WO2019075711A1 - Method for eliminating residue of regenerative agent - Google Patents

Method for eliminating residue of regenerative agent Download PDF

Info

Publication number
WO2019075711A1
WO2019075711A1 PCT/CN2017/106972 CN2017106972W WO2019075711A1 WO 2019075711 A1 WO2019075711 A1 WO 2019075711A1 CN 2017106972 W CN2017106972 W CN 2017106972W WO 2019075711 A1 WO2019075711 A1 WO 2019075711A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequencing
solid support
group
reagent
optionally substituted
Prior art date
Application number
PCT/CN2017/106972
Other languages
French (fr)
Chinese (zh)
Inventor
谢槟
徐讯
杨晋
徐崇钧
刘二凯
陈奥
章文蔚
德马纳克•斯内扎娜
卡洛•马修
拉哈戈泊尔•田西
马可心
赵芳
李计广
Original Assignee
深圳华大智造科技有限公司
完整基因有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳华大智造科技有限公司, 完整基因有限公司 filed Critical 深圳华大智造科技有限公司
Priority to PCT/CN2017/106972 priority Critical patent/WO2019075711A1/en
Priority to CN201780094653.2A priority patent/CN111051533B/en
Publication of WO2019075711A1 publication Critical patent/WO2019075711A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids

Definitions

  • the invention relates to the field of nucleic acid sequencing.
  • the present invention relates to a method of eliminating residual reagents of regenerating reagents, comprising: a. providing a solid support on which a nucleic acid molecule is immobilized, b. treating the solid support with a regenerating reagent, the regenerating reagent comprising a reducing agent, And c.
  • step b treating the solid support with a wash solution to which a compound having a disulfide bond structure is added, wherein prior to step b, the solid support is optionally treated with one or more reaction solutions containing a sequencing reagent Generating a signal representative of the nucleotide sequence of the nucleic acid molecule on a solid support, and wherein the regeneration reagent can eliminate the signal on the solid support, wherein after the treatment of step c, optionally further The solid support is treated with one or more reaction solutions containing sequencing reagents.
  • the second generation of sequencing technology is based on the development of the first generation of Sanger sequencing technology, with low cost, high throughput, automation and other features, greatly promoting the development of the gene sequencing industry.
  • Second-generation sequencing technology has been widely used in whole-genome sequencing, transcriptome sequencing, and metagenomic sequencing. It is a powerful tool for analyzing the evolution and classification of organisms, studying cancer-related genes such as autism, and conducting in vitro diagnostics. It has promoted people's understanding of life sciences and promoted the development of health industry.
  • SBS Sequencing-by-Synthesis
  • cPAS Joint Probe Anchor Polymerization
  • SBL ligation sequencing
  • the regenerated reagent will remain on the surface or reagent of the solid support after resection of the detectable label with the regenerating reagent during the sequencing process.
  • the subsequent sequencing reagents are polluted, so that the quality of the sequencing data is degraded, which affects the sequencing accuracy.
  • high-salt solutions are used to clean the chips or tubes, high salt solutions require a large amount of high salt due to dilution and by reducing the physical binding of the regenerating reagent to the chip, nucleic acid or tubing to reduce the residual amount of regenerating reagents.
  • the solution, and only to minimize the residual of the regenerating reagent in the sequencing reagent is difficult to completely remove the regenerating reagent, resulting in reduced sequencing data quality and sequencing accuracy.
  • the present invention solves the above problems by using a washing solution containing a compound having a disulfide bond structure.
  • the invention provides a method of eliminating regenerative reagent residues comprising: a. providing a solid support having nucleic acid molecules immobilized thereon, b. treating the solid support with a regeneration reagent, the regenerating reagent comprising reducing And c. treating the solid support with a washing solution to which a compound having a disulfide bond structure is added.
  • the solid support is free or substantially free of regenerating reagent after the treatment of step c.
  • the solid support is free or substantially free of reducing agent contained in the regenerating agent.
  • the solid support is optionally treated with one or more reaction solutions containing a sequencing reagent to produce a nucleotide sequence representing the nucleic acid molecule on a solid support
  • the signal and wherein the regeneration reagent can eliminate the signal on the solid support.
  • the solid support is optionally also treated with one or more reaction solutions containing sequencing reagents.
  • the solid support does not contain or substantially does not contain regeneration.
  • Reagent preferably, the solid support is free or substantially free of reducing agent contained in the regenerating agent.
  • substantially free of regenerating agent means that the amount of regenerating agent (or preferably, the reducing agent in the regenerating agent) contained on the solid support is sufficiently small that the solid support and the sequencing reagent are included.
  • reaction solution When the reaction solution is contacted, it does not cause significant contamination of the reaction solution with respect to the washing solution to which the compound containing the disulfide bond structure is not added, with respect to the regenerating agent (or preferably, the reducing agent in the regenerating agent).
  • the solid support After treating a solid support which has been treated with a regenerating agent using a washing solution to which a compound having a disulfide bond structure is added, the solid support is subsequently treated with a reaction solution containing a sequencing reaction reagent.
  • the concentration of the regenerating agent (or preferably, the reducing agent in the regenerating agent) in the reaction solution is not more than about 10%, not more than about 5%, not more than about 2%, not more than about 1%, Not more than about 0.5%, no more than about 0.2%, no more than about 0.1%, no more than about 0.05%, no more than about 0.02%, no more than about 0.001%, or even lower.
  • treating a solid support includes any means of contacting a solid support with a reagent or solution, including but not limited to flowing a reagent or solution through a solid support or soaking a solid support in an agent or solution.
  • the means for flowing the reagent or solution through the solid support includes, for example, transporting the reagent or solution to the solid support using an inlet flow channel in fluid communication with the solid support and using the outlet flow channel in fluid communication with the solid support to remove the reagent or solution from the reagent or solution Solid support discharge (described in International Application Publication WO 91/06678) a kind of treatment).
  • Soaking the solid support in the reagent or solution includes any means of immersing the solid support partially or completely in the reagent or solution.
  • the solid support may be partially or fully immersed in the reagent or solution, for example, in a vertical, oblique or horizontal manner, preferably all nucleic acid molecules immobilized on the solid support are contacted with reagents in the reaction vessel.
  • nucleic acid may be used interchangeably with “nucleic acid molecule", "polynucleotide”, which may be any type of nucleic acid, eg, the nucleic acid may be deoxyribonucleic acid (DNA), ribonucleic acid (RNA) or An analog of DNA or RNA made from a nucleotide analog. Nucleic acids can be single stranded, double stranded or contain both single stranded and double stranded sequences.
  • the nucleic acid molecule may be derived from a double-stranded DNA (dsDNA) form (eg, genomic DNA, PCR, and amplification products, etc.), or may be derived from a single-stranded form such as DNA (ssDNA) or RNA and which may be converted to the dsDNA form, and vice versa.
  • dsDNA double-stranded DNA
  • ssDNA single-stranded form
  • RNA RNA
  • the sequenced nucleic acid can be in the form of a single molecule (which can be a natural molecule, a modified molecule such as a labeled molecule, or a nucleic acid comprising a nucleotide analog), a concatamer of a sequence, etc.
  • nucleic acid molecule can be amplified (eg, amplified as a concatamer, amplified into multiple individual molecules having the same or similar sequences, etc.), and/or can be in any other form.
  • the exact sequence of the nucleic acid molecule can be known or unknown.
  • nucleic acids genes or gene fragments (eg, probes, primers, EST or SAGE tags), genomic DNA, genomic DNA fragments, exons, introns, messenger RNA (mRNA), transfer RNA, Ribosome RNA, ribozyme, cDNA, nucleic acid library, recombinant polynucleotide, synthetic polynucleotide, branched polynucleotide, plasmid, vector, isolated DNA of any sequence, isolated RNA of any sequence, any of the above sequences Nucleic acid probes, primers or amplified copies.
  • genes or gene fragments eg, probes, primers, EST or SAGE tags
  • genomic DNA genomic DNA fragments, exons, introns
  • messenger RNA messenger RNA
  • transfer RNA Ribosome RNA
  • Ribozyme Ribosome RNA
  • cDNA nucleic acid library
  • recombinant polynucleotide synthetic polynucleotide,
  • Nucleic acids can include nucleotides or nucleotide analogs. Nucleotides typically contain a sugar, a nucleobase, and at least one phosphate group. Nucleotides can be abasic (ie, lacking nucleobases). Nucleotides include deoxyribonucleotides, modified deoxyribonucleotides, ribonucleotides, modified ribonucleotides, peptide nucleotides, modified peptide nucleotides, modified phosphate sugar backbone nucleosides Acids and mixtures thereof.
  • nucleotides include, for example, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), thymidine monophosphate (TMP), thymidine diphosphate (TDP), and thorax.
  • AMP adenosine monophosphate
  • ADP adenosine diphosphate
  • ATP adenosine triphosphate
  • TMP thymidine monophosphate
  • TDP thymidine diphosphate
  • thorax examples include, for example, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), thymidine monophosphate (TMP), thymidine diphosphate (TDP), and thorax.
  • Nucleotide analogs comprising modified nucleobases can also be used in the methods described herein.
  • Exemplary modified nucleobases that may be included in a polynucleotide, whether having a native backbone or a similar structure include, for example, inosine, xanthine, hypoxanthine, isocytosine, isoguanine, 2-amino Indole, 5-methylcytosine, 5-hydroxymethylcytosine, 2-aminoadenine, 6-methyladenine, 6-methylguanine, 2-propylguanine, 2-propyladenine , 2-thiouracil, 2-thiothymidine, 2-thiocytosine, 15-halouracil, 15-halocytosine, 5-propynyl uracil, 5-propynyl cytosine, 6 - azouracil, 6-azocytosine, 6-azothymine, 5-uracil, 4-thiouracil, 8-haloadenine or guanine,
  • Nucleic acid molecules that are sequenced in particular embodiments of the invention can be of any length.
  • exemplary lengths of useful nucleic acids include, for example, at least about 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 300, 400, 500, 1,000. , 5,000 or 10,000, 100,000 nucleotides or longer.
  • the length may be no longer than 1,000,000, 100,000, 10,000, 1,000, 100 nucleotides or less.
  • the length of the nucleic acid molecule can also include all integers between the above exemplary numbers.
  • the sequenced nucleic acid can be, for example, within the scope of short polynucleotides, fragments, cDNA, genes, and genomic fragments.
  • Nucleic acid molecules that are sequenced in particular embodiments of the invention may be in any number, for example, may be 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70 , 80, 90 or 100 or more identical or different nucleic acid molecules. Is the number of sequencing a nucleic acid molecule may also be, for example 200,300,400,500,1000,5000,10000,50000,1x10 5, 2x10 5, 3x10 5 , 4x10 5, 5x10 5, 6x10 5, 7x10 5, 8x10 5 , 9x10 5 , 1x10 6 , 2x10 6 , 3x10 6 , 4x10 6 , 5x10 6 , 6x10 6 , 7x10 6 , 8x10 6 , 9x10 6 or 1x10 7 or more identical or different nucleic acid molecules. The number of nucleic acid molecules that are sequenced may also include all integers between the above exemplary numbers.
  • Nucleic acids can be obtained from any source.
  • a nucleic acid can be prepared from a nucleic acid molecule obtained from an organism, or from a population of nucleic acid molecules obtained from a natural source comprising one or more organisms.
  • Sources of nucleic acid molecules include, but are not limited to, organelles, cells, tissues, organs, or organisms.
  • Cells that can be used as a source of nucleic acid molecules can Prokaryotic (eg bacteria); or eukaryotic, such as fungi (eg yeast), plants, protozoa and other parasites, and animals (including insects, nematodes), and mammals (eg, rats, mice, Monkeys, non-human primates, and humans); or nucleic acid molecules can be derived from viruses.
  • Prokaryotic eg bacteria
  • eukaryotic such as fungi (eg yeast), plants, protozoa and other parasites, and animals (including insects, nematodes), and mammals (eg, rats, mice, Monkeys, non-human primates, and humans); or nucleic acid molecules can be derived from viruses.
  • the nucleic acid can be obtained from a particular biological source.
  • the nucleic acid is a human nucleic acid obtained from a human, for example, a sample of human tissue.
  • the nucleic acid is a human mitochondrial nucleic acid.
  • the nucleic acid can be obtained from a metagenomic sample.
  • the nucleic acid can be obtained from an environmental source that no longer contains living organisms.
  • the term "immobilized" when used in reference to a nucleic acid means attached directly or indirectly to a solid support via a covalent bond or a non-covalent bond.
  • covalent attachment can be used, but typically only required is nucleic acid retention under conditions where it is desired to use a solid support (eg, in applications requiring nucleic acid amplification and/or sequencing) Fixed or attached to a solid support.
  • the oligonucleotide to be used as a capture primer or amplification primer is immobilized such that the 3' end is available for enzymatic extension and at least a portion of the primer sequence is capable of hybridizing to the complementary nucleic acid sequence.
  • Immobilization can occur via hybridization to surface-attached oligonucleotides, in which case the immobilized oligonucleotide or polynucleotide can be in the 3'-5' direction.
  • Another way of non-covalent attachment may be to bind the nucleic acid binding protein to the solid support by amination modification and to capture the nucleic acid molecule by the nucleic acid binding protein.
  • immobilization can occur by other means than base pair hybridization, such as the covalent attachment described above.
  • Non-limiting examples of nucleic acid attachment to solid support include nucleic acid hybridization, biotin streptavidin binding, sulfhydryl binding, photoactivated binding, covalent binding, antibody-antigen, via hydrogel or other porous polymer. Physical limitations, etc.
  • Various exemplary methods for immobilizing nucleic acids on a solid support can be found, for example, in G. Steinberg-Tatman et al, Bioconjugate Chemistry 2006, 17, 841-848; Xu x. et al.
  • solid support means any insoluble substrate or matrix to which a nucleic acid can be attached, such as, for example, latex beads, dextran beads, polystyrene surfaces, polypropylene surfaces, polypropylene. Amide gel, gold surface, glass surface, chip and silicon wafer.
  • the surface of the solid support can be any desired shape including, for example, planar, spherical or porous suitable for a particular application.
  • solid support The object can be a flat glass surface.
  • a solid support can be mounted inside the flow cell to allow interaction with solutions of multiple reagents. The solid support can also be moved to contact the solution in a plurality of reaction vessels, respectively.
  • the solid support can comprise an inert substrate or matrix that has been chemically functionalized, for example by applying a layer or coating of an intermediate material that allows covalent attachment to the multicore A reactive group of a glycoside.
  • the intermediate material can be attached directly or indirectly to the solid support via a covalent bond or a non-covalent bond.
  • a support can include a polyacrylamide hydrogel layer on an inert substrate such as glass.
  • the polynucleotide can be directly covalently attached to an intermediate layer (eg, a hydrogel), but the intermediate layer itself can be non-covalently attached to other substrates or substrates (eg, glass substrates) layers.
  • the term “signal” includes any signal that can be detected, including but not limited to optical signals, electrical signals, electromagnetic signals, radiated signals, and the like.
  • the term “signal representing a nucleotide sequence of a nucleic acid molecule” means that the signal carries information of the nucleotide sequence and can be decoded into the nucleotide sequence of the nucleic acid molecule being sequenced. For example, where a labeled (eg, fluorescently labeled) sequencing probe is used, the signal generated by the label carries the nucleotide sequence information of the sequencing probe, and the signal is used to know the sequenced nucleic acid molecule and the sequencing The nucleotide sequence of the position where the probe is complementary to the pair.
  • a labeled eg, fluorescently labeled
  • the signal generated by the label carries the identity information of the nucleotide, and the signal is used to know the nucleotide molecule being sequenced and the nucleotide The nucleotide sequence of the position of the complementary pair.
  • the signal is a fluorescent signal.
  • the label is a fluorescent label.
  • the manner in which fluorescent labels or signals are detected is well known in the art. For example, it can be realized by a device that detects the wavelength of fluorescence. Such devices are well known in the art.
  • such a device can be a confocal scanning microscope that scans the surface of the solid support with a laser to image the fluorophore directly on the nucleic acid molecule being sequenced.
  • each of the generated signals can be observed, for example, with a sensitive 2-D detector, such as a charge coupled detector (CCD).
  • CCD charge coupled detector
  • Other techniques such as scanning near-field optical microscopy (SNOM) can also be used, for example.
  • sequencing reagent refers to an agent that is suitable for use in any sequencing method for sequencing nucleic acid molecules on a solid support.
  • sequencing methods can include, but are not limited to, electrophoretic sequencing, synthetic sequencing (including combinatorial probe anchor polymerization sequencing), ligation sequencing, hybrid sequencing, single molecule sequencing, and real-time sequencing methods.
  • a signal eg, a fluorescent signal
  • sequencing reagent used in the method of the invention and the one or more reaction solutions containing the sequencing reagent will depend on the sequencing method employed. It is within the ability of those skilled in the art to determine the particular sequencing reagents used in the methods of the invention, as well as one or more reaction solutions containing sequencing reagents, according to sequencing methods.
  • the sequencing reagent comprises a sequencing reagent for ligation sequencing.
  • Such reagents can produce a ligation product on the sequenced nucleic acid molecule and generate a signal representative of the nucleotide sequence of the nucleic acid molecule.
  • Ligation sequencing as used herein is a variety of ligation sequencing methods well known in the art. Basically, ligation sequencing involves the hybridization and ligation of labeled (eg, fluorescently labeled) sequencing probes and anchor probes (also referred to as "primers" in SOLiD sequencing) to DNA strands.
  • the sequencing probe contains one or two fixed known sequencing bases (single base sequencing probe or double base sequencing probe) and a series of degenerate or universal bases that allow for sequencing probes and nucleic acid templates Perform complementary pairing.
  • the anchor probe sequence is complementary to an adaptor sequence on the nucleic acid template (an adaptor sequence means an oligonucleotide of known sequence in the nucleic acid template) and provides a known sequence of sites for initiation of ligation. After ligation, the template is imaged and the one or two known bases in the sequencing probe are identified. After the complete removal of the anchor probe-sequencing probe complex or after cleavage removal of the label (eg, fluorophore) and regeneration of the junction site, the next ligation sequencing cycle begins.
  • an adaptor sequence means an oligonucleotide of known sequence in the nucleic acid template
  • the sequencing reagent comprises an anchor probe, a labeled (eg, fluorescently labeled) sequencing probe, a ligase, or a mixture thereof, and one comprising a sequencing reagent
  • the one or more reaction solutions include a solution comprising an anchor probe, a labeled (eg, fluorescently labeled) sequencing probe, a ligase, or a mixture thereof, provided that the solid support and anchor probe, labeled sequencing probe
  • Each of the ligases is contacted, and wherein the signal on the solid support is produced by a labeled sequencing probe that is complementary to a nucleic acid molecule on a solid support, the labeled sequencing probe
  • An anchor probe that is complementary to the same nucleic acid molecule is ligated via a ligase.
  • the sequencing reagent may not comprise an anchor probe and the solid support and the anchor probe (also referred to as "in SOLiD sequencing") prior to treatment of the solid support with one or more reaction solutions containing the sequencing reagent
  • the primer contacts such that the anchor probe hybridizes to the nucleic acid molecule on the solid support.
  • the one or more reaction solutions containing the sequencing reagent can include only one reaction solution comprising a ligase and a labeled sequencing probe, optionally, Also included is an anchor probe; or one or more reaction solutions containing a sequencing reagent may also include two reaction solutions, one of which contains an anchor probe and the other of which contains a labeled sequencing probe , And at least one of the two reaction solutions comprises a ligase, in which case sequencing of the nucleic acid molecule comprises immersing the solid support in a reaction solution comprising an anchor probe, washing, and then immersing in the inclusion of the labeled The reaction solution of the sequencing probe is washed and washed.
  • the one or more reaction solutions containing the sequencing reagent may further comprise three reaction solutions comprising an anchor probe, a labeled sequencing probe and a ligase, respectively, in which case the nucleic acid molecule is sequenced.
  • the method includes: immersing a solid support in a reaction solution containing an anchor probe, washing, then immersing in a reaction solution containing a labeled sequencing probe, washing, and then immersing in a reaction solution containing a ligase, washing.
  • the one or more reaction solutions containing the sequencing reagent can include both a ligase, a first anchor probe, a second anchor probe, and labeled sequencing. Only one reaction solution of the probe, or a plurality of reaction solutions containing a ligase, a first anchor probe, a second anchor probe, a labeled sequencing probe, or a mixture thereof, respectively.
  • ligation sequencing in particular with regard to anchoring probes, sequencing probes, see, for example, WO2013066975, U.S. Patent Nos. 60/992,485, 61/026,337, 61/035,914, and 61/061,134.
  • ligation sequencing includes combined probe anchor ligation (cPAL) sequencing (see document WO2013066975).
  • ligase refers to a nucleic acid modifying enzyme that catalyzes the intramolecular formation and intermolecular formation of a phosphodiester bond between the 5'-phosphate and 3'-hydroxyl ends of a nucleic acid strand.
  • Ligase can be obtained from recombinant or natural sources.
  • One or more low temperature (eg, room temperature or lower) ligases eg, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, and/or E. coli DNA ligase
  • the ligase may also be a thermostable ligase.
  • a thermostable ligase from a thermophilic organism can be used.
  • thermostable DNA ligases include, but are not limited to, Tth DNA ligase (from Thermus thermophilus, available from, for example, Eurogentec and GeneCraft); Pfu DNA ligase (from intense Hyperthermophilic ligase of Pyrococcus furiosus; Taq ligase (from Thermus aquaticus), and any other suitable thermostable ligase, or any combination thereof.
  • the sequencing reagent comprises a sequencing reagent for sequencing by synthesis.
  • a sequencing reagent for sequencing by synthesis Such an agent may be a reagent that performs polymerization by using the sequenced nucleic acid molecule as a template and generates a signal representing a nucleotide sequence of the nucleic acid molecule,
  • Synthetic sequencing as used herein is a variety of synthetic sequencing methods well known in the art. Basically, synthetic sequencing involves first hybridizing the sequenced nucleic acid molecule to a sequencing primer, followed by polymerization of the labeled nucleic acid molecule as a template at the 3' end of the sequencing primer (eg, fluorescent labeling) in the presence of a polymerase. Nucleotide. After polymerization, the labeled nucleotide is identified by detecting the label. After removal of the label (eg, fluorophore) from the labeled nucleotide, the next polymerization sequencing cycle begins.
  • a sequencing primer e.g, fluorescent labeling
  • the sequenced nucleic acid molecule is immobilized on a solid support by hybridization to a sequencing primer immobilized on a solid support.
  • the solid support is contacted with a sequencing primer prior to treatment of the solid support on which the nucleic acid molecule is immobilized using one or more reaction solutions containing the sequencing reagent, such that the sequencing primer hybridizes to A nucleic acid molecule on the solid support.
  • the nucleic acid molecules on the solid support are sequenced using the BGISEQ-500 sequencing platform.
  • the sequencing reagent comprises a polymerase, a labeled (eg, fluorescently labeled) nucleotide, or a mixture thereof, and one or more reactions comprising a sequencing reagent
  • the solution includes a solution comprising a polymerase, a labeled (eg, fluorescently labeled) nucleotide, or a mixture thereof, provided that the solid support contacts each of the polymerase, the labeled nucleotide, and wherein the solid
  • the signal on the support is produced by a labeled nucleotide that is complementary bound to a nucleic acid molecule on a solid support, the labeled nucleotide being polymerized via a polymerase with a nucleic acid molecule on a solid support as a template to The 3' end of the primer was sequenced.
  • the regeneration reagent comprises an agent for removing the label from the labeled (eg, fluorescently labeled) nucleotide.
  • the solid support is optionally treated with a reaction solution comprising a polymerase and a labeled (e.g., fluorescently labeled) nucleotide.
  • the sequencing reagent comprises a reagent in a sequencing kit for the BGISEQ-500 sequencing platform, such as the BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizhi Technology Co., Ltd., article number Sequencing reagent in PF-UM-PEV30).
  • a sequencing kit for the BGISEQ-500 sequencing platform such as the BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizhi Technology Co., Ltd., article number Sequencing reagent in PF-UM-PEV30).
  • Labeled (e.g., fluorescently labeled) nucleotides suitable for use in synthetic sequencing and reagents for removing labels from labeled (e.g., fluorescently labeled) nucleotides are well known in the art, such nucleotides and reagents Non-limiting examples can be found in, for example, the labeled nucleotides disclosed in WO04018497, WO04018493, U.S. Patent No. 7,427, 673, and U.S. Pat. Nucleotide removal of labeled reagents.
  • the labeled nucleotide may further comprise a 3' blocking group.
  • the 3' blocking group prevents the incorporation of other nucleotides when the labeled nucleotide is polymerized onto the growing nucleotide strand.
  • 3' blocking The group is removed along with the label.
  • Suitable 3' blocking groups and reagents for removing 3' blocking groups from nucleotides are well known in the art, and non-limiting examples of such 3' blocking groups and reagents can be found, for example, in Greene. & Wuts, Protective Groups in Organic Synthesis, John Wiley & Sons. Guillier; Metzker et al.
  • the label on the labeled nucleotide itself can be used as a 3' blocking group.
  • a label may be of a size or structure sufficient to function to block the incorporation of other nucleotides into the polynucleotide strand. The blockage may be due to steric hindrance or may be due to a combination of size, charge and structure.
  • the labeling and blocking groups on the labeled nucleotides can be different, but preferably the labeling and blocking groups can be removed from the nucleotides in the same manner.
  • the regenerative reagents described herein comprise an agent that removes both the label and the 3' blocking group from the labeled nucleotide.
  • the regenerative reagents described herein can include an agent that can remove the label from the labeled nucleotide and an agent that can remove the 3' blocking group from the labeled nucleotide.
  • polymerase refers to an enzyme that synthesizes a nucleic acid strand or polymer, including DNA polymerases and RNA polymerases.
  • the polymerase used herein is a DNA polymerase. May be used is one polymerase Sequenase TM (7 DNA polymerase from bacteriophage enzyme, which sequence is modified to improve its properties - see Tabor and Richarson, Proc.Nat.Acad.Sci.USA, 84: 4767-4771 (1987), available from, for example, United States Biochemical Corporation, Cleveland, Ohio).
  • TM Sequenase
  • Other polymerases that can be used in place of Sequenase (TM) include, but are not limited to, the Klenow fragment of DNA polymerase I, AMV reverse transcriptase, and Taq polymerase. Further description of the polymerase can also be found in WO05024010 and WO06120433, the entire contents of which are hereby incorporated by reference.
  • the polymerization conditions generally used are the polymerization conditions of these enzymes known in the art.
  • the polymerization conditions include a temperature in the range of from about room temperature to about 45 °C; and pH 7 buffer to 8, preferably pH 7.3 to 7.7; an enzyme concentration of from about 0.01 units / [mu] l to about 1 unit / The microliter is for a period of from about 1 to about 20 minutes, preferably from 1 to 5 minutes.
  • Typical for Sequenase TM buffer consisting of: 0.040M Tris HCI (pH7.5) 0.050M sodium chloride, 0.010M magnesium chloride, 0.010M dithiothreitol.
  • these typical conditions include a temperature in the range of from about 10 ° C to about 45 ° C, preferably from about 15 ° C to about 40 ° C; a buffer of pH 6.8 to 7.4, preferably pH 7.0 To 7.4; the enzyme concentration is from about 0.01 unit/microliter to about 1 unit/microliter, preferably from about 0.02 to about 0.15 unit/microliter, and the reaction time is from about 1 to about 40 minutes.
  • a typical buffer for the Klenow fragment of DNA polymerase I consists of 0.05 M trishydroxymethylammonium chloride, pH 7.5 0.05 M magnesium chloride, 0.05 M sodium chloride, 0.010 M dithiothreitol.
  • complementary refers to hybridization or base pairing or duplex formation between nucleotides or nucleic acids. If a nucleotide of one nucleic acid at a given position is capable of forming a hydrogen bond with a nucleotide of another nucleic acid, then the two nucleic acids are considered to be complementary to each other at that position.
  • Complementary nucleotides are typically A and T (or A and U) or C and G.
  • RNA or DNA molecules are said to be substantially complementary.
  • hybridization refers to a sufficient hydrogen bonding between complementary nucleotide or nucleotide bases, which may be, for example, Watson-Crick, Hoogsteen or reverse Hoogsteen hydrogen bonding, Stable and specific binding occurs between nucleic acid strands.
  • Hybridization capacity is determined according to stringent conditions, including appropriate buffer concentrations and temperatures, which allow for specific hybridization to a target nucleic acid having a fully or partially complementary region. Therefore, not all nucleotides of a nucleic acid need to be complementary.
  • a nucleic acid strand is "substantially complementary" when the nucleic acid strand hybridizes to all, part or overlapping regions of the target nucleic acid.
  • the term "regeneration agent” refers to an agent that, upon sequencing a nucleic acid molecule on a solid support, is capable of eliminating sequencing signals (eg, fluorescent signals) on a solid support to enable next round of sequencing.
  • the particular regenerating reagent used in the methods of the invention will depend on the sequencing method employed. Determining the specific regenerative reagents used in the methods of the invention according to sequencing methods is within the skill of those skilled in the art. Inside.
  • a regenerating reagent in ligation sequencing, includes an agent that eliminates a signal representing a nucleotide sequence of the nucleic acid molecule from the sequenced nucleic acid molecule and enables initiation of the next ligation sequencing reaction, preferably such a reagent It is a reducing agent.
  • the regenerating reagent comprises an agent that removes a label (eg, a fluorescent label) from a labeled (eg, fluorescently labeled) sequencing probe, preferably such reagent is a reducing agent.
  • the regenerating reagent comprises reagents for eliminating a signal representative of the nucleotide sequence of the nucleic acid molecule from the sequenced nucleic acid molecule and enabling initiation of the next polymerization sequencing reaction, preferably, Such an agent is a reducing agent.
  • the regenerating reagent comprises an agent that removes a label (eg, a fluorescent label) from a labeled (eg, fluorescently labeled) nucleotide, preferably such reagent is a reducing agent.
  • the reagent that removes the label (eg, fluorescent label) from the labeled (eg, fluorescently labeled) sequencing probe or nucleotide can be a reducing agent, such as by linking the label with a functional group that is susceptible to reductive cleavage. In the case of nucleotides.
  • a reducing agent examples include, but are not limited to, a metal catalyst such as a nickel catalyst, a platinum catalyst, a palladium catalyst, an alcohol compound, a phenol compound, an aldehyde compound, an olefin compound, an amine compound, a ketone compound, a phosphorus-containing compound, Peroxides, thiol compounds such as ⁇ -mercaptoethanol, dithiothreitol, phosphines (eg, tris(hydroxymethyl)phosphine (THP), tris(2-carboxyethyl)phosphine, etc.) (TCEP), etc.
  • a metal catalyst such as a nickel catalyst, a platinum catalyst, a palladium catalyst
  • an alcohol compound such as a nickel catalyst, a platinum catalyst, a palladium catalyst
  • an alcohol compound such as a phenol compound, an aldehyde compound, an olefin compound, an amine compound, a ketone compound, a
  • the reducing agent is any suitable reducing agent capable of reducing disulfide bonds, including but not limited to thiol compounds such as ⁇ -mercaptoethanol, dithiothreitol, phosphines (eg, tris(hydroxymethyl)phosphine (THP) ), tris(2-carboxyethyl)phosphine, etc.) (TCEP) and the like.
  • thiol compounds such as ⁇ -mercaptoethanol, dithiothreitol, phosphines (eg, tris(hydroxymethyl)phosphine (THP) ), tris(2-carboxyethyl)phosphine, etc.) (TCEP) and the like.
  • the regenerant containing reductant used in embodiments of the invention should be capable of removing labels (e.g., fluorescent labels) from labeled (e.g., fluorescently labeled) sequencing probes or nucleotides without disrupting sequencing.
  • labels e.g., fluorescent labels
  • the regenerating reagent comprises a regenerative reagent for use in a sequencing kit for the BGISEQ-500 sequencing platform, such as the BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhicheng Technology Co., Ltd., Regeneration reagent in item number PF-UM-PEV30).
  • a sequencing kit for the BGISEQ-500 sequencing platform such as the BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhicheng Technology Co., Ltd., Regeneration reagent in item number PF-UM-PEV30).
  • the invention relates to a method of eliminating regenerative reagent residues comprising: a. providing a solid support on which a nucleic acid molecule is immobilized, b. using a sequencing kit for the BGISEQ-500 sequencing platform
  • the regenerative reagent such as the BGISEQ-500 high-throughput sequencing kit (SE50 V3.0, Shenzhen Huada Zhizhi Technology Co., Ltd., Cat. No. PF-UM-PEV30), treats the solid support, the regenerating reagent Containing a reducing agent, and c. treating the solid with a washing solution to which a compound having a disulfide bond structure is added Support.
  • the solid support is free or substantially free of regenerating reagent after the treatment of step c.
  • a sequencing reagent containing a sequencing kit for the BGISEQ-500 sequencing platform eg, BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0, Shenzhen Hua) is optionally used.
  • the solid support is treated with one or more reaction solutions of a sequencing reagent in Dazhicheng Technology Co., Ltd., No.
  • PF-UM-PEV30 to generate a nucleotide sequence representing the nucleic acid molecule on a solid support a signal (eg, a fluorescent signal), and wherein the regeneration reagent can eliminate the signal (eg, a fluorescent signal) on the solid support.
  • a sequencing reagent in a sequencing kit for the BGI-SEQ500 sequencing platform eg, BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0
  • SE50 V3.0 the sequencing kit for the BGI-SEQ500 sequencing platform
  • the solid support is treated with one or more reaction solutions of Shenzhen Huada Zhixing Technology Co., Ltd., the sequencing reagent in PF-UM-PEV30).
  • the invention also relates to a kit for sequencing, the kit comprising a sequencing reagent, a washing reagent, a regenerating reagent comprising a reducing agent, and a washing solution to which a compound containing a disulfide bond structure is added, as described herein,
  • the sequencing reagent can generate a signal (eg, a fluorescent signal) representing a nucleotide sequence of the nucleic acid molecule on a solid support to which the nucleic acid molecule is immobilized, and wherein the regeneration reagent can eliminate the solid support The signal.
  • the sequencing reagent comprises a sequencing reagent for ligation sequencing or synthetic sequencing.
  • a wash reagent is any solution that is capable of washing away a substance on its solid support that is non-specifically bound thereto and does not adversely affect subsequent reactions.
  • the wash reagent contains a buffer, such as an organic salt, to maintain a stable pH of from about pH 6 to pH 9, and may also contain monovalent or divalent cations to remove non-specifically bound molecules from the solid support.
  • Exemplary washing reagents can include, for example, 100 mM Tris-HCl buffer at pH 6.5, TE buffer (Tris-HCl pH 8, 10 mM and EDTA, 1 mM), and the like.
  • the buffering agent can also be, for example, a buffering agent as described below.
  • a compound having a disulfide bond structure includes any compound having a bond (also referred to as a disulfide bond) between sulfur atoms in the -S-S-form in its chemical structure.
  • the compound containing a disulfide bond structure has the formula -RSS-R'-, wherein R and R' are each independently selected from hydrogen, optionally substituted alkyl, optionally substituted alkenyl, Optionally substituted alkynyl, optionally substituted cycloalkyl, optionally substituted cycloalkenyl, optionally substituted cycloalkynyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted Heterocyclyl, optionally substituted heteroalicyclic, optionally substituted aralkyl or optionally substituted (heteroalicyclic)alkyl.
  • R and R' are each independently selected from hydrogen, optionally substituted alkyl, optionally substituted alkenyl, Optionally substituted alkynyl, optionally substituted cycloalkyl, optionally substituted cycloalkenyl, optionally substituted cycloalkynyl, optionally substituted aryl, optionally substituted heteroaryl,
  • the compound containing a disulfide bond structure has the formula -C(R 1 ) m (R 2 ) m' -C(R 3 ) p -SSC(R 4 ) q -C(R 5 ) n (R 6 ) n' -, wherein R 1 , R 2 , R 3 , R 4 , R 5 and R 6 are each independently selected from the group consisting of hydrogen, oxo, hydroxy, protected hydroxy, alkoxy, aryloxy Base, acyl, decyl, alkylthio, arylthio, cyano, halogen, thiocarbonyl, O-carbamoyl, N-carbamoyl, O-thiocarbamoyl, N-thiocarbamoyl , C-amido, N-acylamino, S-sulfonylamino, N-sulfonylamino, C-carboxyl
  • the compound containing a disulfide bond structure may be selected from, but not limited to, such as diethyl disulfide, di-n-propyl disulfide, di-n-butyl disulfide, di- Sec-butyl disulfide, di-tert-butyl disulfide, di-n-pentyl disulfide, di-tert-amyl disulfide, di-tert-hexyl disulfide, di-n-octyl Dithioether, di-tert-octyl disulfide, di-n-dodecyl disulfide, di-tert-dodecyl disulfide, di-n-stearyl disulfide, ethyl- Dialkyl disulfide compounds such as n-propyl disulfide, ethyl-tert-butyl disulfide, ethyl-tert-but
  • the compound containing a disulfide bond structure is selected from the group consisting of cystamine, cystine, 5,5'-dithiobis(2-nitrobenzoic acid), and 2-hydroxyethyl disulfide. Its derivatives.
  • the R, R', R 1 , R 2 , R 3 , R 4 , R 5 and R 6 groups represent substituents which are capable of attaching to a specified atom.
  • the R, R', R 1 , R 2 , R 3 , R 4 , R 5 and R 6 groups may be substituted or unsubstituted.
  • the R, R', R 1 , R 2 , R 3 , R 4 , R 5 and R 6 groups together with the atoms to which they are attached are capable of forming a cycloalkyl, aryl, heteroaryl or heterocyclic ring.
  • the group may be unsubstituted or substituted with one or more specified substituents.
  • the substituent may be selected from one or more of the specified substituents.
  • substituent it is meant that the specified "optionally substituted” or “substituted” group may be substituted, individually and independently, by one or more groups, which are individually and independently selected from the group consisting of Functional groups, including but not limited to: alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, heteroaryl, heteroalicyclic, aralkyl, heteroarylalkyl, (heteroalicyclic)alkyl, hydroxy, protected hydroxy, alkoxy, aryloxy, acyl, decyl, alkylthio, arylthio, cyano, halogen, thiocarbonyl, O-carbamoyl , N-carbamoyl, O-thiocarbamoyl, N-thiocarbamoyl, C-amido, N-acylamino, S-sulfonyla
  • Functional groups
  • alkyl refers to a straight or branched hydrocarbon chain that includes a fully saturated (no double or triple bond) hydrocarbon group.
  • the alkyl group can have from 1 to 20 carbon atoms (when it appears herein, such as the numerical range of "1-20” refers to a range including endpoints within a given range.
  • Each integer; for example, “1-20 carbon atoms” means that the alkyl group may consist of 1 carbon atom, 2 carbon atoms, 3 carbon atoms, etc. up to and including 20 carbon atoms, however this definition also covers The occurrence of the term "alkyl” which does not specify a range of values).
  • the alkyl group can also be a medium size alkyl group having from about 7 to about 10 carbon atoms.
  • the alkyl group may also be a lower alkyl group having 1 to 6 carbon atoms.
  • the alkyl group of the compound can be designated as "C 1 -C 4 alkyl group” or the like.
  • “C 1 -C 4 alkyl” means that one to four carbon atoms are present in the alkyl chain, ie, the alkyl chain is selected from the group consisting of methyl, ethyl, propyl, isopropyl, n-butyl Base, isobutyl, sec-butyl and tert-butyl.
  • Typical alkyl groups include, but are not limited to, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, tert-butyl, pentyl, and hexyl.
  • the alkyl group can be substituted or unsubstituted.
  • alkenyl refers to a hydrocarbyl group containing one or more double bonds in a straight or branched hydrocarbon chain.
  • the alkenyl group can be unsubstituted or substituted.
  • alkynyl refers to a hydrocarbyl group containing one or more triple bonds in a straight or branched hydrocarbon chain.
  • An alkynyl group can be unsubstituted or substituted.
  • cycloalkyl refers to a fully saturated (no double or triple bond) monocyclic or polycyclic hydrocarbon ring system. When composed of two or more rings, the rings may be bonded together in a fused form.
  • the cycloalkyl group may have 3 to 10 atoms in the ring. In certain embodiments, a cycloalkyl group can contain from 3 to 8 atoms in the ring.
  • a cycloalkyl group can be unsubstituted or substituted.
  • Typical cycloalkyl groups include, but are in no way limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl and cyclooctyl.
  • aryl refers to a monocyclic or polycyclic aromatic ring system of carbon rings (all carbon) (including, for example, fused, bridged or spiro ring systems in which two carbon rings share a chemical bond, For example, one or more aromatic rings carry one or more aromatic or non-aromatic rings) having a fully delocalized pi-electron system throughout at least one ring.
  • the number of carbon atoms in the aryl group is variable.
  • the aryl group can be a C 6 -C 14 aryl group, a C 6 -C 1o aryl group, or a C 6 aryl group.
  • Examples of aryl groups include, but are not limited to, benzene, naphthalene, and chamomile rings.
  • the aryl group can be substituted or unsubstituted.
  • heterocyclyl refers to a ring system comprising at least one hetero atom (eg, O, N, S). Such systems may be unsaturated, may include partial unsaturation, or may contain some aromatic moieties, or be completely aromatic.
  • the heterocyclic group may be unsubstituted or substituted.
  • heteroaryl refers to a monocyclic or polycyclic aromatic ring system (a ring system having at least one ring that is a fully delocalized pi-electron system) that contains one or more heteroatoms, ie, carbon Elements other than, including but not limited to nitrogen, oxygen and sulfur, and comprising at least one aromatic ring.
  • the number of atoms in the ring of the heteroaryl group is variable.
  • a heteroaryl group can contain 4-14 ring atoms, 5-10 ring atoms, or 5-6 ring atoms.
  • heteroaryl includes fused ring systems in which two rings, for example at least one aromatic ring and at least one heteroaryl ring, or at least two heteroaryl rings share at least one chemical bond.
  • heteroaromatic rings include, but are not limited to, furan, furazan, thiophene, benzothiophene, pyridazine, pyrrole, oxazole, benzoxazole, 1,2,3-oxadiazole, 1,2,4-oxa Diazole, thiazole, 1,2,3-thiadiazole, 1,2,4-thiadiazole, benzothiazole, imidazole, benzimidazole, indole, oxazole, pyrazole, benzopyrazole, different Oxazole, benzisoxazole, isothiazole, triazole, benzotriazole, thiadiazole, tetrazole, pyridine, pyridazine, pyr
  • heteroalicyclic refers to three, four, five, six, seven, eight, nine, ten, up to 18-membered monocyclic, bicyclic, and tricyclic
  • a ring system in which a carbon atom and 1-5 heteroatoms together form the ring system.
  • the heterocycle can optionally contain one or more unsaturated bonds positioned in such a manner that there is no fully delocalized pi-electron system throughout all of the rings.
  • the heteroatoms are independently selected from the group consisting of oxygen, sulfur, and nitrogen.
  • the heterocyclic ring may further comprise one or more carbonyl or thiocarbonyl functional groups, such that the definition includes oxo systems and sulfur Generation systems such as lactams, lactones, cyclic imides, cyclic thioimides, and cyclic carbamates. When composed of two or more rings, the rings may be bonded together in a fused form. Additionally, any nitrogen in the heteroalicyclic ring can be quaternized.
  • the heteroalicyclic or heteroalicyclic group may be unsubstituted or substituted.
  • heteroalicyclic or “heteroalicyclic” groups include, but are not limited to, 1,3-dioxin, 1,3-dioxane, 1,4-dioxane, 1,2-dioxolane, 1,3-dioxolan, 1,4-dioxolane, 1,3-oxathiane, 1,4-oxethiohexadiene, 1,3-oxathiolane, 1,3-dithiolepine, 1,3-dithiolane, 1,4-oxathiane, tetrahydro-1,4- Thiazide, 2H-1,2-oxazine, maleimide, succinimide, barbituric acid, thiobarbituric acid, dioxopiperazine, hydantoin, dihydrouracil , trioxane, hexahydro-1,3,5-triazine, imidazoline, imidazolidine, isox
  • aralkyl and aryl (alkyl) refer to an aryl group as a substituent attached through a lower alkylene group.
  • the lower alkylene and aryl groups of the aralkyl group may be substituted or unsubstituted. Examples include, but are not limited to, benzyl, 2-phenylalkyl, 3-phenylalkyl, and naphthylalkyl.
  • heteroaryl and “heteroaryl (alkyl)” refers to a heteroaryl group attached as a substituent through a lower alkylene group.
  • the lower alkylene and heteroaryl groups of the heteroarylalkyl group may be substituted or unsubstituted. Examples include, but are not limited to, 2-thienylalkyl, 3-thienylalkyl, furylalkyl, thienylalkyl, pyrrolylalkyl, pyridylalkyl, isoxazolylalkyl, and imidazolylalkyl And their benzo-fused analogs.
  • hydrocarbyloxy refers to the formula -OR, wherein R is alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl or cycloalkynyl as defined above.
  • alkoxy groups are methoxy, ethoxy, n-propoxy, 1-methylethoxy (isopropoxy), n-butoxy, isobutoxy, sec-butoxy And tert-butoxy.
  • the alkoxy group may be substituted or unsubstituted.
  • the C-acylamino group may be substituted or unsubstituted.
  • R and R a may independently be hydrogen, alkyl, alkenyl, alkynyl.
  • the N-acylamino group may be substituted or unsubstituted.
  • halogen atom means any of the radiation-stable atoms of the seventh column of the periodic table, such as fluorine, chlorine, bromine and iodine.
  • amine refers to a -NH 2 group, wherein one or more hydrogen may optionally be substituted with R groups.
  • R may independently be hydrogen, alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, heteroaryl, heteroalicyclic, aralkyl or (heteroalicyclic) alkyl.
  • aldehyde refers to a -R c -C (O) H group, wherein R c may be absent, or independently selected from alkylene, alkenylene, alkynylene, cycloalkylene, alkylene Cycloalkenyl, cycloalkynylene, arylene, heteroarylene, heteroalicyclic, aralkylene or (heteroalicyclic)alkyl.
  • amino refers to a -NH 2 group.
  • hydroxy refers to an -OH group.
  • cyano refers to a "-CN” group.
  • azido refers to a -N 3 group.
  • mercapto refers to a -SH group.
  • carboxylic acid refers to -C(O)OH.
  • thiocyanate refers to a -S-C ⁇ N group.
  • amine group refers to the group -O-NH 2, wherein one or more hydrogens on -NH 2 may optionally be substituted with R groups.
  • R may independently be hydrogen, alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, heteroaryl, heteroalicyclic, aralkyl or (heteroalicyclic) alkyl.
  • the term "derivative” means a similar compound that is derived from a specified compound by physical or chemical processes. Derivatives can be prepared using standard procedures known to those skilled in the art of synthetic organic chemistry and are for example, by J. March, "Advanced Organic Chemistry: Reactions, Mechanisms and Structure (Advanced Organic Chemistry: Reactions, Mechanisms). And Structure ”)", 4th edition (New York: Wiley-Interscience, 1992).
  • a base addition salt is prepared from a compound using conventional means, including reacting one or more free hydroxyl groups of the compound with a suitable base.
  • the compound is dissolved in a polar organic solvent such as methanol or ethanol and a base is added thereto.
  • Salts suitable for the formation of cystamines of base addition salts refer to cystamine salts derived from inorganic and organic acids and bases.
  • suitable acids include, but are not limited to, hydrochloric acid, hydrobromic acid, sulfuric acid, k nitrate, perchloric acid, fumaric acid, maleic acid, phosphoric acid, glycolic acid, lactic acid, salicylic acid, succinic acid, p-toluenesulfonic acid, Tartaric acid, acetic acid, trifluoroacetic acid, citric acid, methanesulfonic acid, formic acid, benzoic acid, malonic acid, naphthalene-2-sulfonic acid, benzenesulfonic acid and oxalic acid.
  • salts derived from amino acids such as L-arginine, L-lysine.
  • suitable bases include, but are not limited to, inorganic bases such as sodium hydroxide, potassium hydroxide, ammonium hydroxide, calcium hydroxide, trimethylamine, and the like.
  • a wash solution comprising a compound having a disulfide bond structure also contains a buffer.
  • the buffer is any buffer suitable for treating a fixed support to which a nucleic acid molecule is immobilized. It should be understood that the treatment of such a buffer with a solid support should not deal with the structure of the nucleic acid molecule on the solid support, the binding of the nucleic acid molecule to the solid support, and the sequencing enzyme, probe and nucleotide structure in subsequent sequencing reagents and Activity has an adverse effect. Suitable buffers can be readily determined by those skilled in the art.
  • Suitable buffering agents may contain, for example, an organic salt to maintain a stable pH of from about pH 6 to pH 9, and may also contain monovalent or divalent cations or detergents to remove non-specifically bound molecules from the solid support.
  • Exemplary buffers can include, for example, Tris-HCl buffer (eg, 100 mM Tris-HCl buffer at pH 6.5), SSC buffer (eg, 0.3 x SSC/0.1% Tween), TE buffer (eg, 10 mM Tris with pH 8). -HCl and 1 mM EDTA in TE buffer) and the like.
  • the disulfide-containing compound of the present invention is capable of undergoing a redox reaction with a reducing agent in a regenerating agent, and at the same time, such a disulfide-containing bond of the present invention
  • the compound does not cause any significant adverse effects on the sequencing of the nucleic acid, such as the nature of the sequencing reagent (eg, sequencing enzyme, labeled probe or nucleotide), the nature of the nucleic acid molecule on the solid support, and The binding of the solid support to the nucleic acid molecule immobilized thereon causes any significant adverse effects.
  • many other commonly used oxidants can adversely affect sequencing, such as disrupting the structure of dNTPs, affecting the activity of polymerases, and the like.
  • the oxidizing property of the disulfide bond-containing compound of the present invention may be derived only from a disulfide bond structure. That is, the disulfide bond-containing compound of the present invention may not contain any other oxidizing group other than -S-S-.
  • the disulfide bond-containing compound of the present invention may contain at least one disulfide bond.
  • a compound containing a disulfide bond structure may contain at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more disulfide bonds.
  • the compound containing a disulfide bond structure contains 1-3 disulfide bonds.
  • the compound containing a disulfide bond structure contains 1-2 disulfide bonds.
  • the compound containing a disulfide bond structure contains only one disulfide bond.
  • the amount of the disulfide-containing structure-containing compound in the solution containing the compound is sufficient to significantly reduce or even eliminate or be in fluid communication with the solid support during the sequencing process. Any amount of residual reagent regenerating on the flow channel.
  • the amount of the disulfide-containing structure-containing compound in the solution containing the compound can be, for example, at least about 0.1 mM, at least about 0.2 mM, at least about 0.3 mM, at least about 0.4 mM, at least about 0.5 mM, At least about 0.6 mM, at least about 0.7 mM, at least about 0.8 mM, at least about 0.9 mM, at least about 0.10 mM, at least about 1 mM, at least about 2 mM, at least about 3 mM, at least about 4 mM, at least about 5 mM, at least about 6 mM, at least About 7 mM, at least about 8 mM, at least about 9 mM, at least about 10 mM, at least about 15 mM, at least about 20 mM, at least about 25 mM, at least about 30 mM, at least about 35 mM, at least about 40 mM, at least about 45 mM, at
  • the amount of the disulfide-containing structure-containing compound in the solution containing the compound can be, for example, up to about 1 mM, up to about 2 mM, up to about 3 mM, up to about 4 mM, up to about 5 mM, up to about 6 mM, up to About 7 mM, up to about 8 mM, up to about 9 mM, up to about 10 mM, up to about 15 mM, up to about 20 mM, up to about 25 mM, up to about 30 mM, up to about 35 mM, up to about 40 mM, up to about 45 mM, up to about 50 mM, up to about 55 mM Up to about 60 mM, up to about 70 mM, up to about 80 mM, up to about 90 mM, up to about 100 mM, or a range between any two of the foregoing.
  • the amount of the compound containing a disulfide bond structure in the solution containing the compound may be, for example, from about 1 mM to about 50 mM. In one embodiment, the amount of the compound containing a disulfide bond structure in the solution containing the compound may be, for example, from about 2 mM to about 20 mM. In one embodiment, the amount of the compound containing a disulfide bond structure in the solution containing the compound may be, for example, from about 5 mM to about 10 mM.
  • RunOn is used to evaluate the phenomenon of base synthesis advancement in sequencing.
  • Advance base synthesis directly leads to a decline in the quality of sequencing data.
  • RunOn detection can be performed and reported according to the software that comes with the BGISEQ-500 sequencing platform (Shenzhen Huada Zhizhi Technology Co., Ltd.) or other suitable methods. Residues of regenerated reagents typically result in higher RunOn values during sequencing.
  • the inventors have discovered that the addition of a compound containing a disulfide bond structure as described herein during the washing of the regenerating agent in the sequencing procedure can significantly reduce the RunOn value and is not detrimental
  • the effect of the sequencing reagent used in the subsequent sequencing cycle is affected.
  • other oxidants such as hydrogen peroxide, iron sulfate, and copper hydroxide severely interfere with the sequencing reaction, resulting in the inability to measure the RunOn value.
  • Figure 1 shows a comparison of RunOn values for sequencing using a wash solution with or without added cystamine.
  • Figure 2 shows a comparison of RunOn values for sequencing using a wash solution with or without added cystine.
  • Figure 3 shows a comparison of RunOn values for sequencing using a wash solution with or without addition of 5,5'-dithiobis(2-nitrobenzoic acid).
  • Figure 4 shows a comparison of RunOn values for sequencing using a wash solution with or without addition of 2-hydroxyethyl disulfide.
  • Figure 5 shows a comparison of RunOn values for sequencing using a wash solution with or without added cystamine.
  • Figure 6 shows a comparison of signal intensities for sequencing using a wash solution supplemented with the addition of hydrogen peroxide, iron sulfate or copper hydroxide or 2-hydroxyethyl disulfide.
  • MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip.
  • the prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions.
  • the platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the sequence determination of 30 bases and selected 1*1 of the photographed area.
  • MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip.
  • the prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions.
  • the platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the sequence determination of 20 bases and selected 1*1 of the photographed area.
  • MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip.
  • the prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions.
  • the platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the sequence determination of 40 bases and selected 1*1 of the photographed area.
  • MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip.
  • the prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions.
  • the platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the sequence determination of 50 bases and selected 1*1 of the photographed area.
  • MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip.
  • the prepared sequencing chip was subjected to the sequence determination of 50 bases according to the following procedure, and the photographing area of 1*1 was selected:
  • the chip was immersed in the polymerization reagent for about 1 min, the chip was removed, and then immersed in the washing reagent 2 for about 1 min, the chip was removed, and then immersed in the protective reagent for about 1 min, the chip was removed, and photographed for 20-30 min to detect the representative.
  • Fluorescence signal of identity information of one base then immersing the chip in the regeneration reagent for about 1 min, removing the chip, then immersing in the washing reagent 1 for about 1 min, removing the chip; then repeating the foregoing steps on the chip to proceed to the next base Sequencing of the base; the reagents used were all from the BGISEQ-500 sequencing kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30), the temperature and other reaction conditions of each step and the photographing procedure were referenced. Instructions for the kit and standard procedures for the BGISEQ-500 sequencing platform were performed.
  • E. coli genomic DNA prepared using MGIEasy TM DNA library kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip on.
  • the prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions.
  • the platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the 50-base sequence determination and selected 1*1 of the photographed area, using the elution reagent 1 in the BGISEQ-500 sequencing kit V3.0 (recycling reagent in use) 50 mM 2-hydroxyethyl disulfide was added to the reagent for washing the chip after processing the chip.
  • the fluorescence signal intensity values of the four sets of sequencing experiments were output using the program provided by the BGISEQ-500 sequencing platform, and the results obtained are shown in FIG. 6.

Abstract

Provided is a method for treating a solid support fixed with nucleic acid molecules thereon, wherein the solid support is already treated by a regenerative agent and the regenerative agent comprises a reducing agent that can remove a blocking group and label from a 3' blocked labeled nucleotide. The method comprises using a washing solution added with a compound containing a disulfide bond structure to treat the solid support. The solid support is also optionally treated by using a reaction solution containing a polymerase, a 3' blocked labeled nucleotide and optional primers following the treatment with the washing solution, for instance, the primers contain the sequence complementary to at least a part of each nucleic acid molecule on the solid support.

Description

消除再生试剂残留的方法Method for eliminating residual reagent residues 技术领域Technical field
本发明涉及核酸测序领域。具体地,本发明涉及消除再生试剂残留的方法,其包括:a.提供其上固定有核酸分子的固体支持物,b.使用再生试剂处理所述固体支持物,所述再生试剂包含还原剂,和c.使用添加了含有二硫键结构的化合物的洗涤溶液处理所述固体支持物,其中在步骤b之前,任选地使用含有测序试剂的一种或多种反应溶液处理所述固体支持物,以在固体支持物上产生代表所述核酸分子的核苷酸序列的信号,并且其中所述再生试剂可以消除固体支持物上的所述信号,其中经过步骤c的处理之后,任选地还使用含有测序试剂的一种或多种反应溶液处理所述固体支持物。The invention relates to the field of nucleic acid sequencing. In particular, the present invention relates to a method of eliminating residual reagents of regenerating reagents, comprising: a. providing a solid support on which a nucleic acid molecule is immobilized, b. treating the solid support with a regenerating reagent, the regenerating reagent comprising a reducing agent, And c. treating the solid support with a wash solution to which a compound having a disulfide bond structure is added, wherein prior to step b, the solid support is optionally treated with one or more reaction solutions containing a sequencing reagent Generating a signal representative of the nucleotide sequence of the nucleic acid molecule on a solid support, and wherein the regeneration reagent can eliminate the signal on the solid support, wherein after the treatment of step c, optionally further The solid support is treated with one or more reaction solutions containing sequencing reagents.
背景技术Background technique
第二代测序技术基于第一代Sanger测序技术发展而来,具有低成本,高通量,自动化等特征,极大地推进了基因测序产业的发展。二代测序技术目前已经广泛应用于全基因组测序,转录组测序,宏基因组测序等,是分析生物的进化与分类,研究癌症,自闭症等疾病相关基因,以及进行体外诊断等的有力工具,促进了人们对于生命科学的进一步了解,也推动了健康产业的发展。The second generation of sequencing technology is based on the development of the first generation of Sanger sequencing technology, with low cost, high throughput, automation and other features, greatly promoting the development of the gene sequencing industry. Second-generation sequencing technology has been widely used in whole-genome sequencing, transcriptome sequencing, and metagenomic sequencing. It is a powerful tool for analyzing the evolution and classification of organisms, studying cancer-related genes such as autism, and conducting in vitro diagnostics. It has promoted people's understanding of life sciences and promoted the development of health industry.
现有的第二代测序技术主要包括合成法测序(Sequencing-by-Synthesis,SBS)技术(包括BGI的联合探针锚定聚合技术(cPAS)),半导体测序技术,以及连接法测序(Sequencing-by-Ligation,SBL)技术。Existing second-generation sequencing technologies mainly include Sequencing-by-Synthesis (SBS) technology (including BGI's Joint Probe Anchor Polymerization (cPAS)), semiconductor sequencing technology, and ligation sequencing (Sequencing- by-Ligation, SBL) technology.
对于现有的广泛使用的合成法或连接法测序平台,例如BGISEQ-500测序平台,在测序过程中,在用再生试剂对可检测标记进行切除后,再生试剂会残留在固体支持物表面或试剂输送管道中,对后续的测序试剂造成污染,从而使测序数据质量下降,影响测序准确性。尽管采用了高盐溶液来清洗芯片或管道,但是由于高盐溶液通过稀释作用和通过降低再生试剂与芯片、核酸或管道的物理结合力来降低再生试剂的残留量,因而需要耗费大量的高盐溶液,而且仅能够尽量地降低测序试剂中再生试剂的残留,而难以完全除去再生试剂,从而导致测序数据质量和测序准确性降低。For existing widely used synthetic or ligation sequencing platforms, such as the BGISEQ-500 sequencing platform, the regenerated reagent will remain on the surface or reagent of the solid support after resection of the detectable label with the regenerating reagent during the sequencing process. In the pipeline, the subsequent sequencing reagents are polluted, so that the quality of the sequencing data is degraded, which affects the sequencing accuracy. Although high-salt solutions are used to clean the chips or tubes, high salt solutions require a large amount of high salt due to dilution and by reducing the physical binding of the regenerating reagent to the chip, nucleic acid or tubing to reduce the residual amount of regenerating reagents. The solution, and only to minimize the residual of the regenerating reagent in the sequencing reagent, is difficult to completely remove the regenerating reagent, resulting in reduced sequencing data quality and sequencing accuracy.
因此,本领域迫切需要一种新的方法来解决测序试剂中再生试剂残留的问题。Therefore, there is an urgent need in the art for a new method to solve the problem of regenerating reagent residues in sequencing reagents.
发明内容 Summary of the invention
本发明通过使用包含含有二硫键结构的化合物的洗涤溶液解决了上述问题。The present invention solves the above problems by using a washing solution containing a compound having a disulfide bond structure.
在一个方面,本发明提供了消除再生试剂残留的方法,其包括:a.提供其上固定有核酸分子的固体支持物,b.使用再生试剂处理所述固体支持物,所述再生试剂包含还原剂,和c.使用添加了含有二硫键结构的化合物的洗涤溶液处理所述固体支持物。优选地,经过步骤c的处理之后,所述固体支持物上不含或基本上不含再生试剂。优选地,经过步骤c的处理之后,所述固体支持物上不含或基本上不含包含在再生试剂中的还原剂。In one aspect, the invention provides a method of eliminating regenerative reagent residues comprising: a. providing a solid support having nucleic acid molecules immobilized thereon, b. treating the solid support with a regeneration reagent, the regenerating reagent comprising reducing And c. treating the solid support with a washing solution to which a compound having a disulfide bond structure is added. Preferably, the solid support is free or substantially free of regenerating reagent after the treatment of step c. Preferably, after the treatment of step c, the solid support is free or substantially free of reducing agent contained in the regenerating agent.
在一个实施方案中,在步骤b之前,任选地使用含有测序试剂的一种或多种反应溶液处理所述固体支持物,以在固体支持物上产生代表所述核酸分子的核苷酸序列的信号,并且其中所述再生试剂可以消除固体支持物上的所述信号。In one embodiment, prior to step b, the solid support is optionally treated with one or more reaction solutions containing a sequencing reagent to produce a nucleotide sequence representing the nucleic acid molecule on a solid support The signal, and wherein the regeneration reagent can eliminate the signal on the solid support.
在一个实施方案中,经过步骤c的处理之后,任选地还使用含有测序试剂的一种或多种反应溶液处理所述固体支持物。In one embodiment, after the treatment of step c, the solid support is optionally also treated with one or more reaction solutions containing sequencing reagents.
令人惊讶地发现,在使用本发明的添加了含有二硫键结构的化合物的洗涤溶液处理已用再生试剂处理过的固体支持物后,所述固体支持物上不含或基本上不含再生试剂,优选地,所述固体支持物上不含或基本上不含包含在再生试剂中的还原剂。如本文所用,术语“基本上不含再生试剂”意指固体支持物上含有的再生试剂(或优选地,在再生试剂中的还原剂)的量足够小以使得将固体支持物与含有测序试剂的反应溶液接触时,相对于未添加含有二硫键结构的化合物的洗涤溶液,不会造成再生试剂(或优选地,在再生试剂中的还原剂)对反应溶液的显著污染。例如,在使用本发明的添加了含有二硫键结构的化合物的洗涤溶液处理已用再生试剂处理过的固体支持物后,随后将这样的固体支持物使用含有测序反应试剂的反应溶液进行处理时,反应溶液中的再生试剂(或优选地,在再生试剂中的还原剂)的浓度不多于约10%,不多于约5%,不多于约2%,不多于约1%,不多于约0.5%,不多于约0.2%,不多于约0.1%,不多于约0.05%,不多于约0.02%,不多于约0.001%,或甚至更低。Surprisingly, it has been found that after treating a solid support which has been treated with a regenerating agent using a washing solution of the present invention having a compound containing a disulfide bond structure, the solid support does not contain or substantially does not contain regeneration. Reagent, preferably, the solid support is free or substantially free of reducing agent contained in the regenerating agent. As used herein, the term "substantially free of regenerating agent" means that the amount of regenerating agent (or preferably, the reducing agent in the regenerating agent) contained on the solid support is sufficiently small that the solid support and the sequencing reagent are included. When the reaction solution is contacted, it does not cause significant contamination of the reaction solution with respect to the washing solution to which the compound containing the disulfide bond structure is not added, with respect to the regenerating agent (or preferably, the reducing agent in the regenerating agent). For example, after treating a solid support which has been treated with a regenerating agent using a washing solution to which a compound having a disulfide bond structure is added, the solid support is subsequently treated with a reaction solution containing a sequencing reaction reagent. The concentration of the regenerating agent (or preferably, the reducing agent in the regenerating agent) in the reaction solution is not more than about 10%, not more than about 5%, not more than about 2%, not more than about 1%, Not more than about 0.5%, no more than about 0.2%, no more than about 0.1%, no more than about 0.05%, no more than about 0.02%, no more than about 0.001%, or even lower.
如本文所用,“处理固体支持物”包括使固体支持物与试剂或溶液接触的任何方式,包括但不限于使试剂或溶液流动通过固体支持物或将固体支持物浸泡在试剂或溶液中。使试剂或溶液流动通过固体支持物的方式包括例如使用与固体支持物流体连通的入口流动通道将试剂或溶液输送至固体支持物和使用与固体支持物流体连通的出口流动通道将试剂或溶液从固体支持物排出(国际申请公开WO 91/06678中描述了这 样的一种处理方式)。将固体支持物浸泡在试剂或溶液中包括使得固体支持物部分或全部浸没在试剂或溶液中的任何方式。固体支持物可以例如以垂直、倾斜或水平的方式部分或全部浸泡在试剂或溶液中,优选地,固体支持物上固定的所有核酸分子与反应容器中的试剂接触。As used herein, "treating a solid support" includes any means of contacting a solid support with a reagent or solution, including but not limited to flowing a reagent or solution through a solid support or soaking a solid support in an agent or solution. The means for flowing the reagent or solution through the solid support includes, for example, transporting the reagent or solution to the solid support using an inlet flow channel in fluid communication with the solid support and using the outlet flow channel in fluid communication with the solid support to remove the reagent or solution from the reagent or solution Solid support discharge (described in International Application Publication WO 91/06678) a kind of treatment). Soaking the solid support in the reagent or solution includes any means of immersing the solid support partially or completely in the reagent or solution. The solid support may be partially or fully immersed in the reagent or solution, for example, in a vertical, oblique or horizontal manner, preferably all nucleic acid molecules immobilized on the solid support are contacted with reagents in the reaction vessel.
如本文所用,术语“核酸”可以与“核酸分子”、“多核苷酸”互换使用,其可以是任何类型的核酸,例如核酸可以是脱氧核糖核酸(DNA)、核糖核酸(RNA)或由核苷酸类似物制成的DNA或RNA的类似物。核酸可以是单链的、双链的或含有单链和双链序列两者。核酸分子可以来源于双链DNA(dsDNA)形式(例如,基因组DNA、PCR和扩增产物等),或者可以来源于如DNA(ssDNA)或RNA的单链形式并且其可以转化为dsDNA形式,并且反之亦然。在一些实施方案中,被测序的核酸可以是单分子的形式(可以是天然分子、修饰分子如标记的分子,或包括核苷酸类似物的核酸)、序列的多联体(concatamer),等等),可以扩增(例如扩增为多联体、扩增为多个具有相同或相似序列的个体分子,等等),和/或可以为任何其它形式。核酸分子的准确序列可以是已知的或未知的。以下是核酸的示例性实例:基因或基因片段(例如,探针、引物、EST或SAGE标签)、基因组DNA、基因组DNA片段、外显子、内含子、信使RNA(mRNA)、转运RNA、核糖体RNA、核糖酶、cDNA、核酸文库、重组多核苷酸、合成多核苷酸、分枝多核苷酸、质粒、载体、任何序列的分离的DNA、任何序列的分离的RNA、任何上述序列的核酸探针、引物或扩增拷贝。As used herein, the term "nucleic acid" may be used interchangeably with "nucleic acid molecule", "polynucleotide", which may be any type of nucleic acid, eg, the nucleic acid may be deoxyribonucleic acid (DNA), ribonucleic acid (RNA) or An analog of DNA or RNA made from a nucleotide analog. Nucleic acids can be single stranded, double stranded or contain both single stranded and double stranded sequences. The nucleic acid molecule may be derived from a double-stranded DNA (dsDNA) form (eg, genomic DNA, PCR, and amplification products, etc.), or may be derived from a single-stranded form such as DNA (ssDNA) or RNA and which may be converted to the dsDNA form, and vice versa. In some embodiments, the sequenced nucleic acid can be in the form of a single molecule (which can be a natural molecule, a modified molecule such as a labeled molecule, or a nucleic acid comprising a nucleotide analog), a concatamer of a sequence, etc. Alternatively, it can be amplified (eg, amplified as a concatamer, amplified into multiple individual molecules having the same or similar sequences, etc.), and/or can be in any other form. The exact sequence of the nucleic acid molecule can be known or unknown. The following are illustrative examples of nucleic acids: genes or gene fragments (eg, probes, primers, EST or SAGE tags), genomic DNA, genomic DNA fragments, exons, introns, messenger RNA (mRNA), transfer RNA, Ribosome RNA, ribozyme, cDNA, nucleic acid library, recombinant polynucleotide, synthetic polynucleotide, branched polynucleotide, plasmid, vector, isolated DNA of any sequence, isolated RNA of any sequence, any of the above sequences Nucleic acid probes, primers or amplified copies.
核酸可以包括核苷酸或核苷酸类似物。核苷酸通常含有糖、核碱基和至少一个磷酸基。核苷酸可以是无碱基的(即,缺少核碱基)。核苷酸包括脱氧核糖核苷酸、修饰的脱氧核糖核苷酸、核糖核苷酸、修饰的核糖核苷酸、肽核苷酸、修饰的肽核苷酸、修饰磷酸盐糖主链核苷酸及其混合物。核苷酸的实例包括(例如)腺苷一磷酸(AMP)、腺苷二磷酸(ADP)、腺苷三磷酸(ATP)、胸苷一磷酸(TMP)、胸苷二磷酸(TDP)、胸苷三磷酸(TTP)、胞苷酸(CMP)、胞苷二磷酸(CDP)、胞苷三磷酸(CTP)、鸟苷一磷酸(GMP)、鸟苷二磷酸(GDP)、鸟苷三磷酸(GTP)、尿苷一磷酸(UMP)、尿苷二磷酸(UDP)、尿苷三磷酸(UTP)、脱氧腺苷酸(dAMP)、脱氧腺苷二磷酸(dADP)、脱氧腺苷三磷酸(dATP)、脱氧胸腺嘧啶核苷一磷酸(dTMP)、脱氧胸腺嘧啶核苷二磷酸(dTDP)、脱氧胸苷三磷酸(dTTP)、去氧胞二磷(dCDP)、脱氧胞苷三磷酸(dCTP)、脱氧鸟苷一磷酸(dGMP)、脱氧鸟苷二磷酸(dGDP)、脱氧鸟苷三磷酸(dGTP)、脱氧尿苷一磷酸(dUMP)、 脱氧尿苷二磷酸(dUDP)和脱氧尿苷三磷酸(dUTP)。还可以在本文所述的方法中使用包含修饰的核碱基的核苷酸类似物。无论是具有天然主链还是类似结构,可以包含在多核苷酸中的示例性修饰的核碱基包括(例如)肌苷、黄嘌呤、次黄嘌呤、异胞嘧啶、异鸟嘌呤、2-氨基嘌呤、5-甲基胞嘧啶、5-羟甲基胞嘧啶、2-氨基腺嘌呤、6-甲基腺嘌呤、6-甲基鸟嘌呤、2-丙基鸟嘌呤、2-丙基腺嘌呤、2-硫脲嘧啶、2-硫胸腺嘧啶、2-硫胞嘧啶、15-卤代脲嘧啶、15-卤代胞嘧啶、5-丙炔基尿嘧啶、5-丙炔基胞嘧啶、6-偶氮尿嘧啶、6-偶氮胞嘧啶、6-偶氮胸腺嘧啶、5-尿嘧啶、4-硫尿嘧啶、8-卤代腺嘌呤或鸟嘌呤、8-氨基腺嘌呤或鸟嘌呤、8-硫腺嘌呤或鸟嘌呤、8-硫烷基腺嘌呤或鸟嘌呤、8-羟基腺嘌呤或鸟嘌呤、5-卤素取代的尿嘧啶或胞嘧啶、7-甲基鸟嘌呤、7-甲基腺嘌呤、8-氮杂鸟嘌呤、8-氮杂腺嘌呤、7-去氮鸟嘌呤、7-去氮腺嘌呤、3-去氮鸟嘌呤、3-去氮腺嘌呤等。如本领域中已知的,某些核苷酸类似物不能引入多核苷酸,例如,核苷酸类似物,如腺苷5′-磷酰硫酸。Nucleic acids can include nucleotides or nucleotide analogs. Nucleotides typically contain a sugar, a nucleobase, and at least one phosphate group. Nucleotides can be abasic (ie, lacking nucleobases). Nucleotides include deoxyribonucleotides, modified deoxyribonucleotides, ribonucleotides, modified ribonucleotides, peptide nucleotides, modified peptide nucleotides, modified phosphate sugar backbone nucleosides Acids and mixtures thereof. Examples of nucleotides include, for example, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), thymidine monophosphate (TMP), thymidine diphosphate (TDP), and thorax. Triphosphate triphosphate (TTP), cytidine (CMP), cytidine diphosphate (CDP), cytidine triphosphate (CTP), guanosine monophosphate (GMP), guanosine diphosphate (GDP), guanosine triphosphate (GTP), uridine monophosphate (UMP), uridine diphosphate (UDP), uridine triphosphate (UTP), deoxyadenosine (dAMP), deoxyadenosine diphosphate (dADP), deoxyadenosine triphosphate (dATP), deoxythymidine monophosphate (dTMP), deoxythymidine diphosphate (dTDP), deoxythymidine triphosphate (dTTP), deoxycysteines (dCDP), deoxycytidine triphosphate ( dCTP), deoxyguanosine monophosphate (dGMP), deoxyguanosine diphosphate (dGDP), deoxyguanosine triphosphate (dGTP), deoxyuridine monophosphate (dUMP), Deoxyuridine diphosphate (dUDP) and deoxyuridine triphosphate (dUTP). Nucleotide analogs comprising modified nucleobases can also be used in the methods described herein. Exemplary modified nucleobases that may be included in a polynucleotide, whether having a native backbone or a similar structure, include, for example, inosine, xanthine, hypoxanthine, isocytosine, isoguanine, 2-amino Indole, 5-methylcytosine, 5-hydroxymethylcytosine, 2-aminoadenine, 6-methyladenine, 6-methylguanine, 2-propylguanine, 2-propyladenine , 2-thiouracil, 2-thiothymidine, 2-thiocytosine, 15-halouracil, 15-halocytosine, 5-propynyl uracil, 5-propynyl cytosine, 6 - azouracil, 6-azocytosine, 6-azothymine, 5-uracil, 4-thiouracil, 8-haloadenine or guanine, 8-aminoadenine or guanine, 8-thioadenine or guanine, 8-sulfanyl adenine or guanine, 8-hydroxyadenine or guanine, 5-halo substituted uracil or cytosine, 7-methylguanine, 7-A Adenine, 8-azaguanine, 8-azadenine, 7-azepine guanine, 7-deaza adenine, 3-azepine guanine, 3-deaza adenine, and the like. Certain nucleotide analogs are not capable of introducing a polynucleotide, such as a nucleotide analog, such as adenosine 5'-phosphorylsulfate, as is known in the art.
在本发明的具体实施方案中被测序的核酸分子可以具有任意长度。一般说来,有用的核酸的示例性长度包括(例如)至少约5、10、20、30、40、50、60、70、80、90、100、150、200、300、400、500、1,000、5,000或10,000、100,000个核苷酸或更长。作为另外一种选择或者另外,所述长度可以不长于1,000,000、100,000、10,000、1,000、100个核苷酸或更少。核酸分子的长度还可以包括以上示例性数目之间的所有整数。因此,被测序的核酸可以(例如)在短多核苷酸、片段、cDNA、基因和基因组片段的范围内。Nucleic acid molecules that are sequenced in particular embodiments of the invention can be of any length. In general, exemplary lengths of useful nucleic acids include, for example, at least about 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 300, 400, 500, 1,000. , 5,000 or 10,000, 100,000 nucleotides or longer. Alternatively or additionally, the length may be no longer than 1,000,000, 100,000, 10,000, 1,000, 100 nucleotides or less. The length of the nucleic acid molecule can also include all integers between the above exemplary numbers. Thus, the sequenced nucleic acid can be, for example, within the scope of short polynucleotides, fragments, cDNA, genes, and genomic fragments.
在本发明的具体实施方案中被测序的核酸分子可以为任意数量,例如可以是2、3、4、5、6、7、8、9、10、20、30、40、50、60、70、80、90或100个或者更多个相同或不同的核酸分子。被测序的核酸分子的数量还可以是例如200、300、400、500、1000、5000、10000、50000、1x105、2x105、3x105、4x105、5x105、6x105、7x105、8x105、9x105、1x106、2x106、3x106、4x106、5x106、6x106、7x106、8x106、9x106或1x107个或更多个相同或不同的核酸分子。被测序的核酸分子的数量还可以包括以上示例性数目之间的所有整数。Nucleic acid molecules that are sequenced in particular embodiments of the invention may be in any number, for example, may be 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70 , 80, 90 or 100 or more identical or different nucleic acid molecules. Is the number of sequencing a nucleic acid molecule may also be, for example 200,300,400,500,1000,5000,10000,50000,1x10 5, 2x10 5, 3x10 5 , 4x10 5, 5x10 5, 6x10 5, 7x10 5, 8x10 5 , 9x10 5 , 1x10 6 , 2x10 6 , 3x10 6 , 4x10 6 , 5x10 6 , 6x10 6 , 7x10 6 , 8x10 6 , 9x10 6 or 1x10 7 or more identical or different nucleic acid molecules. The number of nucleic acid molecules that are sequenced may also include all integers between the above exemplary numbers.
核酸可以从任何来源获得。例如,核酸可以由从一种生物获得的核酸分子制备,或者由从包括一种或多种生物的天然来源获得的核酸分子群制备。核酸分子的来源包括但不限于细胞器、细胞、组织、器官或生物体。可以用作核酸分子来源的细胞可以 是原核的(例如细菌);或真核的,如真菌(例如酵母)、植物、原生动物和其它寄生虫,和动物(包括昆虫、线虫),和哺乳动物(例如,大鼠、小鼠、猴、非人类灵长类动物和人类));或者核酸分子可以来源于病毒。Nucleic acids can be obtained from any source. For example, a nucleic acid can be prepared from a nucleic acid molecule obtained from an organism, or from a population of nucleic acid molecules obtained from a natural source comprising one or more organisms. Sources of nucleic acid molecules include, but are not limited to, organelles, cells, tissues, organs, or organisms. Cells that can be used as a source of nucleic acid molecules can Prokaryotic (eg bacteria); or eukaryotic, such as fungi (eg yeast), plants, protozoa and other parasites, and animals (including insects, nematodes), and mammals (eg, rats, mice, Monkeys, non-human primates, and humans); or nucleic acid molecules can be derived from viruses.
在一些实施方案中,核酸可以从特定生物来源获得。在一个优选的实施方案中,核酸是从人获得的人核酸,例如,人组织的样品。在另一个优选的实施方案中,核酸是人线粒体核酸。在另一个优选的实施方案中,核酸可以从宏基因组样品获得。在其它实施方案中,核酸可以从不再包含活生物的环境来源获得。In some embodiments, the nucleic acid can be obtained from a particular biological source. In a preferred embodiment, the nucleic acid is a human nucleic acid obtained from a human, for example, a sample of human tissue. In another preferred embodiment, the nucleic acid is a human mitochondrial nucleic acid. In another preferred embodiment, the nucleic acid can be obtained from a metagenomic sample. In other embodiments, the nucleic acid can be obtained from an environmental source that no longer contains living organisms.
如本文所用的,术语“固定”当提及核酸使用时,意指经由共价键或非共价键直接或间接附接至固体支持物。在本公开内容的某些实施方案中,可以使用共价附接,但是通常仅需要的是在期望使用固体支持物的条件下(例如在需要核酸扩增和/或测序的应用中)核酸保持固定或附接至固体支持物。通常,待用作捕获引物或扩增引物的寡核苷酸被固定,使得3′末端对于酶促延伸是可利用的并且该引物序列的至少一部分能够杂交至互补核酸序列。固定可以经由杂交至表面附接的寡核苷酸发生,在这种情况下固定的寡核苷酸或多核苷酸可以为3′-5′方向。另一种非共价附接的方式可以是通过氨基化修饰将核酸结合蛋白质结合在固体支持物上,并通过核酸结合蛋白质捕获核酸分子。可选地,固定可以通过除碱基配对杂交之外的其他方式发生,例如上文描述的共价附接。核酸与固体支持物附接方式的非限制性示例包括核酸杂交、生物素链霉亲和素结合、巯基结合、光活化结合、共价结合、抗体-抗原、经由水凝胶或其他多孔聚合物的物理限制等。用于将核酸固定在固体支持物上的各种示例性方法可参见例如G.Steinberg-Tatman等人,Bioconjugate Chemistry 2006,17,841-848;Xu x.等人Journal of the American Chemical Society 128(2006)9286-9287;美国专利申请US 5639603、US 5641658、US2010248991;国际专利申请WO 2001062982、WO 2001012862、WO 2007111937、WO0006770,为了所有目的,特别是为了与制备形成其上固定有核酸的固体支持物有关的全部教导,以上文献均通过引用全文并入本文。As used herein, the term "immobilized" when used in reference to a nucleic acid, means attached directly or indirectly to a solid support via a covalent bond or a non-covalent bond. In certain embodiments of the present disclosure, covalent attachment can be used, but typically only required is nucleic acid retention under conditions where it is desired to use a solid support (eg, in applications requiring nucleic acid amplification and/or sequencing) Fixed or attached to a solid support. Typically, the oligonucleotide to be used as a capture primer or amplification primer is immobilized such that the 3' end is available for enzymatic extension and at least a portion of the primer sequence is capable of hybridizing to the complementary nucleic acid sequence. Immobilization can occur via hybridization to surface-attached oligonucleotides, in which case the immobilized oligonucleotide or polynucleotide can be in the 3'-5' direction. Another way of non-covalent attachment may be to bind the nucleic acid binding protein to the solid support by amination modification and to capture the nucleic acid molecule by the nucleic acid binding protein. Alternatively, immobilization can occur by other means than base pair hybridization, such as the covalent attachment described above. Non-limiting examples of nucleic acid attachment to solid support include nucleic acid hybridization, biotin streptavidin binding, sulfhydryl binding, photoactivated binding, covalent binding, antibody-antigen, via hydrogel or other porous polymer. Physical limitations, etc. Various exemplary methods for immobilizing nucleic acids on a solid support can be found, for example, in G. Steinberg-Tatman et al, Bioconjugate Chemistry 2006, 17, 841-848; Xu x. et al. Journal of the American Chemical Society 128 ( 2006) 9286-9287; US Patent Application US 5,639, 603, US 5, 641, 658, US 2010 248 991; International Patent Application WO 2001062982, WO 2001012862, WO 2007111937, WO0006770, for all purposes, in particular for the preparation of solid supports on which nucleic acids are immobilized All of the relevant teachings are hereby incorporated by reference in their entirety.
如本文所使用的,术语“固体支持物”意指核酸可以与其附接的任何不可溶的基底或基质,诸如例如,乳胶珠、葡聚糖珠、聚苯乙烯表面、聚丙烯表面、聚丙烯酰胺凝胶、金表面、玻璃表面、芯片和硅晶片。固体支持物的表面可以是任何所需形状,所述形状包括,例如,适合用于特定应用的平面的、球形的或多孔的。例如,固体支持 物可以是平面的玻璃表面。固体支持物可以被安装在流动池的内部,以允许与多个试剂的溶液相互作用。也可以移动固体支持物以使其分别与多个反应容器中的溶液接触。As used herein, the term "solid support" means any insoluble substrate or matrix to which a nucleic acid can be attached, such as, for example, latex beads, dextran beads, polystyrene surfaces, polypropylene surfaces, polypropylene. Amide gel, gold surface, glass surface, chip and silicon wafer. The surface of the solid support can be any desired shape including, for example, planar, spherical or porous suitable for a particular application. For example, solid support The object can be a flat glass surface. A solid support can be mounted inside the flow cell to allow interaction with solutions of multiple reagents. The solid support can also be moved to contact the solution in a plurality of reaction vessels, respectively.
在某些实施方案中,固体支持物可以包括惰性基底或基质,该惰性基底或基质已经例如通过应用中间材料的层或涂层被化学官能化,所述中间材料具有允许共价附接至多核苷酸的反应性基团。中间材料可以经由共价键或非共价键直接或间接附接至固体支持物。作为用于非共价附接至固体支持物的非限制性实例,这类支持物可以包括在惰性基底例如玻璃上的聚丙烯酰胺水凝胶层。在这类实施方案中,多核苷酸可以直接共价附接至中间层(例如,水凝胶),但是中间层本身可以非共价附接至其他基底或基质(例如玻璃基底)层。In certain embodiments, the solid support can comprise an inert substrate or matrix that has been chemically functionalized, for example by applying a layer or coating of an intermediate material that allows covalent attachment to the multicore A reactive group of a glycoside. The intermediate material can be attached directly or indirectly to the solid support via a covalent bond or a non-covalent bond. As a non-limiting example for non-covalent attachment to a solid support, such a support can include a polyacrylamide hydrogel layer on an inert substrate such as glass. In such embodiments, the polynucleotide can be directly covalently attached to an intermediate layer (eg, a hydrogel), but the intermediate layer itself can be non-covalently attached to other substrates or substrates (eg, glass substrates) layers.
如本文所用,术语“信号”包括能够被检测到的任何信号,包括但不限于光信号、电信号、电磁信号、放射信号等。术语“代表核酸分子的核苷酸序列的信号”意指该信号携带核苷酸序列的信息并能够被解码为被测序的核酸分子的核苷酸序列。例如,在使用经标记(例如荧光标记)的测序探针的情况下,由标记产生的信号携带测序探针的核苷酸序列信息,并且通过该信号可获知被测序的核酸分子上与该测序探针互补配对的位置的核苷酸序列。又例如,在使用经标记(例如荧光标记)的核苷酸的情况下,由标记产生的信号携带核苷酸的身份信息,并且通过该信号可获知被测序的核酸分子上与该核苷酸互补配对的位置的核苷酸序列。在优选的实施方案中,所述信号是荧光信号。在优选的实施方案中,所述标记是荧光标记。检测荧光标记或信号的方式是本领域熟知的。例如,可以通过检测荧光的波长的装置来实现。这样的装置是本领域熟知的。例如,这样的装置可以是共焦扫描显微镜,其用激光扫描固体支持物的表面,以便使直接结合被测序的核酸分子上的荧光团成像。另外,可以例如用灵敏的2-D探测器,如电荷偶连的探测器(CCD)观察所产生的每一种信号。还可以例如使用诸如扫描近场光学显微方法(SNOM)的其他技术。As used herein, the term "signal" includes any signal that can be detected, including but not limited to optical signals, electrical signals, electromagnetic signals, radiated signals, and the like. The term "signal representing a nucleotide sequence of a nucleic acid molecule" means that the signal carries information of the nucleotide sequence and can be decoded into the nucleotide sequence of the nucleic acid molecule being sequenced. For example, where a labeled (eg, fluorescently labeled) sequencing probe is used, the signal generated by the label carries the nucleotide sequence information of the sequencing probe, and the signal is used to know the sequenced nucleic acid molecule and the sequencing The nucleotide sequence of the position where the probe is complementary to the pair. In another example, where a labeled (eg, fluorescently labeled) nucleotide is used, the signal generated by the label carries the identity information of the nucleotide, and the signal is used to know the nucleotide molecule being sequenced and the nucleotide The nucleotide sequence of the position of the complementary pair. In a preferred embodiment, the signal is a fluorescent signal. In a preferred embodiment, the label is a fluorescent label. The manner in which fluorescent labels or signals are detected is well known in the art. For example, it can be realized by a device that detects the wavelength of fluorescence. Such devices are well known in the art. For example, such a device can be a confocal scanning microscope that scans the surface of the solid support with a laser to image the fluorophore directly on the nucleic acid molecule being sequenced. Alternatively, each of the generated signals can be observed, for example, with a sensitive 2-D detector, such as a charge coupled detector (CCD). Other techniques such as scanning near-field optical microscopy (SNOM) can also be used, for example.
如本文所用,术语“测序试剂”是指适合用于对固体支持物上的核酸分子进行测序的任何测序方法中所使用的试剂。例如,这样的测序方法可以包括但不限于电泳测序、合成法测序(包括联合探针锚定聚合测序)、连接法测序、杂交测序、单分子测序和实时测序方法。一般而言,将测序试剂与固体支持物上的核酸分子进行接触后,在固体支持物上会产生代表所述核酸分子的核苷酸序列的信号(例如,荧光信号)。在本 发明的方法中所使用的具体的测序试剂以及含有测序试剂的一种或多种反应溶液将取决于所采用的测序方法。根据测序方法确定在本发明的方法中所使用的具体的测序试剂以及含有测序试剂的一种或多种反应溶液在本领域技术人员的能力范围内。As used herein, the term "sequencing reagent" refers to an agent that is suitable for use in any sequencing method for sequencing nucleic acid molecules on a solid support. For example, such sequencing methods can include, but are not limited to, electrophoretic sequencing, synthetic sequencing (including combinatorial probe anchor polymerization sequencing), ligation sequencing, hybrid sequencing, single molecule sequencing, and real-time sequencing methods. In general, upon contact of a sequencing reagent with a nucleic acid molecule on a solid support, a signal (eg, a fluorescent signal) representative of the nucleotide sequence of the nucleic acid molecule is produced on the solid support. In this The particular sequencing reagent used in the method of the invention and the one or more reaction solutions containing the sequencing reagent will depend on the sequencing method employed. It is within the ability of those skilled in the art to determine the particular sequencing reagents used in the methods of the invention, as well as one or more reaction solutions containing sequencing reagents, according to sequencing methods.
在一个实施方案中,测序试剂包括用于连接法测序的测序试剂。这样的试剂可在被测序的核酸分子上产生连接产物并生成代表所述核酸分子的核苷酸序列的信号。In one embodiment, the sequencing reagent comprises a sequencing reagent for ligation sequencing. Such reagents can produce a ligation product on the sequenced nucleic acid molecule and generate a signal representative of the nucleotide sequence of the nucleic acid molecule.
如本文所用的连接法测序是本领域熟知的各种连接法测序方法。基本地,连接法测序涉及经标记(例如荧光标记)的测序探针和锚定探针(在SOLiD测序中也被称为“引物”)与DNA链的杂交和连接。测序探针包含一或两个固定已知的测序碱基(单碱基测序探针或双碱基测序探针)和一系列简并或通用碱基,其使得测序探针与核酸模板之间进行互补配对。锚定探针序列与核酸模板上的衔接子序列(衔接子序列意指核酸模板中的序列已知的寡核苷酸)互补并提供起始连接的位点的已知序列。连接之后,对模板成像并鉴定测序探针中的所述一或两个已知碱基。在锚定探针-测序探针复合物的完全移除或者经过切割除去标记(例如荧光团)之后并再生连接位点之后,开始下一个连接测序循环。Ligation sequencing as used herein is a variety of ligation sequencing methods well known in the art. Basically, ligation sequencing involves the hybridization and ligation of labeled (eg, fluorescently labeled) sequencing probes and anchor probes (also referred to as "primers" in SOLiD sequencing) to DNA strands. The sequencing probe contains one or two fixed known sequencing bases (single base sequencing probe or double base sequencing probe) and a series of degenerate or universal bases that allow for sequencing probes and nucleic acid templates Perform complementary pairing. The anchor probe sequence is complementary to an adaptor sequence on the nucleic acid template (an adaptor sequence means an oligonucleotide of known sequence in the nucleic acid template) and provides a known sequence of sites for initiation of ligation. After ligation, the template is imaged and the one or two known bases in the sequencing probe are identified. After the complete removal of the anchor probe-sequencing probe complex or after cleavage removal of the label (eg, fluorophore) and regeneration of the junction site, the next ligation sequencing cycle begins.
因此,在使用用于连接法测序的测序试剂的一个实施方案中,测序试剂包括锚定探针、经标记(例如荧光标记)的测序探针、连接酶或其混合物,并且含有测序试剂的一种或多种反应溶液包括包含锚定探针、经标记(例如荧光标记)的测序探针、连接酶或其混合物的溶液,前提是固体支持物与锚定探针、经标记的测序探针、连接酶中的每一种均发生接触,并且其中固体支持物上的所述信号由互补结合至固体支持物上的核酸分子的经标记的测序探针产生,所述经标记的测序探针经由连接酶连接至互补结合至同一核酸分子的锚定探针。或者,测序试剂可不包含锚定探针,并且在使用含有测序试剂的一种或多种反应溶液处理固体支持物之前,将固体支持物与锚定探针(在SOLiD测序中也被称为“引物”)接触,以使得锚定探针杂交至所述固体支持物上的核酸分子。Thus, in one embodiment using a sequencing reagent for ligation sequencing, the sequencing reagent comprises an anchor probe, a labeled (eg, fluorescently labeled) sequencing probe, a ligase, or a mixture thereof, and one comprising a sequencing reagent The one or more reaction solutions include a solution comprising an anchor probe, a labeled (eg, fluorescently labeled) sequencing probe, a ligase, or a mixture thereof, provided that the solid support and anchor probe, labeled sequencing probe Each of the ligases is contacted, and wherein the signal on the solid support is produced by a labeled sequencing probe that is complementary to a nucleic acid molecule on a solid support, the labeled sequencing probe An anchor probe that is complementary to the same nucleic acid molecule is ligated via a ligase. Alternatively, the sequencing reagent may not comprise an anchor probe and the solid support and the anchor probe (also referred to as "in SOLiD sequencing") prior to treatment of the solid support with one or more reaction solutions containing the sequencing reagent The primer ") contacts such that the anchor probe hybridizes to the nucleic acid molecule on the solid support.
所述含有测序试剂的一种或多种反应溶液的具体组成取决于所采用的具体连接法测序方法而不同。例如,在采用单一锚定探针的情况下,含有测序试剂的一种或多种反应溶液可以包括仅一种反应溶液,该反应溶液包含连接酶和经标记的测序探针,任选地,还包含锚定探针;或者含有测序试剂的一种或多种反应溶液也可以包括两种反应溶液,其中一种反应溶液包含锚定探针,另一种反应溶液包含经标记的测序探针, 并且这两种反应溶液至少之一包含连接酶,在这样的情况下,对核酸分子的测序包括将固体支持物浸泡在包含锚定探针的反应溶液中、洗涤,随后浸泡在包含经标记的测序探针的反应溶液中、洗涤。或者,含有测序试剂的一种或多种反应溶液还可以包括三种反应溶液,其分别包含锚定探针、经标记的测序探针和连接酶,在这样的情况下,对核酸分子的测序包括如下过程:将固体支持物浸泡在包含锚定探针的反应溶液中、洗涤,随后浸泡在包含经标记的测序探针的反应溶液中、洗涤,随后浸泡在包含连接酶的反应溶液中、洗涤。类似地,在采用双重锚定探针的情况下,含有测序试剂的一种或多种反应溶液可以包括同时包含连接酶、第一锚定探针、第二锚定探针和经标记的测序探针的仅一种反应溶液,或分别包含连接酶、第一锚定探针、第二锚定探针、经标记的测序探针或其混合物的多种反应溶液。The specific composition of the one or more reaction solutions containing the sequencing reagent will vary depending on the particular ligation sequencing method employed. For example, where a single anchor probe is employed, the one or more reaction solutions containing the sequencing reagent can include only one reaction solution comprising a ligase and a labeled sequencing probe, optionally, Also included is an anchor probe; or one or more reaction solutions containing a sequencing reagent may also include two reaction solutions, one of which contains an anchor probe and the other of which contains a labeled sequencing probe , And at least one of the two reaction solutions comprises a ligase, in which case sequencing of the nucleic acid molecule comprises immersing the solid support in a reaction solution comprising an anchor probe, washing, and then immersing in the inclusion of the labeled The reaction solution of the sequencing probe is washed and washed. Alternatively, the one or more reaction solutions containing the sequencing reagent may further comprise three reaction solutions comprising an anchor probe, a labeled sequencing probe and a ligase, respectively, in which case the nucleic acid molecule is sequenced. The method includes: immersing a solid support in a reaction solution containing an anchor probe, washing, then immersing in a reaction solution containing a labeled sequencing probe, washing, and then immersing in a reaction solution containing a ligase, washing. Similarly, where a dual anchor probe is employed, the one or more reaction solutions containing the sequencing reagent can include both a ligase, a first anchor probe, a second anchor probe, and labeled sequencing. Only one reaction solution of the probe, or a plurality of reaction solutions containing a ligase, a first anchor probe, a second anchor probe, a labeled sequencing probe, or a mixture thereof, respectively.
关于连接法测序,特别是关于锚定探针、测序探针的描述,可参见例如WO2013066975、美国专利No.60/992,485、61/026,337、61/035,914和61/061,134中的记载。关于连接法测序的详细描述还可参见例如Landegren,U.,Kaiser,R.,Sanders,J.& Hood,L.A ligase-mediated gene detection technique.Science 241,1077-1080(1988),美国专利No.6969488、美国专利No.6172218和美国专利No.6306597(其公开内容在此全部引入作为参考)。在优选的实施方案中,连接法测序包括联合探针锚定连接(cPAL)测序(参见WO2013066975的记载)。For the description of ligation sequencing, in particular with regard to anchoring probes, sequencing probes, see, for example, WO2013066975, U.S. Patent Nos. 60/992,485, 61/026,337, 61/035,914, and 61/061,134. For a detailed description of ligation sequencing, see, for example, Landegren, U., Kaiser, R., Sanders, J. & Hood, LA ligase-mediated gene detection technique. Science 241, 1077-1080 (1988), U.S. Patent No. U.S. Patent No. 6, 721, 218, and U.S. Patent No. 6,306, 597, the disclosures of In a preferred embodiment, ligation sequencing includes combined probe anchor ligation (cPAL) sequencing (see document WO2013066975).
如本文所用,术语“连接酶”是指催化核酸链的5′-磷酸和3′-羟基末端之间磷酸二酯键的分子内形成和分子间形成的核酸修饰酶。连接酶可以获自重组或天然来源。可以使用一种或多种低温(例如,室温或更低)连接酶(例如,T3 DNA连接酶、T4 DNA连接酶、T7 DNA连接酶和/或大肠杆菌DNA连接酶)。连接酶也可以是热稳定的连接酶。可以使用来自嗜热生物的热稳定连接酶。热稳定DNA连接酶的例子包括但不限于:Tth DNA连接酶(来自嗜热栖热菌(Thermus thermophilus),可购自例如欧基公司(Eurogentec)和GeneCraft公司);Pfu DNA连接酶(来自激烈火球菌(Pyrococcusfuriosus)的超嗜热连接酶);Taq连接酶(来自水生栖热菌(Thermus aquaticus)),以及任何其他合适的热稳定连接酶,或其任意组合。As used herein, the term "ligase" refers to a nucleic acid modifying enzyme that catalyzes the intramolecular formation and intermolecular formation of a phosphodiester bond between the 5'-phosphate and 3'-hydroxyl ends of a nucleic acid strand. Ligase can be obtained from recombinant or natural sources. One or more low temperature (eg, room temperature or lower) ligases (eg, T3 DNA ligase, T4 DNA ligase, T7 DNA ligase, and/or E. coli DNA ligase) can be used. The ligase may also be a thermostable ligase. A thermostable ligase from a thermophilic organism can be used. Examples of thermostable DNA ligases include, but are not limited to, Tth DNA ligase (from Thermus thermophilus, available from, for example, Eurogentec and GeneCraft); Pfu DNA ligase (from intense Hyperthermophilic ligase of Pyrococcus furiosus; Taq ligase (from Thermus aquaticus), and any other suitable thermostable ligase, or any combination thereof.
在另一个实施方案中,测序试剂包括用于合成法测序的测序试剂。这样的试剂可以以被测序的核酸分子为模板进行聚合反应并生成代表所述核酸分子的核苷酸序列的信号的试剂, In another embodiment, the sequencing reagent comprises a sequencing reagent for sequencing by synthesis. Such an agent may be a reagent that performs polymerization by using the sequenced nucleic acid molecule as a template and generates a signal representing a nucleotide sequence of the nucleic acid molecule,
如本文所用的合成法测序是本领域熟知的各种合成法测序方法。基本地,合成法测序涉及首先将被测序的核酸分子与测序引物杂交,随后在聚合酶的存在下,以被测序的核酸分子为模板在测序引物的3’端聚合经标记(例如荧光标记)的核苷酸。聚合之后,通过检测所述标记来鉴定该经标记的核苷酸。在从经标记的核苷酸上除去标记(例如荧光团)之后,开始下一个聚合测序循环。在一个实施方案中,被测序的核酸分子通过与固定在固体支持物上的测序引物杂交而固定在固体支持物上。在另一个实施方案中,在使用含有测序试剂的一种或多种反应溶液处理其上固定有核酸分子的固体支持物之前,将所述固体支持物与测序引物接触,以使得测序引物杂交至所述固体支持物上的核酸分子。在优选的实施方案中,利用BGISEQ-500测序平台对固体支持物上的核酸分子进行测序。Synthetic sequencing as used herein is a variety of synthetic sequencing methods well known in the art. Basically, synthetic sequencing involves first hybridizing the sequenced nucleic acid molecule to a sequencing primer, followed by polymerization of the labeled nucleic acid molecule as a template at the 3' end of the sequencing primer (eg, fluorescent labeling) in the presence of a polymerase. Nucleotide. After polymerization, the labeled nucleotide is identified by detecting the label. After removal of the label (eg, fluorophore) from the labeled nucleotide, the next polymerization sequencing cycle begins. In one embodiment, the sequenced nucleic acid molecule is immobilized on a solid support by hybridization to a sequencing primer immobilized on a solid support. In another embodiment, the solid support is contacted with a sequencing primer prior to treatment of the solid support on which the nucleic acid molecule is immobilized using one or more reaction solutions containing the sequencing reagent, such that the sequencing primer hybridizes to A nucleic acid molecule on the solid support. In a preferred embodiment, the nucleic acid molecules on the solid support are sequenced using the BGISEQ-500 sequencing platform.
因此,在使用用于合成法测序的测序试剂的一个实施方案中,测序试剂包括聚合酶、经标记(例如荧光标记)的核苷酸或其混合物,并且含有测序试剂的一种或多种反应溶液包括包含聚合酶、经标记(例如荧光标记)的核苷酸或其混合物的溶液,前提是固体支持物与聚合酶、经标记的核苷酸中的每一种均发生接触,并且其中固体支持物上的所述信号由互补结合至固体支持物上的核酸分子的经标记的核苷酸产生,所述经标记的核苷酸经由聚合酶以固体支持物上的核酸分子为模板聚合至测序引物的3’端。在这样的实施方案中,再生试剂包括用于从经标记(例如荧光标记)的核苷酸除去标记的试剂。在使用用于合成法测序的测序试剂的优选实施方案中,在步骤b之前,任选地使用包含聚合酶和经标记(例如荧光标记)的核苷酸的反应溶液处理所述固体支持物。在优选的实施方案中,测序试剂包括用于BGISEQ-500测序平台的测序试剂盒中的试剂,例如BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30)中的测序试剂。Thus, in one embodiment using a sequencing reagent for sequencing of a synthesis, the sequencing reagent comprises a polymerase, a labeled (eg, fluorescently labeled) nucleotide, or a mixture thereof, and one or more reactions comprising a sequencing reagent The solution includes a solution comprising a polymerase, a labeled (eg, fluorescently labeled) nucleotide, or a mixture thereof, provided that the solid support contacts each of the polymerase, the labeled nucleotide, and wherein the solid The signal on the support is produced by a labeled nucleotide that is complementary bound to a nucleic acid molecule on a solid support, the labeled nucleotide being polymerized via a polymerase with a nucleic acid molecule on a solid support as a template to The 3' end of the primer was sequenced. In such embodiments, the regeneration reagent comprises an agent for removing the label from the labeled (eg, fluorescently labeled) nucleotide. In a preferred embodiment using a sequencing reagent for sequencing of the synthesis, prior to step b, the solid support is optionally treated with a reaction solution comprising a polymerase and a labeled (e.g., fluorescently labeled) nucleotide. In a preferred embodiment, the sequencing reagent comprises a reagent in a sequencing kit for the BGISEQ-500 sequencing platform, such as the BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizhi Technology Co., Ltd., article number Sequencing reagent in PF-UM-PEV30).
适合用于合成法测序的经标记(例如荧光标记)的核苷酸以及用于从经标记(例如荧光标记)的核苷酸除去标记的试剂是本领域熟知的,这样的核苷酸和试剂的非限制性实例可参见例如WO04018497、WO04018493、美国专利No.7427673和美国专利No.7057026(其公开内容在此全部引入作为参考)中公开的经标记的核苷酸和用于从经标记的核苷酸除去标记的试剂。Labeled (e.g., fluorescently labeled) nucleotides suitable for use in synthetic sequencing and reagents for removing labels from labeled (e.g., fluorescently labeled) nucleotides are well known in the art, such nucleotides and reagents Non-limiting examples can be found in, for example, the labeled nucleotides disclosed in WO04018497, WO04018493, U.S. Patent No. 7,427, 673, and U.S. Pat. Nucleotide removal of labeled reagents.
在某些实施方案中,经标记的核苷酸还可包含3’阻断基团。所述3’阻断基团在该经标记的核苷酸聚合到生长的核苷酸链上时阻止其他核苷酸的掺入。优选地,3’阻断 基团与标记一起除去。合适的3’阻断基团以及用于从核苷酸除去3’阻断基团的试剂是本领域熟知的,这样的3’阻断基团和试剂的非限制性实例可参见例如:Greene &Wuts,Protective Groups in Organic Synthesis,John Wiley & Sons。Guillier;Metzker等人(Nucleic Acids Research,22(20):4259-4267,1994);WO91/06678、WO2002/029003、WO2014139596、WO2004/018497,上述参考文献均通过引用整体并入本文。In certain embodiments, the labeled nucleotide may further comprise a 3' blocking group. The 3' blocking group prevents the incorporation of other nucleotides when the labeled nucleotide is polymerized onto the growing nucleotide strand. Preferably, 3' blocking The group is removed along with the label. Suitable 3' blocking groups and reagents for removing 3' blocking groups from nucleotides are well known in the art, and non-limiting examples of such 3' blocking groups and reagents can be found, for example, in Greene. & Wuts, Protective Groups in Organic Synthesis, John Wiley & Sons. Guillier; Metzker et al. (Nucleic Acids Research, 22(20): 4259-4267, 1994); WO 91/06678, WO 2002/029003, WO 2014139596, WO 2004/018497, each of which is incorporated herein in its entirety by reference.
在优选的实施方案中,经标记的核苷酸上的标记本身可用作3’阻断基团。不受理论约束,例如这样的标记可以具有足以发挥阻断其他核苷酸掺入到多核苷酸链上的大小或结构。所述阻断可能是由于空间位阻造成的,或者可能是由于大小、电荷和结构的组合造成的。在某些实施方案中,经标记的核苷酸上的标记和阻断基团可以是不同的,但优选可采用相同的方式从核苷酸上除去所述标记和阻断基团。在这样的情况下,本文所述的再生试剂包含可从经标记的核苷酸上同时除去标记和3’阻断基团的试剂。在另外的一些实施方案中,本文所述的再生试剂可以包括可从经标记的核苷酸上除去标记的试剂和可从经标记的核苷酸上除去3’阻断基团的试剂。In a preferred embodiment, the label on the labeled nucleotide itself can be used as a 3' blocking group. Without being bound by theory, for example, such a label may be of a size or structure sufficient to function to block the incorporation of other nucleotides into the polynucleotide strand. The blockage may be due to steric hindrance or may be due to a combination of size, charge and structure. In certain embodiments, the labeling and blocking groups on the labeled nucleotides can be different, but preferably the labeling and blocking groups can be removed from the nucleotides in the same manner. In such cases, the regenerative reagents described herein comprise an agent that removes both the label and the 3' blocking group from the labeled nucleotide. In still other embodiments, the regenerative reagents described herein can include an agent that can remove the label from the labeled nucleotide and an agent that can remove the 3' blocking group from the labeled nucleotide.
如本文所用,术语“聚合酶”是指合成核酸链或聚合物的酶,包括DNA聚合酶和RNA聚合酶。优选地,本文所用的聚合酶是DNA聚合酶。可以使用的一种聚合酶是SequenaseTM(来源于噬菌体7 DNA聚合酶的酶,其被修饰以改善其测序性质-参见Tabor and Richarson,Proc.Nat.Acad.Sci.USA,84:4767-4771(1987),可购自例如United States Biochemical Corporation,Cleveland,Ohio)。可用于代替SequenaseTM的其它聚合酶包括但不限于DNA聚合酶I的Klenow片段,AMV逆转录酶以及Taq聚合酶。关于聚合酶的其他描述还可参见WO05024010和WO06120433,其全部内容通过引用并入本文。As used herein, the term "polymerase" refers to an enzyme that synthesizes a nucleic acid strand or polymer, including DNA polymerases and RNA polymerases. Preferably, the polymerase used herein is a DNA polymerase. May be used is one polymerase Sequenase TM (7 DNA polymerase from bacteriophage enzyme, which sequence is modified to improve its properties - see Tabor and Richarson, Proc.Nat.Acad.Sci.USA, 84: 4767-4771 (1987), available from, for example, United States Biochemical Corporation, Cleveland, Ohio). Other polymerases that can be used in place of Sequenase (TM) include, but are not limited to, the Klenow fragment of DNA polymerase I, AMV reverse transcriptase, and Taq polymerase. Further description of the polymerase can also be found in WO05024010 and WO06120433, the entire contents of which are hereby incorporated by reference.
通常使用的聚合条件是本领域已知的这些酶的聚合条件。在SequenaseTM的情况下,聚合条件包括在约室温至约45℃范围内的温度;pH 7至8的缓冲液,优选pH 7.3至7.7;酶浓度为约0.01单位/微升至约1单位/微升,反应时间为约1至约20分钟,优选1至5分钟。用于SequenaseTM的典型缓冲液由以下组成:0.040M Tris HCI(pH7.5)0.050M氯化钠,0.010M氯化镁,0.010M二硫苏糖醇。在DNA聚合酶I的Klenow片段的情况下,这些典型的条件包括在约10℃至约45℃范围内的温度,优选约15℃至约40℃;pH 6.8至7.4的缓冲液,优选pH 7.0至7.4;酶浓度为约0.01单位/微升至 约1单位/微升,优选约0.02至约0.15单位/微升,反应时间为约1至约40分钟。用于DNA聚合酶I的Klenow片段的典型缓冲液由以下组成:0.05M三羟甲基氨基甲烷氯化物,pH 7.5 0.05M氯化镁,0.05M氯化钠,0.010M二硫苏糖醇。The polymerization conditions generally used are the polymerization conditions of these enzymes known in the art. In the case of Sequenase TM, the polymerization conditions include a temperature in the range of from about room temperature to about 45 ℃; and pH 7 buffer to 8, preferably pH 7.3 to 7.7; an enzyme concentration of from about 0.01 units / [mu] l to about 1 unit / The microliter is for a period of from about 1 to about 20 minutes, preferably from 1 to 5 minutes. Typical for Sequenase TM buffer consisting of: 0.040M Tris HCI (pH7.5) 0.050M sodium chloride, 0.010M magnesium chloride, 0.010M dithiothreitol. In the case of the Klenow fragment of DNA polymerase I, these typical conditions include a temperature in the range of from about 10 ° C to about 45 ° C, preferably from about 15 ° C to about 40 ° C; a buffer of pH 6.8 to 7.4, preferably pH 7.0 To 7.4; the enzyme concentration is from about 0.01 unit/microliter to about 1 unit/microliter, preferably from about 0.02 to about 0.15 unit/microliter, and the reaction time is from about 1 to about 40 minutes. A typical buffer for the Klenow fragment of DNA polymerase I consists of 0.05 M trishydroxymethylammonium chloride, pH 7.5 0.05 M magnesium chloride, 0.05 M sodium chloride, 0.010 M dithiothreitol.
应当理解,这些条件仅是示例性的。当使用其它聚合酶时,应该使用最适合它们的条件,因为通常希望尽可能快地进行聚合反应。为此,通常对于逆转录酶需要使用42℃的温度;对于Klenow聚合酶为24℃;对于SequenaseTM为37℃;和对于Taq聚合酶为72℃。此外,为了增强反应,特别是在使用经修饰的dNTP的情况下,使用显著过量的dNTP(超过化学计量)或修饰其他条件如盐浓度可以是有利的。It should be understood that these conditions are merely exemplary. When other polymerases are used, the conditions most suitable for them should be used, as it is generally desirable to carry out the polymerization as quickly as possible. For this reason, a temperature of 42 ° C is usually required for the reverse transcriptase; 24 ° C for Klenow polymerase; 37 ° C for Sequenase TM ; and 72 ° C for Taq polymerase. Furthermore, in order to enhance the reaction, particularly in the case of modified dNTPs, it may be advantageous to use a significant excess of dNTPs (beyond stoichiometry) or to modify other conditions such as salt concentration.
如本文所用,术语“互补”或“基本互补”是指核苷酸或核酸之间的杂交或碱基配对或者双链体形成。如果在一个给定位置上一个核酸的核苷酸能够与另一个核酸的核苷酸形成氢键,则这两个核酸被认为在该位置上是彼此互补的。互补核苷酸一般是A和T(或A和U)或者C和G。当在最佳地比对和比较并适当地进行了核苷酸插入或缺失的情况下,一条链的核苷酸与另一条链的至少大约80%、通常至少大约90%到约95%,甚至大约98%到100%配对时,这两个单链RNA或DNA分子被称为基本互补。As used herein, the term "complementary" or "substantially complementary" refers to hybridization or base pairing or duplex formation between nucleotides or nucleic acids. If a nucleotide of one nucleic acid at a given position is capable of forming a hydrogen bond with a nucleotide of another nucleic acid, then the two nucleic acids are considered to be complementary to each other at that position. Complementary nucleotides are typically A and T (or A and U) or C and G. Where, in the case of optimal alignment and comparison and appropriate nucleotide insertions or deletions, at least about 80%, typically at least about 90% to about 95% of the nucleotides of one strand, and the other strand, Even when about 98% to 100% are paired, the two single-stranded RNA or DNA molecules are said to be substantially complementary.
如本文所用,术语“杂交”是指互补的核苷酸或核苷酸碱基之间足够的氢键结合,其可以是,例如,Watson-Crick,Hoogsteen或反向的Hoogsteen氢键结合,以使得核酸链之间发生稳定和特异性结合。杂交能力根据严格条件(包括合适的缓冲液浓度和温度,其允许特异性杂交至具有完全或部分互补性区域的靶核酸)确定。因此,不是核酸的所有核苷酸都需要是互补的。此外,当核酸链与靶核酸的所有、部分或重叠区域杂交时,核酸链是“基本互补”的。建立用于设计本发明的寡核苷酸或引物的严格杂交条件的定性和定量考虑事项是本领域已知的,参见,例如,Ausubel等人,Short Protocols in Molecular Biology(4th ed.,John Wiley & Sons 1999);Sambrook等人,Molecular Cloning:A Laboratory Manual(3d ed.,Cold Spring Harbor Laboratory Press 2001):Nucleic Acid Hybridisation:A Practical Approach(B.D.Hames & S.J.Higgins eds.,IRL Press 1985)。As used herein, the term "hybridization" refers to a sufficient hydrogen bonding between complementary nucleotide or nucleotide bases, which may be, for example, Watson-Crick, Hoogsteen or reverse Hoogsteen hydrogen bonding, Stable and specific binding occurs between nucleic acid strands. Hybridization capacity is determined according to stringent conditions, including appropriate buffer concentrations and temperatures, which allow for specific hybridization to a target nucleic acid having a fully or partially complementary region. Therefore, not all nucleotides of a nucleic acid need to be complementary. Furthermore, a nucleic acid strand is "substantially complementary" when the nucleic acid strand hybridizes to all, part or overlapping regions of the target nucleic acid. Qualitative and quantitative considerations for establishing stringent hybridization conditions for designing oligonucleotides or primers of the invention are known in the art, see, for example, Ausubel et al., Short Protocols in Molecular Biology (4th ed., John Wiley). & Sons 1999); Sambrook et al., Molecular Cloning: A Laboratory Manual (3d ed., Cold Spring Harbor Laboratory Press 2001): Nucleic Acid Hybridisation: A Practical Approach (BD Hames & SJ Higgins eds., IRL Press 1985).
如本文所用,术语“再生试剂”是指在对固体支持物上的核酸分子进行测序后,能够消除固体支持物上的测序信号(例如,荧光信号)以使得能够进行下一轮测序的试剂。在本发明的方法中所使用的具体的再生试剂将取决于所采用的测序方法。根据测序方法确定在本发明的方法中所使用的具体的再生试剂在本领域技术人员的能力范 围内。As used herein, the term "regeneration agent" refers to an agent that, upon sequencing a nucleic acid molecule on a solid support, is capable of eliminating sequencing signals (eg, fluorescent signals) on a solid support to enable next round of sequencing. The particular regenerating reagent used in the methods of the invention will depend on the sequencing method employed. Determining the specific regenerative reagents used in the methods of the invention according to sequencing methods is within the skill of those skilled in the art. Inside.
例如,在连接法测序中,再生试剂包括从被测序的核酸分子上消除代表所述核酸分子的核苷酸序列的信号并使得能够起始下一个连接测序反应的试剂,优选地,这样的试剂是还原剂。在一个实施方案中,再生试剂包括从经标记(例如荧光标记)的测序探针上除去标记(例如荧光标记)的试剂,优选地,这样的试剂是还原剂。For example, in ligation sequencing, a regenerating reagent includes an agent that eliminates a signal representing a nucleotide sequence of the nucleic acid molecule from the sequenced nucleic acid molecule and enables initiation of the next ligation sequencing reaction, preferably such a reagent It is a reducing agent. In one embodiment, the regenerating reagent comprises an agent that removes a label (eg, a fluorescent label) from a labeled (eg, fluorescently labeled) sequencing probe, preferably such reagent is a reducing agent.
又例如,在合成法测序中,再生试剂包含用于从被测序的核酸分子上消除代表所述核酸分子的核苷酸序列的信号并使得能够起始下一个聚合测序反应的试剂,优选地,这样的试剂是还原剂。在一个实施方案中,再生试剂包括从经标记(例如荧光标记)的核苷酸除去标记(例如荧光标记)的试剂,优选地,这样的试剂是还原剂。In another example, in synthetic sequencing, the regenerating reagent comprises reagents for eliminating a signal representative of the nucleotide sequence of the nucleic acid molecule from the sequenced nucleic acid molecule and enabling initiation of the next polymerization sequencing reaction, preferably, Such an agent is a reducing agent. In one embodiment, the regenerating reagent comprises an agent that removes a label (eg, a fluorescent label) from a labeled (eg, fluorescently labeled) nucleotide, preferably such reagent is a reducing agent.
在优选的实施方案中,从经标记(例如荧光标记)的测序探针或核苷酸上除去标记(例如荧光标记)的试剂可以是还原剂,例如在使用对还原裂解敏感的官能团将标记连接至核苷酸的情况下。这样的还原剂的实例包括但不限于金属催化剂例如镍催化剂、铂催化剂、钯催化剂,醇类化合物,酚类化合物,醛类化合物,烯烃类化合物,胺类化合物,酮类化合物,含磷化合物,过氧化物、硫醇类化合物例如β-巯基乙醇、二硫苏糖醇,膦(例如三(羟甲基)膦(THP),三(2-羧乙基)膦等)(TCEP),等,其他还原剂的实例还可参见例如网页http://www.organic-chemistry.org/chemicals/reductions/上列举的那些。优选地,还原剂是能够还原二硫键的任何合适的还原剂,包括但不限于硫醇类化合物例如β-巯基乙醇、二硫苏糖醇,膦(例如三(羟甲基)膦(THP),三(2-羧乙基)膦等)(TCEP)等。应理解,在本发明的实施方案中使用的含有还原剂的再生试剂应当能够从经标记(例如荧光标记)的测序探针或核苷酸上除去标记(例如荧光标记)同时不会破坏测序探针、核苷酸、合成的多核苷酸、固体支持物上的核酸分子的化学结构以及核酸分子与固体支持物的结合。在优选的实施方案中,再生试剂包括用于BGISEQ-500测序平台的测序试剂盒中的再生试剂,例如BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30)中的再生试剂。In a preferred embodiment, the reagent that removes the label (eg, fluorescent label) from the labeled (eg, fluorescently labeled) sequencing probe or nucleotide can be a reducing agent, such as by linking the label with a functional group that is susceptible to reductive cleavage. In the case of nucleotides. Examples of such a reducing agent include, but are not limited to, a metal catalyst such as a nickel catalyst, a platinum catalyst, a palladium catalyst, an alcohol compound, a phenol compound, an aldehyde compound, an olefin compound, an amine compound, a ketone compound, a phosphorus-containing compound, Peroxides, thiol compounds such as β-mercaptoethanol, dithiothreitol, phosphines (eg, tris(hydroxymethyl)phosphine (THP), tris(2-carboxyethyl)phosphine, etc.) (TCEP), etc. Examples of other reducing agents can also be found, for example, at the web page http://www.organic-chemistry.org/chemicals/reductions/. Preferably, the reducing agent is any suitable reducing agent capable of reducing disulfide bonds, including but not limited to thiol compounds such as β-mercaptoethanol, dithiothreitol, phosphines (eg, tris(hydroxymethyl)phosphine (THP) ), tris(2-carboxyethyl)phosphine, etc.) (TCEP) and the like. It will be appreciated that the regenerant containing reductant used in embodiments of the invention should be capable of removing labels (e.g., fluorescent labels) from labeled (e.g., fluorescently labeled) sequencing probes or nucleotides without disrupting sequencing. A needle, a nucleotide, a synthetic polynucleotide, a chemical structure of a nucleic acid molecule on a solid support, and a binding of a nucleic acid molecule to a solid support. In a preferred embodiment, the regenerating reagent comprises a regenerative reagent for use in a sequencing kit for the BGISEQ-500 sequencing platform, such as the BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhicheng Technology Co., Ltd., Regeneration reagent in item number PF-UM-PEV30).
因此,在一个实施方案中,本发明涉及消除再生试剂残留的方法,其包括:a.提供其上固定有核酸分子的固体支持物,b.使用用于BGISEQ-500测序平台的测序试剂盒中的再生试剂,例如BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30)中的再生试剂处理所述固体支持物,所述再生试剂包含还原剂,和c.使用添加了含有二硫键结构的化合物的洗涤溶液处理所述固体 支持物。优选地,经过步骤c的处理之后,所述固体支持物上不含或基本上不含再生试剂。在一个实施方案中,在步骤b之前,任选地使用含有用于BGISEQ-500测序平台的测序试剂盒中的测序试剂(例如BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30)中的测序试剂)的一种或多种反应溶液处理所述固体支持物,以在固体支持物上产生代表所述核酸分子的核苷酸序列的信号(例如荧光信号),并且其中所述再生试剂可以消除固体支持物上的所述信号(例如荧光信号)。在一个实施方案中,经过步骤c的处理之后,任选地还使用含有用于BGI-SEQ500测序平台的测序试剂盒中的测序试剂(例如BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30)中的测序试剂)的一种或多种反应溶液处理所述固体支持物。Accordingly, in one embodiment, the invention relates to a method of eliminating regenerative reagent residues comprising: a. providing a solid support on which a nucleic acid molecule is immobilized, b. using a sequencing kit for the BGISEQ-500 sequencing platform The regenerative reagent, such as the BGISEQ-500 high-throughput sequencing kit (SE50 V3.0, Shenzhen Huada Zhizhi Technology Co., Ltd., Cat. No. PF-UM-PEV30), treats the solid support, the regenerating reagent Containing a reducing agent, and c. treating the solid with a washing solution to which a compound having a disulfide bond structure is added Support. Preferably, the solid support is free or substantially free of regenerating reagent after the treatment of step c. In one embodiment, prior to step b, a sequencing reagent containing a sequencing kit for the BGISEQ-500 sequencing platform (eg, BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0, Shenzhen Hua) is optionally used. The solid support is treated with one or more reaction solutions of a sequencing reagent in Dazhicheng Technology Co., Ltd., No. PF-UM-PEV30) to generate a nucleotide sequence representing the nucleic acid molecule on a solid support a signal (eg, a fluorescent signal), and wherein the regeneration reagent can eliminate the signal (eg, a fluorescent signal) on the solid support. In one embodiment, after the treatment of step c, optionally using a sequencing reagent in a sequencing kit for the BGI-SEQ500 sequencing platform (eg, BGISEQ-500 High Throughput Sequencing Kit (SE50 V3.0) The solid support is treated with one or more reaction solutions of Shenzhen Huada Zhixing Technology Co., Ltd., the sequencing reagent in PF-UM-PEV30).
在一个方面,本发明还涉及测序用的试剂盒,所述试剂盒包括如本文所述的测序试剂、洗涤试剂、包含还原剂的再生试剂和添加了含有二硫键结构的化合物的洗涤溶液,其中所述测序试剂可以在其上固定有核酸分子的固体支持物上产生代表所述核酸分子的核苷酸序列的信号(例如,荧光信号),并且其中所述再生试剂可以消除固体支持物上的所述信号。在一个实施方案中,所述测序试剂包括用于连接法测序或合成法测序的测序试剂。如本文所用,洗涤试剂是能够将固体支持物上与其非特异性结合的物质洗掉并且不会不利地影响后续反应的任何溶液。适当地,洗涤试剂含有缓冲剂,例如有机盐,以保持约pH 6至pH 9的稳定pH,并且可能还包含一价或二价阳离子,从而从固体支持物除去非特异性结合的分子。示例性的洗涤试剂可以包括例如pH 6.5的100mM Tris-HCl缓冲液、TE缓冲液(Tris-HCl pH 8,10mM和EDTA,1mM)等。缓冲剂还可以是例如如下文所述的缓冲剂。In one aspect, the invention also relates to a kit for sequencing, the kit comprising a sequencing reagent, a washing reagent, a regenerating reagent comprising a reducing agent, and a washing solution to which a compound containing a disulfide bond structure is added, as described herein, Wherein the sequencing reagent can generate a signal (eg, a fluorescent signal) representing a nucleotide sequence of the nucleic acid molecule on a solid support to which the nucleic acid molecule is immobilized, and wherein the regeneration reagent can eliminate the solid support The signal. In one embodiment, the sequencing reagent comprises a sequencing reagent for ligation sequencing or synthetic sequencing. As used herein, a wash reagent is any solution that is capable of washing away a substance on its solid support that is non-specifically bound thereto and does not adversely affect subsequent reactions. Suitably, the wash reagent contains a buffer, such as an organic salt, to maintain a stable pH of from about pH 6 to pH 9, and may also contain monovalent or divalent cations to remove non-specifically bound molecules from the solid support. Exemplary washing reagents can include, for example, 100 mM Tris-HCl buffer at pH 6.5, TE buffer (Tris-HCl pH 8, 10 mM and EDTA, 1 mM), and the like. The buffering agent can also be, for example, a buffering agent as described below.
如本文所用,含有二硫键结构的化合物包括其化学结构中含有-S-S-形式的硫原子间的键(也称为二硫键)的任何化合物。As used herein, a compound having a disulfide bond structure includes any compound having a bond (also referred to as a disulfide bond) between sulfur atoms in the -S-S-form in its chemical structure.
在某些实施方案中,含有二硫键结构的化合物具有通式-R-S-S-R'-,其中R和R’各自独立地选自氢、任选取代的烷基、任选取代的烯基、任选取代的炔基、任选取代的环烷基、任选取代的环烯基、任选取代的环炔基、任选取代的芳基、任选取代的杂芳基、任选取代的杂环基、任选取代的杂脂环基、任选取代的芳烷基或任选取代的(杂脂环基)烷基。In certain embodiments, the compound containing a disulfide bond structure has the formula -RSS-R'-, wherein R and R' are each independently selected from hydrogen, optionally substituted alkyl, optionally substituted alkenyl, Optionally substituted alkynyl, optionally substituted cycloalkyl, optionally substituted cycloalkenyl, optionally substituted cycloalkynyl, optionally substituted aryl, optionally substituted heteroaryl, optionally substituted Heterocyclyl, optionally substituted heteroalicyclic, optionally substituted aralkyl or optionally substituted (heteroalicyclic)alkyl.
在某些实施方案中,含有二硫键结构的化合物具有通式- C(R1)m(R2)m’-C(R3)p-S-S-C(R4)q-C(R5)n(R6)n’-,其中R1、R2、R3、R4、R5和R6各自独立地选自氢、氧代、羟基、被保护的羟基、烃氧基、芳氧基、酰基、巯基、烷硫基、芳硫基、氰基、卤素、硫代羰基、O-氨基甲酰基、N-氨甲酰基、O-硫代氨基甲酰基、N-硫代氨基甲酰基、C-酰氨基、N-酰氨基、S-磺酰氨基、N-磺酰氨基、C-羧基、被保护的C-羧基、O-羧基、异氰酸基、硫代氰酸基、异硫代氰酸基、硝基、甲硅烷基、氧硫基、亚磺酰基、磺酰基、卤代烷基、卤代烃氧基、三卤代甲磺酰基、三卤代甲磺酰氨基、氨基、单取代的氨基、二取代的氨基、任选取代的烷基、任选取代的烯基、任选取代的炔基、任选取代的环烷基、任选取代的环烯基、任选取代的环炔基、任选取代的芳基、任选取代的杂芳基、任选取代的杂环基、任选取代的杂脂环基、任选取代的芳烷基或任选取代的(杂脂环基)烷基,其中m和m’各自为0-3的整数,条件是0<m+m’≤3,n和n’各自为0-3的整数,条件是0<m+m’≤3,p和q各自为1或2的整数。在一些实施方案中,R1、R2、R5和R6各自独立地选自氢、羟基或氨基,R3和R4选自氢。In certain embodiments, the compound containing a disulfide bond structure has the formula -C(R 1 ) m (R 2 ) m' -C(R 3 ) p -SSC(R 4 ) q -C(R 5 ) n (R 6 ) n' -, wherein R 1 , R 2 , R 3 , R 4 , R 5 and R 6 are each independently selected from the group consisting of hydrogen, oxo, hydroxy, protected hydroxy, alkoxy, aryloxy Base, acyl, decyl, alkylthio, arylthio, cyano, halogen, thiocarbonyl, O-carbamoyl, N-carbamoyl, O-thiocarbamoyl, N-thiocarbamoyl , C-amido, N-acylamino, S-sulfonylamino, N-sulfonylamino, C-carboxyl, protected C-carboxyl, O-carboxyl, isocyanato, thiocyanate, iso Thiocyanate, nitro, silyl, oxythio, sulfinyl, sulfonyl, haloalkyl, halohydrocarbyloxy, trihalomethylsulfonyl, trihalomethylsulfonylamino, amino, Monosubstituted amino, disubstituted amino, optionally substituted alkyl, optionally substituted alkenyl, optionally substituted alkynyl, optionally substituted cycloalkyl, optionally substituted cycloalkenyl, optionally substituted Cycloalkynyl, optionally substituted aryl, optionally substituted heteroaryl An optionally substituted heterocyclic group, an optionally substituted heteroalicyclic group, an optionally substituted aralkyl group or an optionally substituted (heteroalicyclic)alkyl group, wherein m and m' are each 0-3 An integer, the condition is 0 < m + m' ≤ 3, n and n' are each an integer of 0-3, provided that 0 < m + m' ≤ 3, and p and q are each an integer of 1 or 2. In some embodiments, R 1 , R 2 , R 5 , and R 6 are each independently selected from hydrogen, hydroxy, or amino, and R 3 and R 4 are selected from hydrogen.
在优选的实施方案中,含有二硫键结构的化合物可以选自但不限于诸如二乙基二硫醚、二-n-丙基二硫醚、二-n-丁基二硫醚、二-仲-丁基二硫醚、二-叔丁基二硫醚、二-n-戊基二硫醚、二-叔戊基二硫醚、二-叔-己基二硫醚、二-n-辛基二硫醚、二-叔辛基二硫醚、二-n-十二烷基二硫醚、二-叔十二烷基二硫醚、二-n-硬脂二硫醚、乙基-n-丙基二硫醚、乙基-叔-丁基二硫醚、乙基-叔-丁基二硫醚、n-丙基-异丙基二硫醚等的二烷基二硫醚化合物;诸如二苯基二硫醚、二苄基硫醚、二-p-甲苯基二硫醚等的芳香族二硫醚化合物;诸如二丙撑二硫醚、二丁撑二硫醚、二环己基二硫醚等的环状二硫醚化合物;诸如胱胺、二氨基二苯基二硫醚等的含氨基的二硫醚化合物;诸如二硫代二乙醇酸、二硫代二丙酸、二硫代二乙醇酸-2-乙基己基等的羧酸及羧酸衍生物的二硫醚化合物;诸如二烯丙基二硫醚等的不饱和键合的二硫醚化合物等;诸如2-羟乙基二硫化物、2-n-羟丙基二硫化物、2-羟丁基二硫化物、二-羟戊基二硫醚、二-羟己基二硫醚、二-羟辛基二硫醚、等羟基取代的二烷基二硫醚化合物;以及其任意衍生物。In a preferred embodiment, the compound containing a disulfide bond structure may be selected from, but not limited to, such as diethyl disulfide, di-n-propyl disulfide, di-n-butyl disulfide, di- Sec-butyl disulfide, di-tert-butyl disulfide, di-n-pentyl disulfide, di-tert-amyl disulfide, di-tert-hexyl disulfide, di-n-octyl Dithioether, di-tert-octyl disulfide, di-n-dodecyl disulfide, di-tert-dodecyl disulfide, di-n-stearyl disulfide, ethyl- Dialkyl disulfide compounds such as n-propyl disulfide, ethyl-tert-butyl disulfide, ethyl-tert-butyl disulfide, n-propyl-isopropyl disulfide An aromatic disulfide compound such as diphenyl disulfide, dibenzyl sulfide, di-p-tolyl disulfide or the like; such as dipropylene disulfide, dibutyl disulfide, bicyclo a cyclic disulfide compound such as hexyl disulfide; an amino group-containing disulfide compound such as cystamine or diaminodiphenyl disulfide; such as dithiodiglycolic acid, dithiodipropionic acid, a dicarboxylic acid compound of a carboxylic acid such as dithiodiglycol-2-ethylhexyl or a carboxylic acid derivative; An unsaturatedly bonded disulfide compound such as diallyl disulfide; etc.; such as 2-hydroxyethyl disulfide, 2-n-hydroxypropyl disulfide, 2-hydroxybutyl disulfide , bis-hydroxypentyl disulfide, di-hydroxyhexyl disulfide, di-hydroxyoctyl disulfide, an iso-hydroxy substituted dialkyl disulfide compound; and any derivative thereof.
在优选的实施方案中,含有二硫键结构的化合物选自胱胺、胱胺酸、5,5′-二硫代双(2-硝基苯甲酸)和2-羟乙基二硫化物及其衍生物。In a preferred embodiment, the compound containing a disulfide bond structure is selected from the group consisting of cystamine, cystine, 5,5'-dithiobis(2-nitrobenzoic acid), and 2-hydroxyethyl disulfide. Its derivatives.
如本文中使用的,R、R’、R1、R2、R3、R4、R5和R6基团代表能够与指定的原子连接的取代基。R、R’、R1、R2、R3、R4、R5和R6基团可以是取代的也可以是未 取代的。R、R’、R1、R2、R3、R4、R5和R6基团与它们所连接的原子一起能够形成环烷基、芳基、杂芳基或杂环。As used herein, the R, R', R 1 , R 2 , R 3 , R 4 , R 5 and R 6 groups represent substituents which are capable of attaching to a specified atom. The R, R', R 1 , R 2 , R 3 , R 4 , R 5 and R 6 groups may be substituted or unsubstituted. The R, R', R 1 , R 2 , R 3 , R 4 , R 5 and R 6 groups together with the atoms to which they are attached are capable of forming a cycloalkyl, aryl, heteroaryl or heterocyclic ring.
当将基团描述为“任选取代的”时,该基团可以为未取代的,或者被一个或多个指定的取代基取代。同样的,当描述基团为“未取代的或取代的”时,如果是取代的,则取代基可选自一个或多个指定的取代基。如果没有指定取代基,则意味着指定的“任选取代的”或“取代的”基团可以单独地且独立地被一个或多个基团取代,所述基团单独地且独立地选自官能团,其包括但不限于:烷基、烯基、炔基、环烷基、环烯基、环炔基、芳基、杂芳基、杂脂环基、芳烷基、杂芳烷基、(杂脂环基)烷基、羟基、被保护的羟基、烃氧基、芳氧基、酰基、巯基、烷硫基、芳硫基、氰基、卤素、硫代羰基、O-氨基甲酰基、N-氨甲酰基、O-硫代氨基甲酰基、N-硫代氨基甲酰基、C-酰氨基、N-酰氨基、S-磺酰氨基、N-磺酰氨基、C-羧基、被保护的C-羧基、O-羧基、异氰酸基(isocyanato)、硫代氰酸基(thiocyanato)、异硫代氰酸基(isothiocyanato)、硝基、甲硅烷基、氧硫基、亚磺酰基、磺酰基、卤代烷基、卤代烃氧基、三卤代甲磺酰基、三卤代甲磺酰氨基、氨基、单取代的氨基、二取代的氨基及其被保护的衍生物。When a group is described as "optionally substituted," the group may be unsubstituted or substituted with one or more specified substituents. Likewise, when the group is described as being "unsubstituted or substituted," if substituted, the substituent may be selected from one or more of the specified substituents. If no substituent is specified, it is meant that the specified "optionally substituted" or "substituted" group may be substituted, individually and independently, by one or more groups, which are individually and independently selected from the group consisting of Functional groups, including but not limited to: alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, heteroaryl, heteroalicyclic, aralkyl, heteroarylalkyl, (heteroalicyclic)alkyl, hydroxy, protected hydroxy, alkoxy, aryloxy, acyl, decyl, alkylthio, arylthio, cyano, halogen, thiocarbonyl, O-carbamoyl , N-carbamoyl, O-thiocarbamoyl, N-thiocarbamoyl, C-amido, N-acylamino, S-sulfonylamino, N-sulfonylamino, C-carboxy, quilt Protected C-carboxyl, O-carboxyl, isocyanato, thiocyanato, isothiocyanato, nitro, silyl, oxythio, sulfin Acyl, sulfonyl, haloalkyl, halohydrocarbyloxy, trihalomethylsulfonyl, trihalomethylsulfonylamino, amino, monosubstituted amino, disubstituted amino and Protected derivatives.
如本文中使用的,“烷基”指直链或支链的烃链,其包括完全饱和的(无双键或三键)的烃基。在某些实施方案中,所述烷基可以具有1-20个碳原子(每当其在本文中出现时,如“1-20”的数值范围是指给定范围内的包括端点在内的每个整数;例如,“1-20个碳原子”意味着烷基可以由1个碳原子、2个碳原子、3个碳原子等组成,直到并包括20个碳原子,然而本定义也涵盖了未指定数值范围的术语“烷基”的出现)。所述烷基也可以是具有约7个-约10个碳原子的中等大小的烷基。所述烷基也可以是具有1-6个碳原子的低级烷基。化合物的烷基可以被指定为“C1-C4的烷基”或类似的指定。仅举例说明,“C1-C4的烷基”表示烷基链中存在一到四个碳原子,即所述烷基链选自甲基、乙基、丙基、异丙基、正丁基、异丁基、仲丁基和叔丁基。典型的烷基包括但不限于甲基、乙基、丙基、异丙基、丁基、异丁基、叔丁基、戊基和己基。所述烷基可以是取代的或未取代的。As used herein, "alkyl" refers to a straight or branched hydrocarbon chain that includes a fully saturated (no double or triple bond) hydrocarbon group. In certain embodiments, the alkyl group can have from 1 to 20 carbon atoms (when it appears herein, such as the numerical range of "1-20" refers to a range including endpoints within a given range. Each integer; for example, "1-20 carbon atoms" means that the alkyl group may consist of 1 carbon atom, 2 carbon atoms, 3 carbon atoms, etc. up to and including 20 carbon atoms, however this definition also covers The occurrence of the term "alkyl" which does not specify a range of values). The alkyl group can also be a medium size alkyl group having from about 7 to about 10 carbon atoms. The alkyl group may also be a lower alkyl group having 1 to 6 carbon atoms. The alkyl group of the compound can be designated as "C 1 -C 4 alkyl group" or the like. By way of example only, "C 1 -C 4 alkyl" means that one to four carbon atoms are present in the alkyl chain, ie, the alkyl chain is selected from the group consisting of methyl, ethyl, propyl, isopropyl, n-butyl Base, isobutyl, sec-butyl and tert-butyl. Typical alkyl groups include, but are not limited to, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, tert-butyl, pentyl, and hexyl. The alkyl group can be substituted or unsubstituted.
如本文中使用的,“烯基”指在直链或支链的烃链中含有一个或多个双键的烃基基团。烯基可以是未取代的或取代的。As used herein, "alkenyl" refers to a hydrocarbyl group containing one or more double bonds in a straight or branched hydrocarbon chain. The alkenyl group can be unsubstituted or substituted.
如本文中使用的,“炔基”指在直链或支链的烃链中含有一个或多个三键的烃基基团。炔基可以是未取代的或取代的。 As used herein, "alkynyl" refers to a hydrocarbyl group containing one or more triple bonds in a straight or branched hydrocarbon chain. An alkynyl group can be unsubstituted or substituted.
如本文中使用的,“环烷基”指完全饱和的(无双键或三键)单环或多环的烃环系统。当由两个或更多个环组成时,所述环可以以稠合的形式结合在一起。环烷基可以在环中含有3-10个原子。在某些实施方案中,环烷基可以在环中含有3-8个原子。环烷基可以是未取代的或取代的。典型的环烷基包括但绝不限于环丙基、环丁基、环戊基、环己基、环庚基和环辛基。As used herein, "cycloalkyl" refers to a fully saturated (no double or triple bond) monocyclic or polycyclic hydrocarbon ring system. When composed of two or more rings, the rings may be bonded together in a fused form. The cycloalkyl group may have 3 to 10 atoms in the ring. In certain embodiments, a cycloalkyl group can contain from 3 to 8 atoms in the ring. A cycloalkyl group can be unsubstituted or substituted. Typical cycloalkyl groups include, but are in no way limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl and cyclooctyl.
如本文中使用的,“芳基”指碳环(全部为碳)的单环或多环芳香环系统(包括,例如,稠环、桥环或螺环系统,其中两个碳环共享化学键,例如,一个或多个芳环带有一个或多个芳环或非芳环),其具有遍及至少一个环的完全离域的π电子系统。芳基中的碳原子数可变。例如,在某些实施方案中,所述芳基可以为C6-C14芳基、C6-C1o芳基或C6芳基。芳基的实例包括但不限于苯、萘和甘菊环。芳基可以是取代的或未取代的。As used herein, "aryl" refers to a monocyclic or polycyclic aromatic ring system of carbon rings (all carbon) (including, for example, fused, bridged or spiro ring systems in which two carbon rings share a chemical bond, For example, one or more aromatic rings carry one or more aromatic or non-aromatic rings) having a fully delocalized pi-electron system throughout at least one ring. The number of carbon atoms in the aryl group is variable. For example, in certain embodiments, the aryl group can be a C 6 -C 14 aryl group, a C 6 -C 1o aryl group, or a C 6 aryl group. Examples of aryl groups include, but are not limited to, benzene, naphthalene, and chamomile rings. The aryl group can be substituted or unsubstituted.
如本文中使用的,“杂环基”指包含至少一个杂原子(例如,O、N、S)的环系统。这类系统可以是不饱和的,可以包括部分不饱和,或可以包含一些芳香部分,或为完全芳香的。杂环基可以是未取代的或取代的。As used herein, "heterocyclyl" refers to a ring system comprising at least one hetero atom (eg, O, N, S). Such systems may be unsaturated, may include partial unsaturation, or may contain some aromatic moieties, or be completely aromatic. The heterocyclic group may be unsubstituted or substituted.
如本文中使用的,“杂芳基”指单环或多环的芳香环系统(环系统具有至少一个为完全离域的π电子系统的环),其包含一个或多个杂原子,即碳以外的元素,包括但不限于氮、氧和硫,并且包含至少一个芳香环。杂芳基的环中的原子数可变。例如,在某些实施方案中,杂芳基可包含4-14个环内原子、5-10个环内原子或5-6个环内原子。此外,术语“杂芳基”包括稠环系统,其中两个环,例如至少一个芳环和至少一个杂芳环,或者至少两个杂芳环共享至少一个化学键。杂芳环的实例包括但不限于呋喃、呋咱、噻吩、苯并噻吩、酞嗪、吡咯、噁唑、苯并噁唑、1,2,3-噁二唑、1,2,4-噁二唑、噻唑、1,2,3-噻二唑、1,2,4-噻二唑、苯并噻唑、咪唑、苯并咪唑、吲哚、吲唑、吡唑、苯并吡唑、异噁唑、苯并异噁唑、异噻唑、三唑、苯并三唑、噻二唑、四唑、吡啶、哒嗪、嘧啶、吡嗪、嘌呤、蝶啶、喹啉、异喹啉、喹唑啉、喹喔啉、邻二氮杂萘和三嗪。杂芳基可以是取代的或未取代的。As used herein, "heteroaryl" refers to a monocyclic or polycyclic aromatic ring system (a ring system having at least one ring that is a fully delocalized pi-electron system) that contains one or more heteroatoms, ie, carbon Elements other than, including but not limited to nitrogen, oxygen and sulfur, and comprising at least one aromatic ring. The number of atoms in the ring of the heteroaryl group is variable. For example, in certain embodiments, a heteroaryl group can contain 4-14 ring atoms, 5-10 ring atoms, or 5-6 ring atoms. Furthermore, the term "heteroaryl" includes fused ring systems in which two rings, for example at least one aromatic ring and at least one heteroaryl ring, or at least two heteroaryl rings share at least one chemical bond. Examples of heteroaromatic rings include, but are not limited to, furan, furazan, thiophene, benzothiophene, pyridazine, pyrrole, oxazole, benzoxazole, 1,2,3-oxadiazole, 1,2,4-oxa Diazole, thiazole, 1,2,3-thiadiazole, 1,2,4-thiadiazole, benzothiazole, imidazole, benzimidazole, indole, oxazole, pyrazole, benzopyrazole, different Oxazole, benzisoxazole, isothiazole, triazole, benzotriazole, thiadiazole, tetrazole, pyridine, pyridazine, pyrimidine, pyrazine, indole, pteridine, quinoline, isoquinoline, quin Oxazoline, quinoxaline, o-naphthyridine and triazine. Heteroaryl groups can be substituted or unsubstituted.
如本文中使用的,“杂脂环族的”或“杂脂环基”指三、四、五、六、七、八、九、十,直至18元的单环、二环和三环的环系统,其中碳原子与1-5个杂原子一起组成所述环系统。杂环可以任选地含有一个或多个以如下这样方式定位的不饱和键,然而在遍及所有的环中不存在完全离域的π电子系统。所述杂原子独立地选自氧、硫和氮。杂环可以进一步包含一个或多个羰基或硫代羰基官能团,使本定义包括氧代系统和硫 代系统,诸如内酰胺、内酯、环状酰亚胺、环状硫代酰亚胺和环状氨基甲酸酯。当由两个或更多个环组成时,这些环可以以稠合形式结合在一起。另外,在杂脂环中的任何氮可以被季铵化。杂脂环基或杂脂环族的基团可以是未取代的或取代的。这种“杂脂环族的”或“杂脂环基”基团的实例包括但不限于1,3-二噁英、1,3-二氧六环、1,4-二氧六环、1,2-二氧戊环、1,3-二氧戊环、1,4-二氧戊环、1,3-氧硫杂环己烷、1,4-氧硫杂环己二烯、1,3-氧硫杂环戊烷、1,3-二硫杂环戊二烯、1,3-二硫戊环、1,4-氧硫杂环己烷、四氢-1,4-噻嗪、2H-1,2-噁嗪、马来酰亚胺、琥珀酰亚胺、巴比妥酸、硫代巴比妥酸、二氧代哌嗪、乙内酰脲、二氢尿嘧啶、三噁烷、六氢-1,3,5-三嗪、咪唑啉、咪唑烷、异噁唑啉、异噁唑烷、噁唑啉、噁唑烷、噁唑烷酮、噻唑啉、噻唑烷、吗啉、环氧乙烷、哌啶N-氧化物、哌啶、哌嗪、吡咯烷、吡咯烷酮、吡咯烷二酮(pyrrolidione)、4-哌啶酮、吡唑啉、吡唑烷、2-氧代吡咯烷、四氢吡喃、4H-吡喃、四氢噻喃、硫吗啉、硫吗啉亚砜、硫吗啉砜,以及它们的苯并-稠合的类似物(例如,苯并咪唑啉酮、四氢喹啉、3,4-亚甲基二氧基苯基)。As used herein, "heteroalicyclic" or "heteroalicyclic" refers to three, four, five, six, seven, eight, nine, ten, up to 18-membered monocyclic, bicyclic, and tricyclic A ring system in which a carbon atom and 1-5 heteroatoms together form the ring system. The heterocycle can optionally contain one or more unsaturated bonds positioned in such a manner that there is no fully delocalized pi-electron system throughout all of the rings. The heteroatoms are independently selected from the group consisting of oxygen, sulfur, and nitrogen. The heterocyclic ring may further comprise one or more carbonyl or thiocarbonyl functional groups, such that the definition includes oxo systems and sulfur Generation systems such as lactams, lactones, cyclic imides, cyclic thioimides, and cyclic carbamates. When composed of two or more rings, the rings may be bonded together in a fused form. Additionally, any nitrogen in the heteroalicyclic ring can be quaternized. The heteroalicyclic or heteroalicyclic group may be unsubstituted or substituted. Examples of such "heteroalicyclic" or "heteroalicyclic" groups include, but are not limited to, 1,3-dioxin, 1,3-dioxane, 1,4-dioxane, 1,2-dioxolane, 1,3-dioxolan, 1,4-dioxolane, 1,3-oxathiane, 1,4-oxethiohexadiene, 1,3-oxathiolane, 1,3-dithiolepine, 1,3-dithiolane, 1,4-oxathiane, tetrahydro-1,4- Thiazide, 2H-1,2-oxazine, maleimide, succinimide, barbituric acid, thiobarbituric acid, dioxopiperazine, hydantoin, dihydrouracil , trioxane, hexahydro-1,3,5-triazine, imidazoline, imidazolidine, isoxazoline, isoxazolidine, oxazoline, oxazolidine, oxazolidinone, thiazoline, thiazole Alkane, morpholine, ethylene oxide, piperidine N-oxide, piperidine, piperazine, pyrrolidine, pyrrolidone, pyrrolidione, 4-piperidone, pyrazoline, pyrazolidine, 2-oxopyrrolidine, tetrahydropyran, 4H-pyran, tetrahydrothiopyran, thiomorpholine, thiomorpholine sulfoxide, thiomorpholine sulfone, and their benzo-fused analogs (eg , Benzimidazolidinone, tetrahydroquinoline, 3,4-methylenedioxyphenyl).
如本文中使用的,“芳烷基”和“芳基(烷基)”指通过低级亚烷基连接的作为取代基的芳基。芳烷基的低级亚烷基和芳基可以是取代的或未取代的。实例包括但不限于苄基、2-苯基烷基、3-苯基烷基和萘基烷基。As used herein, "aralkyl" and "aryl (alkyl)" refer to an aryl group as a substituent attached through a lower alkylene group. The lower alkylene and aryl groups of the aralkyl group may be substituted or unsubstituted. Examples include, but are not limited to, benzyl, 2-phenylalkyl, 3-phenylalkyl, and naphthylalkyl.
如本文中使用的,“杂芳烷基”和“杂芳基(烷基)”指通过低级亚烷基连接的作为取代基的杂芳基。杂芳烷基的低级亚烷基和杂芳基可以是取代的或未取代的。实例包括但不限于2-噻吩基烷基、3-噻吩基烷基、呋喃基烷基、噻吩基烷基、吡咯基烷基、吡啶基烷基、异噁唑基烷基和咪唑基烷基,以及它们的苯并-稠合的类似物。As used herein, "heteroaralkyl" and "heteroaryl (alkyl)" refers to a heteroaryl group attached as a substituent through a lower alkylene group. The lower alkylene and heteroaryl groups of the heteroarylalkyl group may be substituted or unsubstituted. Examples include, but are not limited to, 2-thienylalkyl, 3-thienylalkyl, furylalkyl, thienylalkyl, pyrrolylalkyl, pyridylalkyl, isoxazolylalkyl, and imidazolylalkyl And their benzo-fused analogs.
如本文中使用的,“烃氧基”指式-OR,其中R为上文定义的烷基、烯基、炔基、环烷基、环烯基或环炔基。烃氧基的非限制性实例为甲氧基、乙氧基、正丙氧基、1-甲基乙氧基(异丙氧基)、正丁氧基、异丁氧基、仲丁氧基和叔丁氧基。烃氧基可以是取代的或未取代的。As used herein, "hydrocarbyloxy" refers to the formula -OR, wherein R is alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl or cycloalkynyl as defined above. Non-limiting examples of alkoxy groups are methoxy, ethoxy, n-propoxy, 1-methylethoxy (isopropoxy), n-butoxy, isobutoxy, sec-butoxy And tert-butoxy. The alkoxy group may be substituted or unsubstituted.
如本文中使用的,“C-酰氨基”基团指“-C(=O)N(RaRb)”基团,其中Ra和Rb可以独立地为氢、烷基、烯基、炔基、环烷基、环烯基、环炔基、芳基、杂芳基、杂脂环基、芳烷基或(杂脂环基)烷基。C-酰氨基可以是取代的或未取代的。As used herein, a "C-amido" group refers to a "-C(=O)N(R a R b )" group, wherein R a and R b may independently be hydrogen, alkyl, alkenyl , alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, heteroaryl, heteroalicyclic, aralkyl or (heteroalicyclic)alkyl. The C-acylamino group may be substituted or unsubstituted.
如本文中使用的,“N-酰氨基”基团指“RC(=O)N(Ra)-”基团,其中R和Ra可以独立地为氢、烷基、烯基、炔基、环烷基、环烯基、环炔基、芳基、杂芳基、杂脂环基、 芳烷基或(杂脂环基)烷基。N-酰氨基可以是取代的或未取代的。As used herein, an "N-amido" group refers to a "RC(=O)N(R a )-" group, wherein R and R a may independently be hydrogen, alkyl, alkenyl, alkynyl. A cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, heteroaryl, heteroalicyclic, aralkyl or (heteroalicyclic)alkyl group. The N-acylamino group may be substituted or unsubstituted.
本文使用的术语“卤素原子”、“卤素”或“卤代”意为元素周期表的第七列的放射稳定原子中的任一种,诸如氟、氯、溴和碘。The term "halogen atom", "halogen" or "halo" as used herein means any of the radiation-stable atoms of the seventh column of the periodic table, such as fluorine, chlorine, bromine and iodine.
本文使用的术语“胺”指-NH2基团,其中一个或多个氢可以任选地被R基团取代。R可以独立地为氢、烷基、烯基、炔基、环烷基、环烯基、环炔基、芳基、杂芳基、杂脂环基、芳烷基或(杂脂环基)烷基。The term "amine" as used herein refers to a -NH 2 group, wherein one or more hydrogen may optionally be substituted with R groups. R may independently be hydrogen, alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, heteroaryl, heteroalicyclic, aralkyl or (heteroalicyclic) alkyl.
本文使用的术语“醛”指-Rc-C(O)H基团,其中Rc可以不存在,或独立地选自亚烷基、亚烯基、亚炔基、亚环烷基、亚环烯基、亚环炔基、亚芳基、亚杂芳基、亚杂脂环基、亚芳烷基或亚(杂脂环基)烷基。The term "aldehyde" as used herein refers to a -R c -C (O) H group, wherein R c may be absent, or independently selected from alkylene, alkenylene, alkynylene, cycloalkylene, alkylene Cycloalkenyl, cycloalkynylene, arylene, heteroarylene, heteroalicyclic, aralkylene or (heteroalicyclic)alkyl.
本文使用的术语“氨基”指-NH2基团。The term "amino" as used herein refers to a -NH 2 group.
本文使用的术语“羟基”指-OH基团。The term "hydroxy" as used herein refers to an -OH group.
本文使用的术语“氰基”指“-CN”基团。The term "cyano" as used herein refers to a "-CN" group.
本文使用的术语“叠氮基”指-N3基团。As used herein, the term "azido" refers to a -N 3 group.
本文使用的术语“巯基”指-SH基团。The term "mercapto" as used herein refers to a -SH group.
本文使用的术语“羧酸”指-C(O)OH。The term "carboxylic acid" as used herein refers to -C(O)OH.
本文使用的术语“硫氰酸根”指-S-C≡N基团。The term "thiocyanate" as used herein refers to a -S-C≡N group.
本文使用的术语“氧基胺(oxo-amine)”指-O-NH2基团,其中-NH2上的一个或多个氢可以任选地被R基团取代。R可以独立地为氢、烷基、烯基、炔基、环烷基、环烯基、环炔基、芳基、杂芳基、杂脂环基、芳烷基或(杂脂环基)烷基。As used herein, the term "amine group (oxo-amine)" refers to the group -O-NH 2, wherein one or more hydrogens on -NH 2 may optionally be substituted with R groups. R may independently be hydrogen, alkyl, alkenyl, alkynyl, cycloalkyl, cycloalkenyl, cycloalkynyl, aryl, heteroaryl, heteroalicyclic, aralkyl or (heteroalicyclic) alkyl.
如本文所用,术语“衍生物”意指通过物理或化学过程从指定化合物衍生的相似化合物。衍生物可以使用标准程序来制备,所述标准程序是合成有机化学领域的技术人员已知的并且例如由J.March,“Advanced Organic Chemistry:Reactions,Mechanisms and Structure(《高等有机化学:反应、机制及结构》)”,第4版(纽约:Wiley-Interscience,1992)描述。举例来说,碱加成盐是使用常规手段从化合物制备而来的,包括使化合物的一个或多个游离羟基与合适的碱反应。一般来说,将化合物溶解于如甲醇或乙醇的极性有机溶剂中并且将碱添加于其中。所得的盐沉淀或者可以通过添加极性较小的溶剂而从溶液中析出。适用于形成碱加成盐的胱胺的盐是指衍生自无机和有机酸和碱的胱胺盐。合适酸的实例包括但不限于盐酸、氢溴酸、硫酸、硝酸k、高氯酸、富马酸、马来酸、磷酸、乙醇酸、乳酸、水杨酸、琥珀酸、对甲苯磺酸、 酒石酸、乙酸、三氟乙酸、柠檬酸、甲磺酸、甲酸、苯甲酸、丙二酸、萘-2-磺酸、苯磺酸和草酸。还包括衍生自氨基酸(例如L-精氨酸,L-赖氨酸)的盐。合适的碱的实例包括但不限于无机碱,如氢氧化钠、氢氧化钾、氢氧化铵、氢氧化钙、三甲胺等。As used herein, the term "derivative" means a similar compound that is derived from a specified compound by physical or chemical processes. Derivatives can be prepared using standard procedures known to those skilled in the art of synthetic organic chemistry and are for example, by J. March, "Advanced Organic Chemistry: Reactions, Mechanisms and Structure (Advanced Organic Chemistry: Reactions, Mechanisms). And Structure ")", 4th edition (New York: Wiley-Interscience, 1992). For example, a base addition salt is prepared from a compound using conventional means, including reacting one or more free hydroxyl groups of the compound with a suitable base. Generally, the compound is dissolved in a polar organic solvent such as methanol or ethanol and a base is added thereto. The resulting salt precipitates may be precipitated from the solution by the addition of a less polar solvent. Salts suitable for the formation of cystamines of base addition salts refer to cystamine salts derived from inorganic and organic acids and bases. Examples of suitable acids include, but are not limited to, hydrochloric acid, hydrobromic acid, sulfuric acid, k nitrate, perchloric acid, fumaric acid, maleic acid, phosphoric acid, glycolic acid, lactic acid, salicylic acid, succinic acid, p-toluenesulfonic acid, Tartaric acid, acetic acid, trifluoroacetic acid, citric acid, methanesulfonic acid, formic acid, benzoic acid, malonic acid, naphthalene-2-sulfonic acid, benzenesulfonic acid and oxalic acid. Also included are salts derived from amino acids such as L-arginine, L-lysine. Examples of suitable bases include, but are not limited to, inorganic bases such as sodium hydroxide, potassium hydroxide, ammonium hydroxide, calcium hydroxide, trimethylamine, and the like.
如本文所用,包含含有二硫键结构的化合物的洗涤溶液还含有缓冲剂。所述缓冲剂是适合处理其上固定有核酸分子的固定支持物的任何缓冲液。应理解,这样的缓冲剂对固体支持物的处理不应对固体支持物上的核酸分子的结构、核酸分子与固体支持物的结合以及后续的测序试剂中测序酶、探针和核苷酸结构及活性造成不利的影响。合适的缓冲剂是本领域技术人员能够容易确定的。合适的缓冲剂可以包含例如有机盐,以保持约pH 6至pH 9的稳定pH,并且可能还包含一价或二价阳离子或去垢剂,从而从固体支持物除去非特异性结合的分子。示例性的缓冲剂可以包括例如Tris-HCl缓冲液(例如pH 6.5的100mMTris-HCl缓冲液)、SSC缓冲液(例如0.3xSSC/0.1%Tween)、TE缓冲液(例如含有pH 8的10 mM Tris-HCl和1 mM EDTA的TE缓冲液)等。As used herein, a wash solution comprising a compound having a disulfide bond structure also contains a buffer. The buffer is any buffer suitable for treating a fixed support to which a nucleic acid molecule is immobilized. It should be understood that the treatment of such a buffer with a solid support should not deal with the structure of the nucleic acid molecule on the solid support, the binding of the nucleic acid molecule to the solid support, and the sequencing enzyme, probe and nucleotide structure in subsequent sequencing reagents and Activity has an adverse effect. Suitable buffers can be readily determined by those skilled in the art. Suitable buffering agents may contain, for example, an organic salt to maintain a stable pH of from about pH 6 to pH 9, and may also contain monovalent or divalent cations or detergents to remove non-specifically bound molecules from the solid support. Exemplary buffers can include, for example, Tris-HCl buffer (eg, 100 mM Tris-HCl buffer at pH 6.5), SSC buffer (eg, 0.3 x SSC/0.1% Tween), TE buffer (eg, 10 mM Tris with pH 8). -HCl and 1 mM EDTA in TE buffer) and the like.
在作出本发明的过程中,本发明人令人惊讶地发现,本发明的含有二硫键的化合物能够与再生试剂中的还原剂发生氧化还原反应,同时,本发明的这样的含有二硫键的化合物还不会对核酸的测序造成任何显著的不利影响,例如不会对测序试剂(例如测序酶、经标记的探针或核苷酸)的性质、固体支持物上的核酸分子的性质和固体支持物与其上固定的核酸分子的结合造成任何显著的不利影响。相比之下,其它很多常用的氧化剂会对测序引起不良影响,例如破坏dNTP的结构,影响聚合酶的活性等。In the course of making the present invention, the inventors have surprisingly found that the disulfide-containing compound of the present invention is capable of undergoing a redox reaction with a reducing agent in a regenerating agent, and at the same time, such a disulfide-containing bond of the present invention The compound does not cause any significant adverse effects on the sequencing of the nucleic acid, such as the nature of the sequencing reagent (eg, sequencing enzyme, labeled probe or nucleotide), the nature of the nucleic acid molecule on the solid support, and The binding of the solid support to the nucleic acid molecule immobilized thereon causes any significant adverse effects. In contrast, many other commonly used oxidants can adversely affect sequencing, such as disrupting the structure of dNTPs, affecting the activity of polymerases, and the like.
在优选的实施方案中,本发明的含有二硫键结构的化合物的氧化性可以仅来自于二硫键结构。也就是说,本发明的含有二硫键结构的化合物可以不含有除-S-S-以外的任何其它氧化基团。在一些实施方案中,本发明的含有二硫键结构的化合物可以含有至少一个二硫键。例如,含有二硫键结构的化合物可以含有至少2个、3个、4个、5个、6个、7个、8个、9个、10个或更多个二硫键。在优选的实施方案中,含有二硫键结构的化合物含有1-3个二硫键。在优选的实施方案中,含有二硫键结构的化合物含有1-2个二硫键。在优选的实施方案中,含有二硫键结构的化合物仅含有1个二硫键。In a preferred embodiment, the oxidizing property of the disulfide bond-containing compound of the present invention may be derived only from a disulfide bond structure. That is, the disulfide bond-containing compound of the present invention may not contain any other oxidizing group other than -S-S-. In some embodiments, the disulfide bond-containing compound of the present invention may contain at least one disulfide bond. For example, a compound containing a disulfide bond structure may contain at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more disulfide bonds. In a preferred embodiment, the compound containing a disulfide bond structure contains 1-3 disulfide bonds. In a preferred embodiment, the compound containing a disulfide bond structure contains 1-2 disulfide bonds. In a preferred embodiment, the compound containing a disulfide bond structure contains only one disulfide bond.
在本发明的实施方案中,含有二硫键结构的化合物在含有该化合物的溶液中的量是足以显著降低或甚至消除在测序过程中固体支持物上或与固体支持物流体连通的 流动通道上的再生试剂的残留的任何量。在一个实施方案中,含有二硫键结构的化合物在含有该化合物的溶液中的量可以是例如至少约0.1mM、至少约0.2mM、至少约0.3mM、至少约0.4mM、至少约0.5mM、至少约0.6mM、至少约0.7mM、至少约0.8mM、至少约0.9mM、至少约0.10mM、至少约1mM、至少约2mM、至少约3mM、至少约4mM、至少约5mM、至少约6mM、至少约7mM、至少约8mM、至少约9mM、至少约10mM、至少约15mM、至少约20mM、至少约25mM、至少约30mM、至少约35mM、至少约40mM、至少约45mM、至少约50mM、至少约55mM、至少约60mM、至少约70mM、至少约80mM、至少约90mM、至少约100mM或更高,或前述任何两个值之间的范围。在一个实施方案中,含有二硫键结构的化合物在含有该化合物的溶液中的量可以是例如至多约1mM、至多约2mM、至多约3mM、至多约4mM、至多约5mM、至多约6mM、至多约7mM、至多约8mM、至多约9mM、至多约10mM、至多约15mM、至多约20mM、至多约25mM、至多约30mM、至多约35mM、至多约40mM、至多约45mM、至多约50mM、至多约55mM、至多约60mM、至多约70mM、至多约80mM、至多约90mM、至多约100mM,或前述任何两个值之间的范围。在一个实施方案中,含有二硫键结构的化合物在含有该化合物的溶液中的量可以是例如约1mM-约50mM。在一个实施方案中,含有二硫键结构的化合物在含有该化合物的溶液中的量可以是例如约2mM-约20mM。在一个实施方案中,含有二硫键结构的化合物在含有该化合物的溶液中的量可以是例如约5mM-约10mM。In an embodiment of the invention, the amount of the disulfide-containing structure-containing compound in the solution containing the compound is sufficient to significantly reduce or even eliminate or be in fluid communication with the solid support during the sequencing process. Any amount of residual reagent regenerating on the flow channel. In one embodiment, the amount of the disulfide-containing structure-containing compound in the solution containing the compound can be, for example, at least about 0.1 mM, at least about 0.2 mM, at least about 0.3 mM, at least about 0.4 mM, at least about 0.5 mM, At least about 0.6 mM, at least about 0.7 mM, at least about 0.8 mM, at least about 0.9 mM, at least about 0.10 mM, at least about 1 mM, at least about 2 mM, at least about 3 mM, at least about 4 mM, at least about 5 mM, at least about 6 mM, at least About 7 mM, at least about 8 mM, at least about 9 mM, at least about 10 mM, at least about 15 mM, at least about 20 mM, at least about 25 mM, at least about 30 mM, at least about 35 mM, at least about 40 mM, at least about 45 mM, at least about 50 mM, at least about 55 mM At least about 60 mM, at least about 70 mM, at least about 80 mM, at least about 90 mM, at least about 100 mM or higher, or a range between any two of the foregoing values. In one embodiment, the amount of the disulfide-containing structure-containing compound in the solution containing the compound can be, for example, up to about 1 mM, up to about 2 mM, up to about 3 mM, up to about 4 mM, up to about 5 mM, up to about 6 mM, up to About 7 mM, up to about 8 mM, up to about 9 mM, up to about 10 mM, up to about 15 mM, up to about 20 mM, up to about 25 mM, up to about 30 mM, up to about 35 mM, up to about 40 mM, up to about 45 mM, up to about 50 mM, up to about 55 mM Up to about 60 mM, up to about 70 mM, up to about 80 mM, up to about 90 mM, up to about 100 mM, or a range between any two of the foregoing. In one embodiment, the amount of the compound containing a disulfide bond structure in the solution containing the compound may be, for example, from about 1 mM to about 50 mM. In one embodiment, the amount of the compound containing a disulfide bond structure in the solution containing the compound may be, for example, from about 2 mM to about 20 mM. In one embodiment, the amount of the compound containing a disulfide bond structure in the solution containing the compound may be, for example, from about 5 mM to about 10 mM.
如本文所用,术语“RunOn”用于评价测序中的碱基合成超前现象。RunOn值越高,表明碱基合成超前现象越严重。碱基合成超前直接导致测序数据质量下降。可以根据BGISEQ-500测序平台自带的软件(深圳华大智造科技有限公司)或其它合适的方法来执行和报告RunOn的检测。在测序过程中,再生试剂的残留通常会导致较高的RunOn值。令人惊讶地,如实施例所证实的,本发明人发现在测序程序中洗涤再生试剂的过程中,如本文所述的含有二硫键结构的化合物的添加能够显著降低RunOn值并且不会不利地影响后续测序循环中使用的测序试剂的反应活性。相比之下,其它氧化剂例如过氧化氢、硫酸铁和氢氧化铜则严重干扰测序反应,导致无法测量RunOn值。As used herein, the term "RunOn" is used to evaluate the phenomenon of base synthesis advancement in sequencing. The higher the RunOn value, the more serious the base synthesis advancement phenomenon. Advance base synthesis directly leads to a decline in the quality of sequencing data. RunOn detection can be performed and reported according to the software that comes with the BGISEQ-500 sequencing platform (Shenzhen Huada Zhizhi Technology Co., Ltd.) or other suitable methods. Residues of regenerated reagents typically result in higher RunOn values during sequencing. Surprisingly, as demonstrated by the examples, the inventors have discovered that the addition of a compound containing a disulfide bond structure as described herein during the washing of the regenerating agent in the sequencing procedure can significantly reduce the RunOn value and is not detrimental The effect of the sequencing reagent used in the subsequent sequencing cycle is affected. In contrast, other oxidants such as hydrogen peroxide, iron sulfate, and copper hydroxide severely interfere with the sequencing reaction, resulting in the inability to measure the RunOn value.
除非另外定义,否则本文使用的所有技术术语和科学术语具有与本领域普通技术人员所通常理解含义相同的含义。使用术语“包括”以及该术语的其他形式并非是限制 性的。使用术语“具有”以及该术语的其他形式并非是限制性的。如本说明书中所使用的,不论在过渡短语中还是在权利要求书中,术语“包含”应解释为具有开放性的含义。即,以上术语应与短语“至少含有”或“至少包括”同义地进行解释。例如,当在方法的上下文中使用时,术语“包含”意味着该方法至少包括已叙述的步骤,但可以包括另外的步骤。当在化合物、组合物、或装置的上下文中使用时,术语“包含”意味着该化合物、组合物或装置至少包括已叙述的特征或组分,但可以包括另外的特征或组分。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. The use of the term "include" and other forms of the term are not limiting Sexual. The use of the term "having" and other forms of the term is not limiting. As used in this specification, the term "comprising", whether in the transitional phrase or in the claims, is to be interpreted as having an open meaning. That is, the above terms should be interpreted synonymously with the phrase "containing at least" or "including at least". For example, when used in the context of a method, the term "comprising" means that the method includes at least the steps recited, but may include additional steps. The term "comprising" when used in the context of a compound, composition, or device means that the compound, composition or device includes at least the recited features or components, but may include additional features or components.
附图说明DRAWINGS
图1显示使用添加或未添加胱胺的洗涤溶液进行测序的RunOn值的比较。Figure 1 shows a comparison of RunOn values for sequencing using a wash solution with or without added cystamine.
图2显示使用添加或未添加胱胺酸的洗涤溶液进行测序的RunOn值的比较。Figure 2 shows a comparison of RunOn values for sequencing using a wash solution with or without added cystine.
图3显示使用添加或未添加5,5′-二硫代双(2-硝基苯甲酸)的洗涤溶液进行测序的RunOn值的比较。Figure 3 shows a comparison of RunOn values for sequencing using a wash solution with or without addition of 5,5'-dithiobis(2-nitrobenzoic acid).
图4显示使用添加或未添加2-羟乙基二硫化物的洗涤溶液进行测序的RunOn值的比较。Figure 4 shows a comparison of RunOn values for sequencing using a wash solution with or without addition of 2-hydroxyethyl disulfide.
图5显示使用添加或未添加胱胺的洗涤溶液进行测序的RunOn值的比较。Figure 5 shows a comparison of RunOn values for sequencing using a wash solution with or without added cystamine.
图6显示使用添加添加过氧化氢、硫酸铁或氢氧化铜或2-羟乙基二硫化物的洗涤溶液进行测序的信号强度的比较。Figure 6 shows a comparison of signal intensities for sequencing using a wash solution supplemented with the addition of hydrogen peroxide, iron sulfate or copper hydroxide or 2-hydroxyethyl disulfide.
实施例Example
下面将结合实施例对本发明的实施方案进行详细描述,但是本领域技术人员将会理解,下列实施例仅用于说明本发明,而不应视为限定本发明的范围。实施例中未注明具体条件者,按照常规条件或制造商建议的条件进行。所用试剂或仪器未注明生产厂商者,均为可以通过市购获得的常规产品。The embodiments of the present invention will be described in detail below with reference to the accompanying drawings, however, the following examples are intended to illustrate the invention and are not intended to limit the scope of the invention. Those who do not specify the specific conditions in the examples are carried out according to the conventional conditions or the conditions recommended by the manufacturer. The reagents or instruments used are not indicated by the manufacturer, and are conventional products that can be obtained commercially.
实施例1Example 1
按照制造商的说明书,利用MGIEasyTM DNA文库制备试剂盒(深圳华大智造科技有限公司)以大肠杆菌标准菌株为原料提取DNA构建用于测序的DNA文库,加载到测序芯片上。按照制造商的说明书,使用BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30),将制备好的测序芯片加载至BGISEQ-500测序平台(深圳华大智造科技有限公司)完成30个碱基的序列测 定,选择1*1的拍照面积。According to the manufacturer's instructions, using MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip. The prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions. The platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the sequence determination of 30 bases and selected 1*1 of the photographed area.
在BGI-SEQ500测序试剂盒V3.0的洗涤试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加2mM胱胺,采用上面段落中所述的相同程序完成30个碱基序列测定。Add 2 mM cystamine to Washing Reagent 1 of BGI-SEQ500 Sequencing Kit V3.0 (reagent for washing the chip after treating the chip with the regenerating reagent), and complete the 30 base sequence using the same procedure as described in the above paragraph. Determination.
利用BGI-SEQ500测序平台自带的软件分别评估两组测序实验的每个循环的RunOn值,得到的结果如图1所示,另外,利用BGISEQ-500测序平台自带的软件评估了平均RunOn值。The RunOn value of each cycle of the two sets of sequencing experiments was evaluated using the software provided by the BGI-SEQ500 sequencing platform. The results obtained are shown in Figure 1. In addition, the average RunOn value was evaluated using the software provided by the BGISEQ-500 sequencing platform. .
结论:胱胺的添加使得每个循环RunOn值显著降低,使得平均RunOn值从0.15降低到0.11。Conclusion: The addition of cystamine resulted in a significant decrease in RunOn values per cycle, resulting in a reduction in the average RunOn value from 0.15 to 0.11.
实施例2Example 2
按照制造商的说明书,利用MGIEasyTM DNA文库制备试剂盒(深圳华大智造科技有限公司)以大肠杆菌标准菌株为原料提取DNA构建用于测序的DNA文库,加载到测序芯片上。按照制造商的说明书,使用BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30),将制备好的测序芯片加载至BGISEQ-500测序平台(深圳华大智造科技有限公司)完成20个碱基的序列测定,选择1*1的拍照面积。According to the manufacturer's instructions, using MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip. The prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions. The platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the sequence determination of 20 bases and selected 1*1 of the photographed area.
在BGISEQ-500测序试剂盒V3.0的洗涤试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加10mM胱胺酸,采用上面段落中所述的相同程序完成20个碱基序列测定。Add 10 mM cystine to Washing Reagent 1 of BGISEQ-500 Sequencing Kit V3.0 (reagent for washing the chip after treatment of the chip with regenerating reagent) and complete 20 bases using the same procedure as described in the paragraph above. Sequence determination.
利用BGISEQ-500测序平台自带的软件分别评估两组测序实验的每个循环的RunOn值,得到的结果如图2所示,另外,利用BGISEQ-500测序平台自带的软件评估了平均RunOn值。The RunOn value of each cycle of the two sets of sequencing experiments was evaluated using the software provided by the BGISEQ-500 sequencing platform. The results obtained are shown in Figure 2. In addition, the average RunOn value was evaluated using the software provided by the BGISEQ-500 sequencing platform. .
结论:胱胺的添加使得每个循环RunOn值显著降低,使得平均RunOn值从0.14降低到0.09。Conclusion: The addition of cystamine resulted in a significant decrease in RunOn values per cycle, resulting in a reduction in the average RunOn value from 0.14 to 0.09.
实施例3Example 3
按照制造商的说明书,利用MGIEasyTM DNA文库制备试剂盒(深圳华大智造科技有限公司)以大肠杆菌标准菌株为原料提取DNA构建用于测序的DNA文库,加载到测序芯片上。按照制造商的说明书,使用BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30),将制备好的测序芯片加 载至BGISEQ-500测序平台(深圳华大智造科技有限公司)完成40个碱基的序列测定,选择1*1的拍照面积。According to the manufacturer's instructions, using MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip. The prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions. The platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the sequence determination of 40 bases and selected 1*1 of the photographed area.
在BGISEQ-500测序试剂盒V3.0的洗涤试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加1mM 5,5′-二硫代双(2-硝基苯甲酸),采用上面段落中所述的相同程序完成40个碱基序列测定。1 mM 5,5'-dithiobis(2-nitrobenzoic acid) was added to Washing Reagent 1 of BGISEQ-500 Sequencing Kit V3.0 (reagent for washing the chip after treating the chip with the regenerating reagent), The 40 base sequence determination was performed using the same procedure as described in the paragraph above.
利用BGISEQ-500测序平台自带的软件分别评估两组测序实验的每个循环的RunOn值,得到的结果如图3所示,另外,利用BGISEQ-500测序平台自带的软件评估了平均RunOn值。The RunOn value of each cycle of the two sets of sequencing experiments was evaluated using the software provided by the BGISEQ-500 sequencing platform. The results obtained are shown in Figure 3. In addition, the average RunOn value was evaluated using the software provided by the BGISEQ-500 sequencing platform. .
结论:胱胺的添加使得每个循环RunOn值显著降低,使得平均RunOn值从0.12降低到0.07。Conclusion: The addition of cystamine resulted in a significant decrease in RunOn values per cycle, resulting in a reduction in the average RunOn value from 0.12 to 0.07.
实施例4Example 4
按照制造商的说明书,利用MGIEasyTM DNA文库制备试剂盒(深圳华大智造科技有限公司)以大肠杆菌标准菌株为原料提取DNA构建用于测序的DNA文库,加载到测序芯片上。按照制造商的说明书,使用BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30),将制备好的测序芯片加载至BGISEQ-500测序平台(深圳华大智造科技有限公司)完成50个碱基的序列测定,选择1*1的拍照面积。According to the manufacturer's instructions, using MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip. The prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions. The platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the sequence determination of 50 bases and selected 1*1 of the photographed area.
在BGISEQ-500测序试剂盒V3.0的洗涤试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加20mM 2-羟乙基二硫化物,采用上面段落中所述的相同程序完成50个碱基序列测定。Add 20 mM 2-hydroxyethyl disulfide to Washing Reagent 1 of BGISEQ-500 Sequencing Kit V3.0 (reagent for washing the chip after treatment of the chip with regenerating reagent) using the same procedure as described in the above paragraph Complete 50 base sequence determination.
利用BGISEQ-500测序平台自带的软件分别评估两组测序实验的每个循环的RunOn值,得到的结果如图4所示,另外,利用BGISEQ-500测序平台自带的软件评估了平均RunOn值。The RunOn value of each cycle of the two sets of sequencing experiments was evaluated using the software provided by the BGISEQ-500 sequencing platform. The results obtained are shown in Figure 4. In addition, the average RunOn value was evaluated using the software provided by the BGISEQ-500 sequencing platform. .
结论:2-羟乙基二硫化物的添加使得每个循环RunOn值显著降低,使得平均RunOn值从0.11降低到0.07。Conclusion: The addition of 2-hydroxyethyl disulfide resulted in a significant decrease in RunOn values per cycle, resulting in a reduction in the average RunOn value from 0.11 to 0.07.
实施例5Example 5
按照制造商的说明书,利用MGIEasyTM DNA文库制备试剂盒(深圳华大智造科技有限公司)以大肠杆菌标准菌株为原料提取DNA构建用于测序的DNA文库,加载到测序芯片上。将制备好的测序芯片按照以下程序完成50个碱基的序列测定,选择 1*1的拍照面积:According to the manufacturer's instructions, using MGIEasy TM DNA library preparation kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip. The prepared sequencing chip was subjected to the sequence determination of 50 bases according to the following procedure, and the photographing area of 1*1 was selected:
将芯片浸泡在聚合反应试剂中约1min,移出芯片,随后浸泡在洗涤试剂2中约1min,移出芯片,随后浸泡在保护试剂中约1min,移出芯片,对其进行20-30min的拍照以检测代表一个碱基的身份信息的荧光信号,随后将芯片浸泡在再生试剂中约1min,移出芯片,随后浸泡在洗涤试剂1中约1min,移出芯片;随后对芯片重复前述的操作步骤以进行下一个碱基的测序;所使用的试剂均来自BGISEQ-500测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30),各步骤的温度等反应条件以及拍照程序均参照试剂盒的说明书以及BGISEQ-500测序平台的标准程序进行。The chip was immersed in the polymerization reagent for about 1 min, the chip was removed, and then immersed in the washing reagent 2 for about 1 min, the chip was removed, and then immersed in the protective reagent for about 1 min, the chip was removed, and photographed for 20-30 min to detect the representative. Fluorescence signal of identity information of one base, then immersing the chip in the regeneration reagent for about 1 min, removing the chip, then immersing in the washing reagent 1 for about 1 min, removing the chip; then repeating the foregoing steps on the chip to proceed to the next base Sequencing of the base; the reagents used were all from the BGISEQ-500 sequencing kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30), the temperature and other reaction conditions of each step and the photographing procedure were referenced. Instructions for the kit and standard procedures for the BGISEQ-500 sequencing platform were performed.
在BGISEQ-500测序试剂盒V3.0的洗涤试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加20mM胱胺,采用上面段落中所述的相同程序完成30个碱基序列测定。Add 20 mM cystamine to Washing Reagent 1 of BGISEQ-500 Sequencing Kit V3.0 (reagent for washing the chip after treating the chip with the regenerating reagent), and complete the 30 base sequence using the same procedure as described in the paragraph above. Determination.
利用BGISEQ-500测序平台自带的软件分别评估两组测序实验的每个循环的RunOn值,得到的结果如图5所示,另外,利用BGISEQ-500测序平台自带的软件评估了平均RunOn值。The RunOn value of each cycle of the two sets of sequencing experiments was evaluated using the software provided by the BGISEQ-500 sequencing platform. The results obtained are shown in Figure 5. In addition, the average RunOn value was evaluated using the software provided by the BGISEQ-500 sequencing platform. .
结论:胱胺的添加使得每个循环RunOn值显著降低,使得平均RunOn值从0.94降低到0.14。Conclusion: The addition of cystamine resulted in a significant decrease in the RunOn value per cycle, resulting in a reduction in the average RunOn value from 0.94 to 0.14.
实施例6Example 6
按照制造商的说明书,使用大肠杆菌基因组DNA,利用MGIEasyTM DNA文库制备试剂盒(深圳华大智造科技有限公司)以大肠杆菌标准菌株为原料提取DNA构建用于测序的DNA文库,加载到测序芯片上。按照制造商的说明书,使用BGISEQ-500高通量测序试剂盒(SE50 V3.0,深圳华大智造科技有限公司,货号PF-UM-PEV30),将制备好的测序芯片加载至BGISEQ-500测序平台(深圳华大智造科技有限公司)完成50个碱基的序列测定,选择1*1的拍照面积,其中使用的BGISEQ-500测序试剂盒V3.0中的洗脱试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加50mM 2-羟乙基二硫化物。According to the manufacturer's instructions using E. coli genomic DNA, prepared using MGIEasy TM DNA library kit (Shenzhen Hua Taizhi Ltd. made) to a standard strain of E. coli extract as raw material DNA construct a DNA library for sequencing, loaded onto a sequencing chip on. The prepared sequencing chip was loaded into BGISEQ-500 sequencing using the BGISEQ-500 High-throughput Sequencing Kit (SE50 V3.0, Shenzhen Huada Zhizao Technology Co., Ltd., item number PF-UM-PEV30) according to the manufacturer's instructions. The platform (Shenzhen Huada Zhizao Technology Co., Ltd.) completed the 50-base sequence determination and selected 1*1 of the photographed area, using the elution reagent 1 in the BGISEQ-500 sequencing kit V3.0 (recycling reagent in use) 50 mM 2-hydroxyethyl disulfide was added to the reagent for washing the chip after processing the chip.
在BGISEQ-500测序试剂盒V3.0的洗脱试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加2%过氧化氢,采用上面段落中所述的相同程序完成50个碱基序列测定。 Add 2% hydrogen peroxide to the elution reagent 1 of BGISEQ-500 Sequencing Kit V3.0 (reagent for washing the chip after treating the chip with the regenerating reagent), and complete 50 with the same procedure as described in the above paragraph. Base sequence determination.
在BGISEQ-500测序试剂盒V3.0的洗脱试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加2mM硫酸铁,采用上面段落中所述的相同程序完成50个碱基序列测定。Add 2 mM ferric sulfate to the elution reagent 1 of BGISEQ-500 Sequencing Kit V3.0 (reagent for washing the chip after treating the chip with the regenerating reagent), and complete 50 bases using the same procedure as described in the above paragraph. Sequence determination.
在BGISEQ-500测序试剂盒V3.0的洗脱试剂1(在用再生试剂处理芯片后用于洗涤芯片的试剂)中添加2mM氢氧化铜,采用上面段落中所述的相同程序完成50个碱基序列测定。Add 2 mM copper hydroxide to the elution reagent 1 of BGISEQ-500 Sequencing Kit V3.0 (reagent for washing the chip after treating the chip with the regenerating reagent), and complete 50 bases using the same procedure as described in the above paragraph. Base sequence determination.
利用BGISEQ-500测序平台自带的程序输出四组测序实验的荧光信号强度值,得到的结果如图6所示。The fluorescence signal intensity values of the four sets of sequencing experiments were output using the program provided by the BGISEQ-500 sequencing platform, and the results obtained are shown in FIG. 6.
结论:添加过氧化氢、硫酸铁或氢氧化铜使荧光信号强度(荧光信号强度是测序过程获得的最基础数据,如果信号值过低,RunOn等其它质量指标都无法计算)急剧下降,已接近背景值,说明过氧化氢、硫酸铁和氢氧化铜严重影响测序过程;而添加2-羟乙基二硫化物使信号强度呈线性稳定下降,属于测序过程中信号的正常下降情况。 Conclusion: The addition of hydrogen peroxide, ferric sulfate or copper hydroxide makes the fluorescence signal intensity (fluorescence signal intensity is the most basic data obtained during the sequencing process. If the signal value is too low, RunOn and other quality indicators cannot be calculated), it drops sharply and is approaching. The background value indicates that hydrogen peroxide, ferric sulfate and copper hydroxide seriously affect the sequencing process; while the addition of 2-hydroxyethyl disulfide causes the signal intensity to decrease linearly and stably, which is a normal decline of the signal during the sequencing process.

Claims (12)

  1. 一种消除再生试剂残留的方法,其包括:A method of eliminating residues of regenerating reagents, comprising:
    a.提供其上固定有核酸分子的固体支持物,a. providing a solid support on which the nucleic acid molecule is immobilized,
    b.使用再生试剂处理所述固体支持物,所述再生试剂包含还原剂,和b. treating the solid support with a regenerant reagent comprising a reducing agent, and
    c.使用添加了含有二硫键结构的化合物的洗涤溶液处理所述固体支持物。c. The solid support is treated with a wash solution to which a compound having a disulfide bond structure is added.
  2. 权利要求1的方法,其中在步骤b之前,任选地使用含有测序试剂的一种或多种反应溶液处理所述固体支持物,以在固体支持物上产生代表所述核酸分子的核苷酸序列的信号(例如,荧光信号),并且其中所述再生试剂可以消除固体支持物上的所述信号。The method of claim 1 wherein prior to step b, said solid support is optionally treated with one or more reaction solutions containing a sequencing reagent to produce a nucleotide representative of said nucleic acid molecule on a solid support A sequence of signals (eg, a fluorescent signal), and wherein the regeneration reagent can eliminate the signal on the solid support.
  3. 权利要求1或2的方法,其中经过步骤c的处理之后,任选地还使用含有测序试剂的一种或多种反应溶液处理所述固体支持物。The method of claim 1 or 2, wherein after the treatment of step c, the solid support is optionally also treated with one or more reaction solutions containing sequencing reagents.
  4. 权利要求2或3的方法,其中所述测序试剂包括用于连接法测序或合成法测序的测序试剂。The method of claim 2 or 3, wherein the sequencing reagent comprises a sequencing reagent for ligation sequencing or synthetic sequencing.
  5. 前述权利要求中任一项的的方法,其中所述含有二硫键结构的化合物具有通式-R-S-S-R’-,其中R和R’各自独立地选自氢、任选取代的烷基、任选取代的烯基、任选取代的炔基、任选取代的环烷基、任选取代的环烯基、任选取代的环炔基、任选取代的芳基、任选取代的杂芳基、任选取代的杂环基、任选取代的杂脂环基、任选取代的芳烷基或任选取代的(杂脂环基)烷基。The method of any of the preceding claims, wherein the compound having a disulfide bond structure has the formula -RSS-R'-, wherein R and R' are each independently selected from hydrogen, optionally substituted alkyl, Optionally substituted alkenyl, optionally substituted alkynyl, optionally substituted cycloalkyl, optionally substituted cycloalkenyl, optionally substituted cycloalkynyl, optionally substituted aryl, optionally substituted An aryl group, an optionally substituted heterocyclic group, an optionally substituted heteroalicyclic group, an optionally substituted aralkyl group or an optionally substituted (heteroalicyclic)alkyl group.
  6. 权利要求5的方法,其中所述含有二硫键结构的化合物具有通式-C(R1)m(R2)m’-C(R3)p-S-S-C(R4)q-C(R5)n(R6)n’-,The method of claim 5, wherein said compound having a disulfide bond structure has the formula -C(R 1 ) m (R 2 ) m' -C(R 3 ) p -SSC(R 4 ) q -C(R 5 ) n (R 6 ) n' -,
    其中R1、R2、R3、R4、R5和R6各自独立地选自氢、氧代、羟基、被保护的羟基、烃氧基、芳氧基、酰基、巯基、烷硫基、芳硫基、氰基、卤素、硫代羰基、O-氨基甲酰基、N-氨甲酰基、O-硫代氨基甲酰基、N-硫代氨基甲酰基、C-酰氨基、N-酰氨基、 S-磺酰氨基、N-磺酰氨基、C-羧基、被保护的C-羧基、O-羧基、异氰酸基、硫代氰酸基、异硫代氰酸基、硝基、甲硅烷基、氧硫基、亚磺酰基、磺酰基、卤代烷基、卤代烃氧基、三卤代甲磺酰基、三卤代甲磺酰氨基、氨基、单取代的氨基、二取代的氨基、任选取代的烷基、任选取代的烯基、任选取代的炔基、任选取代的环烷基、任选取代的环烯基、任选取代的环炔基、任选取代的芳基、任选取代的杂芳基、任选取代的杂环基、任选取代的杂脂环基、任选取代的芳烷基或任选取代的(杂脂环基)烷基,Wherein R 1 , R 2 , R 3 , R 4 , R 5 and R 6 are each independently selected from the group consisting of hydrogen, oxo, hydroxy, protected hydroxy, alkoxy, aryloxy, acyl, decyl, alkylthio , arylthio, cyano, halogen, thiocarbonyl, O-carbamoyl, N-carbamoyl, O-thiocarbamoyl, N-thiocarbamoyl, C-amido, N-acyl Amino group, S-sulfonylamino group, N-sulfonylamino group, C-carboxyl group, protected C-carboxyl group, O-carboxyl group, isocyanate group, thiocyanate group, isothiocyanate group, nitro group, Silyl, oxythio, sulfinyl, sulfonyl, haloalkyl, halohydrocarbyloxy, trihalomethylsulfonyl, trihalomethylsulfonylamino, amino, monosubstituted amino, disubstituted amino , optionally substituted alkyl, optionally substituted alkenyl, optionally substituted alkynyl, optionally substituted cycloalkyl, optionally substituted cycloalkenyl, optionally substituted cycloalkynyl, optionally substituted An aryl group, an optionally substituted heteroaryl group, an optionally substituted heterocyclic group, an optionally substituted heteroalicyclic group, an optionally substituted aralkyl group or an optionally substituted (heteroalicyclic)alkyl group,
    其中m和m’各自为0-3的整数,条件是0<m+m’≤3,Wherein m and m' are each an integer of 0-3, provided that 0 < m + m' ≤ 3,
    n和n’各自为0-3的整数,条件是0<m+m’≤3,n and n' are each an integer of 0-3, provided that 0 < m + m' ≤ 3,
    p和q各自为1或2的整数,Each of p and q is an integer of 1 or 2,
    例如,R1、R2、R5和R6各自独立地选自氢、羟基或氨基,For example, R 1 , R 2 , R 5 and R 6 are each independently selected from hydrogen, hydroxy or amino.
    例如,R3和R4选自氢。For example, R 3 and R 4 are selected from hydrogen.
  7. 权利要求5或6的方法,其中所述含有二硫键结构的化合物不含有其它氧化基团,The method of claim 5 or 6, wherein said compound having a disulfide bond structure does not contain other oxidizing groups,
    优选地,所述含有二硫键结构的化合物含有不多于5个,不多于4个,不多于3个,优选不多于2个,优选不多于1个的二硫键结构。Preferably, the compound containing a disulfide bond structure contains no more than 5, no more than 4, no more than 3, preferably no more than 2, preferably no more than 1 disulfide bond structure.
  8. 前述权利要求中任一项的方法,其中所述含有二硫键结构的化合物选自二烷基二硫醚化合物、芳香族二硫醚化合物、环状二硫醚化合物、含氨基的二硫醚化合物、羧酸及羧酸衍生物的二硫醚化合物、不饱和键合的二硫醚化合物等、羟基取代的二烷基二硫醚及其衍生物。The method according to any one of the preceding claims, wherein the compound containing a disulfide bond structure is selected from the group consisting of a dialkyl disulfide compound, an aromatic disulfide compound, a cyclic disulfide compound, and an amino group-containing disulfide. a disulfide compound of a compound, a carboxylic acid and a carboxylic acid derivative, an unsaturatedly bonded disulfide compound, or the like, a hydroxy-substituted dialkyl disulfide, and a derivative thereof.
  9. 前述权利要求中任一项的方法,其中所述含有二硫键结构的化合物选自胱胺、胱胺酸、5,5′-二硫代双(2-硝基苯甲酸)和2-羟乙基二硫化物及其衍生物。The method according to any one of the preceding claims, wherein the compound containing a disulfide bond structure is selected from the group consisting of cystamine, cystine, 5,5'-dithiobis(2-nitrobenzoic acid) and 2-hydroxyl Ethyl disulfide and its derivatives.
  10. 前述权利要求中任一项的方法,其中所述洗涤溶液还包含缓冲剂,The method of any one of the preceding claims, wherein the washing solution further comprises a buffering agent,
    例如,其中所述含有二硫键结构的化合物在所述洗涤溶液中的浓度为1-50mM,优选为2mM或20mM。For example, the concentration of the compound having a disulfide bond structure in the washing solution is 1 to 50 mM, preferably 2 mM or 20 mM.
  11. 如权利要求5-9中任一项定义的含有二硫键结构的化合物用于消除再生试剂 残留的用途,其包括:A disulfide bond-containing compound as defined in any one of claims 5 to 9 for use in eliminating regenerating reagents Residual uses, including:
    a.提供其上固定有核酸分子的固体支持物,a. providing a solid support on which the nucleic acid molecule is immobilized,
    b.使用再生试剂处理所述固体支持物,所述再生试剂包含还原剂,和b. treating the solid support with a regenerant reagent comprising a reducing agent, and
    c.使用添加了含有二硫键结构的化合物的洗涤溶液处理所述固体支持物,c. treating the solid support with a washing solution to which a compound having a disulfide bond structure is added,
    例如,其中在步骤b之前,任选地使用含有测序试剂的一种或多种反应溶液处理所述固体支持物,以在固体支持物上产生代表所述核酸分子的核苷酸序列的信号(例如,荧光信号),并且其中所述再生试剂可以消除固体支持物上的所述信号,For example, wherein prior to step b, the solid support is optionally treated with one or more reaction solutions containing sequencing reagents to generate a signal representative of the nucleotide sequence of the nucleic acid molecule on the solid support ( For example, a fluorescent signal), and wherein the regeneration reagent can eliminate the signal on the solid support,
    例如,其中经过步骤c的处理之后,任选地还使用含有测序试剂的一种或多种反应溶液处理所述固体支持物,For example, wherein after the treatment of step c, the solid support is optionally also treated with one or more reaction solutions containing sequencing reagents,
    例如,其中所述测序试剂包括用于连接法测序或合成法测序的测序试剂For example, wherein the sequencing reagent comprises a sequencing reagent for ligation sequencing or synthetic sequencing
    例如,所述洗涤溶液还包含缓冲剂,For example, the washing solution further comprises a buffering agent,
    例如,所述含有二硫键结构的化合物在洗涤溶液中的浓度为1-50mM,优选为2mM或20mM。For example, the compound having a disulfide bond structure has a concentration in the washing solution of 1 to 50 mM, preferably 2 mM or 20 mM.
  12. 一种测序用的试剂盒,所述试剂盒包括测序试剂、洗涤试剂、包含还原剂的再生试剂和添加了含有二硫键结构的化合物的洗涤溶液,A kit for sequencing, the kit comprising a sequencing reagent, a washing reagent, a regenerating reagent containing a reducing agent, and a washing solution to which a compound having a disulfide bond structure is added,
    其中所述测序试剂可以在其上固定有核酸分子的固体支持物上产生代表所述核酸分子的核苷酸序列的信号(例如,荧光信号),并且其中所述再生试剂可以消除固体支持物上的所述信号,Wherein the sequencing reagent can generate a signal (eg, a fluorescent signal) representing a nucleotide sequence of the nucleic acid molecule on a solid support to which the nucleic acid molecule is immobilized, and wherein the regeneration reagent can eliminate the solid support The signal,
    例如,所述测序试剂包括用于连接法测序或合成法测序的测序试剂For example, the sequencing reagent includes sequencing reagents for ligation sequencing or synthetic sequencing
    例如,所述洗涤溶液还包含缓冲剂,For example, the washing solution further comprises a buffering agent,
    例如,所述含有二硫键结构的化合物在洗涤溶液中的浓度为1-50mM,优选为2mM或20mM。 For example, the compound having a disulfide bond structure has a concentration in the washing solution of 1 to 50 mM, preferably 2 mM or 20 mM.
PCT/CN2017/106972 2017-10-20 2017-10-20 Method for eliminating residue of regenerative agent WO2019075711A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2017/106972 WO2019075711A1 (en) 2017-10-20 2017-10-20 Method for eliminating residue of regenerative agent
CN201780094653.2A CN111051533B (en) 2017-10-20 2017-10-20 Method for eliminating regeneration reagent residue

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/106972 WO2019075711A1 (en) 2017-10-20 2017-10-20 Method for eliminating residue of regenerative agent

Publications (1)

Publication Number Publication Date
WO2019075711A1 true WO2019075711A1 (en) 2019-04-25

Family

ID=66173131

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/106972 WO2019075711A1 (en) 2017-10-20 2017-10-20 Method for eliminating residue of regenerative agent

Country Status (2)

Country Link
CN (1) CN111051533B (en)
WO (1) WO2019075711A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102140523A (en) * 2011-01-28 2011-08-03 东南大学 Sequencing method for in-situ copying high-flux sequencing template and increasing reading length thereof
CN102329884A (en) * 2011-10-20 2012-01-25 东南大学 Synchronous synthesis and DNA sequencing method for two nucleotides and application thereof
CN102971335A (en) * 2009-03-23 2013-03-13 史蒂芬·阿尔伯特·本纳 Reagents for reversibly terminating primer extension
CN103484106A (en) * 2013-09-05 2014-01-01 上海交通大学 Four-color fluorescence labeling reversible terminal and use thereof in DNA (Deoxyribonucleic Acid) sequencing
US20140363852A1 (en) * 2013-04-02 2014-12-11 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acids
CN106117288A (en) * 2015-05-08 2016-11-16 桑特里莱恩科技控股公司 The reversible terminator that disulfide bond connects

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102971335A (en) * 2009-03-23 2013-03-13 史蒂芬·阿尔伯特·本纳 Reagents for reversibly terminating primer extension
CN102140523A (en) * 2011-01-28 2011-08-03 东南大学 Sequencing method for in-situ copying high-flux sequencing template and increasing reading length thereof
CN102329884A (en) * 2011-10-20 2012-01-25 东南大学 Synchronous synthesis and DNA sequencing method for two nucleotides and application thereof
US20140363852A1 (en) * 2013-04-02 2014-12-11 Molecular Assemblies, Inc. Methods and apparatus for synthesizing nucleic acids
CN103484106A (en) * 2013-09-05 2014-01-01 上海交通大学 Four-color fluorescence labeling reversible terminal and use thereof in DNA (Deoxyribonucleic Acid) sequencing
CN106117288A (en) * 2015-05-08 2016-11-16 桑特里莱恩科技控股公司 The reversible terminator that disulfide bond connects

Also Published As

Publication number Publication date
CN111051533B (en) 2024-02-02
CN111051533A (en) 2020-04-21

Similar Documents

Publication Publication Date Title
US10113195B2 (en) Nucleic acid amplification
CN109486902B (en) Nucleic acid amplification
US11047004B2 (en) Thiolated nucleotide analogues for nucleic acid synthesis
EP3461910B1 (en) Nucleic acid amplification
AU2017394644B2 (en) Allele-specific capture of nucleic acids
US20230323445A1 (en) Nucleic acid sequencing method
WO2017196676A1 (en) Metal chelation post incorporation detection methods
CN111051533B (en) Method for eliminating regeneration reagent residue
US10858696B2 (en) Methods of reducing density-dependent GC bias in amplification
WO2023122491A1 (en) Periodate compositions and methods for chemical cleavage of surface-bound polynucleotides
AU2022421156A1 (en) Periodate compositions and methods for chemical cleavage of surface-bound polynucleotides
US20140127698A1 (en) Reiterative oligonucleotide synthesis
CN117940577A (en) Periodate compositions and methods for chemically cleaving surface-bound polynucleotides
CN115997033A (en) Catalytic control sequencing by synthetic generation of traceless DNA

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17929067

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17929067

Country of ref document: EP

Kind code of ref document: A1