WO2022105640A1 - Nucleic acid sequencing method - Google Patents

Nucleic acid sequencing method Download PDF

Info

Publication number
WO2022105640A1
WO2022105640A1 PCT/CN2021/129461 CN2021129461W WO2022105640A1 WO 2022105640 A1 WO2022105640 A1 WO 2022105640A1 CN 2021129461 W CN2021129461 W CN 2021129461W WO 2022105640 A1 WO2022105640 A1 WO 2022105640A1
Authority
WO
WIPO (PCT)
Prior art keywords
polymerase
nucleic acid
alkyl
dna
cysteines
Prior art date
Application number
PCT/CN2021/129461
Other languages
French (fr)
Chinese (zh)
Inventor
白净卫
杜娟娟
徐扬
Original Assignee
清华大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 清华大学 filed Critical 清华大学
Publication of WO2022105640A1 publication Critical patent/WO2022105640A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/12Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
    • C12N9/1241Nucleotidyltransferases (2.7.7)
    • C12N9/1252DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/30Nucleotides
    • C12P19/34Polynucleotides, e.g. nucleic acids, oligoribonucleotides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y207/00Transferases transferring phosphorus-containing groups (2.7)
    • C12Y207/07Nucleotidyltransferases (2.7.7)
    • C12Y207/07007DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase

Definitions

  • the invention belongs to the field of biotechnology, and relates to a method for determining a nucleic acid sequence.
  • DNA deoxyribonucleic acid is the genetic material that composes an organism. It is composed of A, T, C, and G in different sequences. These genetic materials composed of A, T, C, and G in different sequences contain rich genetic information. and the meaning of life. Nucleic acid sequences of different genetic materials can be obtained through nucleic acid sequencing technology to help people understand the differences between individual genetic materials, which is of great significance to life science research, disease diagnosis, and personalized medicine.
  • the first generation sequencing technology is represented by the dideoxy chain end termination method invented by Sanger. This method has disadvantages such as high cost, slow speed and low throughput, which cannot meet the needs of research and application.
  • the second-generation sequencing technology Illumina's Solexa sequencing technology has relatively low cost, so it is widely used, but this technology takes 6 days from the beginning of library construction to the completion of sequencing, which is time-consuming.
  • SMRT of Pacific Bioscience is the only commercially applied technology. SMRT technology realizes single-molecule sequencing and has the advantages of no amplification and long read length. Although this technology is compared with Solexa sequencing technology from library construction The time to complete sequencing was shorter, but it still took about two days to complete.
  • Patent US5302509 describes a method of sequencing a polynucleotide template comprising a multiple extension reaction using DNA polymerase or DNA ligase to successively incorporate labeled nucleotides or polynucleotides complementary to the template strand.
  • a new nucleotide base-paired to the template strand is constructed in a 5' to 3' orientation by successively incorporating single nucleotides complementary to the template strand chain.
  • the nucleoside triphosphate substrate used in the sequencing reaction is blocked to prevent overincorporation, and the nucleoside triphosphate substrate is differentially labeled to enable determination of the species of incorporated nucleotide that is added as a subsequent nucleotide.
  • Patent CN106244712A discloses a DNA sequencing method, including the following steps: (1) adding a tag sequence at the 3' end of the DNA to be tested to form the DNA to be tested containing the tag sequence; the nucleotide sequence of the tag sequence and the sequencing primer (2) mixing the DNA to be tested containing the tag sequence and the sequencing primer to form a product in the form of a double-stranded main body at the 5' end; (3) completing step (2) Then, the products were mixed with dATP, dCTP, dTTP and dGTP respectively to obtain four systems, and each system was added to a specific DNA polymerase-modified single-molecule device to read the electrical signal.
  • the synthase chain extension reaction (Sequencing by synthesis), the core of Illumina's sequencing technology, is a three-substrate reaction, mainly including template-primer hybrid, dNTP, and DNA synthase, and the reaction kinetics is a secondary reaction; the present invention
  • the provided method uses dNTP-DNA synthase and reversible ligation complexes as reaction substrates, reacts with template-primer hybrids, and the reaction kinetics are first-order reactions, so it is faster than Illumina's SBS technology.
  • Illumina's SBS fluorescent molecule is connected to the base of dNTP, and there is usually only one fluorescent molecule, so the optical signal intensity is limited; the method provided by the present invention connects fluorescent molecules on the synthase, which can provide more connection sites and connect More than one fluorescent group can therefore increase the optical signal intensity.
  • WO2007076057A2 provides compositions comprising polymerases having features of improved entry of nucleotide analogs into the active site region and coordination of nucleotide analogs in the active site region. Also provided are methods of making such polymerases and using such polymerases for sequencing and DNA replication and amplification, as well as kinetic models of polymerase activity and computer-implemented methods using such models.
  • TdT terminal deoxynucleotidyl transferase
  • the system has a repulsive effect on other TdT-dNTPs, severing the connection between TdT and dNTPs to release the primer and allow subsequent extension, thus reversibly terminating the extension of a dNTP at the 3' end of single-stranded DNA.
  • this method only uses the extension reaction of single-stranded nucleic acid, that is, DNA synthesis, and does not involve identifying the sequence of template DNA, nor does it involve connecting fluorescent molecules or other luminescent groups to nucleic acid synthase.
  • the present invention provides a method capable of rapidly determining a nucleic acid sequence.
  • a first aspect of the present invention provides a method for determining a nucleic acid sequence, the sequencing method comprising the following steps:
  • the polymerase is connected to the dNTP through a cleavable group (linker), and the polymerase can emit a light signal;
  • the attached polymerase prevents the reaction of another dNTP-polymerase with the 3' end of the newly formed primer, resulting in chain termination.
  • the nucleic acid to be detected in the present invention can be DNA or RNA, and preferably the nucleic acid sequence to be detected is DNA, such as genomic DNA, cDNA and DNA fragments.
  • the polymerase can be a currently known DNA polymerase and its variants, for example, the polymerase is selected from Bst DNA Pol and its variants, DNA Pol I and its variants, DNA Pol ⁇ and its variants. , T3 DNA Pol and its variants, T5 DNA Pol and its variants, L5 DNA Pol and its variants, DNA Pol II and its variants, DNA Pol B and its variants, DNA Pol ⁇ and its variants, DNA Pol ⁇ and its variants, DNA Pol ⁇ and its variants, DNA Pol ⁇ and its variants, DNA Pol III and its variants, DNA Pol ⁇ and its variants, DNA Pol ⁇ and its variants, DNA Pol ⁇ and its variants, DNA Pol ⁇ and its variants, DNA Pol ⁇ and its variants, DNA Pol One or more of IV and its variants, DNA Pol V and its variants, terminal transferase and its variants; wherein, the terminal transferase is terminal deoxynucleotidyl transfera
  • the polymerase is DNA Pol ⁇ and a variant thereof, and the sequence of the DNA Pol ⁇ is SEQ ID NO: 1.
  • the DNA Pol ⁇ variant is a mutation of the DNA Pol ⁇ at one or more (for example, 2, 3, 4 or 5) amino acid sequence positions, the mutation comprising a substitution, deletion and/or insertion, or At least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5 with DNA Pol ⁇ coding sequence % or more sequence identity, and the DNA Pol ⁇ variant has polymerase activity.
  • the mutation site of the DNA Pol ⁇ variant is in the region near the catalytic center of DNA synthesis, and is not the amino acid residue position necessary for the catalytic reaction, which can ensure that the catalytic activity is not affected, and that the dNTP is away from the catalytic center. More recently, it does not require a particularly long cleavable group to contact the catalytic center to participate in the reaction.
  • the variant of the DNA Pol ⁇ is to mutate any one, two or three cysteines at positions 145-355 of SEQ ID NO: 1 to non-cysteines, preferably, the SEQ ID NO: 1 mutates any one, two or three cysteines at positions 145-355 to serines, and mutates any one, two or three non-cysteines at positions 1-236 to cysteine.
  • the variant of DNA Pol ⁇ is to mutate any one, two or three cysteines at positions 155-294 of SEQ ID NO: 1 to serines, and mutate any of positions 20-154 into serines.
  • One, two or three non-cysteines are mutated to cysteines.
  • the variant of the DNA Pol ⁇ is to mutate the cysteine at the 178th position, the 239th position and/or the 267th position of SEQ ID NO: 1 to serine, and mutate the 28th position, the 28th position, the 267th position Any one, two or three non-cysteines at positions 33 and/or 149 are mutated to cysteine.
  • the polymerase is a variant of DNA Pol ⁇ , and the variant of DNA Pol ⁇ is selected from one or a combination of two or more of the following variants:
  • the amino acid sequence of the polymerase has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93% with SEQ ID NOs: 3-6 %, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Pol ⁇ variant has polymerase activity.
  • amino acid sequence of the polymerase is shown in any of SEQ ID NOs: 3-6.
  • the polymerase is Bst DNA Pol and a variant thereof, and the sequence of the Bst DNA Pol is SEQ ID NO: 2.
  • the Bst DNA Pol variant is a mutation of the Bst DNA Pol at one or more (for example, 2, 3, 4 or 5) amino acid sequence positions, the mutation comprising a substitution, deletion and/or insertion , or at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% with the Bst DNA Pol coding sequence %, more than 99.5% sequence identity, and the Bst DNA Pol variant has polymerase activity.
  • the mutation site of the Bst DNA Pol variant is in the region near the catalytic center of DNA synthesis, and is not the amino acid residue position necessary for the catalytic reaction, which can ensure that the catalytic activity is not affected, and that the dNTP is separated from the catalytic reaction.
  • the center is relatively close, and it does not require a particularly long cleavable group to contact the catalytic center to participate in the reaction.
  • the variant of the Bst DNA Pol is to mutate any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 to non-cysteines,
  • any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 are mutated to serines, and any one of positions 250-280 or 320-380 is mutated , two or three non-cysteines are mutated to cysteines.
  • the variant of the Bst DNA Pol is to mutate any one, two or three cysteines at positions 90-95 or 549-555 of SEQ ID NO: 2 to serine, and mutate the 260- Any one, two or three non-cysteines at positions 270 or 330-370 were mutated to cysteine.
  • the variant of the Bst DNA Pol is to mutate the cysteine at the 93rd position and/or the 550th position of SEQ ID NO:2 to serine, and the 264th position, the 334th position and/ Or any one, two or three non-cysteines at position 364 are mutated to cysteine.
  • the polymerase is a variant of Bst DNA Pol, and the variant of Bst DNA Pol is selected from one or a combination of two or more of the following variants:
  • the amino acid sequence of the polymerase has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93% with SEQ ID NOs: 7-10 %, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Pol ⁇ variant has polymerase activity.
  • amino acid sequence of the polymerase is shown in any of SEQ ID NOs: 7-10.
  • the dNTP linked to the polymerase by the cleavable group is selected from any one of dATP or modified dATP, dGTP or modified dGTP, dCTP or modified dCTP, dTTP or modified dTTP, dUTP or modified dUTP .
  • the cleavable group is used to link polymerase and modified dNTP, or link polymerase and dNTP.
  • the cleavable group contains a cleavable structure X, and the X is selected from:
  • j is an integer from 1 to 3
  • R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (
  • R' is selected from N or O.
  • the cleavable group has the structure of L1-L2-X-L3-L4, wherein L1 is an end group bound to dNTP or modified dNTP, L2 and L3 do not exist or are non-cleavable linking groups, L4 is the terminal group that binds to the polymerase, and X is the cleavable group.
  • the modified dNTP is a base-modified dNTP
  • the base-modified dNTP is a base-modified dNTP containing an N atom. More preferably, the base-modified dNTP is an amino-containing base.
  • Modified dNTPs are those whose bases are Modified dNTP
  • the base-modified dNTPs of the present invention can be either 3'-modified dNTPs or 3'-unmodified dNTPs, and the 3'-modified dNTPs can block the extension of nucleic acid chains.
  • the base-modified dNTP is selected from:
  • the linking site is selected from the following formula A, formula B, formula C or formula D.
  • the X is selected from:
  • j is an integer from 1 to 3
  • R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (
  • R' is selected from N or O.
  • said L1 or L4 is independently selected from maleimide group, carboxyl group, mercapto group, azido group, alkynyl group, cyclooctynyl group and its derivatives, tetrazinyl group, dithiopyridyl group, vinyl group , alkenyl sulfone group, succinimidyl group, aldehyde group, hydrazide group, amineoxy group and ⁇ -halogenated carbonyl group.
  • said L2 or L3 is independently selected from -O(CH 2 CH 2 O) i -, -(CH 2 ) i -, -(CH 2 ) i NH(CH 2 ) i -, -(CH 2 ) i COO(CH 2 ) i -, -(CH 2 ) i CONH(CH 2 ) i -, -(CH 2 ) i O(CH 2 ) i -, -(CH 2 ) i CO(CH 2 ) i -or
  • the i is selected from an integer of 0-10.
  • the cleavable group is selected from:
  • the cleavable group linking the polymerase to the dNTP can be cleaved under the action of a method commonly used in the prior art, for example, the group is cleaved after being irradiated by ultraviolet light or by the action of a chemical substance, thereby separating the polymerase from the dNTP. .
  • the polymerase of the present invention can be a luminescent polymerase, for example, the polymerase is mutated into a luminescent polymerase, or a traditional polymerase is fused with a luminescent protein to form a fusion protein to become a luminescent polymerase.
  • the polymerase of the present invention can also be a modified polymerase containing one or more fluorescent markers, phosphorescent markers or chemiluminescent markers, the modification does not affect the activity of the polymerase, the fluorescent markers, Phosphorescent labels or chemiluminescent labels emit a light signal upon exposure to light.
  • the fluorescent markers include fluorescein, Cy2, Cy3, Cy5, Cy7, Alexa Fluor series dyes, fluorescein isothiocyanate, 5-hexachlorofluorescein phosphoramidate, 6-carboxy-2',4,7 ,7'-Tetrachlorofluorescein succinimide ester, 6-carboxy-4',5'-dichloro-2',7'-dimethoxyfluorescein succinimide ester, Texas Red , rhodamine 110, fluorescein maleimide dye, fluoroborodipyrrole, xanthene, carbocyanine, 1,1'-bis(octadecyl)-3,3,3',3'-tetra Methyl indocarbocyanine perchlorate, 3,3'-bis(octadecyl)-oxacarbocyanine perchlorate, pyrene, phthalocyanine, 6-carboxyrhodamine 6G
  • Said phosphorescent label includes transition metal iridium (Ir) or ruthenium (Ru) aza-heteroaryl complexes, preferably as shown in the following formula:
  • the polymerase provided in the step (1) connected to dNTP through a cleavable group is to provide four types of polymerases connected with different dNTPs, and the four types of polymerases connected with different dNTPs
  • the enzymes are linked with different fluorescent, phosphorescent or chemiluminescent labels.
  • the nucleic acid to be detected is immobilized on the support.
  • the supports can be obtained commercially, such as supports prepared from glass, ceramic, silica and silicon materials, and supports having a gold surface can also be used.
  • the support typically comprises a flat surface (plane), or at least a structure in which the polynucleic acids to be linked are substantially in the same plane.
  • the solid support may be non-planar, such as microbeads. Any suitable size can be used.
  • the support may be on the order of 1 to 10 cm in each direction.
  • the amplification reaction can be carried out substantially as described in WO98/44151. Briefly, after primer attachment, the solid support is brought into contact with the template to be amplified under conditions that allow hybridization of the template and the immobilized primer.
  • the template is typically added to the free solution under suitable hybridization conditions, as will be apparent to those skilled in the art. Typical hybridization conditions are eg 40°C, 5xSSC after an initial denaturation step.
  • Solid phase amplification can then be performed, the first step of which is a primer extension step in which nucleotides are added to the 3' end of the immobilized primer hybridized to the template to generate a fully extended complementary strand.
  • the complementary strand contains at its 3' end a sequence capable of binding to the second primer molecule immobilized on the solid support. Further rounds of amplification result in the formation of clusters or clusters of template molecules bound to the solid support.
  • the step (2) further includes the step of adding primers to the nucleic acid to be detected.
  • the polymerase is excited by light to emit a light signal.
  • the light signal emitted on the polymerase is detected by a detection system commonly used in the prior art.
  • the polymerase can be detected by CCD or other suitable detection methods.
  • the light signal emitted on it can be used to measure the fluorescent signal. Either by adding enzymatic chemiluminescence, direct chemiluminescence substrates and or catalytic reagents, or by applying voltage, electrocatalytic chemiluminescence is performed, and the luminescent signal is detected.
  • the nucleic acid sequencing method of the present invention further comprises step (4) of cleaving between the polymerase and the dNTP.
  • the cleavage between the polymerase and the dNTP is to break the cleavable group connecting the polymerase and the dNTP, and the cleavage of the cleavable group is performed by a method commonly used in the prior art. This is accomplished, for example, by cleavage of the group linking the polymerase and the dNTP by UV irradiation or by chemical substances, such as the action of reducing agents, pH changes.
  • the ultraviolet rays can be ultraviolet rays of 365nm-410nm;
  • the reducing agent can be DTT, TCEP, THPP, and the reagents for changing the solution include organic acids, inorganic acids, organic bases, inorganic alkali.
  • the step (4) of the present invention also includes removing the polymerase with an eluent.
  • the nucleic acid sequencing method provided by the present invention further comprises step (5), that is, repeating steps (1)-(4).
  • the nucleic acid sequencing method provided by the present invention further includes nucleic acid sample pretreatment, and the nucleic acid sample pretreatment includes nucleic acid sample library construction and amplification.
  • the nucleic acid sample library construction includes adding adapters at both ends of the nucleic acid to obtain the nucleic acid to be tested.
  • the nucleic acid sample library construction includes adding a linker at both ends of the nucleic acid after fragmenting the nucleic acid sample to obtain the nucleic acid to be tested.
  • Linkers are typically short oligonucleotides that can be synthesized by conventional methods. Adapters can be attached to the 5' and 3' ends of target nucleic acid fragments in a variety of ways. Preferably, two different linker sequences are attached to the nucleic acid to be amplified, such that one linker is attached to one end of the nucleic acid molecule and the other linker molecule is attached to the other end of the nucleic acid molecule.
  • Adapters contain sequences that allow amplification of nucleic acid using amplification primer molecules immobilized on a solid support.
  • one single strand of the template construct must contain a sequence complementary to the sequence in the forward amplification primer (so that the forward primer molecule can bind and initiate complementary strand synthesis) and a reverse The sequence corresponding to the sequence in the amplification primer molecule (so that the reverse primer molecule can bind to the complementary strand).
  • the sequences in the linker that allow hybridization to the primer molecule are typically about 20-30 nucleotides in length, although the invention is not limited to sequences of this length.
  • sequences in the amplification primers and the corresponding sequences in the adapters are generally not critical to the present invention, so long as the primer molecules can interact with the amplification sequences to direct bridge amplification.
  • the principles of primer design are generally familiar to those skilled in the art.
  • the amplification is the amplification of the nucleic acid to be detected by performing bridge PCR or rolling circle amplification technology with primers immobilized on the carrier.
  • a second aspect of the present invention provides a polymerase, the polymerase is a variant of DNA Pol ⁇ , and the variant of DNA Pol ⁇ is a combination of any one or two of positions 145-355 of SEQ ID NO: 1 Or three cysteines are mutated to non-cysteines, preferably, any one, two or three cysteines at positions 145-355 of SEQ ID NO: 1 are mutated to serine, and the first Any one, two or three non-cysteines at positions 1-236 were mutated to cysteine.
  • the variant of DNA Pol ⁇ is to mutate any one, two or three cysteines at positions 155-294 of SEQ ID NO: 1 to serines, and mutate any one of positions 20-154 of SEQ ID NO: 1 , two or three non-cysteines are mutated to cysteines. More preferably, the variant of the DNA Pol ⁇ is to mutate the cysteine at the 178th position, the 239th position and/or the 267th position of SEQ ID NO: 1 to serine, and mutate the 28th position, the 28th position, the 267th position Any one, two or three non-cysteines at positions 33 and/or 149 are mutated to cysteine.
  • the polymerase is a variant of DNA Pol ⁇ , and the variant of DNA Pol ⁇ is selected from one or a combination of two or more of the following variants:
  • the amino acid sequence of the polymerase has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93% with SEQ ID NOs: 3-6 %, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Pol ⁇ variant has polymerase activity.
  • amino acid sequence of the polymerase is shown in any of SEQ ID NOs: 3-6.
  • the third aspect of the present invention provides a polymerase, the polymerase is Bst DNA Pol and a variant thereof, and the sequence of the Bst DNA Pol is SEQ ID NO: 2.
  • the Bst DNA Pol variant is a mutation of the Bst DNA Pol at one or more (for example, 2, 3, 4 or 5) amino acid sequence positions, the mutation comprising a substitution, deletion and/or insertion , or at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% with the Bst DNA Pol coding sequence %, more than 99.5% sequence identity, and the Bst DNA Pol variant has polymerase activity.
  • the mutation site of the Bst DNA Pol variant is in the region near the catalytic center of DNA synthesis, and is not the amino acid residue position necessary for the catalytic reaction, which can ensure that the catalytic activity is not affected, and that the dNTP is separated from the catalytic reaction.
  • the center is relatively close, and it does not require a particularly long cleavable group to contact the catalytic center to participate in the reaction.
  • the variant of the Bst DNA Pol is to mutate any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 to non-cysteines,
  • any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 are mutated to serines, and any one of positions 250-280 or 320-380 is mutated , two or three non-cysteines are mutated to cysteines.
  • the variant of the Bst DNA Pol is to mutate any one, two or three cysteines at positions 90-95 or 549-555 of SEQ ID NO: 2 to serine, and mutate the 260- Any one, two or three non-cysteines at positions 270 or 330-370 were mutated to cysteine. More preferably, the variant of the Bst DNA Pol is to mutate the cysteine at the 93rd position and/or the 550th position of SEQ ID NO: 2 to serine, and mutate the 264th position, the 334th position and the /or any one, two or three non-cysteines at position 364 are mutated to cysteine.
  • the polymerase is a variant of Bst DNA Pol, and the variant of Bst DNA Pol is selected from one or a combination of two or more of the following variants:
  • the amino acid sequence of the polymerase has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93% with SEQ ID NOs: 7-10 %, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Pol ⁇ variant has polymerase activity.
  • amino acid sequence of the polymerase is shown in any of SEQ ID NOs: 7-10.
  • the fourth aspect of the present invention also provides a polymerase complex prepared by linking the polymerase and dNTP through a cleavable group.
  • the cleavable group is used to connect the polymerase and the modified dNTP, or, the polymerase and the dNTP.
  • the cleavable group contains a cleavable structure X, and the X is selected from:
  • j is an integer from 1 to 3
  • R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (
  • R' is selected from N or O.
  • the cleavable group has a structure of L1-L2-X-L3-L4, wherein L1 is an end group bound to dNTP or modified dNTP, and L2 and L3 do not exist or are non-cleavable linking groups. , L4 is the end group bound to the polymerase, and X is the cleavable group.
  • the modified dNTP is a base-modified dNTP
  • the base-modified dNTP is a base-modified dNTP containing an N atom. More preferably, the base-modified dNTP is an amino-containing base.
  • Modified dNTPs are those whose bases are Modified dNTP
  • the base-modified dNTPs of the present invention can be either 3'-modified dNTPs or 3'-unmodified dNTPs, and the 3'-modified dNTPs can block the extension of nucleic acid chains.
  • the base-modified dNTP is selected from:
  • the linking site is selected from the following formula A, formula B, formula C or formula D.
  • the X is selected from:
  • j is an integer from 1 to 3
  • R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (
  • R' is selected from N or O.
  • said L1 or L4 is independently selected from maleimide group, carboxyl group, mercapto group, azido group, alkynyl group, cyclooctynyl group and its derivatives, tetrazinyl group, dithiopyridyl group, vinyl group , alkenyl sulfone group, succinimidyl group, aldehyde group, hydrazide group, amineoxy group and ⁇ -halogenated carbonyl group.
  • said L2 or L3 is independently selected from -O(CH 2 CH 2 O) i -, -(CH 2 ) i -, -(CH 2 ) i NH(CH 2 ) i -, -(CH 2 ) i COO(CH 2 ) i -, -(CH 2 ) i CONH(CH 2 ) i -, -(CH 2 ) i O(CH 2 ) i -, -(CH 2 ) i CO(CH 2 ) i -or
  • the i is selected from an integer of 0-10.
  • the cleavable group is selected from:
  • the cleavable group linking the polymerase and the dNTP can be cleaved under the action of methods commonly used in the prior art, for example, the group is cleaved after being irradiated by ultraviolet light or by the action of a chemical substance, so that the polymerase is separated from the dNTP. .
  • the polymerase complexes of the present invention may also include fluorescent labels, phosphorescent labels or chemiluminescent labels.
  • the fluorescent labels include fluorescein, Cy2, Cy3, Cy5, Cy7, Alexa Fluor series dyes, fluorescein isothiocyanate, 5-hexachlorofluorescein phosphoramidate, 6-carboxyl -2',4,7,7'-Tetrachlorofluorescein succinimide ester, 6-carboxy-4',5'-dichloro-2',7'-dimethoxyfluorescein succinimide Ester, Texas Red, Rhodamine 110, Fluorescein Maleimide Dye, Fluoroborodipyrrole, Xanthene, Carbocyanine, 1,1'-Di(octadecyl)-3,3, 3',3'-Tetramethylindocarbocyanine perchlorate, 3,3'-bis(octadecyl)-oxacarbocyanine perchlorate, pyrene, phthalocyanine, 6-carboxy Rhodamine 6G, Fluor
  • Said phosphorescent label includes transition metal iridium (Ir) or ruthenium (Ru) aza-heteroaryl complexes, preferably as shown in the following formula:
  • a fifth aspect of the present invention provides a nucleic acid synthesis method, the nucleic acid synthesis method comprising:
  • the polymerase is connected to the dNTP through a cleavable group
  • the dNTP on the polymerase is complementary to the bases on the sequence of the nucleic acid to be tested and enzymatically catalyzed by the polymerase
  • the reaction was added to the 3' end of the primer.
  • the attached polymerase prevents the reaction of another dNTP-polymerase with the 3' end of the newly formed primer, resulting in chain termination.
  • the nucleic acid synthesis method further comprises step (3) cleavage between the polymerase and dNTP; and (4) repeating steps (1)-(3).
  • the cleavable group is used for linking polymerase and modified dNTP, or linking polymerase and dNTP.
  • the cleavable group contains a cleavable structure X, and the X is selected from:
  • j is an integer from 1 to 3
  • R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (
  • R' is selected from N or O.
  • the cleavable group has the structure of L1-L2-X-L3-L4, wherein L1 is an end group bound to dNTP or modified dNTP, L2 and L3 do not exist or are non-cleavable linking groups, L4 is the terminal group that binds to the polymerase, and X is the cleavable group.
  • the modified dNTP is a base-modified dNTP
  • the base-modified dNTP is a base-modified dNTP containing an N atom. More preferably, the base-modified dNTP is an amino-containing base.
  • Modified dNTPs are those whose bases are Modified dNTP
  • the base-modified dNTPs of the present invention can be either 3'-modified dNTPs or 3'-unmodified dNTPs, and the 3'-modified dNTPs can block the extension of nucleic acid chains.
  • the base-modified dNTP is selected from:
  • the linking site is selected from the following formula A, formula B, formula C or formula D.
  • the X is selected from:
  • j is an integer from 1 to 3
  • R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (
  • R' is selected from N or O.
  • said L1 or L4 is independently selected from maleimide group, carboxyl group, mercapto group, azido group, alkynyl group, cyclooctynyl group and its derivatives, tetrazinyl group, dithiopyridyl group, vinyl group , alkenyl sulfone group, succinimidyl group, aldehyde group, hydrazide group, amineoxy group and ⁇ -halogenated carbonyl group.
  • said L2 or L3 is independently selected from -O(CH 2 CH 2 O) i -, -(CH 2 ) i -, -(CH 2 ) i NH(CH 2 ) i -, -(CH 2 ) i COO(CH 2 ) i -, -(CH 2 ) i CONH(CH 2 ) i -, -(CH 2 ) i O(CH 2 ) i -, -(CH 2 ) i CO(CH 2 ) i -or
  • the i is selected from an integer of 0-10.
  • the cleavable group is selected from:
  • the primers can be free or immobilized on a support.
  • the cleavage between the polymerase and dNTP in the step (3) is to break the cleavable group between the ligation polymerase and the dNTP, and the cleavage of the cleavable group between the ligation polymerase and the dNTP is This is achieved by methods commonly used in the prior art, for example, by UV irradiation or by the cleavage of the group linking the polymerase and the dNTP by chemical substances, such as the action of reducing agents, pH changes.
  • the polymerase used in the nucleic acid sequencing method provided by the present invention is connected to the dNTP through a cleavable group, and contains a fluorescent marker, a chemiluminescent marker or a phosphorescent marker, and the nucleic acid sequencing method implemented by the polymerase can realize the nucleic acid sequence. Quick determination.
  • Figure 1 Structural formula of linker connecting dNTP and polymerase, specifically PC Mal-NHS carbonate lipid;
  • Figure 2 Purification results of DNA polymerase Bst DNA Pol and its variants, in which lane M represents Marker; lane CF represents Bst DNA Pol Cys-Free mutation site is C93S/C550S, lane D264C represents mutation site is C93S /C550S/D264C Bst DNA Pol variant, lane R334C represents the Bst DNA Pol variant with the mutation site C93S/C550S/R334C, and lane L364C represents the Bst DNA Pol variant with the mutation site C93S/C550S/L364C;
  • Figure 3 DNA polymerase Bst DNA Pol polymerase activity detection prepared in Example 1, wherein, lane M represents Marker, lane PC represents positive control, lane NC represents negative control, and lane CF represents Bst DNA Pol Cys-Free mutation
  • the site is C93S/C550S
  • lane D264C represents the Bst DNA Pol variant with the mutation site C93S/C550S/D264C
  • lane R334C represents the Bst DNA Pol variant with the mutation site C93S/C550S/R334C
  • lane L364C represents the mutation site
  • the point is the Bst DNA Pol variant of C93S/C550S/L364C;
  • Figure 4A dATP base modification structural formula, specifically 7-deaza-Propargylamino-dATP, with a molecular weight of 543.26;
  • Figure 4B dCTP base modification structural formula, specifically Propargylamino-dCTP, with a molecular weight of 520.22;
  • Figure 4C dGTP base modification structural formula, specifically 7-deaza-Propargylamino-dGTP, with a molecular weight of 559.26;
  • Figure 4D The structural formula of dUTP base modification, specifically Propargylamino-dUTP, the molecular weight is 521.20;
  • Figure 5 The junction site of dNTP and linker
  • FIG. 6A Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the connection mass spectrum of Bst DNA Pol Cys-Free and dUTP;
  • FIG. 6B Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the mass spectrogram of the Bst DNA Pol variant whose mutation site is C93S/C550S/L364C;
  • FIG. 6C Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the connection mass spectrum of Bst DNA Pol variants with C93S/C550S/L364C mutation sites and dUTP;
  • Figure 6D Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the connection mass spectrum of Bst DNA Pol variants with C93S/C550S/R334C mutation sites and dUTP;
  • Figure 6E Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the connection mass spectrum of Bst DNA Pol variants with C93S/C550S/D264C mutation sites and dUTP;
  • Figure 7 The reaction mode diagram of the sequencing process, specifically designing a template-primer complex, carrying out an extension reaction with dNTP-linker-polymerase, washing away the unbound dNTP-linker-polymerase, and performing fluorescence or luminescence detection; Catalytic cleavage of the linker frees the polymerase from the primer-template complex. Use buffer to wash away the polymerase that has been cut off by light, and repeat the above steps to add the next nucleotide;
  • Figure 8 Single nucleotide extension detection electropherogram, in which lane P is the primer alone; lane P+T is the primer-template complex; lane 3nt is the extension of the primer-template complex with a separate polymerase and dTTP Reaction to increase the primer length by 3 nucleotides; mark +365 lane as primer-template complex and dUTP-linker-polymerase for extension reaction, since the polymerase and dUTP are linked by the linker, the primer will not be detached after the extension is completed -Template complex, which prevents other dUTP-linker-polymerase from binding to the primer-template complex, and cannot carry out the subsequent extension reaction, so it can only extend by 1 nucleotide.
  • the primers are separated from the polymerase, so the bands in the lane are primers that extend only 1 nucleotide in length; the reaction conditions in the marker-365 lane are the same as those in the marker+365 lane, because 365nm light is not used for cleavage, the primers and the polymerase The enzyme is covalently bound, and the denaturing gel cannot be opened, so the molecular weight of the primer band is extremely high.
  • nucleic acid in the present invention refers to at least two nucleotides covalently linked together.
  • dNTP in the present invention refers to deoxy-ribonucleotide triphosphate or ribonucleoside triphosphate, and usually includes dATP, dGTP, dTTP, dCTP or dUTP and the like.
  • mutant refers to an enzyme or protein with the same function formed by changing one or more amino acid sites in the wild-type polymerase.
  • the "combination of groups" mentioned in the present invention refers to a new group formed by connecting one or more substituent groups in the form of covalent bonds.
  • Modification refers to the substitution of one or more groups in the molecular structure of a substance.
  • C 0-10 alkyl refers to H, therefore, C 0-10 alkyl includes H, C 1 alkyl, C 2 alkyl, C 3 alkyl, C 4 Alkyl, C5 alkyl, C6 alkyl, C7 alkyl, C8 alkyl, C9 alkyl, C10 alkyl.
  • C 1-10 straight chain/branched chain alkyl described in the present invention includes methyl, ethyl, C 3 straight chain/branched chain alkyl, C 4 straight chain/branched chain alkyl, C 5 straight chain /branched alkyl, C6 linear/branched alkyl, C7 linear/branched alkyl, C8 linear/branched alkyl, C9 linear/branched alkyl, C10 linear /branched alkyl.
  • C 3-10 branched chain alkyl described in the present invention includes isopropyl, isobutyl, tert-butyl, and isoamyl.
  • C 3-10 cycloalkyl described in the present invention includes C 3 cycloalkyl, C 4 cycloalkyl, C 5 cycloalkyl, C 6 cycloalkyl, C 7 cycloalkyl, C 8 cycloalkane group, C 9 cycloalkyl, C 10 cycloalkyl.
  • halogen described in the present invention includes fluorine, chlorine, bromine and iodine.
  • heterocycloalkyl refers to a non-aromatic saturated mono- or polycyclic ring system containing 3-10 ring atoms, preferably 5-10 ring atoms, one or more of which is not carbon atoms, but for example nitrogen, oxygen or sulfur atoms.
  • Preferred heterocycloalkyl groups contain 5-6 ring atoms.
  • aza, oxa or thia before heterocycloalkyl means that there is at least one nitrogen, oxygen or sulfur atom respectively as a ring atom.
  • heterocyclic aromatic group in the present invention refers to an aromatic monocyclic or polycyclic ring system containing 5-14 ring atoms, preferably 5-10 ring atoms, wherein one or more ring atoms are not carbon atoms, and are, for example, nitrogen, oxygen or sulfur atoms.
  • Preferred heterocyclic aromatic groups contain 5-6 ring atoms.
  • heterocyclic aromatic groups include pyrazinyl, furyl, thienyl, pyridyl, pyrimidinyl, isoxazolyl, isothiazolyl, oxazolyl, thiazolyl, pyrazolyl, pyrrolyl, pyrazolyl , triazolyl, pyrazinyl, pyridazinyl, quinoxalinyl, 2,3-diazanaphthyl, imidazo[1,2-a]pyridine, imidazo[2,1-b]thiazolyl , indolyl, azaindolyl, benzimidazolyl, benzothienyl, quinolinyl, imidazolyl, thienopyridyl, quinazolinyl, thienopyrimidinyl, pyrrolopyridyl, imidazolyl Pyridyl, isoquinolinyl, 1,2,4-triazin
  • the expression construct pET15-Bst DNA Pol for the production of N-terminal 6His-tagged Bst DNA Pol and its variants and variants thereof was used.
  • N-terminal 6His-tagged Bst DNA Pol and its variants are expressed in the cytoplasm due to increased yield.
  • N-terminal 6His-tagged Bst DNA Pol and its variants were introduced into the expression strain of BL21 (DE3) for expression. Bacteria were disrupted and centrifuged using buffer A (50 mM Tris-HCl pH 8.0, 800 mM NaCl, 1 mM EDTA, 1 mM DTT). DNA and other impurity proteins in the bacterial supernatant were removed using PEI solution and saturated ammonium sulfate solution in sequence. The N-terminal 6His-tagged Bst DNA Pol and its variant proteins were purified using His-trap FF column and AKTA protein purification system (GE Healthcare).
  • the protein eluate was collected and purified using a Heparin HP column (GE Healthcare). The protein eluate was collected and used SDS-PAGE gel to detect the protein purity. The results are shown in Figure 2.
  • the purity of the purified DNA polymerase Bst DNA Pol and its variants is greater than 95%, and lane M is the protein Marker (Thermo Fisher), Lane CF is DNA polymerase Bst DNA Pol Cys-Free, lane D264C is DNA polymerase Bst DNA Pol Cys-Free D264C mutant, lane R334C is DNA polymerase Bst DNA Pol Cys-Free R334C mutant, lane L364C is DNA polymerase Enzyme Bst DNA Pol Cys-Free L364C mutant. Measure the concentration of purified DNA polymerase Bst DNA Pol and its variants, and store in aliquots and freeze at -80°C ultra-low temperature until the dNTP-linker-polymerase complex is prepared.
  • the purified DNA polymerase Bst DNA Pol and its variants need to be tested for its activity first, and can be linked with small molecules after confirming that it has polymerase activity.
  • the purified DNA polymerase Bst DNA Pol and its variants were mixed with annealed bound primer-template complex, 10 mM MgCl 2 , dNTP mix and buffer (50 mM NaCl, 10 mM Tris-HCl pH7.9, 1 mM DTT), and the reaction The volume of the system is 10uL. Incubate at 37°C for 30 min, and then use non-denaturing polyacrylamide gel electrophoresis for detection.
  • lane M is DNA Marker (Thermo Fisher)
  • lane PC is a positive control, using polymerase Klenow (NEB)
  • lane NC is a negative control
  • lane CF is DNA polymerase Bst DNA Pol Cys-Free
  • lane D264C is DNA polymerase Bst DNA Pol Cys-Free D264C mutant
  • lane R334C is DNA polymerase Bst DNA Pol Cys-Free R334C mutant
  • lane L364C is DNA polymerase Bst DNA Pol Cys-Free L364C mutant.
  • a higher-purity polymerase is obtained by the above method, and the obtained polymerase has a higher polymerase activity.
  • the above-mentioned ligation product small molecule was dissolved in DMSO, mixed with the aforementioned expressed and purified N-terminal 6His-labeled Bst DNA Pol and its variant polymerase protein at a small molecule molar ratio of 10:1, and incubated at room temperature for 2 hours in the dark. . Afterwards, Zeba Spin Desalting Column (Thermo) was used to remove excess small molecules in the solution. Among them, the junction site of dNTP and linker is shown in Figure 5.
  • Figures 6A-6E show that the dNTP is linked to the polymerase through the linker, and the polymerase linked with the dNTP modified by the single molecule base is obtained.
  • a DNA template design a random sequence of 60 bp in length.
  • the 20-22nd position from the 3' end of the template is three consecutive adenines (A), and at the 5' end of the template, add Biotin tag, design a primer sequence complementary to the 3' end of the template, the length is 19nt, the sequence is shown in Table 1.
  • the extension reaction was performed by incubating the pre-annealed primers and templates with dNTP-linker-polymerase at the optimum temperature for enzymatic activity, 60°C, for 5 minutes.
  • the reacted solution was placed on ice and irradiated with an LED lamp with a wavelength of 365nm-405nm for 15 minutes, to perform photocleavage of the linker connected to the polymerase and dNTP, so that the polymerase was separated from the primer-template complex.
  • the reaction system after irradiation was incubated with streptavidin-labeled magnetic beads for 5 minutes to allow the extended primer-template complexes to bind to the magnetic beads. Photocleaved polymerase and unbound dNTP-linker-polymerase were washed away using magnetic bead wash buffer (0.5 M NaCl, 20 mM Tris-HCl pH 7.5, 1 mM EDTA).
  • the primer-template complex is opened with 50 mM KOH solution, the primer is eluted, neutralized with an equal volume of 50 mM HCl solution, and then used 15% urea denatured PAGE gel for detection. Since only the 5' end of the primers has a fluorophore (FAM), the size of the primers on the urea-denatured PAGE gel can be detected by using the Typhoon fluorescent gel scanner of GE Company, and photographed for comparative analysis.
  • FAM fluorophore
  • Lane P is a separate primer
  • lane P+T is a primer-template complex
  • lane 3nt is an extension reaction of the primer-template complex with a separate polymerase Bst L364C and dTTP to increase the primer length by 3 nucleotides
  • mark +365 lane is the primer-template complex and the polymerase Bst L364C-dUTP for extension reaction.
  • the bands in the lanes are primers that only extend 1 nucleotide; the reaction conditions in the marker-365 lanes are the same as those in the marker+365 lanes. Since 365nm light is not used for cleavage, the primers are covalently bound to the polymerase, and the denatured gel cannot be used. open, so the molecular weight of the primer band is extremely high.

Abstract

The present invention provides a nucleic acid sequencing method, comprising: providing a polymerase linked to dNTP by means of a cleavable group and capable of emitting an optical signal; making the polymerase in contact with 3' end of a primer of a nucleic acid template-primer complex to be measured, complementarily pairing dNTP on the polymerase with a base group on a nucleic acid sequence to be measured, and adding the 3' end of the primer into a polymerase enzymatic reaction; and detecting the optical signal emitted by the polymerase. Subsequently, dNTP and the polymerase are broken, and the new template-primer complex after the leaving of the polymerase may enter the next sequencing cycle.

Description

一种核酸测序方法A nucleic acid sequencing method 技术领域technical field
本发明属于生物技术领域,涉及一种核酸序列的测定方法。The invention belongs to the field of biotechnology, and relates to a method for determining a nucleic acid sequence.
背景技术Background technique
DNA(脱氧核糖核酸)是组成生物体的遗传物质,由不同排列顺序的A、T、C、G组成,这些由不同顺序的A、T、C、G组成的遗传物质蕴含着丰富的遗传信息以及生命内涵。通过核酸测序技术可以获取不同遗传物质的核酸序列,帮助人们了解个体遗传物质之间的差异,这对生命科学研究、疾病诊断、个性化用药等均具有十分重要的意义。DNA (deoxyribonucleic acid) is the genetic material that composes an organism. It is composed of A, T, C, and G in different sequences. These genetic materials composed of A, T, C, and G in different sequences contain rich genetic information. and the meaning of life. Nucleic acid sequences of different genetic materials can be obtained through nucleic acid sequencing technology to help people understand the differences between individual genetic materials, which is of great significance to life science research, disease diagnosis, and personalized medicine.
第一代测序技术以Sanger发明的双脱氧链末端终止法为代表,该方法成本高,速度慢,通量低等不足,不能满足研究及应用的需求。第二代测序技术中Illumina公司的Solexa测序技术成本相对较低,所以被广泛的应用,但是该技术从开始建库到完成测序需要6天的时间,耗时较长。第三代测序技术中Pacific Bioscience公司的SMRT是唯一商业化应用的技术,SMRT技术实现了单分子测序,具有不需要扩增,读长长等优势,虽然该技术相对于Solexa测序技术从建库到完成测序时间短了一些,但是仍需约两天时间才能完成。The first generation sequencing technology is represented by the dideoxy chain end termination method invented by Sanger. This method has disadvantages such as high cost, slow speed and low throughput, which cannot meet the needs of research and application. In the second-generation sequencing technology, Illumina's Solexa sequencing technology has relatively low cost, so it is widely used, but this technology takes 6 days from the beginning of library construction to the completion of sequencing, which is time-consuming. Among the third-generation sequencing technologies, SMRT of Pacific Bioscience is the only commercially applied technology. SMRT technology realizes single-molecule sequencing and has the advantages of no amplification and long read length. Although this technology is compared with Solexa sequencing technology from library construction The time to complete sequencing was shorter, but it still took about two days to complete.
专利US5302509描述了一种对多核苷酸模板测序的方法,该方法包括使用DNA聚合酶或DNA连接酶连续掺入与模板链互补的经标记核苷酸或多核苷酸而实施的多延伸反应。在这种“合成测序(sequencing by synthesis)”反应中,通过连续掺入与模板链互补的单个核苷酸以5'到3'的方向构建了一条与模板链碱基配对的新核苷酸链。将测序反应使用的核苷三磷酸底物封闭以防止过度掺入,差异标记核苷三磷酸底物以使得能够确定作为后续核苷酸被加入的掺入核苷酸的种类。Patent US5302509 describes a method of sequencing a polynucleotide template comprising a multiple extension reaction using DNA polymerase or DNA ligase to successively incorporate labeled nucleotides or polynucleotides complementary to the template strand. In this "sequencing by synthesis" reaction, a new nucleotide base-paired to the template strand is constructed in a 5' to 3' orientation by successively incorporating single nucleotides complementary to the template strand chain. The nucleoside triphosphate substrate used in the sequencing reaction is blocked to prevent overincorporation, and the nucleoside triphosphate substrate is differentially labeled to enable determination of the species of incorporated nucleotide that is added as a subsequent nucleotide.
专利CN106244712A公开了一种DNA测序方法,包括如下步骤:(1)在待测DNA的3'末端增加标签序列,形成含有标签序列的待测DNA;所述标签序列的核苷酸序列与测序引物的核苷酸序列反向互补;(2)将所述含有标签序列的待测DNA和所述测序引物混合,形成5'末端双链主体单链形式的产物;(3)完成步骤(2)后,将产物分别与dATP、dCTP、dTTP和dGTP混合,得到四个体系,分别将每个体系加入特异DNA聚合酶修饰的单分子 器件,读取电信号。实验证明,采用本发明提供的方法可进行DNA测序,具有重要的应用价值。Illumina的测序技术的核心的合成酶链伸反应(Sequencing by synthesis)是一个三个底物的反应,主要包括模板-引物杂交体、dNTP、DNA合成酶,反应动力学是二级反应;本发明提供的方法以dNTP-DNA合成酶和可逆连接复合物为反应底物,与模板-引物杂交体反应,反应动力学是一级反应,因此比Illumina的SBS技术速度更快。此外,Illumina的SBS荧光分子连接在dNTP的碱基上,通常只有一个荧光分子,因此光学信号强度有限;本发明提供的方法在合成酶上连接荧光分子,可以提供更多的连接位点,连接一个以上的荧光集团,因此可以提高光学信号强度。Patent CN106244712A discloses a DNA sequencing method, including the following steps: (1) adding a tag sequence at the 3' end of the DNA to be tested to form the DNA to be tested containing the tag sequence; the nucleotide sequence of the tag sequence and the sequencing primer (2) mixing the DNA to be tested containing the tag sequence and the sequencing primer to form a product in the form of a double-stranded main body at the 5' end; (3) completing step (2) Then, the products were mixed with dATP, dCTP, dTTP and dGTP respectively to obtain four systems, and each system was added to a specific DNA polymerase-modified single-molecule device to read the electrical signal. Experiments show that DNA sequencing can be performed by using the method provided by the invention, which has important application value. The synthase chain extension reaction (Sequencing by synthesis), the core of Illumina's sequencing technology, is a three-substrate reaction, mainly including template-primer hybrid, dNTP, and DNA synthase, and the reaction kinetics is a secondary reaction; the present invention The provided method uses dNTP-DNA synthase and reversible ligation complexes as reaction substrates, reacts with template-primer hybrids, and the reaction kinetics are first-order reactions, so it is faster than Illumina's SBS technology. In addition, Illumina's SBS fluorescent molecule is connected to the base of dNTP, and there is usually only one fluorescent molecule, so the optical signal intensity is limited; the method provided by the present invention connects fluorescent molecules on the synthase, which can provide more connection sites and connect More than one fluorescent group can therefore increase the optical signal intensity.
WO2007076057A2提供了含有具有改善核苷酸类似物进入活性位点区和在活性位点区与核苷酸类似物协调作用的特征的聚合酶的组合物。也提供了制备这种聚合酶和将这种聚合酶用于测序以及DNA复制和扩增的方法,以及聚合酶活性的动力学模型和使用此模型的计算机实现方法。WO2007076057A2 provides compositions comprising polymerases having features of improved entry of nucleotide analogs into the active site region and coordination of nucleotide analogs in the active site region. Also provided are methods of making such polymerases and using such polymerases for sequencing and DNA replication and amplification, as well as kinetic models of polymerase activity and computer-implemented methods using such models.
Sebastian Palluk等(Nature Biotechnology,2018(36):645-650)公开了一种寡核苷酸的合成方法,该寡核苷酸的合成方法是利用末端脱氧核苷酸转移酶(TdT)进行核苷酸的合成。合成的原理是TdT分子可以与单个脱氧核糖核苷三磷酸(dNTP)分子形成连接复合物,引物与形成酶连接体系的dNTP结合后,引物的3'末端与TdT共价结合,形成的新的体系对其它的TdT-dNTP有排斥作用,切断TdT与dNTP之间的连接释放引物并允许随后的延伸,这样就在单链DNA 3'末端上可逆终止的延伸了一个dNTP。但此方法只是应用单链核酸的延伸反应即DNA合成,不涉及鉴定模板DNA的序列,也不涉及给核酸合成酶连接荧光分子或者其他发光基团。Sebastian Palluk et al. (Nature Biotechnology, 2018(36): 645-650) disclosed a method for synthesizing an oligonucleotide using terminal deoxynucleotidyl transferase (TdT) for nuclear Synthesis of Glycosides. The principle of synthesis is that TdT molecule can form a ligation complex with a single deoxyribonucleoside triphosphate (dNTP) molecule. After the primer is combined with the dNTP that forms the enzymatic ligation system, the 3' end of the primer is covalently combined with TdT to form a new complex. The system has a repulsive effect on other TdT-dNTPs, severing the connection between TdT and dNTPs to release the primer and allow subsequent extension, thus reversibly terminating the extension of a dNTP at the 3' end of single-stranded DNA. However, this method only uses the extension reaction of single-stranded nucleic acid, that is, DNA synthesis, and does not involve identifying the sequence of template DNA, nor does it involve connecting fluorescent molecules or other luminescent groups to nucleic acid synthase.
Fei Chen等(Genomics Proteomics Bioinformatics,2013(11):34-40)对可逆终止子的分类及可逆终止子在基因测序技术中应用的发展历史及工作机制做了介绍。Fei Chen et al. (Genomics Proteomics Bioinformatics, 2013(11): 34-40) introduced the classification of reversible terminators and the development history and working mechanism of the application of reversible terminators in gene sequencing technology.
针对于现有技术的不足,本发明提供一种能够快速测定核酸序列的方法。In view of the deficiencies of the prior art, the present invention provides a method capable of rapidly determining a nucleic acid sequence.
发明内容SUMMARY OF THE INVENTION
本发明第一方面提供一种测定核酸序列的方法,所述的测序方法包括如下步骤:A first aspect of the present invention provides a method for determining a nucleic acid sequence, the sequencing method comprising the following steps:
(1)提供聚合酶(polymerase),所述的聚合酶通过可断裂基团(linker)与dNTP连接,并且,所述的聚合酶可发出光信号;(1) providing a polymerase, the polymerase is connected to the dNTP through a cleavable group (linker), and the polymerase can emit a light signal;
(2)使所述聚合酶与待测核酸模板-引物复合体的引物3'端接触,所述聚合酶上的dNTP与待测核酸的序列上的碱基互补配对并在聚合酶酶促反应加入引物3'端;(2) contacting the polymerase with the 3' end of the primer of the nucleic acid template-primer complex to be tested, the dNTPs on the polymerase are complementary to the bases on the sequence of the nucleic acid to be tested, and enzymatically react with the polymerase Add primer 3' end;
(3)检测所述聚合酶发出的光信号。(3) Detecting the light signal emitted by the polymerase.
其中,酶促反应形成一个碱基的延伸后,所连接的聚合酶阻止了另一个dNTP-聚合酶与新形成的引物3'端的反应,造成了链终止。Here, after the enzymatic reaction forms an extension of one base, the attached polymerase prevents the reaction of another dNTP-polymerase with the 3' end of the newly formed primer, resulting in chain termination.
本发明所述的待测核酸可以为DNA或RNA,优选的所述的待测核酸序列为DNA,例如基因组DNA、cDNA以及DNA片段。The nucleic acid to be detected in the present invention can be DNA or RNA, and preferably the nucleic acid sequence to be detected is DNA, such as genomic DNA, cDNA and DNA fragments.
所述的聚合酶可以为目前已知的DNA聚合酶及其变体,例如,所述的聚合酶选自Bst DNA Pol及其变体、DNA Pol I及其变体、DNA Polγ及其变体、T3 DNA Pol及其变体、T5 DNA Pol及其变体、L5 DNA Pol及其变体、DNA Po1II及其变体、DNA Pol B及其变体、DNA Polα及其变体、DNA Pol△及其变体、DNA Polε及其变体、DNA Pol III及其变体、DNA Polβ及其变体、DNA Polσ及其变体、DNA Polλ及其变体、DNA Polμ及其变体、DNA Pol IV及其变体、DNA Pol V及其变体、末端转移酶及其变体中的一种或者两种以上;其中,所述的末端转移酶为末端脱氧核苷酸转移酶(TdT)。The polymerase can be a currently known DNA polymerase and its variants, for example, the polymerase is selected from Bst DNA Pol and its variants, DNA Pol I and its variants, DNA Polγ and its variants. , T3 DNA Pol and its variants, T5 DNA Pol and its variants, L5 DNA Pol and its variants, DNA Pol II and its variants, DNA Pol B and its variants, DNA Polα and its variants, DNA Pol△ and its variants, DNA Polε and its variants, DNA Pol III and its variants, DNA Polβ and its variants, DNA Polσ and its variants, DNA Polλ and its variants, DNA Polμ and its variants, DNA Pol One or more of IV and its variants, DNA Pol V and its variants, terminal transferase and its variants; wherein, the terminal transferase is terminal deoxynucleotidyl transferase (TdT).
优选的,所述的聚合酶为DNA Polβ及其变体,所述的DNA Polβ的序列为SEQ ID NO:1。所述的DNA Polβ变体为DNA Polβ在一个或多个(例如,2个、3个、4个或5个)氨基酸序列位置的突变,所述的突变包含取代、缺失和/或插入,或者与DNA Polβ编码序列具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的DNA Polβ变体具有聚合酶活性。Preferably, the polymerase is DNA Polβ and a variant thereof, and the sequence of the DNA Polβ is SEQ ID NO: 1. The DNA Polβ variant is a mutation of the DNA Polβ at one or more (for example, 2, 3, 4 or 5) amino acid sequence positions, the mutation comprising a substitution, deletion and/or insertion, or At least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5 with DNA Polβ coding sequence % or more sequence identity, and the DNA Polβ variant has polymerase activity.
优选的,所述的DNA Polβ变体的突变位点是在DNA合成的催化中心附近的区域,又不是催化反应所必须的氨基酸残基位置,可以保证不影响催化活性,又保证dNTP离催化中心比较近,不需要特别长的可断裂基团就可以与催化中心接触参与反应。Preferably, the mutation site of the DNA Polβ variant is in the region near the catalytic center of DNA synthesis, and is not the amino acid residue position necessary for the catalytic reaction, which can ensure that the catalytic activity is not affected, and that the dNTP is away from the catalytic center. More recently, it does not require a particularly long cleavable group to contact the catalytic center to participate in the reaction.
更优选的,所述的DNA Polβ的变体是将SEQ ID NO:1第145-355位的任意一个、 两个或三个半胱氨酸突变为非半胱氨酸,优选的,是将SEQ ID NO:1第145-355位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第1-236位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。More preferably, the variant of the DNA Polβ is to mutate any one, two or three cysteines at positions 145-355 of SEQ ID NO: 1 to non-cysteines, preferably, the SEQ ID NO: 1 mutates any one, two or three cysteines at positions 145-355 to serines, and mutates any one, two or three non-cysteines at positions 1-236 to cysteine.
更优选的,所述的DNA Polβ的变体是将SEQ ID NO:1第155-294位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第20-154位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。More preferably, the variant of DNA Polβ is to mutate any one, two or three cysteines at positions 155-294 of SEQ ID NO: 1 to serines, and mutate any of positions 20-154 into serines. One, two or three non-cysteines are mutated to cysteines.
更优选的,将所述的DNA Polβ的变体是将SEQ ID NO:1的第178位、第239位和/或第267位的半胱氨酸突变为丝氨酸,并且将第28位、第33位和/或第149位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。More preferably, the variant of the DNA Polβ is to mutate the cysteine at the 178th position, the 239th position and/or the 267th position of SEQ ID NO: 1 to serine, and mutate the 28th position, the 28th position, the 267th position Any one, two or three non-cysteines at positions 33 and/or 149 are mutated to cysteine.
特别优选的,所述的聚合酶为DNA Polβ的变体,所述的DNA Polβ的变体选自如下变体中的一种或者两种以上的组合:Particularly preferably, the polymerase is a variant of DNA Polβ, and the variant of DNA Polβ is selected from one or a combination of two or more of the following variants:
(1)将SEQ ID NO:1第178位的半胱氨酸突变为丝氨酸,第239位的半胱氨酸突变为丝氨酸,第267位的半胱氨酸突变为丝氨酸,第28位的天冬酰胺突变为半胱氨酸;(1) mutate the cysteine at position 178 of SEQ ID NO: 1 to serine, the cysteine at position 239 to serine, the cysteine at position 267 to serine, and the cysteine at position 28 to serine Paragine is mutated to cysteine;
(2)将SEQ ID NO:1第178位的半胱氨酸突变为丝氨酸,第239位的半胱氨酸突变为丝氨酸,第267位的半胱氨酸突变为丝氨酸,第33位的异亮氨酸突变为半胱氨酸;(2) mutate the cysteine at position 178 of SEQ ID NO: 1 to serine, the cysteine at position 239 to serine, the cysteine at position 267 to serine, the isotope at position 33 to serine Leucine is mutated to cysteine;
(3)将SEQ ID NO:1第178位的半胱氨酸突变为丝氨酸,第239位的半胱氨酸突变为丝氨酸,第267位的半胱氨酸突变为丝氨酸,第149位的精氨酸突变为半胱氨酸。(3) mutate the cysteine at position 178 of SEQ ID NO: 1 to serine, the cysteine at position 239 to serine, the cysteine at position 267 to serine, the sperm at position 149 amino acid to cysteine.
在本发明的一个具体实施方式中,所述的聚合酶的氨基酸序列与SEQ ID NO:3-6具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的DNA Polβ变体具有聚合酶活性。In a specific embodiment of the present invention, the amino acid sequence of the polymerase has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93% with SEQ ID NOs: 3-6 %, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Polβ variant has polymerase activity.
在本发明的另一个具体实施方式中,所述的聚合酶的氨基酸序列如SEQ ID NO:3-6任一所示。In another specific embodiment of the present invention, the amino acid sequence of the polymerase is shown in any of SEQ ID NOs: 3-6.
优选的,所述的聚合酶为Bst DNA Pol及其变体,所述的Bst DNA Pol的序列为SEQ ID NO:2。所述的Bst DNA Pol变体为Bst DNA Pol在一个或多个(例如,2个、3个、4个或5个)氨基酸序列位置的突变,所述的突变包含取代、缺失和/或插入,或者与Bst DNA Pol 编码序列具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的Bst DNA Pol变体具有聚合酶活性。Preferably, the polymerase is Bst DNA Pol and a variant thereof, and the sequence of the Bst DNA Pol is SEQ ID NO: 2. The Bst DNA Pol variant is a mutation of the Bst DNA Pol at one or more (for example, 2, 3, 4 or 5) amino acid sequence positions, the mutation comprising a substitution, deletion and/or insertion , or at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% with the Bst DNA Pol coding sequence %, more than 99.5% sequence identity, and the Bst DNA Pol variant has polymerase activity.
优选的,所述的Bst DNA Pol变体的突变位点是在DNA合成的催化中心附近的区域,又不是催化反应所必须的氨基酸残基位置,可以保证不影响催化活性,又保证dNTP离催化中心比较近,不需要特别长的可断裂基团就可以与催化中心接触参与反应。Preferably, the mutation site of the Bst DNA Pol variant is in the region near the catalytic center of DNA synthesis, and is not the amino acid residue position necessary for the catalytic reaction, which can ensure that the catalytic activity is not affected, and that the dNTP is separated from the catalytic reaction. The center is relatively close, and it does not require a particularly long cleavable group to contact the catalytic center to participate in the reaction.
更优选的,所述的Bst DNA Pol的变体是将SEQ ID NO:2第85-100或540-570位的任意一个、两个或三个半胱氨酸突变为非半胱氨酸,优选的,是将SEQ ID NO:2第85-100或540-570位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第250-280或320-380位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。More preferably, the variant of the Bst DNA Pol is to mutate any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 to non-cysteines, Preferably, any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 are mutated to serines, and any one of positions 250-280 or 320-380 is mutated , two or three non-cysteines are mutated to cysteines.
优选的,所述的Bst DNA Pol的变体是将SEQ ID NO:2第90-95或549-555位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第260-270或330-370位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。Preferably, the variant of the Bst DNA Pol is to mutate any one, two or three cysteines at positions 90-95 or 549-555 of SEQ ID NO: 2 to serine, and mutate the 260- Any one, two or three non-cysteines at positions 270 or 330-370 were mutated to cysteine.
更优选的,所述的Bst DNA Pol的变体是将SEQ ID NO:2的第93位和/或第550位的半胱氨酸突变为丝氨酸,并且将第264位、第334位和/或第364位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。More preferably, the variant of the Bst DNA Pol is to mutate the cysteine at the 93rd position and/or the 550th position of SEQ ID NO:2 to serine, and the 264th position, the 334th position and/ Or any one, two or three non-cysteines at position 364 are mutated to cysteine.
特别优选的,所述的聚合酶为Bst DNA Pol的变体,所述的Bst DNA Pol的变体选自如下变体中的一种或者两种以上的组合:Particularly preferably, the polymerase is a variant of Bst DNA Pol, and the variant of Bst DNA Pol is selected from one or a combination of two or more of the following variants:
(1)将SEQ ID NO:2第93位的半胱氨酸突变为丝氨酸,第550位的半胱氨酸突变为丝氨酸;(1) mutate the cysteine at position 93 of SEQ ID NO: 2 to serine, and mutate the cysteine at position 550 to serine;
(2)将SEQ ID NO:2第93位的半胱氨酸突变为丝氨酸,第550位的半胱氨酸突变为丝氨酸,第264位的天冬氨酸突变为半胱氨酸;(2) mutating the cysteine at position 93 of SEQ ID NO: 2 to serine, the cysteine at position 550 to serine, and the aspartic acid at position 264 to cysteine;
(3)将SEQ ID NO:2第93位的半胱氨酸突变为丝氨酸,第550位的半胱氨酸突变为丝氨酸,第334位的精氨酸突变为半胱氨酸;(3) mutating the cysteine at position 93 of SEQ ID NO: 2 to serine, the cysteine at position 550 to serine, and the arginine at position 334 to cysteine;
(4)将SEQ ID NO:2第93位的半胱氨酸突变为丝氨酸,第550位的半胱氨酸突变 为丝氨酸,第364位的亮氨酸突变为半胱氨酸。(4) The cysteine at position 93 of SEQ ID NO: 2 is mutated to serine, the cysteine at position 550 is mutated to serine, and the leucine at position 364 is mutated to cysteine.
在本发明的一个具体实施方式中,所述的聚合酶的氨基酸序列与SEQ ID NO:7-10具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的DNA Polβ变体具有聚合酶活性。In a specific embodiment of the present invention, the amino acid sequence of the polymerase has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93% with SEQ ID NOs: 7-10 %, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Polβ variant has polymerase activity.
在本发明的另一个具体实施方式中,所述的聚合酶的氨基酸序列如SEQ ID NO:7-10任一所示。In another specific embodiment of the present invention, the amino acid sequence of the polymerase is shown in any of SEQ ID NOs: 7-10.
所述的通过可断裂基团与聚合酶连接的dNTP选自dATP或修饰的dATP、dGTP或修饰的dGTP、dCTP或修饰的dCTP、dTTP或修饰的dTTP、dUTP或修饰的dUTP中的任意一种。The dNTP linked to the polymerase by the cleavable group is selected from any one of dATP or modified dATP, dGTP or modified dGTP, dCTP or modified dCTP, dTTP or modified dTTP, dUTP or modified dUTP .
所述的可断裂基团用于连接聚合酶和修饰的dNTP,或者连接聚合酶和dNTP。优选的,所述的可断裂基团含有可断裂结构X,所述的X选自:The cleavable group is used to link polymerase and modified dNTP, or link polymerase and dNTP. Preferably, the cleavable group contains a cleavable structure X, and the X is selected from:
Figure PCTCN2021129461-appb-000001
Figure PCTCN2021129461-appb-000001
其中,j为1-3的整数,where j is an integer from 1 to 3,
R选自H、C 1- 10直链或支链烷基、C 2- 10烯基、C 2- 10炔基、C 3-10环烷基、芳基、含有N、O或S的杂环芳基、含有N、O或S的杂环烷基、-N(C 0-10烷基)(C 0-10烷基)、-CON(C 0-10烷基)(C 0-10烷基)、-N(C 0-10烷基)CO(C 0-10烷基)、COR 1、CN,C 1- 10烷氧基,芳氧基,含有 N、O或S杂芳氧基,上述基团中的H可被以下基团取代:卤素、-CN、-OCH 2F、-OCHF 2、-OCF 3、-OH、C 1-10直链或支链烷基、-N(C 0-10烷基)(C 0-10烷基)、-OC 0-10烷基、C 3-10环烷基、-O杂环烷基、-N杂环烷基、-S杂环烷基、-N杂环芳香基、-O杂环芳香基、-S杂环芳香基,R 1选自H、OC 0-10烷基、C 1- 10直链或支链烷基, R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (C 0-10 alkyl), -OC 0-10 alkyl, C 3-10 cycloalkyl, -O heterocycloalkyl, -N heterocycloalkyl, -S hetero Cycloalkyl, -N heterocyclic aromatic group, -O heterocyclic aromatic group, -S heterocyclic aromatic group, R 1 is selected from H, O 0-10 alkyl group, C 1 - 10 straight chain or branched chain alkyl group,
R'选自N或O。R' is selected from N or O.
更优选的,所述的可断裂基团具有L1-L2-X-L3-L4结构,其中L1为与dNTP或修饰的dNTP结合的端基,L2和L3不存在或者为不可断裂连接基团,L4为与聚合酶结合的端基,X为可断裂基团。More preferably, the cleavable group has the structure of L1-L2-X-L3-L4, wherein L1 is an end group bound to dNTP or modified dNTP, L2 and L3 do not exist or are non-cleavable linking groups, L4 is the terminal group that binds to the polymerase, and X is the cleavable group.
所述的修饰的dNTP为碱基修饰的dNTP,所述的碱基修饰的dNTP为含N原子的碱基修饰的dNTP,更优选的,所述的碱基修饰的dNTP为含氨基的碱基修饰的dNTP。特别优选的,所述的碱基修饰的dNTP为碱基被
Figure PCTCN2021129461-appb-000002
修饰的dNTP,所述的n选自0-12的整数,优选的,所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
The modified dNTP is a base-modified dNTP, and the base-modified dNTP is a base-modified dNTP containing an N atom. More preferably, the base-modified dNTP is an amino-containing base. Modified dNTPs. Particularly preferably, the base-modified dNTPs are those whose bases are
Figure PCTCN2021129461-appb-000002
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
本发明所述的碱基修饰的dNTP既可以是3'位修饰的dNTP,也可以是3'位非修饰的dNTP,所述的3'位修饰的dNTP可以达到阻隔核酸链的延展。The base-modified dNTPs of the present invention can be either 3'-modified dNTPs or 3'-unmodified dNTPs, and the 3'-modified dNTPs can block the extension of nucleic acid chains.
优选的,所述的碱基修饰的dNTP为嘌呤碱基的7号位或者是嘧啶碱基的3号位被以下基团修饰:
Figure PCTCN2021129461-appb-000003
修饰的dNTP,所述的n选自0-12的整数,优选的,所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
Preferably, the base-modified dNTP is that the 7th position of the purine base or the 3rd position of the pyrimidine base is modified by the following groups:
Figure PCTCN2021129461-appb-000003
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
优选的,所述的3'位修饰的dNTP可以为3'位的H原子被以下基团取代:
Figure PCTCN2021129461-appb-000004
Figure PCTCN2021129461-appb-000005
修饰的dNTP,所述的n选自0-12的整数,优选的, 所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
Preferably, the dNTP modified at the 3' position can be substituted by the following group at the H atom at the 3' position:
Figure PCTCN2021129461-appb-000004
Figure PCTCN2021129461-appb-000005
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
在本发明具体实施方式中,所述的碱基修饰的dNTP选自:In a specific embodiment of the present invention, the base-modified dNTP is selected from:
Propargylamino-3'-azidomethyl-dCTP;Propargylamino-3'-azidomethyl-dCTP;
Propargyl-3'-azidomethyl-dCTP;Propargyl-3'-azidomethyl-dCTP;
Propargylamino-3'-azidomethyl-dUTP;Propargylamino-3'-azidomethyl-dUTP;
Propargyl-3'-azidomethyl-dUTP;Propargyl-3'-azidomethyl-dUTP;
7-Deaza-7-Propargylamino-3'-azidomethyl-dGTP;7-Deaza-7-Propargylamino-3'-azidomethyl-dGTP;
7-deaza-propargyl-3'-azidomethyl-dGTP;7-deaza-propargyl-3'-azidomethyl-dGTP;
7-Deaza-7-Propargylamino-3'-azidomethyl-dATP;7-Deaza-7-Propargylamino-3'-azidomethyl-dATP;
7-deaza-propargyl-3'-azidomethyl-dATP;7-deaza-propargyl-3'-azidomethyl-dATP;
Propargylamino dCTP,propargyl-dCTP;Propargylamino dCTP, propargyl-dCTP;
Propargylamino dUTP,propargyl-dUTP;Propargylamino dUTP, propargyl-dUTP;
7-Deaza-7-Propargylamino dATP,7-deaza-propargyl-dATP;7-Deaza-7-Propargylamino dATP, 7-deaza-propargyl-dATP;
7-Deaza-7-Propargylamino dGTP,7-deaza-propargyl-dGTP。7-Deaza-7-Propargylamino dGTP, 7-deaza-propargyl-dGTP.
优选的,所述的连接位点选自下式A、式B、式C或式D。Preferably, the linking site is selected from the following formula A, formula B, formula C or formula D.
Figure PCTCN2021129461-appb-000006
Figure PCTCN2021129461-appb-000006
优选的,所述的X选自:Preferably, the X is selected from:
Figure PCTCN2021129461-appb-000007
Figure PCTCN2021129461-appb-000007
其中,j为1-3的整数,where j is an integer from 1 to 3,
R选自H、C 1- 10直链或支链烷基、C 2- 10烯基、C 2- 10炔基、C 3-10环烷基、芳基、含有N、O或S的杂环芳基、含有N、O或S的杂环烷基、-N(C 0-10烷基)(C 0-10烷基)、-CON(C 0-10烷基)(C 0-10烷基)、-N(C 0-10烷基)CO(C 0-10烷基)、COR 1、CN,C 1- 10烷氧基,芳氧基,含有N、O或S杂芳氧基,上述基团中的H可被以下基团取代:卤素、-CN、-OCH 2F、-OCHF 2、-OCF 3、-OH、C 1-10直链或支链烷基、-N(C 0-10烷基)(C 0-10烷基)、-OC 0-10烷基、C 3-10环烷基、-O杂环烷基、-N杂环烷基、-S杂环烷基、-N杂环芳香基、-O杂环芳香基、-S杂环芳香基,R 1选自H、OC 0-10烷基、C 1- 10直链或支链烷基, R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (C 0-10 alkyl), -OC 0-10 alkyl, C 3-10 cycloalkyl, -O heterocycloalkyl, -N heterocycloalkyl, -S hetero Cycloalkyl, -N heterocyclic aromatic group, -O heterocyclic aromatic group, -S heterocyclic aromatic group, R 1 is selected from H, O 0-10 alkyl group, C 1 - 10 straight chain or branched chain alkyl group,
R'选自N或O。R' is selected from N or O.
优选的,所述L1或L4独立的选自马来酰亚胺基、羧基、巯基、叠氮基、炔基、环辛炔基及其衍生物、四嗪基、二硫吡啶基、乙烯基、烯砜基、琥珀酰亚胺基、醛基、酰肼基、胺氧基和α-卤代羰基。Preferably, said L1 or L4 is independently selected from maleimide group, carboxyl group, mercapto group, azido group, alkynyl group, cyclooctynyl group and its derivatives, tetrazinyl group, dithiopyridyl group, vinyl group , alkenyl sulfone group, succinimidyl group, aldehyde group, hydrazide group, amineoxy group and α-halogenated carbonyl group.
优选的,所述L2或L3独立的选自-O(CH 2CH 2O) i-、-(CH 2) i-、-(CH 2) iNH(CH 2) i-、-(CH 2) iCOO(CH 2) i-、-(CH 2) iCONH(CH 2) i-、-(CH 2) iO(CH 2) i-、-(CH 2) iCO(CH 2) i-或
Figure PCTCN2021129461-appb-000008
中的一个或二个以上基团的组合,所述的i选自0-10的整数。
Preferably, said L2 or L3 is independently selected from -O(CH 2 CH 2 O) i -, -(CH 2 ) i -, -(CH 2 ) i NH(CH 2 ) i -, -(CH 2 ) i COO(CH 2 ) i -, -(CH 2 ) i CONH(CH 2 ) i -, -(CH 2 ) i O(CH 2 ) i -, -(CH 2 ) i CO(CH 2 ) i -or
Figure PCTCN2021129461-appb-000008
One or a combination of two or more groups in , the i is selected from an integer of 0-10.
优选的,所述的可断裂基团选自:Preferably, the cleavable group is selected from:
Figure PCTCN2021129461-appb-000009
。所述连接聚合酶与dNTP的可断裂基团在现有技术中常用的方法的作用下可发生断裂,例如,经紫外线照射或者经化学物质作用后基团发生断裂,从而使聚合酶与dNTP分离。
Figure PCTCN2021129461-appb-000009
. The cleavable group linking the polymerase to the dNTP can be cleaved under the action of a method commonly used in the prior art, for example, the group is cleaved after being irradiated by ultraviolet light or by the action of a chemical substance, thereby separating the polymerase from the dNTP. .
本发明所述的聚合酶可以为可发光聚合酶,例如所述的聚合酶经突变为可发光聚合酶,或将传统聚合酶与发光蛋白融合形成融合蛋白成为可发光聚合酶。The polymerase of the present invention can be a luminescent polymerase, for example, the polymerase is mutated into a luminescent polymerase, or a traditional polymerase is fused with a luminescent protein to form a fusion protein to become a luminescent polymerase.
本发明所述聚合酶还可以是经修饰后含有一个或多个荧光标记物、磷光标记物或者化学发光标记物的聚合酶,所述的修饰不影响聚合酶的活性,所述荧光标记物、磷光标记物或者化学发光标记物经光照后发出光信号。所述的化学发光标记物为目前已知的在可见光和/或紫外线的照射下发出光信号的基团,例如,辣根过氧化酶、碱性磷酸酶、荧光素酶及其衍生物,吖啶酯类、过氧化草酸酯类、洛粉碱、光泽精、鲁米诺及其衍生物,催化化学发光标记物的金属离子复合物、电催化化学发光标记物、苯并二呋喃、次甲基、三苯甲烷、吖嗪、三吩嗪、萘二甲酰亚胺、吡唑、萘醌、蒽醌、单偶氮和双偶氮以及上述基团的衍生物,具有可见光区吸收带的苯的衍生物,具有紫外吸收带的C=C、C=O、-N=N-、-NO 2、-C=S等。 The polymerase of the present invention can also be a modified polymerase containing one or more fluorescent markers, phosphorescent markers or chemiluminescent markers, the modification does not affect the activity of the polymerase, the fluorescent markers, Phosphorescent labels or chemiluminescent labels emit a light signal upon exposure to light. The chemiluminescent label is a group currently known to emit light signal under the irradiation of visible light and/or ultraviolet light, for example, horseradish peroxidase, alkaline phosphatase, luciferase and its derivatives, acridine Pyridinyl esters, peroxyoxalates, lofenacine, lucigenin, luminol and its derivatives, metal ion complexes of catalytic chemiluminescent markers, electrocatalytic chemiluminescent markers, benzodifuran, methine base, triphenylmethane, azine, triphenazine, naphthalimide, pyrazole, naphthoquinone, anthraquinone, monoazo and bisazo and derivatives of the above groups, with absorption bands in the visible light region Derivatives of benzene, C=C, C=O, -N=N-, -NO 2 , -C=S, etc. with ultraviolet absorption bands.
所述的荧光标记物包括荧光素,Cy2,Cy3,Cy5,Cy7,Alexa Fluor系列染料,异硫氰酸荧光素,5-六氯荧光素氨基磷酸酯,6-羧基-2',4,7,7'-四氯荧光素琥珀酰亚胺酯,6-羧基-4',5'-二氯-2',7'-二甲氧基荧光素琥珀酰亚胺酯,德克萨斯红,罗丹明110,荧光素马来酰亚胺染料,氟硼二吡咯,呫吨、羰花青、1,1'-二(十八烷基)-3,3,3',3'-四甲基吲哚羰花青高 氯酸盐、3,3'-二(十八烷基)-氧杂羰花青高氯酸盐、芘、酞菁、6-羧基罗丹明6G、异硫氰酸荧光素、6-羧基荧光素琥珀酰亚胺酯、5-羧基荧光素琥珀酰亚胺酯、5-羧基荧光素、6-羧基荧光素、罗丹明B、罗丹明6G、7-氨基-4-甲基香豆素、二氢罗丹明123、四甲基罗丹明-6-马来酰亚胺、四甲基罗丹明-5-马来酰亚胺、5-吲哚乙酰氨基荧光素、双[NN-双(羧甲基)氨甲基]荧光素四钠盐、荧光素-5-马来酰亚胺、磺基罗丹明G、7-羟基-4-甲基香豆素、3-氰基-7-羟基香豆素、荧光素二钠盐、四甲基罗丹明-6-异硫氰酸、6-羧基-X-罗丹明琥珀酰亚胺酯、5-羧基-X-罗丹明琥珀酰亚胺酯、6-羧基-X-罗丹明、5-羧基四甲基罗丹明琥珀酰亚胺酯、6-羧基四甲基罗丹明、5-羧基四甲基罗丹明、产生能量转移染料及荧光蛋白,例如GFP(绿色荧光蛋白),CFP(青色荧光蛋白),BFP(蓝色荧光蛋白),YFP(黄色荧光蛋白)及衍生物中的一种或多种。The fluorescent markers include fluorescein, Cy2, Cy3, Cy5, Cy7, Alexa Fluor series dyes, fluorescein isothiocyanate, 5-hexachlorofluorescein phosphoramidate, 6-carboxy-2',4,7 ,7'-Tetrachlorofluorescein succinimide ester, 6-carboxy-4',5'-dichloro-2',7'-dimethoxyfluorescein succinimide ester, Texas Red , rhodamine 110, fluorescein maleimide dye, fluoroborodipyrrole, xanthene, carbocyanine, 1,1'-bis(octadecyl)-3,3,3',3'-tetra Methyl indocarbocyanine perchlorate, 3,3'-bis(octadecyl)-oxacarbocyanine perchlorate, pyrene, phthalocyanine, 6-carboxyrhodamine 6G, isothiocyanate acid fluorescein, 6-carboxyfluorescein succinimidyl ester, 5-carboxyfluorescein succinimidyl ester, 5-carboxyfluorescein, 6-carboxyfluorescein, rhodamine B, rhodamine 6G, 7-amino- 4-Methylcoumarin, Dihydrorhodamine 123, Tetramethylrhodamine-6-maleimide, Tetramethylrhodamine-5-maleimide, 5-indoleacetamidofluorescein , bis[NN-bis(carboxymethyl)aminomethyl]fluorescein tetrasodium salt, fluorescein-5-maleimide, sulforhodamine G, 7-hydroxy-4-methylcoumarin, 3-cyano-7-hydroxycoumarin, fluorescein disodium salt, tetramethylrhodamine-6-isothiocyanate, 6-carboxy-X-rhodamine succinimide ester, 5-carboxy-X -Rhodamine succinimide ester, 6-carboxy-X-rhodamine, 5-carboxytetramethylrhodamine succinimide ester, 6-carboxytetramethylrhodamine, 5-carboxytetramethylrhodamine, Energy transfer dyes and fluorescent proteins such as one or more of GFP (green fluorescent protein), CFP (cyan fluorescent protein), BFP (blue fluorescent protein), YFP (yellow fluorescent protein) and derivatives are produced.
所述的磷光标记物包括过渡金属铱(Ir)或钌(Ru)的氮杂芳基络合物,优选如下式所示:Said phosphorescent label includes transition metal iridium (Ir) or ruthenium (Ru) aza-heteroaryl complexes, preferably as shown in the following formula:
Figure PCTCN2021129461-appb-000010
Figure PCTCN2021129461-appb-000010
优选的,所述的步骤(1)中提供的通过可断裂基团与dNTP连接的聚合酶为提供四种连接有不同有dNTP的聚合酶,并且所述的四种连接有不同有dNTP的聚合酶连接有不同的荧光标记物、磷光标记物或者化学发光标记物。Preferably, the polymerase provided in the step (1) connected to dNTP through a cleavable group is to provide four types of polymerases connected with different dNTPs, and the four types of polymerases connected with different dNTPs The enzymes are linked with different fluorescent, phosphorescent or chemiluminescent labels.
优选的,所述的步骤(2)中待测核酸固定在支持物上。Preferably, in the step (2), the nucleic acid to be detected is immobilized on the support.
本领域技术人员可以理解,所述的支持物可以市售获得,例如玻璃、陶瓷、二氧化硅和硅的材料制备得到支持物,也可以使用具有金表面的支持物。所述支持物通常包含一个平的表面(平面),或至少包含其中待连接的多核酸大致在同一平面上的结构。或者,所述固体支持物可以为非平面,例如微珠。可使用任何合适的尺寸。例如,所述支持物可以在 每个方向上为1到10cm的数量级。Those skilled in the art can understand that the supports can be obtained commercially, such as supports prepared from glass, ceramic, silica and silicon materials, and supports having a gold surface can also be used. The support typically comprises a flat surface (plane), or at least a structure in which the polynucleic acids to be linked are substantially in the same plane. Alternatively, the solid support may be non-planar, such as microbeads. Any suitable size can be used. For example, the support may be on the order of 1 to 10 cm in each direction.
如果仅将引物混合物接枝到固体支持物表面,并且待扩增模板存在于游离溶液中,则扩增反应大体上可如WO98/44151中所述进行。简单的说,引物附着之后,在允许模板和固定化引物杂交的条件下将所述固体支持物与待扩增模板相接触。通常在合适的杂交条件下将所述模板加入到游离溶液中,这对本领域技术人员是明显的。通常杂交条件为例如起始变性步骤之后为40℃,5×SSC。接着可以进行固相扩增,扩增的第一步是引物延伸步骤,其中将核苷酸添加到与模板杂交的固定化引物的3'端,以生成完整延伸的互补链。这样,该互补链在其3'端包含能够与固定在固体支持物上的第二引物分子结合的序列。进一步的扩增轮次导致形成结合于固体支持物的模板分子簇或集群。If only the primer mixture is grafted to the surface of the solid support, and the template to be amplified is present in free solution, the amplification reaction can be carried out substantially as described in WO98/44151. Briefly, after primer attachment, the solid support is brought into contact with the template to be amplified under conditions that allow hybridization of the template and the immobilized primer. The template is typically added to the free solution under suitable hybridization conditions, as will be apparent to those skilled in the art. Typical hybridization conditions are eg 40°C, 5xSSC after an initial denaturation step. Solid phase amplification can then be performed, the first step of which is a primer extension step in which nucleotides are added to the 3' end of the immobilized primer hybridized to the template to generate a fully extended complementary strand. Thus, the complementary strand contains at its 3' end a sequence capable of binding to the second primer molecule immobilized on the solid support. Further rounds of amplification result in the formation of clusters or clusters of template molecules bound to the solid support.
优选的,所述的步骤(2)中还包括在待测核酸上添加引物的步骤。Preferably, the step (2) further includes the step of adding primers to the nucleic acid to be detected.
所述的步骤(3)中通过光照激发聚合酶发出光信号。In the step (3), the polymerase is excited by light to emit a light signal.
所述的步骤(3)中通过现有技术中常用的检测系统检测聚合酶上发出的光信号,例如,利用特定波长激光或使用其它合适的光源,可以通过CCD或其它合适检测方法检测聚合酶上发出的光信号。例如,专利PCT/US07/007991中所描述的方法可以用于测定荧光信号。或者通过加入酶促化学发光、直接化学发光的底物和或催化试剂,或者通过施加电压进行电催化化学发光,并检测发光信号。In the step (3), the light signal emitted on the polymerase is detected by a detection system commonly used in the prior art. For example, by using a specific wavelength laser or using other suitable light sources, the polymerase can be detected by CCD or other suitable detection methods. the light signal emitted on it. For example, the method described in patent PCT/US07/007991 can be used to measure the fluorescent signal. Either by adding enzymatic chemiluminescence, direct chemiluminescence substrates and or catalytic reagents, or by applying voltage, electrocatalytic chemiluminescence is performed, and the luminescent signal is detected.
优选的,本发明所述的核酸测序方法中还包括步骤(4)使聚合酶与dNTP之间断裂。Preferably, the nucleic acid sequencing method of the present invention further comprises step (4) of cleaving between the polymerase and the dNTP.
所述的步骤(4)中聚合酶与dNTP之间断裂是使连接聚合酶和dNTP之间的可断裂基团发生断裂,所述的可断裂基团的断裂是通过现有技术中常用的方法实现的,例如,通过紫外线照射或者经化学物质,例如是还原剂作用、pH变化使连接聚合酶和dNTP之间的基团的断裂。In the step (4), the cleavage between the polymerase and the dNTP is to break the cleavable group connecting the polymerase and the dNTP, and the cleavage of the cleavable group is performed by a method commonly used in the prior art. This is accomplished, for example, by cleavage of the group linking the polymerase and the dNTP by UV irradiation or by chemical substances, such as the action of reducing agents, pH changes.
在本发明的实施方式中,所述的紫外线可以为365nm-410nm的紫外线;所述的还原剂可以为DTT、TCEP、THPP,所述改变溶液的试剂包括有机酸,无机酸,有机碱,无机碱。In the embodiment of the present invention, the ultraviolet rays can be ultraviolet rays of 365nm-410nm; the reducing agent can be DTT, TCEP, THPP, and the reagents for changing the solution include organic acids, inorganic acids, organic bases, inorganic alkali.
本发明所述的步骤(4)步骤中还包括用洗脱液去除聚合酶。The step (4) of the present invention also includes removing the polymerase with an eluent.
本发明提供的核酸测序方法还包括步骤(5),即重复步骤(1)-(4)。The nucleic acid sequencing method provided by the present invention further comprises step (5), that is, repeating steps (1)-(4).
优选的,本发明提供的核酸测序方法还包括核酸样品前处理,所述的核酸样品前处理包括核酸样品文库构建及扩增。Preferably, the nucleic acid sequencing method provided by the present invention further includes nucleic acid sample pretreatment, and the nucleic acid sample pretreatment includes nucleic acid sample library construction and amplification.
所述的核酸样品文库构建包括在核酸两端添加接头(adapter),得到待测核酸。The nucleic acid sample library construction includes adding adapters at both ends of the nucleic acid to obtain the nucleic acid to be tested.
优选的,所述的核酸样品文库构建包括将核酸样品片段化后在核酸两端添加接头,得到待测核酸。Preferably, the nucleic acid sample library construction includes adding a linker at both ends of the nucleic acid after fragmenting the nucleic acid sample to obtain the nucleic acid to be tested.
接头通常是可以通过常规方法合成的短寡核苷酸。接头可以通过多种方式附着至靶标核酸片段的5'端和3'端。优选的,将两种不同的接头序列附着至待扩增核酸上,以使一个接头附着至核酸分子的一端而另一个接头分子附着至核酸分子另一端。Linkers are typically short oligonucleotides that can be synthesized by conventional methods. Adapters can be attached to the 5' and 3' ends of target nucleic acid fragments in a variety of ways. Preferably, two different linker sequences are attached to the nucleic acid to be amplified, such that one linker is attached to one end of the nucleic acid molecule and the other linker molecule is attached to the other end of the nucleic acid molecule.
接头包含允许使用固定于固体支持物上的扩增引物分子扩增核酸的序列。为了起到核酸扩增模板的作用,模板构建体的一条单链必须包含与正向扩增引物中序列互补的序列(以使所述正向引物分子可结合并引发互补链合成)以及与反向扩增引物分子中序列相对应的序列(以使反向引物分子可以结合到互补链上)。接头中允许与引物分子杂交的序列通常长约20-30个核苷酸,但是本发明并不限于此长度的序列。Adapters contain sequences that allow amplification of nucleic acid using amplification primer molecules immobilized on a solid support. In order to function as a nucleic acid amplification template, one single strand of the template construct must contain a sequence complementary to the sequence in the forward amplification primer (so that the forward primer molecule can bind and initiate complementary strand synthesis) and a reverse The sequence corresponding to the sequence in the amplification primer molecule (so that the reverse primer molecule can bind to the complementary strand). The sequences in the linker that allow hybridization to the primer molecule are typically about 20-30 nucleotides in length, although the invention is not limited to sequences of this length.
扩增引物中序列以及接头中的相应序列通常来说对本发明并不重要,只要所述引物分子可以与扩增序列相互作用来指导桥式扩增即可。引物设计的原则通常对本领域技术人员来说都是很熟悉的。The sequences in the amplification primers and the corresponding sequences in the adapters are generally not critical to the present invention, so long as the primer molecules can interact with the amplification sequences to direct bridge amplification. The principles of primer design are generally familiar to those skilled in the art.
所述的扩增是待测核酸通过与固定在载体上的引物进行桥式PCR或者滚环扩增技术进行待测核酸的扩增。The amplification is the amplification of the nucleic acid to be detected by performing bridge PCR or rolling circle amplification technology with primers immobilized on the carrier.
本发明第二方面提供了一种聚合酶,所述的聚合酶为DNA Polβ的变体,所述的DNA Polβ的变体是将SEQ ID NO:1第145-355位的任意一个、两个或三个半胱氨酸突变为非半胱氨酸,优选的,是将SEQ ID NO:1第145-355位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第1-236位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A second aspect of the present invention provides a polymerase, the polymerase is a variant of DNA Polβ, and the variant of DNA Polβ is a combination of any one or two of positions 145-355 of SEQ ID NO: 1 Or three cysteines are mutated to non-cysteines, preferably, any one, two or three cysteines at positions 145-355 of SEQ ID NO: 1 are mutated to serine, and the first Any one, two or three non-cysteines at positions 1-236 were mutated to cysteine.
优选的,所述的DNA Polβ的变体是将SEQ ID NO:1第155-294位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第20-154位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。更优选的,将所述的DNA Polβ的变体是将SEQ ID NO:1的第178位、 第239位和/或第267位的半胱氨酸突变为丝氨酸,并且将第28位、第33位和/或第149位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。Preferably, the variant of DNA Polβ is to mutate any one, two or three cysteines at positions 155-294 of SEQ ID NO: 1 to serines, and mutate any one of positions 20-154 of SEQ ID NO: 1 , two or three non-cysteines are mutated to cysteines. More preferably, the variant of the DNA Polβ is to mutate the cysteine at the 178th position, the 239th position and/or the 267th position of SEQ ID NO: 1 to serine, and mutate the 28th position, the 28th position, the 267th position Any one, two or three non-cysteines at positions 33 and/or 149 are mutated to cysteine.
特别优选的,所述的聚合酶为DNA Polβ的变体,所述的DNA Polβ的变体选自如下变体中的一种或者两种以上的组合:Particularly preferably, the polymerase is a variant of DNA Polβ, and the variant of DNA Polβ is selected from one or a combination of two or more of the following variants:
(1)将SEQ ID NO:1第178位的半胱氨酸突变为丝氨酸,第239位的半胱氨酸突变为丝氨酸,第267位的半胱氨酸突变为丝氨酸,第28位的天冬酰胺突变为半胱氨酸;(1) mutate the cysteine at position 178 of SEQ ID NO: 1 to serine, the cysteine at position 239 to serine, the cysteine at position 267 to serine, and the cysteine at position 28 to serine Paragine is mutated to cysteine;
(2)将SEQ ID NO:1第178位的半胱氨酸突变为丝氨酸,第239位的半胱氨酸突变为丝氨酸,第267位的半胱氨酸突变为丝氨酸,第33位的异亮氨酸突变为半胱氨酸;(2) mutate the cysteine at position 178 of SEQ ID NO: 1 to serine, the cysteine at position 239 to serine, the cysteine at position 267 to serine, the isotope at position 33 to serine Leucine is mutated to cysteine;
(3)将SEQ ID NO:1第178位的半胱氨酸突变为丝氨酸,第239位的半胱氨酸突变为丝氨酸,第267位的半胱氨酸突变为丝氨酸,第149位的精氨酸突变为半胱氨酸。(3) mutate the cysteine at position 178 of SEQ ID NO: 1 to serine, the cysteine at position 239 to serine, the cysteine at position 267 to serine, the sperm at position 149 amino acid to cysteine.
在本发明的一个具体实施方式中,所述的聚合酶的氨基酸序列与SEQ ID NO:3-6具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的DNA Polβ变体具有聚合酶活性。In a specific embodiment of the present invention, the amino acid sequence of the polymerase has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93% with SEQ ID NOs: 3-6 %, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Polβ variant has polymerase activity.
在本发明的另一个具体实施方式中,所述的聚合酶的氨基酸序列如SEQ ID NO:3-6任一所示。In another specific embodiment of the present invention, the amino acid sequence of the polymerase is shown in any of SEQ ID NOs: 3-6.
本发明的第三方面,提供一种聚合酶,所述的聚合酶为Bst DNA Pol及其变体,所述的Bst DNA Pol的序列为SEQ ID NO:2。所述的Bst DNA Pol变体为Bst DNA Pol在一个或多个(例如,2个、3个、4个或5个)氨基酸序列位置的突变,所述的突变包含取代、缺失和/或插入,或者与Bst DNA Pol编码序列具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的Bst DNA Pol变体具有聚合酶活性。The third aspect of the present invention provides a polymerase, the polymerase is Bst DNA Pol and a variant thereof, and the sequence of the Bst DNA Pol is SEQ ID NO: 2. The Bst DNA Pol variant is a mutation of the Bst DNA Pol at one or more (for example, 2, 3, 4 or 5) amino acid sequence positions, the mutation comprising a substitution, deletion and/or insertion , or at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% with the Bst DNA Pol coding sequence %, more than 99.5% sequence identity, and the Bst DNA Pol variant has polymerase activity.
优选的,所述的Bst DNA Pol变体的突变位点是在DNA合成的催化中心附近的区域,又不是催化反应所必须的氨基酸残基位置,可以保证不影响催化活性,又保证dNTP离催化中心比较近,不需要特别长的可断裂基团就可以与催化中心接触参与反应。Preferably, the mutation site of the Bst DNA Pol variant is in the region near the catalytic center of DNA synthesis, and is not the amino acid residue position necessary for the catalytic reaction, which can ensure that the catalytic activity is not affected, and that the dNTP is separated from the catalytic reaction. The center is relatively close, and it does not require a particularly long cleavable group to contact the catalytic center to participate in the reaction.
更优选的,所述的Bst DNA Pol的变体是将SEQ ID NO:2第85-100或540-570位的 任意一个、两个或三个半胱氨酸突变为非半胱氨酸,优选的,是将SEQ ID NO:2第85-100或540-570位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第250-280或320-380位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。优选的,所述的Bst DNA Pol的变体是将SEQ ID NO:2第90-95或549-555位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第260-270或330-370位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。更优选的,将所述的Bst DNA Pol的变体是将SEQ ID NO:2的第93位和/或第550位的半胱氨酸突变为丝氨酸,并且将第264位、第334位和/或第364位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。More preferably, the variant of the Bst DNA Pol is to mutate any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 to non-cysteines, Preferably, any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 are mutated to serines, and any one of positions 250-280 or 320-380 is mutated , two or three non-cysteines are mutated to cysteines. Preferably, the variant of the Bst DNA Pol is to mutate any one, two or three cysteines at positions 90-95 or 549-555 of SEQ ID NO: 2 to serine, and mutate the 260- Any one, two or three non-cysteines at positions 270 or 330-370 were mutated to cysteine. More preferably, the variant of the Bst DNA Pol is to mutate the cysteine at the 93rd position and/or the 550th position of SEQ ID NO: 2 to serine, and mutate the 264th position, the 334th position and the /or any one, two or three non-cysteines at position 364 are mutated to cysteine.
特别优选的,所述的聚合酶为Bst DNA Pol的变体,所述的Bst DNA Pol的变体选自如下变体中的一种或者两种以上的组合:Particularly preferably, the polymerase is a variant of Bst DNA Pol, and the variant of Bst DNA Pol is selected from one or a combination of two or more of the following variants:
(1)将SEQ ID NO:2第93位的半胱氨酸突变为丝氨酸,第550位的半胱氨酸突变为丝氨酸;(1) mutate the cysteine at position 93 of SEQ ID NO: 2 to serine, and mutate the cysteine at position 550 to serine;
(2)将SEQ ID NO:2第93位的半胱氨酸突变为丝氨酸,第550位的半胱氨酸突变为丝氨酸,第264位的天冬氨酸突变为半胱氨酸;(2) mutating the cysteine at position 93 of SEQ ID NO: 2 to serine, the cysteine at position 550 to serine, and the aspartic acid at position 264 to cysteine;
(3)将SEQ ID NO:2第93位的半胱氨酸突变为丝氨酸,第550位的半胱氨酸突变为丝氨酸,第334位的精氨酸突变为半胱氨酸;(3) mutating the cysteine at position 93 of SEQ ID NO: 2 to serine, the cysteine at position 550 to serine, and the arginine at position 334 to cysteine;
(4)将SEQ ID NO:2第93位的半胱氨酸突变为丝氨酸,第550位的半胱氨酸突变为丝氨酸,第364位的亮氨酸突变为半胱氨酸。(4) The cysteine at position 93 of SEQ ID NO: 2 is mutated to serine, the cysteine at position 550 is mutated to serine, and the leucine at position 364 is mutated to cysteine.
在本发明的一个具体实施方式中,所述的聚合酶的氨基酸序列与SEQ ID NO:7-10具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的DNA Polβ变体具有聚合酶活性。In a specific embodiment of the present invention, the amino acid sequence of the polymerase has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93% with SEQ ID NOs: 7-10 %, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Polβ variant has polymerase activity.
在本发明的另一个具体实施方式中,所述的聚合酶的氨基酸序列如SEQ ID NO:7-10任一所示。In another specific embodiment of the present invention, the amino acid sequence of the polymerase is shown in any of SEQ ID NOs: 7-10.
本发明的第四方面还提供了一种聚合酶复合物,所述的聚合酶复合物为通过可断裂基团将聚合酶与dNTP连接制备得到。其中,所述的可断裂基团用于连接聚合酶和修饰的 dNTP,或者,连接聚合酶和dNTP。优选的,所述的可断裂基团含有可断裂结构X,所述的X选自:The fourth aspect of the present invention also provides a polymerase complex prepared by linking the polymerase and dNTP through a cleavable group. Wherein, the cleavable group is used to connect the polymerase and the modified dNTP, or, the polymerase and the dNTP. Preferably, the cleavable group contains a cleavable structure X, and the X is selected from:
Figure PCTCN2021129461-appb-000011
Figure PCTCN2021129461-appb-000011
其中,j为1-3的整数,where j is an integer from 1 to 3,
R选自H、C 1- 10直链或支链烷基、C 2- 10烯基、C 2- 10炔基、C 3-10环烷基、芳基、含有N、O或S的杂环芳基、含有N、O或S的杂环烷基、-N(C 0-10烷基)(C 0-10烷基)、-CON(C 0-10烷基)(C 0-10烷基)、-N(C 0-10烷基)CO(C 0-10烷基)、COR 1、CN,C 1- 10烷氧基,芳氧基,含有N、O或S杂芳氧基,上述基团中的H可被以下基团取代:卤素、-CN、-OCH 2F、-OCHF 2、-OCF 3、-OH、C 1-10直链或支链烷基、-N(C 0-10烷基)(C 0-10烷基)、-OC 0-10烷基、C 3-10环烷基、-O杂环烷基、-N杂环烷基、-S杂环烷基、-N杂环芳香基、-O杂环芳香基、-S杂环芳香基,R 1选自H、OC 0-10烷基、C 1- 10直链或支链烷基, R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (C 0-10 alkyl), -OC 0-10 alkyl, C 3-10 cycloalkyl, -O heterocycloalkyl, -N heterocycloalkyl, -S hetero Cycloalkyl, -N heterocyclic aromatic group, -O heterocyclic aromatic group, -S heterocyclic aromatic group, R 1 is selected from H, O 0-10 alkyl group, C 1 - 10 straight chain or branched chain alkyl group,
R'选自N或O。R' is selected from N or O.
更优选的,所述的可断裂基团具有L1-L2-X-L3-L4结构,其中,L1为与dNTP或修饰的dNTP结合的端基,L2和L3不存在或为不可断裂连接基团,L4为与聚合酶结合的端基,X为可断裂基团。More preferably, the cleavable group has a structure of L1-L2-X-L3-L4, wherein L1 is an end group bound to dNTP or modified dNTP, and L2 and L3 do not exist or are non-cleavable linking groups. , L4 is the end group bound to the polymerase, and X is the cleavable group.
所述的修饰的dNTP为碱基修饰的dNTP,所述的碱基修饰的dNTP为含N原子的碱 基修饰的dNTP,更优选的,所述的碱基修饰的dNTP为含氨基的碱基修饰的dNTP。特别优选的,所述的碱基修饰的dNTP为碱基被
Figure PCTCN2021129461-appb-000012
修饰的dNTP,所述的n选自0-12的整数,优选的,所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
The modified dNTP is a base-modified dNTP, and the base-modified dNTP is a base-modified dNTP containing an N atom. More preferably, the base-modified dNTP is an amino-containing base. Modified dNTPs. Particularly preferably, the base-modified dNTPs are those whose bases are
Figure PCTCN2021129461-appb-000012
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
优选的,所述的碱基修饰的dNTP为嘌呤碱基的7号位或者是嘧啶碱基的3号位被以下基团修饰:
Figure PCTCN2021129461-appb-000013
修饰的dNTP,所述的n选自0-12的整数,优选的,所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
Preferably, the base-modified dNTP is that the 7th position of the purine base or the 3rd position of the pyrimidine base is modified by the following groups:
Figure PCTCN2021129461-appb-000013
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
本发明所述的碱基修饰的dNTP既可以是3'位修饰的dNTP,也可以是3'位非修饰的dNTP,所述的3'位修饰的dNTP可以达到阻隔核酸链的延展。所述的3'位修饰的dNTP可以为3'位的H原子被以下基团取代:
Figure PCTCN2021129461-appb-000014
修饰的dNTP,所述的n选自0-12的整数,优选的,所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
The base-modified dNTPs of the present invention can be either 3'-modified dNTPs or 3'-unmodified dNTPs, and the 3'-modified dNTPs can block the extension of nucleic acid chains. The dNTP modified at the 3' position can be substituted by the following group at the H atom at the 3' position:
Figure PCTCN2021129461-appb-000014
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
在本发明具体实施方式中,所述的碱基修饰的dNTP选自:In a specific embodiment of the present invention, the base-modified dNTP is selected from:
Propargylamino-3'-azidomethyl-dCTP;Propargylamino-3'-azidomethyl-dCTP;
Propargyl-3'-azidomethyl-dCTP;Propargyl-3'-azidomethyl-dCTP;
Propargylamino-3'-azidomethyl-dUTP;Propargylamino-3'-azidomethyl-dUTP;
Propargyl-3'-azidomethyl-dUTP;Propargyl-3'-azidomethyl-dUTP;
7-Deaza-7-Propargylamino-3'-azidomethyl-dGTP;7-Deaza-7-Propargylamino-3'-azidomethyl-dGTP;
7-deaza-propargyl-3'-azidomethyl-dGTP;7-deaza-propargyl-3'-azidomethyl-dGTP;
7-Deaza-7-Propargylamino-3'-azidomethyl-dATP;7-Deaza-7-Propargylamino-3'-azidomethyl-dATP;
7-deaza-propargyl-3'-azidomethyl-dATP;7-deaza-propargyl-3'-azidomethyl-dATP;
Propargylamino dCTP,propargyl-dCTP;Propargylamino dCTP, propargyl-dCTP;
Propargylamino dUTP,propargyl-dUTP;Propargylamino dUTP, propargyl-dUTP;
7-Deaza-7-Propargylamino dATP,7-deaza-propargyl-dATP;7-Deaza-7-Propargylamino dATP, 7-deaza-propargyl-dATP;
7-Deaza-7-Propargylamino dGTP,7-deaza-propargyl-dGTP。7-Deaza-7-Propargylamino dGTP, 7-deaza-propargyl-dGTP.
优选的,所述的连接位点选自下式A、式B、式C或式D。Preferably, the linking site is selected from the following formula A, formula B, formula C or formula D.
Figure PCTCN2021129461-appb-000015
Figure PCTCN2021129461-appb-000015
优选的,所述的X选自:Preferably, the X is selected from:
Figure PCTCN2021129461-appb-000016
Figure PCTCN2021129461-appb-000016
其中,j为1-3的整数,where j is an integer from 1 to 3,
R选自H、C 1- 10直链或支链烷基、C 2- 10烯基、C 2- 10炔基、C 3-10环烷基、芳基、含有N、O或S的杂环芳基、含有N、O或S的杂环烷基、-N(C 0-10烷基)(C 0-10烷基)、-CON(C 0-10烷基)(C 0-10烷基)、-N(C 0-10烷基)CO(C 0-10烷基)、COR 1、CN,C 1- 10烷氧基,芳氧基,含有N、O或S杂芳氧基,上述基团中的H可被以下基团取代:卤素、-CN、-OCH 2F、-OCHF 2、-OCF 3、-OH、C 1-10直链或支链烷基、-N(C 0-10烷基)(C 0-10烷基)、-OC 0-10烷基、C 3-10环烷基、-O杂环烷基、-N杂环烷基、-S杂环烷基、-N杂环芳香基、-O杂环芳香基、-S杂环芳香基,R 1选自H、OC 0-10烷基、C 1- 10直链或支链烷基, R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (C 0-10 alkyl), -OC 0-10 alkyl, C 3-10 cycloalkyl, -O heterocycloalkyl, -N heterocycloalkyl, -S hetero Cycloalkyl, -N heterocyclic aromatic group, -O heterocyclic aromatic group, -S heterocyclic aromatic group, R 1 is selected from H, O 0-10 alkyl group, C 1 - 10 straight chain or branched chain alkyl group,
R'选自N或O。R' is selected from N or O.
优选的,所述L1或L4独立的选自马来酰亚胺基、羧基、巯基、叠氮基、炔基、环辛炔基及其衍生物、四嗪基、二硫吡啶基、乙烯基、烯砜基、琥珀酰亚胺基、醛基、酰肼基、胺氧基和α-卤代羰基。Preferably, said L1 or L4 is independently selected from maleimide group, carboxyl group, mercapto group, azido group, alkynyl group, cyclooctynyl group and its derivatives, tetrazinyl group, dithiopyridyl group, vinyl group , alkenyl sulfone group, succinimidyl group, aldehyde group, hydrazide group, amineoxy group and α-halogenated carbonyl group.
优选的,所述L2或L3独立的选自-O(CH 2CH 2O) i-、-(CH 2) i-、-(CH 2) iNH(CH 2) i-、-(CH 2) iCOO(CH 2) i-、-(CH 2) iCONH(CH 2) i-、-(CH 2) iO(CH 2) i-、-(CH 2) iCO(CH 2) i-或
Figure PCTCN2021129461-appb-000017
中的一个或二个以上基团的组合,所述的i选自0-10的整数。
Preferably, said L2 or L3 is independently selected from -O(CH 2 CH 2 O) i -, -(CH 2 ) i -, -(CH 2 ) i NH(CH 2 ) i -, -(CH 2 ) i COO(CH 2 ) i -, -(CH 2 ) i CONH(CH 2 ) i -, -(CH 2 ) i O(CH 2 ) i -, -(CH 2 ) i CO(CH 2 ) i -or
Figure PCTCN2021129461-appb-000017
One or a combination of two or more groups in , the i is selected from an integer of 0-10.
优选的,所述的可断裂基团选自:Preferably, the cleavable group is selected from:
Figure PCTCN2021129461-appb-000018
Figure PCTCN2021129461-appb-000018
所述连接聚合酶与dNTP的可断裂基团在现有技术中常用的方法的作用下可发生断裂,例如,经紫外线照射或者经化学物质作用后基团发生断裂,从而使聚合酶与dNTP分离。The cleavable group linking the polymerase and the dNTP can be cleaved under the action of methods commonly used in the prior art, for example, the group is cleaved after being irradiated by ultraviolet light or by the action of a chemical substance, so that the polymerase is separated from the dNTP. .
本发明所述的聚合酶复合物还可以包括荧光标记物、磷光标记物或者化学发光标记物。The polymerase complexes of the present invention may also include fluorescent labels, phosphorescent labels or chemiluminescent labels.
所述的化学发光标记物为目前已知的在可见光和/或紫外线的照射下发出光信号的基团,例如,辣根过氧化酶、碱性磷酸酶、荧光素酶及其衍生物,吖啶酯类、过氧化草酸酯类、洛粉碱、光泽精、鲁米诺及其衍生物,催化化学发光标记物的金属离子复合物、电催化化学发光标记物、苯并二呋喃、次甲基、三苯甲烷、吖嗪、三吩嗪、萘二甲酰亚胺、吡唑、萘醌、蒽醌、单偶氮和双偶氮以及上述基团的衍生物,具有可见光区吸收带的苯的衍生物,具有紫外吸收带的C=C、C=O、-N=N-、-NO 2、-C=S等。 The chemiluminescent label is a group currently known to emit light signal under the irradiation of visible light and/or ultraviolet light, for example, horseradish peroxidase, alkaline phosphatase, luciferase and its derivatives, acridine Pyridine esters, peroxyoxalates, lofenacine, lucigenin, luminol and their derivatives, metal ion complexes of catalytic chemiluminescent markers, electrocatalytic chemiluminescent markers, benzodifuran, methine base, triphenylmethane, azine, triphenazine, naphthalimide, pyrazole, naphthoquinone, anthraquinone, monoazo and bisazo and derivatives of the above groups, with absorption bands in the visible light region Derivatives of benzene, C=C, C=O, -N=N-, -NO 2 , -C=S, etc. with ultraviolet absorption bands.
所述的荧光标记物包括所述的荧光标记物包括荧光素,Cy2,Cy3,Cy5,Cy7,Alexa Fluor系列染料,异硫氰酸荧光素,5-六氯荧光素氨基磷酸酯,6-羧基-2',4,7,7'-四氯荧光素琥珀酰亚胺酯,6-羧基-4',5'-二氯-2',7'-二甲氧基荧光素琥珀酰亚胺酯,德克萨斯红,罗丹明110,荧光素马来酰亚胺染料,氟硼二吡咯,呫吨、羰花青、1,1'-二(十八烷基)-3,3,3',3'-四甲基吲哚羰花青高氯酸盐、3,3'-二(十八烷基)-氧杂羰花青高氯酸盐、芘、酞菁、6-羧基罗丹明6G、异硫氰酸荧光素、6-羧基荧光素琥珀酰亚胺酯、5-羧基荧光素琥珀酰亚胺酯、5- 羧基荧光素、6-羧基荧光素、罗丹明B、罗丹明6G、7-氨基-4-甲基香豆素、二氢罗丹明123、四甲基罗丹明-6-马来酰亚胺、四甲基罗丹明-5-马来酰亚胺、5-吲哚乙酰氨基荧光素、双[NN-双(羧甲基)氨甲基]荧光素四钠盐、荧光素-5-马来酰亚胺、磺基罗丹明G、7-羟基-4-甲基香豆素、3-氰基-7-羟基香豆素、荧光素二钠盐、四甲基罗丹明-6-异硫氰酸、6-羧基-X-罗丹明琥珀酰亚胺酯、5-羧基-X-罗丹明琥珀酰亚胺酯、6-羧基-X-罗丹明、5-羧基四甲基罗丹明琥珀酰亚胺酯、6-羧基四甲基罗丹明、5-羧基四甲基罗丹明、产生能量转移染料及荧光蛋白,例如GFP(绿色荧光蛋白),CFP(青色荧光蛋白),BFP(蓝色荧光蛋白),YFP(黄色荧光蛋白)及衍生物中的一种或多种。The fluorescent labels include fluorescein, Cy2, Cy3, Cy5, Cy7, Alexa Fluor series dyes, fluorescein isothiocyanate, 5-hexachlorofluorescein phosphoramidate, 6-carboxyl -2',4,7,7'-Tetrachlorofluorescein succinimide ester, 6-carboxy-4',5'-dichloro-2',7'-dimethoxyfluorescein succinimide Ester, Texas Red, Rhodamine 110, Fluorescein Maleimide Dye, Fluoroborodipyrrole, Xanthene, Carbocyanine, 1,1'-Di(octadecyl)-3,3, 3',3'-Tetramethylindocarbocyanine perchlorate, 3,3'-bis(octadecyl)-oxacarbocyanine perchlorate, pyrene, phthalocyanine, 6-carboxy Rhodamine 6G, Fluorescein isothiocyanate, 6-carboxyfluorescein succinimidyl ester, 5-carboxyfluorescein succinimidyl ester, 5-carboxyfluorescein, 6-carboxyfluorescein, rhodamine B, rhodamine Ming 6G, 7-amino-4-methylcoumarin, dihydrorhodamine 123, tetramethylrhodamine-6-maleimide, tetramethylrhodamine-5-maleimide, 5 -Indoleacetamidofluorescein, bis[NN-bis(carboxymethyl)aminomethyl]fluorescein tetrasodium salt, fluorescein-5-maleimide, sulforhodamine G, 7-hydroxy-4 - Methylcoumarin, 3-cyano-7-hydroxycoumarin, fluorescein disodium salt, tetramethylrhodamine-6-isothiocyanate, 6-carboxy-X-rhodamine succinimide Ester, 5-carboxy-X-rhodamine succinimide ester, 6-carboxy-X-rhodamine, 5-carboxytetramethylrhodamine succinimide ester, 6-carboxytetramethylrhodamine, 5-carboxytetramethylrhodamine Carboxytetramethylrhodamine, produces energy transfer dyes and fluorescent proteins, such as GFP (green fluorescent protein), CFP (cyan fluorescent protein), BFP (blue fluorescent protein), YFP (yellow fluorescent protein) and one of its derivatives one or more.
所述的磷光标记物包括过渡金属铱(Ir)或钌(Ru)的氮杂芳基络合物,优选如下式所示:Said phosphorescent label includes transition metal iridium (Ir) or ruthenium (Ru) aza-heteroaryl complexes, preferably as shown in the following formula:
Figure PCTCN2021129461-appb-000019
Figure PCTCN2021129461-appb-000019
本发明的第五方面提供了一种核酸的合成方法,所述的核酸的合成方法包括:A fifth aspect of the present invention provides a nucleic acid synthesis method, the nucleic acid synthesis method comprising:
(1)提供聚合酶,所述的聚合酶通过可断裂基团与dNTP连接;(1) providing a polymerase, the polymerase is connected to the dNTP through a cleavable group;
(2)将所述聚合酶与待测核酸的模板-引物复合体的引物3'端接触,所述聚合酶上的dNTP与待测核酸的序列上的碱基互补配对并在聚合酶酶促反应加入引物3'端上。(2) contacting the polymerase with the 3' end of the primer of the template-primer complex of the nucleic acid to be tested, the dNTP on the polymerase is complementary to the bases on the sequence of the nucleic acid to be tested and enzymatically catalyzed by the polymerase The reaction was added to the 3' end of the primer.
其中,酶促反应形成一个碱基的延伸后,所连接的聚合酶阻止了另一个dNTP-聚合酶与新形成的引物3'端的反应,造成了链终止。Here, after the enzymatic reaction forms an extension of one base, the attached polymerase prevents the reaction of another dNTP-polymerase with the 3' end of the newly formed primer, resulting in chain termination.
优选的,所述的核酸的合成方法还包括步骤(3)使聚合酶与dNTP之间断裂;及(4)重复(1)-(3)步骤。Preferably, the nucleic acid synthesis method further comprises step (3) cleavage between the polymerase and dNTP; and (4) repeating steps (1)-(3).
其中,所述的可断裂基团用于连接聚合酶和修饰的dNTP,或者连接聚合酶和dNTP。Wherein, the cleavable group is used for linking polymerase and modified dNTP, or linking polymerase and dNTP.
优选的,所述的可断裂基团含有可断裂结构X,所述的X选自:Preferably, the cleavable group contains a cleavable structure X, and the X is selected from:
Figure PCTCN2021129461-appb-000020
Figure PCTCN2021129461-appb-000020
其中,j为1-3的整数,where j is an integer from 1 to 3,
R选自H、C 1- 10直链或支链烷基、C 2- 10烯基、C 2- 10炔基、C 3-10环烷基、芳基、含有N、O或S的杂环芳基、含有N、O或S的杂环烷基、-N(C 0-10烷基)(C 0-10烷基)、-CON(C 0-10烷基)(C 0-10烷基)、-N(C 0-10烷基)CO(C 0-10烷基)、COR 1、CN,C 1- 10烷氧基,芳氧基,含有N、O或S杂芳氧基,上述基团中的H可被以下基团取代:卤素、-CN、-OCH 2F、-OCHF 2、-OCF 3、-OH、C 1-10直链或支链烷基、-N(C 0-10烷基)(C 0-10烷基)、-OC 0-10烷基、C 3-10环烷基、-O杂环烷基、-N杂环烷基、-S杂环烷基、-N杂环芳香基、-O杂环芳香基、-S杂环芳香基,R 1选自H、OC 0-10烷基、C 1- 10直链或支链烷基, R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (C 0-10 alkyl), -OC 0-10 alkyl, C 3-10 cycloalkyl, -O heterocycloalkyl, -N heterocycloalkyl, -S hetero Cycloalkyl, -N heterocyclic aromatic group, -O heterocyclic aromatic group, -S heterocyclic aromatic group, R 1 is selected from H, O 0-10 alkyl group, C 1 - 10 straight chain or branched chain alkyl group,
R'选自N或O。R' is selected from N or O.
更优选的,所述的可断裂基团具有L1-L2-X-L3-L4结构,其中L1为与dNTP或修饰的dNTP结合的端基,L2和L3不存在或者为不可断裂连接基团,L4为与聚合酶结合的端基,X为可断裂基团。More preferably, the cleavable group has the structure of L1-L2-X-L3-L4, wherein L1 is an end group bound to dNTP or modified dNTP, L2 and L3 do not exist or are non-cleavable linking groups, L4 is the terminal group that binds to the polymerase, and X is the cleavable group.
所述的修饰的dNTP为碱基修饰的dNTP,所述的碱基修饰的dNTP为含N原子的碱基修饰的dNTP,更优选的,所述的碱基修饰的dNTP为含氨基的碱基修饰的dNTP。特别 优选的,所述的碱基修饰的dNTP为碱基被
Figure PCTCN2021129461-appb-000021
修饰的dNTP,所述的n选自0-12的整数,优选的,所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
The modified dNTP is a base-modified dNTP, and the base-modified dNTP is a base-modified dNTP containing an N atom. More preferably, the base-modified dNTP is an amino-containing base. Modified dNTPs. Particularly preferably, the base-modified dNTPs are those whose bases are
Figure PCTCN2021129461-appb-000021
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
本发明所述的碱基修饰的dNTP既可以是3'位修饰的dNTP,也可以是3'位非修饰的dNTP,所述的3'位修饰的dNTP可以达到阻隔核酸链的延展。The base-modified dNTPs of the present invention can be either 3'-modified dNTPs or 3'-unmodified dNTPs, and the 3'-modified dNTPs can block the extension of nucleic acid chains.
优选的,所述的碱基修饰的dNTP为嘌呤碱基的7号位或者是嘧啶碱基的3号位被以下基团修饰:
Figure PCTCN2021129461-appb-000022
修饰的dNTP,所述的n选自0-12的整数,优选的,所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
Preferably, the base-modified dNTP is that the 7th position of the purine base or the 3rd position of the pyrimidine base is modified by the following groups:
Figure PCTCN2021129461-appb-000022
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
优选的,所述的3'位修饰的dNTP可以为3'位的H原子被以下基团取代:
Figure PCTCN2021129461-appb-000023
Figure PCTCN2021129461-appb-000024
修饰的dNTP,所述的n选自0-12的整数,优选的,所述的n选自1-6的整数,例如n选自1、2、3、4、5或6,所述的G选自-COOH、-SH、-NH 2、-N 3、-CH=CH 2、-C≡CH。
Preferably, the dNTP modified at the 3' position can be substituted by the following group at the H atom at the 3' position:
Figure PCTCN2021129461-appb-000023
Figure PCTCN2021129461-appb-000024
Modified dNTP, the n is selected from an integer of 0-12, preferably, the n is selected from an integer of 1-6, for example, n is selected from 1, 2, 3, 4, 5 or 6, the said G is selected from -COOH, -SH, -NH2 , -N3 , -CH= CH2 , -C≡CH.
在本发明具体实施方式中,所述的碱基修饰的dNTP选自:In a specific embodiment of the present invention, the base-modified dNTP is selected from:
Propargylamino-3'-azidomethyl-dCTP;Propargylamino-3'-azidomethyl-dCTP;
Propargyl-3'-azidomethyl-dCTP;Propargyl-3'-azidomethyl-dCTP;
Propargylamino-3'-azidomethyl-dUTP;Propargylamino-3'-azidomethyl-dUTP;
Propargyl-3'-azidomethyl-dUTP;Propargyl-3'-azidomethyl-dUTP;
7-Deaza-7-Propargylamino-3'-azidomethyl-dGTP;7-Deaza-7-Propargylamino-3'-azidomethyl-dGTP;
7-deaza-propargyl-3'-azidomethyl-dGTP;7-deaza-propargyl-3'-azidomethyl-dGTP;
7-Deaza-7-Propargylamino-3'-azidomethyl-dATP;7-Deaza-7-Propargylamino-3'-azidomethyl-dATP;
7-deaza-propargyl-3'-azidomethyl-dATP;7-deaza-propargyl-3'-azidomethyl-dATP;
Propargylamino dCTP,propargyl-dCTP;Propargylamino dCTP, propargyl-dCTP;
Propargylamino dUTP,propargyl-dUTP;Propargylamino dUTP, propargyl-dUTP;
7-Deaza-7-Propargylamino dATP,7-deaza-propargyl-dATP;7-Deaza-7-Propargylamino dATP, 7-deaza-propargyl-dATP;
7-Deaza-7-Propargylamino dGTP,7-deaza-propargyl-dGTP。7-Deaza-7-Propargylamino dGTP, 7-deaza-propargyl-dGTP.
优选的,所述的连接位点选自下式A、式B、式C或式D。Preferably, the linking site is selected from the following formula A, formula B, formula C or formula D.
Figure PCTCN2021129461-appb-000025
Figure PCTCN2021129461-appb-000025
优选的,所述的X选自:Preferably, the X is selected from:
Figure PCTCN2021129461-appb-000026
Figure PCTCN2021129461-appb-000026
其中,j为1-3的整数,where j is an integer from 1 to 3,
R选自H、C 1- 10直链或支链烷基、C 2- 10烯基、C 2- 10炔基、C 3-10环烷基、芳基、含有N、O或S的杂环芳基、含有N、O或S的杂环烷基、-N(C 0-10烷基)(C 0-10烷基)、-CON(C 0-10烷基)(C 0-10烷基)、-N(C 0-10烷基)CO(C 0-10烷基)、COR 1、CN,C 1- 10烷氧基,芳氧基,含有N、O或S杂芳氧基,上述基团中的H可被以下基团取代:卤素、-CN、-OCH 2F、-OCHF 2、-OCF 3、-OH、C 1-10直链或支链烷基、-N(C 0-10烷基)(C 0-10烷基)、-OC 0-10烷基、C 3-10环烷基、-O杂环烷基、-N杂环烷基、-S杂环烷基、-N杂环芳香基、-O杂环芳香基、-S杂环芳香基,R 1选自H、OC 0-10烷基、C 1- 10直链或支链烷基, R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl , C 2-10 alkynyl , C 3-10 cycloalkyl, aryl, heterocyclic containing N, O or S Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1 - 10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (C 0-10 alkyl), -OC 0-10 alkyl, C 3-10 cycloalkyl, -O heterocycloalkyl, -N heterocycloalkyl, -S hetero Cycloalkyl, -N heterocyclic aromatic group, -O heterocyclic aromatic group, -S heterocyclic aromatic group, R 1 is selected from H, O 0-10 alkyl group, C 1 - 10 straight chain or branched chain alkyl group,
R'选自N或O。R' is selected from N or O.
优选的,所述L1或L4独立的选自马来酰亚胺基、羧基、巯基、叠氮基、炔基、环辛炔基及其衍生物、四嗪基、二硫吡啶基、乙烯基、烯砜基、琥珀酰亚胺基、醛基、酰肼基、胺氧基和α-卤代羰基。Preferably, said L1 or L4 is independently selected from maleimide group, carboxyl group, mercapto group, azido group, alkynyl group, cyclooctynyl group and its derivatives, tetrazinyl group, dithiopyridyl group, vinyl group , alkenyl sulfone group, succinimidyl group, aldehyde group, hydrazide group, amineoxy group and α-halogenated carbonyl group.
优选的,所述L2或L3独立的选自-O(CH 2CH 2O) i-、-(CH 2) i-、-(CH 2) iNH(CH 2) i-、-(CH 2) iCOO(CH 2) i-、-(CH 2) iCONH(CH 2) i-、-(CH 2) iO(CH 2) i-、-(CH 2) iCO(CH 2) i-或
Figure PCTCN2021129461-appb-000027
中的一个或二个以上基团的组合,所述的i选自0-10的整数。
Preferably, said L2 or L3 is independently selected from -O(CH 2 CH 2 O) i -, -(CH 2 ) i -, -(CH 2 ) i NH(CH 2 ) i -, -(CH 2 ) i COO(CH 2 ) i -, -(CH 2 ) i CONH(CH 2 ) i -, -(CH 2 ) i O(CH 2 ) i -, -(CH 2 ) i CO(CH 2 ) i -or
Figure PCTCN2021129461-appb-000027
One or a combination of two or more groups in , the i is selected from an integer of 0-10.
优选的,所述的可断裂基团选自:Preferably, the cleavable group is selected from:
Figure PCTCN2021129461-appb-000028
。所述的引物可以为游离状态或者固定在支持物上。
Figure PCTCN2021129461-appb-000028
. The primers can be free or immobilized on a support.
所述的步骤(3)中聚合酶与dNTP之间断裂是使连接聚合酶和dNTP之间的可断裂基团发生断裂,所述的连接聚合酶和dNTP之间的可断裂基团的断裂是通过现有技术中常用的方法实现的,例如,通过紫外线照射或者经化学物质,例如是还原剂作用、pH变化使连接聚合酶和dNTP之间的基团的断裂。The cleavage between the polymerase and dNTP in the step (3) is to break the cleavable group between the ligation polymerase and the dNTP, and the cleavage of the cleavable group between the ligation polymerase and the dNTP is This is achieved by methods commonly used in the prior art, for example, by UV irradiation or by the cleavage of the group linking the polymerase and the dNTP by chemical substances, such as the action of reducing agents, pH changes.
本发明提供的核酸测序方法中所用的聚合酶通过可断裂基团与dNTP连接,且含有荧光标记物、化学发光标记物或者磷光标记物,利用该聚合酶实施的核酸测序方法可以实现核酸序列的快速测定。The polymerase used in the nucleic acid sequencing method provided by the present invention is connected to the dNTP through a cleavable group, and contains a fluorescent marker, a chemiluminescent marker or a phosphorescent marker, and the nucleic acid sequencing method implemented by the polymerase can realize the nucleic acid sequence. Quick determination.
附图说明Description of drawings
以下,结合附图来详细说明本发明的实施例,其中:Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings, wherein:
图1:连接dNTP与聚合酶的linker的结构式,具体为PC Mal-NHS碳酸盐脂;Figure 1: Structural formula of linker connecting dNTP and polymerase, specifically PC Mal-NHS carbonate lipid;
图2:DNA聚合酶Bst DNA Pol及其变体的纯化结果图,其中,泳道M代表Marker;泳道CF代表Bst DNA Pol Cys-Free突变位点为C93S/C550S,泳道D264C代表突变位点为C93S/C550S/D264C的Bst DNA Pol变体,泳道R334C代表突变位点为C93S/C550S/R334C的Bst DNA Pol变体,泳道L364C代表突变位点为C93S/C550S/L364C的Bst DNA Pol变体;Figure 2: Purification results of DNA polymerase Bst DNA Pol and its variants, in which lane M represents Marker; lane CF represents Bst DNA Pol Cys-Free mutation site is C93S/C550S, lane D264C represents mutation site is C93S /C550S/D264C Bst DNA Pol variant, lane R334C represents the Bst DNA Pol variant with the mutation site C93S/C550S/R334C, and lane L364C represents the Bst DNA Pol variant with the mutation site C93S/C550S/L364C;
图3:实施例1制备获得的DNA聚合酶Bst DNA Pol聚合酶酶活检测,其中,泳道M 代表Marker,泳道PC代表阳性对照,泳道NC代表阴性对照,泳道CF代表Bst DNA Pol Cys-Free突变位点为C93S/C550S,泳道D264C代表突变位点为C93S/C550S/D264C的Bst DNA Pol变体,泳道R334C代表突变位点为C93S/C550S/R334C的Bst DNA Pol变体,泳道L364C代表突变位点为C93S/C550S/L364C的Bst DNA Pol变体;Figure 3: DNA polymerase Bst DNA Pol polymerase activity detection prepared in Example 1, wherein, lane M represents Marker, lane PC represents positive control, lane NC represents negative control, and lane CF represents Bst DNA Pol Cys-Free mutation The site is C93S/C550S, lane D264C represents the Bst DNA Pol variant with the mutation site C93S/C550S/D264C, lane R334C represents the Bst DNA Pol variant with the mutation site C93S/C550S/R334C, and lane L364C represents the mutation site The point is the Bst DNA Pol variant of C93S/C550S/L364C;
图4A:dATP碱基修饰结构式,具体为7-deaza-Propargylamino-dATP,分子量为543.26;Figure 4A: dATP base modification structural formula, specifically 7-deaza-Propargylamino-dATP, with a molecular weight of 543.26;
图4B:dCTP碱基修饰结构式,具体为Propargylamino-dCTP,分子量为520.22;Figure 4B: dCTP base modification structural formula, specifically Propargylamino-dCTP, with a molecular weight of 520.22;
图4C:dGTP碱基修饰结构式,具体为7-deaza-Propargylamino-dGTP,分子量为559.26;Figure 4C: dGTP base modification structural formula, specifically 7-deaza-Propargylamino-dGTP, with a molecular weight of 559.26;
图4D:dUTP碱基修饰结构式,具体为Propargylamino-dUTP,分子量为521.20;Figure 4D: The structural formula of dUTP base modification, specifically Propargylamino-dUTP, the molecular weight is 521.20;
图5:dNTP与linker连接位点;Figure 5: The junction site of dNTP and linker;
图6A:质谱检测DNA聚合酶Bst DNA Pol及其变体与小分子的连接,具体为Bst DNA Pol Cys-Free与dUTP的连接质谱图;Figure 6A: Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the connection mass spectrum of Bst DNA Pol Cys-Free and dUTP;
图6B:质谱检测DNA聚合酶Bst DNA Pol及其变体与小分子的连接,具体为突变位点为C93S/C550S/L364C的Bst DNA Pol变体的质谱图;Figure 6B: Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the mass spectrogram of the Bst DNA Pol variant whose mutation site is C93S/C550S/L364C;
图6C:质谱检测DNA聚合酶Bst DNA Pol及其变体与小分子的连接,具体为突变位点为C93S/C550S/L364C的Bst DNA Pol变体与dUTP的连接质谱图;Figure 6C: Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the connection mass spectrum of Bst DNA Pol variants with C93S/C550S/L364C mutation sites and dUTP;
图6D:质谱检测DNA聚合酶Bst DNA Pol及其变体与小分子的连接,具体为突变位点为C93S/C550S/R334C的Bst DNA Pol变体与dUTP的连接质谱图;Figure 6D: Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the connection mass spectrum of Bst DNA Pol variants with C93S/C550S/R334C mutation sites and dUTP;
图6E:质谱检测DNA聚合酶Bst DNA Pol及其变体与小分子的连接,具体为突变位点为C93S/C550S/D264C的Bst DNA Pol变体与dUTP的连接质谱图;Figure 6E: Mass spectrometry detects the connection of DNA polymerase Bst DNA Pol and its variants to small molecules, specifically the connection mass spectrum of Bst DNA Pol variants with C93S/C550S/D264C mutation sites and dUTP;
图7:测序过程的反应模式图,具体为设计模板引物复合体,将其与dNTP-linker-聚合酶进行延伸反应,洗去未结合的dNTP-linker-聚合酶后进行荧光或者发光检测;光催化切断linker,使聚合酶脱离引物-模板复合体。使用缓冲液将被光切掉的聚合酶洗去,重复上述步骤进行下一个核苷酸的加入;Figure 7: The reaction mode diagram of the sequencing process, specifically designing a template-primer complex, carrying out an extension reaction with dNTP-linker-polymerase, washing away the unbound dNTP-linker-polymerase, and performing fluorescence or luminescence detection; Catalytic cleavage of the linker frees the polymerase from the primer-template complex. Use buffer to wash away the polymerase that has been cut off by light, and repeat the above steps to add the next nucleotide;
图8:单核苷酸延伸检测电泳图,其中,泳道P为单独的引物;泳道P+T为引物-模板复合体;泳道3nt为引物-模板复合体与单独的聚合酶和外加dTTP进行延伸反应,使引物 长度增加3个核苷酸;标记+365泳道为引物-模板复合体与dUTP-linker-聚合酶进行延伸反应,由于聚合酶与dUTP被linker连接,在延伸完成后不会脱离引物-模板复合体,阻碍了其他dUTP-linker-聚合酶结合在引物-模板复合体上,无法进行后续的延伸反应,因此只能延伸增加1个核苷酸,在延伸反应后经过365nm光照射后,引物与聚合酶分离,因此泳道中的条带为只延伸1个核苷酸长度的引物;标记-365泳道中反应条件与标记+365泳道相同,由于未使用365nm光进行切割,引物与聚合酶共价结合,变性胶无法打开,因此引物条带的分子量极高。Figure 8: Single nucleotide extension detection electropherogram, in which lane P is the primer alone; lane P+T is the primer-template complex; lane 3nt is the extension of the primer-template complex with a separate polymerase and dTTP Reaction to increase the primer length by 3 nucleotides; mark +365 lane as primer-template complex and dUTP-linker-polymerase for extension reaction, since the polymerase and dUTP are linked by the linker, the primer will not be detached after the extension is completed -Template complex, which prevents other dUTP-linker-polymerase from binding to the primer-template complex, and cannot carry out the subsequent extension reaction, so it can only extend by 1 nucleotide. After the extension reaction, after 365nm light irradiation , the primers are separated from the polymerase, so the bands in the lane are primers that extend only 1 nucleotide in length; the reaction conditions in the marker-365 lane are the same as those in the marker+365 lane, because 365nm light is not used for cleavage, the primers and the polymerase The enzyme is covalently bound, and the denaturing gel cannot be opened, so the molecular weight of the primer band is extremely high.
具体实施方式Detailed ways
本发明所述“核酸”是指至少两个共价连接在一起的核苷酸。The term "nucleic acid" in the present invention refers to at least two nucleotides covalently linked together.
本发明所述的“dNTP”是指脱氧核糖核苷三磷酸(deoxy-ribonucleotide triphosphate)或核糖核苷三磷酸,通常包括dATP、dGTP、dTTP、dCTP或dUTP等。The "dNTP" in the present invention refers to deoxy-ribonucleotide triphosphate or ribonucleoside triphosphate, and usually includes dATP, dGTP, dTTP, dCTP or dUTP and the like.
本发明所述的“突变体”或“变体”是指野生型聚合酶中一个或多个氨基酸位点发生改变形成的具有同一功能的酶或蛋白。The "mutant" or "variant" in the present invention refers to an enzyme or protein with the same function formed by changing one or more amino acid sites in the wild-type polymerase.
本发明所述的“基团的组合”是指一个或多个取代基基团之间以共价键的形式连接所形成新的基团。The "combination of groups" mentioned in the present invention refers to a new group formed by connecting one or more substituent groups in the form of covalent bonds.
本发明所述的“修饰”是指在物质的分子结构中一个或多个基团被取代。"Modification" as used in the present invention refers to the substitution of one or more groups in the molecular structure of a substance.
本发明中所述的术语C 0-10烷基,C 0烷基是指H,因此,C 0-10烷基包括H、C 1烷基、C 2烷基、C 3烷基、C 4烷基、C 5烷基、C 6烷基、C 7烷基、C 8烷基、C 9烷基、C 10烷基。 The term C 0-10 alkyl described in the present invention, C 0 alkyl refers to H, therefore, C 0-10 alkyl includes H, C 1 alkyl, C 2 alkyl, C 3 alkyl, C 4 Alkyl, C5 alkyl, C6 alkyl, C7 alkyl, C8 alkyl, C9 alkyl, C10 alkyl.
本发明中所述的术语C 1-10直链/支链烷基,包括甲基、乙基、C 3直链/支链烷基、C 4直链/支链烷基、C 5直链/支链烷基、C 6直链/支链烷基、C 7直链/支链烷基、C 8直链/支链烷基、C 9直链/支链烷基、C 10直链/支链烷基。 The term C 1-10 straight chain/branched chain alkyl described in the present invention includes methyl, ethyl, C 3 straight chain/branched chain alkyl, C 4 straight chain/branched chain alkyl, C 5 straight chain /branched alkyl, C6 linear/branched alkyl, C7 linear/branched alkyl, C8 linear/branched alkyl, C9 linear/branched alkyl, C10 linear /branched alkyl.
本发明中所述的术语C 3-10支链烷基,包括异丙基、异丁基、叔丁基、异戊基。 The term C 3-10 branched chain alkyl described in the present invention includes isopropyl, isobutyl, tert-butyl, and isoamyl.
本发明中所述的术语C 3-10环烷基,包括C 3环烷基、C 4环烷基、C 5环烷基、C 6环烷基、C 7环烷基、C 8环烷基、C 9环烷基、C 10环烷基。 The term C 3-10 cycloalkyl described in the present invention includes C 3 cycloalkyl, C 4 cycloalkyl, C 5 cycloalkyl, C 6 cycloalkyl, C 7 cycloalkyl, C 8 cycloalkane group, C 9 cycloalkyl, C 10 cycloalkyl.
本发明所述的术语卤素,包括氟、氯、溴、碘。The term halogen described in the present invention includes fluorine, chlorine, bromine and iodine.
本发明所述的术语杂环烷基是指含3-10个环原子,优选5-10个环原子的非芳香的饱和单环或多环环系,其中的一个或多个环原子不是碳原子,而是例如氮、氧或硫原子。优选的杂环烷基含有5-6个环原子。杂环烷基前的前缀氮杂、氧杂或硫杂分别是指至少有一个氮、氧或硫原子作为环原子。The term heterocycloalkyl as used herein refers to a non-aromatic saturated mono- or polycyclic ring system containing 3-10 ring atoms, preferably 5-10 ring atoms, one or more of which is not carbon atoms, but for example nitrogen, oxygen or sulfur atoms. Preferred heterocycloalkyl groups contain 5-6 ring atoms. The prefix aza, oxa or thia before heterocycloalkyl means that there is at least one nitrogen, oxygen or sulfur atom respectively as a ring atom.
本发明所述的术语杂环芳香基是指含5-14个环原子,优选5-10个环原子的芳香单环或多环环系,其中的一个或多个环原子不是碳原子,而是例如氮、氧或硫原子。优选的杂环芳香基含有5-6个环原子。代表性的杂环芳香基包括吡嗪基、呋喃基、噻吩基、吡啶基、嘧啶基、异噁唑基、异噻唑基、噁唑基、噻唑基、吡唑基、吡咯基、吡唑基、三唑基、吡嗪基、哒嗪基、喹喔啉基、2,3-二氮杂萘基、咪唑并[1,2-a]吡啶、咪唑并[2,1-b]噻唑基、吲哚基、氮杂吲哚基、苯并咪唑基、苯并噻吩基、喹啉基、咪唑基、噻吩并吡啶基、喹唑啉基、噻吩并嘧啶基、吡咯并吡啶基、咪唑并吡啶基、异喹啉基、1,2,4-三嗪基、苯并噻唑基等。The term heterocyclic aromatic group in the present invention refers to an aromatic monocyclic or polycyclic ring system containing 5-14 ring atoms, preferably 5-10 ring atoms, wherein one or more ring atoms are not carbon atoms, and are, for example, nitrogen, oxygen or sulfur atoms. Preferred heterocyclic aromatic groups contain 5-6 ring atoms. Representative heterocyclic aromatic groups include pyrazinyl, furyl, thienyl, pyridyl, pyrimidinyl, isoxazolyl, isothiazolyl, oxazolyl, thiazolyl, pyrazolyl, pyrrolyl, pyrazolyl , triazolyl, pyrazinyl, pyridazinyl, quinoxalinyl, 2,3-diazanaphthyl, imidazo[1,2-a]pyridine, imidazo[2,1-b]thiazolyl , indolyl, azaindolyl, benzimidazolyl, benzothienyl, quinolinyl, imidazolyl, thienopyridyl, quinazolinyl, thienopyrimidinyl, pyrrolopyridyl, imidazolyl Pyridyl, isoquinolinyl, 1,2,4-triazinyl, benzothiazolyl and the like.
实施例1聚合酶制备Example 1 Preparation of polymerase
一、克隆和Bst DNA Pol表达菌株1. Cloning and Bst DNA Pol expression strains
使用用于生产N末端6His标记的Bst DNA Pol及其变体及其变体的表达构建体pET15-Bst DNA Pol。对于N末端6His标记的Bst DNA Pol及其变体由于产率增加而在细胞质中进行表达。The expression construct pET15-Bst DNA Pol for the production of N-terminal 6His-tagged Bst DNA Pol and its variants and variants thereof was used. For N-terminal 6His-tagged Bst DNA Pol and its variants are expressed in the cytoplasm due to increased yield.
二、蛋白的表达和纯化2. Protein expression and purification
使用通用的化学转化方法,将N末端6His标记的Bst DNA Pol及其变体的表达质粒导入BL21(DE3)的表达菌株中进行表达。使用缓冲液A(50mM Tris-HCl pH8.0、800mM NaCl、1mM EDTA、1mM DTT)对细菌进行破碎离心。依次使用PEI溶液和饱和硫酸铵溶液除去细菌上清中的DNA和其他杂质蛋白。利用His-trap FF层析柱和AKTA蛋白纯化系统(GE Healthcare)对N末端6His标记Bst DNA Pol及其变体蛋白进行纯化。收集蛋白洗脱液,利用Heparin HP层析柱(GE Healthcare)进行纯化。收集蛋白洗脱液利用SDS-PAGE胶进行蛋白纯度的检测,结果如图2显示,纯化的DNA聚合酶Bst DNA Pol及其变体的纯度大 于95%,泳道M为蛋白Marker(Thermo Fisher),泳道CF为DNA聚合酶Bst DNA Pol Cys-Free,泳道D264C为DNA聚合酶Bst DNA Pol Cys-Free D264C突变体,泳道R334C为DNA聚合酶Bst DNA Pol Cys-Free R334C突变体,泳道L364C为DNA聚合酶Bst DNA Pol Cys-Free L364C突变体。测定纯化的DNA聚合酶Bst DNA Pol及其变体浓度,分装冻存在-80℃超低温冰箱,留待制备dNTP-linker-聚合酶复合体。Using a general chemical transformation method, the expression plasmids of N-terminal 6His-tagged Bst DNA Pol and its variants were introduced into the expression strain of BL21 (DE3) for expression. Bacteria were disrupted and centrifuged using buffer A (50 mM Tris-HCl pH 8.0, 800 mM NaCl, 1 mM EDTA, 1 mM DTT). DNA and other impurity proteins in the bacterial supernatant were removed using PEI solution and saturated ammonium sulfate solution in sequence. The N-terminal 6His-tagged Bst DNA Pol and its variant proteins were purified using His-trap FF column and AKTA protein purification system (GE Healthcare). The protein eluate was collected and purified using a Heparin HP column (GE Healthcare). The protein eluate was collected and used SDS-PAGE gel to detect the protein purity. The results are shown in Figure 2. The purity of the purified DNA polymerase Bst DNA Pol and its variants is greater than 95%, and lane M is the protein Marker (Thermo Fisher), Lane CF is DNA polymerase Bst DNA Pol Cys-Free, lane D264C is DNA polymerase Bst DNA Pol Cys-Free D264C mutant, lane R334C is DNA polymerase Bst DNA Pol Cys-Free R334C mutant, lane L364C is DNA polymerase Enzyme Bst DNA Pol Cys-Free L364C mutant. Measure the concentration of purified DNA polymerase Bst DNA Pol and its variants, and store in aliquots and freeze at -80°C ultra-low temperature until the dNTP-linker-polymerase complex is prepared.
三、聚合酶活性的检测3. Detection of polymerase activity
纯化的DNA聚合酶Bst DNA Pol及其变体需要先对其活性进行检测,待确定具有聚合酶活性后,方可与小分子进行连接。将纯化的DNA聚合酶Bst DNA Pol及其变体与退火结合的引物-模板复合体、10mM MgCl 2、dNTP混合物和缓冲液(50mM NaCl、10mM Tris-HCl pH7.9、1mM DTT)混合,反应体系定容为10uL。在37℃条件下孵育30min,之后利用非变性的聚丙烯酰胺凝胶电泳进行检测。在与阴性对照组的比较中,引物-模板复合体分子量会增加,说明聚合酶具有酶活性将引物延伸至模板长度。如图3,泳道M为DNA Marker(Thermo Fisher),泳道PC为阳性对照,使用聚合酶Klenow(NEB),泳道NC为阴性对照,泳道CF为DNA聚合酶Bst DNA Pol Cys-Free,泳道D264C为DNA聚合酶Bst DNA Pol Cys-Free D264C突变体,泳道R334C为DNA聚合酶Bst DNA Pol Cys-Free R334C突变体,泳道L364C为DNA聚合酶Bst DNA Pol Cys-Free L364C突变体。 The purified DNA polymerase Bst DNA Pol and its variants need to be tested for its activity first, and can be linked with small molecules after confirming that it has polymerase activity. The purified DNA polymerase Bst DNA Pol and its variants were mixed with annealed bound primer-template complex, 10 mM MgCl 2 , dNTP mix and buffer (50 mM NaCl, 10 mM Tris-HCl pH7.9, 1 mM DTT), and the reaction The volume of the system is 10uL. Incubate at 37°C for 30 min, and then use non-denaturing polyacrylamide gel electrophoresis for detection. In comparison with the negative control, the molecular weight of the primer-template complex will increase, indicating that the polymerase has enzymatic activity to extend the primer to the template length. As shown in Figure 3, lane M is DNA Marker (Thermo Fisher), lane PC is a positive control, using polymerase Klenow (NEB), lane NC is a negative control, lane CF is DNA polymerase Bst DNA Pol Cys-Free, lane D264C is DNA polymerase Bst DNA Pol Cys-Free D264C mutant, lane R334C is DNA polymerase Bst DNA Pol Cys-Free R334C mutant, lane L364C is DNA polymerase Bst DNA Pol Cys-Free L364C mutant.
如图2-3显示,通过上述方法获得了较高纯度的聚合酶,且获得的聚合酶具有较高聚合酶活性。As shown in Figures 2-3, a higher-purity polymerase is obtained by the above method, and the obtained polymerase has a higher polymerase activity.
实施例2 dNTP-linker-聚合酶的制备Example 2 Preparation of dNTP-linker-polymerase
一、dNTP与linker连接1. dNTP and linker connection
采用摩尔比1:2将上述的dNTP(如图4A-4D)与linker(如图1)分子进行混合,避光室温震荡孵育2小时,利用乙酸乙酯沉淀连接产物,离心挥干上清,将沉淀置于-20℃冰箱保存。The above-mentioned dNTP (as shown in Figure 4A-4D) and the linker (as shown in Figure 1) were mixed in a molar ratio of 1:2, incubated in the dark at room temperature for 2 hours, the ligation product was precipitated with ethyl acetate, and the supernatant was evaporated by centrifugation. Store the pellet in a -20°C refrigerator.
二、小分子与聚合酶链接的制备2. Preparation of Small Molecules Linked to Polymerase
将上述连接产物小分子用DMSO溶解,按小分子摩尔比10:1与前述的表达纯化的N 末端6His标记的Bst DNA Pol及其变体的聚合酶蛋白进行混合,避光室温震荡孵育2小时。之后采用Zeba Spin Desalting Column(Thermo)除去溶液内多余的小分子。其中,dNTP与linker连接位点见图5。The above-mentioned ligation product small molecule was dissolved in DMSO, mixed with the aforementioned expressed and purified N-terminal 6His-labeled Bst DNA Pol and its variant polymerase protein at a small molecule molar ratio of 10:1, and incubated at room temperature for 2 hours in the dark. . Afterwards, Zeba Spin Desalting Column (Thermo) was used to remove excess small molecules in the solution. Among them, the junction site of dNTP and linker is shown in Figure 5.
使用质谱方法,取部分连接产物进行连接效果和效率的检测。如图6A-图6E。由图6A-图6E显示将dNTP通过linker连接到聚合酶上,获得了连有单分子碱基修饰的dNTP的聚合酶。Using mass spectrometry, a part of the ligation product was taken for detection of ligation effect and efficiency. Figures 6A-6E. Figures 6A-6E show that the dNTP is linked to the polymerase through the linker, and the polymerase linked with the dNTP modified by the single molecule base is obtained.
实施例3测序Example 3 Sequencing
按照反应模式图(图7),设计一段随机序列的DNA模板,其长度为60bp,从模板的3'端第20-22位为连续三个腺嘌呤(A),在模板的5'端加生物素标签,设计一段与模板3'端互补的引物序列,长度为19nt,序列见表1。将预先退火结合在一起的引物和模板与dNTP-linker-聚合酶在酶活性最适温度60℃下孵育5分钟进行延伸反应。将反应后的溶液放置在冰上,用波长为365nm-405nm的LED灯照射15分钟,对聚合酶与dNTP连接的linker进行光切割,使聚合酶脱离引物-模板复合体。将照射之后的反应体系与带有链霉亲和素标记的磁珠孵育5分钟,使延长了的引物-模板复合体结合在磁珠上。使用磁珠洗涤缓冲液(0.5M NaCl、20mM Tris-HCl pH 7.5、1mM EDTA)将被光切掉的聚合酶和未结合的dNTP-linker-聚合酶洗去。为了检测经过一轮反应之后,是否在引物上延伸了一个核苷酸,使用50mM KOH溶液将引物-模板复合体打开,将引物洗脱下来,再用等体积的50mM HCl溶液中和,然后利用15%尿素变性的PAGE胶进行检测。由于只有引物的5'端带有荧光基团(FAM),使用GE公司的Typhoon荧光扫胶仪可以检测尿素变性的PAGE胶上引物的大小,并拍照进行对比分析。According to the reaction pattern diagram (Fig. 7), design a DNA template with a random sequence of 60 bp in length. The 20-22nd position from the 3' end of the template is three consecutive adenines (A), and at the 5' end of the template, add Biotin tag, design a primer sequence complementary to the 3' end of the template, the length is 19nt, the sequence is shown in Table 1. The extension reaction was performed by incubating the pre-annealed primers and templates with dNTP-linker-polymerase at the optimum temperature for enzymatic activity, 60°C, for 5 minutes. The reacted solution was placed on ice and irradiated with an LED lamp with a wavelength of 365nm-405nm for 15 minutes, to perform photocleavage of the linker connected to the polymerase and dNTP, so that the polymerase was separated from the primer-template complex. The reaction system after irradiation was incubated with streptavidin-labeled magnetic beads for 5 minutes to allow the extended primer-template complexes to bind to the magnetic beads. Photocleaved polymerase and unbound dNTP-linker-polymerase were washed away using magnetic bead wash buffer (0.5 M NaCl, 20 mM Tris-HCl pH 7.5, 1 mM EDTA). In order to detect whether a nucleotide is extended on the primer after one round of reaction, the primer-template complex is opened with 50 mM KOH solution, the primer is eluted, neutralized with an equal volume of 50 mM HCl solution, and then used 15% urea denatured PAGE gel for detection. Since only the 5' end of the primers has a fluorophore (FAM), the size of the primers on the urea-denatured PAGE gel can be detected by using the Typhoon fluorescent gel scanner of GE Company, and photographed for comparative analysis.
表1引物与模板序列Table 1 Primer and template sequences
Figure PCTCN2021129461-appb-000029
Figure PCTCN2021129461-appb-000029
反应结果如图8泳道P为单独的引物;泳道P+T为引物-模板复合体;泳道3nt为引物 -模板复合体与单独的聚合酶Bst L364C和外加dTTP进行延伸反应,使引物长度增加3个核苷酸;标记+365泳道为引物-模板复合体与聚合酶Bst L364C-dUTP进行延伸反应,由于聚合酶与dUTP被linker连接,在延伸完成后不会脱离引物-模板复合体,阻碍了其他dUTP-linker-聚合酶结合在引物-模板复合体上,无法进行后续的延伸反应,因此只能延伸增加1个核苷酸,在延伸反应后经过365nm光照射后,引物与聚合酶分离,因此泳道中的条带为只延伸1个核苷酸的引物;标记-365泳道中反应条件与标记+365泳道相同,由于未使用365nm光进行切割,引物与聚合酶共价结合,变性胶无法打开,因此引物条带的分子量极高。The reaction results are shown in Figure 8. Lane P is a separate primer; lane P+T is a primer-template complex; lane 3nt is an extension reaction of the primer-template complex with a separate polymerase Bst L364C and dTTP to increase the primer length by 3 nucleotides; mark +365 lane is the primer-template complex and the polymerase Bst L364C-dUTP for extension reaction. Since the polymerase and dUTP are linked by the linker, it will not detach from the primer-template complex after the extension is completed, hindering the Other dUTP-linker-polymerases are bound to the primer-template complex and cannot carry out the subsequent extension reaction, so only one nucleotide can be added for extension. Therefore, the bands in the lanes are primers that only extend 1 nucleotide; the reaction conditions in the marker-365 lanes are the same as those in the marker+365 lanes. Since 365nm light is not used for cleavage, the primers are covalently bound to the polymerase, and the denatured gel cannot be used. open, so the molecular weight of the primer band is extremely high.
结果显示连有dUTP的聚合酶Bst L364C具有较高的聚合酶活性,无法脱离引物-模板复合体的聚合酶Bst L364C有效的阻碍了后续的延伸反应,且阻碍基团能够被365nm光切掉,解除阻碍。The results showed that the dUTP-linked polymerase Bst L364C had high polymerase activity, and the polymerase Bst L364C that could not escape the primer-template complex effectively hindered the subsequent extension reaction, and the hindering group could be cleaved by 365nm light. Remove obstacles.

Claims (28)

  1. 一种核酸测序方法,所述的测序方法包括如下步骤:A nucleic acid sequencing method comprising the steps of:
    (1)提供聚合酶,所述的聚合酶通过可断裂基团与dNTP连接,并且所述的聚合酶可发出光信号;(1) providing a polymerase, the polymerase is connected to the dNTP through a cleavable group, and the polymerase can emit a light signal;
    (2)使所述聚合酶与待测核酸模板-引物复合体的引物3'端接触,所述聚合酶上的dNTP与待测核酸的序列上的碱基互补配对并在聚合酶酶促反应加入引物3'端;(2) contacting the polymerase with the 3' end of the primer of the nucleic acid template-primer complex to be tested, the dNTPs on the polymerase are complementary to the bases on the sequence of the nucleic acid to be tested, and enzymatically react with the polymerase Add primer 3' end;
    (3)检测所述聚合酶发出的光信号。(3) Detecting the light signal emitted by the polymerase.
  2. 根据权利要求1所述的一种核酸测序方法,其特征在于,所述聚合酶选自Bst DNA Pol及其变体、DNA Pol I及其变体、DNA Polγ及其变体、T3 DNA Pol及其变体、T5 DNA Pol及其变体、L5 DNA Pol及其变体、DNA Po1 II及其变体、DNA Pol B及其变体、DNA Polα及其变体、DNA Pol△及其变体、DNA Polε及其变体、DNA Pol III及其变体、DNA Polβ及其变体、DNA Polσ及其变体、DNA Polλ及其变体、DNA Polμ及其变体、DNA Pol IV及其变体、DNA Pol V及其变体、末端转移酶及其变体中的一种或者两种以上。A nucleic acid sequencing method according to claim 1, wherein the polymerase is selected from the group consisting of Bst DNA Pol and its variants, DNA Pol I and its variants, DNA Polγ and its variants, T3 DNA Pol and Its variants, T5 DNA Pol and its variants, L5 DNA Pol and its variants, DNA Po1 II and its variants, DNA Pol B and its variants, DNA Polα and its variants, DNA Pol△ and its variants , DNA Polε and its variants, DNA Pol III and its variants, DNA Polβ and its variants, DNA Polσ and its variants, DNA Polλ and its variants, DNA Polμ and its variants, DNA Pol IV and its variants One or more of DNA Pol V and its variants, terminal transferase and its variants.
  3. 根据权利要求2所述的一种核酸测序方法,其特征在于,所述聚合酶为DNA Polβ变体,所述的DNA Polβ变体为DNA Polβ在一个或多个氨基酸序列位置的突变,所述的突变包含取代、缺失和/或插入,或者所述的DNA Polβ变体与DNA Polβ编码序列具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的DNA Polβ变体具有聚合酶活性。A nucleic acid sequencing method according to claim 2, wherein the polymerase is a DNA Polβ variant, and the DNA Polβ variant is a mutation of DNA Polβ at one or more amino acid sequence positions, and the DNA Polβ variant is The mutation comprises substitution, deletion and/or insertion, or the DNA Polβ variant has at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the DNA Polβ variant has polymerase activity.
  4. 根据权利要求3所述的一种核酸测序方法,其特征在于,所述的DNA Polβ的变体是将SEQ ID NO:1第145-355位的任意一个、两个或三个半胱氨酸突变为非半胱氨酸,并且将第1-236位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A nucleic acid sequencing method according to claim 3, wherein the variant of the DNA Polβ is any one, two or three cysteines from positions 145-355 of SEQ ID NO: 1 Mutations to non-cysteines and any one, two or three non-cysteines at positions 1-236 are mutated to cysteines.
  5. 根据权利要求4所述的一种核酸测序方法,其特征在于,所述的DNA Polβ的变体是将SEQ ID NO:1第145-355位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第20-154位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A nucleic acid sequencing method according to claim 4, wherein the variant of the DNA Polβ is any one, two or three cysteines at positions 145-355 of SEQ ID NO: 1 Mutation to serine and any one, two or three non-cysteines at positions 20-154 were mutated to cysteine.
  6. 根据权利要求5所述的一种核酸测序方法,其特征在于,所述的DNA Polβ的变体是将SEQ ID NO:1的第178位、第239位和/或第267位的半胱氨酸突变为丝氨酸,并 且将第28位、第33位和/或第149位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A nucleic acid sequencing method according to claim 5, wherein the variant of the DNA Polβ is the cysteine at the 178th position, the 239th position and/or the 267th position of SEQ ID NO: 1 Acids were mutated to serines and any one, two or three non-cysteines at positions 28, 33 and/or 149 were mutated to cysteines.
  7. 根据权利要求2所述的一种核酸测序方法,其特征在于,所述聚合酶为Bst DNA Pol变体,所述的Bst DNA Pol变体为Bst DNA Pol在一个或多个氨基酸序列位置的突变,所述的突变包含取代、缺失和/或插入,或者所述的Bst DNA Pol变体与Bst DNA Pol编码序列具有至少70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、99.5%以上的序列同一性,并且所述的Bst DNA Pol变体具有聚合酶活性。A nucleic acid sequencing method according to claim 2, wherein the polymerase is a Bst DNA Pol variant, and the Bst DNA Pol variant is a mutation of the Bst DNA Pol at one or more amino acid sequence positions , the mutation comprises a substitution, deletion and/or insertion, or the Bst DNA Pol variant and the Bst DNA Pol coding sequence have at least 70%, 75%, 80%, 85%, 90%, 91%, 92% %, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5% or more sequence identity, and the Bst DNA Pol variant has polymerase activity.
  8. 根据权利要求7所述的一种核酸测序方法,其特征在于,所述的Bst DNA Pol的变体是将SEQ ID NO:2的第93位和/或第550位的半胱氨酸突变为丝氨酸,并且将第264位、第334位和/或第364位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A nucleic acid sequencing method according to claim 7, wherein the variant of the Bst DNA Pol is to mutate the cysteine at the 93rd position and/or the 550th position of SEQ ID NO: 2 to serine, and mutating any one, two or three non-cysteines at positions 264, 334 and/or 364 to cysteines.
  9. 根据权利要求1所述的一种核酸测序方法,其特征在于,所述的可断裂基团含有可断裂结构X,所述的X选自:A nucleic acid sequencing method according to claim 1, wherein the cleavable group contains a cleavable structure X, and the X is selected from:
    Figure PCTCN2021129461-appb-100001
    Figure PCTCN2021129461-appb-100001
    其中,j为1-3的整数,where j is an integer from 1 to 3,
    R选自H、C 1-10直链或支链烷基、C 2-10烯基、C 2-10炔基、C 3-10环烷基、芳基、含有N、O或S的杂环芳基、含有N、O或S的杂环烷基、-N(C 0-10烷基)(C 0-10烷基)、-CON(C 0-10烷基)(C 0-10烷基)、-N(C 0-10烷基)CO(C 0-10烷基)、COR 1、CN,C 1-10烷氧基,芳氧基,含有 N、O或S杂芳氧基,上述基团中的H可被以下基团取代:卤素、-CN、-OCH 2F、-OCHF 2、-OCF 3、-OH、C 1-10直链或支链烷基、-N(C 0-10烷基)(C 0-10烷基)、-OC 0-10烷基、C 3-10环烷基、-O杂环烷基、-N杂环烷基、-S杂环烷基、-N杂环芳香基、-O杂环芳香基、-S杂环芳香基,R 1选自H、OC 0-10烷基、C 1-10直链或支链烷基, R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl, C 2-10 alkynyl, C 3-10 cycloalkyl, aryl, hetero N, O or S containing Cycloaryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl)(C 0-10 alkyl), -CON(C 0-10 alkyl)(C 0-10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1-10 alkoxy, aryloxy, containing N, O or S heteroaryloxy group, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N (C 0-10 alkyl) (C 0-10 alkyl), -OC 0-10 alkyl, C 3-10 cycloalkyl, -O heterocycloalkyl, -N heterocycloalkyl, -S hetero Cycloalkyl, -N heterocyclic aryl, -O heterocyclic aryl, -S heterocyclic aryl, R 1 is selected from H, O 0-10 alkyl, C 1-10 straight or branched chain alkyl,
    R'选自N或O。R' is selected from N or O.
  10. 根据权利要求9所述的一种核酸测序方法,其特征在于,所述的可断裂基团具有L1-L2-X-L3-L4结构,其中,L1为与dNTP或修饰的dNTP结合的端基,L2和L3不存在或为不可断裂连接基团,L4为与聚合酶结合的端基。The nucleic acid sequencing method according to claim 9, wherein the cleavable group has a structure of L1-L2-X-L3-L4, wherein L1 is an end group bound to dNTP or modified dNTP , L2 and L3 are absent or non-cleavable linking groups, and L4 is the end group bound to the polymerase.
  11. 根据权利要求10所述的一种核酸测序方法,其特征在于,所述的L1或L4独立的选自马来酰亚胺基、羧基、巯基、叠氮基、炔基、环辛炔基及其衍生物、四嗪基、二硫吡啶基、乙烯基、烯砜基、琥珀酰亚胺基、醛基、酰肼基、胺氧基和α-卤代羰基。A nucleic acid sequencing method according to claim 10, wherein said L1 or L4 is independently selected from maleimide, carboxyl, sulfhydryl, azide, alkynyl, cyclooctynyl and Its derivatives, tetrazinyl, dithiopyridyl, vinyl, alkenyl, succinimidyl, aldehyde, hydrazide, amineoxy and α-halogenated carbonyl.
  12. 根据权利要求10所述的一种核酸测序方法,其特征在于,所述的L2或L3独立的选自-O(CH 2CH 2O) i-、-(CH 2) i-、-(CH 2) iNH(CH 2) i-、-(CH 2) iCOO(CH 2) i-、-(CH 2) iCONH(CH 2) i-、-(CH 2) iO(CH 2) i-、-(CH 2) iCO(CH 2) i-或
    Figure PCTCN2021129461-appb-100002
    中的一个或二个以上基团的组合,所述的i选自0-10的整数。
    The nucleic acid sequencing method according to claim 10, wherein said L2 or L3 is independently selected from -O(CH 2 CH 2 O) i -, -(CH 2 ) i -, -(CH 2 ) i NH(CH 2 ) i -, -(CH 2 ) i COO(CH 2 ) i -, -(CH 2 ) i CONH(CH 2 ) i -, -(CH 2 ) i O(CH 2 ) i -, -(CH 2 ) i CO(CH 2 ) i - or
    Figure PCTCN2021129461-appb-100002
    One or a combination of two or more groups in , the i is selected from an integer of 0-10.
  13. 根据权利要求1所述的一种核酸测序方法,其特征在于,所述聚合酶连接有一个或多个荧光标记物、磷光标记物或者化学发光标记物。The nucleic acid sequencing method according to claim 1, wherein the polymerase is linked with one or more fluorescent labels, phosphorescent labels or chemiluminescent labels.
  14. 根据权利要求13所述的一种核酸测序方法,其特征在于,所述的化学发光标记物选自辣根过氧化酶、碱性磷酸酶、荧光素酶及其衍生物,吖啶酯类、过氧化草酸酯类、洛粉碱、光泽精、鲁米诺及其衍生物,催化化学发光标记物的金属离子复合物、电催化化学发光标记物、苯并二呋喃、次甲基、三苯甲烷、吖嗪、三吩嗪、萘二甲酰亚胺、吡唑、萘醌、蒽醌、单偶氮和双偶氮以及上述基团的衍生物,具有可见光区吸收带的苯的衍生物,具有紫外吸收带的C=C、C=O、-N=N-、-NO 2、-C=S;所述的荧光标记物选自荧光素,Cy2,Cy3,Cy5,Cy7,Alexa Fluor系列染料,异硫氰酸荧光素,5-六氯荧光素氨基磷酸酯,6- 羧基-2',4,7,7'-四氯荧光素琥珀酰亚胺酯,6-羧基-4',5'-二氯-2',7'-二甲氧基荧光素琥珀酰亚胺酯,德克萨斯红,罗丹明110,荧光素马来酰亚胺染料,氟硼二吡咯,呫吨、羰花青、1,1'-二(十八烷基)-3,3,3',3'-四甲基吲哚羰花青高氯酸盐、3,3'-二(十八烷基)-氧杂羰花青高氯酸盐、芘、酞菁、6-羧基罗丹明6G、异硫氰酸荧光素、6-羧基荧光素琥珀酰亚胺酯、5-羧基荧光素琥珀酰亚胺酯、5-羧基荧光素、6-羧基荧光素、罗丹明B、罗丹明6G、7-氨基-4-甲基香豆素、二氢罗丹明123、四甲基罗丹明-6-马来酰亚胺、四甲基罗丹明-5-马来酰亚胺、5-吲哚乙酰氨基荧光素、双[N、N-双(羧甲基)氨甲基]荧光素四钠盐、荧光素-5-马来酰亚胺、磺基罗丹明G、7-羟基-4-甲基香豆素、3-氰基-7-羟基香豆素、荧光素二钠盐、四甲基罗丹明-6-异硫氰酸、6-羧基-X-罗丹明琥珀酰亚胺酯、5-羧基-X-罗丹明琥珀酰亚胺酯、6-羧基-X-罗丹明、5-羧基四甲基罗丹明琥珀酰亚胺酯、6-羧基四甲基罗丹明、5-羧基四甲基罗丹明、产生能量转移染料及荧光蛋白,例如GFP(绿色荧光蛋白),CFP(青色荧光蛋白),BFP(蓝色荧光蛋白),YFP(黄色荧光蛋白)及衍生物中的一种或多种;所述的磷光标记物包括过渡金属铱(Ir)或钌(Ru)的氮杂芳基络合物。 A nucleic acid sequencing method according to claim 13, wherein the chemiluminescent marker is selected from the group consisting of horseradish peroxidase, alkaline phosphatase, luciferase and derivatives thereof, acridine esters, Peroxyoxalates, Lofenine, Lucigenin, Luminol and its derivatives, Metal ion complexes of catalytic chemiluminescent markers, Electrocatalytic chemiluminescent markers, benzodifuran, methine, triphenyl Methane, azine, triphenazine, naphthalimide, pyrazole, naphthoquinone, anthraquinone, monoazo and bisazo and derivatives of the above groups, derivatives of benzene with absorption bands in the visible region , C=C, C=O, -N=N-, -NO 2 , -C=S with ultraviolet absorption band; the fluorescent label is selected from fluorescein, Cy2, Cy3, Cy5, Cy7, Alexa Fluor Series Dyes, Fluorescein isothiocyanate, 5-Hexachlorofluorescein phosphoramidate, 6-carboxy-2',4,7,7'-tetrachlorofluorescein succinimidyl ester, 6-carboxy-4',5'-Dichloro-2',7'-dimethoxyfluorescein succinimidyl ester, Texas red, Rhodamine 110, Fluorescein maleimide dye, fluoroborodipyrrole, xanthium Tonne, carbocyanine, 1,1'-bis(octadecyl)-3,3,3',3'-tetramethylindocarbocyanine perchlorate, 3,3'-bis(decyl) Octaalkyl)-oxacarbocyanine perchlorate, pyrene, phthalocyanine, 6-carboxyrhodamine 6G, fluorescein isothiocyanate, 6-carboxyfluorescein succinimidyl ester, 5-carboxyfluorescein Succinimidyl ester, 5-carboxyfluorescein, 6-carboxyfluorescein, rhodamine B, rhodamine 6G, 7-amino-4-methylcoumarin, dihydrorhodamine 123, tetramethylrhodamine- 6-maleimide, tetramethylrhodamine-5-maleimide, 5-indoleacetamidofluorescein, bis[N,N-bis(carboxymethyl)aminomethyl]fluorescein tetra sodium salt, fluorescein-5-maleimide, sulforhodamine G, 7-hydroxy-4-methylcoumarin, 3-cyano-7-hydroxycoumarin, fluorescein disodium salt, Tetramethylrhodamine-6-isothiocyanate, 6-carboxy-X-rhodamine succinimide ester, 5-carboxy-X-rhodamine succinimide ester, 6-carboxy-X-rhodamine, 5-Carboxytetramethylrhodamine succinimide ester, 6-carboxytetramethylrhodamine, 5-carboxytetramethylrhodamine, produce energy transfer dyes and fluorescent proteins such as GFP (green fluorescent protein), CFP ( One or more of cyan fluorescent protein), BFP (blue fluorescent protein), YFP (yellow fluorescent protein) and derivatives; the phosphorescent label includes nitrogen of transition metal iridium (Ir) or ruthenium (Ru) Heteroaryl complexes.
  15. 根据权利要求1所述的一种核酸测序方法,其特征在于,所述的步骤(1)中提供通过可断裂基团与dNTP连接的聚合酶为提供四种连接有不同dNTP的聚合酶,并且所述的四种连接有不同dNTP的聚合酶连接有不同的荧光标记物、化学发光标记物或者磷光标记物。A nucleic acid sequencing method according to claim 1, wherein the step (1) provides that the polymerase connected to the dNTP through the cleavable group is to provide four polymerases connected with different dNTPs, and The four polymerases linked with different dNTPs are linked with different fluorescent labels, chemiluminescent labels or phosphorescent labels.
  16. 根据权利要求1所述的一种核酸测序方法,其特征在于,所述的步骤(2)中待测核酸固定在支持物上。The nucleic acid sequencing method according to claim 1, wherein in the step (2), the nucleic acid to be detected is immobilized on a support.
  17. 根据权利要求1所述的一种核酸测序方法,其特征在于,所述的核酸测序方法中还包括步骤(4)使聚合酶与dNTP之间断裂。The nucleic acid sequencing method according to claim 1, wherein the nucleic acid sequencing method further comprises step (4) of cleaving between the polymerase and the dNTP.
  18. 根据权利要求17所述的一种核酸测序方法,其特征在于,所述的核酸测序方法还包括步骤(5):重复步骤(1)-(4)。The nucleic acid sequencing method according to claim 17, wherein the nucleic acid sequencing method further comprises step (5): repeating steps (1)-(4).
  19. 根据权利要求1所述的一种核酸测序方法,其特征在于,所述的核酸测序方法还包括核酸样品前处理,所述的核酸样品前处理包括核酸样品文库构建及扩增。The nucleic acid sequencing method according to claim 1, wherein the nucleic acid sequencing method further comprises nucleic acid sample pretreatment, and the nucleic acid sample pretreatment includes nucleic acid sample library construction and amplification.
  20. 一种聚合酶,其特征在于,所述的聚合酶是将SEQ ID NO:1第145-355位的任意一个、两个或三个半胱氨酸突变为非半胱氨酸,并且将第1-236位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A polymerase, characterized in that the polymerase mutates any one, two or three cysteines at positions 145 to 355 of SEQ ID NO: 1 to non-cysteines, and mutates the first cysteines to non-cysteines. Any one, two or three non-cysteines at positions 1-236 were mutated to cysteine.
  21. 根据权利要求20所述的一种聚合酶,其特征在于,所述的聚合酶是将SEQ ID NO:1第145-355位的任意一个、两个或三个半胱氨酸突变为丝氨酸,并且将第20-154位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A polymerase according to claim 20, wherein the polymerase mutates any one, two or three cysteines from positions 145 to 355 of SEQ ID NO: 1 to serine, And mutate any one, two or three non-cysteines at positions 20-154 to cysteine.
  22. 根据权利要求21所述的一种聚合酶,其特征在于,所述的聚合酶是将SEQ ID NO:1的第178位、第239位和/或第267位的半胱氨酸突变为丝氨酸,并且将第28位、第33位和/或第149位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A polymerase according to claim 21, wherein the polymerase mutates the cysteine at position 178, 239 and/or 267 of SEQ ID NO: 1 to serine , and any one, two or three non-cysteines at positions 28, 33 and/or 149 are mutated to cysteine.
  23. 一种聚合酶,其特征在于,所述的聚合酶是将SEQ ID NO:2的第85-100或540-570位的任意一个、两个或三个半胱氨酸突变为非半胱氨酸,并且将第250-280或320-380位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。A polymerase, characterized in that the polymerase mutates any one, two or three cysteines at positions 85-100 or 540-570 of SEQ ID NO: 2 to non-cysteine acid and mutate any one, two or three non-cysteines at positions 250-280 or 320-380 to cysteines.
  24. 根据权利要求23所述的聚合酶,其特征在于,所述的聚合酶是将SEQ ID NO:2的第93位和/或第550位的半胱氨酸突变为丝氨酸,并且将第264位、第334位和/或第364位的任意一个、两个或三个非半胱氨酸突变为半胱氨酸。The polymerase according to claim 23, wherein the polymerase mutates the cysteine at the 93rd position and/or the 550th position of SEQ ID NO: 2 to serine, and mutates the 264th position , any one, two or three non-cysteines at positions 334 and/or 364 are mutated to cysteine.
  25. 一种聚合酶复合物,其特征在于,所述的聚合酶复合物为通过可断裂基团将聚合酶与dNTP连接制备得到。A polymerase complex is characterized in that, the polymerase complex is prepared by connecting the polymerase and dNTP through a cleavable group.
  26. 根据权利要求25所述的一种聚合酶复合物,其特征在于,所述的可断裂基团含有可断裂结构X,所述的X选自:A polymerase complex according to claim 25, wherein the cleavable group contains a cleavable structure X, and the X is selected from:
    Figure PCTCN2021129461-appb-100003
    Figure PCTCN2021129461-appb-100003
    其中,R选自H、C 1-10直链或支链烷基、C 2-10烯基、C 2-10炔基、C 3-10环烷基、芳基、含有N、O或S的杂环芳基、含有N、O或S的杂环烷基、-N(C 0-10烷基)(C 0-10烷基)、-CON(C 0-10烷基)(C 0-10烷基)、-N(C 0-10烷基)CO(C 0-10烷基)、COR 1、CN,C 1-10烷氧基,芳氧基,含有N、O或S杂芳氧基,上述基团中的H可被以下基团取代:卤素、-CN、-OCH 2F、-OCHF 2、-OCF 3、-OH、C 1-10直链或支链烷基、-N(C 0-10烷基)(C 0-10烷基)、-OC 0-10烷基、C 3-10环烷基、-O杂环烷基、-N杂环烷基、-S杂环烷基、-N杂环芳香基、-O杂环芳香基、-S杂环芳香基,R 1选自H、OC 0-10烷基、C 1-10直链或支链烷基, Wherein, R is selected from H, C 1-10 straight or branched chain alkyl, C 2-10 alkenyl, C 2-10 alkynyl, C 3-10 cycloalkyl, aryl, containing N, O or S Heterocyclic aryl, heterocycloalkyl containing N, O or S, -N(C 0-10 alkyl) (C 0-10 alkyl), -CON(C 0-10 alkyl) (C 0 -10 alkyl), -N(C 0-10 alkyl) CO(C 0-10 alkyl), COR 1 , CN, C 1-10 alkoxy, aryloxy, containing N, O or S hetero Aryloxy, the H in the above groups can be substituted by the following groups: halogen, -CN, -OCH 2 F, -OCHF 2 , -OCF 3 , -OH, C 1-10 straight or branched chain alkyl, -N(C 0-10 alkyl) (C 0-10 alkyl), -OC 0-10 alkyl, C 3-10 cycloalkyl, -O heterocycloalkyl, -N heterocycloalkyl, - S heterocycloalkyl, -N heterocyclic aryl, -O heterocyclic aryl, -S heterocyclic aryl, R 1 is selected from H, O 0-10 alkyl, C 1-10 straight chain or branched alkane base,
    R'选自N或O。R' is selected from N or O.
  27. 一种核酸的合成方法,其特征在于,所述的核酸的合成方法包括:A method for synthesizing nucleic acid, wherein the method for synthesizing nucleic acid comprises:
    (1)提供聚合酶,所述的聚合酶通过可断裂基团与dNTP连接;(1) providing a polymerase, the polymerase is connected to the dNTP through a cleavable group;
    (2)将所述聚合酶与待测核酸的模板-引物复合体的引物3'端接触,所述聚合酶上的dNTP与待测核酸的序列上的碱基互补配对并在聚合酶酶促反应加入引物3'端上。(2) contacting the polymerase with the 3' end of the primer of the template-primer complex of the nucleic acid to be tested, the dNTP on the polymerase is complementary to the bases on the sequence of the nucleic acid to be tested and enzymatically catalyzed by the polymerase The reaction was added to the 3' end of the primer.
  28. 根据权利要求27所述的一种核酸的合成方法,其特征在于,所述的核酸的合成方法还包括步骤(3)使聚合酶与dNTP之间断裂;及(4)重复(1)-(3)步骤。The method for synthesizing nucleic acid according to claim 27, wherein the method for synthesizing nucleic acid further comprises step (3) cleaving between the polymerase and dNTP; and (4) repeating (1)-( 3) step.
PCT/CN2021/129461 2020-11-17 2021-11-09 Nucleic acid sequencing method WO2022105640A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011289898.0A CN112322715B (en) 2020-11-17 2020-11-17 Nucleic acid sequencing method
CN202011289898.0 2020-11-17

Publications (1)

Publication Number Publication Date
WO2022105640A1 true WO2022105640A1 (en) 2022-05-27

Family

ID=74322615

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/129461 WO2022105640A1 (en) 2020-11-17 2021-11-09 Nucleic acid sequencing method

Country Status (2)

Country Link
CN (1) CN112322715B (en)
WO (1) WO2022105640A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112322715B (en) * 2020-11-17 2022-11-25 清华大学 Nucleic acid sequencing method

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101636406A (en) * 2006-12-05 2010-01-27 激光基因公司 The Nucleotide of labeled nucleotide that light can rupture and nucleosides and mark and nucleosides with and using method in dna sequencing
WO2010117804A2 (en) * 2009-03-30 2010-10-14 The Regents Of The University Of California Mostly natural dna sequencing by synthesis
US20110212437A1 (en) * 2010-02-18 2011-09-01 Pacific Biosciences Of California, Inc. Single molecule sequencing with two distinct chemistry steps
CN105001292A (en) * 2015-07-14 2015-10-28 深圳市瀚海基因生物科技有限公司 Light-fractured fluorescence-labeling reversible terminal compound and use thereof in DNA (Deoxyribonucleic Acid) or RNA (Ribonucleic Acid) sequencing
CN108795900A (en) * 2017-04-27 2018-11-13 深圳华大智造科技有限公司 Archaeal dna polymerase and preparation method thereof
US20190078065A1 (en) * 2017-09-08 2019-03-14 Sigma-Aldrich Co. Llc Modified dna polymerases
CN109477080A (en) * 2016-06-14 2019-03-15 Dna斯克瑞普特公司 The variant of the archaeal dna polymerase of polX family
CN109642255A (en) * 2016-06-24 2019-04-16 加利福尼亚大学董事会 Use the nucleic acid synthesis and sequencing of the ribonucleoside triphosphote tied
CN112322715A (en) * 2020-11-17 2021-02-05 清华大学 Nucleic acid sequencing method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CZ293215B6 (en) * 1996-08-06 2004-03-17 F. Hoffmann-La Roche Ag Enzyme of thermally stable DNA polymerase, process of its preparation and a pharmaceutical composition and a kit containing thereof
CN106244712B (en) * 2016-08-31 2019-11-19 北京大学 DNA sequencing method

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101636406A (en) * 2006-12-05 2010-01-27 激光基因公司 The Nucleotide of labeled nucleotide that light can rupture and nucleosides and mark and nucleosides with and using method in dna sequencing
WO2010117804A2 (en) * 2009-03-30 2010-10-14 The Regents Of The University Of California Mostly natural dna sequencing by synthesis
US20110212437A1 (en) * 2010-02-18 2011-09-01 Pacific Biosciences Of California, Inc. Single molecule sequencing with two distinct chemistry steps
CN105001292A (en) * 2015-07-14 2015-10-28 深圳市瀚海基因生物科技有限公司 Light-fractured fluorescence-labeling reversible terminal compound and use thereof in DNA (Deoxyribonucleic Acid) or RNA (Ribonucleic Acid) sequencing
CN109477080A (en) * 2016-06-14 2019-03-15 Dna斯克瑞普特公司 The variant of the archaeal dna polymerase of polX family
CN109642255A (en) * 2016-06-24 2019-04-16 加利福尼亚大学董事会 Use the nucleic acid synthesis and sequencing of the ribonucleoside triphosphote tied
CN108795900A (en) * 2017-04-27 2018-11-13 深圳华大智造科技有限公司 Archaeal dna polymerase and preparation method thereof
US20190078065A1 (en) * 2017-09-08 2019-03-14 Sigma-Aldrich Co. Llc Modified dna polymerases
CN112322715A (en) * 2020-11-17 2021-02-05 清华大学 Nucleic acid sequencing method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SERRAO ERIK, CHEREPANOV PETER, ENGELMAN ALAN N.: "Amplification, Next-generation Sequencing, and Genomic DNA Mapping of Retroviral Integration Sites", JOURNAL OF VISUALIZED EXPERIMENTS, vol. 109379153840, no. 109, 22 March 2016 (2016-03-22), pages 53840, XP055932288, DOI: 10.3791/53840 *

Also Published As

Publication number Publication date
CN112322715B (en) 2022-11-25
CN112322715A (en) 2021-02-05

Similar Documents

Publication Publication Date Title
US11827932B2 (en) Methods and compositions for nucleic acid sequencing
US10059986B2 (en) Reversible terminator molecules and methods of their use
JP3415627B2 (en) Fluorescence detection assays for homogeneous PCR hybridization systems
JP5085837B2 (en) Terminal phosphate-labeled nucleotide and method of use
US20180327829A1 (en) Super-Resolution Sequencing
US20060160125A1 (en) Compositions and methods for nonenzymatic ligation of oligonucleotides and detection of genetic polymorphisms
KR102592367B1 (en) Systems and methods for clonal replication and amplification of nucleic acid molecules for genomic and therapeutic applications
JP2004509613A (en) Asynchronous stimulus PCR
KR20180057702A (en) Polymethine compounds and their use as fluorescent labels
JP2005501567A5 (en)
ES2805874T3 (en) Enzymatic ligation of nucleic acids
JP2020530988A (en) High-strength labeled reactant composition and method for sequencing
WO2023060858A1 (en) Monochromatic fluorescence mrt gene sequencing reagent and method based on environment-sensitive dye
WO2020227953A1 (en) Single-channel sequencing method based on self-luminescence
WO2022105640A1 (en) Nucleic acid sequencing method
WO2021216868A1 (en) Methods for detecting and sequencing a target nucleic acid
WO2021031109A1 (en) Method for sequencing polynucleotides on basis of optical signal dynamics of luminescent label and secondary luminescent signal
Qiu et al. Design and synthesis of cleavable biotinylated dideoxynucleotides for DNA sequencing by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
WO2020073274A1 (en) Method for sequencing polynucleotide
US20240141428A1 (en) Methods and compositions for nucleic acid sequencing
JPS613063A (en) Measurement of polynucleotide utilizing luminous substance
Chaudhuri AA-Dutp-Cy3 a novel fluorescent labeling agent for nucleotides
Guo Design and synthesis of novel nucleotide analogs and protein conjugates for DNA sequencing

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21893787

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21893787

Country of ref document: EP

Kind code of ref document: A1