WO2020243207A1 - Methods to identify components in nucleic acid sequences - Google Patents

Methods to identify components in nucleic acid sequences Download PDF

Info

Publication number
WO2020243207A1
WO2020243207A1 PCT/US2020/034759 US2020034759W WO2020243207A1 WO 2020243207 A1 WO2020243207 A1 WO 2020243207A1 US 2020034759 W US2020034759 W US 2020034759W WO 2020243207 A1 WO2020243207 A1 WO 2020243207A1
Authority
WO
WIPO (PCT)
Prior art keywords
pol
polymerase
nanostructure
dna
group
Prior art date
Application number
PCT/US2020/034759
Other languages
French (fr)
Inventor
Peiming Zhang
Ming Lei
Original Assignee
Universal Sequencing Technology Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Universal Sequencing Technology Corporation filed Critical Universal Sequencing Technology Corporation
Priority to EP20814921.1A priority Critical patent/EP3976814A4/en
Priority to JP2021570366A priority patent/JP2022535746A/en
Priority to CN202080053393.6A priority patent/CN114555832A/en
Priority to US17/595,758 priority patent/US20220251638A1/en
Priority to KR1020217042311A priority patent/KR20220012920A/en
Publication of WO2020243207A1 publication Critical patent/WO2020243207A1/en

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/483Physical analysis of biological material
    • G01N33/487Physical analysis of biological material of liquid biological material
    • G01N33/48785Electrical and electronic details of measuring devices for physical analysis of liquid biological material not specific to a particular test method, e.g. user interface or power supply
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N27/00Investigating or analysing materials by the use of electric, electrochemical, or magnetic means
    • G01N27/26Investigating or analysing materials by the use of electric, electrochemical, or magnetic means by investigating electrochemical variables; by using electrolysis or electrophoresis
    • G01N27/28Electrolytic cell components
    • G01N27/30Electrodes, e.g. test electrodes; Half-cells
    • G01N27/327Biochemical electrodes, e.g. electrical or mechanical details for in vitro measurements
    • G01N27/3275Sensing specific biomolecules, e.g. nucleic acid strands, based on an electrode surface reaction
    • G01N27/3278Sensing specific biomolecules, e.g. nucleic acid strands, based on an electrode surface reaction involving nanosized elements, e.g. nanogaps or nanoparticles
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/483Physical analysis of biological material
    • G01N33/487Physical analysis of biological material of liquid biological material
    • G01N33/48707Physical analysis of biological material of liquid biological material by electrical means
    • G01N33/48721Investigating individual macromolecules, e.g. by translocation through nanopores
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B82NANOTECHNOLOGY
    • B82YSPECIFIC USES OR APPLICATIONS OF NANOSTRUCTURES; MEASUREMENT OR ANALYSIS OF NANOSTRUCTURES; MANUFACTURE OR TREATMENT OF NANOSTRUCTURES
    • B82Y15/00Nanotechnology for interacting, sensing or actuating, e.g. quantum dots as markers in protein assays or molecular motors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2565/00Nucleic acid analysis characterised by mode or means of detection
    • C12Q2565/50Detection characterised by immobilisation to a surface
    • C12Q2565/531Detection characterised by immobilisation to a surface characterised by the capture moiety being a protein for target oligonucleotides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2565/00Nucleic acid analysis characterised by mode or means of detection
    • C12Q2565/60Detection means characterised by use of a special device
    • C12Q2565/607Detection means characterised by use of a special device being a sensor, e.g. electrode
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/88Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86
    • G01N2030/8809Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86 analysis specially adapted for the sample
    • G01N2030/8813Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86 analysis specially adapted for the sample biological materials
    • G01N2030/8827Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86 analysis specially adapted for the sample biological materials involving nucleic acids

Definitions

  • Embodiments of the present invention are related to methods and biochemical materials for an electronic sequencing device to read out individual nucleotides in a nucleic acid sequence using enzymes.
  • a prior art discloses a method to detect the nucleotide incorporation into DNA by monitoring the conformational changes of a DNA polymerase labeled with fluorescent dyes. Depending on the enzymes and dyes, different nucleotides, including those naturally occurring and modified, produced different amplitudes and durations for the fluorescent emissions. The method determines a nucleotide sequence from an ensemble of nucleic acid molecules.
  • a prior art application (WO 2016/183218) claims that a mixture in which one or more of the native nucleotide triphosphates is replaced with an analog having a non natural moiety that alters signal polarity in a distinguishable way without hurting the ability of the analog to base pair with its cognate nucleotide in a template strand during sequencing.
  • a-thio-dATP results in a negative change in signal polarity
  • 2-thio-dTTP results in a positive change in signal polarity so that they can be used to distinguish between the T and A in a template by means of the said device charge sensor (see Figure 2 for their structures).
  • TS transition state
  • Figure 1 a prior art device composed of a carbon nanotube attached to two electrodes (source and drain) and functionalized with a DNA polymerase for monitoring the enzyme activity in real-time.
  • Figure 2 Chemical structure of modified nucleotides a-thio-dATP and 2-thio- dTTP.
  • Figure 3 A schematic diagram of a single molecule DNA sequencing device with polymerase on a nanostructure attached to two electrodes, (a) a DNA
  • nanostructure (b) a peptide nanostructure.
  • Figure 4 (a) Free energy profile for single nucleotide incorporation by different DNA polymerase Pol b WT, R258A mutant, KF, and Pol X; (b) Qualitative free energy profile of matched and mismatched dNTP incorporation by Pol b versus I260Q mutant.
  • Figure 5 the reaction in incorporating a nucleotide substrate to a DNA chain.
  • Figure 7 Chemical structures of naturally occurring nucleoside g-substituted triphosphates.
  • Figure 8 Chemical structures of b, g-C analogies of naturally occurring nucleoside triphosphates.
  • Figure 9 Chemical structures of naturally occurring nucleoside a-thio- triphosphates (a-thio-dNTP).
  • Figure 10 Chemical structures of naturally occurring nucleoside a-borano- triphosphates (a-borano-dNTP).
  • Figure 1 1 Chemical structures of naturally occurring nucleoside a-borano-a- thio-tri phosphates (a-borano-a-thio-dNTPs).
  • Figure 12 Chemical structures of naturally occurring nucleoside a-seleno- triphosphates (a-seleno-dNTP).
  • Figure 13 Chemical structures of naturally occurring deoxyribonucleoside a- R-phosphonyl-b, g-diphosphates.
  • Figure 14 Chemical structures of naturally occurring nucleoside
  • Figure 15 Chemical structures of naturally occurring nucleoside
  • Figure 17 Chemical structures of representative xeno nucleic acid (XNA) nucleosides.
  • Figure 18 Diagram of Watson-Crick base pairs and modification sites in this invention.
  • Figure 19 Chemical structures of modified pyrimidine nucleobases.
  • Figure 20 Chemical structures of modified purine nucleobases.
  • This invention includes a biopolymer nanostructure coupling with a DNA polymerase as an electronic sensor for nucleic acid sequencing (see Figure 3a, a DNA nanostructure, and Figure 3b, a peptide nanostructure), as disclosed in the provisional patent applications, US 62/794,096, US 62/812,736, US 62/833,870, and
  • Both the DNA nanostructure and peptide nanostructure illustrated in Figure 3 are conductors of electron charges under certain conditions through tunneling and hopping.
  • a DNA polymerase is attached to the nanostructure at the predefined locations, each through a short flexible linker.
  • the DNA polymerase first forms a binary complex with a target-primer duplex, existing in an“open” conformation, which can, in turn, form a ternary complex with a correct nucleoside triphosphate through the Watson-Crick base pairing.
  • the ternary complex turns the DNA polymerase to a“closed” conformation, facilitating the elongation reaction.
  • This invention provides methods and chemicals for identifying individual components (or units or bases) that constitute biopolymers, especially DNAs and RNAs.
  • DNAs and RNAs For example, to sequence a target DNA molecule, we use it as a template for DNA synthesis on the said nanostructure sequencing device, with which nucleoside triphosphate substrates are incorporated into a growing DNA strand following the Watson Crick base pairing rule. The DNA sequence is determined by reading the nucleotide incorporation.
  • a recent study has shown that the DNA synthesis is a two Mg 2+ ion assisted stepwise associative SN2 reaction, 6 albeit a third divalent metal ion may be present during DNA synthesis.
  • pyrophosphate (PPi) group released from the SN2 reaction is hydrolyzed to phosphates during the DNA synthesis catalyzed by DNA polymerases.
  • a general mechanism of DNA polymerization is illustrated in Figure 5.
  • the terminal 3’ oxygen of the growing strand acts as a
  • this invention provides modified nucleotide substrates, which affect the kinetics of the polymerase enzymatic reactions in ways different from the naturally occurring nucleotides, generating distinguishable electric signals in the nanostructure that can be used to differentiate individual nucleotides in the target DNA template so that the target DNA can be sequenced.
  • the DNA polymerases used in this invention include those that have been classified by structural homology into the families of A, B, C, D, X, Y, and RT. For example, those in Family A include T7 DNA polymerase and Bacillus
  • RT reverse transcriptase family of DNA polymerases include, for example, retrovirus reverse transcriptases and eukaryotic telomerases.
  • a polymerase is attached to the nanostructure, fed with a duplex composed of DNA primer and a target to be sequenced, and followed by a mixture of nucleoside triphosphates or dNTPs.
  • the DNA polymerase incorporates the dNTPs into the DNA primer according to the Watson-Crick pairing rule, and each incorporating step evokes an electric spike that is recorded in the sensor.
  • nucleoside triphosphate mixtures include: • 0-4 of naturally occurring nucleoside triphosphates ( Figure 6).
  • the substituents are either electron donating or electron withdrawing groups that affect the activities of DNA polymerases, 8 and also may affect the hydrolysis of pyrophosphate to phosphates, resulting in altered reaction rates and, in turn, the electric signals.
  • the Sp-diastereomers of deoxyribonucleoside and ribonucleoside 5'-0-(1 -thio-triphosphates) are analogs of the naturally occurring nucleotides and are incorporated readily into nucleic acids by DNA or RNA polymerases. 12 ’ 13
  • a-borano-dNTPs • 0-4 of a-borano-dNTPs ( Figure 10).
  • the a-borano-dNTPs and a-borano- NTPs are good to excellent substrates for DNA and RNA polymerases, allowing for ready enzymatic syntheses of DNA and RNA. 14 ⁇ 15
  • the said nucleoside triphosphates include modified sugars.
  • Figure 16 shows one form of the modifications, in which the oxygen in the sugar ring is replaced by another atom. These atoms have different electron negativities, which would affect the pK a of the neighbor 3’-OH, and in turn its nucleophilicity.
  • dSNTPs 2'-deoxy-4'-thioribonucleoside 5'-triphosphate
  • X S
  • R H
  • Base A
  • C, G, T with unmodified triphosphate
  • An RNA dependent RNA polymerase (RdRP) is attached to the DNA nanostructure device for RNA sequencing.
  • the enzyme is polio virus RdRP and others.
  • the said nucleoside triphosphates have the nucleoside units including artificial genetic polymer xeno nucleic acids (XNA), a set of nucleic acid polymers with their backbone structures distinct from those found in nature, which is capable of specifically base pairing with DNA nucleobases ( Figure 17).
  • XNAs genetic polymer xeno nucleic acids
  • Figure 17 Some of XNAs have their sugar units flexible or rigid conformations, and others have different configurations and structures from their naturally occurring counterparts. These make their binding to targets in the enzyme differently from those naturally occurring counterparts.
  • some XNAs carry an electron-donating or withdrawing group that make its neighbor OH more or less nucleophilic, compared to the naturally occurring counterpart.
  • TNA can be incorporated into a DNA primer by a laboratory evolved polymerase that derives from a replicative B-family polymerase isolated from the archaeal hyperthermophilic species Thermococcus kodakarensis (Kod). 20 21 These XNA substrates are useful to distinguish a specific DNA nucleotide from the rest of them in a DNA target.
  • the RNA polymerase attached to the biopolymer nanostructure sensor for RNA sequencing includes, but not limited to, viral RNA polymerases such as T7 RNA polymerase; Eukaryotic RNA polymerases such as RNA polymerase I, RNA polymerase II, RNA polymerase III, RNA polymerase IV, and RNA polymerase V; and Archaea RNA polymerase.
  • viral RNA polymerases such as T7 RNA polymerase
  • Eukaryotic RNA polymerases such as RNA polymerase I, RNA polymerase II, RNA polymerase III, RNA polymerase IV, and RNA polymerase V
  • Archaea RNA polymerase Archaea RNA polymerase.
  • the RNA polymerase attached to sensor is fed with a mixture of canonical ribonucleoside triphosphates for reading out RNA sequences.
  • the said mixture contains 0-4 of canonical ribonucleoside triphosphates.
  • This invention further provides modified bases to further tune both DNA and RNA polymerase for their reactivities.
  • These compounds have a common feature of the preserved Watson-Crick hydrogen bonding edges for inserting a correct incoming nucleotide to interact with the template following the Watson-Crick base pairing rule and hydrogen bonding acceptor sites for a polymerase to interact with the base pair from the minor groove. 22 ’ 23
  • the modifications do not disturb the fidelity of the enzyme.
  • the said nucleoside triphosphates are composed of the pyrimidine bases with their 5- positions modified with a series of electron- withdrawing groups, electron donor groups, as well as ethyl, ethylene, and acetylene, to which various functional groups are attached ( Figure 19). These modifications allow us to tune the transition state of the enzymatic reaction.
  • the said nucleoside triphosphates are composed of the purines bases with their 7- positions modified with a series of electron-withdrawing groups, electron donor groups, as well as ethyl, ethylene, and acetylene, to which various functional groups are attached ( Figure 20). These modifications allow us to tune the transition state of the enzymatic reaction.
  • the nucleoside triphosphates are composed of the said modified bases, modified sugars or sugar analogies, and modified triphosphates or triphosphate analogies.
  • a plurality of nanostructure sensors are used to read the nucleic acid sequences in parallel.
  • a plurality of nanostructure sensors can be fabricated in an array format with the number of nanostructure sensors from 10 to 10 9 on a solid surface or in a well, preferably 10 3 to 10 7 or more preferably 10 4 to 10 6 .
  • All of the nanostructure sensors in the said array is configured with one type of nucleic acid polymerase or different types of nucleic acid polymerases.
  • the target sample can be double or single-stranded, linear, or circular DNA.
  • the target sample can also be double or single-stranded, linear, or circular RNA.
  • the primer for the sequencing can be DNA, RNA, conjugates of DNA and RNA, or DNA containing modified nucleosides.
  • a polymerase can be attached to a biopolymer nanostructure sensor at a predefined location or locations using the attachments chemistries provided in the previous provisional patent applications (ref. US 62/794,096, US 62/812,736, US 62/833,870, and US62/803,100).
  • a DNA nanostructure is functionalized with organic functional groups at the predefined DNA nucleoside or nucleosides.
  • the DNA polymerase is bioengineered to contain unnatural amino acids that bear the function against those in the DNA nanostructure for the click reaction.
  • the biopolymer nanostructure in all the above descriptions is replaced by a solid nanowire made of material selected from the group of platinum (Pt), palladium (Pd), Tungsten (W), gold (Au), copper (Cu), titanium (Ti), Tantalum (Ta), Chromium (Cr), TiN, TiNx, TaN, TaNx, silver (Ag), aluminum (Al), and other metals, preferably Pt, Pd, Au, Ti, and TiN.
  • the nanowire is 3nm to 10pm in length, preferably 20nm to 1 pm; 5nm to 50nm in width, preferably 5nm to 20nm; and 3nm to 50nm in thickness, preferably 4nm to 10nm.
  • the nanowire is a carbon nanotube or a graphene sheet, single layer or multilayer, with dimension similar to the nanowire.
  • the nanostructure in all the above descriptions is replaced by a molecular wire, such as those disclosed in patent applications,
  • WO2018208505 US20180305727A1
  • WO2018136148A1 All the nucleoside triphosphate and ribonucleoside triphosphate designs, modifications, variations, natural or unnatural, their interaction with polymerase, their methods of use and principles of distinguishing individual nucleotides apply to the polymerase-molecular wire coupled DNA/RNA sequencing system.
  • a DNA polymerase is directly attached to the two electrodes, bridging the nanogap between the two electrodes and allowing electrons or electric current to pass through, such as those disclosed in patent applications
  • a FET type polymerase sequencing system such as those disclosed in the provisional patent application US62/833,870.
  • a FET type polymerase sequencing system such as those disclosed in the provisional patent application US62/833,870.
  • the mechanism of polymerase conformational change affecting the electrical signal passing through the nanogap is somehow different, the nucleoside triphosphate and ribonucleoside triphosphate designs, modifications, variations, natural or unnatural, and their interaction with polymerase, their methods of use and principles of distinguishing individual nucleotides also apply to the FET type polymerase DNA/RNA sequencing system.
  • Cited Literature Patents or patent applications are incorporated into where they are mentioned in the text.
  • the cited journal publications are listed in Cited Literature.
  • SHAW B. R.; DOBRIKOV, M.; WANG, X.; WAN, J.; HE, K.; LIN, J.-L; LI, P.; RAIT, V.; SERGUEEVA, Z. A.; SERGUEEV, D., Reading, Writing, and Modulating

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Organic Chemistry (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Pathology (AREA)
  • General Physics & Mathematics (AREA)
  • Nanotechnology (AREA)
  • Biotechnology (AREA)
  • Urology & Nephrology (AREA)
  • Medicinal Chemistry (AREA)
  • Food Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Genetics & Genomics (AREA)
  • Microbiology (AREA)
  • Hematology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Human Computer Interaction (AREA)
  • Electrochemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)

Abstract

This invention provides methods to identify or sequence a DNA or RNA molecule electronically in a single molecule level based on polymerase synthesis.

Description

METHODS TO IDENTIFY COMPONENTS IN NUCLEIC ACID SEQUENCES
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority to U.S. Provisional Application Serial No. 62/853,1 19 filed May 27, 2019, and U.S. Provisional Application Serial No. 62/861 ,675 filed June 14, 2019, the entire disclosures of which are hereby incorporated herein by reference.
FIELD
[0002] Embodiments of the present invention are related to methods and biochemical materials for an electronic sequencing device to read out individual nucleotides in a nucleic acid sequence using enzymes.
THE PRIOR ART AND BACKGROUND
[0001] Throughout this application, various publications, patents, and patent applications to which this invention pertains have been referenced. The disclosures of these publications in their entireties are delineated as the state of the art. Also, in this application, the invention is illustrated primarily with DNA and DNA polymerase synthesis. The same mechanisms, principles, and features apply to RNA and RNA polymerase synthesis too with only slight modifications, e.g., replacing
deoxyribonucleoside triphosphate (dNTP) with nucleoside triphosphate (NTP), replacing deoxyribose with ribose, and so on. [0002] A prior art (US 9,862,998 and 10,233,493) discloses a method to detect the nucleotide incorporation into DNA by monitoring the conformational changes of a DNA polymerase labeled with fluorescent dyes. Depending on the enzymes and dyes, different nucleotides, including those naturally occurring and modified, produced different amplitudes and durations for the fluorescent emissions. The method determines a nucleotide sequence from an ensemble of nucleic acid molecules.
[0003] Another prior art has demonstrated that a carbon nanotube charge sensor (Figure 1 ) could electronically monitor the process of a single DNA polymerase incorporating individual naturally-occurring nucleotides into a DNA primer in real time1 so that such an electronic device could potentially be used in the sequencing of nucleic acid polymers. However, the electric signals (appearing as spikes) cannot be effectively applied to identify individually incorporated nucleotides. As shown in the table below (adopted from Reference 1 ), there are overlaps between the characteristic parameters derived from the electric signals of the DNA polymerase incorporation overlap correspondingly among the naturally occurring nucleotides. Thus, neither tΐo, xm, nor H values are sufficient to identify the incorporation of a particular dNTP with any degree of reliability.
Figure imgf000003_0001
[0004] A prior art application (WO 2016/183218) claims that a mixture in which one or more of the native nucleotide triphosphates is replaced with an analog having a non natural moiety that alters signal polarity in a distinguishable way without hurting the ability of the analog to base pair with its cognate nucleotide in a template strand during sequencing. As an example, a-thio-dATP results in a negative change in signal polarity and 2-thio-dTTP results in a positive change in signal polarity so that they can be used to distinguish between the T and A in a template by means of the said device charge sensor (see Figure 2 for their structures). However, Weiss and coworkers have reported that the 2-thiopyrimidine-5'-triphosphate (2-thio-dNTP) analogs produced mixed behaviors in which the DNA polymerase I Klenow fragment (KF) activity produced negative excursions during 1 minute, and positive excursions during another minute.2 That indicates that the modification on the Watson-Crick base pairing edge of a nucleobase causes uncertainty of the electronic signals, in turn resulting in uncertainties as to the determination of the nucleotide incorporation for the nucleic acid sequencing.
[0005] The enzymatic incorporation of nucleoside triphosphates to a DNA strand has a similar kinetic pathway between the mismatched and matched dNTPs3 as well as between different DNA polymerases4, as shown in Figure 4. In general, the DNA polymerization catalyzed by DNA polymerases is a kinetically controlled process. There are several major steps involved in the process: (1 ) the conformation closing, (2) the triphosphate coupling to 3’ end of DNA, and (3) DNA translocation and the conformation reopening, in which the coupling reaction is the rate-limiting step. Figure 4 suggests that the mismatch base pair would not affect the closing and reopening of a DNA
polymerase significantly but affect the enzyme catalyzed transition state (TS). Thus, the modification of the non-base pairing section of naturally occurring nucleosides should be able to tune their kinetic parameters, making them distinguishable by an electronical sensor.
BRIEF DESCRIPTION OF THE DRAWINGS
[0006] Figure 1 : a prior art device composed of a carbon nanotube attached to two electrodes (source and drain) and functionalized with a DNA polymerase for monitoring the enzyme activity in real-time. [0007] Figure 2: Chemical structure of modified nucleotides a-thio-dATP and 2-thio- dTTP.
[0008] Figure 3: A schematic diagram of a single molecule DNA sequencing device with polymerase on a nanostructure attached to two electrodes, (a) a DNA
nanostructure, (b) a peptide nanostructure.
[0009] Figure 4: (a) Free energy profile for single nucleotide incorporation by different DNA polymerase Pol b WT, R258A mutant, KF, and Pol X; (b) Qualitative free energy profile of matched and mismatched dNTP incorporation by Pol b versus I260Q mutant. [0010] Figure 5: the reaction in incorporating a nucleotide substrate to a DNA chain.
[0011] Figure 6: Chemical structures of naturally occurring nucleoside
triphosphates.
[0012] Figure 7: Chemical structures of naturally occurring nucleoside g-substituted triphosphates.
[0013] Figure 8: Chemical structures of b, g-C analogies of naturally occurring nucleoside triphosphates.
[0014] Figure 9: Chemical structures of naturally occurring nucleoside a-thio- triphosphates (a-thio-dNTP). [0015] Figure 10: Chemical structures of naturally occurring nucleoside a-borano- triphosphates (a-borano-dNTP).
[0016] Figure 1 1 : Chemical structures of naturally occurring nucleoside a-borano-a- thio-tri phosphates (a-borano-a-thio-dNTPs). [0017] Figure 12: Chemical structures of naturally occurring nucleoside a-seleno- triphosphates (a-seleno-dNTP).
[0018] Figure 13: Chemical structures of naturally occurring deoxyribonucleoside a- R-phosphonyl-b, g-diphosphates. [0019] Figure 14: Chemical structures of naturally occurring nucleoside
triphosphates with both oxygen bridges modified (b,g-C-a-Z-dNTP).
[0020] Figure 15: Chemical structures of naturally occurring nucleoside
triphosphates with one of g-, and a-phosphorus’ oxygens replaced by other atoms or organic groups. [0021] Figure 16: Chemical structures of nucleotide with their sugar ring oxygen replaced by other atoms.
[0022] Figure 17: Chemical structures of representative xeno nucleic acid (XNA) nucleosides.
[0023] Figure 18: Diagram of Watson-Crick base pairs and modification sites in this invention.
[0024] Figure 19: Chemical structures of modified pyrimidine nucleobases.
[0025] Figure 20: Chemical structures of modified purine nucleobases.
SUMMARY OF THE INVENTION
[0026] This invention includes a biopolymer nanostructure coupling with a DNA polymerase as an electronic sensor for nucleic acid sequencing (see Figure 3a, a DNA nanostructure, and Figure 3b, a peptide nanostructure), as disclosed in the provisional patent applications, US 62/794,096, US 62/812,736, US 62/833,870, and
US62/803,100, which are included herein by their entirety. Both the DNA nanostructure and peptide nanostructure illustrated in Figure 3 are conductors of electron charges under certain conditions through tunneling and hopping. A DNA polymerase is attached to the nanostructure at the predefined locations, each through a short flexible linker. For sequencing, the DNA polymerase first forms a binary complex with a target-primer duplex, existing in an“open” conformation, which can, in turn, form a ternary complex with a correct nucleoside triphosphate through the Watson-Crick base pairing. In the presence of metal ions, the ternary complex turns the DNA polymerase to a“closed” conformation, facilitating the elongation reaction. When a new phosphodiester bond is formed, the nascent base pair at the end of the duplex is overstretched, which triggers a stacking interaction with the nearest-neighbor base-pair. Such a process shifts DNA and DNA polymerase in opposite directions, hence giving rise to an open conformation for the next round of incorporation.5 All of these mechanical movements, including the conformation change and DNA translocation, exert forces on the underneath
nanostructure and disturb its base pairing and stacking, resulting in fluctuations in the charge transport as a signature of the nucleotide incorporation.
[0027] This invention provides methods and chemicals for identifying individual components (or units or bases) that constitute biopolymers, especially DNAs and RNAs. For example, to sequence a target DNA molecule, we use it as a template for DNA synthesis on the said nanostructure sequencing device, with which nucleoside triphosphate substrates are incorporated into a growing DNA strand following the Watson Crick base pairing rule. The DNA sequence is determined by reading the nucleotide incorporation. A recent study has shown that the DNA synthesis is a two Mg2+ ion assisted stepwise associative SN2 reaction,6 albeit a third divalent metal ion may be present during DNA synthesis.7 Moreover, the pyrophosphate (PPi) group released from the SN2 reaction is hydrolyzed to phosphates during the DNA synthesis catalyzed by DNA polymerases. A general mechanism of DNA polymerization is illustrated in Figure 5. The terminal 3’ oxygen of the growing strand acts as a
nucleophile to attack the a-phosphorus atom of the incoming dNTP to forms a P-0 covalent bond, accompanied by the release of pyrophosphate that is in turn hydrolyzed to phosphates. Based on the reaction mechanism, this invention provides modified nucleotide substrates, which affect the kinetics of the polymerase enzymatic reactions in ways different from the naturally occurring nucleotides, generating distinguishable electric signals in the nanostructure that can be used to differentiate individual nucleotides in the target DNA template so that the target DNA can be sequenced. [0028] The DNA polymerases used in this invention include those that have been classified by structural homology into the families of A, B, C, D, X, Y, and RT. For example, those in Family A include T7 DNA polymerase and Bacillus
stearothermophilus Pol I; those in Family B includeT4 DNA polymerase, Phi29 DNA polymerase, and RB69; those in Family C include the E. coli DNA Polymerase III. The RT (reverse transcriptase) family of DNA polymerases include, for example, retrovirus reverse transcriptases and eukaryotic telomerases.
DETAILED DESCRIPTION
[0029] In some embodiments, a polymerase is attached to the nanostructure, fed with a duplex composed of DNA primer and a target to be sequenced, and followed by a mixture of nucleoside triphosphates or dNTPs. In the presence of metal ions, the DNA polymerase incorporates the dNTPs into the DNA primer according to the Watson-Crick pairing rule, and each incorporating step evokes an electric spike that is recorded in the sensor.
[0030] The above said nucleoside triphosphate mixtures include: • 0-4 of naturally occurring nucleoside triphosphates (Figure 6).
• 0-4 of naturally occurring nucleoside g-substituted triphosphates (Figure 7). The substituents are either electron donating or electron withdrawing groups that affect the activities of DNA polymerases,8 and also may affect the hydrolysis of pyrophosphate to phosphates, resulting in altered reaction rates and, in turn, the electric signals.
• 0-4 of b,g-C analogies of naturally occurring nucleoside triphosphates (Figure 8). In the analogy, the X moiety substitutes for the b,g-bridging O of the naturally occurring nucleoside triphosphate, which alters the stereoelectronic properties of the bisphosphonate (BP) leaving group without affecting the base pairing. As a result, these triphosphate analogies modulate the incorporation rates of the DNA polymerase, which is affected by the leaving groups.9-11 Since the incorporation is a kinetically controlled process, the corresponding electric signals can be modulated accordingly.
• 0-4 of a-thio-dNTPs (Figure 9). These modified triphosphates are
incorporated into DNA primers by DNA polymerase. The Sp-diastereomers of deoxyribonucleoside and ribonucleoside 5'-0-(1 -thio-triphosphates) are analogs of the naturally occurring nucleotides and are incorporated readily into nucleic acids by DNA or RNA polymerases.1213
• 0-4 of a-borano-dNTPs (Figure 10). The a-borano-dNTPs and a-borano- NTPs are good to excellent substrates for DNA and RNA polymerases, allowing for ready enzymatic syntheses of DNA and RNA.14· 15
0-4 of a-borano-a-thio-dNTPs (Figure 1 1 ).16 • 0-4 of a-seleno-dNTP (Figure 12). These modified dNTPs can be incorporated into DNA. However, the DNA polymerization with a-seleno- dNTPs is slower than with the native dNTPs. The a-Seleno-dNTPs suppress the primer self-extension in the lack of a DNA template.17 · 0-4 of deoxyribonucleoside a-R-phosphonyl-b, g-diphosphate (Figure 13).
These substrates produce uncharged nucleic acid backbone18, which facilitates the further distinction between the electric signals generated from the incorporation of different substrates.
• 0-4 of b,g-C-a-Z-dNTP analogs (Figure 14), which facilitate the further distinction between the electric signals generated from the incorporation of different substrates.
• 0-4 of g-R-a-Z-dNTP analogies (Figure 15), which facilitate the further distinction between the electric signals generated from incorporation of different substrates. [0031] In some embodiments, the said nucleoside triphosphates include modified sugars. Figure 16 shows one form of the modifications, in which the oxygen in the sugar ring is replaced by another atom. These atoms have different electron negativities, which would affect the pKa of the neighbor 3’-OH, and in turn its nucleophilicity. For example, 2'-deoxy-4'-thioribonucleoside 5'-triphosphate (dSNTPs), where X = S, R = H, Base = A, C, G, T, with unmodified triphosphate, can be used as substrates of DNA polymerases.19 dSNTPs have shown different reaction rates and efficiencies from the corresponding native dNTPs.
[0032] In some embodiments, the said nucleoside triphosphates include the ribose sugar (Figure 16, R= OH). An RNA dependent RNA polymerase (RdRP) is attached to the DNA nanostructure device for RNA sequencing. The enzyme is polio virus RdRP and others.
[0033] In some embodiments, the said nucleoside triphosphates have the nucleoside units including artificial genetic polymer xeno nucleic acids (XNA), a set of nucleic acid polymers with their backbone structures distinct from those found in nature, which is capable of specifically base pairing with DNA nucleobases (Figure 17). Some of XNAs have their sugar units flexible or rigid conformations, and others have different configurations and structures from their naturally occurring counterparts. These make their binding to targets in the enzyme differently from those naturally occurring counterparts. Also, some XNAs carry an electron-donating or withdrawing group that make its neighbor OH more or less nucleophilic, compared to the naturally occurring counterpart. For example, TNA can be incorporated into a DNA primer by a laboratory evolved polymerase that derives from a replicative B-family polymerase isolated from the archaeal hyperthermophilic species Thermococcus kodakarensis (Kod).20 21 These XNA substrates are useful to distinguish a specific DNA nucleotide from the rest of them in a DNA target.
[0034] In some embodiments, the RNA polymerase attached to the biopolymer nanostructure sensor for RNA sequencing includes, but not limited to, viral RNA polymerases such as T7 RNA polymerase; Eukaryotic RNA polymerases such as RNA polymerase I, RNA polymerase II, RNA polymerase III, RNA polymerase IV, and RNA polymerase V; and Archaea RNA polymerase. The RNA polymerase attached to sensor is fed with a mixture of canonical ribonucleoside triphosphates for reading out RNA sequences.
[0035] In some embodiments, the said mixture contains 0-4 of canonical ribonucleoside triphosphates.
• 0-4 of ribonucleotides with their triphosphates modified (Figure 16, R = OH)
• 0-4 of modified ribonucleoside triphosphates with a generalized structure as shown in Figure 16, where R = OH. · 0-4 of XNA triphosphates (Figure 17).
[0036] This invention further provides modified bases to further tune both DNA and RNA polymerase for their reactivities. To maintain the fidelity of polymerases, we remain the WC edges unchanged for the modified bases, as shown in Figure 18. These compounds have a common feature of the preserved Watson-Crick hydrogen bonding edges for inserting a correct incoming nucleotide to interact with the template following the Watson-Crick base pairing rule and hydrogen bonding acceptor sites for a polymerase to interact with the base pair from the minor groove.2223 Thus, the modifications do not disturb the fidelity of the enzyme.
[0037] In some embodiments, the said nucleoside triphosphates are composed of the pyrimidine bases with their 5- positions modified with a series of electron- withdrawing groups, electron donor groups, as well as ethyl, ethylene, and acetylene, to which various functional groups are attached (Figure 19). These modifications allow us to tune the transition state of the enzymatic reaction.
[0038] In some embodiments, the said nucleoside triphosphates are composed of the purines bases with their 7- positions modified with a series of electron-withdrawing groups, electron donor groups, as well as ethyl, ethylene, and acetylene, to which various functional groups are attached (Figure 20). These modifications allow us to tune the transition state of the enzymatic reaction. [0039] In some embodiments, the nucleoside triphosphates are composed of the said modified bases, modified sugars or sugar analogies, and modified triphosphates or triphosphate analogies.
[0040] In some embodiments, a plurality of nanostructure sensors are used to read the nucleic acid sequences in parallel. A plurality of nanostructure sensors can be fabricated in an array format with the number of nanostructure sensors from 10 to 109 on a solid surface or in a well, preferably 103 to 107 or more preferably 104 to 106.
[0041] All of the nanostructure sensors in the said array is configured with one type of nucleic acid polymerase or different types of nucleic acid polymerases. [0042] The target sample can be double or single-stranded, linear, or circular DNA.
The target sample can also be double or single-stranded, linear, or circular RNA. The primer for the sequencing can be DNA, RNA, conjugates of DNA and RNA, or DNA containing modified nucleosides.
[0043] A polymerase can be attached to a biopolymer nanostructure sensor at a predefined location or locations using the attachments chemistries provided in the previous provisional patent applications (ref. US 62/794,096, US 62/812,736, US 62/833,870, and US62/803,100). In many embodiments, a DNA nanostructure is functionalized with organic functional groups at the predefined DNA nucleoside or nucleosides. Whereas the DNA polymerase is bioengineered to contain unnatural amino acids that bear the function against those in the DNA nanostructure for the click reaction.
[0044] In some embodiments, the biopolymer nanostructure in all the above descriptions is replaced by a solid nanowire made of material selected from the group of platinum (Pt), palladium (Pd), Tungsten (W), gold (Au), copper (Cu), titanium (Ti), Tantalum (Ta), Chromium (Cr), TiN, TiNx, TaN, TaNx, silver (Ag), aluminum (Al), and other metals, preferably Pt, Pd, Au, Ti, and TiN. The nanowire is 3nm to 10pm in length, preferably 20nm to 1 pm; 5nm to 50nm in width, preferably 5nm to 20nm; and 3nm to 50nm in thickness, preferably 4nm to 10nm. All the nucleoside triphosphate and ribonucleoside triphosphate designs, modifications, variations, natural or unnatural, their interaction with polymerase, their methods of use, and principles of distinguishing individual nucleotides apply to the polymerase-nanowire coupled DNA/RNA sequencing system. In some other embodiments, the nanowire is a carbon nanotube or a graphene sheet, single layer or multilayer, with dimension similar to the nanowire. [0045] In some embodiments, the nanostructure in all the above descriptions is replaced by a molecular wire, such as those disclosed in patent applications,
WO2018208505, US20180305727A1 , and WO2018136148A1 . All the nucleoside triphosphate and ribonucleoside triphosphate designs, modifications, variations, natural or unnatural, their interaction with polymerase, their methods of use and principles of distinguishing individual nucleotides apply to the polymerase-molecular wire coupled DNA/RNA sequencing system.
[0046] In some embodiments, a DNA polymerase is directly attached to the two electrodes, bridging the nanogap between the two electrodes and allowing electrons or electric current to pass through, such as those disclosed in patent applications
WO2018208505, US20180305727A1 and WO2018136148A1 . For the purpose of
DNA/RNA sequencing, the above mentioned nucleoside triphosphate and
ribonucleoside triphosphate designs, modifications, variations, natural or unnatural, their interaction with polymerase, their methods of use and principles of distinguishing individual nucleotides apply to the polymerase only DNA/RNA sequencing system. [0047] In some embodiments, all the above-mentioned nanogap bridging
configurations, such as the biopolymer nanostructure, the molecular wire, the nanowire and the polymerase directly contacting the nanogap electrodes, can be combined with a gate electrode to form a FET type polymerase sequencing system, such as those disclosed in the provisional patent application US62/833,870. Although the mechanism of polymerase conformational change affecting the electrical signal passing through the nanogap is somehow different, the nucleoside triphosphate and ribonucleoside triphosphate designs, modifications, variations, natural or unnatural, and their interaction with polymerase, their methods of use and principles of distinguishing individual nucleotides also apply to the FET type polymerase DNA/RNA sequencing system.
[0048] GENERAL REMARKS:
Patents or patent applications are incorporated into where they are mentioned in the text. The cited journal publications are listed in Cited Literature.
Unless defined otherwise, all technical and scientific terms used herein take on the meaning commonly understood by one of the ordinary skills in the art to which this invention belongs. While the present invention has been illustrated by a description of various embodiments and while these embodiments have been described in
considerable detail, it is not the intention of the applicants to restrict or in any way limit the scope of the applications. Additional advantages and modifications will readily appear to those skilled in the art. The invention in its broader aspects is therefore not limited to the specific details, representative device, apparatus and method, and illustrative example shown and described. Accordingly, departures may be made from such details without departing from the spirit of applicant’s general inventive concept.
CITED REFERENCES: 1. Olsen, T. J.; Choi, Y.; Sims, P. C.; Gul, O. T.; Corso, B. L; Dong, C.; Brown, W. A.; Collins, P. G.; Weiss, G. A., Electronic Measurements of Single-Molecule Processing by DNA Polymerase I (Klenow Fragment). J. Am. Chem. Soc. 2013, 135, 7855-7860.
2. Pugliese, K. M.; Gul, O. T.; Choi, Y.; Olsen, T. J.; Sims, P. C.; Collins, P. G.; Weiss, G. A., Processive Incorporation of Deoxynucleoside Triphosphate Analogs by Single-Molecule DNA Polymerase I (Klenow Fragment) Nanocircuits. J Am Chem Soc 2015, 137, 9587-9594.
3. Roettger, M. P.; Bakhtina, M.; Tsai, M.-D., Mismatched and Matched dNTP Incorporation by DNA Polymerase Proceed via Analogous Kinetic Pathways.
Biochemistry 2008, 47, 9718-9727.
4. Bakhtina, M.; Roettger, M. P.; Kumar, S.; Tsai, M.-D., A Unified Kinetic
Mechanism Applicable to Multiple DNA Polymerases. Biochemistry 2007, 46, 5463- 5472.
5. Arias-Gonzalez, J. R., A DNA-centered explanation of the DNA polymerase translocation mechanism. Scientific reports 2017 , 7, 7566.
6. Kottur, J.; Nair, D. T., Pyrophosphate hydrolysis is an intrinsic and critical step of the DNA synthesis reaction. Nucleic Acids Research 2018, 46, 5875-5885.
7. Raper, A. T.; Reed, A. J.; Suo, Z., Kinetic Mechanism of DNA Polymerases: Contributions of Conformational Dynamics and a Third Divalent Metal Ion. Chem Rev 2018, 778, 6000-6025.
8. Arzumanov, A. A.; Semizarov, D. G.; Victorova, L. S.; Dyatkina, N. B.; Krayevsky, A. A., g-Phosphate-substituted 2'-Deoxynucleoside 5'-Tri phosphates as Substrates for DNA Polymerases. THE JOURNAL OF BIOLOGICAL CHEMISTRY 1996, 271, 24389- 24394.
9. Sucato, C. A.; Upton, T. G.; Kashemirov, B. A.; Osuna, J.; Oertell, K.; Beard, W.
A.; Wilson, S. H.; Florian, J.; Warshel, A.; McKenna, C. E.; Goodman, M. F., DNA Polymerase b Fidelity: Halomethylene-Modified Leaving Groups in Pre-Steady-State Kinetic Analysis Reveal Differences at the Chemical Transition State. Biochemistry
2008, 47, 870-879.
10. Oertell, K.; Chamberlain, B. T.; Wu, Y.; Ferri, E.; Kashemirov, B. A.; Beard, W. A.; Wilson, S. H.; McKenna, C. E.; Goodman, M. F., Transition State in DNA
Polymerase b Catalysis: Rate-Limiting Chemistry Altered by Base-Pair Configuration. Biochemistry 2014, 53, 1842-1848.
1 1 . Oertell, K.; Florian, J.; Haratipour, P.; Crans, D. C.; Kashemirov, B. A.; Wilson, S. H.; McKenna, C. E.; Goodman, M. F., A Transition-State Perspective on Y-Family DNA Polymerase h Fidelity in Comparison with X-Family DNA Polymerases l and b.
Biochemistry 2019, 58, 1764-1773.
12. Eckstein, F., NUCLEOSIDE PHOSPHOROTHIOATES. Ann. Rev. Biochem. 1985, 54, 367-402.
13. GISH, G.; ECKSTEIN, F., DNA and RNA Sequence Determination Based on Phosphorothioate Chemistry. Science 240, 1520-1521 .
14. Russell, C.; Roy, S.; Ganguly, S.; Qian, X.; Caruthers, M. H.; Nilsson, M., Formation of Silver Nanostructures by Rolling Circle Amplification Using
Boranephosphonate-Modified Nucleotides. Analytical Chemistry 2015, 87, 6660-6666.
15. SHAW, B. R.; DOBRIKOV, M.; WANG, X.; WAN, J.; HE, K.; LIN, J.-L; LI, P.; RAIT, V.; SERGUEEVA, Z. A.; SERGUEEV, D., Reading, Writing, and Modulating
Genetic Information with Boranophosphate Mimics of Nucleotides, DNA, and RNA. Annals of the New York Academy of Sciences 2003, 1002, 12-29.
16. Lin, J.; Ramsay Shaw, B., Synthesis of a novel triphosphate analogue:
nucleoside a-borano, a--thiotri phosphate. Chemical Communications 2000, (21 ), 21 15-
21 16. 17. Hu, B.; Wang, Y.; Sun, S.; Yan, W.; Zhang, C.; Luo, D.; Deng, H.; Hu, L. R.; Huang, Z., Synthesis of Selenium-Triphosphates (dNTPaSe) for More Specific DNA Polymerization. Angewandte Chemie International Edition 2019, 58, 7835-7839.
18. Arangundy-Franklin, S.; Taylor, A. I.; Porebski, B. T.; Genna, V.; Peak-Chew, S.; Vaisman, A.; Woodgate, R.; Orozco, M.; Holliger, P., A synthetic genetic polymer with an uncharged backbone chemistry based on alkyl phosphonate nucleic acids. Nature Chemistry 20 9, 77, 533-542.
19. Kojima, T.; Furukawa, K.; Maruyama, H.; Inoue, N.; Tarashima, N.; Matsuda, A.; Minakawa, N., PCR amplification of 4'-thioDNA using 2'-deoxy-4'-thionucleoside 5'- triphosphates. ACS Synth Biol 2013, 2, 529-36.
20. Dunn, M. R.; Otto, C.; Fenton, K. E.; Chaput, J. C., Improving Polymerase Activity with Unnatural Substrates by Sampling Mutations in Homologous Protein Architectures. ACS Chemical Biology 2016, 77, 1210-1219.
21 . Chim, N.; Shi, C.; Sau, S. P.; Nikoomanzar, A.; Chaput, J. C., Structural basis for TNA synthesis by an engineered TNA polymerase. Nature Communications 2017, 8,
1810.
22. Freudenthal, B. D.; Beard, W. A.; Wilson, S. H., Structures of dNTP intermediate states during DNA polymerase active site assembly. Structure 2012, 20, 1829-1837.
23. Xia, S.; Konigsberg, W. H., RB69 DNA Polymerase Structure, Kinetics, and Fidelity. Biochemistry 2014, 53, 2752-2767.

Claims

WHAT IS CLAIMED:
1. A system for identification, characterization, or sequencing of a biopolymer comprising,
(a) a non-conductive substrate;
(b) a nanogap formed by a first electrode and a second electrode placed next to each other on the non-conductive substrate;
(c) a nanostructure configured to have a dimension comparable to the nanogap and to bridge the nanogap by attaching one end to the first electrode and another end to the second electrode through a chemical bond;
(d) a DNA or an RNA polymerase attached to the nanostructure and configured to perform a biopolymer synthesis reaction;
(e) a reaction mixture that facilitates the biopolymer synthesis reaction;
(f) a bias voltage that is applied between the first electrode and the second electrode;
(g) a device that records a current fluctuation through the nanostructure
resulting from a distortion within the nanostructure caused by a conformation change initiated by the polymerase attached to the nanostructure; and (h) a software for data analysis that identifies the biopolymer or a subunit of the biopolymer; wherein the biopolymer is either a DNA molecule, a RNA molecule, or a oligonucleotide, or a combination thereof, and either double or single stranded, linear or circular, natural, modified or synthesized, and a combination thereof.
2. The system of claim 1 , wherein the non-conductive substrate comprises the
following: silicon, silicon oxide, silicon nitride, glass, hafnium dioxide, other metal oxides, any non-conductive polymer film, silicon with silicon oxide or silicon nitride or other non-conductive coatings, glass with silicon nitride coating, other non- conductive organic materials, and/or any non-conductive inorganic materials.
3. The system of claim 1 , wherein the nanostructure is one of the following or a
combination thereof:
(a) a DNA nanostructure, made of deoxyribonucleic acid, either natural, modified or synthesized;
(b) an RNA nanostructure, made of ribonucleic acid, either natural, modified or synthesized;
(c) a peptide nanostructure, made of amino acid, either natural, modified or synthesized; and
(d) a molecular wire made of any conductive biopolymer or biopolymers, either natural, modified or synthesized.
4. The system of claim 1 , wherein the nanostructure comprises a solid nanowire made of a metal material selected from the group consisting of platinum (Pt), palladium (Pd), Tungsten (W), gold (Au), copper (Cu), titanium (Ti), Tantalum (Ta), Chromium (Cr), TiN, TiNx, TaN, TaNx, silver (Ag), aluminum (Al), and other metals, and a combination thereof.
5. The system of claim 1 , wherein the nanostructure comprises a carbon nanotube or a graphene sheet, either a single layer or multilayer or a combination thereof.
6. The system of claim 1 , wherein the nanostructure is the polymerase wherein the polymerase is directly attached to the two electrodes, bridging the nanogap, and allowing the electronic current to pass through.
7. The system of claim 1 , wherein the DNA polymerase is selected from the group consisting of DNA polymerase families A, B, C, D, X, Y, and RT, comprising T7 DNA polymerase, Phi29 DNA polymerase, Taq polymerase, DNA polymerase Y, DNA Polymerase Pol I, Pol II, Pol III, Pol IV, and Pol V, Pol a (alpha), Pol b (beta), Pol s (sigma), Pol l (lambda), Pol d (delta), Pol e (epsilon), Pol m (mu), Pol I (iota), Pol k (kappa), pol h (eta), and terminal deoxynucleotidyl transferase, telomerase, either native, mutated, expressed, or synthesized, and a combination thereof.
8. The system of claim 1 , wherein the RNA polymerase is selected from the group consisting of viral RNA polymerases, comprising T7 RNA polymerase; and
Eukaryotic RNA polymerases, comprising RNA polymerase I, RNA polymerase II, RNA polymerase III, RNA polymerase IV, and RNA polymerase V; and Archaea RNA polymerase, either native, mutated, expressed, or synthesized, and a combination thereof.
9. The system of claim 1 , further comprising a third electrode, configured to function as a gate electrode, wherein together with the first and the second electrodes, they form a FET type nanogap device.
10. The system of claim 1 , wherein the reaction mixture comprises at least one of the following nucleoside triphosphate mixtures or a combination thereof: (a) at least a naturally occurring nucleoside triphosphates;
(b) at least a naturally occurring nucleoside g-substituted triphosphates, comprising either electron-donating or electron-withdrawing groups;
(c) at least a b,g-C analogs of naturally occurring nucleoside triphosphates with the X moiety substituting for the b,g-bridging O of the naturally occurring nucleoside triphosphate;
(d) at least a a-thio-dNTPs or a-thio-NTPs;
(e) at least a a-borano-dNTPs or a-borano-NTPs;
(f) at least a a-borano-a-thio-dNTPs or a-borano-a-thio-NTPs; (g) at least a a-seleno-dNTPs or a-seleno-NTPs;
(h) at least a deoxyribonucleoside a-R-phosphonyl-b, g-diphosphate;
(i) at least a b,g-C-a-Z-dNTP analogies or b,g-C-a-Z-NTR analogies; and
(j) at least a g-R-a-Z-dNTP analogies or g-R-a-Z-NTP analogies, wherein the dNTP is configured for DNA synthesis and the NTP is configured for RNA synthesis. The system of claim 1 , wherein the reaction mixture comprises at least one of the following or a combination thereof:
(a) a dNTP or a NTP that comprises a modified sugar, wherein the oxygen in the sugar ring is replaced by an atom that has a different electron negativity;
(b) a dNTP or a NTP that comprises a nucleoside unit comprising an artificial genetic polymer xeno nucleic acid (XNA);
(c) a dNTP or a NTP that comprises a pyrimidine base with the 5- position modified with a molecule selected from the group consisting of an electron-withdrawing group, an electron-donor groups, an ethyl group, an ethylene group, an acetylene group, and a combination thereof, to which a functional group is attached; and
(d) a dNTP or a NTP that comprises a purine base with the 7- position
modified with a molecule selected from the group consisting of an electron-withdrawing group, an electron-donor group, an ethyl group, an ethylene, and an acetylene group, and a combination thereof, to which a functional group is attached; and wherein the dNTP is configured for DNA synthesis and the NTP is configured for RNA synthesis.
12. The system of claim 1 comprises a plurality of nanogap sensors, wherein each
nanogap sensor comprises a pair of electrodes, a nanostructure, a polymerase, a reaction mixture, and any feature associated with a single nanogap.
13. The system of claim 12, wherein the plurality of nanogap sensors comprise an array of about 10 to about 1 billion nanogaps, preferably between about 10,000 to about 1 million nanogaps.
14. A method for identification, characterization, or sequencing of a biopolymer comprising,
(a) providing a non-conductive substrate;
(b) providing a first electrode and a second electrode, and placing them next to each other to form a nanogap on the non-conductive substrate;
(c) providing a nanostructure configured to have a dimension comparable to the nanogap and to bridge the nanogap by attaching one end to the first electrode and another end to the second electrode through a chemical bond
(d) providing a DNA or RNA polymerase attached to the nanostructure and
configured to perform a biopolymer synthesis reaction;
(e) providing a reaction mixture that facilitates the biopolymer synthesis
reaction;
(f) applying a bias voltage between the first electrode and the second electrode;
(g) recording the current fluctuation through the nanostructure resulting from a distortion within the nanostructure caused by a conformation change initiated by the polymerase attached to the nanostructure; and (h) providing a software for data analysis that identifies the biopolymer or a subunit of the biopolymer; and wherein the biopolymer is either a DNA molecule, a RNA molecule, or a oligonucleotide, or a combination thereof, and either double or single stranded, linear or circular, natural, modified or synthesized, and a combination thereof.
15. The method of claim 14, wherein the non-conductive substrate comprises the
following: silicon, silicon oxide, silicon nitride, glass, hafnium dioxide, other metal oxides, any non-conductive polymer film, silicon with silicon oxide or silicon nitride or other non-conductive coatings, glass with silicon nitride coating, other non- conductive organic materials, and/or any non-conductive inorganic materials.
16. The method of claim 14, wherein the nanostructure is one of the following or a
combination thereof:
(a) a DNA nanostructure, made of deoxyribonucleic acid, either natural,
modified or synthesized;
(b) an RNA nanostructure, made of ribonucleic acid, either natural, modified or synthesized;
(c) a peptide nanostructure, made of amino acid, either natural, modified or synthesized; and
(d) a molecular wire, made of any conductive biopolymer or biopolymers, either natural, modified or synthesized.
17. The method of claim 14, wherein the nanostructure comprises a solid nanowire
made of a metal material selected from the group consisting of platinum (Pt), palladium (Pd), Tungsten (W), gold (Au), copper (Cu), titanium (Ti), Tantalum (Ta), Chromium (Cr), TiN, TiNx, TaN, TaNx, silver (Ag), aluminum (Al), and other metals, and a combination thereof.
18. The method of claim 14, wherein the nanostructure comprises a carbon nanotube or a graphene sheet, either a single layer or multilayer or a combination thereof.
19. The method of claim 14, wherein the nanostructure is the polymerase wherein the polymerase is directly attached to the two electrodes, bridging the nanogap, and allowing the electronic current to pass through.
20. The method of claim 14, wherein the DNA polymerase is selected from the group consisting of DNA polymerase families A, B, C, D, X, Y, and RT, comprising T7 DNA polymerase, Phi29 DNA polymerase, Tag polymerase, DNA polymerase Y, DNA Polymerase Pol I, Pol II, Pol III, Pol IV, and Pol V, Pol a (alpha), Pol b (beta), Pol s (sigma), Pol l (lambda), Pol d (delta), Pol e (epsilon), Pol m (mu), Pol I (iota), Pol k
(kappa), pol h (eta), and terminal deoxynucleotidyl transferase, telomerase, either native, mutated, expressed, or synthesized, and a combination thereof.
21 .The method of claim 14, wherein the RNA polymerase is selected from the group consisting of viral RNA polymerases comprising T7 RNA polymerase; and
Eukaryotic RNA polymerases comprisingRNA polymerase I, RNA polymerase II,
RNA polymerase III, RNA polymerase IV, and RNA polymerase V; and Archaea RNA polymerase, either native, mutated, expressed, or synthesized, and a combination thereof.
22. The method of claim 14, further comprising providing a third electrode, configured to function as a gate electrode, wherein together with the first and the second electrodes, they form a FET type nanogap device.
23. The method of claim 14, wherein the reaction mixture comprises at least one of the following nucleoside triphosphate mixtures or a combination thereof:
(a) at least a naturally occurring nucleoside triphosphates; (b) at least a naturally occurring nucleoside g-substituted triphosphates, containing either electron-donating or electron-withdrawing groups;
(c) at least a b,g-C analogs of naturally occurring nucleoside triphosphates with the X moiety substituting for the b,g-bridging O of the naturally occurring nucleoside triphosphate;
(d) at least a a-thio-dNTPs or a-thio-NTPs;
(e) at least a a-borano-dNTPs or a-borano-NTPs;
(f) at least a a-borano-a-thio-dNTPs or a-borano-a-thio-NTPs;
(g) at least a a-seleno-dNTPs or a-seleno-NTPs; (h) at least a deoxyribonucleoside a-R-phosphonyl-b, g-di phosphate;
(i) at least a b,g-C-a-Z-dNTP analogies or b,g-C-a-Z-NTR analogies; and
(j) at least a g-R-a-Z-dNTP analogies or g-R-a-Z-NTP analogies; and wherein the dNTP is configured for DNA synthesis and the NTP is configured for RNA synthesis. 24. The method of claim 14, wherein the reaction mixture contains at least one of the following or a combination thereof:
(a) a dNTP or a NTP that comprises a modified sugar wherein the oxygen in the sugar ring is replaced by an atom that has different electron negativity.;
(b) a dNTP or a NTP that comprises a nucleoside unit comprising an artificial genetic polymer xeno nucleic acid (XNA); (c) a dNTP or a NTP that comprises a pyrimidine base with the 5- position modified with a molecule selected from the group consisting of an electron-withdrawing group, an electron-donor groups, an ethyl group, an ethylene group, and an acetylene group, and a combination thereof, to which a functional group is attached; and
(d) a dNTP or a NTP that comprises a purine base with the 7- position
modified with a molecule selected from the group consisting of an electron-withdrawing group, an electron-donor group, an ethyl group, an ethylene group, and an acetylene group, to which a functional group is attached, wherein the dNTP is configured for DNA synthesis and the NTP is configured for RNA synthesis.
PCT/US2020/034759 2019-05-27 2020-05-27 Methods to identify components in nucleic acid sequences WO2020243207A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
EP20814921.1A EP3976814A4 (en) 2019-05-27 2020-05-27 Methods to identify components in nucleic acid sequences
JP2021570366A JP2022535746A (en) 2019-05-27 2020-05-27 Methods of identifying constituents in nucleic acid sequences
CN202080053393.6A CN114555832A (en) 2019-05-27 2020-05-27 Method for identifying components in nucleic acid sequences
US17/595,758 US20220251638A1 (en) 2019-05-27 2020-05-27 Methods to Identify Components in Nucleic Acid Sequences
KR1020217042311A KR20220012920A (en) 2019-05-27 2020-05-27 Methods for identifying elements within a nucleic acid sequence

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201962853119P 2019-05-27 2019-05-27
US62/853,119 2019-05-27
US201962861675P 2019-06-14 2019-06-14
US62/861,675 2019-06-14

Publications (1)

Publication Number Publication Date
WO2020243207A1 true WO2020243207A1 (en) 2020-12-03

Family

ID=73552680

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2020/034759 WO2020243207A1 (en) 2019-05-27 2020-05-27 Methods to identify components in nucleic acid sequences

Country Status (6)

Country Link
US (1) US20220251638A1 (en)
EP (1) EP3976814A4 (en)
JP (1) JP2022535746A (en)
KR (1) KR20220012920A (en)
CN (1) CN114555832A (en)
WO (1) WO2020243207A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2023202006B1 (en) * 2022-08-15 2023-07-13 Ella Chernis System, Apparatus and Process
US12031981B2 (en) 2018-05-09 2024-07-09 Arizona Board Of Regents On Behalf Of Arizona State University Method for electronic detection and quantification of antibodies

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SG11202112851RA (en) 2019-06-20 2021-12-30 Univ Arizona State Direct electrical readout of nucleic acid sequences
KR20220147602A (en) 2020-02-28 2022-11-03 아리조나 보드 오브 리젠츠 온 비하프 오브 아리조나 스테이트 유니버시티 How to sequence biopolymers

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130327644A1 (en) * 2012-06-08 2013-12-12 Pacific Biosciences Of California, Inc. Modified base detection with nanopore sequencing
US20160108002A1 (en) * 2013-05-30 2016-04-21 Arizona Board Of Regents Acting For And On Behalf Of Arizona State University Universal reader molecule for recognition tunneling
US20160265047A1 (en) * 2014-12-18 2016-09-15 Edico Genome, Inc. Graphene fet devices, systems, and methods of using the same for sequencing nucleic acids
WO2017189930A1 (en) * 2016-04-27 2017-11-02 Quantum Biosystems Inc. Systems and methods for measurement and sequencing of bio-molecules

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5037343B2 (en) * 2004-06-10 2012-09-26 ジーイー・ヘルスケア・バイオサイエンス・コーポレイション Nucleic acid analysis method
US9238836B2 (en) * 2012-03-30 2016-01-19 Pacific Biosciences Of California, Inc. Methods and compositions for sequencing modified nucleic acids
CA2971268A1 (en) * 2014-12-18 2016-06-23 The Regents Of The University Of California Detection of nucleic acid polymerase conformational changes using a nanotube
IL255446B2 (en) * 2015-05-12 2024-01-01 Illumina Inc Field-effect apparatus and methods for sequencing nucelic acids
EP3332033B1 (en) * 2015-08-06 2021-04-21 Pacific Biosciences of California, Inc. Single-molecule nanofet sequencing systems and methods
WO2017123416A1 (en) * 2016-01-14 2017-07-20 Roswell Biotechnologies, Inc. Molecular sensors and related methods
KR20180105699A (en) * 2016-01-28 2018-09-28 로스웰 바이오테크놀로지스 인코포레이티드 Methods and apparatus for measuring analytes using large scale molecular electronic device sensor arrays
WO2018200687A1 (en) * 2017-04-25 2018-11-01 Roswell Biotechnologies, Inc. Enzymatic circuits for molecular sensors

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130327644A1 (en) * 2012-06-08 2013-12-12 Pacific Biosciences Of California, Inc. Modified base detection with nanopore sequencing
US20160108002A1 (en) * 2013-05-30 2016-04-21 Arizona Board Of Regents Acting For And On Behalf Of Arizona State University Universal reader molecule for recognition tunneling
US20160265047A1 (en) * 2014-12-18 2016-09-15 Edico Genome, Inc. Graphene fet devices, systems, and methods of using the same for sequencing nucleic acids
WO2017189930A1 (en) * 2016-04-27 2017-11-02 Quantum Biosystems Inc. Systems and methods for measurement and sequencing of bio-molecules

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3976814A4 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12031981B2 (en) 2018-05-09 2024-07-09 Arizona Board Of Regents On Behalf Of Arizona State University Method for electronic detection and quantification of antibodies
AU2023202006B1 (en) * 2022-08-15 2023-07-13 Ella Chernis System, Apparatus and Process
WO2024036363A1 (en) * 2022-08-15 2024-02-22 Ethan Samuel Volavola Vidal System, apparatus and process

Also Published As

Publication number Publication date
JP2022535746A (en) 2022-08-10
KR20220012920A (en) 2022-02-04
EP3976814A1 (en) 2022-04-06
EP3976814A4 (en) 2023-07-19
US20220251638A1 (en) 2022-08-11
CN114555832A (en) 2022-05-27

Similar Documents

Publication Publication Date Title
US20220251638A1 (en) Methods to Identify Components in Nucleic Acid Sequences
US20220162679A1 (en) Closed nucleic acid structures
Houlihan et al. Exploring the chemistry of genetic information storage and propagation through polymerase engineering
EP3620533B1 (en) Closed nucleic acid structures
Pinheiro et al. The XNA world: progress towards replication and evolution of synthetic genetic polymers
US10752944B2 (en) Circularized templates for sequencing
US10253352B2 (en) Methods for determining sequence profiles
Kim et al. Single-molecule study of DNA polymerization activity of HIV-1 reverse transcriptase on DNA templates
Lee et al. Non-natural nucleotides as probes for the mechanism and fidelity of DNA polymerases
Bhamra et al. Base-pairing preferences, physicochemical properties and mutational behaviour of the DNA lesion 8-nitroguanine
US20200063194A1 (en) Comprehensive single molecule enhanced detection of modified cytosines
Stengel et al. Ambivalent incorporation of the fluorescent cytosine analogues tC and tCo by human DNA polymerase α and Klenow fragment
Jung et al. Nucleotide analogues as probes for DNA polymerases
EP2744916A1 (en) Multimodal methods for simultaneous detection and quantification of multiple nucleic acids in a sample
Pichon et al. Controlled enzymatic synthesis of oligonucleotides
US20080096765A1 (en) Methods and compositions for correcting misincorporation in a nucleic acid synthesis reaction

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20814921

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2021570366

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20217042311

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2020814921

Country of ref document: EP

Effective date: 20220103