GB2447679A - Scanning probe microscopy-based polynucleotide sequencing and detection - Google Patents
Scanning probe microscopy-based polynucleotide sequencing and detection Download PDFInfo
- Publication number
- GB2447679A GB2447679A GB0705367A GB0705367A GB2447679A GB 2447679 A GB2447679 A GB 2447679A GB 0705367 A GB0705367 A GB 0705367A GB 0705367 A GB0705367 A GB 0705367A GB 2447679 A GB2447679 A GB 2447679A
- Authority
- GB
- United Kingdom
- Prior art keywords
- polynucleotides
- scanning
- base
- spm
- modified
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 108091033319 polynucleotide Proteins 0.000 title claims abstract description 91
- 102000040430 polynucleotide Human genes 0.000 title claims abstract description 91
- 239000002157 polynucleotide Substances 0.000 title claims abstract description 91
- 238000004621 scanning probe microscopy Methods 0.000 title claims abstract description 44
- 238000012163 sequencing technique Methods 0.000 title claims abstract description 38
- 238000001514 detection method Methods 0.000 title claims abstract description 20
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 64
- 238000000034 method Methods 0.000 claims abstract description 62
- -1 nucleotide triphosphates Chemical class 0.000 claims abstract description 32
- 239000000758 substrate Substances 0.000 claims abstract description 21
- 239000001226 triphosphate Substances 0.000 claims abstract description 20
- 235000011178 triphosphate Nutrition 0.000 claims abstract description 20
- 102000039446 nucleic acids Human genes 0.000 claims description 33
- 108020004707 nucleic acids Proteins 0.000 claims description 33
- 150000007523 nucleic acids Chemical class 0.000 claims description 33
- 239000000523 sample Substances 0.000 claims description 25
- 230000015572 biosynthetic process Effects 0.000 claims description 24
- 230000000295 complement effect Effects 0.000 claims description 24
- 238000003786 synthesis reaction Methods 0.000 claims description 24
- 238000004630 atomic force microscopy Methods 0.000 claims description 21
- 238000004574 scanning tunneling microscopy Methods 0.000 claims description 20
- 229920000642 polymer Polymers 0.000 claims description 15
- 238000009396 hybridization Methods 0.000 claims description 14
- 125000005647 linker group Chemical group 0.000 claims description 13
- 150000001413 amino acids Chemical class 0.000 claims description 12
- 239000010445 mica Substances 0.000 claims description 11
- 229910052618 mica group Inorganic materials 0.000 claims description 11
- KAESVJOAVNADME-UHFFFAOYSA-N Pyrrole Chemical compound C=1C=CNC=1 KAESVJOAVNADME-UHFFFAOYSA-N 0.000 claims description 10
- YTPLMLYBLZKORZ-UHFFFAOYSA-N Thiophene Chemical compound C=1C=CSC=1 YTPLMLYBLZKORZ-UHFFFAOYSA-N 0.000 claims description 10
- 229920000867 polyelectrolyte Polymers 0.000 claims description 9
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 8
- 238000001179 sorption measurement Methods 0.000 claims description 8
- 108010038807 Oligopeptides Proteins 0.000 claims description 7
- 102000015636 Oligopeptides Human genes 0.000 claims description 7
- PAYRUJLWNCNPSJ-UHFFFAOYSA-N Aniline Chemical compound NC1=CC=CC=C1 PAYRUJLWNCNPSJ-UHFFFAOYSA-N 0.000 claims description 6
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 claims description 5
- 238000004458 analytical method Methods 0.000 claims description 5
- 150000001875 compounds Chemical class 0.000 claims description 5
- 238000004651 near-field scanning optical microscopy Methods 0.000 claims description 5
- 229920001184 polypeptide Polymers 0.000 claims description 5
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 5
- 238000001115 scanning electrochemical microscopy Methods 0.000 claims description 5
- 229930192474 thiophene Natural products 0.000 claims description 5
- 125000000217 alkyl group Chemical group 0.000 claims description 4
- 125000003118 aryl group Chemical group 0.000 claims description 3
- 125000003636 chemical group Chemical group 0.000 claims description 3
- 229910002804 graphite Inorganic materials 0.000 claims description 3
- 239000010439 graphite Substances 0.000 claims description 3
- 230000007935 neutral effect Effects 0.000 claims description 3
- 229910052723 transition metal Inorganic materials 0.000 claims description 3
- 150000003624 transition metals Chemical class 0.000 claims description 3
- 125000003342 alkenyl group Chemical group 0.000 claims description 2
- HSFWRNGVRCDJHI-UHFFFAOYSA-N alpha-acetylene Natural products C#C HSFWRNGVRCDJHI-UHFFFAOYSA-N 0.000 claims description 2
- 229910052751 metal Inorganic materials 0.000 claims description 2
- 239000002184 metal Substances 0.000 claims description 2
- 239000005547 deoxyribonucleotide Substances 0.000 claims 2
- 229920000037 Polyproline Polymers 0.000 claims 1
- 108091028664 Ribonucleotide Proteins 0.000 claims 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 claims 1
- 125000002534 ethynyl group Chemical group [H]C#C* 0.000 claims 1
- JZRYQZJSTWVBBD-UHFFFAOYSA-N pentaporphyrin i Chemical compound N1C(C=C2NC(=CC3=NC(=C4)C=C3)C=C2)=CC=C1C=C1C=CC4=N1 JZRYQZJSTWVBBD-UHFFFAOYSA-N 0.000 claims 1
- 108010026466 polyproline Proteins 0.000 claims 1
- 239000002336 ribonucleotide Substances 0.000 claims 1
- 125000002652 ribonucleotide group Chemical group 0.000 claims 1
- 239000002773 nucleotide Substances 0.000 abstract description 46
- 239000002777 nucleoside Substances 0.000 abstract description 22
- 238000006243 chemical reaction Methods 0.000 abstract description 14
- 125000003835 nucleoside group Chemical group 0.000 abstract description 12
- 108020005187 Oligonucleotide Probes Proteins 0.000 abstract description 4
- 239000002751 oligonucleotide probe Substances 0.000 abstract description 4
- 108020004414 DNA Proteins 0.000 description 32
- 239000000126 substance Substances 0.000 description 21
- 201000001997 microphthalmia with limb anomalies Diseases 0.000 description 19
- 101500012027 Viscum album Beta-galactoside-specific lectin 1 chain A isoform 1 Proteins 0.000 description 17
- 235000001014 amino acid Nutrition 0.000 description 10
- 238000013459 approach Methods 0.000 description 10
- 238000010348 incorporation Methods 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- 125000006850 spacer group Chemical group 0.000 description 10
- 108091034117 Oligonucleotide Proteins 0.000 description 8
- 238000011160 research Methods 0.000 description 8
- 238000012552 review Methods 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 150000003833 nucleoside derivatives Chemical class 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 5
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 238000001712 DNA sequencing Methods 0.000 description 4
- YLQBMQCUIZJEEH-UHFFFAOYSA-N Furan Chemical compound C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 4
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 4
- 125000004122 cyclic group Chemical group 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 3
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 239000000232 Lipid Bilayer Substances 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 3
- 125000005677 ethinylene group Chemical group [*:2]C#C[*:1] 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 238000010647 peptide synthesis reaction Methods 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 238000001338 self-assembly Methods 0.000 description 3
- 125000001424 substituent group Chemical group 0.000 description 3
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical group N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- XMWRBQBLMFGWIX-UHFFFAOYSA-N C60 fullerene Chemical class C12=C3C(C4=C56)=C7C8=C5C5=C9C%10=C6C6=C4C1=C1C4=C6C6=C%10C%10=C9C9=C%11C5=C8C5=C8C7=C3C3=C7C2=C1C1=C2C4=C6C4=C%10C6=C9C9=C%11C5=C5C8=C3C3=C7C1=C1C2=C4C6=C2C9=C5C3=C12 XMWRBQBLMFGWIX-UHFFFAOYSA-N 0.000 description 2
- 239000004215 Carbon black (E152) Substances 0.000 description 2
- 229920000858 Cyclodextrin Polymers 0.000 description 2
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 2
- 108010017826 DNA Polymerase I Proteins 0.000 description 2
- 102000004594 DNA Polymerase I Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 229960002684 aminocaproic acid Drugs 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 239000012298 atmosphere Substances 0.000 description 2
- 239000002041 carbon nanotube Substances 0.000 description 2
- 229910021393 carbon nanotube Inorganic materials 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 229940097362 cyclodextrins Drugs 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 2
- 235000011180 diphosphates Nutrition 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- RMBPEFMHABBEKP-UHFFFAOYSA-N fluorene Chemical compound C1=CC=C2C3=C[CH]C=CC3=CC2=C1 RMBPEFMHABBEKP-UHFFFAOYSA-N 0.000 description 2
- 229910003472 fullerene Inorganic materials 0.000 description 2
- 238000012252 genetic analysis Methods 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 229930195733 hydrocarbon Natural products 0.000 description 2
- 150000002430 hydrocarbons Chemical class 0.000 description 2
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000012212 insulator Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 239000002105 nanoparticle Substances 0.000 description 2
- 238000001426 native polyacrylamide gel electrophoresis Methods 0.000 description 2
- NIHNNTQXNPWCJQ-UHFFFAOYSA-N o-biphenylenemethane Natural products C1=CC=C2CC3=CC=CC=C3C2=C1 NIHNNTQXNPWCJQ-UHFFFAOYSA-N 0.000 description 2
- 125000006353 oxyethylene group Chemical group 0.000 description 2
- 230000000704 physical effect Effects 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 239000011148 porous material Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 2
- 239000002096 quantum dot Substances 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 239000013545 self-assembled monolayer Substances 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000001308 synthesis method Methods 0.000 description 2
- 125000002264 triphosphate group Chemical group [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 2
- 230000005641 tunneling Effects 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- KMEMIMRPZGDOMG-UHFFFAOYSA-N 2-cyanoethoxyphosphonamidous acid Chemical group NP(O)OCCC#N KMEMIMRPZGDOMG-UHFFFAOYSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical group O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 description 1
- 206010065042 Immune reconstitution inflammatory syndrome Diseases 0.000 description 1
- 102000009617 Inorganic Pyrophosphatase Human genes 0.000 description 1
- 108010009595 Inorganic Pyrophosphatase Proteins 0.000 description 1
- 239000012901 Milli-Q water Substances 0.000 description 1
- 229920002518 Polyallylamine hydrochloride Polymers 0.000 description 1
- 229920002873 Polyethylenimine Polymers 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 108010001244 Tli polymerase Proteins 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 150000003838 adenosines Chemical class 0.000 description 1
- 239000002156 adsorbate Substances 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 125000002843 carboxylic acid group Chemical group 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229920001577 copolymer Polymers 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 125000006165 cyclic alkyl group Chemical group 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000000412 dendrimer Substances 0.000 description 1
- 229920000736 dendritic polymer Polymers 0.000 description 1
- 238000000151 deposition Methods 0.000 description 1
- 230000008021 deposition Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 125000001891 dimethoxy group Chemical group [H]C([H])([H])O* 0.000 description 1
- 229910001873 dinitrogen Inorganic materials 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000004668 electrochemical scanning tunneling microscopy Methods 0.000 description 1
- 239000012039 electrophile Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 125000005678 ethenylene group Chemical group [H]C([*:1])=C([H])[*:2] 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical class O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 1
- 125000001072 heteroaryl group Chemical group 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- QYFRTHZXAGSYGT-UHFFFAOYSA-L hexaaluminum dipotassium dioxosilane oxygen(2-) difluoride hydrate Chemical compound O.[O--].[O--].[O--].[O--].[O--].[O--].[O--].[O--].[O--].[F-].[F-].[Al+3].[Al+3].[Al+3].[Al+3].[Al+3].[Al+3].[K+].[K+].O=[Si]=O.O=[Si]=O.O=[Si]=O.O=[Si]=O.O=[Si]=O.O=[Si]=O QYFRTHZXAGSYGT-UHFFFAOYSA-L 0.000 description 1
- 230000036571 hydration Effects 0.000 description 1
- 238000006703 hydration reaction Methods 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229910052741 iridium Inorganic materials 0.000 description 1
- GKOZUEZYRPOHIO-UHFFFAOYSA-N iridium atom Chemical compound [Ir] GKOZUEZYRPOHIO-UHFFFAOYSA-N 0.000 description 1
- 238000012804 iterative process Methods 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 238000007481 next generation sequencing Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000012038 nucleophile Substances 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 238000004375 physisorption Methods 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229920000371 poly(diallyldimethylammonium chloride) polymer Polymers 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 150000004032 porphyrins Chemical class 0.000 description 1
- JKANAVGODYYCQF-UHFFFAOYSA-N prop-2-yn-1-amine Chemical class NCC#C JKANAVGODYYCQF-UHFFFAOYSA-N 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 125000000561 purinyl group Chemical group N1=C(N=C2N=CNC2=C1)* 0.000 description 1
- 150000003230 pyrimidines Chemical class 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 150000003254 radicals Chemical class 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000001454 recorded image Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000011896 sensitive detection Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000011451 sequencing strategy Methods 0.000 description 1
- 238000002444 silanisation Methods 0.000 description 1
- 239000002109 single walled nanotube Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 238000012876 topography Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6816—Hybridisation assays characterised by the detection means
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6834—Enzymatic or biochemical coupling of nucleic acids to a solid phase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
A method for the sequencing of target polynucleotides comprises the steps of (i) performing the polymerase reaction to extend suitable primers hybridised to the target polynucleotides using labelled nucleotide triphosphates; (ii) scanning flat substrates containing ultrahigh densities of the labelled single stranded or double stranded polynucleotides obtained in step (i) with scanning probe microscopy (SPM); (iii) analysing the images recorded from scanning to obtain at once the complete sequences of all the polynucleotides immobilised on the substrate. Methods for the detection with SPM of polynucleotides using oligonucleotide probes prepared from the labelled nucleosides and immobilised on a flat surface are also provided. The nucleotides are typically labeled with a moiety suitable for SPM detection attached via a non-cleavable linker to the base.
Description
TITLE OF THE INVENTION: SCANNING PROBE MICROSCOPY-BASED POLYNUCLEOTIDES
DETECTION AND SEQUENCING.
DESCRIPTION OF THE INVENTION
FIELD OF THE INVENTION
This invention relates to the detection and the sequencing of polynucleotides, In particular, the invention describes modified labelled nucleotides and nucleosides and methods for detecting and analysing the sequence of polynucleotides using scanning probe microscopy (SPM) and sequencing by primer extension (SPREX).
BACKGROUND TO THE INVENTION
The Sanger DNA sequencing approach introduced in 1977 revolutionised biological science and allowed the sequencing of nucleic acids such as DNA and RNA.
The Sanger methods are based on chain termination and rely on the use of labelled dideoxy derivatives of the four nucleotide triphosphates which are incorporated into a nascent polynucleotide chain in a polymerase reaction. Upon incorporation, the dideoxy derivatives terminate the polymerase reaction. The labelled polynucleotide fragments obtained are size fractionated using gel electrophoresis and analysed to determine the order of bases.
Developments of automated fluorescent DNA sequencers based on this approach and rapid increases in instrument throughput enabled the completion of a blue print of the Human Genome.
Despite the progress made, Sanger based methods are slow, labour intensive and expensive, It still costs an estimated $10 millions US dollars to sequence a mammalian genome. There is a need to develop new sequencing technologies to reduce the cost of sequencing a mammalian genome to $100,000 US dollars and ultimately $1000 or less. The attainment of this goal will enable the sequencing of each person's genome which will lead to individualised approaches for diagnosing, treating and preventing disease.
The so-called next generation sequencing technologies introduced in the past few years are non-electrophoretic and are mainly based either on sequencing by synthesis (SBS) or on the use of nanopores associated with various detection systems.
The concept of sequencingbysynthesis (SBS) involves the detection of the identity of each nucleotide immediately after its incorporation into a growing strand of DNA in a polymerase reaction.
One approach of SBS relies on the use of modified nucleotides as reversible terminators, in which a different fluorophore with a distinct fluorescent emission is linked to each of the 4 bases through a cleavable linker and the 3-OH group is capped by a small chemical moiety. DNA polymerase incorporates only a single nucleotide analogue complementary to the base on a DNA template covalently linked to a surface. After incorporation, the unique fluorescence emission is detected to identify the incorporated nucleotide and the fluorophore is subsequently removed
I
chemically or photochemically. The 3-OH group is then chemically regenerated, which allows the next cycle of the polymerase reaction to proceed. SBS is performed on arrays of single polynucleot ides or on clusters of identical polynucleotides obtained through a localised amplification of the polynucleotide to be sequenced (WO 2005-065814) Another approach of SBS relies on pyrosequencing which is a real-time sequencing strategy based on the release of pyrophosphate during enzymatic DNA synthesis. A first nucleotide triphosphate is introduced into the polymerase reaction mixture; when it is the correct complement to the target strand, its incorporation results into the release of pyrophosphate that is converted by an enzymatic reaction to a chemiluminescent signal that is detectable. The other three nucleotides are then added independently in an iterative process.
The most successful approach to date developed by 454 Life Sciences uses the amplification of a DNA fragment immobilized on a bead from a single fragment to several million identical copies. This amplification is necessary to generate sufficient identical DNA to obtain a strong signal from the sequencing reaction.
Although these methods are now commercially available, it is not believed that, they are likely to deliver the $1000 US dollars genome. In fact the estimated cost of sequencing a human genome is still around $ I million US dollars.
Nanopores approaches.
A number of research groups rely on nanopore-based sequencing methods to deliver ultra fast and inexpensive genome sequences. The underlying principle of nanopore sequencing is that a single-stranded polynucleotide molecule is electrophoretically driven through a nano- scale pore in such a way that the bases traverse the pore sequentially.
The detection mechanisms incorporated into the nanopore uses the distinct electrical and physical properties of each of the bases. Theoretically very long reads of polynucleotide sequences can be achieved in extremely short time scales.
Despite their great potential for improvement in speed, read length and sensitivity, currently none of these methods has been shown to achieve single nucleotide resolution of pOlynucleotides. An overview of these sequencing approaches can be found on the internet site of the National Human Genome Research Institute (NHGRI) at www.genome.gov.
Scanning probe microscopy (SPM) approaches.
The atomic force microscopy (AFM) was invented in 1986 by Binnig. This technology used a tip attached to the end of a flexible cantilever. The distance between the AFM tip and a target surface is controlled by a piezoelectric device. Scanning the tip across the surface causes the cantilever to deflect. This deflection is due to interactions between the tip and the surface. The deflection is measured by a laser reflected from the surface of the cantilever. This process generates a topographic image of the target surface. The resolution by AFM depends in part on the radius and shape of the tip.
AFM was used to scan a closely packed plasmid DNA adsorbed to a cationic lipid bilayer surface. High-resolution images of DNA double helix with the expected pitch of 3.4 nanometers were obtained (FEBS Letters 1995, 371, 279-282; Journal of Physical Chemistiy B 1997, 101, 441-449). This remarkable result triggered speculations that single nucleotide resolution of polynucleotide might be achieved using more refined tips and this would be the basis of DNA and RNA sequencing using AFM. Although measurable improvements in AFM resolution have been achieved using single-walled carbon nanotubes tips (Nature Biotechnology 2000, 18, 760-763), sequencing by AFM remains an elusive goal.
Lindsay of Arizona State University has proposed a new sequencing technology using Atomic Force Microscopy (AFM) in combination with naturally occurring ring-shaped sugar molecules called cyclodextrins. The method relies on the ring molecules, when coupled to the AFM probe, to serve as sensors to read the sequence of the DNA bases. The cyclodextrins are just big enough to slide a strand of DNA through.
Lindsay proposes to attach the reactive groups on the ring to the sensitivity of an AFM tip, which would thread an anchored DNA molecule into the ring and pull it through, recording the subtle variations resulting from the friction of the different DNA bases with the ring. The resulting data will be translated into the precise sequence of the DNA molecule. This technology has to overcome important technical hurdles to achieve its potential and no data has been published yet.
Rouvain in US 20040214177 describes a closely related sequencing approach in which a device comprises a location for the placement of at least one cyclic molecule (e.g. rotaxane) and a linear polymer (e.g. DNA) is threaded through said cyclic molecule; the tip of an SPM is attached to the cyclic molecule or the linear polymer and a signal resulting from the interaction of the cyclic molecule and each unit of the polymer is produced and read to identify the sequence of the polymer.
Another DNA sequencing method proposed by the Mechatronics Research Laboratory at the Massachusetts Institute of Technology uses an atomic force microscope to measure the specific binding interaction of nucleotides. A single molecule of DNA is denatured and immobilized on an atomically flat surface, and a force probe functionalised with a nucleotide is scanned along the molecule to detect locations of the nucleotide's complement. This method is under development and has not yet demonstrated single nucleotide resolution useful for DNA sequencing.
Scanning tunneling microscopy (STM) was invented by Binnig before the AFM. In STM, a metallic tip is brought very close to a conductive substrate and by applying a voltage between both conductive media; a tunnelling current flows between the two electrodes. The direction of the tunnelling depends on the bias polarity. The exponential distance dependence of the tunnelling current leads to excellent control of the distance between the tip and the surface enabling the achievement of very high resolution on atomically flat conductive substrate. For imaging purposes, the tip and substrate are scanned precisely relative to one another and the current is monitored as a function of the lateral position. The contrast in STM images reflects both topography and electronic effects.
The STM was discovered unexpectedly, to give high resolution images of biological macromolecules such as DNA on mica (an insulator) in humid air (Guckenberger et al Science, 1994, 266, 1538-1540). The STM resolution on humid DNA molecules appears to be revealing the major groove of DNA double helix which has a pitch of 3.4 nm. This level of resolution which corresponds to approximately i 0 base-pairs does not allow direct sequencing of non modified DNA and polyflucleotides using STM.
Researchers have proposed the use of nanolabels or nanocodes to enable polynucleotide detection and sequencing using STM.
Yamakawa etal (US 20050147981) proposes the use of a series of labelled oligonucleot ides including each a known nucleotide sequence and a molecular nanocode to bind to a nucleic acid of unknown sequence. The nanocodes are selected from the group consisting of carbon nanotubes, fullerenes, submicrometer metallic barcodes, nanoparticles and quantum dots. The detection of the nanocodes using Scanning probe microscopy (SPM) allows the determination of the sequence of the target nucleic acid from the sequences of the labelled oligonucleotides. Some knowledge of the sequence of the target nucleic acid is necessary to choose labelled oligonucleoie for the experiment.
A substrate with pathways to align labelled DNA molecules for sequencing by scanning tunnelling microscopy (STM) has also been disclosed, by Sargent et at in PCI International Application WO 96 24,689. In this method, the nucleic acid molecule is modified with a base specific label by analog incorporation during synthesis or by complementation The nucleotide sequence is determined by orienting single-straflde nucleic acid molecules on a surface having one or more linear alignment paths, the path is scanned using SPM and the presence of base specific labels along the length of the molecule are recorded. The process is repeated in other paths with labels specific for each base, and the data sets thus provided are combined to give the complete nucleotide sequence.
Henderson et at (US 6716578) propose a method for determining the order of nucleic acid segments from a target nucleic acid. The method comprises tagging of sequencespecpfic sites of the target nucleic acid with a sequence specific tag, scanning the target nucleic acid using a scanning probe microscope, and analysing the scan to determine the order of nucleic acid segments. This method does not provide single flucleotide resolution and is not easily applicable to the sequencing of unknown nucleic acids.
The SPM-based sequencing technologies of prior art are difficult to implement and have not yet been able to demonstrate individual base resolution necessary for sequencing pOlynucleotides.
There is still a need for a technology with the potential to revolutionise genetic analysis by dramatically improving the speed and reducing the cost of a range of genetic analysis applications, including whole-genome de novo sequencing and resequencing and expression profiling.
We propose herein an invention that combines sequencing by primer extension (SPREX) and scanning probe microscopy in a novel way compared to prior art.
SUMMARY OF THE INVENTION
The invention relates to novel labelled nucleoside phosphoramidites and nucleotide triphosphates and methods for polyflucleotides sequencing using Sequencing by Primer Extension (SPREX) and scanning probe microscopy (SPM). The invention relates also to methods for detecting nucleic acids.
The nucleic acids and polynucleotides referred to herein are naturally occurring or synthetic, they include but are not limited to genomic DNA, cDNA and mRNA.
Viewed from a first aspect, the invention describes the chemical structures of labelled nucleosicie phosphoramidites and nucleotide triphosphates. The labelled nucleoside phosphoramidites may be used in the synthesis of oligonucleot,des using automated sequencers. The labelled nucleotide triphosphates may be incorporated, in solution or on a solid support, to a nascent strand complementary to the target polynucleotides by naturally occurring or genetically modified polymerases. In contrast to sequencing-by-synthesis methods in prior art, the whole complementary strand is synthesised in one step. The nature of the base incorporated is not identified after single incorporation, but instead the sequence of all the bases is determined at once after the complete synthesis of the complementary strand.
The molecular labels (MLA) are attached to the base of the nucleotide or nucleoside through a spacer.
The molecular label may have a contour length from a few angstroms to 50 nm more preferably from 1 to 10 nm. Their chemical structure is based on but not limited to an alkyl or alkenyl chain, a polypeptide, a modified polypeptide, a neutral or charged oligomer or polymer. The preferred example of MLA is a polyelectrolyte chain positively or negatively charged; among polyelectrolytes, rigid and rod-like are preferred.
Another preferred example of MLA is a conjugated, conducting or semi-conducting oligomer or polymer chain based on but not limited to pyrrole, thiophene, phenylene ethynylene, phenylene vinylene, aniline, pyridine and acetylene moieties.
Other m electron rich compounds including but not limited to phthalocyanines, subphthalocyanines, porphyrin are also examples of MLA.
Electro-active moieties including but not limited to ferrocenyl compounds and transition metal complexes are also examples of MLA included in the invention.
Conjugated, rigid and rod-like chemical moieties are the most preferred examples of MLAs.
The MLAs are designed to favour the stable physisorption or chemisorption of the modified oligonucleotides on the substrates prior to scanning.
It will be apparent after examining the entire invention that certain agents that are related may be substituted for the molecular labels described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the present invention.
Viewed from a second aspect, the invention describes the synthesis of complementary strands to target polynucleotides. These complementary strands are obtained by primer extension using the labelled nucleotides and a suitable polymerase. The double stranded or single stranded labelled polynucleotides obtained are then immobilised on a flat surface. In some embodiments the target polynucleotides are immobilised on a flat substrate prior to the synthesis of their complementary strands incorporating the labelled nucleotides.
Viewed from a third aspect, the invention describes modifications of atomically flat surfaces that may be used for polynucleotides immobilisation and scanning probe microscopy.
Viewed from a fourth aspect, the invention provides unprecedented ultra high densities of immobilised polynucleotides that may enable the immobilisation and analysis of a whole genome on a single chip. The scanning of the chip surface with scanning probe microscopy is then used to determine the sequence of all the immobilised polynucleotides.
Viewed from a fifth aspect, the invention describes the sensitive detection of polynucleotides using oligonucleotides probes incorporating the labelled nucleosides described herein in hybridisation assays associating SPM.
In summary:
1) In the case of polynucleoticie sequencing, the invention may comprise the following steps: i) synthesis of complementary strands to target single stranded polynucleotides using the modified nucleotides triphosphates of the invention; ii) immobilisation at a very high density of the double or single stranded newly synthesised polynucleotides onto a flat surface (this second step is omitted when the first step is carried on polynucleotides already immobilised to a surface); iii) determination of the base sequences of all the immobilised polynucleotides on the substrate using scanning probe microscopy techniques including but not limited to Atomic Force Microscopy (AFM), Scanning Tunnelling Microscopy (STM), Scanning Electrochemical Microscopy (SECM), Scanning Near field Optical Microscopy (SNOM).
2) In the case of nucleic acids detection, the invention may comprise the following steps: I) synthesis of single stranded oligonucleotide probes containing at least two modified nucleotides, ii) contacting the probes with unknown polynucleotides in conditions favouring hybridisatior,, iii) immobilisation of the products of the reaction on a flat surface, iv) scanning of the flat substrate using scanning probe microscopy and analysis of the image obtained to identify hybridisation events. In an alternative, the single stranded probes may be immobilised to the flat substrate prior to the hybridisation experiment with unknown polynucleotides. The identification of hybridisation events is based on the fact that the binding of a complementary strand to the probe triggers structural changes that can be readily visualized using SPM.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1: Figure Ia shows a schematic representation of the structures of the labelled nucleotide triphosphates that may be used according to the present invention, the example of an amino propargyl group is chosen between the base and the spacer but other groups known to those skilled in the art may be used. Figure 1 b illustrates the schema of figure Ia in the case of deoxy nucleotide triphosphate (dNTPs). The spacer and the molecular label (MLA) are not drawn to scale.
Figure 2: Figure 2a shows a schematic representation of the structures of the labelled nucleotide phosphorarnidites that may be used according to the present invention, the example of an amino propargyl group is chosen between the base and the spacer but other groups known to those skilled in the art may be used. Figure 2b illustrates the schema of figure Ia in the case of deoxy nucleotide phosphoramidites. The spacer and the molecular label (MLA) are not drawn to scale.
Figure 3: Figure 3 shows examples of molecular label (MLA) structures based on phenylene ethynylene, phenylene, phenanthroline ethynyfene, phenylenevinylene. X is any reactive chemical group known in the art including but not limited to amine, carboxylic acid.
Figure 4: Figure 4 shows examples of MLA structures based on thiophene, pyrrole and furan. X is any reactive chemical group known in the art including but not limited to amine, carboxylic acid.
Figure 5: Figure 5 shows examples of MLA structures based on aniline and fluorene.
Figure 6: Fig 6 shows examples of MLA structures based on oligopeptides and incorporating natural and non-natural amino acids.
Figure 7: Fig 7 shows examples of modified nucleotides nucleotides triphosphates bearing an oligothiophenebased MLA.
Figure 8: Fig 8 shows other examples of modified nucleotides nucleotides triphosphates bearing an oligothiophene-based MLA.
Figure 9: Fig 9 shows examples of modified nucleotides nucleotides triphosphates bearing an oligophenyIeneethynylenebas MLA.
Figure 10: Fig 10 shows other examples of modified nucleotides nucleotides triphosphates bearing an oligophenyleneethynylenebased MLA.
DETAILED DESCRIPTION OF THE INVENTION
The invention describes methods for polynucleotide sequencing using Sequencing by Primer Extension (SPREX) and scanning probe microscopy (SPM) and methods for detecting nucleic acids. The invention describes also modified nucleotides that can be used for the aforementioned polynucleotide sequencing and nucleic acid detection.
The invention provides for an advance over the current sequencing by synthesis methods in that it eliminates time-consuming repetitive and expensive steps inherent to sequencing by synthesis in which the identity of the base has to be determined after each incorporation and the fluorophore and the blocking group on 3' position of the sugar has to be cleaved before the next incorporation. In prior art, the use of fluorescent labels limits the density of the polynucleotides on the array and the read length of the polynucleoticje and ultimately affect the throughput. The inventions allows for the preparation of unprecedented ultra high density arrays of polynucleotides and the determination of all their base sequence at once.
By allowing single base identification, the invention provides also an advance over the scanning probe microscopy methods that have yet to achieve single-nucleotide resolution.
The nucleotide or nucleoside molecules of the invention have each a base that is linked to a molecular label (MLA) via a non-cleavable linker.
The base may be a purine, or a pyrimidine. The base can be a deazapurine. The molecule may have a ribose or deoxyribose sugar moiety. Although the nucleotide and nucleoside examples provided herein show the 2'-deoxyribose sugar moieties found in DNA, the invention applies also to nucleotides and nucleosides containing the ribose sugar moiety found in RNA.
Labelled nucleoside phosphoramjd,tes of the invention that are suitable for oligonucleotide synthesis using automatic synthesisers have at the 5' position a dimethoxytrityl (DM1) protecting group and at the 3' position a 2-cyanoethyl-N, N diisopropyl phosphoramidite moiety.
It will be apparent to those skilled in the art that other reactive groups may be used to obtain the same result. Those reactive groups are also within the scope of the invention.
Labelled nucleotides of the invention that may be substrate to a polymerase have a triphosphate moiety at their 5' position and a free OH at their 3' position.
Figure 1 and Figure 2 give a schematic view of the modified nucleotides and nucleosides of the invention. In these examples 2'-deoxyribose is used but it should be noted that an hydroxyl group may be introduced at the 2' position of the sugar to include also the case of nbose.
Other synthetic nucleotide or nucleoside derivatives having modified base moieties and/or modified sugar moieties can be used in conjunction with the MLA described herein provided that they are capable of undergoing Watson-Crick base pairing. Such derivatives are described for example by Uhlman et al., Chemical Reviews 90: 543-584, 1990.
A preferred embodiment of the invention makes use of the more common purine and pyrimidine bases. In this instance, the linker connecting the base and the MLA is preferentially attached via the 7-position of the purine or the preferred deazapurine analogue, via an 8-modified punne, via an N-6 modified adenosine or an N-2 modified guanine. For pyrimidines, the attachment of the linker is preferably via the 5-position on cytidine, thymidine or uracil and the N-4 position on cytosine. -Although the invention will be further described with an emphasis on DNA, it should be noted that the descriptions will also be applicable to RNA, PNA, and other nucleic acids.
A variety of linkers may be used provided that they have the following characteristics.
They should hold the MLA at a sufficient distance from the nucleotide so as not to interfere with the activity of the enzyme and the Watson Crick base pairing. They should also favour the stable immobilisation of the polynucleotides and may help the ordering of the MLA on the surface used for scanning probe microscopy.
Suitable linkers may include, but are not limited to, saturated or unsaturated alkyl chains, linear or cyclic alkyl chains, oxyethylene units and aromatic or heteroaromatic moieties.
Linkers may also be prepared using standard peptide synthesis techniques with any natural and synthetic amino acid building blocks. For example, commercially available 6-aminohexanoic acid may be incorporated in the linker.
Combinations of the above moieties may be used within one linker. The incorporation of oxyethylene units andlor of hydrophilic or charged naturally occurring or synthetic amino acids may enhance solubility in water.
The linker may be attached to the base through an aminopropyl, an aminopropenyl or an aminopropynyl moiety; the amino group provides a functionality that may react with the linker to form an amide bond for instance. The aminopropynyl moiety is preferred.
The methods of the present invention are different from prior art because they make use of non conventional molecular labels (MLA) that are not required to be fluorescent for the nucleosides and nucleotides.
Modified nucleotides or nucleosides are commonly labelled with fluorescent or luminescent groups to allow the detection of the polynucleotides incorporating them.
Examples of such modified nucleotides can be found in W0200401 8493, US6573374, US 20050170367.
Nucleotides may also be labelled with bulky groups selected from the group consisting of nanoparticles, carbon nanotubes, fullerenes, quantum dots and dendrimers as described in US 20050026163, US 20040219596, US 20050147981.
The labels in prior art do not allow the identification at once of all the bases in a polynucleotide strand. In the case of fluorescent or luminescent labels, the diffraction of light significantly reduces the density of detectable labels per surface unit and therefore limits the throughput. The large size of the bulky labels and their random orientation on a scanning surface do not allow their use for single-nucleotide detection and identification in a polynucleotide.
The MLA of the invention are designed to overcome the limitations of the labels of prior art and allow the identification at once of all the bases in all the polynucleotide molecules immobilised on a flat surface using SPM. In certain embodiments, the MLAs described herein allow unprecedented high densities of polynucleotides to be immobilised and sequenced.
The MLAs of the invention are also designed to enable the ultrasensitive detection of polynucleotides and nucleic acids.
In the study of linear macromolecular brush copolymers synthesised by atom transfer radical polymerisation, Matyjaszewskj et al (Chemical Review 2001, 101, 292 1-2990) discovered that if the grafted chains of the polymer brush were sufficiently long, they could be individually visualised by atomic force microscopy (AFM).
Minko et at (J. Am. Chem. Soc. 2005, 127, 15688-15689) reported recently the visualisation of single poly(2-vinylpyrid,ne) molecules at the solid-liquid interface using AFM.
It follows that polymer chains grafted on polynucleotides may be visualised individually byAFM.
There are numerous reports of studies of the adsorption of molecules at the solid-air and solid liquid interfaces using STM. Molecular and sub-molecular resolution are routinely achieved with it-electron-rich conjugated molecules including but not limited to oligothiophene (Langmuir 2003, 19, 3350-3356), oligophenylene ethynylene ( Langmuir 2004,20,8892-8896) oligophenylene vinylene (Journal of Physical ChemistryB, 2005, 109, 4290-4302), phthalocyanines (Langmuir 2006, 22, 723-728 ).
Molecular resolution has also been achieved with long chain hydrocarbon molecules using STM.
The self-assembly properties of it-conjugated systems are well recognised (Chemical Reviews 2005, 105, 1491-1546).
The design of the MLAs of the invention is based on the recognition that molecular and sub-molecular resolutions are achieved with certain molecules under certain conditions when using SPM.
The present invention teaches the combination of these moieties that possess self-assembly and/or good immobilisation properties at surfaces and that can be detected by SPM at the molecular level with nucleotides and nucleoside to generate labelled oligonucleotides and polynucleotides useful for sequencing and detection of nucleic acids.
The first embodiments of the MLAs of the invention are oligopeptides and polypept ides containing naturally occurring or synthetic aminoacids. In prior art, oligopeptides were only used as spacers between the base and a fluorophore but not as detectable labels.
MLAs of varying length may be prepared using standard peptide synthesis techniques with any amino acid building blocks. Their chemical and physical properties may be easily tailored by the appropriate choice of aminoacids. The aminoacids are chosen to enhance solubility, stable adsorption on the flat substrate and sub-molecular resolution by SPM.
Figure 6 gives examples of MLAs based on oligopeptides. These examples show that different moieties and aminoacids may be combined to favour solubility, structural, electronic, self-assembly and adsorption properties. The examples are given as an illustration and not a limitation of the various structures of MLAs that may be obtained with aminoacids.
As a general rule, any polymer or oligomer chain from 3 angstroms to 50 nm in length, more preferably from I to 10 nm in length detectable by SPM may be used as MLA. A preferred example is polyelectrolyte and oligoelectrolyte chains; among these rigid and rod-like are most preferred.
Other embodiments of the MLAs of the invention are monomer, oligomer or polymer chain containing it-electron-rich, conjugated or conducting moieties including but not limited to pyrrole, thiophene, phenyleneethynylene, phenylenevinylene, fluorene,thienylenevinylene, aniline, pyridine, acetylene, phthalocyanines and aromatic polycyclic compounds. These moieties may bear substituents to enhance their solubility in water, their stable adsorption and self-ordering at the solid-air or solid-liquid interfaces.
Conjugated, rigid and rod-like chemical moieties are the most preferred examples of MLAs.
Figure 3, Figure 4 Figure 5 give examples of chemical stwctures of these MLAs.
Different MLAs may be obtained by varying the length, chemical nature and structure of the above moieties. Other modifications apparent to one skilled in the art may be introduced as well after considering the entirety of the invention.
It will also be apparent to one skilled in the art that other molecules that are it-electron rich and/or conjugated may be substituted for the examples of MLA described above while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the present invention.
As a general point, monomers and oligomers are preferred to polymers because unlike the latter their properties can be easily tailored.
Although molecules containing it-electron-rich moieties are the preferred MLAs, saturated long chain hydrocarbon molecules that allow a strong adsorption on HOPG and sub-molecular resolution by STM may also be used as molecular labels.
To allow sequencing, each of the four bases bears a specific MLA enabling its unambiguous identification. The labelled nucleotides and nucleosides may be made with all identical spacers and different MLA Alternatively, combinations of different spacers and different MLA may be used.
The modified nucleotides and nucleosides bearing MLA are readily synthesised by one skilled in the art of organic synthesis.
A number of the substructures of the target modified nucleotides and nucleosides are available commercially. Modified nucleoside phosphoramidites and nucleotide triphosphates with an attached amino terminated spacer can be bought from a number of companies such as Eurogentec, Glen Research and others.
FmocNHNNO Example of modified nucleotide commercially available
NC-O 1N
Some of the it-rich monomers, oligomers and polymer are also available commercially. A number of oligothiophenes, oligopyrrole and oligofuran bearing various substituents may be bought from Organic Electronic Chemicals.
R 1f$I11R8 X, Y, Z=S, NH, NR or OR R1-R8 = H, alkyl, Ar, OR, SR or any other substituent n = 0-6 Examples of conjugated heterocycles of Thiophene, Pyrrole and Furan commercially available.
The synthesis of the other MLAs of the invention are known from one skilled in the art.
For a review of some of the synthesis see: Chemical Reviews 1999, 99, 1863-1933; Chemical Reviews 2000, 100, 1605-1644; Chemical Reviews 2000, 100, 253 7-2574; Chemical Reviews 2005, 105, 1197-1279; Langmuir 2005, 21, 7860-7865; Tetrahedron, 2004, 60, 6285-6294.
The oligopeptide-based MLA are readily synthesised by automated solid phase peptide synthesis from commercial amino acids.
The MLA may have a reactive moiety (for instance a carboxylic acid group) allowing its attachment to the rest of the modified nucleotide or nucleoside.
The skilled person will appreciate how to covalently link the different substructures to obtain the labelled nucleotides and nucleosides of the invention.
Scheme 1 shows an example of the synthesis of dUTP bearing a MLA. Uridine is shown as an example but the synthesis outline may be applied to the other bases.
The first step of the synthesis is the attachment of a linker (in this case 6-aminohexanoic acid) to the chosen MLA (a substituted oligothiophene in this case 1) which is in the form of a reactive N-hydroxy succinimide (NHS) ester. The second step is the activation of the carboxylic acid of the product of the first step. The last step is the covalent coupling of a known propargylamine derivative of dUTP 2 to the linker-MLA adduct.
Compounds 1 and 2 as well as the aminohexanoic acid, are commercially available. n is an integer from I to 50 more preferably from 1 to 20, even more preferably from 1 to 6.
This synthesis example is provided by way of illustration, it will be obvious to those skilled in the art that various changes and modifications can be made to achieve the synthesis of the modified nucleotides and nucleosides bearing the MLA from non commercial substructures that are readily prepared from commercial building blocks.
In situ scanning tunnelling microscopy of adsorbed inorganic transition metal complexes and metallo-phthalocyanines allow molecular and sub-molecular resolution of the adsorbates as described for instance in Lan gmuir 2004, 20, 3 159-3165 and Langmuir 2006, 22, 2105-2111. These metal complexes as wel as other red-ox active compounds known to the skilled person are used as molecular labels in some embodiments of the invention.
Scheme I / + + Nç / N HO1NH2 \+
-N-N /
-N -
acid adivation H2N
HO HO HO
Hoo\ 0
P P
II II
-- 0
OH \+ 2
N /\ NL
HJL
S HOHO HO
-N---HOO\o,O
OH
Figure 7, Figure 8, Figure 9 and Figure 10 give some examples of modified nucleotide triphosphates of the invention.
In these figures, the replacement of the triphosphate moiety by a dimethoxy tntyl group (DM1) and the attachment of a cyanoethyl phosphoramidite moiety to the 3' hydroxyl yield the corresponding nucleoside phosphoramidites useful for the synthesis of oligonucleotide.
The invention provides a method of sequencing by primer extension (SPREX) of target polynucleotides using the modified nucleotides triphosphates described above and suitable polymerases.
The target polynucleotides sequenced may include genomic DNA, cDNA, RNA and other naturally occurring or synthetic nucleic acids.
When the target polynucleotide is genomic DNA, it is purified using standard methods.
The genomic DNA may be first amplified by PCR or directly fragmented using suitable enzymes or other forms of fragmentation (i. e. chemical or mechanical). One of the strands of the DNA fragment is either ligated to a hairpin oligonucleotide which provides a self-priming moiety or is ligated to an oligonucleotide with a sequence complementary to the primers to be used. Single stranded DNA fragments are then generated by denaturation. The details of these techniques as well as other techniques capable of achieving the same results will be apparent to the skilled person.
To carry out the primer extension reaction the first step is usually to anneal a primer sequence to the target polynucleotide. Alternatively, the primer and the target polynucleotide may be part of the same molecule when the target polynucleotide was ligated to a hairpin loop structure using a ligation reagent such as a ligase enzyme.
The primer sequence is recognised by the polymerase enzyme and acts as an initiation point for the extension of the complementary strand. Preferably, all the 4 modified nucleotides (A, C, T, G) are then brought into contact with the target polynucleotide simultaneously, to allow the synthesis of the complementary strand.
Other conditions necessary for carrying out the polymerase reaction, including temperature, pH, buffer compositions etc., will be apparent to those skilled in the art.
Many different polymerase enzymes may be used for primer extension, and it will be apparent to the person of ordinary skill which is most appropriate to use. Preferred enzymes include but are not limited to Taq polymerase, Vent (exo-) polymerase, Pwo, DNA polymerase I, polymerase Ill, Klenow fragment, Iii polymerase and ThermalAce polymerase. Examples of such appropriate polymerases are disclosed in Nucleic Acids Research, 2003, 31, 2360-2365; Nucleic Acids Research, 2003, 31, 2636-2646, Journal of the American Chemical Society 2005, 127, 15071-15082; Proc. Natl. Acad. Sc USA, 1996 (93), pp 5281-5285, Nucleic Acids Research, 1999 (27), pp 2454- 2553 and Acids Research, 2002 (30), pp 605-613.
Other polymerases genetically modified to improve their incorporation.of the modified nucleotides may also be used.
Primer extension of a large number of polynudeotides may be performed in solution.
In the case of amphiphilic nucleotides triphosphates containing hydrophilic and hydrophobic substructures (for example a hydrophilic spacer and an hydrophobic molecular label), mixture of water and organic solvent may be used. Primer extension may also be performed in emulsions and vesicles. In one embodiment, the inner medium of the droplet is aqueous and contain the target polynucleotide, a polymerase. The hydrophilic part of the nucleotide triphosphate containing the base is solubilised in the aqueous medium (and is available for the polymerase reaction) while its hydrophobic part is solubilised in the continuous organic medium around the aqueous droplet.
The duplexes obtained by any of the above protocols or other protocols known in the art may be purified using methods such as gel filtration and alcohol precipitation.
Others purification methods will be apparent to the skilled person. The purified duplexes bearing the MLA are then immobilised on an atomically flat surface at a very high density before scanning with SPM. The complementary strands are immobilised either as duplex or single strands.
The ultra high density of polynucleotide immobilised is an important feature of the invention. It is estimated that an entire genome could be immobilised on a flat surface a few centimetres square and analysed. The density achieved here would be several order of magnitude larger than the densities achieved in prior art.
Alternatively the target polynucleotides may be immobilised on the flat surface via any suitable covalent or non covalent linkage of which many are known in the art prior to primer extension. In this case the unincorporated labelled nucleotides can be washed off along with others impurities before scanning.
For example, the target polynucleotides already ligated to a hairpin may be attached covalently to the surface by the reaction of a chemical functionality on the hairpin (i. e.
a nucleophile) that is complementary to a chemical functionality (electrophile) on the surface. Alternatively the hairpin may bear a biotin moiety that will interact with streptavidin molecules adsorbed on the surface. Other immobilisation strategies will be apparent to the skilled person.
In the case the target polynucleotide has portions complementary to the primers to be used, the latter may be first immobilised on the surface as described for the hairpin followed by the hybndisation of the target polynucleotide to the surface bound primers.
Examples of the immobilisation surfaces include but are not limited to freshly cleaved muscovite mica, chemically modified or not, graphite, ultra flat metallic surfaces grown on mica (e.g. Au (111)), highly oriented pyrolitic graphite (HOPG). The immobilisation surface may be modified first to be neutral, positively or negatively charged using methods including but not limited to silanization, self assembled monolayers (SAMS), lipid bilayer, Langmuir-Blodgett films, polyelectrolytes deposition, and metal ions.
In one embodiment, the polynucleotides may be co-immobilised on the surface with suitable molecules chosen for example to impart stability, enhance detection of the MLA or to introduce any other useful property.
In other embodiments, intercalators such as ethidium bromide may be used to increase the length of the nucleic acid strand and augment the spatial separation of the MLA decorating the polynucleotide.
Scheme 2 Polymerase Genomic DNA fragments ligated to hairpin MLA-dNTPs in polymerase buffer 1) primers extension: synthesis of complementary strands to target polynucleotides I I 2) purification of the duplex obtained (native PAGE (15%), alcohol precipitation) 3) Ultra high density immobilisatlon of the polynucleotides labelled with MLAs on a flat surface (e.g. mica, HOPG) 4) Scanning of the surface using SPM and recording of a topographic' image
I I SMALL PORTION OF THE
RECORDED IMAGE
polynucleotide backbone MLAs Scheme 2 shows by way of illustration, not limitation the steps involved in the sequencing of target genomic DNA.
In the case of nucleic acids detection, the first step is the synthesis of single stranded oligonucleotide probes containing several of the labelled nucleotides described herein.
The oligonucleoticle probes have sequences complementary to the target sequences to be detected. The target sequences are from any synthetic or naturally occurring nucleic acids (e.g. nucleic acids from pathogens).
In the second step, the single stranded probes are contacted with target nucleic acids mixtures in conditions favouring hybridisation.
The third step is the immobilisation of the products of the reaction on a flat surface using methods known in the art.
The fourth step is the scanning of the flat substrate using scanning probe microscopy and analysis of the images obtained to identify hybridisation events.
In an alternative, the single stranded probes may be immobilised on the flat substrate prior to the hybridisation experiment with target polynucleotides.
The distribution of the MLA on the probe is such that hybridisation to a complementary polynucleotide induces structural changes in the appearance of the probe that are
detectable by SPM
For example,the identification of hybridisation events may be based on the observation of the ordering of the MLA of each probe. In the absence of hybridisation, single stranded oligonucleotides are easily bent so the attached MLA may be disordered but upon hybridisation with a complementary strand and formation of a duplex an ordering of the MLA may be observed.
The nucleic acid detection may be practised using a very high number of probes.
Each probe has a specific coding system allowing its unambiguous identification. The identification code may be obtained by varying the number and relative position of MLA on the oligonucleotide probes or by using MLA of various lengths or chemical nature. The examples given are for the purpose of explanation and not limitation, others ways of tagging the probes in a specific way using the MLA will be evident to the skilled person.
Scanning probe microscopy techniques used to scan the samples include but are not limited to STM, ECSTM, SECM, AFM, and SNOM. The samples are scanned at the solid/air or solid/liquid interfaces in ambient conditions or at low temperatures using commercially available or custom-built instruments. A wealth of experimental conditions and set-ups apparent to the skilled person may be used. An illustration of experimental conditions that may be used is given in Example 4.
Example I
Treatment of mica with positively charged polyelectrolytes.
A 3g/mI solution in water of polyallylamine hydrochloride from Aldrich is prepared. A drop of this solution is deposited on a freshly cleaved mica surface so as to cover the whole surface. Incubation is performed in a humid chamber for 30 minutes then the mica surface is rinsed gently with Milli-Q water. The treated mica surface is dried in a nitrogen stream and used immediately for polynucleotides immobilisation.
The same protocol is used for other positively charged polyelectrolytes including but not limited to linear and branched polyethyleneimine, poly(diallyl dimethyl ammonium chloride).
Example 2
Preparation of lipid bilayer on a mica surface.
A 0.5 mg/mI solution of dipalmitoyl-trimethyl-ammoniumpropane in 20 mM NaCI at pH 6 iss prepared. This solution is sonicated repeatedly to obtain small unhlamellar vesicles. Clean bilayers are formed when a droplet of the vesicle solution is allowed to incubate overnight on a freshly cleaved mica surface which is then heated at 70 C for minutes.
Example 3
Primer extension.
A sample of genomic DNA purified from a blood sample is subjected to one of several known methods to fragment it into 1000 bp polynucleotide portions. One strand of each polynucleotide is ligated to a hairpin polynucleotide by a ligase enzyme and the other strand is removed after the ligation to yield the target polynucleotide to be sequenced. The details of this method are known to those skilled in the art.
In this example labelled 2'-deoxy nucleotides triphosphates bearing oligothiophene-based MLA of the type described in Figure 7b are used: n = 2 for A, n = 3 for T, n = 4 for G, n = 5 for C. Thermostable inorganic pyrophosphatase (2 U), Vent (exo-) DNA polymerase (10 U), the target polynucleotides (diluted to a total concentration of 10 pM), and the four labelled dNTPs (final concentration 250 jtM each) are mixed in 20.tl of the polymerase reaction buffer (provided by the supplier of the DNA polymerase). The extension reaction is performed by heating the mixture to the optimal temperature of the enzyme in a thermocycler for 1 hour. The reaction is stopped by adding a 60 jii of a 80% formamide solution containing 20 mM EDTA and heating at 99 C for 10 minutes.
The duplexes obtained are purified on native polyacrylamide gel electrophoresis PAGE (15%) 4 C, extracted from the gel and purified by precipitation with ethanol twice.
Example 4
The pOlynucleotides duplexes obtained in Example 3 are dissolved in an appropriate buffer (e. g. 10 mM IRIS, 1mM EDTA). The concentration of polynucleotides in solution is chosen to obtain the desired high density coverage of the surface. The typical concentrations of polynucleotides used are in the range I ng/ml. A volume of this solution sufficient to cover the pre-treated mica described in Example 1 is allowed to incubate on top of the surface for one hour. The surface is then gently rinsed with MilIi-Q water and dried under nitrogen gas. The surface is then scanned using Hydration Scanning Tunneling Microscopy (HSTM) in humid air. This technique is based on the electrical conductivity of molecularly thin water layers which adsorb to the sample surfaces in a humid atmosphere. It allows reliable imaging of biological specimens and even insulators as long as they are hydrophilic.
HSTM is carried out in a humid atmosphere using a low-current scanning tunneling microscope (e.g. Rochester Hills, MI (RHK)). Mechanically cut and electrochimically etched PtIIr (platinum/Iridium) tips are used. The surface is scanned either in a constant height mode or in a constant current mode. Settings for the tunneling current the bias voltage used range from 0.05 to I nA and from -0.1 to -0.9V respectively.
A topographic image of the surface is obtained as schematically illustrated in Scheme 2 in which only a very small portion of the surface (with dimensions in the nanometer range) is represented. An analysis of the image by measuring the length and ordering of the surface features provides the sequences of all the polynucleotides immobilised on the surface.
Claims (25)
- TITLE OF THE INVENTION: SCANNING PROBE MICROSCOPY-BASED POLYNUCLEOTIDESDETECTION AND SEQUENCING.What is claimed is 1. A method for determining the sequence of polynucleotides comprising the following steps: (i) Synthesis in a single step and in solution of complementary strands to target single stranded polynucleotides using suitable primers, polymerases and the modified nucleotides described therein.(ii) Immobilisation at a very high density of the double or single stranded newly synthesised polynucleotides onto a flat surface.(iii) Scanning of the flat substrate containing a very high density of the synthesised polynucleotide using scanning probe microscopy techniques including but not limited to Atomic Force Microscopy (AFM), Scanning Tunnelling Microscopy (STM), Electrochemical STM (ECSTM), Scanning Electrochemical Microscopy (SECM),Scanning Near field Optical Microscopy (SNOM).(iv) Determination of the base sequences of all the immobilised polynucleotides on the substrate by analysing the scanning image revealing the labels of the incorporated modified nucleotides.
- 2. A method for detecting nucleic acids based on the following steps: (i) Synthesis of polynucleotide probes using at least two modified nucleotides described herein.(ii) Contacting the probes with unknown nucleic acids in conditions favouring hybridisation in solution or on a flat surface.(iii) lmmobilisation of the products of the solution hybridisation on a flat surface.(iv) Scanning of the flat surface using Scanning Probe Microscopy (SPM) and identifying the hybndised probes by analysis of the image obtained by scanning.
- 3. A method according to claim 1, wherein the synthesis of complementary strands is performed on target polynucleotides already immobilised at high density on a flat substrate.
- 4. The method of claim 1, wherein the modified nucleotides are deoxyri bonucleotide triphosphates or ribonucleotide tnphosphates
- 5. The method of claim 2, wherein the modified nucleotides are deoxyribonucleotide phosphoramidjtes or nbonucleotide phosphoramjdjtes.V
- 6. The method of claim 2, wherein the synthetic polynucleotide probes are immobilised on a flat surface prior to hybridisation with target nucleic acids.
- 7. A method according to claims 1 and 2, wherein the modified nucleotides incorporate non-natural bases and/or non natural sugar moieties.
- 8. A method according to claims 1 and 2, wherein the modified nucleotide has a base that has an attached molecular label with a length from 3 angstroms to 100 nanometers and more preferably from I to 10 nanometers.
- 9. A method according to claim 8, wherein the base is attached to the molecular label via a non-cleavable linker.
- I0.A method according to claims 1 and 2, wherein the modified nucleotide has a base that is linked to a molecular label that is an alkyl, alkenyl or aryl chain between 6 and 50 repeat units and preferably between 10 and 30 repeat units.
- 11. A method according to claims I and 2, wherein the modified nucleotide has a base that is linked to a molecular label that is an oligopeptide or a polypeptide incorporating natural and non-natural amino acids, an oligo pseudo peptide or poly-pseudo peptide incorporating natural and non-natural amino acids.
- 12. The method of claim 11, wherein the oligopeptide, the polypeptide, the oligo pseudo peptide and poly pseudo form a polyproline II helix (PPII).
- 13. A method according to claims 1 and 2, wherein the modified nucleotide has a base that is linked to a molecular label that is a neutral or charged oligomer or polymer.
- 14.A method according to claims I and 2, wherein the modified nucleotide has a base that is linked to a molecular label that is a polyelectrolyte more preferably a rigid and rod-like polyelectrolyte
- 15.A method according to claims 13 and 14, wherein the neutra' or charged oligomer or polymer or polyelectrolyte contains aromatic and/or nelectron rich moieties.
- 16. A method according to claims 1 and 2, wherein the modified nucleotide has a base that is linked to a molecular label that is a conjugated or conducting or semi-conducting oligomer or polymer chain based on but not limited to the following moieties pyrrole, thiophene, phenylene ethynylene, phenylene vinylene, fluorene, aniline, pyndine and acetylene.
- 17.A method according to claims land 2, wherein the modified nucleotide has a base that is linked to a molecular label comprising an electro-active moiety including but not limited to ferrocenyl compounds, transition metal complexes, metal lo-phthalocyanine and porphine.
- 18.A method according to claims 1 and 2, wherein the molecular label further reacts with chemical groups on the flat surface on adsorption of the labelled polynucleotides.
- 19.A method according to claims I and 2, wherein the molecular label on the modified nucleotide allows a stable adsorption of the labelled polynucleotides on the flat substrate used.
- 20. A method according to claims 1 and 2, wherein the molecular label on the modified nucleotide can be visualised by SPM.
- 21.A method according to claims 1 and 2, wherein the molecular labels of at least two consecutives bases of the modified polynucleotide can be resolved by SPM.
- 22. A method according to claim 1, wherein each of the four bases of the modified nucleotides has a distinctive molecular label allowing its identification by SPM.
- 23.A method according to claim I, wherein all the molecular labels on a synthesised complementary strand can be visualised by SPM.
- 24.A method according to claims 1 and 2, wherein the flat surface or substrate is selected from a group comprising but not limited to mica, highly oriented pyrolitic graphite (HOPG), ultraflat metallic surfaces grown on HOPG such as but not limited to Au(111) and Cu(111).
- 25.A method according to claim 24, wherein the flat surface or substrate is chemically modified prior to the adsorption of polynucleotides.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0705367A GB2447679A (en) | 2007-03-21 | 2007-03-21 | Scanning probe microscopy-based polynucleotide sequencing and detection |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0705367A GB2447679A (en) | 2007-03-21 | 2007-03-21 | Scanning probe microscopy-based polynucleotide sequencing and detection |
Publications (2)
Publication Number | Publication Date |
---|---|
GB0705367D0 GB0705367D0 (en) | 2007-04-25 |
GB2447679A true GB2447679A (en) | 2008-09-24 |
Family
ID=38008783
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB0705367A Withdrawn GB2447679A (en) | 2007-03-21 | 2007-03-21 | Scanning probe microscopy-based polynucleotide sequencing and detection |
Country Status (1)
Country | Link |
---|---|
GB (1) | GB2447679A (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0410618A2 (en) * | 1989-07-24 | 1991-01-30 | Arizona Board Of Regents | Method for visualizing the base sequence of nucleic acid polymers |
WO1996024689A1 (en) * | 1995-02-07 | 1996-08-15 | Sargent Jeannine P | Method and apparatus for determining the sequence of polynucleotides |
WO2002022889A2 (en) * | 2000-09-11 | 2002-03-21 | President And Fellows Of Harvard College | Direct haplotyping using carbon nanotube probes |
US20030232346A1 (en) * | 2002-06-17 | 2003-12-18 | Xing Su | Nucleic acid sequencing by signal stretching and data integration |
WO2004038037A2 (en) * | 2002-09-20 | 2004-05-06 | Intel Corporation | Controlled alignment of nano-barcodes encoding specific information for scanning probe microscopy (spm) reading |
WO2005066368A2 (en) * | 2003-12-31 | 2005-07-21 | Intel Corporation | Methods and compositions for detecting nucleic acids using scanning probe microscopy and nanocodes |
-
2007
- 2007-03-21 GB GB0705367A patent/GB2447679A/en not_active Withdrawn
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0410618A2 (en) * | 1989-07-24 | 1991-01-30 | Arizona Board Of Regents | Method for visualizing the base sequence of nucleic acid polymers |
WO1996024689A1 (en) * | 1995-02-07 | 1996-08-15 | Sargent Jeannine P | Method and apparatus for determining the sequence of polynucleotides |
WO2002022889A2 (en) * | 2000-09-11 | 2002-03-21 | President And Fellows Of Harvard College | Direct haplotyping using carbon nanotube probes |
US20030232346A1 (en) * | 2002-06-17 | 2003-12-18 | Xing Su | Nucleic acid sequencing by signal stretching and data integration |
WO2004038037A2 (en) * | 2002-09-20 | 2004-05-06 | Intel Corporation | Controlled alignment of nano-barcodes encoding specific information for scanning probe microscopy (spm) reading |
WO2005066368A2 (en) * | 2003-12-31 | 2005-07-21 | Intel Corporation | Methods and compositions for detecting nucleic acids using scanning probe microscopy and nanocodes |
Non-Patent Citations (2)
Title |
---|
DNA Sequence (1996), Vol 6, p 199-209, "An improved method for the synthesis of...", Bridgman & Petersen * |
Medical Engineering & Physics (2006); Vol 28, pp 956-962, "AFM images of short oligonucleotides on...", Humenik et al * |
Also Published As
Publication number | Publication date |
---|---|
GB0705367D0 (en) | 2007-04-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220064741A1 (en) | High throughput nucleic acid sequencing by expansion | |
CN106687574B (en) | Compositions, systems and methods for detecting events using tethers anchored to or near nanoparticles | |
CN107835858B (en) | Compositions, systems, and methods for sequencing polynucleotides using tethers anchored to polymerases adjacent to nanopores | |
US7476786B2 (en) | Controlled alignment of nano-barcodes encoding specific information for scanning probe microscopy (SPM) reading | |
EP1960550B1 (en) | Probe for nucleic acid sequencing and methods of use | |
US7705222B2 (en) | Controlled alignment of nano-barcodes encoding specific information for scanning probe microscopy (SPM) | |
DK2576818T3 (en) | A method for DNA sequencing by hybridization | |
EP1247815A2 (en) | Modified oligonucleotides and uses thereof | |
CN102365367A (en) | High throughput nucleic acid sequencing by expansion and related methods | |
US20090118140A1 (en) | Method and system for assembly of macromolecules and nanostructures | |
GB2447679A (en) | Scanning probe microscopy-based polynucleotide sequencing and detection | |
EP3110965B1 (en) | Method of identifying a nucleotide at a defined position and determining the sequence of a target polynucleotide using an electro-switchable biosensor | |
Di Giusto et al. | Special-purpose modifications and immobilized functional nucleic acids for biomolecular interactions | |
KR20150071876A (en) | Method for sequencing nucleic acids using atomic force microscope | |
JP2022524982A (en) | Devices and methods for biopolymer identification | |
US20240158849A1 (en) | Compositions and methods for sequencing using polymers with metal-coated regions and exposed regions | |
JP5100236B2 (en) | Single-stranded nucleotide multimer extension immobilization substrate and method for producing the same | |
Lunn | Exploiting DNA surfaces for sensing and nanomaterial applications | |
Cauchi | Towards a Microfluidics-Based Nucleic Acid Biosensor Using Immobilized Quantum Dot–DNA Conjugates for FRET Detection of Target Oligonucleotides |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WAP | Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1) |