EP3436602A1 - Nanopore protein conjugates and uses thereof - Google Patents
Nanopore protein conjugates and uses thereofInfo
- Publication number
- EP3436602A1 EP3436602A1 EP17715423.4A EP17715423A EP3436602A1 EP 3436602 A1 EP3436602 A1 EP 3436602A1 EP 17715423 A EP17715423 A EP 17715423A EP 3436602 A1 EP3436602 A1 EP 3436602A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- nanopore
- protein
- hemolysin
- alpha
- dna
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 238
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 223
- 101000844752 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) DNA-binding protein 7d Proteins 0.000 claims abstract description 110
- 239000000178 monomer Substances 0.000 claims abstract description 87
- 238000012163 sequencing technique Methods 0.000 claims abstract description 50
- 230000004568 DNA-binding Effects 0.000 claims abstract description 48
- 238000000034 method Methods 0.000 claims abstract description 46
- 150000007523 nucleic acids Chemical class 0.000 claims description 56
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 claims description 46
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 claims description 46
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 44
- 102000039446 nucleic acids Human genes 0.000 claims description 35
- 108020004707 nucleic acids Proteins 0.000 claims description 35
- 238000006467 substitution reaction Methods 0.000 claims description 33
- 239000012528 membrane Substances 0.000 claims description 23
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 22
- 239000003228 hemolysin Substances 0.000 claims description 20
- 239000000523 sample Substances 0.000 claims description 12
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 7
- 101100545099 Bacillus subtilis (strain 168) yxiH gene Proteins 0.000 claims description 7
- 102220612391 Mitogen-activated protein kinase kinase kinase 1_N17R_mutation Human genes 0.000 claims description 7
- 102220359228 c.35C>G Human genes 0.000 claims description 7
- 102220102504 rs878854150 Human genes 0.000 claims description 7
- 101710092462 Alpha-hemolysin Proteins 0.000 abstract description 173
- 108020004414 DNA Proteins 0.000 abstract description 66
- 102000053602 DNA Human genes 0.000 abstract description 27
- 238000006243 chemical reaction Methods 0.000 abstract description 13
- 238000000429 assembly Methods 0.000 abstract description 12
- 230000000712 assembly Effects 0.000 abstract description 12
- 238000001712 DNA sequencing Methods 0.000 abstract description 4
- 235000018102 proteins Nutrition 0.000 description 195
- 235000001014 amino acid Nutrition 0.000 description 60
- 229940024606 amino acid Drugs 0.000 description 56
- 150000001413 amino acids Chemical class 0.000 description 56
- 125000005647 linker group Chemical group 0.000 description 31
- 230000000694 effects Effects 0.000 description 28
- 239000000203 mixture Substances 0.000 description 27
- 239000011148 porous material Substances 0.000 description 26
- 210000004027 cell Anatomy 0.000 description 25
- 239000002773 nucleotide Substances 0.000 description 23
- 125000003729 nucleotide group Chemical group 0.000 description 23
- 108090000765 processed proteins & peptides Proteins 0.000 description 23
- 102000037865 fusion proteins Human genes 0.000 description 18
- 108020001507 fusion proteins Proteins 0.000 description 18
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 18
- 239000005090 green fluorescent protein Substances 0.000 description 17
- 102000004196 processed proteins & peptides Human genes 0.000 description 17
- 230000027455 binding Effects 0.000 description 15
- 108091006146 Channels Proteins 0.000 description 13
- 239000000499 gel Substances 0.000 description 13
- 230000014509 gene expression Effects 0.000 description 13
- 229920001184 polypeptide Polymers 0.000 description 13
- 238000000746 purification Methods 0.000 description 13
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 12
- 238000010828 elution Methods 0.000 description 11
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 11
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 10
- 101710096438 DNA-binding protein Proteins 0.000 description 9
- 150000003839 salts Chemical class 0.000 description 9
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 108091033380 Coding strand Proteins 0.000 description 7
- 239000013598 vector Substances 0.000 description 7
- 229910017052 cobalt Inorganic materials 0.000 description 6
- 239000010941 cobalt Substances 0.000 description 6
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 description 6
- 239000000463 material Substances 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 230000004481 post-translational protein modification Effects 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- 239000000758 substrate Substances 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 108091081021 Sense strand Proteins 0.000 description 5
- 125000000539 amino acid group Chemical group 0.000 description 5
- 239000013604 expression vector Substances 0.000 description 5
- 238000005194 fractionation Methods 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 238000002156 mixing Methods 0.000 description 5
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 5
- 239000004065 semiconductor Substances 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 108010006464 Hemolysin Proteins Proteins 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 102200068053 rs587777039 Human genes 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- 239000011534 wash buffer Substances 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 108091092195 Intron Proteins 0.000 description 3
- 150000008575 L-amino acids Chemical class 0.000 description 3
- 239000000232 Lipid Bilayer Substances 0.000 description 3
- 241000191967 Staphylococcus aureus Species 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical compound CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000002869 basic local alignment search tool Methods 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 230000002759 chromosomal effect Effects 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000010494 dissociation reaction Methods 0.000 description 3
- 230000005593 dissociations Effects 0.000 description 3
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 3
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical class O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 150000003904 phospholipids Chemical class 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 108020001580 protein domains Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical compound CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- OYIFNHCXNCRBQI-UHFFFAOYSA-N 2-aminoadipic acid Chemical compound OC(=O)C(N)CCCC(O)=O OYIFNHCXNCRBQI-UHFFFAOYSA-N 0.000 description 2
- RDFMDVXONNIGBC-UHFFFAOYSA-N 2-aminoheptanoic acid Chemical compound CCCCCC(N)C(O)=O RDFMDVXONNIGBC-UHFFFAOYSA-N 0.000 description 2
- PECYZEOJVXMISF-UHFFFAOYSA-N 3-aminoalanine Chemical compound [NH3+]CC(N)C([O-])=O PECYZEOJVXMISF-UHFFFAOYSA-N 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical class NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- 150000008574 D-amino acids Chemical class 0.000 description 2
- 230000006820 DNA synthesis Effects 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 239000007995 HEPES buffer Substances 0.000 description 2
- JUQLUIFNNFIIKC-YFKPBYRVSA-N L-2-aminopimelic acid Chemical compound OC(=O)[C@@H](N)CCCCC(O)=O JUQLUIFNNFIIKC-YFKPBYRVSA-N 0.000 description 2
- AGPKZVBTJJNPAG-UHNVWZDZSA-N L-allo-Isoleucine Chemical compound CC[C@@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-UHNVWZDZSA-N 0.000 description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- YPIGGYHFMKJNKV-UHFFFAOYSA-N N-ethylglycine Chemical compound CC[NH2+]CC([O-])=O YPIGGYHFMKJNKV-UHFFFAOYSA-N 0.000 description 2
- 108010065338 N-ethylglycine Proteins 0.000 description 2
- AKCRVYNORCOYQT-YFKPBYRVSA-N N-methyl-L-valine Chemical compound CN[C@@H](C(C)C)C(O)=O AKCRVYNORCOYQT-YFKPBYRVSA-N 0.000 description 2
- KSPIYJQBLVDRRI-UHFFFAOYSA-N N-methylisoleucine Chemical compound CCC(C)C(NC)C(O)=O KSPIYJQBLVDRRI-UHFFFAOYSA-N 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- 108010013381 Porins Proteins 0.000 description 2
- 102000017033 Porins Human genes 0.000 description 2
- 102000002067 Protein Subunits Human genes 0.000 description 2
- 108010001267 Protein Subunits Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 108010077895 Sarcosine Proteins 0.000 description 2
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- -1 amides) Chemical class 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 238000005341 cation exchange Methods 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Natural products NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- VEVRNHHLCPGNDU-MUGJNUQGSA-O desmosine Chemical compound OC(=O)[C@@H](N)CCCC[N+]1=CC(CC[C@H](N)C(O)=O)=C(CCC[C@H](N)C(O)=O)C(CC[C@H](N)C(O)=O)=C1 VEVRNHHLCPGNDU-MUGJNUQGSA-O 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 230000005669 field effect Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000004255 ion exchange chromatography Methods 0.000 description 2
- RGXCTRIQQODGIZ-UHFFFAOYSA-O isodesmosine Chemical compound OC(=O)C(N)CCCC[N+]1=CC(CCC(N)C(O)=O)=CC(CCC(N)C(O)=O)=C1CCCC(N)C(O)=O RGXCTRIQQODGIZ-UHFFFAOYSA-O 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 229910044991 metal oxide Inorganic materials 0.000 description 2
- 150000004706 metal oxides Chemical class 0.000 description 2
- 230000011987 methylation Effects 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 108091005573 modified proteins Proteins 0.000 description 2
- 102000035118 modified proteins Human genes 0.000 description 2
- 102000044158 nucleic acid binding protein Human genes 0.000 description 2
- 108700020942 nucleic acid binding protein Proteins 0.000 description 2
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000000087 stabilizing effect Effects 0.000 description 2
- YSMODUONRAFBET-WHFBIAKZSA-N threo-5-hydroxy-L-lysine Chemical compound NC[C@@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-WHFBIAKZSA-N 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- BJBUEDPLEOHJGE-UHFFFAOYSA-N (2R,3S)-3-Hydroxy-2-pyrolidinecarboxylic acid Natural products OC1CCNC1C(O)=O BJBUEDPLEOHJGE-UHFFFAOYSA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- JHTPBGFVWWSHDL-UHFFFAOYSA-N 1,4-dichloro-2-isothiocyanatobenzene Chemical compound ClC1=CC=C(Cl)C(N=C=S)=C1 JHTPBGFVWWSHDL-UHFFFAOYSA-N 0.000 description 1
- OGNSCSPNOLGXSM-UHFFFAOYSA-N 2,4-diaminobutyric acid Chemical compound NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 1
- GMKMEZVLHJARHF-UHFFFAOYSA-N 2,6-diaminopimelic acid Chemical compound OC(=O)C(N)CCCC(N)C(O)=O GMKMEZVLHJARHF-UHFFFAOYSA-N 0.000 description 1
- XABCFXXGZPWJQP-UHFFFAOYSA-N 3-aminoadipic acid Chemical compound OC(=O)CC(N)CCC(O)=O XABCFXXGZPWJQP-UHFFFAOYSA-N 0.000 description 1
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 241000403668 Actinomyces virus Av1 Species 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- 108010011170 Ala-Trp-Arg-His-Pro-Gln-Phe-Gly-Gly Proteins 0.000 description 1
- 241001136792 Alle Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- JBRZTFJDHDCESZ-UHFFFAOYSA-N AsGa Chemical compound [As]#[Ga] JBRZTFJDHDCESZ-UHFFFAOYSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 241000701844 Bacillus virus phi29 Species 0.000 description 1
- 241000095992 Clostridium phage phiCPV4 Species 0.000 description 1
- 108050006400 Cyclin Proteins 0.000 description 1
- 108010076804 DNA Restriction Enzymes Proteins 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 229910001218 Gallium arsenide Inorganic materials 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Chemical group OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- LCWXJXMHJVIJFK-UHFFFAOYSA-N Hydroxylysine Natural products NCC(O)CC(N)CC(O)=O LCWXJXMHJVIJFK-UHFFFAOYSA-N 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical group OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- 235000019687 Lamb Nutrition 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- PQNASZJZHFPQLE-LURJTMIESA-N N(6)-methyl-L-lysine Chemical compound CNCCCC[C@H](N)C(O)=O PQNASZJZHFPQLE-LURJTMIESA-N 0.000 description 1
- OLNLSTNFRUFTLM-BYPYZUCNSA-N N-ethyl-L-asparagine Chemical compound CCN[C@H](C(O)=O)CC(N)=O OLNLSTNFRUFTLM-BYPYZUCNSA-N 0.000 description 1
- OLNLSTNFRUFTLM-UHFFFAOYSA-N N-ethylasparagine Chemical compound CCNC(C(O)=O)CC(N)=O OLNLSTNFRUFTLM-UHFFFAOYSA-N 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 108700028353 OmpC Proteins 0.000 description 1
- 108700006385 OmpF Proteins 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 1
- 101710129178 Outer plastidial membrane protein porin Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 102100036691 Proliferating cell nuclear antigen Human genes 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 230000006819 RNA synthesis Effects 0.000 description 1
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 1
- 101000844743 Saccharolobus shibatae DNA-binding protein 7a Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 244000191761 Sida cordifolia Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 102100026940 Small ubiquitin-related modifier 1 Human genes 0.000 description 1
- 101710081623 Small ubiquitin-related modifier 1 Proteins 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 241000205098 Sulfolobus acidocaldarius Species 0.000 description 1
- 101500011469 Sulfolobus acidocaldarius (strain ATCC 33909 / DSM 639 / JCM 8929 / NBRC 15157 / NCIMB 11770) DNA-binding protein 7a Proteins 0.000 description 1
- 101500011470 Sulfolobus acidocaldarius (strain ATCC 33909 / DSM 639 / JCM 8929 / NBRC 15157 / NCIMB 11770) DNA-binding protein 7b Proteins 0.000 description 1
- 101000844753 Sulfolobus acidocaldarius (strain ATCC 33909 / DSM 639 / JCM 8929 / NBRC 15157 / NCIMB 11770) DNA-binding protein 7d Proteins 0.000 description 1
- 101000844750 Sulfolobus acidocaldarius (strain ATCC 33909 / DSM 639 / JCM 8929 / NBRC 15157 / NCIMB 11770) DNA-binding protein 7e Proteins 0.000 description 1
- 241000205095 Sulfolobus shibatae Species 0.000 description 1
- 241000205091 Sulfolobus solfataricus Species 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical group O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 1
- 101710183280 Topoisomerase Proteins 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical group O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 102100037820 Voltage-dependent anion-selective channel protein 1 Human genes 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 125000002355 alkine group Chemical group 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 150000003862 amino acid derivatives Chemical class 0.000 description 1
- 229960002684 aminocaproic acid Drugs 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 229940000635 beta-alanine Drugs 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 238000006352 cycloaddition reaction Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 1
- 229960003964 deoxycholic acid Drugs 0.000 description 1
- KXGVEGMKQFWNSR-UHFFFAOYSA-N deoxycholic acid Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 KXGVEGMKQFWNSR-UHFFFAOYSA-N 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical compound NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Chemical group 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- QJHBJHUKURJDLG-UHFFFAOYSA-N hydroxy-L-lysine Natural products NCCCCC(NO)C(O)=O QJHBJHUKURJDLG-UHFFFAOYSA-N 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000026045 iodination Effects 0.000 description 1
- 238000006192 iodination reaction Methods 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 108040007791 maltose transporting porin activity proteins Proteins 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 238000006011 modification reaction Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- 125000004043 oxo group Chemical group O=* 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 125000002743 phosphorus functional group Chemical group 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000013823 prenylation Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 102220042668 rs116429842 Human genes 0.000 description 1
- 229940043230 sarcosine Drugs 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000001447 template-directed synthesis Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 125000000341 threoninyl group Chemical class [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- BJBUEDPLEOHJGE-IMJSIDKUSA-N trans-3-hydroxy-L-proline Chemical compound O[C@H]1CC[NH2+][C@@H]1C([O-])=O BJBUEDPLEOHJGE-IMJSIDKUSA-N 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 125000004417 unsaturated alkyl group Chemical group 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/483—Physical analysis of biological material
- G01N33/487—Physical analysis of biological material of liquid biological material
- G01N33/48707—Physical analysis of biological material of liquid biological material by electrical means
- G01N33/48721—Investigating individual macromolecules, e.g. by translocation through nanopores
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/305—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Micrococcaceae (F)
- C07K14/31—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Micrococcaceae (F) from Staphylococcus (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/80—Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/195—Assays involving biological materials from specific organisms or of a specific nature from bacteria
- G01N2333/305—Assays involving biological materials from specific organisms or of a specific nature from bacteria from Micrococcaceae (F)
- G01N2333/31—Assays involving biological materials from specific organisms or of a specific nature from bacteria from Micrococcaceae (F) from Staphylococcus (G)
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N27/00—Investigating or analysing materials by the use of electric, electrochemical, or magnetic means
- G01N27/26—Investigating or analysing materials by the use of electric, electrochemical, or magnetic means by investigating electrochemical variables; by using electrolysis or electrophoresis
- G01N27/416—Systems
- G01N27/447—Systems using electrophoresis
- G01N27/44756—Apparatus specially adapted therefor
- G01N27/44791—Microapparatus
Definitions
- the present disclosure relates generally to methods and compositions for nanopore-based nucleotide sequencing, and more particularly to nanopore protein monomers that are conjugated to DNA binding proteins to form nanopore protein conjugates. Also provided are nanopore protein assemblies that are configured from the nanopore protein conjugates.
- the processivity of a DNA polymerase i.e., the ability of a polymerase to remain bound to the template or substrate and perform DNA synthesis, is critical to the function of nanopore-based sequencing reactions.
- sequencing activity of the nanopore assembly ceases, thereby slowing and disrupting the sequencing reaction until the DNA polymerase can re-bind the template strand.
- the DNA polymerase may not re-bind the template strand, in which case the sequencing reaction for the dissociated template strand remains incomplete.
- the dissociated template DNA strand may migrate away from the polymerase and the nanopore assembly, thus preventing the polymerase from re-binding the template strand.
- nanopore protein conjugates that include a nanopore protein monomer and a DNA binding domain of a DNA binding protein.
- the nanopore protein monomer includes, for example, an a-hemolysin (a-HL) domain or variant thereof, while the DNA binding domain includes, for example, an Sso7d domain or Sso7d-like domain.
- a-HL domain includes an amino acid sequence at least 75% identical to the amino acid sequence set forth as SEQ ID NO: 1
- the Sso7d domain includes an amino acid sequence having at least 75% sequence identity to the amino acid sequence set forth as SEQ ID NO: 2.
- the a-HL domain is a variant domain, and includes a substitution at a position corresponding to position 1, 2, 3, 4, 9, 12, 17,
- the substitution including one or more positive charges.
- the substitution may be an HI 44 A, T12K, T12R, N17K, or N17R substitution.
- the a-HL domain includes a sequence having at least 80%, 90%, 95%, 98%), or more sequence identity to SEQ ID NO: 4.
- the nanopore protein conjugate includes an amino acid sequence having at least 80%>, 90%>, 95%, 98%, or more sequence identity to SEQ ID NO: 5.
- a heptameric nanopore assembly that includes at least one of the nanopore protein conjugates described herein, such as a protein conjugate having an amino acid set forth as SEQ ID NO: 5.
- a DNA-manipulating or modifying enzyme such as a DNA polymerase, is joined to the nanopore monomer of the heptameric nanopore assembly, such as via a SpyTag/SpyCatcher linkage.
- a nanopore assembly system for nucleic acid sequencing.
- the system includes a nanopore assembly including a plurality of oligomerized nanopore protein monomers.
- the nanopore assembly is disposed within a membrane.
- a first monomer of the plurality of monomers for example, is a protein conjugate including a DNA binding domain.
- the DNA binding domain is joined to the first monomer of the nanopore assembly, such as via a covalent linkage.
- a second of the plurality of monomers of the nanopore assembly for example, is joined to a DNA polymerase.
- a sensing electrode is positioned adjacent to or in proximity to the membrane.
- each of the plurality of nanopore protein monomers of the nanopore assembly system is an a-hemolysin monomer, thereby forming a heptameric assembly.
- the a-HL monomer is at least 75% identical to the amino acid sequence set forth as SEQ ID NO: 1
- the DNA binding domain of the protein conjugate comprises an Sso7d domain having the a sequence that is at least 75% identical to the amino acid sequence set forth as SEQ ID NO:2.
- the protein conjugate of the first monomer is at least 75% identical to the amino acid sequence set forth as SEQ ID NO:5.
- the DNA binding domain of the protein conjugate comprises an Sso7d domain having the a sequence that is at least 75% identical to the amino acid sequence set forth as SEQ ID NO:2.
- the protein conjugate of the first monomer is at least 75% identical to the amino acid sequence set forth as SEQ ID NO:5.
- the DNA binding domain of the protein conjugate comprises an Sso7d domain having the a sequence that is at least 75% identical to the amino acid sequence set forth as SEQ ID NO:2.
- DNA polymerase is joined to the second monomer via SpyTag/SpyCatcher linkage.
- a method for detecting a target molecule includes providing a chip that includes a nanopore as described herein, the nanopore being is disposed within a membrane. A sensing electrode is positioned adjacent or in proximity to the membrane. The nanopore is then contacted with a nucleic acid molecule, the nucleic acid molecule being associated with a reporter molecule having an address region and a probe region. The reporter molecule is associated with the nucleic acid molecule at the probe region and the reporter molecule is coupled to a target molecule.
- the method further includes sequencing the address region while the nucleic acid molecule is in contact with the nanopore to determine a nucleic acid sequence of said address region.
- the method also includes identifying, with the aid of a computer processor, the target molecule based upon a nucleic acid sequence of the sequenced address region.
- Figure 1 is an image of an SDS-PAGE gel showing purification of an a- HL/Sso7d protein conjugate, in accordance with certain example embodiments.
- Serial elution fractions imaged using a Bio-RadTM stain-free gel system are shown. More particularly, lane 1 shows molecular weight markers; lane 2 show the lysate; lane 3 shows the pellet; lane 4 shows the supernatant; lane 5 shows Talon FT (the affinity resin); lane 6 shows Elution of 5 ⁇ ; lane 7 shows Elution of 10 ⁇ ; and, lane 8 shows Elution of 15 ⁇ .
- the purified a-HL/Sso7d conjugate protein is shown at around the expected 45 kD m.w. in lanes 6, 7, and 8 (arrow).
- Figure 2A is an image of an SDS-PAGE gel showing the identification of heptamers having a-HL/SpyTag and a-HL/Sso7d monomers, in accordance with certain example embodiments.
- the gel was imaged using a Bio-RadTM stain-free gel system. Serai elution fractions are shown.
- lane 1 shows molecular weight markers
- lane 2 shows Spycatcher-GFP alone
- lane 3 shows Spycatcher-GFP + monomeric a-HL
- lane 4 shows Spycatcher-GFP + a-HL nanopore that does not contain a monomer-subunit with spytag
- lane 5 shows Spycatcher-GFP + a-HL nanopore with a single monomer-subunit conjugated to a spytag
- lane 6 shows Spycatcher-GFP + a-HL nanopore with one to two monomer- subunits conjugated to a spytag
- lane 7 shows Spycatcher-GFP + a-HL nanopore with one to three monomers-subunits conjugated to a spytag
- lane 8 shows Spycatcher-GFP + low levels of a-HL nanopore with two or more monomer- subunits conjugated to a spytag
- lane 9 shows Spycatcher-GFP + low levels of a- HL nanopore with two or more monomer-subunits conjug
- the elution fraction shown in lane 5 was determined to have a 1 :6 a-HL/SpyTag:a-HL/Sso7d ratio.
- Figure 2B is an image of the SDS-PAGE gel of Figure 2A, but viewed with a fluorescence filter to review GFP (green fluoresce protein) fluorescence, in accordance with certain example embodiments. More particularly, binding of SpyCatcher-GFP to the a-HL/SpyTag of the heptamers from the various elution fractions reveals the presence of the ⁇ -HL/SpyTag, such as in lanes 5, 6, and 7. Notably, no a-HL/SpyTag is present in the fraction of lane 4, as this heptamer, having the furthest migration, is expected to be devoid of a-HL/SpyTag.
- GFP green fluoresce protein
- the heptamer contains a-HL/SpyTag:a-HL/Sso7d at a ratio of 0:7 (i.e., no a-HL/SpyTag).
- lane 5 contains the fraction that migrated the furthest and that displays fluorescence, thus indicating the presence of the 1 :6 a-HL/SpyTag:a-HL/Sso7d heptamer.
- Figure 3A and 3B are graphs showing analysis of control a-HL nanopores and nanopores having a 1 :6 a-HL/SpyTag:a-HL/Sso7d ratio, in accordance with certain example embodiments.
- Figure 3A shows the difference between when the polymerase ceased sequencing activity and when the pore ceased its activity for control a-HL nanopores.
- Figure 3B shows the difference between when the polymerase ceased sequencing activity and when the pore ceased its activity for the nanopores having the 1 :6 a-HL/SpyTag:a-HL/Sso7d ratio.
- Figure 3A shows the difference between when the polymerase ceased sequencing activity and when the pore ceased channel activity.
- Figure 4 A and 4B are graphs showing sequencing end times for control and control a-HL nanopores and nanopores having a 1 :6 a-HL/SpyTag:a-HL/Sso7d ratio, in accordance with certain example embodiments. More particularly, Figure 4A shows sequencing end time, i.e., the amount of time the polymerase of the nanopore actively sequences a template, for control a-HL nanopores. Figure 4B shows sequencing end time for nanopores having a 1 :6 a-HL/SpyTag:a-HL/Sso7d ratio. As can be seen by comparing Figure 4A with
- compositions for improving DNA polymerase processivity during nanopore-based DNA sequencing are provided.
- the compositions include a nanopore protein conjugate, such as a fusion protein, having a DNA binding protein that is linked to a monomer of a nanopore assembly. Tethered to another monomer of the nanopore assembly is a DNA-manipulating or modifying enzyme, such as a DNA polymerase.
- a DNA-manipulating or modifying enzyme such as a DNA polymerase.
- the DNA polymerase for example, is held to the assembly via the tether while the DNA binding domain of the nanopore protein conjugate is available to interact with a DNA template strand.
- the interaction of the DNA binding domain with the template DNA strand improves the polymerase processivity. That is, as the tethered DNA polymerase processes a template DNA strand, it is believed that the DNA binding domain linked to the nanopore assembly monomer binds the template DNA strand, thereby keeping the template DNA strand in close proximity to the nanopore assembly and hence near the tethered DNA polymerase during sequencing.
- the template DNA strand dissociates from the DNA polymerase, it is believed that the close proximity of the template DNA strand to nanopore assembly allows the polymerase to re -bind the template strand, thus permitting the DNA polymerase to continue its sequencing activity. In other words, the interaction of the DNA binding domain with the DNA template strand at the nanopore assembly is believed to maintain the DNA template strand in high local concentration near the DNA polymerase so that the effects of DNA polymerase dissociation from the template strand is minimized.
- the nanopore assembly domain of the nanopore conjugate protein is an alpha-hemolysin (a-HL) monomer or variant thereof, thus forming an a-HL/DNA-binding conjugate protein.
- the DNA binding domain for example, is available to bind template DNA strands as described herein.
- the a-HL monomer domain of the conjugate protein is available to oligomerize with other a-hemolysin monomers, including additional a-HL/DNA-binding conjugate proteins, to form a multi-subunit nanopore.
- the nanopore may be a heptamer that includes six ⁇ -HL/DNA-binding conjugate proteins and one a-HL monomer that is used to attach the DNA polymerase to the monomer (and hence to the nanopore).
- the nanopore heptamer includes seven oligomerized a-HL monomers, six of which include a DNA binding domain (and hence are capable of binding a DNA template strand) and one of which that is tethered to the DNA polymerase in the nanopore assembly.
- the DNA binding domain that is joined to a monomer of the nanopore assembly to form the ⁇ -HL/DNA-binding conjugate protein is an Sso7d protein or fragment thereof.
- the Sso7d protein or fragment thereof can be linked to an a-HL monomer to form an a-HL/Sso7d fusion protein.
- the Sso7d protein binds to double-stranded DNA without marked sequence preference. Such lack of sequence preference is advantageous, for example, for use with the nanopore-based sequencing methods described herein because the sequence of the DNA undergoing sequencing is usually unknown.
- the time between when the polymerase stops processing a template DNA strand and when the nanopore ceases its activity can be significantly reduced.
- the processivity the tethered DNA polymerase can be advantageously increased during nanopore-based sequencing.
- Such increases in processivity may be useful, for example, when carrying out nanopore-based sequencing in higher salt conditions.
- the methods and compositions described herein may be used to maintain a high level of polymerase processivity in higher salt concentrations, thereby allowing more accurate signal detection (due to the higher salt levels).
- polymerase processivity is not sacrificed as the expense of better signal detection across the nanopore.
- amino acid sequences are written left to right in amino to carboxy orientation, respectively.
- Ranges can be expressed herein as from “about” one particular value, and/or to "about” another particular value. When such a range is expressed, another aspect includes from the one particular value of the range and/or to the other particular value of the range. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. Similarly, when values are expressed as approximations, by use of the antecedent "about,” it will be understood that the particular value forms another aspect. In certain example embodiments, the term “about” is understood as within a range of normal tolerance in the art, for example within 2 standard deviations of the mean.
- alpha-hemolysin As used herein, "alpha-hemolysin,” “a-hemolysin,” “a -HL,” “a-HL,” and “hemolysin” are used interchangeably and refer to the monomeric protein that self-assembles into a heptameric water-filled transmembrane channel (i.e., nanopore). Depending on context, the term may also refer to the transmembrane channel formed by seven monomeric proteins.
- the alpha-hemolysin is a "modified alpha-hemolysin," meaning that alpha- hemolysin originated from another (i.e., parental) alpha-hemolysin and contains one or more amino acid alterations (e.g., amino acid substitution, deletion, or insertion) compared to the parental alpha-hemolysin.
- a modified alpha-hemolysin of the invention is originated or modified from a naturally-occurring or wild-type alpha-hemolysin.
- a modified alpha-hemolysin is originated or modified from a recombinant or engineered alpha-hemolysin including, but not limited to, chimeric alpha- hemolysin, fusion alpha-hemolysin or another modified alpha-hemolysin.
- a modified alpha-hemolysin has at least one changed phenotype compared to the parental alpha-hemolysin.
- the alpha-hemolysin arises from a "variant hemolysin gene" or is a “variant hemolysin,” which means, respectively, that the nucleic acid sequence of the alpha- hemolysin gene from Staphylococcus aureus has been altered by removing, adding, and/or manipulating the coding sequence or the amino acid sequence of the expressed protein has been modified consistent with the invention described herein.
- the term "amino acid,” in its broadest sense refers to any compound and/or substance that can be incorporated into a polypeptide chain.
- an amino acid has the general structure H 2 N— C(H)(R)— COOH.
- an amino acid is a naturally-occurring amino acid.
- an amino acid is a synthetic amino acid; in some embodiments, an amino acid is a D-amino acid; in some embodiments, an amino acid is an L-amino acid.
- Standard amino acid refers to any of the twenty standard L-amino acids commonly found in naturally occurring peptides.
- Nonstandard amino acid refers to any amino acid, other than the standard amino acids, regardless of whether it is prepared synthetically or obtained from a natural source.
- a "synthetic amino acid” or “non-natural amino acid” encompasses chemically modified amino acids, including but not limited to salts, amino acid derivatives (such as amides), and/or substitutions.
- Amino acids including carboxy- and/or amino-terminal amino acids in peptides, can be modified by methylation, amidation, acetylation, and/or substitution with other chemical without adversely affecting their activity. Amino acids may participate in a disulfide bond.
- amino acid is used interchangeably with "amino acid residue,” and may refer to a free amino acid and/or to an amino acid residue of a peptide. It will be apparent from the context in which the term is used whether it refers to a free amino acid or a residue of a peptide. It should be noted that all amino acid residue sequences are represented herein by formulae whose left and right orientation is in the conventional direction of amino -terminus to carboxy- terminus.
- the term "complementary” refers to the broad concept of sequence complementarity between regions of two polynucleotide strands or between two nucleotides through base-pairing. It is known that an adenine nucleotide is capable of forming specific hydrogen bonds ("base pairing") with a nucleotide which is thymine or uracil. Similarly, it is known that a cytosine nucleotide is capable of base pairing with a guanine nucleotide.
- a base pair (bp) refers to a partnership of adenine (A) with thymine (T), or of cytosine (C) with guanine (G) in a double stranded DNA molecule.
- cellular expression or "cellular gene expression” generally refer to the cellular processes by which a biologically active polypeptide is produced from a DNA sequence and exhibits a biological activity in a cell.
- gene expression involves the processes of transcription and translation, but can also involve post-transcriptional and post-translational processes that can influence a biological activity of a gene or gene product. These processes include, for example, RNA synthesis, processing, and transport, as well as polypeptide synthesis, transport, and post-translational modification of polypeptides. Additionally, processes that affect protein-protein interactions within the cell can also affect gene expression as defined herein.
- conjugate refers to the product of coupling or joining of two or more materials, the resulting product having at least two distinct elements, such as at least two domains.
- the coupled materials may be the same or may be different. Such a coupling may be via one or more linking groups.
- a "protein conjugate,” for example, results from the coupling of two or more amino acid sequences.
- a conjugate of two proteins, for example, results in a single protein that has a domain corresponding to each of the individually joined proteins.
- DNA refers to a molecule comprising at least one deoxyribonucleotide residue.
- a "deoxyribonucleotide,” is a nucleotide without a hydroxyl group and instead a hydrogen at the 2' position of a ⁇ -D- deoxyribofuranose moiety.
- the term encompasses double stranded DNA, single stranded DNA, DNAs with both double stranded and single stranded regions, isolated DNA such as partially purified DNA, essentially pure DNA, synthetic DNA, recombinantly produced DNA, as well as altered DNA, or analog DNA, that differs from naturally occurring DNA by the addition, deletion, substitution, and/or modification of one or more nucleotides.
- DNA binding domain refers to the region of a protein that bind DNA molecule, such as a DNA template strand.
- DNA molecule such as a DNA template strand.
- Sso7d polypeptide when conjugated to a nanopore monomer protein as described herein, constitutes a DNA binding domain of the protein conjugate.
- domain refers to a unit of a protein or protein complex, comprising a polypeptide subsequence, a complete polypeptide sequence, or a plurality of polypeptide sequences where that unit has a defined function.
- the function is understood to be broadly defined and can be ligand binding, catalytic activity or can have a stabilizing effect on the structure of the protein.
- An "expression cassette” or “expression vector” is a nucleic acid construct generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a target cell.
- the recombinant expression cassette can be incorporated into a plasmid, chromosome, mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment.
- the recombinant expression cassette portion of an expression vector includes, among other sequences, a nucleic acid sequence to be transcribed and a promoter.
- a "gene" includes a coding strand and a non-coding strand.
- coding strand and “sense strand” are used interchangeably, and refer to a nucleic acid sequence that has the same sequence of nucleotides as an m NA from which the gene product is translated.
- the coding/sense strand includes thymidine residues instead of the uridine residues found in the corresponding mRNA.
- the coding/sense strand can also include additional elements not found in the mRNA including, but not limited to promoters, enhancers, and introns.
- template strand As used interchangeably and refer to a nucleic acid sequence that is complementary to the coding/sense strand.
- a "heterologous" nucleic acid construct or sequence has a portion of the sequence which is not native to the cell in which it is expressed.
- Heterologous with respect to a control sequence refers to a control sequence (i.e. promoter or enhancer) that does not function in nature to regulate the same gene the expression of which it is currently regulating.
- heterologous nucleic acid sequences are not endogenous to the cell or part of the genome in which they are present, and have been added to the cell, by infection, transfection, transformation, microinjection, electroporation, or the like.
- a “heterologous” nucleic acid construct may contain a control sequence/DNA coding sequence combination that is the same as, or different from a control sequence/DNA coding sequence combination found in the native cell.
- host cell it is meant a cell that contains a vector and supports the replication, and/or transcription or transcription and translation (expression) of the expression construct.
- Host cells can be prokaryotic cells, such as E. coli or Bacillus subtilus, or eukaryotic cells such as yeast, plant, insect, amphibian, or mammalian cells. In general, host cells are prokaryotic, e.g., E. coli.
- An "isolated" molecule is a nucleic acid molecule that is separated from at least one other molecule with which it is ordinarily associated, for example, in its natural environment.
- An isolated nucleic acid molecule includes a nucleic acid molecule contained in cells that ordinarily express the nucleic acid molecule, but the nucleic acid molecule is present extrachromasomally or at a chromosomal location that is different from its natural chromosomal location.
- join refers to any method known in the art for functionally connecting proteins and/or protein domains.
- one protein domain may be linked to another protein domain via a covalent bond, such as in a recombinant fusion protein, with or without intervening sequences or domains.
- Example covalent linkages may be formed, for example, through SpyCatcher/SpyTag interactions, cysteine-maleimide conjugation, or azide-alkyne click chemistry, as well as other means known in the art.
- label refers to a detectable compound or composition that is conjugated or coupled directly or indirectly to another molecule to facilitate detection of that molecule.
- Specific, non-limiting examples of labels include fluorescent tags, chemiluminescent tags, haptens, enzymatic linkages, and radioactive isotopes.
- a label includes, for example, a moiety via which an oligonucleotide can be detected or purified.
- mutation refers to a change introduced into a parental sequence, including, but not limited to, substitutions, insertions, deletions (including truncations).
- the consequences of a mutation include, but are not limited to, the creation of a new character, property, function, phenotype or trait not found in the protein encoded by the parental sequence.
- a mutation in a DNA sequence may lead to a change in the amino acid sequence of the protein resulting from transcription/translation of the DNA sequence.
- nanopore generally refers to a pore, channel, or passage formed or otherwise provided in a membrane.
- a membrane may be an organic membrane, such as a lipid bilayer, or a synthetic membrane, such as a membrane formed of a polymeric material.
- the membrane may be a polymeric material.
- the nanopore may be disposed adjacent or in proximity to a sensing circuit or an electrode coupled to a sensing circuit, such as, for example, a complementary metal-oxide semiconductor (CMOS) or field effect transistor (FET) circuit.
- CMOS complementary metal-oxide semiconductor
- FET field effect transistor
- a nanopore has a characteristic width or diameter on the order of 0.1 nanometers (nm) to about lOOOnm.
- Some nanopores are proteins.
- nucleic acid molecule includes R A, DNA and cDNA molecules. It will be understood that, as a result of the degeneracy of the genetic code, a multitude of nucleotide sequences encoding a given protein such as alpha- hemolysin and/or variants thereof may be produced. The present invention contemplates every possible variant nucleotide sequence, encoding variant alpha- hemolysin, all of which are possible given the degeneracy of the genetic code.
- nucleotide is used herein as recognized in the art to include natural bases (standard), and modified bases well known in the art. Such bases are generally located at the position of a nucleotide sugar moiety. Nucleotides generally comprise a base, sugar, and a phosphate group.
- phospholipid refers to a hydrophobic molecule comprising at least one phosphorus group.
- a phospholipid can comprise a phosphorus-containing group and saturated or unsaturated alkyl group, optionally substituted with OH, COOH, oxo, amine, or substituted or unsubstituted aryl groups.
- a "polymerase” refers to an enzyme that performs template-directed synthesis of polynucleotides.
- the term, as used herein, also refers to a domain of the polymerase that has catalytic activity. Generally, the enzyme will initiate synthesis at the 3 '-end of the primer annealed to a polynucleotide template sequence, and will proceed toward the 5' end of the template strand.
- a "DNA polymerase” catalyzes the polymerization of deoxynucleotides.
- the term “processivity” refers to the ability of a nucleic acid modifying enzyme to remain bound to the template or substrate and perform multiple modification reactions. Processivity is generally measured by the number of catalytic events that take place per binding event.
- the term “promoter” refers to a nucleic acid sequence that functions to direct transcription of a downstream gene. The promoter will generally be appropriate to the host cell in which the target gene is being expressed. The promoter together with other transcriptional and translational regulatory nucleic acid sequences (also termed “control sequences”) are necessary to express a given gene. In general, the transcriptional and translational regulatory sequences include, but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, translational start and stop sequences, and enhancer or activator sequences.
- purified means that a molecule is present in a sample at a concentration of at least 95% by weight, or at least 98% by weight of the sample in which it is contained.
- purifying generally refers to subjecting transgenic nucleic acid or protein containing cells to biochemical purification and/or column chromatography.
- purified does not require absolute purity. Rather, this term is intended as a relative term.
- a purified or “substantially pure” protein preparation is one in which the protein referred to is more pure than the protein in its natural environment within a cell or within a production reaction chamber (as appropriate).
- sequence identity refers to the similarity between two nucleic acid sequences, or two amino acid sequences, and is expressed in terms of the similarity between the sequences, otherwise referred to as sequence identity. Sequence identity is frequently measured in terms of percentage identity (or similarity or homology); the higher the percentage, the more similar the two sequences are. For example, 80%> homology means the same thing as 80%> sequence identity determined by a defined algorithm, and accordingly a homologue of a given sequence has greater than 80%> sequence identity over a length of the given sequence.
- Example levels of sequence identity include, for example, 80, 85,
- sequence identity to a given sequence, e.g., the coding sequence for any one of the inventive polypeptides, as described herein.
- NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al. J. Mol. Biol. 215:403-410, 1990) is available from several sources, including the National Center for Biotechnology Information (NCBI, Bethesda, MD) and on the Internet, for use in connection with the sequence analysis programs that include, for example, the suite of BLAST programs, such as BLASTN, BLASTX, and TBLASTX, BLASTP and TBLASTN.
- Sequence searches are typically carried out using the BLASTN program when evaluating a given nucleic acid sequence relative to nucleic acid sequences in the GenBank DNA Sequences and other public databases.
- the BLASTX program is preferred for searching nucleic acid sequences that have been translated in all reading frames against amino acid sequences in the GenBank Protein Sequences and other public databases. Both BLASTN and BLASTX are run using default parameters of an open gap penalty of 11.0, and an extended gap penalty of 1.0, and utilize the BLOSUM-62 matrix. (See, e.g., Altschul, S. F., et al, Nucleic Acids Res.
- a preferred alignment of selected sequences in order to determine "% identity" between two or more sequences is performed using for example, the CLUSTAL-W program in MacVector version 13.0.7, operated with default parameters, including an open gap penalty of 10.0, an extended gap penalty of 0.1, and a BLOSUM 30 similarity matrix.
- “significance” or "significant” relates to a statistical analysis of the probability that there is a non-random association between two or more entities.
- a relationship is "significant” or has “significance”
- statistical manipulations of the data can be performed to calculate a probability, expressed as a "p-value.” Those p-values that fall below a user-defined cutoff point are regarded as significant. In one example, a p-value less than or equal to 0.05, in another example less than 0.01, in another example less than 0.005, and in yet another example less than 0.001, are regarded as significant.
- the term "tag” refers to a detectable moiety that may be atoms or molecules, or a collection of atoms or molecules.
- a tag may provide an optical, electrochemical, magnetic, or electrostatic (e.g., inductive, capacitive) signature, which signature may be detected with the aid of a nanopore.
- a nucleotide is attached to the tag it is called a "Tagged Nucleotide.”
- the tag may be attached to the nucleotide via the phosphate moiety.
- time to thread means the time it takes the polymerase-tag complex or a nucleic acid strand to thread the tag into the barrel of the nanopore.
- the term “variant” refers to a modified protein which displays altered characteristics when compared to the parental protein, e.g., altered ionic conductance.
- the term “vector” refers to a nucleic acid construct designed for transfer between different host cells.
- An "expression vector” refers to a vector that has the ability to incorporate and express heterologous DNA fragments in a foreign cell. Many prokaryotic and eukaryotic expression vectors are commercially available. Selection of appropriate expression vectors is within the knowledge of those having skill in the art.
- wild-type refers to a gene or gene product which has the characteristics of that gene or gene product when isolated from a naturally-occurring source.
- Thrl7Arg+Glu34Ser or T17R+E34S representing mutations in positions 30 and 34 substituting alanine and glutamic acid for asparagine and serine, respectively.
- T17R/K or T17R or T17K.
- Nanopore Protein Conjugates Provided herein are compositions that include nanopore protein conjugates.
- the conjugates include a nanopore protein monomer that is joined to a DNA binding domain of a DNA binding protein.
- the resultant nanopore protein conjugate includes a nanopore protein monomer domain and a DNA binding domain.
- Such protein conjugates can be used, for example, to form nanopore pore assemblies having improved sequencing yield and nanopore lifetime as described herein.
- the nanopore protein monomer of the nanopore protein conjugate may include any nanopore protein that, when combined with other proteins - and when positioned in a substrate, such as a membrane - allows the passage of a molecule through the substrate.
- the nanopore may allow passage of a molecule that would otherwise not be able to pass through that substrate.
- Examples of nanopores include proteinaceous or protein based pores or synthetic pores.
- a nanopore may have an inner diameter of 1-10 nm or 1-5 nm or 1-3 nm.
- protein pores include for example, alpha- homolysin, voltage-dependent mitochondrial porin (VDAC), OmpF, OmpC, MspA and LamB (maltoporin) ⁇ see (Rhee, M. et al., Trends in Biotechnology, 25(4) (2007): 174-181).
- VDAC voltage-dependent mitochondrial porin
- OmpF OmpF
- OmpC OmpC
- MspA MspA
- LamB maltoporin
- the pore protein may be a modified protein, such as a modified natural protein or synthetic protein.
- the DNA binding domain of the nanopore protein conjugate can include any DNA binding domain that binds a DNA, such as a double-stranded DNA template strand.
- the DNA binding domain is sequence non-specific. That is, the DNA binding domain of the nanopore protein conjugate can bind a variety of different DNA sequences, such as template DNA strands with different nucleotide sequences, without binding specificity for the strand to which the DNA binding domain interacts. As such, the DNA binding domain binds to double-stranded nucleic acid in a sequence-independent manner, such that binding does not exhibit a gross preference for a particular nucleotide sequence.
- double-stranded nucleic acid binding proteins exhibit a 10- fold or higher affinity for double-stranded versus single-stranded nucleic acids.
- the double-stranded nucleic acid binding proteins in certain example embodiments are preferably thermostable.
- examples of such proteins include, but are not limited to, the Archaeal small basic DNA binding protein Sso7d (discussed below; see, e.g., Choli et al., Biochimica et Biophysica Acta 950: 193-203, 1988; Baumann et al., Structural Biol. 1 :808-819, 1994; and Gao et al, Nature Struc. Biol. 5:782-786,
- the nanopore protein monomer and/or the DNA binding protein of the conjugate protein may include one or more post- translational modifications.
- modification may include, for example, phosphate (phosphorylation), carbohydrate (glycosylation), ADP-ribosyl (ADP ribosylation), fatty acid (prenylation, which includes but is not limited to: myristoylation and palmitylation), ubiquitin (ubiquitination) and sentrin (sentrinization; a ubiquitination-like protein modification).
- post-translational modification include methylation, actylation, hydroxylation, iodination and flavin linkage.
- the amino acids forming all or a part of nanopore protein conjugate may be stereoisomers. Additionally or alternatively, the amino acids forming all or a part of the nanopore protein conjugate described herein may be modifications of naturally occurring amino acids, non-naturally occurring amino acids, post-translationally modified amino acids, enzymatically synthesized amino acids, derivatized amino acids, constructs or structures designed to mimic amino acids, and the like.
- the amino acids forming the peptides of the present invention may be one or more of the 20 common amino acids found in naturally occurring proteins, or one or more of the modified and unusual amino acids. In certain example embodiments, the amino acids may be D- or L- amino acids.
- the amino acid sequence of the conjugate protein may also include one or more modified and/or unusual amino acid.
- modified and unusual amino acids include but are not limited to, 2-Aminoadipic acid (Aad), 3-Aminoadipic acid (Baad), ⁇ -Amino-propionic acid (Bala, ⁇ -alanine), 2-Aminobutyric acid (Abu, piperidinic acid), 4-Aminobutyric acid (4Abu), 6-Aminocaproic acid (Acp), 2-Aminoheptanoic acid (Ahe), 2- Aminoisobutyric acid (Aib), 3-Aminoisobutyric acid (Baib), 2-Aminopimelic acid
- the nanopore protein conjugate includes a linker sequence that links the nanopore protein monomer domain the
- the linker may covalently join the nanopore protein monomer domain to the DNA binding domain.
- the linker may include any number of amino acids that join the nanopore protein monomer domain to the DNA binding domain, while sill preserving the independent function of the two domains. That is, the linker will not interfere with the ability of the nanopore protein monomer to oligomerize with other nanopore protein monomer domain to form a pore.
- the linker sequence will not interfere the ability of the DNA binding domain of the nanopore protein conjugate to bind DNA.
- the linker sequence may include, for example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 amino acids. In certain example embodiments, the linker is less than about 10 amino acids, such as 1-5 amino acids. In certain example embodiments, the linker sequence is a -GLSA- linker sequence (SEQ ID NO: 7).
- the nanopore monomer portion of the nanopore protein conjugate is an alpha-hemolysin monomer.
- the resultant nanopore protein conjugate includes an alpha- hemolysin domain (i.e., the alpha-hemolysin monomer portion of the conjugate) and a DNA binding domain, as described herein.
- Alpha-hemolysin is a 293 amino acid polypeptide secreted by Staphylococcus aureus as a water-soluble monomer that assembles into lipid bilayers to form a heptameric pore.
- the heptamer for example, is stable in sodium dodecyl sulfate (SDS) at up to 65° C.
- alpha-hemolysin domain of the nanopore protein conjugate provided herein has the amino acid sequence set forth as SEQ ID NO: 1 (wild type alpha-hemolysin).
- the alpha-hemolysin domain of the nanopore protein conjugate has an amino acid sequence that is 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the sequence set forth as SEQ ID NO: 1.
- the alpha-hemolysin domain of the nanopore protein conjugate has the amino acid sequence set forth as SEQ ID NO: 3 (mature, wild type alpha-hemolysin).
- the alpha- hemolysin domain of the nanopore protein conjugate has an amino acid sequence that is 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the sequence set forth as SEQ ID NO: 3.
- the alpha-hemolysin domain of the nanopore protein conjugate has the amino acid sequence set forth as SEQ ID NO: 4 (mature, parental wild type alpha-hemolysin; AAA26598). In certain example embodiments, the alpha-hemolysin domain of the nanopore protein conjugate has an amino acid sequence that is 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the sequence set forth as SEQ ID NO: 4.
- the alpha-hemolysin domain of the nanopore protein conjugate is a specific alpha-hemolysin variant.
- Such variants for example, have been shown to have improved time-to-thread (see, e.g., U.S. Pat.
- the alpha-hemolysin variant may have at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more sequence identity to SEQ ID NO: 4, but comprise a substitution at a position corresponding to position 12 or 17 of SEQ ID NO: 3.
- the alpha-hemolysin variant may have at least 60%>, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more sequence identity to SEQ ID NO: 4, but comprises a substitution at a position corresponding to position 1, 2, 3, 4, 9, 12, 17, 35, 47, 106, 128, 129, 130, 131, 144, 149, and/or 287.
- the variant further comprises an HI 44 A substitution.
- the substitution comprises one or more positive charges.
- the variant comprises a substitution at a position corresponding to one or more of residues T12 and/or N17.
- the variant comprises a substitution selected from T12K, T12R, N17K, N17R and combinations thereof.
- the variant comprises a K or R substitution corresponding to position 1, 2, 3, 4, 9, 35, 47, 106, 128, 129, 130, 131, 144, 149, and/or 287 of SEQ ID NO:4.
- the alpha-hemolysin variant comprises a substitution at a position corresponding to a residue selected from the group consisting of T12R or K, and/or N17R or K in alpha-hemolysin from Staphylococcus aureus (SEQ ID NO: 1).
- the substitution is T12K.
- the substitution is T12R.
- the substitution is N17K. In certain example embodiments, the substitution is N17R. In certain example embodiments, the variant alpha-hemolysin having an altered characteristic as compared to a parental alpha-hemolysin (e.g., AAA26598) comprises H144A and at least one additional mutation selected from T12K/R, N17K/R, or combinations thereof.
- a parental alpha-hemolysin e.g., AAA26598
- the variant alpha-hemolysin having an altered characteristic as compared to a parental alpha-hemolysin includes one or more of the amino acid sequences set forth as SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, or SEQ ID NO: 11.
- the variant alpha-hemolysin having an altered characteristic as compared to a parental alpha- hemolysin includes an amino acid sequence that is 60%, 65%, 70%>, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identical to one or more of the amino acid sequences set forth as SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, or SEQ ID NO: 11, provided that the amino acid substitution identified therein is preserved.
- the amino acid substitution allows the addition of heterologous molecules, such as polyethylene glycol (PEG).
- the substitution is a non-native amino acid that is basic or positively charged at a pH from about 5 to about 8.5. Additionally or alternatively, the substitution allows the introduction of a post-translational modification, such as described herein.
- the nanopore protein conjugate includes the DNA binding protein Sso7d.
- Sso7d is a small (about 7,000 kd MW), basic chromosomal protein from the hyperthermophilic archaeabacteria Sulfolobus solfataricus.
- the protein is lysine-rich and has a high thermal, acid and chemical stability.
- the Sso7d protein binds double-stranded DNA in a sequence-independent manner and when bound, increases the TM of DNA by up to 40°C under some conditions (McAfee et al., Biochemistry 34: 10063-10077, 1995).
- the Sso7d protein and its homologs are typically believed to be involved in packaging genomic DNA and stabilizing genomic DNA at elevated temperatures.
- the resultant nanopore protein conjugate includes a nanopore monomer domain (i.e., the nanopore monomer protein) that is linked to an Sso7d domain (i.e., the Sso7d DNA binding protein).
- the Sso7d domain of the nanopore protein conjugate is available to bind DNA, such as template DNA, when part of the nanopore assembly.
- the DNA binding domain when the DNA binding domain of the nanopore protein conjugate is a Sso7d protein, the DNA binding domain includes the amino acid sequence set forth as SEQ ID NO: 2 (the amino acid sequence of Sso7d).
- the Sso7d domain of the nanopore protein conjugate includes an amino acid sequence that has at least 60%, 65%, 70%>, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more sequence identity to the amino acid sequence set forth as SEQ ID NO: 2.
- the DNA binding domain of the nanopore protein conjugate includes an Sso7d like protein sequence.
- Sso7d-like proteins also referred to as Sso7 proteins
- Such protein include, for example, Sac7a, Sac7b, Sac7d, and Sac7e, from the hyperthermophilic archacabacteria S.
- Ssh7a and SsbJb Sulfolobus shibatae. These proteins have an identity with Sso7d that ranges from about 78%) to about 98%>.
- Other Sso7d-like proteins that may be used in accordance with the methods and compositions described herein include RiboP3 and Sto7e.
- Sso7 domains that may be used to form the nanopore protein conjugates described herein and may be identified by the methods described in U.S. Pat. No. 8,445,249.
- the Sso7d domain may include one or more amino acid substitutions or post-translational modifications, as further described herein.
- the nanopore protein conjugate includes an alpha-hemolysin domain that is joined to an Sso7d domain. That is, any of the alpha-hemolysin proteins described herein, including any of the alpha- hemolysin variants, can be linked to any of the Sso7d or Sso7d-like proteins described herein to form the nanopore protein conjugate.
- the resultant nanopore protein conjugate for example, thus has an alpha-hemolysin domain and an Sso7d domain.
- the alpha-hemolysin domain may be linked directly to the Sso7d domain, for example, or an interviewing sequence may be present linking the two domains.
- the linkage of the alpha-hemolysin domain and an Sso7d domain is a covalent linkage, with or without an intervening sequence such as a linker sequence.
- a linker sequence such as a linker sequence.
- SEQ ID NO: 1, SEQ ID NO: 3, or SEQ ID NO: 4 can be joined with the Sso7d sequence set forth as SEQ ID NO: 2 to form a nanopore protein conjugate in accordance with the methods and compositions described herein.
- an alpha-hemolysin protein having 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%o, 97%), 98%), or 99% or more sequence identity to one or more of the amino acid sequences set forth as SEQ ID NO: 1, SEQ ID NO: 3, or SEQ ID NO: 4 can be joined with an Sso7d protein having an amino acid sequence that is 60%>, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identical to SEQ ID NO: 2 to form the nanopore protein conjugate.
- the resultant protein conjugate will have an alpha-hemolysin domain and an Sso7d domain.
- the alpha-hemolysin domain can bind with other alpha-hemolysin proteins to forma the heptamer as described herein, while the Sso7d domain is available to bind to a DNA strand, such as template DNA.
- the alpha-hemolysin domain is joined to the to the Sso7d domain by a linker sequence as described herein.
- the linker sequence may include any number of amino acids that join the alpha- hemolysin domain and the Sso7d domain together while sill preserving the independent function of the two domains. That is, the linker will not interfere the ability of the alpha-hemolysin domain to oligomerize with other alpha-hemolysin proteins to form a nanopore. Likewise, the linker sequence will not interfere the ability of the Sso7d domain of the nanopore protein conjugate to bind DNA.
- the linker sequence of the alpha-hemolysin/Sso7d conjugate protein may include, for example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or amino acids. In certain example embodiments, the linker is less than about 10 amino acids, such as 1-5 amino acids. In certain example embodiments, the linker sequence is a -GLSA- linker sequence (SEQ ID NO: 7). In certain example embodiments, the linker may be flexible. In other embodiments, the linker may be rigid. In other embodiments, the linker may comprise modified amino acids or non-peptide structures.
- the alpha-hemolysin/Sso7d protein conjugate has the amino acid sequence acid set forth as SEQ ID NO: 5.
- the liker is a -GLSA- linker sequence that can be located at residues 295-298.
- the alpha-hemolysin/Sso7d protein conjugate has an amino acid sequence that is 60%, 65%, 70%>, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identical to one or more of the amino acid sequences set forth as SEQ ID NO: 5.
- the resultant nanopore protein conjugate has an alpha-hemolysin domain (for binding to other alpha-hemolysin proteins) and an Sso7d domain (for binding DNA, such as template strand DNA).
- nucleic acid sequence that encodes any of the nanopore protein conjugates described herein.
- the nucleic acid sequence encoding the alpha-hemolysin/Sso7d protein conjugate may have at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more sequence identity to SEQ ID NO: 6.
- a vector that includes the nucleic acid sequence that encodes any of the nanopore protein conjugates described herein.
- the vector includes a nucleic acid that is 60%>, 65%, 70%, 75%, 80%,
- sequence identity to SEQ ID NO: 6.
- sequence includes modifications, such as a sequence encoding a His-Tag ⁇ See SEQ ID NO: 12).
- the methods and compositions described herein provide a nanopore assembly that can be used, for example, in a DNA sequencing reaction.
- the nanopore assembly is typically a multimeric protein structure embedded in a substrate, such as a membrane.
- At least one of the protein subunits of the nanopore assembly includes a nanopore protein conjugate as described herein, although - depending on the type of pore - multiple of the subunits of the nanopore may be a nanopore protein conjugate as described herein.
- At least one of the nanopore protein subunits (and in some cases more) of the nanopore assembly includes a DNA binding domain and a nanopore monomer domain - the nanopore monomer domain being the portion of the monomeric subunit that interacts with other nanopore subunits to form the multimeric pore.
- the DNA binding domain (or domains, depending on the number of protein conjugates used in the assembly) is available to bind a DNA template strand in accordance with the methods described herein.
- each subunit of the multimeric nanopore is a nanopore protein conjugate as described herein, whereas in other example embodiments only a portion of the subunits of the nanopore are nanopore protein conjugates. That is, the nanopore assembly includes at least one protein conjugate as described herein, but it may include multiple nanopore protein conjugates as described herein.
- the nanopore protein conjugate of the nanopore assembly can be any of the nanopore protein conjugates described herein.
- the nanopore assembly is an oligomer of seven alpha-hemolysin monomers (i.e., a heptameric nanopore assembly).
- the monomeric subunits of the heptameric nanopore assembly can be identical copies of the same polypeptide or they can be different polypeptides, so long as the ratio totals seven subunits and at least one of the subunits includes a protein conjugate as described herein.
- the nanopore assembly can include six nanopore protein conjugates, each of which having an alpha-hemolysin domain linked to a DNA binding domain as described herein, and one alpha-hemolysin that is configured to link to a DNA polymerase (for a total of seven oligomerized alpha-hemolysin subunits).
- the alpha-hemolysin domain of each of the subunits can be the same, or the alpha-hemolysins can be a mixture of alpha-hemolysin monomers and variants as described herein.
- one subunit of the heptameric, alpha- hemolysin nanopore assembly may be a nanopore protein conjugate having an alpha-hemolysin domain linked to a DNA binding domain, while the remaining six subunits are not nanopore protein conjugates as described herein.
- the remaining six subunits can be alpha-hemolysin proteins or variants thereof that interact with each other - and the single nanopore protein conjugate - to form the heptamer with a single nanopore protein conjugate.
- an alpha-hemolysin nanopore assembly is formed that includes six alpha-hemolysin proteins and one nanopore protein conjugate having an alpha-hemolysin domain linked to a DNA binding domain.
- the heptameric, alpha-hemolysin nanopore assembly may include 2, 3, 4, 5, 6, or 7 nanopore protein conjugates, thereby providing 2, 3, 4, 5, 6, or 7 DNA binding domains, respectively.
- At least one of the subunits of the heptameric, alpha-hemolysin nanopore assembly is a nanopore protein conjugate that includes an alpha-hemolysin domain or variant thereof linked to an Sso7d or Sso7d-like domain as described herein.
- the resulting nanopore assembly includes 1, 2, 3, 4, 5, 6, or 7 alpha-hemolysin/Sso7d protein conjugates.
- the heptameric, alpha-hemolysin assembly may include six alpha-hemolysin/Sso7d protein conjugates and one alpha-hemolysin monomer that is not linked to Sso7d.
- one or more of the 1, 2, 3, 4, 5, 6, or 7 alpha-hemolysin/Sso7d protein conjugates of the heptameric assembly has an amino acid sequence that is 60%, 65%, 70%>, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identical to the sequence set forth as SEQ ID NO: 5.
- the heptameric, alpha-hemolysin assembly may include a mixture of one or more alpha-hemolysin/Sso7d protein conjugates.
- a particular heptameric assembly may include one or more alpha-hemolysin/Sso7d nanopore protein conjugates, one or more alpha- hemolysin/Sso7d-like protein conjugates, and one or more alpha-hemolysin protein monomers without a DNA binding domain, the resultant nanopore assembly having a total of seven subunits arising from the mixture.
- nanopore protein conjugate proteins as described herein such as the alpha- hemolysin/Sso7d nanopore
- the nanopore assembly may be assembled by any method known in the art.
- the nanopore assembly described herein may be assembled according to the methods described in WO2014/074727, which provides a method for forming multimeric proteins having a defined number of modified subunits (see Figure 27 of WO2014/074727).
- the method includes providing multiple first subunits 2705 and providing multiple second subunits 2710, where the second subunits are modified when compared with the first subunits.
- the first subunits are wild- type (e.g., purified from native sources or produced recombinantly).
- the second subunits can be modified in any suitable way.
- the second subunits have a protein (e.g., a polymerase) attached (e.g., as a fusion protein).
- the modified subunits can comprise a chemically reactive moiety (e.g., an azide or an alkyne group suitable for forming a linkage).
- the method further comprises performing a reaction (e.g., a Click chemistry cycloaddition) to attach an entity (e.g., a polymerase) to the chemically reactive moiety.
- the methods of WO2014/074727 can further include contacting the first subunits with the second subunits 2715 in a first ratio to form a plurality of proteins 2720 having the first subunits and the second subunits.
- one part modified alpha-hemolysin subunits having a reactive group suitable for attaching a polymerase can be mixed with six parts alpha- hemolysin/Sso7d protein conjugate subunits (i.e., with the first ratio being 1 :6, or one part alpha-hemolysin/polymerase attachment group to six parts alpha- hemolysin/Sso7d protein conjugate).
- the ratio may be one part alpha-hemolysin/SpyTag fusion peptide to six parts alpha- hemolysin/Sso7d protein conjugate, the combination of which forms a heptameric, alpha-hemolysin nanopore assembly that is configured to bind a DNA polymerase.
- the ratio may be one part alpha-hemolysin/SpyTag fusion peptide to 2, 3, 4, 5, or 6, parts alpha-hemolysin/Sso7d protein conjugate, where any non-protein conjugates are alpha-hemolysin monomers or variants thereof and the resultant protein is a heptameric, alpha-hemolysin nanopore assembly that is configured to bind a DNA polymerase.
- the multiple proteins can have multiple ratios of the first subunits to the second subunits.
- the mixed subunits can form several nanopores having a distribution of stoichiometries of modified to un-modified subunits (e.g., 1 :6, 2:5, 3 :4).
- the alpha-hemolysin portion of any of the ratios can be wild type alpha-hemolysin, for example, or any alpha-hemolysin variant as described herein.
- the nanopores are formed by simply mixing the subunits.
- a detergent e.g., deoxycholic acid
- a detergent can trigger the alpha-hemolysin monomer to adopt the pore conformation.
- the nanopores can also be formed using a lipid (e.g., l ,2-diphytanoyl-sn-glycero-3-phosphocholine (DPhPC) or 1 ,2-di-0-phytanyl-sn- glycero-3-phosphocholine (DoPhPC)) and moderate temperature (e.g., less than about 100 °C).
- DPhPC l ,2-diphytanoyl-sn-glycero-3-phosphocholine
- DoPhPC 1 ,2-di-0-phytanyl-sn- glycero-3-phosphocholine
- moderate temperature e.g., less than about 100 °C
- the resulting proteins can have a mixed stoichiometry (e.g., of the wild type and mutant proteins). For example, the stoichiometry of such proteins can follow a formula which is dependent upon the ratio of the concentrations of the two proteins used in the pore forming reaction.
- the method can further include fractionating the mixture of proteins to enrich proteins that have a second ratio of the first subunits to the second subunits.
- nanopore proteins can be isolated that have one and only one modified subunit (e.g., a second ratio of 1 :6).
- any second ratio is suitable.
- a distribution of second ratios can also be fractionated such as enriching proteins that have either one or two modified subunits.
- the total number of subunits forming the protein is not always 7 (e.g., a different nanopore can be used or an alpha-hemolysin nanopore can form having six subunits) as depicted in Figure 27 of WO2014/074727.
- proteins having only one modified subunit are enriched.
- the second ratio is 1 second subunit per (n-1) first subunits where n is the number of subunits comprising the protein.
- the first ratio can be the same as the second ratio, however this is not required. In some cases, proteins having mutated monomers can form less efficiently than those not having mutated subunits. If this is the case, the first ratio can be greater than the second ratio (e.g., if a second ratio of 1 mutated to 6 non-mutated subunits are desired in a nanopore, forming a suitable number of 1 :6 proteins may require mixing the subunits at a ratio greater than 1 :6). [00110] Proteins having different second ratios of subunits can behave differently (e.g., have different retention times) in a separation.
- the proteins are fractionated using chromatography, such as ion exchange chromatography or affinity chromatography. Since the first and second subunits can be identical apart from the modification, the number of modifications on the protein can serve as a basis for separation. In certain example embodiments, either the first or second subunits have a purification tag (e.g., in addition to the modification) to allow or improve the efficiency of the fractionation. In certain example embodiments, a poly-histidine tag (His-tag), a streptavidin tag (Strep-tag), or other peptide tag is used. In some instances, the first and second subunits each comprise different tags and the fractionation step fractionates on the basis of each tag. In the case of a His-tag, a charge is created on the tag at low pH (Histidine residues become positively charged below the pKa of the side chain).
- chromatography such as ion exchange chromatography or affinity chromatography. Since the first and second subunits can be identical apart from the modification, the number of modifications on
- ion exchange chromatography can be used to separate oligomers which have 0, 1 , 2, 3, 4, 5, 6, or 7 of the "charge-tagged" alpha- hemolysin subunits.
- this charge tag can be a string of any amino acids which carry a uniform charge.
- Figure 28 and Figure 29 of WO2014/074727 show examples of fractionation of nanopores based on a His-tag.
- Figure 28 shows a plot of ultraviolet absorbance at 280 nanometers, ultraviolet absorbance at 260 nanometers, and conductivity. The peaks correspond to nanopores with various ratios of modified and unmodified subunits.
- Figure 29 of WO2014/074727 shows fractionation of alpha-hemolysin nanopores and mutants thereof using both His-tag and Strep -tags.
- an entity e.g., a polymerase
- the protein can be a nanopore monomer, such as an alpha-hemolysin monomer, and the entity can be a polymerase.
- a DNA polymerase fusion protein having a SpyCatcher sequence may be combined with an alpha-hemolysin fusion protein having a SpyTag domain, thereby resulting in an alpha-hemolysin monomer linked to the DNA polymerase. See, for example, Li et al, J Mol Biol. 2014 Jan 23; 426(2):309- 17.
- the resultant alpha-hemolysin/polymerase protein can then be used, along with one or more of the protein conjugates described herein, to form the nanopore assembly.
- the method further includes inserting the proteins having the second ratio subunits into a bilayer.
- a nanopore can comprise multiple subunits as described herein.
- a polymerase can be attached to one of the subunits and at least one and less than all of the subunits comprise a first purification tag.
- the nanopore is alpha-hemolysin or a variant thereof as described herein.
- all of the subunits comprise a first purification tag or a second purification tag.
- the first purification tag can be a poly-histidine tag (e.g., on the subunit having the polymerase attached).
- the nanopore assembly includes - in addition to at least one of the nanopore protein conjugates described herein - a
- DNA-manipulating or modifying enzyme that is linked to a nanopore monomer of the nanopore assembly.
- a polymerase such as a DNA polymerase
- the polymerase can be attached to the nanopore before or after the nanopore is incorporated into the membrane.
- the polymerase can be attached to a nanopore monomer, such as an alpha-hemolysin monomer, before or after the monomer is incorporated into the multimeric nanopore assembly.
- the nanopore and polymerase are a fusion protein (i.e., single polypeptide chain).
- DNA polymerase capable of synthesizing DNA during a DNA synthesis reaction may be used in accordance with the methods and compositions described herein.
- Exemplary DNA polymerases include, but are not limited to, phi29 (Bacillus bacteriophage ⁇ 29), pol6 (Clostridium phage phiCPV4; GenBank: AFH27113.1) or pol7 (Actinomyces phage Av-1; GenBank: ABR67671.1).
- attached to the nanopore assembly is a DNA-manipulating or modifying enzyme, such as a ligase, nuclease, phosphatase, kinase, transferase, or topoisomerase.
- a polymerase for example, can be attached to the nanopore assembly in any suitable way known in the art. See, for example, PCT/US2013/068967
- the polymerase is attached to a nanopore monomer of a multimeric nanopore, such as to an alpha-hemolysin monomer of the heptameric, alpha-hemolysin nanopore.
- the full nanopore heptamer is then assembled, such as in a ratio of one monomer with an attached polymerase to six nanopore protein conjugates.
- the nanopore heptamer can then be inserted into the membrane.
- a method for attaching a polymerase to a nanopore involves attaching a linker molecule to one of the alpha-hemolysin monomers or mutating a alpha-hemolysin monomer to have an attachment site and then assembling the full nanopore heptamer (e.g., at a ratio of one monomer with linker and/or attachment site to six alpha-hemolysin/DNA binding protein conjugates no linker and/or attachment site).
- a polymerase can then be attached to the attachment site or attachment linker (e.g., in bulk, before inserting into the membrane).
- the polymerase can also be attached to the attachment site or attachment linker after the (e.g., heptamer) nanopore is formed in the membrane.
- the polymerase can be attached to the nanopore assembly with any suitable chemistry (e.g., covalent bond and/or linker).
- the polymerase is attached to the nanopore with molecular staples.
- molecular staples comprise three amino acid sequences (denoted linkers A, B and C).
- Linker A can extend from a hemolysin monomer
- Linker B can extend from the polymerase
- Linker C then can bind Linkers A and B (e.g., by wrapping around both Linkers A and B) and thus the polymerase to the nanopore.
- Linker C can also be constructed to be part of Linker A or Linker B, thus reducing the number of linker molecules.
- the SpyTag/SpyCatcher system which spontaneously forms covalent isopeptide linkages under physiological conditions, may be used to join an alpha-hemolysin monomer to the polymerase. See, for example, Li et al, J Mol Biol. 2014 Jan 23; 426(2):309-17.
- an alpha- hemolysin fusion protein can be expressed having a SpyTag domain.
- the DNA Polymerase to be joined to the alpha-hemolysin may be separately expressed as fusion protein having a SpyCatcher domain.
- the SpyTag and SpyCatcher proteins interact to form the alpha-hemolysin monomer that is linked to a DNA polymerase via a covalent isopeptide linkage.
- the polymerase may be attached to a nanopore monomer before the nanopore monomer is incorporated into a nanopore assembly.
- the purified alpha-hemolysin/SpyTag fusion protein is mixed with purified polymerase/SpyCatcher fusion protein, thus allowing the SpyTag and Spy Catcher proteins bind each other to form an alpha- hemolysin/polymerase monomer.
- the monomer can then be incorporated into the nanopore assembly as described herein to form a heptameric assembly.
- the polymerase is attached to the nanopore assembly after formation of the nanopore assembly.
- the fusion protein is incorporated into the nanopore assembly, along with one or more nanopore protein conjugates, as described herein to form a heptameric nanopore assembly.
- the polymerase/SpyCatcher fusion protein is then mixed with the heptameric assembly, thus allowing the SpyTag and SpyCatcher proteins bind each other, which in turn results in binding of the polymerase to the nanopore assembly.
- nanopore assembly may be configured, for example, to have only a single SpyTag, which therefore allows the attachment of a single polymerase/SpyCatcher.
- alpha-hemolysin for example, mixing the alpha- hemolysin/SpyTag proteins with the alpha-hemolysin/Sso7d conjugate proteins results in heptamers having 0, 1, 2, 3, 4, 5, 6, or 7 alpha-hemolysin/SpyTag subunits. Yet because of the different number of SpyTag sequences (0, 1, 2, 3, 4,
- the heptamers have different charges.
- the heptamers can be separated by methods known in the art, such as via elution with cation exchange chromatography. The eluted fractions can then be examined to determine which fraction includes an assembly with a single SpyTag.
- the different heptamer fraction can be separated based on molecular weight, such as via SDS-PAGE. A reagent can then be used to confirm the presence of SpyTag associated with each fraction. For example, a SpyCatcher- GFP (green fluorescent protein) can be added to the factions before separation via SDS-PAGE.
- the fraction with a single SpyTag can be identified, as evidenced by the furthest band migration and the presence of GFP fluorescence in the SDS-PAGE gel corresponding to the band.
- a fraction containing seven alpha-hemolysin/Sso7d conjugate proteins and zero SpyTag fusion proteins will migrate the furthest, but will not fluoresce when mixed with SpyCatcher-GFP because of the absence of the SpyTag bound to the heptamers.
- the faction containing a single SpyTag will both migrate the next furthest (compared to other fluorescent bands) and will fluoresce.
- the polymerase/SpyCatcher fusion protein can then be added to this fraction, thereby linking the polymerase to the nanopore assembly.
- a nanopore assembly tethered to a single DNA polymerase and including at least one nanopore protein conjugate as described herein can be achieved.
- the nanopore assembly to which the polymerase is attached includes an alpha-hemolysin/Sso7d protein conjugate or an alpha-hemolysin/Sso7d-like protein conjugate as described herein.
- the heptameric nanopore may include at least one alpha-hemolysin/Sso7d protein conjugate as described herein, a single alpha-hemolysin monomer that is joined to a DNA Polymerase, and multiple alpha-hemolysin proteins or variants thereof for a total of seven subunits.
- the alpha-hemolysin nanopore assembly includes six alpha-hemolysin/Sso7d protein conjugates as described herein and one alpha-hemolysin that is joined to a DNA Polymerase (for a total of seven subunits).
- the alpha-hemolysin domain of the six alpha-hemolysin/Sso7d protein conjugates can be the same or be an alpha- hemolysin variant as described herein.
- the alpha- hemolysin nanopore assembly may include 1, 2, 3, 4, 5, or 6 alpha- hemolysin/Sso7d protein conjugates as described herein and one alpha-hemolysin that is joined to a DNA Polymerase (for a total of seven subunits).
- the nanopore assembly includes six alpha-hemolysin/Sso7d protein conjugates having a sequence that is at least is 60%, 65%, 70%>, 75%>, 80%>, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or more identical to the sequence set forth as SEQ ID NO: 5 and one alpha-hemolysin protein (or variant thereof) that is linked to a DNA Polymerase.
- the nanopore assembly described herein may be formed or otherwise embedded in a membrane disposed adjacent to a sensing electrode of a sensing circuit, such as an integrated circuit.
- the integrated circuit may be an application specific integrated circuit (ASIC).
- the integrated circuit is a field effect transistor or a complementary metal-oxide semiconductor (CMOS).
- CMOS complementary metal-oxide semiconductor
- the sensing circuit may be situated in a chip or other device having the nanopore, or off of the chip or device, such as in an off-chip configuration.
- the semiconductor can be any semiconductor, including, without limitation, Group IV
- Group III-V semiconductors e.g., gallium arsenide. See, for example, WO 2013/123450, for the apparatus and device set-up for sensing a nucleotide or tag.
- Pore based sensors can be used for electro-interrogation of single molecules.
- a pore based sensor can include a nanopore of the present disclosure formed in a membrane that is disposed adjacent or in proximity to a sensing electrode.
- the sensor can include a counter electrode.
- the membrane includes a trans side (i.e., side facing the sensing electrode) and a cis side (i.e., side facing the counter electrode).
- a method for detecting a target molecule includes, for example, preparing a chip that includes a nanopore as described herein.
- the nanopore is a nanopore assembly including a nanopore monomer and DNA binding domain, such as an alpha-hemolysin monomer joined to an Sso7d domain.
- the nanopore is then disposed within a membrane.
- a sensing electrode is then positioned adjacent or in proximity to the membrane such that the electrode can detect a signal arising from the nanopore assembly.
- the nanopore is then contacted with a nucleic acid molecule, such as a DNA strand that is to be sequenced.
- the nucleic acid molecule is associated with a reporter molecule having an address region and a probe region.
- the reporter molecule is associated with the nucleic acid molecule at the probe region and the reporter molecule is coupled to a target molecule.
- the method further includes sequencing the address region while the nucleic acid molecule is in contact with the nanopore to determine a nucleic acid sequence of said address region.
- the method also includes identifying, with the aid of a computer processor, the target molecule based upon a nucleic acid sequence of the sequenced address region.
- the nanopore assembly sequencing activity can be improved.
- the difference in time between when the polymerase ceases sequencing activity and when the nanopore ceases its channel activity i.e, the time when the pore last had an open channel
- this timeframe can be reduced by about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60% or more as compared to such controls.
- the sequencing end time i.e., the amount of time the polymerase of the nanopore actively sequences a template
- the sequence end time can be increased (and hence improved) as compared to control assemblies lacking the nanopore protein conjugates.
- the sequence end time can be increased by about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60% or more compared to control assemblies lacking the nanopore protein conjugates.
- Example 1 Expression & Recovery of a-HL/Sso7d Protein Conjugate
- This example illustrates the expression and recovery of protein from bacterial host cells, e.g., E. coli.
- the gene encoding a-HL/Sso7d listed in SEQ ID NO: 12 was synthesized by Genscript and inserted into a pET26b vector using standard DNA restriction enzyme digestion and ligation. Plasmid DNA was transformed into DE3BL21 E. coli competent cells using standard heat-shock protocols and grown on LB agar plates supplemented with Kanamycin. Bacterial colonies were selected and sequenced to verify the integrity of the gene. Bacterial cultures were started from glycerol stocks and grown overnight in 5 mL cultures of LB media supplemented with the appropriate antibiotic. These cultures were then expanded in autoinduction MagicMedia (Invitrogen) supplemented with antibiotics and allowed to expand at 25 C for 16-24 hours. Cell pellets were harvested using centrifugation at 2,200 x g for 15 minutes and frozen at -80 C until further use.
- autoinduction MagicMedia Invitrogen
- pellets were thawed and solubilized in 5 mL of lysis buffer (50 mM Tris-HCl, pH 8.0, 300 mM NaCl, 100 mM KP04, 10 mM Imidazole) for every gram of cell pellet and supplemented with EDTA-free protease inhibitor tablets and DNasel (Sigma-AldrichTM).
- lysis buffer 50 mM Tris-HCl, pH 8.0, 300 mM NaCl, 100 mM KP04, 10 mM Imidazole
- EDTA-free protease inhibitor tablets and DNasel Sigma-AldrichTM.
- Cells were lysed using a tip sonicator (Fisher Scientific ) set to 90% max power and pulsed for 1 second on, 4 seconds off for two minutes. Cell debris was removed using centrifugation at 20,000 x g for 45 minutes.
- the supernatant was applied to a cobalt affinity column and washed with 2 CV of lysis buffer, 2 CV of wash buffer (50 mM Tris-HCl, pH 8.0, 500 mM NaCl, 10 mM Imidazole), 10 CV of high salt wash buffer (50 mM Tris-HCl, pH 8.0, 1 M NaCl, 10 mM Imidazole), 2 CV of wash buffer, and eluted using wash buffer supplemented with 150 mM imidazole.
- wash buffer 50 mM Tris-HCl, pH 8.0, 500 mM NaCl, 10 mM Imidazole
- high salt wash buffer 50 mM Tris-HCl, pH 8.0, 1 M NaCl, 10 mM Imidazole
- wash buffer 50 mM Tris-HCl, pH 8.0, 1 M NaCl, 10 mM Imidazole
- FIG. 1 Purification of the monomeric a-HL/Sso7d protein is shown in FIG. 1. Briefly, serial elution of the purified protein was subject to SDS-PAGE Gel electrophoresis. The gels where then imaged using the Bio-Rad stain- free gel system. The purified a-HL/Sso7d is shown at around the expected 45 kD m.w. in lanes 6, 7, and 8 (FIG. 1).
- This example describes the assembly of a nanopore comprising six a-HL/Sso7d protein conjugates subunits and one wild-type a-HL subunit having a
- wild-type a-HL was expressed with SpyTag and a HisTag as described for a-HL/Sso7d in Example 1.
- the recombinant protein a-HL/SpyTag protein was purified on a cobalt affinity column using a cobalt elution buffer (200mM NaCl, 150mM imidazole, 50mM tris, pH 8).
- a-HL/Sso7d protein was expressed as described in Example 1 with a HisTag and purified on a cobalt affinity column using a cobalt elution buffer (200mM NaCl, 150mM imidazole, 50mM tris, pH 8).
- the proteins were stored at 4°C if used within 5 days, otherwise 8% trehalose was added and stored at -80°C.
- the a-HL/SpyTag to desired a-HL/Sso7d protein solutions were mixed together at a 1 :9 ratio to form a mixture of heptamers. It is expected that such a mixture will result in various fractions that include varying ratios of a-HL/SpyTag and a-HL/Sso7d protein (0:7; 1 :6, 2:5, 3:4, etc.), where the SpyTag component is present as 0, 1, 2, 3, 4, 5, 6, or seven monomeric subunits of the heptamer.
- Diphytanoylphosphatidylcholine (DPhPC) lipid was solubilized in either 50mM Tris, 200mM NaCl, pH 8 or 150mM KC1, 30mM HEPES, pH 7.5 to a final concentration of 50mg/ml and added to the mixture of a-HL monomers to a final concentration of 5mg/ml.
- the mixture of the a-HL monomers was incubated at 37°C for at least 60min.
- n-Octyl-P-D-Glucopyranoside (POG) was added to a final concentration of 5% (weight/volume) to solubilize the resulting lipid-protein mixture.
- the sample was centrifuged to clear protein aggregates and left over lipid complexes and the supernatant was collected for further purification.
- the mixture of heptamers was then subjected to cation exchange purification and the elution fractions collected. For each fraction, two samples were prepared for SDS-PAGE. The first sample included 15 uL of a-HL eluate alone and the second sample was combined with 3 ug of SpyCatcher-GFP. The samples were then incubated and sheltered from light and at room temperature for 1-16 hours. Following incubation, 5 uL of 4x Laemmli SDS-PAGE buffer (Bio- Rad) was added to each sample. The samples and a PrecisionPlusTM Stain-Free protein ladder were then loaded onto a 4-20% Mini-PROTEAN Stain-Free protein precast gel (Bio-Rad). The gels were ran at 200 mV for 30 minutes. The gels were then imaged using a Stain-Free filter.
- 4x Laemmli SDS-PAGE buffer Bio- Rad
- HL purification above can be analyzed for the ratio of a-HL/SpyTag:a-HL/Sso7d.
- the presence of SpyCatcher-GFP attachment can be observed using a GFP-fluorescence filter when imaging the SDS-PAGE gels.
- the polymerase e.g., phi29 DNA Polymerase
- a protein nanopore e.g. alpha-hemolysin
- 1 :6 a-HL/SpyTag:a-HL/Sso7d via the SpyTag and SpyCatcher system. See, for example, Li et al, J Mol Biol. 2014 Jan 23;426(2):309-17.
- the Sticky phi29 Polymerase SpyCatcher HisTag was expressed according to Example 1 and purified using a cobalt affinity column.
- the SpyCatcher/polymerase and the oligomerized 1 :6 a-HL/SpyTag:a-HL/Sso7d heptamers were incubated at a 1 : 1 molar ratio overnight at 4°C to facilitate binding of the SpyCatcher/polymerase to the 1 :6 a-HL/SpyTag:a-HL/Sso7d heptamers.
- the activity of the resultant 1 :6 a-HL/Polymerase:a-HL/Sso7d nanopore assemblies were then evaluated as described in Example 4.
- This example shows the activity of the nanopores as provided by Example 3 (i.e., 1 :6 a-HL/Polymerase:a-HL/Sso7d nanopores).
- Heptameric nanopore assemblies including wild-type alpha-hemolysin monomers and with a single phi29 DNA Polymerase attached thereto were prepared as controls according to Examples 1-3.
- the time it takes to capture a tagged molecule by the DNA polymerase attached to the nanopore was determined using alternating voltages, i.e., squarewaves. Data from the time-to-capture experiments was then extrapolated to determine the difference between when the polymerase ceased sequencing activity and when the pore ceased its activity (i.e, the time when the pore last had an open channel) (FIGS. 3A-B). In other words, the lifetime of the polymerase was compared to the lifetime of the pore (FIGS. 3A-B).
- the sequencing end time i.e., the amount of time the polymerase of the nanopore actively sequences a template, was also determined from the time-to-capture (FIGS. 4A-4B).
- a-HL/Sso7d nanopores for the activity assay, bilayers were formed and pores were inserted as described in PCT/US14/61853 filed 23 October 2014.
- the nanopore device (or sensor) used to detect a molecule (and/or sequence a nucleic acid) was set-up as described in WO2013123450.
- Genia Sequencing device is used with a Genia Sequencing Chip.
- the electrodes are conditioned and phospholipid bilayers are established on the chip as explained in PCT/US2013/026514.
- Genia's sequencing complex is inserted to the bilayers following the protocol described in PCT/US2013/026514 (published as WO2013/123450).
- the time-to-thread data shown in this patent was collected using a buffer system comprised of 20mM HEPES pH 7.5, 300mM KC1, 3uM tagged nucleotide, 3mM Ca2+, with a voltage applied of +/- lOOmV with a duty cycle of 5Hz. After the data was collected it was analyzed for squarewaves that showed the capture of a tagged nucleotide (threaded level) which lasted to the end of the positive portion of the squarewave, and was followed by another tag capture on the subsequent squarewave. The time-to-thread was measured by determining how long the second squarewave reported unobstructed open channel current.
- time-to-thread parameter would be calculated from squarewaves 2-10 (the first squarewave does not factor into the calculation because the polymerase did not have a tag bound to it in the previous squarewave).
- time-to-thread numbers were then collected for all of the pores in the experiment and statistical parameters extracted from them (such as a mean, median, standard deviation etc.).
- FIGS. 3A-3B Table 1, Table 2, FIGS. 4A-4B, Table 3, and Table 4.
- the time between when the polymerase ceased sequencing activity and when the pore ceased its activity was substantially reduced with the use of the 1 :6 a-HL/Polymerase:a-HL/Sso7d nanopores (versus control).
- the mean time was reduced from roughly 225 seconds to 114 seconds (see also FIGS. 3A-3B).
- Table 1 Time between when polymerase ceases sequencing activity and when nanopore ceases channel activity for 1:6 a-HL/Polymerase:a-HL/WT control nanopores. Data are provided in seconds.
- the sequencing end time was substantially increased with the use of the 1 :6 a-HL/Polymerase:a-HL/Sso7d nanopores (versus control). As shown in Table 3 and 4, for example, the mean sequencing end time was increased from roughly 1502 seconds to 1907 seconds (see also FIGS. 4A-4B).
- Table 3 Sequencing end time data for 1:6 a-HL/Polymerase:a-HL/WT control nanopores. Data are provided in seconds.
- a-HL/Polymerase a-HL/Sso7d protein conjugate nanopores. Data are provided in seconds.
- SEQ ID NO: 5 (a-HL/Sso7d Protein Conjufiate; Linker Underlined)
- KELLQMLEKQ KK 312 SEQ ID NO: 6 (a-HL/Sso7d Codin Sequence)
- YYPRNSIDTK EYMSTLTYGF NGNVTGDDTG KIGGLIGANV SIGHTLKYVQ 150 PDFKTILESP TDKKVGWKVI FNNMVNQNWG PYDRDSWNPV YGNQLFMKTR 200
- PQFEK 305 SEQ ID NO: 9 (N17R ot-HL amino acids)
- NGSMKAADNF LDPNKASSLL SSGFSPDFAT VITMDRKASK QQTNIDVIYE 250 RVRDDYQLHW TSTNWKGTNT KDKWTDRSSE RYKIDWEKEE MTNGLSAWSH 300 PQFEK 305 SEQ ID NO: 11 (T12R ot-HL amino acids)
- SEQ ID NO: 12 (q-HL/Sso7d Codinfi Sequence with His-Tafi)
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Biomedical Technology (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Immunology (AREA)
- Analytical Chemistry (AREA)
- Urology & Nephrology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- General Physics & Mathematics (AREA)
- Food Science & Technology (AREA)
- Pathology (AREA)
- Nanotechnology (AREA)
- Genetics & Genomics (AREA)
- Hematology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Peptides Or Proteins (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662316236P | 2016-03-31 | 2016-03-31 | |
PCT/EP2017/057433 WO2017167811A1 (en) | 2016-03-31 | 2017-03-29 | Nanopore protein conjugates and uses thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
EP3436602A1 true EP3436602A1 (en) | 2019-02-06 |
Family
ID=58488970
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP17715423.4A Pending EP3436602A1 (en) | 2016-03-31 | 2017-03-29 | Nanopore protein conjugates and uses thereof |
Country Status (3)
Country | Link |
---|---|
US (1) | US11150233B2 (en) |
EP (1) | EP3436602A1 (en) |
WO (1) | WO2017167811A1 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109563140B (en) | 2016-04-21 | 2022-10-28 | 豪夫迈·罗氏有限公司 | Alpha-hemolysin variants and uses thereof |
WO2019157424A1 (en) * | 2018-02-12 | 2019-08-15 | P&Z Biological Technology Llc | Nanopore assemblies and uses thereof |
CN111801344A (en) * | 2018-02-15 | 2020-10-20 | 豪夫迈·罗氏有限公司 | Nanopore protein conjugates for detection and analysis of analytes |
WO2019166458A1 (en) * | 2018-02-28 | 2019-09-06 | F. Hoffmann-La Roche Ag | Alpha-hemolysin variants and uses thereof |
JP2023549796A (en) * | 2020-11-13 | 2023-11-29 | ナンジン、ユニバーシティ | Programmable Nanoreactor for Probabilistic Sensing (PNRSS) |
CN114732898B (en) * | 2022-04-01 | 2023-05-09 | 中国人民解放军军事科学院军事医学研究院 | Fixed-point covalent binding method of CpG adjuvant and antigen |
CN117886907A (en) * | 2022-10-14 | 2024-04-16 | 北京普译生物科技有限公司 | PHT nanopore mutant protein and application thereof |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7666645B2 (en) | 2002-10-23 | 2010-02-23 | Bio-Rad Laboratories, Inc. | Sso7-polymerase conjugate proteins |
US7238485B2 (en) | 2004-03-23 | 2007-07-03 | President And Fellows Of Harvard College | Methods and apparatus for characterizing polynucleotides |
US20060014183A1 (en) * | 2004-06-10 | 2006-01-19 | Pfundheller Henrik M | Extendable probes |
US7932034B2 (en) * | 2006-12-20 | 2011-04-26 | The Board Of Trustees Of The Leland Stanford Junior University | Heat and pH measurement for sequencing of DNA |
US9670243B2 (en) * | 2010-06-02 | 2017-06-06 | Industrial Technology Research Institute | Compositions and methods for sequencing nucleic acids |
WO2012009578A2 (en) * | 2010-07-14 | 2012-01-19 | The Curators Of The University Of Missouri | Nanopore-facilitated single molecule detection of nucleic acids |
US10443096B2 (en) | 2010-12-17 | 2019-10-15 | The Trustees Of Columbia University In The City Of New York | DNA sequencing by synthesis using modified nucleotides and nanopore detection |
CA2864125C (en) | 2012-02-16 | 2021-08-31 | Genia Technologies, Inc. | Methods for creating bilayers for use with nanopore sensors |
WO2014028311A2 (en) * | 2012-08-15 | 2014-02-20 | President And Fellows Of Harvard College | Polynucleotide-binding domains as a means of cell labeling, cell organization and polymer sequencing |
US9605309B2 (en) | 2012-11-09 | 2017-03-28 | Genia Technologies, Inc. | Nucleic acid sequencing using tags |
WO2014100481A2 (en) * | 2012-12-20 | 2014-06-26 | Electornic Biosciences Inc. | Modified alpha hemolysin polypeptides and methods of use |
EP3212810B1 (en) * | 2014-10-31 | 2020-01-01 | Genia Technologies, Inc. | Alpha-hemolysin variants with altered characteristics |
CN108137656A (en) * | 2015-09-24 | 2018-06-08 | 豪夫迈·罗氏有限公司 | Alpha hemolysin variant |
CN109563140B (en) * | 2016-04-21 | 2022-10-28 | 豪夫迈·罗氏有限公司 | Alpha-hemolysin variants and uses thereof |
-
2017
- 2017-03-29 EP EP17715423.4A patent/EP3436602A1/en active Pending
- 2017-03-29 WO PCT/EP2017/057433 patent/WO2017167811A1/en active Application Filing
-
2018
- 2018-09-28 US US16/146,053 patent/US11150233B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
WO2017167811A1 (en) | 2017-10-05 |
US20190079067A1 (en) | 2019-03-14 |
US11150233B2 (en) | 2021-10-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11150233B2 (en) | Nanopore protein conjugates and uses thereof | |
US11613778B2 (en) | Long lifetime alpha-hemolysin nanopores | |
US10968480B2 (en) | Alpha-hemolysin variants and uses thereof | |
US11479584B2 (en) | Alpha-hemolysin variants with altered characteristics | |
US11261488B2 (en) | Alpha-hemolysin variants | |
US20200385433A1 (en) | Alpha-hemolysin variants and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: UNKNOWN |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20181031 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) | ||
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
17Q | First examination report despatched |
Effective date: 20191107 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R079 Free format text: PREVIOUS MAIN CLASS: C12Q0001680000 Ipc: G01N0033487000 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: C12Q 1/6869 20180101ALI20240325BHEP Ipc: G01N 33/487 20060101AFI20240325BHEP |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: GRANT OF PATENT IS INTENDED |