CN113480620A - Mutant of porin monomer, protein pore and application thereof - Google Patents
Mutant of porin monomer, protein pore and application thereof Download PDFInfo
- Publication number
- CN113480620A CN113480620A CN202110950708.3A CN202110950708A CN113480620A CN 113480620 A CN113480620 A CN 113480620A CN 202110950708 A CN202110950708 A CN 202110950708A CN 113480620 A CN113480620 A CN 113480620A
- Authority
- CN
- China
- Prior art keywords
- mutant
- seq
- protein
- ala
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 202
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 189
- 239000011148 porous material Substances 0.000 title claims abstract description 182
- 108010013381 Porins Proteins 0.000 title claims abstract description 163
- 239000000178 monomer Substances 0.000 title claims abstract description 157
- 102000007739 porin activity proteins Human genes 0.000 title claims abstract 24
- 150000001413 amino acids Chemical class 0.000 claims abstract description 123
- 230000035772 mutation Effects 0.000 claims abstract description 70
- 239000012491 analyte Substances 0.000 claims abstract description 57
- 102000040430 polynucleotide Human genes 0.000 claims description 130
- 108091033319 polynucleotide Proteins 0.000 claims description 130
- 239000002157 polynucleotide Substances 0.000 claims description 130
- 102000039446 nucleic acids Human genes 0.000 claims description 80
- 108020004707 nucleic acids Proteins 0.000 claims description 80
- 150000007523 nucleic acids Chemical class 0.000 claims description 79
- 108020004414 DNA Proteins 0.000 claims description 72
- 238000000034 method Methods 0.000 claims description 72
- 125000003729 nucleotide group Chemical group 0.000 claims description 65
- 239000002773 nucleotide Substances 0.000 claims description 64
- 239000012528 membrane Substances 0.000 claims description 48
- 229910052799 carbon Inorganic materials 0.000 claims description 45
- 229910052770 Uranium Inorganic materials 0.000 claims description 43
- 229910052717 sulfur Inorganic materials 0.000 claims description 39
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 29
- 238000006467 substitution reaction Methods 0.000 claims description 25
- 238000005259 measurement Methods 0.000 claims description 24
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 19
- 238000003780 insertion Methods 0.000 claims description 19
- 230000037431 insertion Effects 0.000 claims description 19
- 238000012217 deletion Methods 0.000 claims description 17
- 230000037430 deletion Effects 0.000 claims description 17
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 17
- 229910052739 hydrogen Inorganic materials 0.000 claims description 16
- 229910052700 potassium Inorganic materials 0.000 claims description 15
- 239000013598 vector Substances 0.000 claims description 14
- 241000589516 Pseudomonas Species 0.000 claims description 13
- 229920001184 polypeptide Polymers 0.000 claims description 13
- 102000014914 Carrier Proteins Human genes 0.000 claims description 12
- 108091008324 binding proteins Proteins 0.000 claims description 12
- 229910052740 iodine Inorganic materials 0.000 claims description 12
- 239000003814 drug Substances 0.000 claims description 7
- 229920000642 polymer Polymers 0.000 claims description 7
- 102220469367 Apolipoprotein C-I_T71S_mutation Human genes 0.000 claims description 6
- 102220470964 Carcinoembryonic antigen-related cell adhesion molecule 5_Y68F_mutation Human genes 0.000 claims description 6
- 108091034117 Oligonucleotide Proteins 0.000 claims description 6
- 102200026990 rs137852764 Human genes 0.000 claims description 6
- 102220094204 rs876659508 Human genes 0.000 claims description 6
- 102200131327 rs879255262 Human genes 0.000 claims description 6
- 229940079593 drug Drugs 0.000 claims description 5
- 239000004172 quinoline yellow Substances 0.000 claims description 5
- 239000004408 titanium dioxide Substances 0.000 claims description 5
- 229910052757 nitrogen Inorganic materials 0.000 claims description 4
- 239000000356 contaminant Substances 0.000 claims description 3
- 229940039227 diagnostic agent Drugs 0.000 claims description 3
- 239000000032 diagnostic agent Substances 0.000 claims description 3
- 230000007613 environmental effect Effects 0.000 claims description 3
- 239000002360 explosive Substances 0.000 claims description 3
- 150000004676 glycans Chemical class 0.000 claims description 3
- 229910017053 inorganic salt Inorganic materials 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 229910021645 metal ion Inorganic materials 0.000 claims description 3
- 229920001282 polysaccharide Polymers 0.000 claims description 3
- 239000005017 polysaccharide Substances 0.000 claims description 3
- 230000001939 inductive effect Effects 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 102220474385 Solute carrier family 13 member 3_S75A_mutation Human genes 0.000 claims 1
- 238000012512 characterization method Methods 0.000 abstract description 5
- 238000001514 detection method Methods 0.000 abstract description 4
- 235000018102 proteins Nutrition 0.000 description 166
- 102000017033 Porins Human genes 0.000 description 139
- 235000001014 amino acid Nutrition 0.000 description 105
- 102000053602 DNA Human genes 0.000 description 71
- 238000012163 sequencing technique Methods 0.000 description 61
- 108060004795 Methyltransferase Proteins 0.000 description 42
- 239000010410 layer Substances 0.000 description 19
- 229920002477 rna polymer Polymers 0.000 description 16
- 108091006146 Channels Proteins 0.000 description 15
- 238000005516 engineering process Methods 0.000 description 15
- 230000008569 process Effects 0.000 description 15
- 230000005945 translocation Effects 0.000 description 13
- 102000004190 Enzymes Human genes 0.000 description 11
- 108090000790 Enzymes Proteins 0.000 description 11
- -1 Hexitol Nucleic Acids Chemical class 0.000 description 11
- 150000003839 salts Chemical class 0.000 description 11
- 238000012360 testing method Methods 0.000 description 11
- 239000000872 buffer Substances 0.000 description 10
- 238000007672 fourth generation sequencing Methods 0.000 description 10
- 230000003993 interaction Effects 0.000 description 10
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 9
- 108010050848 glycylleucine Proteins 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 8
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 8
- 239000002585 base Substances 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 108010003137 tyrosyltyrosine Proteins 0.000 description 8
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 7
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 7
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 7
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 7
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 7
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 7
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 7
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 7
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 7
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 7
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 7
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 7
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 7
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 7
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 7
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 7
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 7
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 7
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 7
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 7
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 7
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 7
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 7
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 7
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 7
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 7
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 7
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 7
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 7
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 7
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 7
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 7
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 7
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 7
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 7
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 7
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 7
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 7
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 7
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 7
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 7
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 7
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 7
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 7
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 7
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 7
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 7
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 7
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 7
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 7
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 7
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 7
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 7
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 7
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 7
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 7
- SLOYNOMYOAOUCX-BVSLBCMMSA-N Trp-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SLOYNOMYOAOUCX-BVSLBCMMSA-N 0.000 description 7
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 7
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 7
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 7
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 7
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 7
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 7
- 108010041407 alanylaspartic acid Proteins 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 230000008878 coupling Effects 0.000 description 7
- 238000010168 coupling process Methods 0.000 description 7
- 238000005859 coupling reaction Methods 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 7
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 7
- 108010053037 kyotorphin Proteins 0.000 description 7
- 108010000761 leucylarginine Proteins 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 108010009298 lysylglutamic acid Proteins 0.000 description 7
- 108010071207 serylmethionine Proteins 0.000 description 7
- 239000007787 solid Substances 0.000 description 7
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 7
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 6
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 6
- 238000001914 filtration Methods 0.000 description 6
- 150000002500 ions Chemical class 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 5
- 102220539443 Nitric oxide synthase, brain_S75A_mutation Human genes 0.000 description 5
- 108091093037 Peptide nucleic acid Proteins 0.000 description 5
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 5
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 5
- 108020004682 Single-Stranded DNA Proteins 0.000 description 5
- 239000003153 chemical reaction reagent Substances 0.000 description 5
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 5
- 238000001493 electron microscopy Methods 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 5
- 235000000346 sugar Nutrition 0.000 description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 239000007864 aqueous solution Substances 0.000 description 4
- 239000002800 charge carrier Substances 0.000 description 4
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 150000003904 phospholipids Chemical class 0.000 description 4
- 239000001103 potassium chloride Substances 0.000 description 4
- 235000011164 potassium chloride Nutrition 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 102000035160 transmembrane proteins Human genes 0.000 description 4
- 108091005703 transmembrane proteins Proteins 0.000 description 4
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 3
- 238000001712 DNA sequencing Methods 0.000 description 3
- 239000007995 HEPES buffer Substances 0.000 description 3
- 239000000232 Lipid Bilayer Substances 0.000 description 3
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 3
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- 125000003118 aryl group Chemical group 0.000 description 3
- 230000004888 barrier function Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical class O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 102000035118 modified proteins Human genes 0.000 description 3
- 108091005573 modified proteins Proteins 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- NCMVOABPESMRCP-SHYZEUOFSA-N 2'-deoxycytosine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 NCMVOABPESMRCP-SHYZEUOFSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 102000034573 Channels Human genes 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 102220585783 Glutamate receptor ionotropic, delta-2_F76D_mutation Human genes 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerol Natural products OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 2
- 102220535112 Inhibin beta E chain_R62T_mutation Human genes 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 102220539439 Nitric oxide synthase, brain_S75G_mutation Human genes 0.000 description 2
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 2
- 102220475857 Phosphoglycerate kinase 1_D63A_mutation Human genes 0.000 description 2
- 102220559091 Potassium voltage-gated channel subfamily E member 1_K69H_mutation Human genes 0.000 description 2
- 241001216848 Pseudomonas taeanensis Species 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- 241001052560 Thallis Species 0.000 description 2
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 150000003841 chloride salts Chemical class 0.000 description 2
- 235000012000 cholesterol Nutrition 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- UTLVQENMPIUJSJ-QTARTGBFSA-N dT10 Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)CO)[C@@H](O)C1 UTLVQENMPIUJSJ-QTARTGBFSA-N 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000005684 electric field Effects 0.000 description 2
- 239000003792 electrolyte Substances 0.000 description 2
- 238000000635 electron micrograph Methods 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- KWIUHFFTVRNATP-UHFFFAOYSA-N glycine betaine Chemical compound C[N+](C)(C)CC([O-])=O KWIUHFFTVRNATP-UHFFFAOYSA-N 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 238000000691 measurement method Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 239000000276 potassium ferrocyanide Substances 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 2
- 102200150035 rs121909265 Human genes 0.000 description 2
- 102220322567 rs147013097 Human genes 0.000 description 2
- 102220072405 rs191342808 Human genes 0.000 description 2
- 102200037714 rs2655655 Human genes 0.000 description 2
- 102200005931 rs375912738 Human genes 0.000 description 2
- 102220243337 rs876659508 Human genes 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 238000001542 size-exclusion chromatography Methods 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 229920001059 synthetic polymer Polymers 0.000 description 2
- XOGGUFAVLNCTRS-UHFFFAOYSA-N tetrapotassium;iron(2+);hexacyanide Chemical compound [K+].[K+].[K+].[K+].[Fe+2].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-] XOGGUFAVLNCTRS-UHFFFAOYSA-N 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 229920000428 triblock copolymer Polymers 0.000 description 2
- MQAYPFVXSPHGJM-UHFFFAOYSA-M trimethyl(phenyl)azanium;chloride Chemical compound [Cl-].C[N+](C)(C)C1=CC=CC=C1 MQAYPFVXSPHGJM-UHFFFAOYSA-M 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- FQERWQCDIIMLHB-UHFFFAOYSA-N 1-ethyl-3-methyl-1,2-dihydroimidazol-1-ium;chloride Chemical compound [Cl-].CC[NH+]1CN(C)C=C1 FQERWQCDIIMLHB-UHFFFAOYSA-N 0.000 description 1
- HWPZZUQOWRWFDB-UHFFFAOYSA-N 1-methylcytosine Chemical compound CN1C=CC(N)=NC1=O HWPZZUQOWRWFDB-UHFFFAOYSA-N 0.000 description 1
- PIINGYXNCHTJTF-UHFFFAOYSA-N 2-(2-azaniumylethylamino)acetate Chemical group NCCNCC(O)=O PIINGYXNCHTJTF-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- 208000035657 Abasia Diseases 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- SGFBVLBKDSXGAP-GKCIPKSASA-N Ala-Phe-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N SGFBVLBKDSXGAP-GKCIPKSASA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- 101710092462 Alpha-hemolysin Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- 241000701844 Bacillus virus phi29 Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 1
- 229920000858 Cyclodextrin Polymers 0.000 description 1
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 1
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- MYMOFIZGZYHOMD-UHFFFAOYSA-N Dioxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical group OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 108091093094 Glycol nucleic acid Proteins 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- 102000004310 Ion Channels Human genes 0.000 description 1
- 108090000862 Ion Channels Proteins 0.000 description 1
- 150000007649 L alpha amino acids Chemical class 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- SVJRVFPSHPGWFF-DCAQKATOSA-N Lys-Cys-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVJRVFPSHPGWFF-DCAQKATOSA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 1
- OKIZCWYLBDKLSU-UHFFFAOYSA-M N,N,N-Trimethylmethanaminium chloride Chemical compound [Cl-].C[N+](C)(C)C OKIZCWYLBDKLSU-UHFFFAOYSA-M 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102000002067 Protein Subunits Human genes 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108091093078 Pyrimidine dimer Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 102220577380 Ras-related protein Rab-22A_Q64L_mutation Human genes 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- 108091046915 Threose nucleic acid Proteins 0.000 description 1
- 101710183280 Topoisomerase Proteins 0.000 description 1
- 108091011223 Transmembrane protein 121 Proteins 0.000 description 1
- OSYOKZZRVGUDMO-HSCHXYMDSA-N Trp-Lys-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OSYOKZZRVGUDMO-HSCHXYMDSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 229910052783 alkali metal Inorganic materials 0.000 description 1
- 229910001514 alkali metal chloride Inorganic materials 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 150000001576 beta-amino acids Chemical class 0.000 description 1
- 229960003237 betaine Drugs 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- HGCIXCUEYOPUTN-UHFFFAOYSA-N cis-cyclohexene Natural products C1CCC=CC1 HGCIXCUEYOPUTN-UHFFFAOYSA-N 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 208000030381 cutaneous melanoma Diseases 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 150000001945 cysteines Chemical class 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 230000009144 enzymatic modification Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 238000009963 fulling Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 1
- 125000003147 glycosyl group Chemical group 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 238000002847 impedance measurement Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 239000011147 inorganic material Substances 0.000 description 1
- 239000000138 intercalating agent Substances 0.000 description 1
- 239000002608 ionic liquid Substances 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 235000018977 lysine Nutrition 0.000 description 1
- 150000002669 lysines Chemical class 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000011368 organic material Substances 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 230000007030 peptide scission Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 150000004713 phosphodiesters Chemical class 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000037332 pore function Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000002250 progressing effect Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 239000013635 pyrimidine dimer Substances 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000013341 scale-up Methods 0.000 description 1
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- 201000003708 skin melanoma Diseases 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 230000005641 tunneling Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/21—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Pseudomonadaceae (F)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Abstract
The invention belongs to the technical field of characterization of target analyte characteristics, and particularly provides a mutant of a porin monomer, a protein pore comprising the mutant, and application of the mutant to detection of a target analyte, wherein the amino acid of the mutant of the porin monomer comprises a sequence shown in SEQ ID NO. 1 or a sequence at least 99%, 98%, 97%, 96%, 95%, 90%, 80%, 70%, 60% or 50% identical thereto, and the amino acid of the mutant of the porin monomer comprises a mutation at one or more positions corresponding to T71, S75 and F76 of SEQ ID NO. 1.
Description
Technical Field
The invention belongs to the technical field of characterization of target analyte characteristics, and particularly relates to a mutant of a pore protein monomer, a protein pore containing the mutant and application of the mutant in detection of a target analyte.
Background
With the research on the structure and sequence of nucleic acid, the nucleic acid sequencing technology is continuously developed, becomes the core field of life science research, and plays a great role in promoting the technical development in the fields of biology, chemistry, electricity, life science, medicine and the like. The research of a novel rapid, accurate, low-cost, high-precision and high-throughput nucleic acid sequencing technology by using the nanopore is one of the hot spots of the subsequent human genome project.
Nanopore (Nanopore) sequencing technology, also known as fourth generation sequencing technology, is a gene sequencing technology that takes a single-stranded nucleic acid molecule as a sequencing unit, utilizes a Nanopore capable of providing an ion current channel, allows the single-stranded nucleic acid molecule to pass through the Nanopore under electrophoretic drive, reduces the current of the Nanopore when nucleic acid passes through the Nanopore, and reads sequence information in real time for different generated signals.
The nanopore sequencing is mainly characterized in that: the reading length is long, the accuracy rate is high, and most error regions occur in the homopolymeric oligonucleotide region. Nanopore sequencing can not only realize natural DNA and RNA sequencing, but also directly acquire base modification information of DNA and RNA, for example, methylated cytosine can be directly read without need of bisulfite (bisufite) treatment on genome in advance like a second generation sequencing method, which greatly promotes the direct study of epigenetic correlation phenomenon at genome level. As a novel platform, the nanopore detection technology has the advantages of low cost, high flux, no mark and the like.
Nanopore analysis techniques originated from the invention of the Coulter counter and the recording technique of single-channel currents. In 1976, Neher and Sakamann who obtained Nobel prize in physiology and medicine use the patch clamp technology to measure membrane potential and study membrane protein and ion channels, thus promoting the practical application process of nanopore sequencing technology. In 1996, Kasiaanowicz et al proposed a new idea of DNA sequencing using α -hemolysin, which is a landmark marker for single molecule sequencing of biological nanopores. Subsequently, the research on biological nanopores such as MspA porins and phage Phi29 connectors is reported, and the research on nanopore analysis technology is enriched. Li et al in 2001 opened a new era of solid-state nanopore research. Solid state nanopore sequencing has been slow progressing, limited by advances in the semiconductor and materials industries.
One of the key points of nanopore sequencing technology lies in the design of a special biological nanopore, a reading head structure formed in a constriction zone in the nanopore can cause the blockage of channel current when a single-stranded nucleic acid (such as ssDNA) molecule passes through the nanopore, so that the current intensity flowing through the nanopore is influenced transiently (the amplitude of current change influenced by each base is different), and finally, high-sensitivity electronic equipment detects the changes so as to identify the passed base. Currently, protein pores are used as nanopores for sequencing, and the pore proteins mainly use escherichia coli as a source.
At present, the nanopore protein is single, and a substitute nanopore protein needs to be developed to realize a nanopore sequencing technology. The porin is also closely related to sequencing precision, and the porin also relates to mode change of interaction with the rate control protein, so that the stability of an interaction interface of the porin and the rate control protein is further optimized, and the consistency and the stability of sequencing data are positively influenced. The accuracy of nanopore sequencing technology is also in need of improvement, and therefore, there is a need to develop improved nanopore proteins to further improve the resolution of nanopore sequencing.
Disclosure of Invention
To solve the above problems, it is an object of embodiments of the present invention to provide an alternative mutant of a porin monomer, a protein pore comprising the same, and uses thereof.
In a first aspect, embodiments of the invention provide a mutant of a porin monomer, wherein the amino acid of the mutant of a porin monomer comprises or consists of the sequence set forth in SEQ ID No. 1 or a sequence having at least 99%, 98%, 97%, 96%, 95%, 90%, 80%, 70%, 60% or 50% identity thereto, and the amino acid of the mutant of a porin monomer comprises a mutation at one or more of the positions T71, S75, F76 corresponding to SEQ ID No. 1;
one or more of T71, S75, F76 are specifically T71, S75, F76, T71 and S75, S75 and F76, T71 and S76, or T71 and S75 and F76.
Preferably, the amino acids of the mutant of porin monomers comprise mutations at one or more positions corresponding to 62-175, 62-104, 68-175, 64-79, 71-76, or 69-76 of SEQ ID NO: 1.
Preferably, the amino acids of the mutant of porin monomers comprise:
(1) (ii) having insertions, deletions and/or substitutions of amino acids at one or more positions corresponding to K69, P70, T71, P72, A73, S74, S75, and F76 of SEQ ID NO 1; (2) (ii) has an amino acid insertion, deletion and/or substitution at one or more positions corresponding to Q64, T65, G66, Q67, Y68, K69, P70, T71, P72, A73, S74, S75, F76, S77, T78, and S79 of SEQ ID NO 1; (3) (ii) has insertions, deletions and/or substitutions of amino acids at one or more positions corresponding to R62, D63, Y68, K69, T71, S75, F76, and E104 of SEQ ID NO 1; or (4) having an insertion, deletion and/or substitution of an amino acid at one or more positions corresponding to Y68, K69, P70, T71, P72, A73, S74, S75, F76, E171 and D175 of SEQ ID NO: 1.
In one embodiment, the amino acid mutation of the mutant of porin monomers is selected from the group consisting of:
(a) amino acid KPTPASSF corresponding to 69-76 of SEQ ID NO. 1 is mutated to M1M2M3M4M5M6M7M8Wherein M is10 to 3 selected from R, K, H; m 20 to 1 selected from P; m 30 to 10 selected from S, G, C, U, T, M, A, V, L, I; m 40 to 1 selected from P; m 50 to 5 selected from A, G, V, L, I; m 60 to 5 selected from S, C, U, T, M; m 70 to 10 selected from A, T, G, V, L, I, S, C, U, M;M 80 to 7 selected from Q, D, E, N, K, H, R;
(b) amino acids QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 were mutated to M9M10M11M12M1 3M14M15M16M17M18M19M20M21M22M23M24Wherein M is90 to 5 selected from L, G, A, V, I; m 100 to 5 selected from T, S, C, U, M; m 110 to 5 selected from G, A, V, L, I; m 120 to 4 selected from Q, D, E, N; m 130 to 3 selected from Y, F, W; m 140 to 3 selected from R, H, K; m 150 to 1 selected from P; m 160 to 5 selected from S, C, U, T, M; m 170 to 1 selected from P; m 180 to 5 selected from A, G, V, L, I; m 190 to 5 selected from S, C, U, T, M; m 200 to 5 selected from A, G, V, L, I; m 210 to 9 selected from N, D, E, Q, L, G, A, V, I; m 220 to 5 selected from S, C, U, T, M; m 230 to 5 selected from T, S, C, U, M; m 240 to 5 selected from A, G, V, L, I;
(c) the mutation corresponding to the 62 th position of SEQ ID NO. 1 is 0 to 5 of S, C, U, T, M; the mutation at position 63 is 0 to 5 of V, G, A, L, I; the mutation at position 68 is 0 to 2 of F, W; the mutation at position 69 is 0 to 2 of R, H; the mutation at position 71 is 0 to 4 of S, C, U, M; the mutation at position 75 is 0 to 5 of A, G, V, L, I; the mutation at position 76 is 0 to 4 of Q, D, E, N; the mutation at position 104 is 0 to 5 of V, G, A, L, I; and
(d) amino acids YKPTPASSF corresponding to positions 68-76 of SEQ ID NO. 1 were mutated to M25M26M27M28M29M30M31M32M33And the 171 mutation is 0 to 3 of N, D, Q; mutation at position 175 is 0 to 3 of N, E, Q, wherein M 250 to 3 selected from F, Y, W; m26Selected from R, H, K0 to 3 of (a); m 270 to 1 selected from P; m 280 to 5 selected from S, C, U, T, M; m 290 to 1 selected from P; m 300 to 5 selected from A, G, V, L, I; m 310 to 5 selected from S, C, U, T, M; m 320 to 5 selected from A, G, V, L, I; m 330 to 4 selected from Q, D, E, N.
In one embodiment, the amino acid mutation of the mutant of porin monomers is selected from the group consisting of:
(a) amino acid KPTPASSF corresponding to 69-76 of SEQ ID NO. 1 is mutated to M1M2M3M4M5M6M7M8Wherein M is1Selected from R, K or H; m2Is selected from P; m3Selected from S, G, C, U, T, M, A, V, L or I; m4Is selected from P; m5Selected from A, G, V, L or I; m6Selected from S, C, U, T or M; m7Selected from A, T, G, V, L, I, S, C, U or M; m8Selected from Q, D, E, N, K, H or R;
(b) amino acids QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 were mutated to M9M10M11M12M1 3M14M15M16M17M18M19M20M21M22M23M24Wherein M is9Selected from L, G, A, V or I; m10Selected from T, S, C, U or M; m11Selected from G, A, V, L or I; m12Selected from Q, D, E or N; m13Selected from Y, F or W; m14Selected from R, H or K; m15Is selected from P; m16Selected from S, C, U, T or M; m17Is selected from P; m18Selected from A, G, V, L or I; m19Selected from S, C, U, T or M; m20Selected from A, G, V, L or I; m21Selected from N, D, E, Q, L, G, A, V or I; m22Selected from S, C, U, T or M; m23Selected from T, S, C, U or M; m24Selected from A, G, V, L or I;
(c) r62 corresponding to SEQ ID NO. 1 is mutated to R62S, R62C, R62U, R62T or R62M; d63 is mutated to D63V, D63G, D63A, D63L or D63I; y68 to Y68F or Y68W; k69 is mutated to K69R or K69H; the T71 mutation is T71S, T71C, T71U or T71M; the S75 mutation is S75A, S75G, S75V, S75L or S75I; f76 to F76Q, F76D, F76E, or F76N; the E104 mutation is E104V, E104G, E104A, E104L, or E104I; and
(d) amino acids YKPTPASSF corresponding to positions 68-76 of SEQ ID NO. 1 were mutated to M25M26M27M28M29M30M31M32M33And the E171 mutation is E171N, E171D, or E171Q; the D175 mutation is D175N, D175E or D175Q, wherein M25Selected from F, Y or W; m26Selected from R, H or K; m27Is selected from P; m28Selected from S, C, U, T or M; m29Is selected from P; m30Selected from A, G, V, L or I; m31Selected from S, C, U, T or M; m32Selected from A, G, V, L or I; m33Selected from Q, D, E, or N.
In one embodiment, the amino acid mutation of the mutant of porin monomers is selected from the group consisting of:
(a) amino acid KPTPASSF corresponding to 69-76 bits of SEQ ID NO. 1 is mutated into RPSPASAQ;
(b) amino acid KPTPASSF corresponding to 69-76 bits of SEQ ID NO. 1 is mutated into KPGPASTK;
(c) amino acid QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 was mutated to LTGQYRPSPASANSTA;
(d) amino acid QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 was mutated to LTGQYRPSPASALSTA;
(e) R62S, D63V, Y68F, K69R, T71S, S75A, F76Q, and E104V corresponding to SEQ ID NO 1; and
(f) amino acids YKPTPASSF corresponding to positions 68-76 of SEQ ID NO. 1 were mutated to FRPSPASAQ and to E171N and D175N of SEQ ID NO. 1.
In a second aspect, embodiments of the invention provide a protein pore comprising at least one mutant of a porin monomer.
In a third aspect, embodiments of the present invention provide a complex for characterising a target analyte, characterised in that: the protein pore and the rate-controlling protein combined with the protein pore.
In a fourth aspect, embodiments of the invention provide nucleic acids encoding a mutant, protein pore, or complex of a porin monomer.
In a fifth aspect, embodiments of the invention provide vectors or genetically engineered host cells comprising the nucleic acids.
In a sixth aspect, embodiments of the present invention provide the use of a mutant of a porin monomer, a protein well, complex, nucleic acid, vector or host cell thereof, in detecting the presence, absence or one or more characteristics of a target analyte, or in the manufacture of a product for detecting the presence, absence or one or more characteristics of a target analyte.
In a seventh aspect, the embodiments of the present invention provide a method for producing a protein pore or a polypeptide thereof, comprising transforming the host cell with the vector, and inducing the host cell to express the protein pore or the polypeptide thereof.
In an eighth aspect, embodiments of the present invention provide a method for determining the presence, absence or one or more characteristics of a target analyte, comprising:
a. contacting a target analyte with a protein pore, complex, or protein pore in a complex such that the target analyte moves relative to the protein pore; and
b. obtaining one or more measurements while the target analyte is moving relative to the protein pore, thereby determining the presence, absence or one or more characteristics of the target analyte.
In one embodiment, the method comprises: the target analyte interacts with the protein pores present in the membrane such that the target analyte moves relative to the protein pores.
In one embodiment, the target analyte is a nucleic acid molecule.
In one embodiment, a method for determining the presence, absence or one or more characteristics of a target analyte comprises coupling the target analyte to a membrane; and the target analyte interacts with the protein pores present in the membrane such that the target analyte moves relative to the protein pores.
In a ninth aspect, embodiments of the present invention provide a kit for determining the presence, absence or one or more characteristics of a target analyte, comprising a mutant of said porin monomer, said protein pore, said complex, said nucleic acid, or said vector or host, and components of said membrane.
In a tenth aspect, embodiments of the present invention provide a device for determining the presence, absence or one or more characteristics of a target analyte, comprising said protein pore or said complex, and said membrane.
In one embodiment, the target analyte comprises a polysaccharide, a metal ion, an inorganic salt, a polymer, an amino acid, a peptide, a protein, a nucleotide, an oligonucleotide, a polynucleotide, a dye, a drug, a diagnostic agent, an explosive, or an environmental contaminant;
preferably, the target analyte comprises a polynucleotide,
more preferably, the polynucleotide comprises DNA or RNA; and/or, the one or more characteristics are selected from (i) the length of the polynucleotide; (ii) identity of the polynucleotides; (iii) the sequence of the polynucleotide; (iv) (iv) the secondary structure of the polynucleotide and (v) whether the polynucleotide is modified; and/or, the rate controlling protein in the complex comprises a polynucleotide binding protein.
Drawings
The drawings described are only schematic and are non-limiting.
Fig. 1 illustrates the basic working principle of a nanopore according to one embodiment.
Figure 2 shows a schematic diagram of DNA sequencing according to one embodiment.
FIG. 3 shows the corresponding pore blocking signal when a nucleotide passes through a protein pore according to one embodiment.
Fig. 4A, 4B and 4C illustrate a wild-type protein pore channel surface structure and a bandmap model according to one embodiment. Fig. 4A is a side view of the surface structure model, fig. 4B is a top view of the ribbon structure model, and fig. 4C is the ribbon structure model.
Fig. 5A and 5B show amino acid model diagrams of a mutant pore 1 according to one embodiment, with fig. 5A being a top view and fig. 5B being a side view.
Fig. 6 shows the size information of the respective portions of the relief hole 1 according to an embodiment.
Fig. 7 shows a monomeric amino acid model diagram of porin mutant 1 according to an embodiment, (b) is (a) the core amino acid composition of the constriction zone (eye loop) shown enlarged.
Fig. 8 shows negative staining electron microscopy results for porin mutant 1 according to one embodiment.
Fig. 9A shows a cryo-electron micrograph of porin mutant 1 according to an embodiment fig. 9B shows the 2D classification results.
Fig. 10A and 10B show local refinement fourier shell correlation number (FSC) results for porin mutant 1, according to an embodiment, a: rlin FSC unshielded map; b: an rIn FSC phase random mask map; c: an rIn FSC correction map; d: rlin FSC mask map.
FIG. 11 shows a porin mutant 1 cryoelectron microscope three-dimensional reconstruction in accordance with an embodimentElectron density map of resolution.
FIG. 13 shows the structure of the DNA construct BS7-4C3-SE1 according to one embodiment.
FIG. 14 shows the structure of the DNA construct BS7-4C3-PLT according to one embodiment.
FIG. 15A shows the opening current and its gating characteristics at a voltage of 180mV for a mutant opening 1 according to one embodiment.
FIG. 15B shows a nucleic acid via scenario with a mutant well 1 at +180mV voltage, according to one embodiment.
FIGS. 16A and 16B show example current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of DNA construct BS7-4C3-PLT through mutation well 1, according to one embodiment.
FIG. 17 is an enlarged view of a single signal region of the embodiment of FIG. 16B.
Fig. 18A shows the opening current and its gating characteristics at a voltage of ± 180mV for a mutant opening 2 according to an embodiment.
FIG. 18B shows the nucleic acid via situation for mutant well 2 at +180mV voltage, according to one embodiment.
FIGS. 19A and 19B show example current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of DNA construct BS7-4C3-PLT through mutant pore 2, according to one embodiment.
Fig. 20 is an enlarged view of a region of a single signal in the embodiment of fig. 19A and B.
Fig. 21A shows the opening current and its gating characteristics at a voltage of ± 180mV for a mutant opening 3 according to an embodiment.
FIG. 21B shows a nucleic acid via scenario with a mutant pore 3 at +180mV voltage, according to one embodiment.
FIGS. 22A and 22B show example current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of DNA construct BS7-4C3-PLT through mutation pore 3, according to one embodiment.
FIG. 23 is an enlarged view of a single signal in the embodiment of FIG. 22A.
Fig. 24A shows the opening current and its gating characteristics at a voltage of ± 180mV for the mutant opening 4 according to an embodiment.
FIG. 24B shows a nucleic acid via scenario with a mutant pore 4 at +180mV voltage, according to one embodiment.
FIGS. 25A and 25B show exemplary current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of DNA construct BS7-4C3-PLT through mutation pore 4, according to one embodiment.
Fig. 26 is an enlarged view of a region of a single signal in the embodiment of fig. 25A and 25B.
FIG. 27 shows the opening current and its gating characteristics at a voltage of 180mV for the mutant opening 5 in accordance with one embodiment.
FIG. 28 shows an example current trajectory when helicase Mph-MP1-E105C/A362C controls translocation of DNA construct BS7-4C3-PLT through mutation pore 5, according to one embodiment.
FIG. 29 is an enlarged area display of a single signal of the embodiment of FIG. 28.
Fig. 30A shows the opening current and its gating characteristics at a voltage of ± 180mV for the mutant opening 6 according to an embodiment.
FIG. 30B shows a nucleic acid via scenario with a mutant well 6 at +180mV voltage, according to one embodiment.
FIGS. 31A and 31B show exemplary current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of DNA construct BS7-4C3-PLT through mutation pore 6, according to one embodiment.
FIG. 32 is an enlarged view of a single signal region of the embodiment of FIG. 31A.
FIG. 33 shows the result of SDS-PAGE electrophoretic detection of mutant 1 according to one embodiment.
Figure 34 shows a mutant 1 protein size exclusion chromatogram according to one embodiment.
Detailed Description
It is understood that the unused applications of the disclosed products and methods may be adapted according to the specific needs of the art. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments of the invention only, and is not intended to be limiting.
Also, as used in this specification and the claims, the singular forms "a", "an", and "the" include plural referents unless the context clearly dictates otherwise. For example, reference to "a nucleotide" includes two or more nucleotides and reference to "a helicase" includes two or more helicases.
As used herein, the term "comprising" means that any of the listed elements must be included, and that other elements may also optionally be included. "consisting of" means not including all the elements not listed. Embodiments defined by each of these terms are within the scope of the invention.
As used herein, a "nucleotide sequence", "DNA sequence" or "nucleic acid molecule" refers to a polymeric form of nucleotides of any length (ribonucleotides or deoxyribonucleotides). The term refers only to the primary structure of the molecule. Thus, the term includes double-and single-stranded DNA and RNA.
The term "nucleic acid" as used herein refers to a single-or double-stranded covalently linked sequence of nucleotides in which the 3 'and 5' ends of each nucleotide are linked by a phosphodiester linkage. A nucleotide may consist of a deoxyribonucleotide base or a ribonucleotide base. Nucleic acids may include DNA and RNA, and may be synthetically prepared in vitro or isolated from natural sources. Nucleic acids may further include modified DNA or RNA, e.g., methylated DNA or RNA, or RNA that has been post-translationally modified, e.g., 5 '-capping with 7-methylguanosine, 3' -end processing, e.g., cleavage and polyadenylation, and splicing. Nucleic acids may also include synthetic nucleic acids (XNA), such as Hexitol Nucleic Acids (HNA), cyclohexene nucleic acids (CeNA), Threose Nucleic Acids (TNA), Glycerol Nucleic Acids (GNA), Locked Nucleic Acids (LNA) and Peptide Nucleic Acids (PNA). The size of a nucleic acid (or polynucleotide) is typically expressed in terms of the number of base pairs (bp) of a double-stranded polynucleotide, or in the case of a single-stranded polynucleotide, the number of nucleotides (nt). 1 kilobase or nt equals one kilobase pair (kb). Polynucleotides less than about 40 nucleotides in length are commonly referred to as "oligonucleotides" and may comprise primers for use in DNA manipulation, e.g., by Polymerase Chain Reaction (PCR).
Polynucleotides, such as nucleic acids, are macromolecules comprising two or more nucleotides. The polynucleotide or nucleic acid may comprise any combination of any nucleotides. The nucleotides may be naturally occurring or synthetic. One or more nucleotides in the polynucleotide may be oxidized or methylated. One or more nucleotides in the polynucleotide may be damaged. For example, the polynucleotide may comprise a pyrimidine dimer. This dimer is often associated with damage caused by ultraviolet light and is the major cause of cutaneous melanoma. One or more nucleotides in the polynucleotide may be modified, for example with a conventional label or tag. The polynucleotide may comprise one or more nucleotides that are abasic (i.e., lack nucleobases), or lack nucleobases and sugars (i.e., are C3).
The nucleotides in the polynucleotide may be linked to each other in any manner. The nucleotides are typically linked by their sugar and phosphate groups, as in nucleic acids. The nucleotides may be linked by their nucleobases, as in the guanine dimers.
The polynucleotide may be single-stranded or double-stranded. At least a portion of the polynucleotide is preferably double stranded. The polynucleotide may be a nucleic acid, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA). The polynucleotide may comprise an RNA strand that is hybridized to a DNA strand. The polynucleotide may be any synthetic nucleic acid known in the art, such as Peptide Nucleic Acid (PNA), Glycerol Nucleic Acid (GNA), Threose Nucleic Acid (TNA), Locked Nucleic Acid (LNA) or other synthetic polymers having nucleotide side chains. The PNA backbone is composed of repeating N- (2-aminoethyl) -glycine units linked by peptide bonds. The GNA backbone is composed of repeating ethylene glycol units linked by phosphodiester bonds. The TNA skeleton is composed of resuscitated glycosyl connected together through phosphodiester bonds. LNAs are formed from the ribonucleic acids described above, with an additional bridging structure connecting the 2 'oxygen and the 4' carbon in the ribose moiety. The Bridged Nucleic Acid (BNA) is a modified RNA nucleotide. They may also be referred to as restricted or inaccessible RNA13BNA monomers that may contain a 5-, 6-or even 7-membered bridging structure and have an "immobilized" C3 '-internal sugar folding structure (C3' -endo sugar tucking). The bridging structure is synthetically introduced into the 2 ', 4' -position of the ribose to produce the 2 ', 4' -BNA monomer.
The polynucleotide is most preferably ribonucleic acid (RNA) or deoxyribonucleic acid (DNA). The polynucleotide may be of any length. For example, the polynucleotide may be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400, or at least 500 nucleotides or nucleotide pairs in length. The polynucleotide may be 1000 or more nucleotides or nucleotide pairs, 5000 or more nucleotides or nucleotide pairs or 100000 or more nucleotides or nucleotide pairs in length.
Any number of polynucleotides may be studied. For example, the methods of the embodiments can involve characterizing 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 50, 100, or more polynucleotides. If two or more polynucleotides are characterized, they may be the case for different polynucleotides or the same polynucleotide.
Polynucleotides may be naturally occurring or synthetic. For example, the method can be used to verify the sequence of the prepared oligonucleotides. The method is typically performed in vitro.
In the context of the present disclosure, the term "amino acid" is used in its broadest sense and is meant to include amine (NH) -containing compounds2) And a Carboxyl (COOH) functional group and a side chain (e.g., R group) specific to each amino acid. In some embodiments, an amino acid refers to a naturally occurring L α -amino acid or residue. The commonly used single and three letter abbreviations for naturally occurring amino acids are used herein: a ═ Ala; c ═ Cys; d ═ Asp; e is Glu; f ═ Phe; g ═ Gly; h ═ His; i ═ Ile; k ═ Lys; l ═ Leu; m is Met; n ═ Asn; p ═ Pro; q ═ Gln; r ═ Arg; s is Ser; t ═ Thr; v is Val; w ═ Trp; and Y-Tyr (leining e r, a.l. (1975) BioChemis try, 2 nd edition, pages 71-92, Worth Publishers, New York). The generic term "amino acid" also includes D-amino acids, retro-inverso amino acids, and chemically modified amino acids (such as amino acid analogs), naturally occurring amino acids that are not normally incorporated into proteins (such as norleucine), and chemically synthesized compounds (such as β -amino acids) that have properties known in the art to be characteristic of amino acids. For example, included within the definition of amino acid are analogs or mimetics of phenylalanine or proline that allow the same conformational restriction of a peptide compound as a native Phe or Pro. Such analogs and mimetics are referred to herein as "functional equivalents" of the corresponding amino acids. Roberts and Vellaccio, The Peptides: Analysis, Synthesis, Biology, Gross and Meiehofer editions, Vol.5, page 341, Academic Press, Inc., N.Y.1983, which are incorporated herein by reference, list additional examples of amino acids.
The terms "protein," "polypeptide," and "peptide" are further used interchangeably herein to refer to polymers of amino acid residues as well as variants and synthetic analogs of amino acid residues. Thus, these terms apply to amino acid polymers in which one or more amino acid residues is a synthetic non-naturally occurring amino acid, such as a chemical analog of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. The polypeptide may also undergo maturation or post-translational modification processes, which may include, but are not limited to: glycosylation, proteolytic cleavage, lipidation, signal peptide cleavage, propeptide cleavage, phosphorylation, etc.
"homologues" of a protein encompass peptides, oligopeptides, polypeptides, proteins and enzymes having amino acid substitutions, deletions and/or insertions relative to the unmodified or wild-type protein in question and having similar biological and functional activity as the unmodified protein from which they are derived. As used herein, the term "amino acid identity" refers to the degree to which the sequences are identical on an amino acid-amino acid basis over a comparison window. Thus, the "percent sequence identity" is calculated by: the two optimally aligned sequences are compared over a comparison window, the number of positions in the two sequences at which the same amino acid residue occurs (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, Ile, Phe, Tyr, Trp, Lys, Arg, His, Asp, Glu, Asn, gin, Cys, and Met) is determined to yield the number of matched positions, the number of matched positions is divided by the total number of positions in the comparison window (i.e., the window size), and the result is multiplied by 100 to yield the percentage of sequence identity.
Sequence identity may also be a fragment or portion of a full-length polynucleotide or polypeptide. Thus, a sequence may have only 50% overall sequence identity to a full-length reference sequence, but the sequence of a particular region, domain or subunit may have 80%, 90% or up to 99% sequence identity to a reference sequence.
The term "wild-type" refers to a gene or gene product isolated from a naturally occurring source. The wild-type gene is the gene most commonly observed in a population and is therefore arbitrarily designed as the "normal" or "wild-type" form of the gene. Conversely, the terms "modified," "mutation," or "variant" refer to a gene or gene product that exhibits a sequence modification (e.g., substitution, truncation, or insertion), post-translational modification, and/or functional property (e.g., altered characteristics) as compared to the wild-type gene or gene product. Note that naturally occurring mutants can be isolated; these mutants are identified by the fact that they have altered characteristics compared to the wild-type gene or gene product. Methods for introducing or substituting naturally occurring amino acids are well known in the art. For example, methionine (M) can be replaced with arginine (R) by replacing the codon for methionine (ATG) with the codon for arginine (CGT) at the relevant position in the polynucleotide encoding the mutated monomer. Methods of introducing or substituting non-naturally occurring amino acids are also well known in the art. For example, non-naturally occurring amino acids can be introduced by including synthetic aminoacyl-trnas in the IVTT system for expressing mutant monomers. Alternatively, non-naturally occurring amino acids may be introduced by expressing a mutant monomer in Pseudomonas taeniae (MS-3), which is auxotrophic for particular amino acids in the presence of synthetic (i.e., non-naturally occurring) analogs of those particular amino acids. If the mutated monomers are generated using partial peptide synthesis, they may also be generated by naked ligation. Conservative substitutions replace amino acids with other amino acids having similar chemical structures, similar chemical properties, or similar side chain volumes. The amino acids introduced may have a polarity, hydrophilicity, hydrophobicity, basicity, acidity, neutrality or charge similar to the amino acids they replace. Alternatively, a conservative substitution may introduce another aromatic or aliphatic amino acid in place of a pre-existing aromatic or aliphatic amino acid. Conservative amino acid changes are well known in the art and may be selected based on the properties of the 20 major amino acids defined in table 1 below. In the case of amino acids with similar polarity, this can also be determined with reference to the hydrophilicity scale for the amino acid side chains in table 2.
TABLE 1 chemical Properties of amino acids
TABLE 2 hydrophilicity Scale
Side chains | Hydrophilicity |
Ile,I | 4.5 |
Val,V | 4.2 |
Leu,L | 3.8 |
Phe,F | 2.8 |
Cys,C | 2.5 |
Met,M | 1.9 |
Ala,A | 1.8 |
Gly,G | -0.4 |
Thr,T | -0.7 |
Ser,S | -0.8 |
Trp,W | -0.9 |
Tyr,Y | -1.3 |
Pro,P | -1.6 |
His,H | -3.2 |
Glu,E | -3.5 |
Gln,Q | -3.5 |
Asp,D | -3.5 |
Asn,N | -3.5 |
Lys,K | -3.9 |
Arg,R | -4.5 |
It is well known that conservative substitutions of amino acids with similar properties to each other, as shown in Table 3, do not generally affect the activity of the peptide sequence.
TABLE 3 conservative amino acid substitutions
The mutated or modified protein, monomer or peptide may also be chemically modified at any site in any manner. The mutated or modified monomer or peptide is preferably chemically modified by attachment of the molecule to one or more cysteines (cysteine linkage), attachment of the molecule to one or more lysines, attachment of the molecule to one or more unnatural amino acids, enzymatic modification of an epitope or modification of the terminus. Suitable methods for making such modifications are well known in the art. Mutants of modified proteins, monomers or peptides may be chemically modified by attachment of any molecule. For example, mutants of modified proteins, monomers or peptides may be chemically modified by attachment of dyes or fluorophores. In some embodiments, the mutant or modified monomer or peptide is chemically modified with a molecular adaptor that facilitates interaction between the pore comprising the monomer or peptide and the target nucleotide or target polynucleotide sequence. The molecular adaptor is preferably a cyclic molecule, a cyclodextrin, a substance capable of hybridizing, a DNA binding agent or intercalator, a peptide or peptide analogue, a synthetic polymer, an aromatic planar molecule, a positively charged small molecule or a small molecule capable of hydrogen bonding.
The presence of the adapter improves the host-guest chemistry of the pore and nucleotide or polynucleotide sequence, thereby improving the sequencing capability of the pore formed by the mutated monomer. The principles of host-guest chemistry are well known in the art. The adaptors have an effect on the physical or chemical properties of the pore, which improves the interaction of the pore with the nucleotide or polynucleotide sequence. The adapter may alter the charge of the barrel or channel of the pore, or specifically interact or bind with a nucleotide or polynucleotide sequence, thereby facilitating its interaction with the pore.
A "protein pore" is a transmembrane protein structure that defines a channel or pore that allows molecules and ions to translocate from one side of the membrane to the other. The translocation of ionic species through the pore may be driven by a potential difference applied to either side of the pore. A "nanopore" is a protein pore in which the most part of the channel through which a molecule or ion passesSmall diameter is in the nanometer range (10)-9Rice). In some embodiments, the protein pore may be a transmembrane protein pore. The transmembrane protein structure of a protein pore may be monomeric or oligomeric in nature. Typically, a pore comprises a plurality of polypeptide subunits arranged around a central axis, thereby forming a protein-lined channel extending substantially perpendicular to the membrane in which the nanopore resides. The number of polypeptide subunits is not limited. Typically, the number of subunits is from 5 to 30, suitably from 6 to 10. Alternatively, the number of subunits is not defined as in the case of perfringolysin (perfringolysin) or related large membrane pores. The portion of the protein subunit within the nanopore that forms the protein lining channel typically comprises a secondary structural motif that may include one or more transmembrane β -barrel and/or α -helical portions.
In one embodiment, the protein pore comprises one or more pore protein monomers. Each porin monomer may be from pseudomonas taeniae. In one embodiment, the protein pore comprises a mutant of one or more pore protein monomers (i.e., a monomer mutated in one or more pore proteins).
In one embodiment, the porin is from a wild-type protein, wild-type homolog, or mutant thereof of kingdom biologies. The mutant may be a modified porin or a porin mutant. Modifications in the mutants include, but are not limited to, any one or more of the modifications disclosed herein or combinations of such modifications. In one embodiment, the wild-type protein of kingdom biologics is a protein from Pseudomonas tapestris.
In one embodiment, porin homologue refers to a polypeptide having at least 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50% complete sequence identity to the protein set forth in SEQ ID NO. 1.
In one embodiment, a porin homologue refers to a polynucleotide having at least 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50% complete sequence identity to the polynucleotide encoding the protein set forth in SEQ ID NO. 2. The polynucleotide sequence may comprise a sequence that differs from SEQ ID NO. 2 based on the degeneracy of the genetic code.
Polynucleotide sequences can be derived and replicated using methods standard in the art. Chromosomal DNA encoding the wild-type porin may be extracted from pore-producing organisms such as Pseudomonas taeniae. The gene encoding the pore subunit can be amplified using PCR including specific primers. The amplified sequence may then be subjected to site-directed mutagenesis. Suitable methods for site-directed mutagenesis are known in the art and include, for example, combinatorial chain reactions. The constructed polynucleotides encoding the embodiments can be prepared using techniques well known in the art, such as those described in Sambrook, J.and Russell, D. (2001). Molecular Cloning A Laboratory Manual,3rd edition.Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y..
The resulting polynucleotide sequence may then be integrated into a recombinant replicable vector, such as a cloning vector. The vectors can be used to replicate the polynucleotides in compatible host cells. Thus a polynucleotide sequence may be prepared by introducing the polynucleotide into a replicable vector, introducing the vector into a compatible host cell, and growing the host cell under conditions which cause the vector to replicate. The vector may be recovered from the host cell.
Fundamental working principle of nanopore or protein pore
In one embodiment, within the electrolyte filled chamber 100, an insulating film 102 with nanoscale pores divides the chamber into 2 cells, as shown in fig. 1, and when a voltage is applied to the electrolyte chamber, ions or other small molecule species pass through the pores under the force of an electric field, creating a stable detectable ionic current. Different types of biomolecules can be detected by grasping the size and surface characteristics of the nanopore, the applied voltage and the solution conditions.
Because the four bases of adenine (A), guanine (G), cytosine (C) and thymine (T) which form DNA have different molecular structures and sizes, when single-stranded DNA (ssDNA) passes through a nano-scale pore under the drive of a speed-control enzyme and an electric field, the change amplitude of current caused by the difference of chemical properties of different bases when the single-stranded DNA passes through a nano-scale pore or a protein pore is different, and thus the sequence information of the detected nucleic acid such as DNA is obtained.
FIG. 2 shows a schematic 200 of DNA sequencing. As shown in fig. 2, in a typical nanopore/protepore sequencing experiment, the nanopore is the only channel for ions to pass through on both sides of the phospholipid membrane. Rate controlling proteins, such as polynucleotide binding proteins, act as motor proteins for nucleic acid molecules, such as DNA, pulling DNA strands to pass through the nanopore/protein pore in single nucleotide steps. Whenever one nucleotide passes through the nanopore/protein pore, the corresponding pore blocking signal is recorded (fig. 3). By analyzing the current signals associated with these sequences by a corresponding algorithm, sequence information of nucleic acid molecules such as DNA can be deduced back.
In the examples, porins are screened from different species in nature (mainly bacteria and archaea) by bioinformatics means and evolutionary points of view. In one embodiment, the porin is from any organism, preferably from Pseudomonas taeniae. By sequence analysis, porins have a complete functional domain. And (3) utilizing a structural biology means to predict and analyze a porin 3D structural model, and selecting a channel protein with a proper reading head architecture form. And then, modifying, testing and optimizing the candidate channel protein (or porin) by means of genetic engineering, protein directed evolution, computer-aided protein design and the like, and obtaining a plurality of homologous protein mutants, preferably six homologous protein mutants (different homologous protein frameworks) with different signal characteristics and signal distribution modes through several iterations.
The porins of the examples are applicable to fourth generation sequencing technologies. In one embodiment, the porin is a nanopore. In one embodiment, the porin may be applied to solid-state wells for sequencing.
In one embodiment, a new protein backbone is employed, forming a new constriction zone (read head zone) structure, thereby providing a completely new mode of action during sequencing. The porins of the examples have good skip-edge distribution and efficiency of recombination with phospholipid membranes.
In one embodiment, genetic mutation of a wild-type porin monomer modifies a mutant that forms a porin monomer. In one embodiment, the amino acids of the mutant of the porin monomer comprise the sequence set forth in SEQ ID No. 1 or a sequence having at least 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, or 50% identity thereto, and the amino acids of the mutant of the porin monomer have mutations corresponding to one or more of positions 62-79, 104, 170, and 175 of SEQ ID No. 1.
In one embodiment, the mutation comprises an insertion, deletion and/or substitution of an amino acid. In one embodiment, the mutations at one or more of positions 62-79, 104, 170 and 175 of SEQ ID NO. 1 are amino acid insertions, deletions and/or substitutions at one or more of positions 62-79, 104, 170 and 175 of SEQ ID NO. 1.
In one embodiment, the mutant of the porin monomer has an amino acid corresponding to one or more of positions (1)69-76, (2)64-79, (3)62, 63, 68, 69, 71, 75, 76, and 104, or (4)68-76 and 171 and 175 of SEQ ID No. 1.
In one embodiment, the mutant of the porin monomer has amino acids corresponding to one or more of positions (1)69-76, (2)64-79, (3)62, 63, 68, 69, 71, 75, 76, and 104, or (4)68-76 and 171 and 175 of SEQ ID No. 1 with insertions, deletions, and/or substitutions of amino acids.
In one embodiment, the mutant of the porin monomer has amino acids with mutations only at positions 69-76 corresponding to SEQ ID NO:1 (i.e. K69, P70, T71, P72, a73, S74, S75, F76), or with insertions, deletions and/or substitutions of amino acids at one or more positions.
In one embodiment, the mutant of the porin monomer has amino acids with mutations only at positions corresponding to positions 64-79 of SEQ ID NO:1 (i.e. Q64, T65, G66, Q67, Y68, K69, P70, T71, P72, a73, S74, S75, F76, S77, T78 and S79), or with insertions, deletions and/or substitutions of amino acids at one or more positions.
In one embodiment, the mutant of the porin monomer has amino acids with mutations only at positions corresponding to R62, D63, Y68, K69, T71, S75, F76Q, and E104 of SEQ ID No. 1, or with insertions, deletions and/or substitutions of amino acids at one or more positions.
In one embodiment, the mutant of the porin monomer has amino acids with mutations only at positions corresponding to positions 68-76 of SEQ ID NO:1 (i.e. Y68, K69, P70, T71, P72, a73, S74, S75, and F76), E171, and D175, or with insertions, deletions, and/or substitutions of amino acids at one or more positions.
In one embodiment, the position corresponding to SEQ ID NO. 1 is such that the numbering of the sequence of SEQ ID NO. 1 is used regardless of whether the relative position is unchanged by amino acid insertion or deletion or by using an identity sequence such that the numbering of the sequence is changed. For example, Q64 corresponding to SEQ ID NO:1 can be mutated to Q64L, and even if the SEQ ID NO:1 sequence number is changed or a sequence having the identity as defined herein with SEQ ID NO:1 is used, the amino acid Q corresponding to position 64 of SEQ ID NO:1 (even if position 64 is not in another sequence) can be mutated to L, and still be within the scope of the present invention.
In one embodiment, the amino acids of the mutant of the porin monomer consist of the sequence set forth in SEQ ID No. 1, or a sequence having at least 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 85%, 80%, 75%, or 70%, 65%, 60%, 55%, or 50% identity thereto, and the amino acids of the mutant of the porin monomer have mutations corresponding to one or more of positions 62-79, 104, 170, and 175 of SEQ ID No. 1.
In one embodiment, the sequence of SEQ ID NO 1 of the porin monomer is from Pseudomonas taeniae. The nucleotide sequence of the amino acid of the SEQ ID NO. 1 is SEQ ID NO. 2.
In one embodiment, amino acids KPTPASSF at positions 69-76 are mutated to M1M2M3M4M5M6M7M8Wherein M is1Selected from R, K or H; m2Is selected from P; m3Selected from S, G, C,U, T, M, A, V, L or I; m4Is selected from P; m5Selected from A, G, V, L or I; m6Selected from S, C, U, T or M; m7Selected from A, T, G, V, L, I, S, C, U or M; m8Selected from Q, D, E, N, K, H or R.
In one embodiment, amino acids QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 are mutated to M9M10M11M12M13M14M15M16M17M18M19M20M21M22M23M24Wherein M is9Selected from L, G, A, V or I; m10Selected from T, S, C, U or M; m11Selected from G, A, V, L or I; m12Selected from Q, D, E or N; m13Selected from Y, F or W; m14Selected from R, H or K; m15Is selected from P; m16Selected from S, C, U, T or M; m17Is selected from P; m18Selected from A, G, V, L or I; m19Selected from S, C, U, T or M; m20Selected from A, G, V, L or I; m21Selected from N, D, E, Q, L, G, A, V or I; m22Selected from S, C, U, T or M; m23Selected from T, S, C, U or M; m24Selected from A, G, V, L or I.
In one embodiment, R62 corresponding to SEQ ID No. 1 is mutated to R62S, R62C, R62U, R62T or R62M; d63 is mutated to D63V, D63G, D63A, D63L or D63I; y68 to Y68F or Y68W; k69 is mutated to K69R or K69H; the T71 mutation is T71S, T71C, T71U or T71M; the S75 mutation is S75A, S75G, S75V, S75L or S75I; f76 to F76Q, F76D, F76E, or F76N; the E104 mutation is E104V, E104G, E104A, E104L, or E104I.
In one embodiment, amino acids YKPTPASSF corresponding to positions 68-76 of SEQ ID NO. 1 are mutated to M25M26M27M28M29M30M31M32M33And the E171 mutation is E171N, E171D, or E171Q; the D175 mutation is D175N, D175E or D175Q, wherein M25Selected from F, Y or W; m26Selected from R, H or K; m27Is selected from P; m28Selected from S,C. U, T or M; m29Is selected from P; m30Selected from A, G, V, L or I; m31Selected from S, C, U, T or M; m32Selected from A, G, V, L or I; m33Selected from Q, D, E, or N.
In one embodiment, the mutant of a porin monomer, wherein the amino acid mutation is selected from the group consisting of:
(a) amino acid KPTPASSF corresponding to 69-76 bits of SEQ ID NO. 1 is mutated into RPSPASAQ;
(b) amino acid KPTPASSF corresponding to 69-76 bits of SEQ ID NO. 1 is mutated into KPGPASTK;
(c) amino acid QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 was mutated to LTGQYRPSPASANSTA;
(d) amino acid QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 was mutated to LTGQYRPSPASALSTA;
(e) R62S, D63V, Y68F, K69R, T71S, S75A, F76Q, and E104V corresponding to SEQ ID NO 1; and
(f) amino acids YKPTPASSF corresponding to positions 68-76 of SEQ ID NO. 1 were mutated to FRPSPASAQ and to E171N and D175N of SEQ ID NO. 1.
In one embodiment, the amino acid sequence of the mutant of the porin monomer comprises or consists of SEQ ID NO 24, SEQ ID NO 26, SEQ ID NO 27, SEQ ID NO 28, SEQ ID NO 29 or SEQ ID NO 30.
In one embodiment, the protein pore comprises a mutant of at least one porin monomer (or porin mutated monomer). In one embodiment, the protein pore comprises mutants of at least two, three, four, five, six, seven, eight, nine or ten or more pore protein monomers. In one embodiment, the protein pore comprises at least two mutants of porin monomers, which may be the same or different. In one embodiment, the protein pore comprises a mutant of two or more pore protein monomers, preferably the mutants of two or more monomers are identical. In one embodiment, the constriction zone pore diameter of the protein pore is 0.7nm to 2.2nm, 0.9nm to 1.6nm, 1.4 to 1.6nm, or
Use of a mutant of a porin monomer or a protein well comprising the same for detecting the presence, absence or one or more characteristics of a target analyte. In one embodiment, mutants of porin monomers or protein pores are used to detect the sequence of a nucleic acid molecule, or to characterize a polynucleotide sequence, e.g., sequence a polynucleotide sequence, as they can distinguish different nucleotides with high sensitivity. Mutants of porin monomers or protein wells comprising them can distinguish between four nucleotides in DNA and RNA, even methylated and unmethylated nucleotides, with unexpectedly high resolution. Mutant of porin monomers or protein wells showed almost complete separation of all four DNA/RNA nucleotides. Deoxycytidine monophosphate (dCMP) and methyl-dCMP are further distinguished based on the residence time in the protein pore and the current flowing through the protein pore.
Mutants of porin monomers or protein pores can also distinguish between different nucleotides under a range of conditions. In particular, mutants of the porin monomers or protein pores distinguish nucleotides under conditions that facilitate nucleic acid characterization, such as sequencing. By varying the applied potential, salt concentration, buffer, temperature and the presence of additives such as urea, betaine and DTT, the extent to which mutants of porin monomers or protein wells distinguish between different nucleotides can be controlled. This allows mutants of porin monomers or protein pore functions to be finely controlled, especially in sequencing. Mutants of porin monomers or protein pores may also be used to identify polynucleotide polymers by interaction with one or more monomers rather than on nucleotide-based nucleotides.
A mutant of a porin monomer or protein pore may be isolated, substantially isolated, purified, or substantially purified. The mutant of the porin monomer or protein pore of the examples is isolated or purified if it is completely free of any other components, such as liposomes or other protein pores/porins. A mutant of a porin monomer or protein pore is substantially isolated if it is mixed with a carrier or diluent that does not interfere with its intended use. For example, a mutant of a porin monomer or a protein pore is substantially isolated or substantially purified if it is present in a form comprising less than 10%, less than 5%, less than 2%, or less than 1% of other components such as triblock copolymers, liposomes, or other protein pores/porins. Alternatively, a mutant of a porin monomer or protein pore may be present in the membrane.
For example, the membrane is preferably an amphiphilic layer. The amphiphilic layer is a layer formed of amphiphilic molecules, for example, phospholipids, which have hydrophilicity and lipophilicity. The amphiphilic molecules may be synthetic or naturally occurring. The amphiphilic layer may be a monolayer or a bilayer. The amphiphilic layer is generally planar. The amphiphilic layer may be curved. The amphiphilic layer may be supported. The membrane may be a lipid bilayer. The lipid bilayer is formed by two opposing layers of lipid. The two layers of lipids are arranged such that their hydrophobic tail groups face each other to form a hydrophobic interior. The hydrophilic head groups of the lipid face outward toward the aqueous environment on each side of the bilayer. The membrane includes a solid layer. The solid-state layer may be formed of organic and inorganic materials. If the membrane comprises a solid layer, the pores are typically present in the amphiphilic membrane or in a layer comprised within the solid layer, e.g. in holes, wells, gaps, channels, trenches or slits within the solid layer.
Characterization of analytes
Embodiments provide a method of determining the presence, absence or one or more characteristics of a target analyte. The method involves contacting the target analyte with a mutant or protein well of a pore protein monomer such that the target analyte moves relative to, e.g., through, the mutant or protein well of the pore protein monomer and taking one or more measurements as the target analyte moves relative to the mutant or protein well of the pore protein monomer, thereby determining the presence, absence or one or more characteristics of the target analyte. The target analyte may also be referred to as a template analyte or analyte of interest.
The target analyte is preferably a polysaccharide, metal ion, inorganic salt, polymer, amino acid, peptide, polypeptide, protein, nucleotide, oligonucleotide, polynucleotide, dye, drug, diagnostic agent, explosive or environmental contaminant. The methods can involve determining the presence, absence, or one or more characteristics of two or more target analytes of the same class, e.g., two or more proteins, two or more nucleotides, or two or more drugs. Alternatively, the method can involve determining the presence, absence, or one or more characteristics of two or more different classes of target analytes, e.g., one or more proteins, one or more nucleotides, and one or more drugs.
The method comprises contacting the target analyte with a mutant of a porin monomer or a protein well such that the target analyte moves through the mutant of the porin monomer or the protein well. The protein pore typically comprises at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 monomers from which the porin is mutated, e.g., 7, 8, 9, or 10 monomers. The protein pores comprise the same monomer or different porin monomers, preferably comprising 8 or 9 of the same monomer. One or more of the monomers, for example 2, 3, 4, 5, 6, 7, 8, 9 or 10, are preferably chemically modified as discussed above. In one embodiment, the amino acids of each monomer include SEQ ID NO 1 and the above-described mutants thereof. In one embodiment, the amino acid of each monomer consists of SEQ ID NO 1 and its above mutants.
The methods of the embodiments can measure two, three, four, or five or more characteristics of the polynucleotide. The one or more characteristics are preferably selected from (i) the length of the polynucleotide, (ii) the identity of the polynucleotide, (iii) the sequence of the polynucleotide, (iv) the secondary structure of the polynucleotide, and (v) whether or not the polynucleotide is modified. In one embodiment, any combination of (i) to (v) may be measured.
For (i), the length of the polynucleotide can be measured, for example, by determining the number of interactions or the duration of interactions between the polynucleotide and the mutant/protein pore of the protein monomer.
For (ii), the identity of the polynucleotide may be measured in a number of ways, which may be measured in conjunction with or without measurement of the polynucleotide sequence. The former is simpler; the polynucleotide is sequenced and then identified. The latter can be done in several different ways. For example, the presence of a particular motif in a polynucleotide can be measured (without measuring the remaining sequence of the polynucleotide). Alternatively, measurement of a particular electrical and/or optical signal in the method may identify the polynucleotide as being from a particular source.
For (iii), the sequence of the polynucleotide may be determined as previously described. Suitable sequencing methods, particularly those using electrical measurement methods, are described in Stoddart D et al, ProC Natl Acad Sci, 12; 106(19)7702-7, Lieberman KR et al, J Am Chem SoC.2010; 132(50)17961-72, and in International application W02000/28312.
For (iv), secondary structure can be measured using a variety of methods. For example, if the method involves an electrical measurement method, the secondary structure may be measured using changes in dwell time or changes in current flowing through the hole. This allows for distinguishing regions of single-stranded and double-stranded polynucleotides.
For (v), the presence or absence of any modification can be measured. The method preferably comprises determining whether the polynucleotide has been modified by methylation, oxidation, damage, by one or more proteins or by one or more labels, tags or by the absence or absence of nucleobases and sugars. Specific modifications will result in specific interactions with the pore, which can be measured using the methods described below. For example, methylcytosine can be distinguished from cytosine based on the current flowing through the pore during its interaction with each nucleotide.
The target polynucleotide is contacted with a mutant/protein pore of a protein monomer, for example a protein monomer as in the examples. Mutants/protein pores of the protein monomer are usually present in the membrane. Suitable membranes are as described hereinbefore. The method may be carried out using any device suitable for studying membrane/protein pores or mutant systems of porin monomers, in which a mutant of a protein monomer/protein pore is present in a membrane. The method may be performed using any device suitable for trans-membranous pore sensing. For example, the device comprises a chamber containing an aqueous solution and a barrier dividing the chamber into two parts. The barrier typically has a hole in which a membrane containing a pore is formed. Or the barrier forms a membrane in which mutants/protein pores of the protein monomer are present. The process may be carried out using the apparatus described in International application No. PCT/GB08/000562(WO 2008/102120).
Various different types of measurements may be made. This includes, but is not limited to, electrical and optical measurements. Electrical measurements include voltage measurements, capacitance measurements, current measurements, impedance measurements, tunneling measurements (Ivanov AP et al, Nano lett.2011jan 12; 11(I):279-85) and FET measurements (international application TO 2005/124888). Optical measurements can be combined with electrical measurements (Soni GV et al, Rev Sci Instrum.2010Jan; 81(1) 014301). The measurement may be a transmembrane current measurement, for example a measurement of the ionic current flowing through the pore. In one embodiment, the electrical or optical measurements may employ conventional electrical or optical measurements.
Electrical measurements can be made using the techniques described in Stoddart D et al, ProC Natl Acad Sci, 12; 106(19)7702-7, Lieberman KR et al, J Am Chem SoC.2010; 132(50)17961-72 and the standard single-channel recording device of international application WO 2000/28312. Alternatively, the electrical measurements may be made using a multichannel system, for example as described in international application W02009/077734 and international application WO 2011/067559.
The method is preferably carried out using an electrical potential applied across the membrane. The applied potential may be a voltage potential. Alternatively, the applied potential may be a chemical potential. An example of this is the use of a salt gradient across the membrane, for example an amphiphilic molecule layer. Salt gradients are disclosed in Holden et al, J Am Chem soc.2007jul 11; 129(27) 8650-5. In some cases, the current flowing through the mutant/protein pore of the protein monomer as the polynucleotide moves relative to the mutant/protein pore of the protein monomer is used to estimate or determine the sequence of the polynucleotide. This is strand sequencing.
The method may comprise measuring the current flowing through the pore as the polynucleotide moves relative to the pore. The apparatus used in the method may therefore also comprise a circuit capable of applying an electrical potential and measuring an electrical signal across the membrane and the pores. The method may be performed using patch-clamp or voltage clamp.
May comprise measuring the current flowing through the pore as the polynucleotide moves relative to the pore. Suitable conditions for measuring ion flow through a transmembrane protein pore are known in the art and are disclosed in the examples. The method is typically performed by applying a voltage across the membrane and the pores. The voltage used is typically from +5V to-5V, for example from +4V to-4V, from +3V to-3V or from +2V to-2V. The voltages used are generally from-600 mV to +600V or-400 mV to +400 mV. The voltage used is preferably in a range having a lower limit selected from-400 mV, -300mV, -200mV, -150mV, -100mV, -50mV, -20mV and 0mV and an upper limit independently selected from +10mV, +20mV, +50mV, +100mV, +150mV, +200mV, +300nA P +400 mV. The voltage used is more preferably in the range of 100mV to 240mV and most preferably in the range of 120mV to 220 mV. By using an increased applied potential, the recognition of different nucleotides by the wells can be increased.
The process is typically carried out in the presence of any charge carrier, for example a metal salt such as an alkali metal salt, a halide salt such as a chloride salt, for example an alkali metal chloride salt. The charge carrier may comprise an ionic liquid or an organic salt, such as tetramethylammonium chloride, trimethylphenylammonium chloride, phenyltrimethylammonium chloride or 1-ethyl-3-methylimidazole chloride. In the above exemplary apparatus, the salt is present in an aqueous solution in the chamber. Potassium chloride (KCl), sodium chloride (NaCl), cesium chloride (CsCl) or a mixture of potassium ferrocyanide and potassium ferricyanide is generally used. KCl, NaCl and mixtures of potassium ferrocyanide and potassium ferricyanide are preferred. The charge carriers may be asymmetric on the membrane. For example, the type and/or concentration of charge carriers may be different on each side of the membrane.
The concentration of the salt may be saturated. The concentration of the salt may be 3M or less, and is usually 0.1 to 2.5M, 0.3 to 1.9M, 0.5 to 1.8M, 0.7 to 1.7M, 0.9 to 1.6M or 1M to 1.4M. The concentration of the salt is preferably 150mM to 1M. The process is preferably carried out using a salt concentration of at least 0.3M, such as at least 0.4M, at least 0.5M, at least 0.6M, at least 0.8M, at least 1.0M, at least 1.5M, at least 2.0M, at least 2.5M or at least 3.0M. High salt concentrations provide a high signal-to-noise ratio and allow the presence of nucleotides to be identified in the context of normal current fluctuations to be indicated by the current.
The method is typically carried out in the presence of a buffer. In the above exemplary device, the buffer is present in an aqueous solution in the chamber. Any buffer may be used in the methods of the invention. Typically, the buffer is a phosphate buffer. Other suitable buffers are HEPES or Tris-HCl buffers. The process is typically carried out at a pH of 4.0 to 12.0, 4.5 to 10.0, 5.0 to 9.0, 5.5 to 8.8, 6.0 to 8.7, 7.0 to 8.8, or 7.5 to 8.5. The pH used is preferably about 7.5.
The process may be carried out at a temperature of from 0 ℃ to 100 ℃, from 15 ℃ to 95 ℃, from 16 ℃ to 90 ℃, from 17 ℃ to 85 ℃, from 18 ℃ to 80 ℃, from 19 ℃ to 70 ℃ or from 20 ℃ to 60 ℃. The process is typically carried out at room temperature. The process is optionally carried out at a temperature that supports enzyme function, for example about 37 ℃.
In one embodiment, a method for determining the presence, absence or one or more characteristics of a target analyte (e.g., a polynucleotide) comprises coupling the target analyte to a membrane; and the target analyte interacts with (e.g., contacts) the protein pore present in the membrane such that the target analyte moves relative to (e.g., through) the protein pore. In one embodiment, the current through the protein pore is measured as the target analyte moves relative to the protein pore, thereby determining the presence, absence or one or more characteristics (e.g., sequence of polynucleotides) of the target analyte.
Polynucleotide binding proteins
The characterization methods of the embodiments preferably comprise contacting the polynucleotide with a polynucleotide binding protein such that the protein controls movement of the polynucleotide relative to, e.g., through, a mutant/protein pore of a protein monomer.
More preferably, the method comprises (a) contacting the polynucleotide with the mutant/protein pore of the protein monomer and the polynucleotide binding protein such that the protein controls movement of the polynucleotide relative to the mutant/protein pore of the protein monomer, e.g., through the mutant/protein pore of the protein monomer, and (b) taking one or more measurements as the polynucleotide moves relative to the mutant/protein pore of the protein monomer, wherein the measurements are indicative of one or more characteristics of the polynucleotide, thereby characterizing the polynucleotide.
More preferably, the method comprises (a) contacting the polynucleotide with the mutant/protein pore of the protein monomer and the polynucleotide binding protein such that the protein controls movement of the polynucleotide relative to the mutant/protein pore of the protein monomer, e.g., through the mutant/protein pore of the protein monomer, and (b) measuring a current through the mutant/protein pore of the protein monomer as the polynucleotide moves relative to the mutant/protein pore of the protein monomer, wherein the current is indicative of one or more characteristics of the polynucleotide, thereby characterizing the polynucleotide.
The polynucleotide binding protein may be any protein that is capable of binding to a polynucleotide and controlling its movement through a pore. Polynucleotide binding proteins typically interact with and modify at least one property of a polynucleotide. Proteins may be modified by cleaving polynucleotides to form individual nucleotides or short strands of nucleotides (e.g., dinucleotides or trinucleotides). A protein may modify a polynucleotide by orienting it or moving it to a specific location, i.e., controlling its movement.
The polynucleotide binding protein is preferably derived from a polynucleotide processing enzyme. A polynucleotide processive enzyme is a polypeptide that is capable of interacting with a polynucleotide and modifying at least one property of the polynucleotide. The enzyme may modify the polynucleotide by cleaving it to form individual nucleotides or short strands of nucleotides (e.g., dinucleotides or trinucleotides). The enzyme may modify the polynucleotide by orienting it or moving it to a specific location. The polynucleotide-processing enzyme need not exhibit enzymatic activity as long as it is capable of binding a polynucleotide and controlling its movement through the pore. For example, the enzyme may be modified to remove its enzymatic activity, or may be used under conditions that prevent its use as an enzyme.
The polynucleotide processing enzyme is preferably a polymerase, exonuclease, helicase and topoisomerase, e.g., gyrase. In one embodiment, the enzyme is preferably a helicase, such as Hel308Mbu, Hel308Csy, Hel308Tga, Hel308Mhu, Tral Eco, XPD Mbu, Dda, or variants thereof. Any helicase may be used in the examples.
In one embodiment, any number of helicases may be used. For example, I, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more helicases may be used. In some embodiments, a different number of helicases may be used.
The methods of the embodiments preferably comprise contacting the polynucleotide with two or more helicases. The two or more helicases are typically the same helicase. The two or more helicases may be different helicases.
The two or more helicases may be any combination of the helicases described above. The two or more helicases may be two or more Dda helicases. The two or more helicases may be one or more Dda helicases and one or more TrwC helicases. The two or more helicases may be different variants of the same helicase.
The two or more helicases are preferably linked to each other. The two or more helicases are more preferably covalently linked to each other. Helicases may be ligated in any order and using any method.
Reagent kit
The invention also provides a kit for characterizing a target analyte (e.g., a target polynucleotide). The kit contains the components of the wells and membranes of the examples. The film is preferably formed from the components. Pores are preferably present in the membrane. The kit may comprise the components of any of the membranes disclosed above (e.g., an amphiphilic layer or a triblock copolymer membrane). The kit may further comprise a polynucleotide binding protein. Any of the polynucleotide binding proteins discussed above may be used.
In one embodiment, the membrane is an amphiphilic layer, a solid state layer, or a lipid bilayer.
The kit may further comprise one or more anchors for coupling the polynucleotide to the membrane.
The kit is preferably for characterizing double-stranded polynucleotides and preferably comprises a Y adaptor and a hairpin loop adaptor.
The Y adaptor preferably has one or more helicases ligated, and the hairpin loop adaptor preferably has one or more molecular brakes ligated. The Y adaptor preferably comprises one or more first anchors for coupling the polynucleotide to the membrane, the hairpin loop adaptor preferably comprises one or more second anchors for coupling the polynucleotide to the membrane, and the strength of coupling of the hairpin loop adaptor to the membrane is preferably greater than the strength of coupling of the Y adaptor to the membrane.
The kit may additionally comprise one or more other reagents or instruments that enable performance of any of the embodiments mentioned above. Such reagents or instruments include one or more of the following: suitable buffers (aqueous solutions), means for obtaining a sample from an individual (e.g., a container or instrument containing a needle), means for amplifying and/or expressing a polynucleotide, or a voltage or patch clamp device. The reagents may be present in the kit in a dry form such that the fluid sample re-suspends the reagents. The kit may also optionally contain instructions to enable use of the kit with the methods of the invention or details as to which organism may use the method.
Equipment (or device)
The invention also provides a device for characterizing a target analyte (e.g., a target polynucleotide). The device comprises a mutant/protein pore of single or multiple protein monomers, and single or multiple membranes. Mutants/pores of the protein monomer are preferably present in the membrane. The number of pores and membranes is preferably equal. Preferably, there is a single hole in each membrane.
The apparatus preferably further comprises instructions for implementing the method of the embodiments. The device may be any conventional device for analyte analysis, for example an array or chip. Any of the embodiments discussed in connection with the method of the embodiment are equally applicable to the device. The device may also include any of the features present in the kits described herein. The apparatus used in the examples may be embodied as a QNome-9604.
The above mentioned prior art is incorporated herein by reference in its entirety.
The following examples are given to illustrate the invention without limiting it.
Example 1
In an embodiment, the wild-type porin is from Pseudomonas taeniae, and the amino acid sequence of the wild-type porin is SEQ ID NO. 1, and the nucleotide sequence encoding this amino acid sequence is represented by SEQ ID NO. 2. Mutant 1 of porin monomer is wild type porin with mutation at position 69-76 corresponding to SEQ ID NO. 1, specifically KPTPASSF at position 69-76 is replaced by RPSPASAQ. The protein pore of mutant 1 comprising a porin monomer is mutant pore 1. The amino acid sequence of the mutant 1 of the protein monomer is shown as SEQ ID NO. 24, and the nucleic acid sequence is shown as SEQ ID NO. 25.
Example 2
In an example, the wild-type porin is from Pseudomonas taeniae, and the amino acid sequence of the wild-type porin is SEQ ID NO. 1, and the nucleotide sequence of the sequence encoding this amino acid is represented by SEQ ID NO. 2. Mutant 2 of porin monomer is wild-type porin with mutations at positions 69-76 corresponding to SEQ ID NO. 1, specifically KPTPASSF at positions 69-76 is replaced with KPGPASTK. The protein pore of mutant 2, which includes a porin monomer, is mutant pore 2. The amino acid sequence of the mutant 2 of the protein monomer is shown as SEQ ID NO. 26.
Example 3
In an example, the wild-type porin is from Pseudomonas taeniae, and the amino acid sequence of the wild-type porin is SEQ ID NO. 1, and the nucleotide sequence of the sequence encoding this amino acid is represented by SEQ ID NO. 2. Mutant 3 of porin monomer is wild-type porin with mutations at positions 64-79 corresponding to SEQ ID NO. 1, in particular QTGQYKPTPASSFSTS-LTGQYRPSPASANSTA-position 64-79. The protein pore of mutant 3, which includes a porin monomer, is mutant pore 3. The amino acid sequence of mutant 3 of the protein monomer is shown as SEQ ID NO. 27.
Example 4
In an example, the wild-type porin is from Pseudomonas taeniae, and the amino acid sequence of the wild-type porin is SEQ ID NO. 1, and the nucleotide sequence of the sequence encoding this amino acid is represented by SEQ ID NO. 2. Mutant 4 of porin monomer is wild-type porin with mutations at positions 64-79 corresponding to SEQ ID NO. 1, in particular QTGQYKPTPASSFSTS-LTGQYRPSPASALSTA at positions 64-79. The protein pore of mutant 4 comprising a porin monomer is mutant pore 4. The amino acid sequence of the mutant 4 of the protein monomer is shown as SEQ ID NO. 28.
Example 5
In an example, the wild-type porin is from Pseudomonas taeniae, and the amino acid sequence of the wild-type porin is SEQ ID NO. 1, and the nucleotide sequence of the sequence encoding this amino acid is represented by SEQ ID NO. 2. Mutant 5 of porin monomer is wild-type porin mutated at the following position corresponding to SEQ ID NO: 1: R62S, D63V, Y68F, K69R, T71S, S75A, F76Q, and E104V. The protein pore of mutant 5 comprising a porin monomer is mutant pore 5. The amino acid sequence of the mutant 5 of the protein monomer is shown as SEQ ID NO. 29.
Example 6
In an example, the wild-type porin is from Pseudomonas taeniae, and the amino acid sequence of the wild-type porin is SEQ ID NO. 1, and the nucleotide sequence of the sequence encoding this amino acid is represented by SEQ ID NO. 2. Mutant 6 of porin monomer is wild-type porin with mutations at positions 68-76, 171 and 175 corresponding to SEQ ID NO 1, in particular YKPTPASSF substitution at positions 68-76 to FRPSPASAQ, E substitution at position 171 to N and D substitution at position 175 to N. The protein pore of mutant 6, which includes a porin monomer, is mutant pore 6. The amino acid sequence of the mutant 6 of the protein monomer is shown as SEQ ID NO. 30.
Example 7
The wild-type porin is subjected to homologous modeling by SWISS MODEL, and the amino acid of a wild-type porin monomer is shown in SEQ ID NO. 1. FIG. 4A is a side view 400 of a model of predicted protein structure, where the darker part shows a single protein monomer 402. Fig. 4B is a top view 404 of the ribbon structure model, wherein the darker portion shows a protein monomer 406. Fig. 4C is a representation 408 of the ribbon structure with darker portions of protein monomers 410.
FIG. 6 shows the sizes of the portions of mutant 1 of porin monomers, with the diameter of the constriction zone channel between mutants of two porin monomers (i.e.two mutated porin monomers) 602 and 604 being at most the diameter of the pore channelSecondly, it isMinimum diameter ofThe head-to-tail distance between two mutant porin monomers is respectivelyAndmutant porin monomer full length The distance between the head of the mutant porin monomer and the pore canal of the constriction zone is as high asThe height of the channel part of the constriction zone is shown in figure 6And
figure 7 shows the monomeric amino acid model of mutant 1 of porin, shown enlarged as the amino acid composition of the constriction zone structure, i.e. Gln76, Ser74 and Ser 71.
Example 8
Negative staining electron microscopy results for mutant 1 of porin are shown in fig. 8, as seen by negative staining EM results: mutant 1 particles were uniform with little aggregation, and many apparently correct protein particles were seen.
Cryo-electron micrographs and 2D classification results of mutant 1 of porin are shown in fig. 9A and 9B, where the 2D results only show a better classification.
FIGS. 10A and 10B show the results of local refinement of Fourier shell correlation coefficient (FSC), and it can be seen that the resolution of different regions of mutant 1 of porin and the resolution of single particle reconstruction of the final cryoelectron microscope are shown asThe reconstruction resolution is determined based on the gold standard FSC 0.143 standard and the high resolution noise substitution. FIG. 11 shows mutant 1 cryo-electron microscopy three-dimensional reconstruction of porinsElectron density map of resolution. FIG. 12 shows mutant 1 of porin proteinElectron density map of (1). The map shows the channel eye-loop region and is overlaid on the final refined model.
Cryo-electron microscopy data for porin mutant 1 are shown in table 4.
TABLE 4 cryo-electron microscopy data for mutant 1 of porin
Example 9 preparation of DNA constructs
Two DNA constructs, BS7-4C3-SE1 and BS7-4C3-PLT, respectively, were made. The structure of BS7-4C3-SE1 is shown in FIG. 13, and the sequence information is shown below:
a:30*C3
5'-TTTTT TTTTT-3' (i.e. SEQ ID NO:3)
c, rate controlling protein
d:4*C18
e:5’-AATGT ACTTC GTTCA GTTAC GTATTGCT-3' (i.e. SEQ ID NO:4)
g cholesterol label
i: 5'-AAAAA AAAAA AAAAA AAAAA AAAAA AAAAA AAAAA AAAAA AAGCA ATACG TAACT GAACG AAGTA CATTA AAAAA AAAAA AAAAA AAAA-3' (i.e., SEQ ID NO:7)
5'-ATCCT TTTTT TTTTT TTTTT TTTT-3' (i.e., SEQ ID NO:8)
k: 5'-AATGT ACTTC GTTCA GTTAC GTATT GCTTT TTTTT TTTTT TTTTT TTT-3' (i.e., SEQ ID NO:9)
l:dSpacer
m: 5'-TTTTT TTTTT TTTTT TTTTT-3' (i.e., SEQ ID NO:10)
The structure of BS7-4C3-PLT is shown in FIG. 14, and the sequence information is as follows:
a:30*C3
5'-TTTTT TTTTT-3' (i.e. SEQ ID NO:11)
c, rate controlling protein
d:4*C18
e: 5'-AATGT ACTTC GTTCA GTTAC GTATT GCT-3' (i.e., SEQ ID NO:12)
f 5 'P-GC AATAC GTAAC TGAAC GAAGT TCACTATCGCATTCTCATGA-3' (i.e. SEQ ID NO:13)
g cholesterol label
h:5'-TCATG AGAAT GCGAT AGTGA-3' (i.e., SEQ ID NO:14)
i 5 '-AAAAAAAAAAAAAAAAAAAAAAAAAAAA (i.e., SEQ ID NO:15)/dSpacer/AAAAAAAAAA (i.e., SEQ ID NO:16)/dSpacer/AAAAAAAAAAAAAATCTCTGAATCTCTGAATCTCTGAATCTCTAAAAAAAAAAAAGAAAAAAAAAAAACAAAAAAAAAAAATAAAAAAAAAAAAAGCAATACGTAACTGAACGAAGTACATTAAAAAAAAAA (i.e., SEQ ID NO:17) -3'
j: 5'-ATCCTTTTTTTTTTAATGTACTTCGTTCAGTTACGTATTGCT-3' (i.e. SEQ ID NO:18)
k 5 'P-TTTTTTTTTTTTATTTTTTTTTTTTGTTTTTTTTTTTTCTTTTTTTTTTTTAGAGATTCAGAGATTCAGAGATTCAGAGATTTTTTTTTTTTTT (i.e., SEQ ID NO:19)/dSpacer/TTTTTTTTTTTT (i.e., SEQ ID NO:20)/iSPC3/TTTTTTTTTTTTTTTTTTTTTTTTTTTT (i.e., SEQ ID NO:21) -3'
C3, C18, dSpacer and iSpC3 are marker (marker) sequences introduced indicating the resolution characteristics of pore sequencing.
In this example, the c-tachykinin protein in FIGS. 13 and 14 is helicase Mph-MP1-E105C/A362C (with mutation E105C/A362C), the amino acid sequence is SEQ ID NO:22, and the nucleic acid sequence is SEQ ID NO: 23.
Example 10
The mutant hole 1 is used as a protein hole and is detected by adopting a technical method of single-hole sequencing. After insertion of a single porin with the amino acid sequence mutant 1 into the phospholipid bilayer, buffer (625mM KCl, 10mM HEPES pH 8.0, 50mM MgCl)2) Flow through the system to remove any excess mutant 1 nanopores. The DNA constructs BS7-4C3-SE1 (data not shown) or BS7-4C3-PLT (1-2 nM final concentration) were added to the mutant 1 nanopore assay system and after mixing, buffer (625mM KCl, 10mM HEPES pH 8.0, 50mM MgCl2) Flows throughThis system to remove any excess of the DNA construct BS7-4C3-SE1 or BS7-4C 3-PLT. A premix of helicase (Mph-MP1-E105C/A362C, 15nM final concentration), fuel (ATP 3mM final concentration) was then added to the single mutant 1 nanopore assay system and the sequencing of mutant 1 pore proteins was monitored at +180mV voltage.
The mutant wells 1 were opened at a voltage of + -180 mV. FIG. 15A shows the opening current and its gating characteristics for mutant hole 1 at 180 mV. FIG. 15B shows single stranded nucleic acid via scenario for mutant well 1 at +180mV voltage. The nucleic acid may be passed through a pore. The downward line shows the nucleic acid pore signal after addition of single-stranded nucleic acid.
And sequencing the DNA construct BS7-4C3-PLT by using a single-hole sequencing technology through the mutant hole 1, and adding a nucleic acid sequencing signal appearing in a sequencing system after hole embedding. FIGS. 16A and 16B show example current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of the DNA construct BS7-4C3-PLT through the mutation pore 1. Based on this signal characteristic, the mutant well 1 can be used for nucleic acid sequencing.
Fig. 17 is an enlarged view of the portion of fig. 16B showing the current trajectory. The graph with the dotted line frame and the arrow (middle graph) is the result of the filtering process of the original signal (y-axis coordinate of the two traces is current (pA) and x-axis coordinate is time (s)). The dotted arrow indicates a portion showing the result of enlargement of the current trace. The area of this single signal is shown in an enlarged scale, further demonstrating that mutant well 1 can be used for nucleic acid sequencing.
Example 11
Similar to example 10, example 11 uses the bump hole 2 for the empty test and the via hole test.
FIG. 18A shows the opening current at a voltage of + -180 mV for mutant hole 2 and its gating characteristics. FIG. 18B shows single stranded nucleic acid via scenario for mutant well 2 at +180mV voltage. The nucleic acid may be passed through a pore. The downward line shows the nucleic acid pore signal after addition of single-stranded nucleic acid.
And (3) sequencing the DNA construct BS7-4C3-PLT by using a single-hole sequencing technology through the mutant hole 2, and adding a nucleic acid sequencing signal appearing in a sequencing system after hole embedding. FIGS. 19A and 19B show example current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of the DNA construct BS7-4C3-PLT through the mutation pore 2. According to the signal characteristics, the sequencing resolution, stability, signal consistency and other related characteristics of the mutation hole 2 can be obtained. The hole has clear steps, obvious jump distribution and high-precision sequencing capability. From the signal characteristics, the sequencing signal is highly consistent.
Fig. 20 shows the result of enlargement of a part of the current trace. The graph with the dotted line frame and the arrow shows the result of the filtering process of the original signal (the y-axis coordinate of the two traces is current (pA) and the x-axis coordinate is time (s)). The dotted arrow indicates a portion showing the result of enlargement of the current trace. The area of this single signal is shown in magnified form, indicating that the mutant well has high resolution for nucleic acid sequencing.
Example 12
Similar to example 10, example 12 uses the bump hole 3 for the empty test and the via hole test.
FIG. 21A shows the opening current and its gating characteristics for mutant hole 3 at 180 mV. FIG. 21B shows single stranded nucleic acid via scenario for mutant well 3 at +180mV voltage. The nucleic acid may be passed through a pore. The downward line shows the nucleic acid pore signal after addition of single-stranded nucleic acid.
And sequencing the DNA construct BS7-4C3-PLT through the mutant hole 3 by adopting a single-hole sequencing technical method, and adding a nucleic acid sequencing signal appearing in a sequencing system after hole embedding. FIGS. 22A and 22B show example current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of the DNA construct BS7-4C3-PLT through the mutation pore 3. Based on this signal characteristic, mutant well 3 can be used for nucleic acid sequencing.
Fig. 23 is an enlarged view of a current trajectory shown in part 22A. The graph with the dotted line frame and the arrow shows the result of the filtering process of the original signal (the y-axis coordinate of the two traces is current (pA) and the x-axis coordinate is time (s)). The dotted arrow indicates a portion showing the result of enlargement of the current trace. The area of this single signal is shown in an enlarged scale, further demonstrating that mutant pore 3 can be used for nucleic acid sequencing.
Example 13
Similar to example 10, example 13 uses the bump hole 4 for the empty test and the via hole test.
FIG. 24A shows the opening current at a voltage of + -180 mV for mutant hole 4 and its gating characteristics. FIG. 24B shows single stranded nucleic acid via scenario for mutant well 4 at +180mV voltage. The nucleic acid may be passed through a pore.
And (3) sequencing the DNA construct BS7-4C3-PLT through the mutant pore 4 by adopting a single-pore sequencing technical method, and adding a nucleic acid sequencing signal appearing in a sequencing system after pore embedding. FIGS. 25A and 25B show example current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of the DNA construct BS7-4C3-PLT through the mutation pore 4. Based on this signal characteristic, the mutant wells 4 can be used for nucleic acid sequencing.
Fig. 26 is an enlarged view of the current trace partially shown in the embodiment of fig. 25A and 25B. The graph with the dotted line frame and the arrow shows the result of the filtering process of the original signal (the y-axis coordinate of the two traces is current (pA) and the x-axis coordinate is time (s)). The dotted arrow indicates a portion showing the result of enlargement of the current trace. The area of this single signal is shown in an enlarged scale, further demonstrating that mutant pore 4 can be used for nucleic acid sequencing.
Example 14
Similar to example 10, example 14 uses the bump hole 5 for the empty test and the via test.
FIG. 27 shows the opening current and its gating characteristics for mutant opening 5 at 180 mV.
And (3) sequencing the DNA construct BS7-4C3-PLT through the mutant pore 5 by adopting a single-pore sequencing technical method, and adding a nucleic acid sequencing signal appearing in a sequencing system after pore embedding. FIG. 28 shows exemplary current traces when helicase Mph-MP1-E105C/A362C controls translocation of the DNA construct BS7-4C3-PLT through the mutation pore 5. Based on this signal characteristic, mutant wells 5 can be used for nucleic acid sequencing.
Fig. 29 is an enlarged view of the current trace shown in part in the embodiment of fig. 28. The graph with the dotted line frame and the arrow shows the result of the filtering process of the original signal (the y-axis coordinate of the two traces is current (pA) and the x-axis coordinate is time (s)). The dotted arrow indicates a portion showing the result of enlargement of the current trace. The area of this single signal is shown in an enlarged scale, further demonstrating that mutant pore 5 can be used for nucleic acid sequencing.
Example 15
Similar to example 10, example 15 uses the bump hole 6 for the empty test and the via hole test.
FIG. 30A shows the opening current and its gating characteristics for mutant opening 6 at 180 mV. FIG. 30B shows single stranded nucleic acid via scenario for mutant via 6 at +180mV voltage. The nucleic acid may be passed through a pore. The downward line shows the nucleic acid pore signal after addition of single-stranded nucleic acid.
And (3) sequencing the DNA construct BS7-4C3-PLT through the mutant hole 6 by adopting a single-hole sequencing technical method, and adding a nucleic acid sequencing signal appearing in a sequencing system after hole embedding. FIGS. 31A and 31B show example current trajectories when helicase Mph-MP1-E105C/A362C controls translocation of the DNA construct BS7-4C3-PLT through the mutation pore 6. According to the signal characteristics, the sequencing resolution, stability, signal consistency and other related characteristics of the mutation hole 6 can be obtained. The hole has clear steps, obvious jump distribution and high-precision sequencing capability. From the signal characteristics, the sequencing signal is highly consistent.
Fig. 32 is an enlarged view of a current trajectory shown in part of fig. 31A. The graph with the dotted line frame and the arrow shows the result of the filtering process of the original signal (the y-axis coordinate of the two traces is current (pA) and the x-axis coordinate is time (s)). The dotted arrow indicates a portion showing the result of enlargement of the current trace. The area of this single signal is shown in magnified form, indicating that the mutant well has high resolution for nucleic acid sequencing.
Example 16
The recombinant plasmid containing mutant 1 nucleic acid sequence (SEQ ID NO:25) of porin monomer is transformed into BL21(DE3) competent cells by a heat shock method, 0.5ml of LB culture medium is added to be cultured for 1h at 30 ℃, then a proper amount of bacterial liquid is taken and coated on an ampicillin resistant solid LB plate, overnight culture is carried out at 37 ℃, a single colony is picked the next day and inoculated into 50ml of liquid LB culture medium containing ampicillin resistance to be cultured overnight at 37 ℃. Transferred to ampicillin-resistant TB liquid medium at an inoculum size of 1% for scale-up culture, cultured at 37 ℃ and 220rpm, and continuously measured for OD 600. When OD600 ═ 2.0-2.2, the culture broth in TB medium was cooled to 16 ℃ and expression was induced by addition of Isopropylthiogalactoside (IPTG) to reach a final concentration of 0.015 mM. After the induction expression is carried out for 20-24h, the thalli are collected by centrifugation. The thalli is crushed under high pressure after being resuspended by crushing buffer solution, purified by a Ni-NTA affinity chromatography method, and a target elution sample is collected. Mutant 2-6 of porin monomers was purified as above.
Illustratively, the protein purification results of mutant 1 are shown in FIG. 33, and SDS-PAGE electrophoresis of the separated different fractions is shown in lanes 1-4. Figure 34 shows an example of mutant 1 protein Size Exclusion Chromatography (SEC) (25mL superose-6GE healthcare, x-axis label ═ elution volume (mL), Y-axis label ═ absorbance (mAu)).
SEQUENCE LISTING
<110> Chengdu carbon technology Co., Ltd
Mutant of <120> porin monomer, protein pore and application thereof
<130> SPI213793-23
<160> 30
<170> PatentIn version 3.5
<210> 1
<211> 283
<212> PRT
<213> Pseudomonas taeanensis
<400> 1
Met Asn Arg Leu Leu Ala Met Ala Met Leu Leu Ala Ala Leu Gln Gly
1 5 10 15
Cys Ala Leu Arg Glu Pro Met Pro Ala Glu Leu Glu Asn Ala Thr Pro
20 25 30
Thr Leu Thr Pro Arg Ala Ser Thr Tyr Tyr Asp Leu Leu Ala Leu Pro
35 40 45
Arg Pro Lys Gly Arg Leu Val Ala Ala Val Tyr Gly Phe Arg Asp Gln
50 55 60
Thr Gly Gln Tyr Lys Pro Thr Pro Ala Ser Ser Phe Ser Thr Ser Val
65 70 75 80
Thr Gln Gly Ala Ala Ser Met Leu Val Asp Ala Leu Gln Ala Ser Gly
85 90 95
Trp Phe Arg Val Leu Glu Arg Glu Gly Leu Gln Asn Ile Leu Thr Glu
100 105 110
Arg Lys Ile Ile Arg Ala Ser Gln Asn Lys Pro Asn Thr Pro Val Asn
115 120 125
Ile Gln Ala Asp Leu Pro Ser Leu Gln Ala Ala Asn Ile Leu Leu Glu
130 135 140
Gly Gly Val Ile Ala Tyr Asp Thr Asn Val Arg Ser Gly Gly Glu Gly
145 150 155 160
Ala Ala Tyr Leu Gly Ile Ser Leu Ser Gln Glu Tyr Arg Val Asp Gln
165 170 175
Val Ser Val Asn Leu Arg Ala Val Asp Val Arg Ser Gly Gln Val Leu
180 185 190
Ala Asn Val Met Thr Ser Lys Thr Ile Tyr Ser Val Gly Arg Asn Ala
195 200 205
Gly Ile Phe Lys Phe Ile Glu Phe Lys Glu Leu Leu Glu Ala Glu Ala
210 215 220
Gly Tyr Thr Thr Asn Glu Pro Ala Gln Leu Cys Val Leu Ser Ala Ile
225 230 235 240
Glu Ala Ala Val Ala His Leu Val Ala Gln Gly Ile Glu Arg Arg Leu
245 250 255
Trp Gln Ala Ala Asp Glu Ser Ser Phe Asp Lys Ser Ser Leu Pro Lys
260 265 270
Tyr Met Ser Gln Ala Arg Ala Asp Pro Leu Pro
275 280
<210> 2
<211> 852
<212> DNA
<213> Pseudomonas taeanensis
<400> 2
atgaacagac tactagcgat ggcgatgttg ctggccgccc tgcaaggctg cgccttgcgc 60
gaacccatgc cggcggagct ggaaaacgcg actccgaccc tgacgccgcg ggcctcgacc 120
tattacgact tgctggccct gccgcggccc aaaggccgcc tggttgccgc tgtgtacggc 180
ttccgcgacc agactgggca gtacaagccg acgcctgcca gttcgttctc caccagcgtg 240
actcagggcg ccgccagcat gctggtcgac gcgttgcagg ccagcggctg gttccgggtg 300
ctggagcgcg agggcctgca gaatatcctg accgaacgca agatcattcg cgcctcgcag 360
aacaagccca ataccccggt gaatatccag gccgatctgc cctcgttgca ggccgccaac 420
atccttctgg aaggcggggt gatcgcctac gacactaatg tacgtagcgg tggtgagggg 480
gctgcctacc tgggtatcag cctgtctcag gagtatcggg tggatcaggt ttcggtgaac 540
ctgcgtgcgg tggatgtgcg cagcggtcag gtactggcca acgtcatgac ctccaagacc 600
atctattcgg tcgggcgcaa tgccggcata ttcaaattca tcgagttcaa ggagctgttg 660
gaggccgagg cgggctacac gaccaatgaa ccggcgcagc tgtgtgtgct ctcggctatc 720
gaagccgccg tggctcatct ggtggcccag ggcatcgagc ggcgcctgtg gcaggctgct 780
gacgaaagct ccttcgataa gtcgtcactg cccaagtaca tgagccaggc gcgggccgat 840
ccactgcctt ag 852
<210> 3
<211> 10
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-SE1
<400> 3
<210> 4
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-SE1
<400> 4
aatgtacttc gttcagttac gtattgct 28
<210> 5
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-SE1
<400> 5
gcaatacgta actgaacgaa gttcactatc gcattctcat ga 42
<210> 6
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-SE1
<400> 6
<210> 7
<211> 89
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-SE1
<400> 7
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagcaatacg taactgaacg 60
aagtacatta aaaaaaaaaa aaaaaaaaa 89
<210> 8
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-SE1
<400> 8
atcctttttt tttttttttt tttt 24
<210> 9
<211> 48
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-SE1
<400> 9
aatgtacttc gttcagttac gtattgcttt tttttttttt tttttttt 48
<210> 10
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-SE1
<400> 10
<210> 11
<211> 10
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 11
<210> 12
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 12
aatgtacttc gttcagttac gtattgct 28
<210> 13
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 13
gcaatacgta actgaacgaa gttcactatc gcattctcat ga 42
<210> 14
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 14
<210> 15
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 15
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 28
<210> 16
<211> 12
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 16
<210> 17
<211> 132
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 17
aaaaaaaaaa aaaatctctg aatctctgaa tctctgaatc tctaaaaaaa aaaaagaaaa 60
aaaaaaaaca aaaaaaaaaa ataaaaaaaa aaaaagcaat acgtaactga acgaagtaca 120
ttaaaaaaaa aa 132
<210> 18
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 18
atcctttttt ttttaatgta cttcgttcag ttacgtattg ct 42
<210> 19
<211> 94
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 19
tttttttttt ttattttttt tttttgtttt ttttttttct tttttttttt tagagattca 60
gagattcaga gattcagaga tttttttttt tttt 94
<210> 20
<211> 12
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 20
<210> 21
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> part of BS7-4C3-PLT
<400> 21
tttttttttt tttttttttt tttttttt 28
<210> 22
<211> 441
<212> PRT
<213> Artificial Sequence
<220>
<223> helicase
<400> 22
Met Ile Thr Ile Asp Gln Leu Thr Glu Gly Gln Phe Asp Ser Leu Gln
1 5 10 15
Arg Ala Lys Val Leu Ile Gln Glu Ala Thr Lys Asn Asp Gly Asn Trp
20 25 30
Asn His Arg Thr Lys His Leu Thr Ile Asn Gly Pro Ala Gly Thr Gly
35 40 45
Lys Thr Thr Met Met Lys Phe Leu Val Ser Trp Leu Arg Asp Glu Gly
50 55 60
Ile Thr Gly Val Ala Leu Ala Ala Pro Thr His Ala Ala Lys Lys Val
65 70 75 80
Leu Ala Asn Ala Val Gly Glu Glu Val Ser Thr Ile His Ser Ile Leu
85 90 95
Lys Ile Asn Pro Thr Thr Tyr Glu Cys Lys Gln Phe Phe Glu Gln Ser
100 105 110
Ala Pro Pro Asp Leu Ser Lys Ile Arg Ile Leu Ile Cys Glu Glu Cys
115 120 125
Ser Phe Tyr Asp Ile Lys Leu Phe Glu Ile Leu Met Asn Ser Ile Gln
130 135 140
Pro Trp Thr Ile Ile Ile Gly Ile Gly Asp Arg Ala Gln Leu Arg Pro
145 150 155 160
Ala Asp Asp Lys Gly Ile Ser Arg Phe Phe Thr Asp Gln Arg Phe Glu
165 170 175
Gln Thr Tyr Leu Thr Glu Ile Lys Arg Ser Asn Met Pro Ile Ile Glu
180 185 190
Val Ala Thr Glu Ile Arg Asn Gly Gly Trp Ile Arg Glu Asn Ile Ile
195 200 205
Asp Asp Leu Gly Val Lys Gln Asp Lys Ser Val Ser Glu Phe Met Thr
210 215 220
Asn Tyr Phe Lys Val Val Lys Ser Ile Asp Asp Leu Tyr Glu Thr Arg
225 230 235 240
Met Tyr Ala Tyr Thr Asn Asn Ser Val Asp Thr Leu Asn Lys Ile Ile
245 250 255
Arg Lys Lys Leu Tyr Glu Thr Glu Gln Asp Phe Ile Val Gly Glu Pro
260 265 270
Ile Val Met Gln Glu Pro Leu Ile Arg Asp Ile Asn Tyr Glu Gly Lys
275 280 285
Arg Phe Gln Glu Ile Val Phe Asn Asn Gly Glu Tyr Leu Glu Val Ser
290 295 300
Glu Ile Lys Pro Met Glu Ser Val Leu Lys Cys Arg Asn Ile Asp Tyr
305 310 315 320
Gln Leu Val Leu His Tyr Tyr Gln Leu Lys Val Lys Ser Ile Asp Thr
325 330 335
Gly Glu Ser Gly Leu Ile Asn Thr Ile Ser Asp Lys Asn Glu Leu Asn
340 345 350
Lys Phe Tyr Met Phe Leu Gly Lys Val Cys Gln Asp Tyr Lys Ser Gly
355 360 365
Thr Ile Lys Ala Phe Trp Asp Asp Phe Trp Lys Ile Lys Asn Asn Tyr
370 375 380
His Arg Val Lys Pro Leu Pro Val Ser Thr Ile His Lys Gly Gln Gly
385 390 395 400
Ser Thr Val Asp Asn Ser Phe Leu Tyr Thr Pro Cys Ile Thr Lys Tyr
405 410 415
Ala Glu Pro Asp Leu Ala Ser Gln Leu Leu Tyr Val Gly Val Thr Arg
420 425 430
Ala Arg His Asn Val Asn Phe Val Gly
435 440
<210> 23
<211> 1326
<212> DNA
<213> Artificial Sequence
<220>
<223> helicase
<400> 23
atgatcacca tcgaccagct gaccgaaggt cagttcgact ctctgcagcg tgctaaagtt 60
ctgatccagg aagctaccaa aaacgacggt aactggaacc accgtaccaa acacctgacc 120
atcaacggtc cggctggtac cggtaaaacc accatgatga aattcctggt ttcttggctg 180
cgtgacgaag gtatcaccgg tgttgctctg gctgctccga cccacgctgc taaaaaagtt 240
ctggctaacg ctgttggtga agaagtttct accatccact ctatcctgaa aatcaacccg 300
accacctacg aatgcaaaca gttcttcgaa cagtctgctc cgccggacct gtctaaaatc 360
cgtatcctga tctgcgaaga atgctctttc tacgacatca aactgttcga aatcctgatg 420
aactctatcc agccgtggac catcatcatc ggtatcggtg accgtgctca gctgcgtccg 480
gctgacgaca aaggtatctc tcgtttcttc accgaccagc gtttcgaaca gacctacctg 540
accgaaatca aacgttctaa catgccgatc atcgaagttg ctaccgaaat ccgtaacggt 600
ggttggattc gtgaaaacat catcgacgac ctgggtgtta aacaggacaa atctgtttct 660
gaatttatga ccaactactt caaagttgtt aaatctatcg acgacctgta cgaaacccgt 720
atgtacgctt acaccaacaa ctctgttgac accctgaaca aaatcatccg taaaaaactg 780
tacgaaaccg aacaggactt catcgttggt gaaccgatcg ttatgcagga accgctgatc 840
cgtgacatca actacgaagg taaacgtttc caggaaatcg ttttcaacaa cggtgaatac 900
ctggaagttt ctgaaatcaa accgatggaa tctgttctga aatgccgtaa catcgactac 960
cagctggttc tgcactacta ccagctgaaa gttaaatcta tcgacaccgg tgaatctggt 1020
ctgatcaaca ccatctctga caaaaacgaa ctgaacaaat tctacatgtt cctgggtaaa 1080
gtttgccagg actacaaatc tggtaccatc aaagcgttct gggacgactt ctggaaaatc 1140
aaaaacaact accaccgtgt taaaccgctg ccggtttcta ccatccacaa aggtcagggt 1200
tctaccgttg acaactcttt cctgtacacc ccgtgcatca ccaaatacgc tgaaccggac 1260
ctggcttctc agctgctgta cgttggtgtt acccgtgctc gtcacaacgt taacttcgtt 1320
ggttaa 1326
<210> 24
<211> 283
<212> PRT
<213> Artificial Sequence
<220>
<223> mutant 1
<400> 24
Met Asn Arg Leu Leu Ala Met Ala Met Leu Leu Ala Ala Leu Gln Gly
1 5 10 15
Cys Ala Leu Arg Glu Pro Met Pro Ala Glu Leu Glu Asn Ala Thr Pro
20 25 30
Thr Leu Thr Pro Arg Ala Ser Thr Tyr Tyr Asp Leu Leu Ala Leu Pro
35 40 45
Arg Pro Lys Gly Arg Leu Val Ala Ala Val Tyr Gly Phe Arg Asp Gln
50 55 60
Thr Gly Gln Tyr Arg Pro Ser Pro Ala Ser Ala Gln Ser Thr Ser Val
65 70 75 80
Thr Gln Gly Ala Ala Ser Met Leu Val Asp Ala Leu Gln Ala Ser Gly
85 90 95
Trp Phe Arg Val Leu Glu Arg Glu Gly Leu Gln Asn Ile Leu Thr Glu
100 105 110
Arg Lys Ile Ile Arg Ala Ser Gln Asn Lys Pro Asn Thr Pro Val Asn
115 120 125
Ile Gln Ala Asp Leu Pro Ser Leu Gln Ala Ala Asn Ile Leu Leu Glu
130 135 140
Gly Gly Val Ile Ala Tyr Asp Thr Asn Val Arg Ser Gly Gly Glu Gly
145 150 155 160
Ala Ala Tyr Leu Gly Ile Ser Leu Ser Gln Glu Tyr Arg Val Asp Gln
165 170 175
Val Ser Val Asn Leu Arg Ala Val Asp Val Arg Ser Gly Gln Val Leu
180 185 190
Ala Asn Val Met Thr Ser Lys Thr Ile Tyr Ser Val Gly Arg Asn Ala
195 200 205
Gly Ile Phe Lys Phe Ile Glu Phe Lys Glu Leu Leu Glu Ala Glu Ala
210 215 220
Gly Tyr Thr Thr Asn Glu Pro Ala Gln Leu Cys Val Leu Ser Ala Ile
225 230 235 240
Glu Ala Ala Val Ala His Leu Val Ala Gln Gly Ile Glu Arg Arg Leu
245 250 255
Trp Gln Ala Ala Asp Glu Ser Ser Phe Asp Lys Ser Ser Leu Pro Lys
260 265 270
Tyr Met Ser Gln Ala Arg Ala Asp Pro Leu Pro
275 280
<210> 25
<211> 849
<212> DNA
<213> Artificial Sequence
<220>
<223> mutant 1
<400> 25
atgaataggt tgttagcaat ggctatgcta ttggcggctt tgcaaggttg tgcactgcgt 60
gaaccgatgc cggcagaatt ggagaacgcg accccgaccc tgacgccacg tgcgtctacg 120
tactacgacc tgctggctct cccgcgtccg aagggtcgcc tggtggcggc ggtgtatggt 180
tttcgcgatc agactggtca gtaccgtccg tccccggcgt cagcacaaag caccagcgtc 240
acccagggtg cagcgtctat gctggttgac gcattacaag cgtccggctg gttccgcgtg 300
ctggagcgcg aaggcctgca aaacatcctt acggaacgta aaatcattcg tgcatctcag 360
aacaaaccga ataccccagt taatatccaa gcggatctgc cttcgttgca ggcggcgaac 420
attctgctcg agggtggcgt gattgcatac gacaccaatg ttcgttctgg cggtgaaggc 480
gctgcgtacc tgggtatcag cctgtcccaa gaatatcgcg ttgaccaggt tagcgttaac 540
ctgcgtgcgg ttgatgtccg tagcggccaa gtactggcga acgtgatgac ctcgaaaacc 600
atttatagcg tgggtcgtaa tgccggcatt tttaagttca tcgaattcaa agagttgttg 660
gaggcagaag cgggttatac cacaaacgag ccggcccaac tttgcgttct gagcgcaatc 720
gaggcggctg tggcccacct ggtcgcgcag ggcatcgagc gccgtctgtg gcaggctgcc 780
gatgagagct cctttgataa gagcagcttg ccgaagtaca tgagccaggc gagagccgac 840
ccgctgccg 849
<210> 26
<211> 283
<212> PRT
<213> Artificial Sequence
<220>
<223> mutant 2
<400> 26
Met Asn Arg Leu Leu Ala Met Ala Met Leu Leu Ala Ala Leu Gln Gly
1 5 10 15
Cys Ala Leu Arg Glu Pro Met Pro Ala Glu Leu Glu Asn Ala Thr Pro
20 25 30
Thr Leu Thr Pro Arg Ala Ser Thr Tyr Tyr Asp Leu Leu Ala Leu Pro
35 40 45
Arg Pro Lys Gly Arg Leu Val Ala Ala Val Tyr Gly Phe Arg Asp Gln
50 55 60
Thr Gly Gln Tyr Lys Pro Gly Pro Ala Ser Thr Lys Ser Thr Ser Val
65 70 75 80
Thr Gln Gly Ala Ala Ser Met Leu Val Asp Ala Leu Gln Ala Ser Gly
85 90 95
Trp Phe Arg Val Leu Glu Arg Glu Gly Leu Gln Asn Ile Leu Thr Glu
100 105 110
Arg Lys Ile Ile Arg Ala Ser Gln Asn Lys Pro Asn Thr Pro Val Asn
115 120 125
Ile Gln Ala Asp Leu Pro Ser Leu Gln Ala Ala Asn Ile Leu Leu Glu
130 135 140
Gly Gly Val Ile Ala Tyr Asp Thr Asn Val Arg Ser Gly Gly Glu Gly
145 150 155 160
Ala Ala Tyr Leu Gly Ile Ser Leu Ser Gln Glu Tyr Arg Val Asp Gln
165 170 175
Val Ser Val Asn Leu Arg Ala Val Asp Val Arg Ser Gly Gln Val Leu
180 185 190
Ala Asn Val Met Thr Ser Lys Thr Ile Tyr Ser Val Gly Arg Asn Ala
195 200 205
Gly Ile Phe Lys Phe Ile Glu Phe Lys Glu Leu Leu Glu Ala Glu Ala
210 215 220
Gly Tyr Thr Thr Asn Glu Pro Ala Gln Leu Cys Val Leu Ser Ala Ile
225 230 235 240
Glu Ala Ala Val Ala His Leu Val Ala Gln Gly Ile Glu Arg Arg Leu
245 250 255
Trp Gln Ala Ala Asp Glu Ser Ser Phe Asp Lys Ser Ser Leu Pro Lys
260 265 270
Tyr Met Ser Gln Ala Arg Ala Asp Pro Leu Pro
275 280
<210> 27
<211> 283
<212> PRT
<213> Artificial Sequence
<220>
<223> mutant 3
<400> 27
Met Asn Arg Leu Leu Ala Met Ala Met Leu Leu Ala Ala Leu Gln Gly
1 5 10 15
Cys Ala Leu Arg Glu Pro Met Pro Ala Glu Leu Glu Asn Ala Thr Pro
20 25 30
Thr Leu Thr Pro Arg Ala Ser Thr Tyr Tyr Asp Leu Leu Ala Leu Pro
35 40 45
Arg Pro Lys Gly Arg Leu Val Ala Ala Val Tyr Gly Phe Arg Asp Leu
50 55 60
Thr Gly Gln Tyr Arg Pro Ser Pro Ala Ser Ala Asn Ser Thr Ala Val
65 70 75 80
Thr Gln Gly Ala Ala Ser Met Leu Val Asp Ala Leu Gln Ala Ser Gly
85 90 95
Trp Phe Arg Val Leu Glu Arg Glu Gly Leu Gln Asn Ile Leu Thr Glu
100 105 110
Arg Lys Ile Ile Arg Ala Ser Gln Asn Lys Pro Asn Thr Pro Val Asn
115 120 125
Ile Gln Ala Asp Leu Pro Ser Leu Gln Ala Ala Asn Ile Leu Leu Glu
130 135 140
Gly Gly Val Ile Ala Tyr Asp Thr Asn Val Arg Ser Gly Gly Glu Gly
145 150 155 160
Ala Ala Tyr Leu Gly Ile Ser Leu Ser Gln Glu Tyr Arg Val Asp Gln
165 170 175
Val Ser Val Asn Leu Arg Ala Val Asp Val Arg Ser Gly Gln Val Leu
180 185 190
Ala Asn Val Met Thr Ser Lys Thr Ile Tyr Ser Val Gly Arg Asn Ala
195 200 205
Gly Ile Phe Lys Phe Ile Glu Phe Lys Glu Leu Leu Glu Ala Glu Ala
210 215 220
Gly Tyr Thr Thr Asn Glu Pro Ala Gln Leu Cys Val Leu Ser Ala Ile
225 230 235 240
Glu Ala Ala Val Ala His Leu Val Ala Gln Gly Ile Glu Arg Arg Leu
245 250 255
Trp Gln Ala Ala Asp Glu Ser Ser Phe Asp Lys Ser Ser Leu Pro Lys
260 265 270
Tyr Met Ser Gln Ala Arg Ala Asp Pro Leu Pro
275 280
<210> 28
<211> 283
<212> PRT
<213> Artificial Sequence
<220>
<223> mutant 4
<400> 28
Met Asn Arg Leu Leu Ala Met Ala Met Leu Leu Ala Ala Leu Gln Gly
1 5 10 15
Cys Ala Leu Arg Glu Pro Met Pro Ala Glu Leu Glu Asn Ala Thr Pro
20 25 30
Thr Leu Thr Pro Arg Ala Ser Thr Tyr Tyr Asp Leu Leu Ala Leu Pro
35 40 45
Arg Pro Lys Gly Arg Leu Val Ala Ala Val Tyr Gly Phe Arg Asp Leu
50 55 60
Thr Gly Gln Tyr Arg Pro Ser Pro Ala Ser Ala Leu Ser Thr Ala Val
65 70 75 80
Thr Gln Gly Ala Ala Ser Met Leu Val Asp Ala Leu Gln Ala Ser Gly
85 90 95
Trp Phe Arg Val Leu Glu Arg Glu Gly Leu Gln Asn Ile Leu Thr Glu
100 105 110
Arg Lys Ile Ile Arg Ala Ser Gln Asn Lys Pro Asn Thr Pro Val Asn
115 120 125
Ile Gln Ala Asp Leu Pro Ser Leu Gln Ala Ala Asn Ile Leu Leu Glu
130 135 140
Gly Gly Val Ile Ala Tyr Asp Thr Asn Val Arg Ser Gly Gly Glu Gly
145 150 155 160
Ala Ala Tyr Leu Gly Ile Ser Leu Ser Gln Glu Tyr Arg Val Asp Gln
165 170 175
Val Ser Val Asn Leu Arg Ala Val Asp Val Arg Ser Gly Gln Val Leu
180 185 190
Ala Asn Val Met Thr Ser Lys Thr Ile Tyr Ser Val Gly Arg Asn Ala
195 200 205
Gly Ile Phe Lys Phe Ile Glu Phe Lys Glu Leu Leu Glu Ala Glu Ala
210 215 220
Gly Tyr Thr Thr Asn Glu Pro Ala Gln Leu Cys Val Leu Ser Ala Ile
225 230 235 240
Glu Ala Ala Val Ala His Leu Val Ala Gln Gly Ile Glu Arg Arg Leu
245 250 255
Trp Gln Ala Ala Asp Glu Ser Ser Phe Asp Lys Ser Ser Leu Pro Lys
260 265 270
Tyr Met Ser Gln Ala Arg Ala Asp Pro Leu Pro
275 280
<210> 29
<211> 283
<212> PRT
<213> Artificial Sequence
<220>
<223> mutant 5
<400> 29
Met Asn Arg Leu Leu Ala Met Ala Met Leu Leu Ala Ala Leu Gln Gly
1 5 10 15
Cys Ala Leu Arg Glu Pro Met Pro Ala Glu Leu Glu Asn Ala Thr Pro
20 25 30
Thr Leu Thr Pro Arg Ala Ser Thr Tyr Tyr Asp Leu Leu Ala Leu Pro
35 40 45
Arg Pro Lys Gly Arg Leu Val Ala Ala Val Tyr Gly Phe Ser Val Gln
50 55 60
Thr Gly Gln Phe Arg Pro Ser Pro Ala Ser Ala Gln Ser Thr Ser Val
65 70 75 80
Thr Gln Gly Ala Ala Ser Met Leu Val Asp Ala Leu Gln Ala Ser Gly
85 90 95
Trp Phe Arg Val Leu Glu Arg Val Gly Leu Gln Asn Ile Leu Thr Glu
100 105 110
Arg Lys Ile Ile Arg Ala Ser Gln Asn Lys Pro Asn Thr Pro Val Asn
115 120 125
Ile Gln Ala Asp Leu Pro Ser Leu Gln Ala Ala Asn Ile Leu Leu Glu
130 135 140
Gly Gly Val Ile Ala Tyr Asp Thr Asn Val Arg Ser Gly Gly Glu Gly
145 150 155 160
Ala Ala Tyr Leu Gly Ile Ser Leu Ser Gln Glu Tyr Arg Val Asp Gln
165 170 175
Val Ser Val Asn Leu Arg Ala Val Asp Val Arg Ser Gly Gln Val Leu
180 185 190
Ala Asn Val Met Thr Ser Lys Thr Ile Tyr Ser Val Gly Arg Asn Ala
195 200 205
Gly Ile Phe Lys Phe Ile Glu Phe Lys Glu Leu Leu Glu Ala Glu Ala
210 215 220
Gly Tyr Thr Thr Asn Glu Pro Ala Gln Leu Cys Val Leu Ser Ala Ile
225 230 235 240
Glu Ala Ala Val Ala His Leu Val Ala Gln Gly Ile Glu Arg Arg Leu
245 250 255
Trp Gln Ala Ala Asp Glu Ser Ser Phe Asp Lys Ser Ser Leu Pro Lys
260 265 270
Tyr Met Ser Gln Ala Arg Ala Asp Pro Leu Pro
275 280
<210> 30
<211> 283
<212> PRT
<213> Artificial Sequence
<220>
<223> mutant 6
<400> 30
Met Asn Arg Leu Leu Ala Met Ala Met Leu Leu Ala Ala Leu Gln Gly
1 5 10 15
Cys Ala Leu Arg Glu Pro Met Pro Ala Glu Leu Glu Asn Ala Thr Pro
20 25 30
Thr Leu Thr Pro Arg Ala Ser Thr Tyr Tyr Asp Leu Leu Ala Leu Pro
35 40 45
Arg Pro Lys Gly Arg Leu Val Ala Ala Val Tyr Gly Phe Arg Asp Gln
50 55 60
Thr Gly Gln Phe Arg Pro Ser Pro Ala Ser Ala Gln Ser Thr Ser Val
65 70 75 80
Thr Gln Gly Ala Ala Ser Met Leu Val Asp Ala Leu Gln Ala Ser Gly
85 90 95
Trp Phe Arg Val Leu Glu Arg Glu Gly Leu Gln Asn Ile Leu Thr Glu
100 105 110
Arg Lys Ile Ile Arg Ala Ser Gln Asn Lys Pro Asn Thr Pro Val Asn
115 120 125
Ile Gln Ala Asp Leu Pro Ser Leu Gln Ala Ala Asn Ile Leu Leu Glu
130 135 140
Gly Gly Val Ile Ala Tyr Asp Thr Asn Val Arg Ser Gly Gly Glu Gly
145 150 155 160
Ala Ala Tyr Leu Gly Ile Ser Leu Ser Gln Asn Tyr Arg Val Asn Gln
165 170 175
Val Ser Val Asn Leu Arg Ala Val Asp Val Arg Ser Gly Gln Val Leu
180 185 190
Ala Asn Val Met Thr Ser Lys Thr Ile Tyr Ser Val Gly Arg Asn Ala
195 200 205
Gly Ile Phe Lys Phe Ile Glu Phe Lys Glu Leu Leu Glu Ala Glu Ala
210 215 220
Gly Tyr Thr Thr Asn Glu Pro Ala Gln Leu Cys Val Leu Ser Ala Ile
225 230 235 240
Glu Ala Ala Val Ala His Leu Val Ala Gln Gly Ile Glu Arg Arg Leu
245 250 255
Trp Gln Ala Ala Asp Glu Ser Ser Phe Asp Lys Ser Ser Leu Pro Lys
260 265 270
Tyr Met Ser Gln Ala Arg Ala Asp Pro Leu Pro
275 280
Claims (21)
1. A mutant of a porin monomer, wherein the amino acid of the mutant of a porin monomer comprises the sequence set forth in SEQ ID No. 1 or a sequence having at least 99%, 98%, 97%, 96%, 95%, 90%, 80%, 70%, 60% or 50% identity thereto, and the amino acid of the mutant of a porin monomer comprises a mutation at one or more positions corresponding to T71, S75, F76 of SEQ ID No. 1.
2. A mutant of a porin monomer as claimed in claim 1, said mutant of a porin monomer having an amino acid comprising a mutation at one or more positions corresponding to 62-175, 62-104, 68-175, 64-79, 71-76, or 69-76 of SEQ ID NO 1.
3. A mutant of a porin monomer as claimed in claim 1 or 2, the amino acids of said mutant of a porin monomer comprising:
(1) (ii) having insertions, deletions and/or substitutions of amino acids at one or more positions corresponding to K69, P70, T71, P72, A73, S74, S75, and F76 of SEQ ID NO 1; (2) (ii) has an amino acid insertion, deletion and/or substitution at one or more positions corresponding to Q64, T65, G66, Q67, Y68, K69, P70, T71, P72, A73, S74, S75, F76, S77, T78, and S79 of SEQ ID NO 1; (3) (ii) has insertions, deletions and/or substitutions of amino acids at one or more positions corresponding to R62, D63, Y68, K69, T71, S75, F76, and E104 of SEQ ID NO 1; or (4) having an insertion, deletion and/or substitution of an amino acid at one or more positions corresponding to Y68, K69, P70, T71, P72, A73, S74, S75, F76, E171 and D175 of SEQ ID NO: 1.
4. A mutant of a porin monomer as claimed in any preceding claim wherein said sequence set forth in SEQ ID NO 1 is derived from Pseudomonas taeniae.
5. A mutant of a porin monomer as claimed in any one of the preceding claims, wherein said mutant of a porin monomer has an amino acid mutation selected from the group consisting of:
(a) amino acid KPTPASSF corresponding to 69-76 of SEQ ID NO. 1 is mutated to M1M2M3M4M5M6M7M8Wherein M is10 to 3 selected from R, K, H; m20 to 1 selected from P; m30 to 10 selected from S, G, C, U, T, M, A, V, L, I; m40 to 1 selected from P; m50 to 5 selected from A, G, V, L, I; m60 to 5 selected from S, C, U, T, M; m70 to 10 selected from A, T, G, V, L, I, S, C, U, M; m80 to 7 selected from Q, D, E, N, K, H, R;
(b) amino acids QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 were mutated to M9M10M11M12M13M14M1 5M16M17M18M19M20M21M22M23M24Wherein M is90 to 5 selected from L, G, A, V, I; m100 to 5 selected from T, S, C, U, M; m110 to 5 selected from G, A, V, L, I; m120 to 4 selected from Q, D, E, N;M130 to 3 selected from Y, F, W; m140 to 3 selected from R, H, K; m150 to 1 selected from P; m160 to 5 selected from S, C, U, T, M; m170 to 1 selected from P; m180 to 5 selected from A, G, V, L, I; m190 to 5 selected from S, C, U, T, M; m200 to 5 selected from A, G, V, L, I; m210 to 9 selected from N, D, E, Q, L, G, A, V, I; m220 to 5 selected from S, C, U, T, M; m230 to 5 selected from T, S, C, U, M; m240 to 5 selected from A, G, V, L, I;
(c) the mutation corresponding to the 62 th position of SEQ ID NO. 1 is 0 to 5 of S, C, U, T, M; the mutation at position 63 is 0 to 5 of V, G, A, L, I; the mutation at position 68 is 0 to 2 of F, W; the mutation at position 69 is 0 to 2 of R, H; the mutation at position 71 is 0 to 4 of S, C, U, M; the mutation at position 75 is 0 to 5 of A, G, V, L, I; the mutation at position 76 is 0 to 4 of Q, D, E, N; the mutation at position 104 is 0 to 5 of V, G, A, L, I; and
(d) amino acids YKPTPASSF corresponding to positions 68-76 of SEQ ID NO. 1 were mutated to M25M26M27M28M29M30M31M32M33And the 171 mutation is 0 to 3 of N, D, Q; mutation at position 175 is 0 to 3 of N, E, Q, wherein M250 to 3 selected from F, Y, W; m260 to 3 selected from R, H, K; m270 to 1 selected from P; m280 to 5 selected from S, C, U, T, M; m290 to 1 selected from P; m300 to 5 selected from A, G, V, L, I; m310 to 5 selected from S, C, U, T, M; m320 to 5 selected from A, G, V, L, I; m330 to 4 selected from Q, D, E, N.
6. A mutant of a porin monomer as claimed in any one of the preceding claims, wherein said mutant of a porin monomer has an amino acid mutation selected from the group consisting of:
(a) amino acid KPTPASSF corresponding to 69-76 bits of SEQ ID NO. 1 is mutated into RPSPASAQ;
(b) amino acid KPTPASSF corresponding to 69-76 bits of SEQ ID NO. 1 is mutated into KPGPASTK;
(c) amino acid QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 was mutated to LTGQYRPSPASANSTA;
(d) amino acid QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO. 1 was mutated to LTGQYRPSPASALSTA;
(e) R62S, D63V, Y68F, K69R, T71S, S75A, F76Q, and E104V corresponding to SEQ ID NO 1; and
(f) amino acids YKPTPASSF corresponding to positions 68-76 of SEQ ID NO. 1 were mutated to FRPSPASAQ and to E171N and D175N of SEQ ID NO. 1.
7. A mutant of a porin monomer as claimed in any one of the preceding claims, wherein the amino acid sequence of said mutant of a porin monomer comprises or consists of SEQ ID NO 24, SEQ ID NO 26, SEQ ID NO 27, SEQ ID NO 28, SEQ ID NO 29 or SEQ ID NO 30.
8. A protein pore comprising at least one mutant of a porin monomer of any one of the preceding claims.
9. The protein pore according to claim 8, wherein the protein pore comprises at least two mutants of the porin monomer.
11. A complex for characterizing a target analyte, characterized by: comprising a protein pore according to any one of claims 8 to 10 and rate-controlling protein bound thereto.
12. A nucleic acid encoding a mutant of a porin monomer of any one of claims 1-7, a protein pore of any one of claims 8-10, or a complex of claim 11.
13. The nucleic acid of claim 12, wherein the nucleotide sequence of the porin monomer is the sequence set forth in SEQ ID NO 2.
14. A vector or a genetically engineered host cell comprising the nucleic acid of any one of claims 12-13.
15. Use of a mutant of a porin monomer of any one of claims 1 to 7, a protein well of any one of claims 8 to 10, a complex of claim 11, a nucleic acid of any one of claims 12 to 13, or a vector or host cell of claim 14, in or for the manufacture of a product for detecting the presence, absence or one or more characteristics of a target analyte.
16. A method of producing a protein pore or polypeptide thereof, comprising transforming a host cell according to claim 14 with a vector comprising claim 14, and inducing the host cell to express a protein pore or polypeptide thereof according to any one of claims 8 to 10.
17. A method for determining the presence, absence or one or more characteristics of a target analyte, comprising:
a. contacting a target analyte with the protein pore of any one of claims 8-10, the complex of claim 11, or the complex of claim 11 such that the target analyte moves relative to the protein pore; and
b. obtaining one or more measurements while the target analyte is moving relative to the protein pore, thereby determining the presence, absence or one or more characteristics of the target analyte.
18. The method of claim 17, wherein the method comprises:
the target analyte interacts with the protein pores present in the membrane such that the target analyte moves relative to the protein pores.
19. A kit for determining the presence, absence or one or more characteristics of a target analyte comprising a mutant of a porin monomer of any one of claims 1 to 7, a protein well of any one of claims 8 to 10, a complex of claim 11, a nucleic acid of any one of claims 12 to 13, or a vector or host of claim 14, and a component of a membrane of claim 18.
20. A device for determining the presence, absence or one or more characteristics of a target analyte comprising a protein pore according to any one of claims 8 to 10 or a complex according to claim 11, and a membrane according to claim 18.
21. The use, method, kit or device of any one of claims 15-20, wherein the target analyte comprises a polysaccharide, a metal ion, an inorganic salt, a polymer, an amino acid, a peptide, a protein, a nucleotide, an oligonucleotide, a polynucleotide, a dye, a drug, a diagnostic agent, an explosive, or an environmental contaminant;
preferably, the target analyte comprises a polynucleotide,
more preferably, the polynucleotide comprises DNA or RNA; and/or, the one or more characteristics are selected from (i) the length of the polynucleotide; (ii) identity of the polynucleotides; (iii) the sequence of the polynucleotide; (iv) (iv) the secondary structure of the polynucleotide and (v) whether the polynucleotide is modified; and/or, the rate controlling protein in the complex comprises a polynucleotide binding protein.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110950708.3A CN113480620B (en) | 2021-08-18 | 2021-08-18 | Mutant of porin monomer, protein hole and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110950708.3A CN113480620B (en) | 2021-08-18 | 2021-08-18 | Mutant of porin monomer, protein hole and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113480620A true CN113480620A (en) | 2021-10-08 |
CN113480620B CN113480620B (en) | 2024-02-06 |
Family
ID=77945885
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110950708.3A Active CN113480620B (en) | 2021-08-18 | 2021-08-18 | Mutant of porin monomer, protein hole and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113480620B (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113754743A (en) * | 2021-10-12 | 2021-12-07 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and application thereof |
CN113773373A (en) * | 2021-10-12 | 2021-12-10 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and application thereof |
CN113896776A (en) * | 2021-10-12 | 2022-01-07 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and application thereof |
CN113912683A (en) * | 2021-10-12 | 2022-01-11 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and application thereof |
WO2023060418A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore, and application thereof |
WO2023060421A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and use thereof |
WO2023060422A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and use thereof |
WO2023060419A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and use thereof |
WO2023060420A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore, and use thereof |
WO2023123370A1 (en) * | 2021-12-31 | 2023-07-06 | 深圳华大生命科学研究院 | Nanopore protein and related use thereof in sequencing |
WO2024078621A1 (en) * | 2022-10-14 | 2024-04-18 | 北京普译生物科技有限公司 | Pht nanopore mutant protein and use thereof |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106459159A (en) * | 2014-05-02 | 2017-02-22 | 牛津纳米孔技术公司 | Mutant pores |
AU2015310913A1 (en) * | 2014-09-01 | 2017-03-16 | Oxford Nanopore Technologies Limited | Mutant CsgG pores |
CN110914290A (en) * | 2017-06-30 | 2020-03-24 | 弗拉芒区生物技术研究所 | Novel protein pores |
CN111164097A (en) * | 2019-09-29 | 2020-05-15 | 北京齐碳科技有限公司 | Mnep monomer variant and application thereof |
CN111164096A (en) * | 2019-09-29 | 2020-05-15 | 北京齐碳科技有限公司 | Mmup monomer variant and application thereof |
CN111499705A (en) * | 2019-01-30 | 2020-08-07 | 深圳市梅丽纳米孔科技有限公司 | Mutant NfpAB nanopore, test system, manufacturing method and application |
-
2021
- 2021-08-18 CN CN202110950708.3A patent/CN113480620B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106459159A (en) * | 2014-05-02 | 2017-02-22 | 牛津纳米孔技术公司 | Mutant pores |
CN106574300A (en) * | 2014-05-02 | 2017-04-19 | 牛津纳米孔技术公司 | Method of improving the movement of a target polynucleotide with respect to a transmembrane pore |
AU2015310913A1 (en) * | 2014-09-01 | 2017-03-16 | Oxford Nanopore Technologies Limited | Mutant CsgG pores |
CN110914290A (en) * | 2017-06-30 | 2020-03-24 | 弗拉芒区生物技术研究所 | Novel protein pores |
CN111499705A (en) * | 2019-01-30 | 2020-08-07 | 深圳市梅丽纳米孔科技有限公司 | Mutant NfpAB nanopore, test system, manufacturing method and application |
CN111164097A (en) * | 2019-09-29 | 2020-05-15 | 北京齐碳科技有限公司 | Mnep monomer variant and application thereof |
CN111164096A (en) * | 2019-09-29 | 2020-05-15 | 北京齐碳科技有限公司 | Mmup monomer variant and application thereof |
Non-Patent Citations (1)
Title |
---|
NCBI: "curli production assembly/transporter CsgG [Pseudomonas taeanensi", 《REFERENCE SEQUENCE: WP_025166647.1》 * |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023060422A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and use thereof |
WO2023060420A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore, and use thereof |
CN113896776A (en) * | 2021-10-12 | 2022-01-07 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and application thereof |
CN113912683A (en) * | 2021-10-12 | 2022-01-11 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and application thereof |
WO2023060418A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore, and application thereof |
WO2023060421A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and use thereof |
CN113773373A (en) * | 2021-10-12 | 2021-12-10 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and application thereof |
WO2023060419A1 (en) * | 2021-10-12 | 2023-04-20 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and use thereof |
CN113754743A (en) * | 2021-10-12 | 2021-12-07 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein pore and application thereof |
CN113773373B (en) * | 2021-10-12 | 2024-02-06 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein hole and application thereof |
CN113754743B (en) * | 2021-10-12 | 2024-02-02 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein hole and application thereof |
CN113912683B (en) * | 2021-10-12 | 2024-02-06 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein hole and application thereof |
CN113896776B (en) * | 2021-10-12 | 2024-02-06 | 成都齐碳科技有限公司 | Mutant of porin monomer, protein hole and application thereof |
WO2023123370A1 (en) * | 2021-12-31 | 2023-07-06 | 深圳华大生命科学研究院 | Nanopore protein and related use thereof in sequencing |
WO2024078621A1 (en) * | 2022-10-14 | 2024-04-18 | 北京普译生物科技有限公司 | Pht nanopore mutant protein and use thereof |
Also Published As
Publication number | Publication date |
---|---|
CN113480620B (en) | 2024-02-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113480620B (en) | Mutant of porin monomer, protein hole and application thereof | |
CN113754743B (en) | Mutant of porin monomer, protein hole and application thereof | |
CN113896776B (en) | Mutant of porin monomer, protein hole and application thereof | |
CN113773373B (en) | Mutant of porin monomer, protein hole and application thereof | |
KR102222191B1 (en) | Mutant pore | |
CN113912683B (en) | Mutant of porin monomer, protein hole and application thereof | |
CN104126018B (en) | Enzymatic process | |
EP2768977B1 (en) | Method of characterizing a target polynucleotide using a pore and a hel308 helicase | |
EP3259281B1 (en) | Hetero-pores | |
CN106459159A (en) | Mutant pores | |
KR20140125874A (en) | Analysis of measurements of a polymer | |
CN107636168A (en) | Method | |
CN113651876B (en) | Mutant of porin monomer, protein hole and application thereof | |
CN113735948B (en) | Mutant of porin monomer, protein hole and application thereof | |
WO2023060419A1 (en) | Mutant of porin monomer, protein pore and use thereof | |
WO2023019470A1 (en) | Mutant of pore protein monomer, protein pore, and use thereof | |
WO2023060418A1 (en) | Mutant of porin monomer, protein pore, and application thereof | |
WO2023050031A1 (en) | Mutant of porin monomer, protein pore and use thereof | |
WO2023060421A1 (en) | Mutant of porin monomer, protein pore and use thereof | |
WO2023060422A1 (en) | Mutant of porin monomer, protein pore and use thereof | |
WO2023060420A1 (en) | Mutant of porin monomer, protein pore, and use thereof | |
WO2023019471A1 (en) | Mutant of porin monomer, protein pore and application thereof | |
CN115960182A (en) | Mutant of porin monomer, protein pore and application thereof | |
EP3511424B1 (en) | Compositions and methods for improving nanopore sequencing | |
CN115698331A (en) | Method for selectively characterizing polynucleotides using a detector |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40057010 Country of ref document: HK |
|
GR01 | Patent grant | ||
GR01 | Patent grant |