WO2021253410A1 - 一种Pif1-like解旋酶及其应用 - Google Patents
一种Pif1-like解旋酶及其应用 Download PDFInfo
- Publication number
- WO2021253410A1 WO2021253410A1 PCT/CN2020/097126 CN2020097126W WO2021253410A1 WO 2021253410 A1 WO2021253410 A1 WO 2021253410A1 CN 2020097126 W CN2020097126 W CN 2020097126W WO 2021253410 A1 WO2021253410 A1 WO 2021253410A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- helicase
- pif1
- seq
- acid
- phenylalanine
- Prior art date
Links
- 108060004795 Methyltransferase Proteins 0.000 title claims abstract description 237
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 114
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 114
- 239000002157 polynucleotide Substances 0.000 claims abstract description 114
- 239000011148 porous material Substances 0.000 claims abstract description 82
- 238000000034 method Methods 0.000 claims abstract description 44
- 235000001014 amino acid Nutrition 0.000 claims description 199
- 229940024606 amino acid Drugs 0.000 claims description 194
- 150000001413 amino acids Chemical class 0.000 claims description 185
- 102000053602 DNA Human genes 0.000 claims description 110
- 108020004414 DNA Proteins 0.000 claims description 110
- 238000006467 substitution reaction Methods 0.000 claims description 64
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 54
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 47
- 229960004441 tyrosine Drugs 0.000 claims description 46
- 239000004472 Lysine Substances 0.000 claims description 44
- 235000018977 lysine Nutrition 0.000 claims description 44
- 239000004475 Arginine Substances 0.000 claims description 42
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 42
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 41
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 40
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims description 40
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 39
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 39
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 37
- 239000002773 nucleotide Substances 0.000 claims description 37
- 125000003729 nucleotide group Chemical group 0.000 claims description 37
- 229960001230 asparagine Drugs 0.000 claims description 36
- 235000009582 asparagine Nutrition 0.000 claims description 36
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 35
- 229960005190 phenylalanine Drugs 0.000 claims description 35
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 34
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 34
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 33
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 31
- 235000018417 cysteine Nutrition 0.000 claims description 24
- 108020004682 Single-Stranded DNA Proteins 0.000 claims description 23
- -1 aromatic amino acids Chemical class 0.000 claims description 22
- 239000002253 acid Substances 0.000 claims description 21
- 229960001153 serine Drugs 0.000 claims description 21
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 20
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 18
- 239000004473 Threonine Substances 0.000 claims description 18
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 18
- 229960002898 threonine Drugs 0.000 claims description 18
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 16
- 230000027455 binding Effects 0.000 claims description 16
- 150000007523 nucleic acids Chemical class 0.000 claims description 16
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 15
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 15
- 239000013604 expression vector Substances 0.000 claims description 15
- 238000005259 measurement Methods 0.000 claims description 15
- 102000039446 nucleic acids Human genes 0.000 claims description 15
- 108020004707 nucleic acids Proteins 0.000 claims description 15
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 14
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 14
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 14
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 14
- 229960000310 isoleucine Drugs 0.000 claims description 14
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 14
- 239000004474 valine Substances 0.000 claims description 14
- 229960004295 valine Drugs 0.000 claims description 14
- 230000004048 modification Effects 0.000 claims description 13
- 238000012986 modification Methods 0.000 claims description 13
- 235000000346 sugar Nutrition 0.000 claims description 13
- 229960003767 alanine Drugs 0.000 claims description 12
- 229910052739 hydrogen Inorganic materials 0.000 claims description 11
- 125000002467 phosphate group Chemical class [H]OP(=O)(O[H])O[*] 0.000 claims description 11
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 10
- 239000007787 solid Substances 0.000 claims description 10
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 9
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 9
- 235000004279 alanine Nutrition 0.000 claims description 9
- 235000003704 aspartic acid Nutrition 0.000 claims description 9
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 9
- 235000013922 glutamic acid Nutrition 0.000 claims description 9
- 239000004220 glutamic acid Substances 0.000 claims description 9
- 239000004471 Glycine Substances 0.000 claims description 8
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 claims description 8
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 7
- QLUTZSRKFWJIGM-QMMMGPOBSA-N (2r)-2-azaniumyl-3-[(2-nitrophenyl)methylsulfanyl]propanoate Chemical compound OC(=O)[C@@H](N)CSCC1=CC=CC=C1[N+]([O-])=O QLUTZSRKFWJIGM-QMMMGPOBSA-N 0.000 claims description 6
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 5
- 229930182817 methionine Natural products 0.000 claims description 5
- 230000003287 optical effect Effects 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 4
- 102220519288 Conserved oligomeric Golgi complex subunit 3_H75N_mutation Human genes 0.000 claims description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 4
- 102220535078 Lysophospholipase_H75Q_mutation Human genes 0.000 claims description 4
- 230000009881 electrostatic interaction Effects 0.000 claims description 4
- 239000001257 hydrogen Substances 0.000 claims description 4
- 230000003993 interaction Effects 0.000 claims description 4
- 102220209728 rs1057523598 Human genes 0.000 claims description 4
- 102200078878 rs490262 Human genes 0.000 claims description 4
- 102220056395 rs730880037 Human genes 0.000 claims description 4
- 239000004149 tartrazine Substances 0.000 claims description 4
- DTERQYGMUDWYAZ-SSDOTTSWSA-N (2r)-6-acetamido-2-azaniumylhexanoate Chemical compound CC(=O)NCCCC[C@@H]([NH3+])C([O-])=O DTERQYGMUDWYAZ-SSDOTTSWSA-N 0.000 claims description 3
- POGSZHUEECCEAP-ZETCQYMHSA-N (2s)-2-amino-3-(3-amino-4-hydroxyphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(N)=C1 POGSZHUEECCEAP-ZETCQYMHSA-N 0.000 claims description 3
- ZHUOMTMPTNZOJE-VIFPVBQESA-N (2s)-2-amino-3-(3-cyanophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(C#N)=C1 ZHUOMTMPTNZOJE-VIFPVBQESA-N 0.000 claims description 3
- PEMUHKUIQHFMTH-QMMMGPOBSA-N (2s)-2-amino-3-(4-bromophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(Br)C=C1 PEMUHKUIQHFMTH-QMMMGPOBSA-N 0.000 claims description 3
- KWIPUXXIFQQMKN-VIFPVBQESA-N (2s)-2-amino-3-(4-cyanophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(C#N)C=C1 KWIPUXXIFQQMKN-VIFPVBQESA-N 0.000 claims description 3
- PHUOJEKTSKQBNT-NSHDSACASA-N (2s)-2-amino-3-(4-prop-2-enoxyphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OCC=C)C=C1 PHUOJEKTSKQBNT-NSHDSACASA-N 0.000 claims description 3
- RKXGYYMTJRQUQQ-NSHDSACASA-N (2s)-2-amino-3-(4-propan-2-ylsulfanylcarbonylphenyl)propanoic acid Chemical compound CC(C)SC(=O)C1=CC=C(C[C@H](N)C(O)=O)C=C1 RKXGYYMTJRQUQQ-NSHDSACASA-N 0.000 claims description 3
- LJHYWUVYIKCPGU-VIFPVBQESA-N (2s)-2-amino-3-[4-(carboxymethyl)phenyl]propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(CC(O)=O)C=C1 LJHYWUVYIKCPGU-VIFPVBQESA-N 0.000 claims description 3
- AWYGHHYADLYNRW-RGURZIINSA-N (2s)-2-amino-3-[4-[(2-amino-3-sulfanylpropanoyl)amino]phenyl]propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(NC(=O)C(N)CS)C=C1 AWYGHHYADLYNRW-RGURZIINSA-N 0.000 claims description 3
- NLFOHNAFILVHGM-AWEZNQCLSA-N (2s)-2-amino-3-[4-[(2-nitrophenyl)methoxy]phenyl]propanoic acid Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1OCC1=CC=CC=C1[N+]([O-])=O NLFOHNAFILVHGM-AWEZNQCLSA-N 0.000 claims description 3
- QEQAKQQRJFWPOR-JTQLQIEISA-N (2s)-2-amino-4-(7-hydroxy-2-oxochromen-4-yl)butanoic acid Chemical compound C1=C(O)C=CC2=C1OC(=O)C=C2CC[C@H](N)C(O)=O QEQAKQQRJFWPOR-JTQLQIEISA-N 0.000 claims description 3
- RPLCQQYRZLXMKL-ZETCQYMHSA-N (2s)-2-amino-6-(2-azidoethoxycarbonylamino)hexanoic acid Chemical compound OC(=O)[C@@H](N)CCCCNC(=O)OCCN=[N+]=[N-] RPLCQQYRZLXMKL-ZETCQYMHSA-N 0.000 claims description 3
- FQFIGHYUHCWZRZ-JTQLQIEISA-N (2s)-2-amino-6-(cyclopentanecarbonylamino)hexanoic acid Chemical compound OC(=O)[C@@H](N)CCCCNC(=O)C1CCCC1 FQFIGHYUHCWZRZ-JTQLQIEISA-N 0.000 claims description 3
- KRFMMSZGIQEBIJ-QMMMGPOBSA-N (2s)-2-amino-6-(prop-2-ynoxycarbonylamino)hexanoic acid Chemical compound OC(=O)[C@@H](N)CCCCNC(=O)OCC#C KRFMMSZGIQEBIJ-QMMMGPOBSA-N 0.000 claims description 3
- WORDWOPJMYWZSB-NSHDSACASA-N (2s)-2-amino-6-[(2-nitrophenyl)methoxycarbonylamino]hexanoic acid Chemical compound OC(=O)[C@@H](N)CCCCNC(=O)OCC1=CC=CC=C1[N+]([O-])=O WORDWOPJMYWZSB-NSHDSACASA-N 0.000 claims description 3
- LJLCMMNKTZXLLX-DTWKUNHWSA-N (2s)-2-amino-6-[[(2r)-oxolane-2-carbonyl]amino]hexanoic acid Chemical compound OC(=O)[C@@H](N)CCCCNC(=O)[C@H]1CCCO1 LJLCMMNKTZXLLX-DTWKUNHWSA-N 0.000 claims description 3
- SDZGVFSSLGTJAJ-ZETCQYMHSA-N (2s)-2-azaniumyl-3-(2-nitrophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1[N+]([O-])=O SDZGVFSSLGTJAJ-ZETCQYMHSA-N 0.000 claims description 3
- NEMHIKRLROONTL-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(4-azidophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N=[N+]=[N-])C=C1 NEMHIKRLROONTL-QMMMGPOBSA-N 0.000 claims description 3
- ZXSBHXZKWRIEIA-JTQLQIEISA-N (2s)-3-(4-acetylphenyl)-2-azaniumylpropanoate Chemical compound CC(=O)C1=CC=C(C[C@H](N)C(O)=O)C=C1 ZXSBHXZKWRIEIA-JTQLQIEISA-N 0.000 claims description 3
- CMUHFUGDYMFHEI-UHFFFAOYSA-N -2-Amino-3-94-aminophenyl)propanoic acid Natural products OC(=O)C(N)CC1=CC=C(N)C=C1 CMUHFUGDYMFHEI-UHFFFAOYSA-N 0.000 claims description 3
- LUTLAXLNPLZCOF-UHFFFAOYSA-N 1-Methylhistidine Natural products OC(=O)C(N)(C)CC1=NC=CN1 LUTLAXLNPLZCOF-UHFFFAOYSA-N 0.000 claims description 3
- XLOULZPUVVVWES-UHFFFAOYSA-N 2-amino-3-(8-hydroxyquinolin-3-yl)propanoic acid Chemical compound OC1=CC=CC2=CC(CC(N)C(O)=O)=CN=C21 XLOULZPUVVVWES-UHFFFAOYSA-N 0.000 claims description 3
- RZGAAVNNBWANKD-UHFFFAOYSA-N 2-amino-3-[[5-(dimethylamino)naphthalen-1-yl]sulfonylamino]propanoic acid Chemical compound C1=CC=C2C(N(C)C)=CC=CC2=C1S(=O)(=O)NCC(N)C(O)=O RZGAAVNNBWANKD-UHFFFAOYSA-N 0.000 claims description 3
- JINGUCXQUOKWKH-UHFFFAOYSA-N 2-aminodecanoic acid Chemical compound CCCCCCCCC(N)C(O)=O JINGUCXQUOKWKH-UHFFFAOYSA-N 0.000 claims description 3
- YZXUCQCJZKJMIR-UHFFFAOYSA-N 2-azaniumyl-3-[4-(trifluoromethoxy)phenyl]propanoate Chemical compound OC(=O)C(N)CC1=CC=C(OC(F)(F)F)C=C1 YZXUCQCJZKJMIR-UHFFFAOYSA-N 0.000 claims description 3
- CKGCFBNYQJDIGS-UHFFFAOYSA-N 2-azaniumyl-6-(phenylmethoxycarbonylamino)hexanoate Chemical compound OC(=O)C(N)CCCCNC(=O)OCC1=CC=CC=C1 CKGCFBNYQJDIGS-UHFFFAOYSA-N 0.000 claims description 3
- JVPFOKXICYJJSC-UHFFFAOYSA-N 2-azaniumylnonanoate Chemical compound CCCCCCCC(N)C(O)=O JVPFOKXICYJJSC-UHFFFAOYSA-N 0.000 claims description 3
- UQTZMGFTRHFAAM-ZETCQYMHSA-N 3-iodo-L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(I)=C1 UQTZMGFTRHFAAM-ZETCQYMHSA-N 0.000 claims description 3
- FBTSQILOGYXGMD-LURJTMIESA-N 3-nitro-L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C([N+]([O-])=O)=C1 FBTSQILOGYXGMD-LURJTMIESA-N 0.000 claims description 3
- CMUHFUGDYMFHEI-QMMMGPOBSA-N 4-amino-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N)C=C1 CMUHFUGDYMFHEI-QMMMGPOBSA-N 0.000 claims description 3
- XWHHYOYVRVGJJY-QMMMGPOBSA-N 4-fluoro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(F)C=C1 XWHHYOYVRVGJJY-QMMMGPOBSA-N 0.000 claims description 3
- PZNQZSRPDOEBMS-QMMMGPOBSA-N 4-iodo-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(I)C=C1 PZNQZSRPDOEBMS-QMMMGPOBSA-N 0.000 claims description 3
- OUYCCCASQSFEME-MRVPVSSYSA-N D-tyrosine Chemical compound OC(=O)[C@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-MRVPVSSYSA-N 0.000 claims description 3
- 229930195709 D-tyrosine Natural products 0.000 claims description 3
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 claims description 3
- FFFHZYDWPBMWHY-UHFFFAOYSA-N L-Homocysteine Natural products OC(=O)C(N)CCS FFFHZYDWPBMWHY-UHFFFAOYSA-N 0.000 claims description 3
- FBWIRBFZWNIGJC-LURJTMIESA-N L-dihomomethionine zwitterion Chemical compound CSCCCC[C@H](N)C(O)=O FBWIRBFZWNIGJC-LURJTMIESA-N 0.000 claims description 3
- FFFHZYDWPBMWHY-VKHMYHEASA-N L-homocysteine Chemical compound OC(=O)[C@@H](N)CCS FFFHZYDWPBMWHY-VKHMYHEASA-N 0.000 claims description 3
- SFSJZXMDTNDWIX-YFKPBYRVSA-N L-homomethionine Chemical compound CSCCC[C@H](N)C(O)=O SFSJZXMDTNDWIX-YFKPBYRVSA-N 0.000 claims description 3
- ZFOMKMMPBOQKMC-KXUCPTDWSA-N L-pyrrolysine Chemical compound C[C@@H]1CC=N[C@H]1C(=O)NCCCC[C@H]([NH3+])C([O-])=O ZFOMKMMPBOQKMC-KXUCPTDWSA-N 0.000 claims description 3
- BRMWTNUJHUMWMS-LURJTMIESA-N N(tele)-methyl-L-histidine Chemical compound CN1C=NC(C[C@H](N)C(O)=O)=C1 BRMWTNUJHUMWMS-LURJTMIESA-N 0.000 claims description 3
- GEYBMYRBIABFTA-VIFPVBQESA-N O-methyl-L-tyrosine Chemical compound COC1=CC=C(C[C@H](N)C(O)=O)C=C1 GEYBMYRBIABFTA-VIFPVBQESA-N 0.000 claims description 3
- PGNICAOCNIVZRV-NSHDSACASA-N [2-[[(5s)-5-amino-5-carboxypentyl]carbamoyloxymethyl]phenyl]-diazonioazanide Chemical compound OC(=O)[C@@H](N)CCCCNC(=O)OCC1=CC=CC=C1[N-][N+]#N PGNICAOCNIVZRV-NSHDSACASA-N 0.000 claims description 3
- JCZLABDVDPYLRZ-AWEZNQCLSA-N biphenylalanine Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1C1=CC=CC=C1 JCZLABDVDPYLRZ-AWEZNQCLSA-N 0.000 claims description 3
- 229960004502 levodopa Drugs 0.000 claims description 3
- TVIDEEHSOPHZBR-AWEZNQCLSA-N para-(benzoyl)-phenylalanine Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1C(=O)C1=CC=CC=C1 TVIDEEHSOPHZBR-AWEZNQCLSA-N 0.000 claims description 3
- JPZXHKDZASGCLU-LBPRGKRZSA-N β-(2-naphthyl)-alanine Chemical compound C1=CC=CC2=CC(C[C@H](N)C(O)=O)=CC=C21 JPZXHKDZASGCLU-LBPRGKRZSA-N 0.000 claims description 3
- AKVBCGQVQXPRLD-SSDOTTSWSA-N (2r)-2-azaniumyloctanoate Chemical compound CCCCCC[C@@H]([NH3+])C([O-])=O AKVBCGQVQXPRLD-SSDOTTSWSA-N 0.000 claims description 2
- HIAVWJOQCVNAQC-LBPRGKRZSA-N (2s)-2-amino-3-(naphthalen-2-ylamino)propanoic acid Chemical compound C1=CC=CC2=CC(NC[C@H](N)C(O)=O)=CC=C21 HIAVWJOQCVNAQC-LBPRGKRZSA-N 0.000 claims description 2
- HYSPNOMZFGNKBR-QMMMGPOBSA-N (2s)-2-amino-3-[(4,5-dimethoxy-2-nitrophenyl)methoxy]propanoic acid Chemical compound COC1=CC(COC[C@H](N)C(O)=O)=C([N+]([O-])=O)C=C1OC HYSPNOMZFGNKBR-QMMMGPOBSA-N 0.000 claims description 2
- VVQIIIAZJXTLRE-QMMMGPOBSA-N (2s)-2-amino-6-[(2-methylpropan-2-yl)oxycarbonylamino]hexanoic acid Chemical compound CC(C)(C)OC(=O)NCCCC[C@H](N)C(O)=O VVQIIIAZJXTLRE-QMMMGPOBSA-N 0.000 claims description 2
- HBMWPJLCTYKAGL-YFKPBYRVSA-N (2s)-2-amino-6-sulfanylhexanoic acid Chemical compound OC(=O)[C@@H](N)CCCCS HBMWPJLCTYKAGL-YFKPBYRVSA-N 0.000 claims description 2
- GTVVZTAFGPQSPC-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(4-nitrophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=C([N+]([O-])=O)C=C1 GTVVZTAFGPQSPC-QMMMGPOBSA-N 0.000 claims description 2
- CYHRSNOITZHLJN-NSHDSACASA-N (2s)-2-azaniumyl-3-(4-propan-2-ylphenyl)propanoate Chemical compound CC(C)C1=CC=C(C[C@H](N)C(O)=O)C=C1 CYHRSNOITZHLJN-NSHDSACASA-N 0.000 claims description 2
- NKQWPQYVVKUEPV-QMMMGPOBSA-N (2s)-2-hydroxy-6-[(2-methylpropan-2-yl)oxycarbonylamino]hexanoic acid Chemical compound CC(C)(C)OC(=O)NCCCC[C@H](O)C(O)=O NKQWPQYVVKUEPV-QMMMGPOBSA-N 0.000 claims description 2
- XKZCXMNMUMGDJG-AWEZNQCLSA-N (2s)-3-[(6-acetylnaphthalen-2-yl)amino]-2-aminopropanoic acid Chemical compound C1=C(NC[C@H](N)C(O)=O)C=CC2=CC(C(=O)C)=CC=C21 XKZCXMNMUMGDJG-AWEZNQCLSA-N 0.000 claims description 2
- JVGVDSSUAVXRDY-MRVPVSSYSA-N (R)-3-(4-hydroxyphenyl)lactic acid Chemical compound OC(=O)[C@H](O)CC1=CC=C(O)C=C1 JVGVDSSUAVXRDY-MRVPVSSYSA-N 0.000 claims description 2
- 239000004230 Fast Yellow AB Substances 0.000 claims description 2
- 102220542423 Gliomedin_K97H_mutation Human genes 0.000 claims description 2
- LCGISIDBXHGCDW-VKHMYHEASA-N L-glutamine amide Chemical compound NC(=O)[C@@H](N)CCC(N)=O LCGISIDBXHGCDW-VKHMYHEASA-N 0.000 claims description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 claims description 2
- VIIDDSZOUXRNDY-IUCAKERBSA-N OC(=O)[C@@H](N)CCCCNC(=O)[C@@H]1CCCN1 Chemical compound OC(=O)[C@@H](N)CCCCNC(=O)[C@@H]1CCCN1 VIIDDSZOUXRNDY-IUCAKERBSA-N 0.000 claims description 2
- 229910019142 PO4 Inorganic materials 0.000 claims description 2
- 102220520962 Serine/threonine-protein phosphatase 2A regulatory subunit B'' subunit alpha_K97Q_mutation Human genes 0.000 claims description 2
- 239000004234 Yellow 2G Substances 0.000 claims description 2
- 125000005336 allyloxy group Chemical group 0.000 claims description 2
- 125000004432 carbon atom Chemical group C* 0.000 claims description 2
- 239000004172 quinoline yellow Substances 0.000 claims description 2
- 102220497826 Tumor protein 63_S94T_mutation Human genes 0.000 claims 2
- BUGBHKTXTAQXES-UHFFFAOYSA-N Selenium Chemical compound [Se] BUGBHKTXTAQXES-UHFFFAOYSA-N 0.000 claims 1
- 125000001626 borono group Chemical group [H]OB([*])O[H] 0.000 claims 1
- 125000001511 cyclopentyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 claims 1
- DNJIEGIFACGWOD-UHFFFAOYSA-N ethanethiol Chemical compound CCS DNJIEGIFACGWOD-UHFFFAOYSA-N 0.000 claims 1
- 235000021317 phosphate Nutrition 0.000 claims 1
- 239000001294 propane Substances 0.000 claims 1
- 229910052711 selenium Inorganic materials 0.000 claims 1
- 239000011669 selenium Substances 0.000 claims 1
- 238000012163 sequencing technique Methods 0.000 abstract description 8
- 230000008569 process Effects 0.000 abstract description 3
- 125000003275 alpha amino acid group Chemical group 0.000 description 17
- 241000187480 Mycobacterium smegmatis Species 0.000 description 12
- 239000012528 membrane Substances 0.000 description 12
- 230000035772 mutation Effects 0.000 description 12
- 239000000758 substrate Substances 0.000 description 12
- 239000000872 buffer Substances 0.000 description 11
- 230000000295 complement effect Effects 0.000 description 10
- 108010013381 Porins Proteins 0.000 description 9
- 230000001276 controlling effect Effects 0.000 description 9
- 102000007739 porin activity proteins Human genes 0.000 description 9
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 125000006850 spacer group Chemical group 0.000 description 8
- 239000000232 Lipid Bilayer Substances 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 230000005945 translocation Effects 0.000 description 7
- 102000014914 Carrier Proteins Human genes 0.000 description 6
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 6
- 108091008324 binding proteins Proteins 0.000 description 6
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 6
- 150000001945 cysteines Chemical class 0.000 description 6
- GYOZYWVXFNDGLU-XLPZGREQSA-N dTMP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 GYOZYWVXFNDGLU-XLPZGREQSA-N 0.000 description 6
- 229910052721 tungsten Inorganic materials 0.000 description 6
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 229920001184 polypeptide Polymers 0.000 description 5
- 108090000765 processed proteins & peptides Proteins 0.000 description 5
- 102000004196 processed proteins & peptides Human genes 0.000 description 5
- 108090000623 proteins and genes Proteins 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 229910052727 yttrium Inorganic materials 0.000 description 5
- 239000007995 HEPES buffer Substances 0.000 description 4
- 239000001103 potassium chloride Substances 0.000 description 4
- 235000011164 potassium chloride Nutrition 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- VLSDXINSOMDCBK-BQYQJAHWSA-N (E)-1,1'-azobis(N,N-dimethylformamide) Chemical compound CN(C)C(=O)\N=N\C(=O)N(C)C VLSDXINSOMDCBK-BQYQJAHWSA-N 0.000 description 3
- KHWCHTKSEGGWEX-RRKCRQDMSA-N 2'-deoxyadenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(O)=O)O1 KHWCHTKSEGGWEX-RRKCRQDMSA-N 0.000 description 3
- NCMVOABPESMRCP-SHYZEUOFSA-N 2'-deoxycytosine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 NCMVOABPESMRCP-SHYZEUOFSA-N 0.000 description 3
- LTFMZDNNPPEQNG-KVQBGUIXSA-N 2'-deoxyguanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1C[C@H](O)[C@@H](COP(O)(O)=O)O1 LTFMZDNNPPEQNG-KVQBGUIXSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 108010006464 Hemolysin Proteins Proteins 0.000 description 3
- 102000015696 Interleukins Human genes 0.000 description 3
- 108010063738 Interleukins Proteins 0.000 description 3
- 102000004895 Lipoproteins Human genes 0.000 description 3
- 108090001030 Lipoproteins Proteins 0.000 description 3
- 101710126949 Lysin Proteins 0.000 description 3
- 241000588653 Neisseria Species 0.000 description 3
- 101710203389 Outer membrane porin F Proteins 0.000 description 3
- 101710203388 Outer membrane porin G Proteins 0.000 description 3
- 101710132190 Porin B Proteins 0.000 description 3
- 101710132227 Porin D Proteins 0.000 description 3
- DJJCXFVJDGTHFX-UHFFFAOYSA-N Uridinemonophosphate Natural products OC1C(O)C(COP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-UHFFFAOYSA-N 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 239000003054 catalyst Substances 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 238000010382 chemical cross-linking Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 235000012000 cholesterol Nutrition 0.000 description 3
- 239000003431 cross linking reagent Substances 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 238000012921 fluorescence analysis Methods 0.000 description 3
- RQFCJASXJCIDSX-UUOKFMHZSA-N guanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O RQFCJASXJCIDSX-UUOKFMHZSA-N 0.000 description 3
- 235000013928 guanylic acid Nutrition 0.000 description 3
- 239000003228 hemolysin Substances 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- 229910001629 magnesium chloride Inorganic materials 0.000 description 3
- 108010014203 outer membrane phospholipase A Proteins 0.000 description 3
- 235000018102 proteins Nutrition 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 102220214673 rs1060502033 Human genes 0.000 description 3
- DJJCXFVJDGTHFX-XVFCMESISA-N uridine 5'-monophosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-XVFCMESISA-N 0.000 description 3
- 101150011953 wza gene Proteins 0.000 description 3
- VPRPVNNXDCMVQT-JTQLQIEISA-N (2s)-2-amino-3-(4-ethylsulfanylcarbonylphenyl)propanoic acid Chemical compound CCSC(=O)C1=CC=C(C[C@H](N)C(O)=O)C=C1 VPRPVNNXDCMVQT-JTQLQIEISA-N 0.000 description 2
- VTERJWKRLSSHIC-QMMMGPOBSA-N (2s)-2-amino-3-[(2-nitrophenyl)methoxy]propanoic acid Chemical compound OC(=O)[C@@H](N)COCC1=CC=CC=C1[N+]([O-])=O VTERJWKRLSSHIC-QMMMGPOBSA-N 0.000 description 2
- BZTDTCNHAFUJOG-UHFFFAOYSA-N 6-carboxyfluorescein Chemical compound C12=CC=C(O)C=C2OC2=CC(O)=CC=C2C11OC(=O)C2=CC=C(C(=O)O)C=C21 BZTDTCNHAFUJOG-UHFFFAOYSA-N 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 102220587362 Cellular tumor antigen p53_S94T_mutation Human genes 0.000 description 2
- 108091006146 Channels Proteins 0.000 description 2
- IVOMOUWHDPKRLL-KQYNXXCUSA-N Cyclic adenosine monophosphate Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-KQYNXXCUSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108091093094 Glycol nucleic acid Proteins 0.000 description 2
- 102100022536 Helicase POLQ-like Human genes 0.000 description 2
- 101000899334 Homo sapiens Helicase POLQ-like Proteins 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 108091046915 Threose nucleic acid Proteins 0.000 description 2
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 2
- 150000001345 alkine derivatives Chemical class 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 229920001400 block copolymer Polymers 0.000 description 2
- ZOOGRGPOEVQQDX-KHLHZJAASA-N cyclic guanosine monophosphate Chemical compound C([C@H]1O2)O[P@](O)(=O)O[C@@H]1[C@H](O)[C@H]2N1C(N=C(NC2=O)N)=C2N=C1 ZOOGRGPOEVQQDX-KHLHZJAASA-N 0.000 description 2
- 125000001887 cyclopentyloxy group Chemical group C1(CCCC1)O* 0.000 description 2
- JSRLJPSBLDHEIO-SHYZEUOFSA-N dUMP Chemical compound O1[C@H](COP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 JSRLJPSBLDHEIO-SHYZEUOFSA-N 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000007672 fourth generation sequencing Methods 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 239000004615 ingredient Substances 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000002052 molecular layer Substances 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 229910052700 potassium Inorganic materials 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- QZAYGJVTTNCVMB-UHFFFAOYSA-N serotonin Chemical compound C1=C(O)C=C2C(CCN)=CNC2=C1 QZAYGJVTTNCVMB-UHFFFAOYSA-N 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- KZNICNPSHKQLFF-UHFFFAOYSA-N succinimide Chemical compound O=C1CCC(=O)N1 KZNICNPSHKQLFF-UHFFFAOYSA-N 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 230000036962 time dependent Effects 0.000 description 2
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 1
- TUMGFDAVJXBHMU-BYPYZUCNSA-N (2s)-2-amino-5-sulfanylpentanoic acid Chemical compound OC(=O)[C@@H](N)CCCS TUMGFDAVJXBHMU-BYPYZUCNSA-N 0.000 description 1
- YZJSUQQZGCHHNQ-BYPYZUCNSA-N (2s)-6-amino-2-azaniumyl-6-oxohexanoate Chemical compound OC(=O)[C@@H](N)CCCC(N)=O YZJSUQQZGCHHNQ-BYPYZUCNSA-N 0.000 description 1
- QRZUPJILJVGUFF-UHFFFAOYSA-N 2,8-dibenzylcyclooctan-1-one Chemical compound C1CCCCC(CC=2C=CC=CC=2)C(=O)C1CC1=CC=CC=C1 QRZUPJILJVGUFF-UHFFFAOYSA-N 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 108010011485 Aspartame Proteins 0.000 description 1
- RODZBUUMGRIALV-SSDOTTSWSA-N C1(O)=CC=C(C[C@@H](C)O)C=C1 Chemical compound C1(O)=CC=C(C[C@@H](C)O)C=C1 RODZBUUMGRIALV-SSDOTTSWSA-N 0.000 description 1
- WWZKQHOCKIZLMA-UHFFFAOYSA-N Caprylic acid Natural products CCCCCCCC(O)=O WWZKQHOCKIZLMA-UHFFFAOYSA-N 0.000 description 1
- UDMBCSSLTHHNCD-UHFFFAOYSA-N Coenzym Q(11) Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(O)=O)C(O)C1O UDMBCSSLTHHNCD-UHFFFAOYSA-N 0.000 description 1
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 1
- 230000007067 DNA methylation Effects 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 101150113153 PIF1 gene Proteins 0.000 description 1
- YGYAWVDWMABLBF-UHFFFAOYSA-N Phosgene Chemical compound ClC(Cl)=O YGYAWVDWMABLBF-UHFFFAOYSA-N 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 230000006093 RNA methylation Effects 0.000 description 1
- 102000001218 Rec A Recombinases Human genes 0.000 description 1
- 108010055016 Rec A Recombinases Proteins 0.000 description 1
- 229910021607 Silver chloride Inorganic materials 0.000 description 1
- 229930182558 Sterol Natural products 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- LNQVTSROQXJCDD-UHFFFAOYSA-N adenosine monophosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(CO)C(OP(O)(O)=O)C1O LNQVTSROQXJCDD-UHFFFAOYSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 239000000605 aspartame Substances 0.000 description 1
- IAOZJIPTCAWIRG-QWRGUYRKSA-N aspartame Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)OC)CC1=CC=CC=C1 IAOZJIPTCAWIRG-QWRGUYRKSA-N 0.000 description 1
- 229960003438 aspartame Drugs 0.000 description 1
- 235000010357 aspartame Nutrition 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- UHOVQNZJYSORNB-UHFFFAOYSA-N benzene Substances C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 1
- GONOPSZTUGRENK-UHFFFAOYSA-N benzyl(trichloro)silane Chemical compound Cl[Si](Cl)(Cl)CC1=CC=CC=C1 GONOPSZTUGRENK-UHFFFAOYSA-N 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 239000002041 carbon nanotube Substances 0.000 description 1
- 229910021393 carbon nanotube Inorganic materials 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Natural products NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- ROSVUDLPLHNVHN-UHFFFAOYSA-N dibenzocyclooctynol Chemical compound C1#CCCC2=CC=CC=C2C2=C1C=CC=C2O ROSVUDLPLHNVHN-UHFFFAOYSA-N 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 230000007831 electrophysiology Effects 0.000 description 1
- 238000002001 electrophysiology Methods 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 230000005669 field effect Effects 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000000198 fluorescence anisotropy Methods 0.000 description 1
- 238000002795 fluorescence method Methods 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 229960002449 glycine Drugs 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 238000002847 impedance measurement Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- FUZZWVXGSFPDMH-UHFFFAOYSA-N n-hexanoic acid Natural products CCCCCC(O)=O FUZZWVXGSFPDMH-UHFFFAOYSA-N 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 229950004354 phosphorylcholine Drugs 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 239000004810 polytetrafluoroethylene Substances 0.000 description 1
- 229920001343 polytetrafluoroethylene Polymers 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 125000002572 propoxy group Chemical group [*]OC([H])([H])C(C([H])([H])[H])([H])[H] 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 229930195734 saturated hydrocarbon Natural products 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 1
- 235000016491 selenocysteine Nutrition 0.000 description 1
- 229940055619 selenocysteine Drugs 0.000 description 1
- 229940076279 serotonin Drugs 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- HKZLPVFGJNLROG-UHFFFAOYSA-M silver monochloride Chemical compound [Cl-].[Ag+] HKZLPVFGJNLROG-UHFFFAOYSA-M 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 150000003432 sterols Chemical class 0.000 description 1
- 235000003702 sterols Nutrition 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 229960002317 succinimide Drugs 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 230000005641 tunneling Effects 0.000 description 1
- 229930195735 unsaturated hydrocarbon Natural products 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y306/00—Hydrolases acting on acid anhydrides (3.6)
- C12Y306/04—Hydrolases acting on acid anhydrides (3.6) acting on acid anhydrides; involved in cellular and subcellular movement (3.6.4)
- C12Y306/04012—DNA helicase (3.6.4.12)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/35—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Mycobacteriaceae (F)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Definitions
- the present invention relates to the technical fields of gene sequencing, molecular detection and clinical detection, in particular to a modified Pif1-like helicase, a construct containing Pif1-like helicase, and its use in characterizing target polynucleotides or controlling targets Application in the movement of polynucleotides through pores.
- Nanopore sequencing technology refers to a gene sequencing technology that uses a single nucleic acid molecule as a measurement unit and uses nanopores to read its sequence information in real time and continuously.
- Electrophoresis drives the polynucleotide through the nanopore, and since the polynucleotide passes through the nanopore, the current passing through the nanopore can be reduced. Each passing nucleotide or series of nucleotides obtains a characteristic current, and the record of the current level corresponds to the polynucleotide sequence.
- strand sequencing methods (such as the use of a helicase to control the movement of polynucleotides through a pore), a single polynucleotide chain passes through the pore and enables identification of nucleotides.
- the advantage of this sequencing technology is that it is simple to build a library and does not require amplification; it has a fast reading speed, which can reach a reading speed of tens of thousands of bases per hour for a single molecule; the read length is very long, usually thousands of bases; it is possible It can directly measure RNA and DNA methylation. These are beyond the reach of the existing second-generation sequencing technology.
- nanopore sequencing technology also has thorny problems that need to be solved.
- the translocation of polynucleotides through the nanopore is so fast that the current level of a single nucleotide is too short to be distinguished.
- the molecular motor that controls the movement of the polynucleotide may be free from the polynucleotide. This allows the polynucleotide to be quickly pulled through the pore in an uncontrolled manner in the direction of the applied field.
- patent WO2013057495A3 discloses a new method for characterizing target polynucleotides. The method includes controlling the passage of target polynucleotides through Hel308 helicase or molecular motors. The movement of the hole.
- Patent US20150065354A1 discloses a method for characterizing target polynucleotides using XPD helicase, which includes controlling the movement of target polynucleotides through pores by XPD helicase.
- Patent CN107109380A discloses a modified enzyme, which is a modified Dda helicase that can control the movement of the target polynucleotide through the pore.
- the present invention provides a new modified Pif1-like helicase.
- the modified Pif1-like helicase can be maintained for a longer time. It binds to polynucleotides and controls the movement of polynucleotides through the pores.
- the Pif1-like helicase of the present invention is a useful tool for controlling the movement of polynucleotides in the process of chain sequencing. It can make the polynucleotides follow or counter the electric field caused by the applied voltage in a controlled and stepwise manner. Perform movement to control the speed at which the polynucleotide passes through the nanopore and obtain a recognizable current level.
- the Pif1-like helicase of the present invention will still not recover from the polynucleoside Acid-free, that is, it is particularly effective for controlling the movement of 500, 1000, 5000, 10000, 20000, 50000, 100000 or more polynucleotides. Based on the advantages of the Pif1-like helicase of the present invention in the field of chain sequencing to control the smooth movement of polynucleotides from the pore and reduce slippage or irregular movement, it can promote more accurate nucleotides and longer read lengths.
- a Pif1-like helicase including at least one cysteine introduced into the tower domain, pin domain and/or 1A domain (RecA type motor) of the Pif1-like helicase Amino acid residues and/or at least one unnatural amino acid, wherein the helicase retains its ability to control the movement of the polynucleotide.
- it includes introducing at least one cysteine residue and/or at least one unnatural amino acid in any of the following groups:
- 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or more cysteine residues can be introduced into the tower domain, pin domain and/or 1A domain , Or introduce 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or more unnatural amino acids, or introduce 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 or more cysteine residues and unnatural amino acids.
- the introduction of cysteine residues and/or at least one unnatural amino acid makes the binding of the Pif1-like helicase to the polynucleotide more stable and enhances the ability to control its movement.
- the Pif1-like helicase is selected from Pba-PM2, Aph-Acj61, Aph-PX29, Avi-Aeh1, Sph-CBH8, Eph-Pei26, Aph-AM101, PphPspYZU05, Eph-EcS1, Eph- Cronus2 or Mph-MP1.
- the wild-type Pif1-like helicase is shown in Table 1.
- Table 2 identifies the residues that constitute each domain in each Pif1-like helicase
- the Pif1-like helicase is selected from Mph-MP1, Sph-CBH8, Eph-Pei26 or PphPspYZU05.
- the Pif1-like helicase includes a variant of SEQ ID NO: 11, in which the tower domain (residues E264-P278 and N296-P389) and/or the pin domain (residues K97-A113) ) And/or 1A domain (residues M1-L96 and P114-K184) introduced at least one cysteine residue and/or at least one unnatural amino acid.
- the Pif1-like helicase includes a variant of SEQ ID NO: 1, in which the tower domain (residues E264-P278 and N296-A394) and/or the pin domain (residues K89-E105) ) And/or 1A domain (residues M1-L88 and M106-V181) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 2, in which the tower domain (residues E265-P279 and N297-A392) and/or the pin domain (residues K89-D105) ) And/or 1A domain (residues M1-L88 and I106-M180) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 3, in which the tower domain (residues T266-P280 and N298-S403) and/or the pin domain (residues K89-A109) ) And/or 1A domain (residues M1-L88 and K110-V182) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 4, in which the tower domain (residues T266-P280 and N298-S404) and/or the pin domain (residues K89-A109) ) And/or 1A domain (residues M1-L88 and K110-V182) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 5, in which the tower domain (residues E260-P274 and N292-A391) and/or the pin domain (residues K86-E102) ) And/or 1A domain (residues M1-L84 and M103-K177) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 6, in which the tower domain (residues E266-P280 and N298-A396) and/or the pin domain (residues K91-E107 ) And/or 1A domain (residues M1-L90 and M108-M183) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 7, in which the tower domain (residues T276-P290 and N308-P402) and/or the pin domain (residues K100-D116) ) And/or 1A domain (residues M1-L99 and D117-M191) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 8, in which the tower domain (residues D274-P288 and N306-A404) and/or the pin domain (residues K95-E112) ) And/or 1A domain (residues M1-L95 and I113-K187) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 9, in which the tower domain (residues E260-P274 and N292-A391) and/or the pin domain (residues K86-E102 ) And/or 1A domain (residues M1-L85 and M103-K177) introduced at least one cysteine residue and/or at least one unnatural amino acid;
- the Pif1-like helicase includes a variant of SEQ ID NO: 10, in which the tower domain (residues E265-P279 and H297-A393) and/or the pin domain (residues K88-E104 ) And/or 1A domain (residues M1-L87 and I105-K180) introduced at least one cysteine residue and/or at least one unnatural amino acid.
- the tower domain (residues E265-P279 and H297-A393) and/or the pin domain (residues K88-E104 ) And/or 1A domain (residues M1-L87 and I105-K180) introduced at least one cysteine residue and/or at least one unnatural amino acid.
- the Pif1-like helicase includes a variant of SEQ ID NO: 11, which includes (i) E105C and/or A362C; (ii) E104C and/or K360C; (iii) E104C and/or A362C ; (Iv) E104C and/or Q363C; (v) E104C and/or K366C; (vi) E105C and/or M356C; (vii) E105C and/or K360C; (viii) E104C and/or M356C; (ix) E105C And/or Q363C; (x) E105C and/or K366C; (xi) F108C and/or M356C; (xii) F108C and/or K360C; (xiii) F108C and/or A362C; (xiv) F108C and/or Q363C; (xv) F108C and/or K366C; (xvi) K134C
- the Pif1-like helicase includes a variant of any one of SEQ ID NO: 1-10, which is defined by any one of (i) to (liv) in SEQ ID NO: 11
- the position corresponding to the position includes a cysteine residue.
- the amino acid sequence of the Pif1-like helicase is the amino acid sequence shown in SEQ ID NO: 1 to 11 or has at least 30%, at least 40%, or at least 50% of the amino acid sequence shown in SEQ ID NO: 1 to 11. %, 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.9% homology and the ability to control the movement of polynucleotides.
- the amino acid corresponding to the 105th and/or 362th position of SEQ ID NO: 11 introduces a cysteine residue, for example, the 97th position of SEQ ID NO: 2 Position and/or position 363, position 96 and/or 371 of SEQ ID NO: 3, position 94 and/or 361 of SEQ ID NO: 5, position 99 and/or position of SEQ ID NO: 6 / Or the 366th position, the 104th and/or the 375th position of SEQ ID NO: 8, etc.
- the introduced cysteine and cysteine are connected to each other,
- the non-natural amino acids of serotonin and the natural amino acids on the helicase are connected to each other.
- cysteines and/or unnatural amino acids can be linked to each other.
- 3, 4, 5, 6, 7, 8 or more cysteines and/or unnatural amino acids can be linked to each other.
- One or more cysteines can be linked to one or more cysteines.
- One or more cysteines can be linked to one or more unnatural amino acids such as Faz.
- One or more unnatural amino acids such as Faz can be linked to one or more unnatural amino acids such as Faz.
- One or more cysteines can be linked to one or more natural amino acids on the helicase.
- One or more unnatural amino acids such as Faz can be linked to one or more natural amino acids on the helicase.
- connection can be in any manner, including direct connection or indirect connection.
- connection can be a short-term contact or a permanent connection. More preferably, the connection may be a non-covalent connection or a covalent connection.
- the covalent connection can be connected by a chemical crosslinking agent, linear molecule or catalyst.
- the chemical crosslinking agent includes, but is not limited to, maleimide, active ester, succinimide, azide, alkyne (such as dibenzocyclooctynol (DIBO or DBCO), difluorocycloalkyne Hydrocarbons and linear alkynes), etc.; the length of the chemical crosslinking agent can vary from one carbon (phosgene connector) to multiple angstroms.
- the linear molecules include but are not limited to polyethylene glycol (PEGs), polypeptides, polysaccharides, deoxyribonucleic acid (DNA), peptide nucleic acid (PNA), threose nucleic acid (TNA), glycerol nucleic acid (GNA), saturated and Unsaturated hydrocarbon, polyamide.
- the catalyst includes, but is not limited to, TMAD, etc., which can make between cysteine residues, between unnatural amino acids, between cysteine residues and unnatural amino acids, between unnatural amino acids and natural amino acids, Or a catalyst that creates a covalent bond between a cysteine residue and a natural amino acid.
- the Pif1-like helicase is further modified to remove one or more cysteine residues.
- the Pif1-like helicase further includes at least one or more natural cysteine substituted. More preferably, alanine, serine or valine is substituted for cysteine.
- the Pif1-like helicase includes a variant of SEQ ID NO: 5, and the one or more substituted natural cysteine residues are one of C109, C114, C136 or C414 Or more.
- the Pif1-like helicase includes a variant of any one of SEQ ID NO: 1, 2, 3, 4, 6, 7, 8, 9, 10, and 11, and the one or more The substituted natural cysteine residues correspond to one or more of C109, C114, C136 or C414 in SEQ ID NO: 5.
- Table 3 shows the amino acid positions in SEQ ID NOs: 1, 2, 3, 4, 6, 7, 8, 9, 10, and 11 corresponding to C109, C114, C136, and C414 in SEQ ID NO: 5.
- the Pif1-like helicase is further modified to reduce the negative charge on its surface.
- the Pif1-like helicase further includes a substitution that increases the net positive charge.
- the substitutions that increase the net positive charge include substitution or modification of negatively charged amino acids, polar or non-polar amino acids, or negatively charged amino acids, polar or non-polar amino acids on the surface. A positively charged amino acid is introduced in the adjacent position.
- the substitutions that increase the net positive charge include substitution of positively charged amino acids for negatively charged amino acids, uncharged amino acids, aromatic amino acids, polar or non-polar amino acids.
- the substitutions that increase the net positive charge include substitution of uncharged amino acids for negatively charged amino acids, aromatic amino acids, polar or non-polar amino acids.
- Suitable positively charged amino acids include, but are not limited to, histidine (H), lysine (K) and/or arginine (R).
- Uncharged amino acids have no net charge.
- Suitable uncharged amino acids include, but are not limited to, cysteine (C), serine (S), threonine (T), methionine (M), asparagine (N) or glutamine (Q) .
- Non-polar amino acids have non-polar side chains.
- Non-polar amino acids include, but are not limited to, glycine (G), alanine (A), proline (P), isoleucine (I), leucine (L) or valine (V).
- Aromatic amino acids have aromatic side chains. Suitable aromatic amino acids include, but are not limited to, histidine (H), phenylalanine (F), tryptophan (W) or tyrosine (Y).
- the positively charged amino acids, uncharged amino acids, polar, non-polar amino acids or aromatic amino acids may be natural or unnatural amino acids, which may be artificially synthesized or modified natural amino acids .
- substitutions include, but are not limited to, arginine (R) for glutamic acid (E), lysine (K) for glutamic acid (E), and asparagine (N) for glutamic acid ( E), substituting lysine (K) for aspartic acid (D), and substituting arginine (R) for aspartic acid (D).
- the Pif1-like helicase includes a variant of SEQ ID NO: 11 and the one or more negatively charged amino acids are D5, E9, E24, E87, I65, S58, D209 or D216.
- the one or more negatively charged amino acids are D5, E9, E24, E87, I65, S58, D209 or D216.
- Any number of these amino acids can be neutralized, such as 1, 2, 3, 4, 5, 6, 7 or 8 of these amino acids. Can neutralize any combination.
- the Pif1-like helicase includes a variant of any one of SEQ ID NO: 1 to 10, and the one or more negatively charged amino acids correspond to D5, E9 in SEQ ID NO: 11. , E24, E87, I65, S58, D209 or D216 one or more.
- the amino acids in SEQ ID NOs: 1 to 10 corresponding to D5, E9, E24, E87, I65, S58, D209, and D216 in SEQ ID NO: 11 can be determined using the alignment in FIG. 11.
- Preferred substitutions also include a variant of SEQ ID NO: 1, and the one or more negatively charged amino acids also include S171.
- Preferred substitutions also include variants of SEQ ID NO: 9, and the one or more negatively charged amino acids also include S173.
- the unnatural amino acid is selected from 4-azido-L-phenylalanine (Faz), 4-acetyl-L-phenylalanine, 3-acetyl-L-phenylalanine, 4-acetoacetyl-L-phenylalanine, O-allyl-L-tyrosine, 3-(phenylselenoalkyl)-L-alanine, O-2-propyn-1-yl -L-tyrosine, 4(dihydroxyboronyl)-L-phenylalanine, 4-[(ethylsulfanyl)carbonyl]-L-phenylalanine, (2S)-2-amino- 3- ⁇ 4-[(Propan-2-ylsulfanyl)carbonyl]phenyl ⁇ propionic acid, (2S)-2-amino-3- ⁇ 4-[(2-amino-3-sulfanylpropionyl )Amino]phenyl ⁇ propionic acid,
- the Pif1-like helicase further includes:
- At least one amino acid that interacts with one or more nucleotides in single-stranded DNA (ssDNA) or double-stranded DNA (dsDNA) is substituted; and/or,
- the Pif1-like helicase has the ability to control the movement of polynucleotides.
- At least one of the sugars and/or bases that interact with one or more nucleotides in single-stranded or double-stranded DNA is replaced with an amino acid containing a larger side chain (R group) Of amino acids.
- the Pif1-like helicase includes replacing at least one sugar and/or base with one or more nucleotides of single-stranded or double-stranded DNA with an amino acid containing a larger side chain (R group).
- Base interacting amino acids Any number of amino acids can be substituted, for example, 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, or 6 or more Amino acids.
- Each amino acid can interact with a base, a sugar, or a base and a sugar.
- Protein modeling can be used to identify amino acids that interact with the sugars and/or bases of one or more nucleotides in single-stranded or double-stranded DNA.
- the Pif1-like helicase comprises a variant of SEQ ID NO: 11, wherein the at least one sugar and/or base is linked to one or more nucleotides in single-stranded or double-stranded DNA
- the interacting amino acid is at least one of P73, H93, N99, F109, I280, A161, F130, D132, D162, D163, E277, K415, Q291, H396, Y244 or P100.
- These numbers correspond to the relevant positions in SEQ ID NO: 11, and compared to SEQ ID NO: 11, it may be necessary to change when one or more amino acids have been inserted or deleted in the variant. As mentioned above, those skilled in the art can determine the corresponding positions in the variant.
- the Pif1-like helicase is a variant of any one of SEQ ID NO: 1 to 10, wherein the at least one sugar is associated with one or more nucleotides in single-stranded or double-stranded DNA.
- the base interacting amino acid is at least one of P73, H93, N99, F109, I280, A161, F130, D132, D162, D163, E277, K415, Q291, H396, Y244 or SEQ ID NO: 11 The amino acid corresponding to P100.
- Table 4 shows the SEQ ID NOs corresponding to P73, H93, N99, F109, I280, A161, F130, D132, D162, D163, E277, K415, Q291, H396, Y244 and P100 in SEQ ID NO: 11: Amino acids from 1 to 10.
- the larger side chain (R group) preferably (a) contains an increased number of carbon atoms (b) has an increased length (c) has an increased molecular volume and/or (d) has an increased van der Waals volume.
- Larger side chains (R groups) are preferably (a); (b); (c); (d); (a) and (b); (a) and (c); (a) and (d); (b) and (c); (b) and (d); (c) and (d); (a), (b) and (c); (a), (b) and (d); (a) ), (c) and (d); (b), (c) and (d); or (a), (b), (c) and (d).
- Standard methods in the art can be used to measure each of (a) to (d).
- the larger side chain (R group) increases the (i) electrostatic interaction between the at least one amino acid and one or more nucleotides in the single-stranded or double-stranded DNA; ii) Hydrogen bonding and/or (iii) cation-pi ((cation- ⁇ )) interaction.
- positively charged amino acids such as arginine (R), histidine (H), and lysine (K) have R groups that increase electrostatic interaction.
- amino acids such as asparagine (N), serine (S), glutamine (Q), threonine (T), and histidine (H) have R groups that increase hydrogen bonding. group.
- aromatic amino acids such as phenylalanine (F), tryptophan (W), tyrosine (Y) or histidine (H) have increased cation-pi (cation- ⁇ ) Interacting R groups.
- the amino acid containing the larger side chain (R) may be an unnatural amino acid.
- the unnatural amino acid can be any of the ones discussed below.
- the amino acids of the larger side chain (R group) are not alanine (A), cysteine (C), glycine (G), selenocysteine (U), methionine (M), aspartic acid (D) or glutamic acid (E).
- the Pif1-like helicase contains one or more of the following substitutions:
- Histidine (H) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q) or asparagine (N); or (iii) phenylpropanine Amino acid (F), tyrosine (Y) or tryptophan (W) substitution. Histidine (H) is more preferably substituted with (a) N, Q or W or (b) Y, F, Q or K.
- Asparagine (N) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q) or histidine (H); or (iii) phenylpropanine Amino acid (F), tyrosine (Y) or tryptophan (W) substitution. Asparagine (N) is more preferably substituted by R, H, W or Y.
- Proline (P) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N), threonine (T) Or histidine (H); (iii) tyrosine (Y), phenylalanine (F) or tryptophan (W); or (iv) leucine (L), valine (V) Or isoleucine (I) substitution.
- Proline (P) is more preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N), threonine (T) or Histidine (H); (iii) Phenylalanine (F) or tryptophan (W) or (iv) Leucine (L), Valine (V) or Isoleucine (I) substitution .
- Proline (P) is more preferably substituted by (a) F, (b) L, V, I, T or F or (c) W, F, Y, H, I, L or V.
- Valine (V) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N) or histidine (H) ; (Iii) Phenylalanine (F), tyrosine (Y) or tryptophan (W); or (iv) isoleucine (I) or leucine (L) substitution.
- Valine (V) is more preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N) or histidine (H); (iii) Tyrosine (Y) or tryptophan (W); or (iv) isoleucine (I) or leucine (L) substitution.
- Valine (V) is more preferably substituted by I or H or I, L, N, W or H.
- Phenylalanine (F) is preferably (i) arginine (R) or lysine (K); (ii) histidine (H); or (iii) tyrosine (Y) or color Amino acid (W) substitution. Phenylalanine (F) is more preferably substituted with (a) W, (b) W, Y or H, (c) W, R or K or (d) K, H, W or R.
- Glutamine (Q) is preferably (i) arginine (R) or lysine (K) or (iii) phenylalanine (F), tyrosine (Y) or tryptophan (W )replace.
- Alanine (A) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N) or histidine (H) ; (Iii) Phenylalanine (F), tyrosine (Y) or tryptophan (W) or (iv) isoleucine (I) or leucine (L) substitution.
- Serine (S) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N) or histidine (H); ( iii) Phenylalanine (F), tyrosine (Y) or tryptophan (W); or (iv) isoleucine (I) or leucine (L) substitution.
- Serine (S) is preferably substituted by K, R, W or F.
- Lysine (K) is preferably substituted by (i) arginine (R) or (iii) tyrosine (Y) or tryptophan (W).
- Arginine (R) is preferably substituted by (iii) Tyrosine (Y) or Tryptophan (W).
- Methionine (M) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N) or histidine (H ) Or (iii) phenylalanine (F), tyrosine (Y) or tryptophan (W) substitution.
- L) Leucine (L) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q) or asparagine (N) or (iii) phenylalanine Acid (F), tyrosine (Y) or tryptophan (W) substitution.
- Aspartic acid (D) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N) or histidine (H ); or (iii) phenylalanine (F), tyrosine (Y) or tryptophan (W) substitution. Aspartic acid (D) is more preferably substituted by H, Y or K.
- Glutamic acid (E) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N) or histidine (H) Or (iii) phenylalanine (F), tyrosine (Y) or tryptophan (W) substitution.
- Isoleucine (I) is preferably (i) arginine (R) or lysine (K); (ii) glutamine (Q), asparagine (N) or histidine (H ); (iii) Phenylalanine (F), tyrosine (Y) or tryptophan (W) or (iv) leucine (L) substitution.
- Tyrosine (Y) is preferably substituted by (i) arginine (R) or lysine (K); or (ii) tryptophan (W). Tyrosine (Y) is more preferably substituted by W or R.
- the Pif1-like helicase more preferably comprises a variant of SEQ ID NO: 11, which comprises:
- the helicase of the present invention is preferably a helicase in which at least one amino acid that interacts with one or more phosphate groups of one or more nucleotides in ssDNA or dsDNA is substituted. Any number of amino acids can be substituted, for example, 1 or more, 2 or more, 3 or more, 4 or more, 5 or more, or 6 or more Amino acids.
- the nucleotides in ssDNA each contain three phosphate groups. Each amino acid that is substituted can interact with any number of phosphate groups at a time, for example, one, two, or three phosphate groups at a time. Amino acids that interact with one or more phosphate groups can be identified using protein modeling.
- the substitution preferably increases (i) electrostatic interaction between the at least one amino acid and one or more phosphate groups in the ssDNA or dsDNA; (ii) hydrogen bonding and/or (iii) cation-pi (cation- ⁇ ) )interaction.
- the preferred substitutions of (i), (ii) and (iii) are discussed using the labels (i), (ii) and (iii).
- the substitution preferably increases the net positive charge of the site.
- Methods known in the art can be used to measure the net charge at any site.
- the isoelectric point can be used to define the net charge of an amino acid.
- the net charge is usually measured at about 7.5.
- the substitution is preferably to replace the negatively charged amino acid with a positively charged, uncharged, non-polar or aromatic amino acid.
- a negatively charged amino acid is an amino acid with a net negative charge.
- Negatively charged amino acids include, but are not limited to, aspartic acid (D) and glutamic acid (E).
- a positively charged amino acid is an amino acid with a net positive charge.
- the positively charged amino acids can be naturally occurring or non-naturally occurring.
- the positively charged amino acids can be synthetic or modified.
- modified amino acids with a net positive charge can be specifically designed for use in the present invention.
- Many different types of modifications to amino acids are well known in the art.
- Preferred naturally occurring positively charged amino acids include, but are not limited to, histidine (H), lysine (K), and arginine (R).
- Uncharged amino acids, non-polar amino acids, or aromatic amino acids may be naturally occurring or non-naturally occurring. It can be synthetic or modified. Uncharged amino acids have no net charge. Suitable uncharged amino acids include, but are not limited to, cysteine (C), serine (S), threonine (T), methionine (M), asparagine (N) and glutamine (Q ). Non-polar amino acids have non-polar side chains. Suitable non-polar amino acids include, but are not limited to, glycine (G), alanine (A), proline (P), isoleucine (I), leucine (L) and valine (V) . Aromatic amino acids have aromatic side chains. Suitable aromatic amino acids include, but are not limited to, histidine (H), phenylalanine (F), tryptophan (W), and tyrosine (Y).
- the Pif1-like helicase preferably comprises a variant of SEQ ID NO: 11, wherein at least one amino acid that interacts with one or more phosphate groups of one or more nucleotides in ssDNA or dsDNA is H75, T91, At least one of S94, K97, N246, N247, N284, K288, N297, T394 or K397.
- SEQ ID NO: 11 At least one of S94, K97, N246, N247, N284, K288, N297, T394 or K397.
- the Pif1-like helicase preferably comprises a variant of SEQ ID NO: 1 to 10, wherein the at least one interacts with one or more phosphate groups of one or more nucleotides in ssDNA or dsDNA
- the amino acid of is at least one amino acid corresponding to H75, T91, S94, K97, N246, N247, N284, K288, N297, T394 or K397 in SEQ ID NO: 11.
- Table 5 shows the amino acids in SEQ ID NO: 1 to 10 corresponding to H75, T91, S94, K97, N246, N247, N284, K288, N297, T394, and K397 in SEQ ID NO: 11.
- the Pif1-like helicase comprises any one or more of the following:
- Histidine is (i) arginine (R) or lysine (K); (ii) asparagine (N), serine (S), glutamine (Q) or threonine Acid (T); or (iii) phenylalanine (F), tryptophan (W) or tyrosine (Y) substitution;
- Threonine is (i) arginine (R), histidine (H) or lysine (K); (ii) asparagine (N), serine (S), glutamine Amide (Q) or histidine (H); or (iii) phenylalanine (F), tryptophan (W), tyrosine (Y) or histidine (H) substitution;
- Serine is (i) arginine (R), histidine (H) or lysine (K); (ii) asparagine (N), glutamine (Q), threonine (T ) Or histidine (H); or (iii) phenylalanine (F), tryptophan (W), tyrosine (Y) or histidine (H) substitution;
- Asparagine (N) is (i) arginine (R), histidine (H) or lysine (K); (ii) serine (S), glutamine (Q), threonine Acid (T) or histidine (H); or (iii) phenylalanine (F), tryptophan (W), tyrosine (Y) or histidine (H) substitution; and/or,
- Lysine (K) is (i) arginine (R) or histidine (H); (ii) asparagine (N), serine (S), glutamine (Q), threonine Acid (T) or histidine (H); or (iii) phenylalanine (F), tryptophan (W), tyrosine (Y) or histidine (H) substitution.
- the Pif1-like helicase is a variant of SEQ ID NO: 11, which includes one or more of (a) to (k), wherein:
- the helicase of the present invention is also such a helicase, wherein a part of the helicase that interacts with the transmembrane pore contains one or more modifications, preferably one or more substitutions, and it is further preferred that the helicase includes At least one amino acid that interacts with the transmembrane pore is substituted.
- the part of the helicase that interacts with the transmembrane pore is usually the part of the helicase that interacts with the transmembrane pore when the helicase is used to control the movement of polynucleotides through the pore.
- the portion When helicases are used to control the movement of polynucleotides through a pore, the portion usually contains amino acids that interact or come into contact with the pore. When an electric potential is applied, when the helicase is bound or linked to a polynucleotide that is moving through the pore, the portion usually contains amino acids that interact or contact with the pore.
- the part that interacts with the transmembrane pore preferably includes a variant of SEQ ID NO: 11, where the position includes one or more amino acids on E196, W202, N199 or G201 being substituted, for example, 2, 3, 4 Or 5.
- the part that interacts with the transmembrane pore preferably includes a variant of any one of SEQ ID NO: 1 to 10, which includes SEQ ID NO: 11 in (a) E196; (b) W202; (c); N199; or (d) At least one or more substitutions corresponding to the G201 position.
- Table 6 shows the amino acids in SEQ ID NO: 1 to 10 corresponding to E196, W202, N199, and G201 in SEQ ID NO: 11.
- the Pif1-like helicase is a variant of SEQ ID NO. 11, which includes substitutions at the following positions:
- -F109/E196/H75 for example, F109W/E196L/H75N, F109W/E196L/H75Q, F109W/E196L/H75K or F109W/E196L/H75F;
- -F109/E196/T91 for example, F109W/E196L/T91K, F109W/E196L/T91Q or F109W/E196L/T91N;
- F109/S94/E196 for example, F109W/S94H/E196L, F109W/S94T/E196L, F109W/S94R/E196L, F109W/S94Q/E196L, F109W/S94N/E196L, or F109W/S94K/E196L;
- F109/N99/E196 for example, F109W/N99R/E196L, F109W/N99H/E196L, F109W/N99W/E196L or F109W/N99Y/E196L;
- -F109/P100/E196 for example, F109W/P100L/E196L, F109W/P100V/E196L, F109W/P100I/E196L or
- -F109/D132/E196 for example, F109W/D132H/E196L, F109W/D132Y/E196L or F109W/D132K/E196L;
- -F109/A161/E196 for example, F109W/A161I/E196L, F109W/A161L/E196L, F109W/A161N/E196L, F109W/A161W/E196L or F109W/A161H/E196L;
- F109/D163/E196 for example, F109W/D163W/E196L, F109W/D163F/E196L, F109W/D163Y/E196L, F109W/D163H/E196L, F109W/D163I/E196L, F109W/D163L/E196L or F109W/D163V/E196L ;
- F109/Y244/E196 for example, F109W/Y244W/E196L, F109W/Y244Y/E196L or F109W/Y244H/E196L;
- -F109/E196/I280 for example, F109W/E196L/I280K, F109W/E196L/I280H, F109W/E196L/I280W or F109W/E196L/I280R;
- -F109/E196/Q291 for example, F109W/E196L/Q291K, F109W/E196L/Q291R, F109W/E196L/Q291W or F109W/E196L/Q291F;
- F109/N297/E196 for example, F109W/N297Q/E196L, F109W/N297K/E196L or F109W/N297H/E196L;
- -F109/T394/E196 for example, F109W/T394K/E196L, F109W/T394H/E196L or F109W/T394N/E196L;
- F109/H396/E196 for example, F109W/H396Y/E196L, F109W/H396F/E196L, F109W/H396Q/E196L or F109W/H396K/E196L;
- -F109/K397/E196 for example, F109W/K397R/E196L, F109W/K397H/E196L or F109W/K397Y/E196L; or,
- Pif1-like helicase is an enzyme having an amino acid sequence that is changed from that of a wild-type helicase and retains polynucleotide binding activity.
- the variant of any one of SEQ ID NO: 1 to 11 is an enzyme having the following amino acid sequence: the amino acid sequence is changed from the amino acid sequence of any one of SEQ ID NO: 1 to 11 and retains the polynucleoside Acid binding activity.
- the polynucleotide binding activity can be determined using methods known in the art. Suitable methods include, but are not limited to, fluorescence anisotropy method, tryptophan fluorescence method, and electrophoretic migration shift assay (EMSA). For example, the ability of the variant to bind to a single-stranded polynucleotide can be determined as described in the Examples.
- the variant preferably has at least 20% identity with the sequence. More preferably, based on amino acid identity, the variant polypeptide may have at least 70%, at least 75%, at least 80%, or at least 85% of the entire sequence of the amino acid sequence of any one of SEQ ID NO: 1 to 11, At least 90% and more preferably at least 95%, 97% or 99% identity. It can have at least 70%, such as at least 80%, at least 85%, at least 90%, or at least 95% amino acids over a length of 100 or more, such as 150, 200, 300, 400, or 500 or more consecutive amino acids Identity (strict homology).
- a construct which comprises at least one or more Pif1-like helicases of the present invention.
- the construct further comprises a polynucleotide binding part.
- the construct has the ability to control the movement of polynucleotides.
- the polynucleotide binding portion may be a portion that binds to the base of the polynucleotide, and/or a portion that binds to the sugar of the polynucleotide, and/or a portion that binds to the phosphate of the polynucleotide.
- the Pif1-like helicase and polynucleotide binding part constituting the construct can be prepared separately and then directly connected.
- the construct can be directly prepared by genetic fusion, for example, the nucleotides encoding the Pif1-like helicase and the polynucleotide binding part are connected, and then transferred into host cells for expression and purification.
- the polynucleotide binding portion is a polypeptide capable of binding to a polynucleotide, including but not limited to eukaryotic single-chain binding protein, bacterial single-chain binding protein, archaea single-chain binding protein, and viral single-chain binding protein.
- eukaryotic single-chain binding protein including but not limited to eukaryotic single-chain binding protein, bacterial single-chain binding protein, archaea single-chain binding protein, and viral single-chain binding protein.
- the polynucleotide binding portion includes but is not limited to any one shown in Table 7:
- the third aspect of the present invention provides a nucleic acid that encodes the Pif1-like helicase or the construct of the present invention.
- an expression vector is provided, and the expression vector comprises the nucleic acid of the present invention.
- the nucleic acid is operably linked to a regulatory element in an expression vector, wherein the regulatory element is preferably a promoter.
- the promoter is selected from T7, trc, lac, ara or ⁇ L.
- the expression vector includes but is not limited to plasmid, virus or phage.
- a host cell in the fifth aspect of the present invention, contains the nucleic acid or the expression vector of the present invention.
- the host cell includes but is not limited to Escherichia coli.
- the host cell is selected from BL21 (DE3), JM109 (DE3), B834 (DE3), TUNER, C41 (DE3), Rosetta2 (DE3), Origami, Origami B, etc. .
- a method for preparing Pif1-like helicase includes providing a wild-type Pif1-like helicase, and then performing processing on the wild-type Pif1-like helicase. Modification to obtain the Pif1-like helicase of the present invention.
- a method for preparing Pif1-like helicase includes culturing and inducing expression of the host cell of the present invention, and obtaining Pif1-like helicase after purification.
- the method includes obtaining a nucleic acid sequence encoding a Pif1-like helicase according to the amino acid sequence of the Pif1-like helicase of the present invention, and converting it after digestion and ligation into an expression vector In Escherichia coli, induce expression and purification to obtain Pif1-like helicase.
- the eighth aspect of the present invention provides a method for controlling the movement of a polynucleotide, the method comprising contacting the Pif1-like helicase or the construct of the present invention with the polynucleotide.
- said controlling the movement of the polynucleotide is controlling the movement of the polynucleotide through the pore.
- the pores are nanopores, and the nanopores are transmembrane pores.
- the pores can be natural or man-made, including but not limited to protein pores, polynucleotide pores or solid pores.
- the transmembrane pores are selected from biological pores, solid pores, or pores hybridized between biological and solid state.
- the pores include, but are not limited to, those derived from Mycobacterium smegmatis porin A, Mycobacterium smegmatis porin B, Mycobacterium smegmatis porin C, and mycobacterium smegmatis porin.
- Mycobacterial porin D hemolysin, lysin, interleukin, outer membrane porin F, outer membrane porin G, outer membrane phospholipase A, WZA or Neisseria autotransport lipoprotein, etc.
- the method may include one or more Pif1-like helicases to jointly control the movement of the polynucleotide.
- the ninth aspect of the present invention provides a method for characterizing a target polynucleotide, the method comprising:
- any number of Pif1-like helicases of the present invention can be used in the method.
- it may be one or more, more preferably 1, 2, 3, 4, 5, 6, 7, 8, 9 or more.
- the two or more Pif1-like helicases of the present invention may be the same or different. It can also contain wild-type Pif1-like helicase or other types of helicases.
- two or more helicases can be connected or just arranged by being respectively bound to the polynucleotide to perform the function of controlling the movement of the polynucleotide.
- the rate at which the target polynucleotide passes through the pore is controlled by the Pif1-like helicase or construct, so as to obtain a recognizable and stable current level for Determine the characteristics of the target polynucleotide.
- steps I) and II) are repeated one or more times.
- the method further includes the step of applying a potential difference across the hole contacting the helicase or the construct and the target polynucleotide.
- the pore is a structure that allows hydrated ions to flow from one side of the membrane to the other layer of the membrane under the driving of an applied electric potential.
- the pores are nanopores, and the nanopores are transmembrane pores.
- the transmembrane pore provides a channel for the movement of the target polynucleotide.
- the membrane can be any membrane existing in the prior art, and is preferably an amphiphilic molecular layer, that is, a layer formed of amphiphilic molecules having at least one hydrophilic part and at least one lipophilic or hydrophobic part, such as phospholipids. , Amphoteric molecules can be synthetic or naturally occurring. Further preferably, the membrane is a lipid bilayer membrane.
- the target polynucleotide can be attached to the membrane using any known method.
- the membrane is an amphiphilic molecular layer, such as a lipid bilayer
- the polynucleotide is preferably connected to the membrane via a polypeptide present in the membrane or via a hydrophobic anchor present in the membrane.
- the hydrophobic anchors are preferably lipids, fatty acids, sterols, carbon nanotubes or amino acids.
- the pores are selected from the group consisting of biological pores, solid pores, or pores hybridized between biological and solid state.
- the pores include, but are not limited to, those derived from Mycobacterium smegmatis porin A, Mycobacterium smegmatis porin B, Mycobacterium smegmatis porin C, and mycobacterium smegmatis porin.
- Mycobacterial porin D hemolysin, lysin, interleukin, outer membrane porin F, outer membrane porin G, outer membrane phospholipase A, WZA or Neisseria autotransport lipoprotein, etc.
- the Pif1-like helicase moves in the 5'-3' direction along the DNA, but the orientation of the DNA in the hole (depending on which end of the DNA is captured) means Enzymes can be used to move DNA out of the hole against the direction of the applied field, or move DNA into the hole in the direction of the applied field.
- the target polynucleotide is single-stranded, double-stranded, or at least part of it is double-stranded.
- the target polynucleotide can be modified by means of tags, spacers, methylation, oxidation or damage.
- the target polynucleotide is double-stranded.
- the double-stranded part constitutes a Y adapter structure, and the Y adapter structure includes a leader sequence that is preferentially screwed into the pore.
- the length of the target polynucleotide may be 10-100000 or more.
- the length of the target polynucleotide can be at least 10, at least 50, at least 100, at least 200, at least 300, at least 400, at least 500, at least 1,000, at least 2,000, at least 5,000, at least 10,000, at least 50,000, or at least 100,000, etc.
- the helicase is incorporated into the internal nucleotide of the single-stranded polynucleotide.
- the one or more characteristics are selected from the source, length, identity, sequence, secondary structure of the target polynucleotide, or whether the target polynucleotide is modified.
- the one or more features are carried out by electrical measurement and/or optical measurement.
- electrical and/or optical signals are generated by electrical measurement and/or optical measurement, and each nucleotide corresponds to a signal level, and then the electrical signals and/or optical signals are converted into nucleotide characteristics.
- the electrical measurement includes, but is not limited to, current measurement, impedance measurement, tunnel measurement, wind tunnel measurement or field effect transistor (FET) measurement, etc.
- FET field effect transistor
- the electrical signal of the present invention is selected from the measured values of current, voltage, tunneling, resistance, potential, conductivity or lateral electrical measurement.
- the electrical signal is a current passing through the hole.
- the characterization further includes the application of an improved Viterbi algorithm.
- the tenth aspect of the present invention provides a product for characterizing a target polynucleotide, the product comprising the Pif1-like helicase of the present invention, the construct, the nucleic acid, the Expression vector or said host cell, and pore.
- the product contains multiple Pif1-like helicases and/or multiple constructs.
- the product contains multiple holes.
- the pores are nanopores, and the nanopores are transmembrane pores.
- the transmembrane pores are selected from biological pores, solid pores, or pores hybridized between biological and solid state.
- the pores include, but are not limited to, those derived from Mycobacterium smegmatis porin A, Mycobacterium smegmatis porin B, Mycobacterium smegmatis porin C, and mycobacterium smegmatis porin.
- Mycobacterial porin D hemolysin, lysin, interleukin, outer membrane porin F, outer membrane porin G, outer membrane phospholipase A, WZA or Neisseria autotransport lipoprotein, etc.
- the product contains multiple Pif1-like helicases or multiple constructs, and multiple pores.
- the product is selected from kits, devices or sensors.
- the kit also includes a chip containing a lipid bilayer.
- the pores span the lipid bilayer.
- the kit of the present invention contains one or more lipid bilayers, and each lipid bilayer contains one or more of the pores.
- the kit of the present invention also includes reagents or devices for performing characterization of target polynucleotides.
- the reagents include buffers and tools required for PCR amplification.
- the eleventh aspect of the present invention provides the Pif1-like helicase, the construct, the nucleic acid, the expression vector, the host cell or the product of the present invention To characterize the target polynucleotide or to control the movement of the target polynucleotide through the pore.
- the twelfth aspect of the present invention provides a kit for characterizing target polynucleotides, said kit comprising the Pif1-like helicase of the present invention, the construct or the nucleic acid, The expression vector or the host cell, and pores.
- the thirteenth aspect of the present invention provides a device for characterizing target polynucleotides, said device comprising the Pif1-like helicase of the present invention, the construct or the nucleic acid, the The expression vector or the host cell, and the hole.
- the device includes a sensor that supports the plurality of holes and can transmit signals for the interaction of the holes and the polynucleotide, and at least one memory for storing the target polynucleotide, and the necessary information during the characterization process. Solution.
- the device includes a plurality of Pif1-like helicases and/or a plurality of constructs, and a plurality of holes.
- the fourteenth aspect of the present invention provides a sensor for characterizing target polynucleotides, the sensor comprising a complex formed between the pore and the Pif1-like helicase of the present invention or the construct Things.
- the pore is brought into contact with the helicase or construct in the presence of the target polynucleotide, and a potential is applied across the pore.
- the potential is selected from voltage potential or chemical potential.
- the fifteenth aspect of the present invention provides a method for forming a sensor for characterizing a target polynucleotide, comprising forming a complex between the pore and the Pif1-like helicase or the construct of the present invention , Thereby forming a sensor that characterizes the target polynucleotide.
- the sixteenth aspect of the present invention provides two or more helicases linked to a polynucleotide, wherein at least one of the two or more helicases is the Pif1 of the present invention -like helicase.
- the seventeenth aspect of the present invention provides a Pif1-like helicase oligomer.
- the Pif1-like helicase oligomer comprises one or more Pif1-like helicases of the present invention. Enzyme.
- the Pif1-like helicase oligomer may also include wild-type Pif1-like helicase or other types of helicase.
- the other types of helicase can be Hel308 helicase, XPD helicase, Dda helicase, TraI helicase or TrwC helicase, etc.
- the Pif1-like helicase oligomer contains more than two Pif1-like helicases of the present invention, wherein the Pif1-like helicases may be different or the same .
- the "Pif1-like helicase" of the present invention is modified, and the modification is relative to the wild-type or natural helicase.
- the "Pif1-like helicase", “construct” or “pore” of the present invention can be modified to facilitate identification or purification, for example, by adding histidine residues (His tag), aspartic acid Acid residues (asp tag), streptavidin tag, Flag tag, SUMO tag, GST tag or MBP tag, or by adding a signal sequence to promote their secretion from the cell, the polypeptide in the cell does not naturally contain the Signal sequence.
- An alternative way to introduce genetic tags is to attach tags to natural or artificial sites on Pif1-like helicases, pores, or constructs through chemical reactions.
- nucleotide in the present invention includes but is not limited to: adenosine monophosphate (AMP), guanosine monophosphate (GMP), thymidine monophosphate (TMP), uridine monophosphate (UMP), cytosine Nucleoside monophosphate (CMP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate (dAMP), deoxyguanosine monophosphate (dGMP), deoxythymidine monophosphate (dTMP), deoxyuridine monophosphate (dUMP) and deoxycytidine monophosphate (dCMP).
- the nucleotide is selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP or dCMP.
- the "conservative amino acid substitutions" in the present invention include, but are not limited to: substitutions between alanine and serine, glycine, threonine, valine, proline or glutamic acid; and/or aspartame
- substitutions between glycine and glycine, asparagine or glutamic acid and/or the substitution between serine and glycine, asparagine or threonine; and/or the substitution between leucine and isoleucine or The substitution between valine; and/or the substitution between valine and leucine, isoleucine; and/or the substitution between tyrosine and phenylalanine; and/or, Replacement between lysine and arginine.
- the above-mentioned substitution basically does not change the activity of the amino acid sequence of the present invention.
- the "two or more” mentioned in the present invention includes two, three, four, five, six, seven, eight or more and so on.
- the "plurality” mentioned in the present invention includes but is not limited to two or more, three or more, four or more, five or more, six or more, seven or more, eight or more or more, and so on.
- the "at least one" in the present invention includes but is not limited to one or more, two or more, three or more, four or more, five or more, six or more, seven or more, eight or more or more, and so on.
- non-natural amino acid in the present invention is an amino acid that is not naturally present in Pif1-like helicase.
- it includes, but is not limited to, 4-azido-L-phenylalanine (Faz), 4-acetyl-L-phenylalanine, 3-acetyl-L-phenylalanine, 4-acetyl Acetyl-L phenylalanine, O-allyl-L-tyrosine, 3-(phenylselenoalkyl)-L-alanine, O-2-propyn-1-yl-L- Tyrosine, 4(dihydroxyboronyl)-L-phenylalanine, 4-[(ethylsulfanyl)carbonyl]-L-phenylalanine, (2S)-2-amino-3- ⁇ 4-[(Propan-2-ylsulfanyl)carbonyl]phenyl ⁇ propionic acid, (2S)-2-amino-3- ⁇ 4-
- FIG. 1 Schematic diagram of fluorescence analysis for detecting Pif1-like helicase enzyme activity.
- the fluorescent substrate chain has a 5'-end ssDNA overhang, and a 50-base hybridized dsDNA portion.
- the upper part of the main chain (B) has a black hole quencher (BHQ-1) base (E) at the 3'end, and the hybrid complementary strand (D) has a carboxyfluorescein at the 5'end (C). It also includes a 0.5 ⁇ M capture chain (F) complementary to the shorter chain (D) of the fluorescent substrate.
- BHQ-1 black hole quencher
- D hybrid complementary strand
- C carboxyfluorescein
- the helicase (200nM) added to the substrate is connected to the 5'end portion of the fluorescent substrate along the After the main strand moves and the complementary strand is unwound, the excess capture strand preferentially anneals to the complementary strand DNA to prevent the initial substrate from re-annealing with lost fluorescence.
- the capture strand (A) after adding an excess of the capture strand (A) that is completely complementary to the main chain, part of the ununtwisted dsDNA will have a strand untwisting effect due to the presence of excess A, and finally all the dsDNA will be untwisted, and the fluorescence value will reach Highest.
- Figure 2 Fluorescence analysis to detect the ability of Pif1-like helicase to unwind hybridized dsDNA, specifically the graph of the time-dependent dsDNA unwinding ratio in a buffer containing 400mM NaCl.
- Figure 3 shows the results of gel assays for the ability of different Pif1-like helicases to bind DNA.
- Lane 1 is the pre-built dsDNA (SEQ ID NO: 18 hybridized with SEQ ID NO: 19 with 5'FAM modification).
- Lanes 2-6 respectively contain Aph Acj61, Aph PX29, Sph CBH8, PphPspYZU05, and Mph MP1 pre-linked to dsDNA.
- Lanes 7-11 respectively contain Aph Acj61-D97C/A363C, Aph PX29-D96C/A371C, Sph CBH8-A94C/A361C/C136A, PphPspYZU05-D104C/A375C/C146A and Mph MP1-E105C/A362C pre-linked to dsDNA.
- Band A corresponds to SEQ ID NO: 19 which hybridizes with SEQ ID NO: 18.
- the regions marked 1A, 2A, 3A, 4A, and 5A correspond to SEQ ID NO: 19 that hybridizes with SEQ ID NO: 18 with 1, 2, 3, 4, and 5 helicases, respectively.
- Figure 4 A diagram of the DNA construct used in the examples, in which SEQ ID NO: 13 (marked as B), its 5'end is connected to 20 iSpC3 spacers (marked A), and its 3'end is connected to 4 iSpC3 spacer (marked as C), the spacer (marked as C) is connected to the 5'end of SEQ ID NO: 14 (marked as D), and the 3'end of SEQ ID NO: 14 (marked as D) is connected To SEQ ID NO: 17 or SEQ ID NO: 24 (marked as E), the SEQ ID NO: 15 (marked as F) region of the construct and SEQ ID NO: 16 (marked as G, which has a 3'cholesterol series) Strand) hybridization.
- Figure 10 shows the purified Mph MP1 (SEQ ID NO: 11) SDS-PAGE gel electrophoresis image. Among them, M is Marker (Kd), and lane 1 is the electrophoresis result of Mph MP1 helicase.
- Figure 11 shows the correspondence between the amino acid sequences of SEQ Nos: 1 to 11.
- the recombinant plasmid containing the Pif1-like helicase sequence (a variant of amino acid sequence SEQ ID NO: 1-11, and its corresponding nucleotide sequence SEQ ID NO: 25-35) was transformed into BL21 (DE3) sensing by heat shock After spreading the ampicillin-resistant solid LB plate with the resuscitated bacteria solution, culture it overnight at 37°C. Pick a single colony and inoculate it in 100ml of liquid LB medium containing ampicillin resistance and cultivate it at 37°C. The 1% inoculum was transferred to ampicillin-resistant LB liquid medium for expansion culture, cultured at 37°C and 200 rpm, and its OD600 value was continuously measured.
- the culture solution in the LB medium was cooled to 18° C., and isopropyl thiogalactoside (IPTG) was added to induce expression, so that the final concentration reached 1 mM. After 12-16h, collect the bacteria at 18°C. The bacteria were crushed under high pressure, purified by the FPLC method, and samples were collected.
- IPTG isopropyl thiogalactoside
- Figure 10 shows an SDS-PAGE gel electrophoresis image of purified Mph MP1 (variant of SEQ ID NO: 11).
- Fluorescence analysis was used to detect enzyme activity to illustrate the ability of Pif1-like helicase to untie hybrid dsDNA.
- the fluorescent substrate strand (final concentration 100 nM) has a 5'-end ssDNA overhang, and a 50-base hybridized dsDNA portion.
- Including the upper part of the main chain has a black hole quencher (BHQ-1) base (SEQ ID NO: 20--BHQ-3') at the 3'end, and the hybrid complementary strand has a carboxyfluorescein at the 5'end ( 5'FAM-SEQ ID NO: 21).
- BHQ-1 black hole quencher
- 5'FAM-SEQ ID NO: 21 carboxyfluorescein at the 5'end
- a 0.5 ⁇ M capture strand (SEQ ID NO: 22) complementary to the shorter strand of the fluorescent substrate.
- SEQ ID NO: 22 0.5 ⁇ M capture strand
- the Pif1-like helicase (200nM) added to the substrate is connected to the 5'end portion of the fluorescent substrate along the After the main strand moves and the complementary strand is unwound, the excess capture strand preferentially anneals to the complementary strand DNA to prevent the initial substrate from re-annealing with lost fluorescence.
- Figure 2 shows the time-dependent dsDNA unwinding ratio in a buffer containing 400 mM NaCl (10 mM Hepes pH 8.0, 5 mM ATP, 5 mM MgCl2, 100 nM fluorescent substrate DNA, 0.5 ⁇ M capture DNA).
- An exemplary gel assay is used to measure the ability of the modified Pif1-like helicase of the present invention to bind DNA.
- the annealed DNA complex (SEQ ID NO: 18 and SEQ ID NO: 19 with 5'FAM modification) were hybridized with Aph Acj61, Aph PX29, Sph CBH8, Pph at a ratio of (1:1, volume/volume) PspYZU05, Mph MP1, Aph Acj61-D97C/A363C, Aph PX29-D96C/A371C, Sph CBH8-A94C/A361C/C136A, Pph PspYZU05-D104C/A375C/C146A, Mph PE1-E105C/A362C at pH 10mM/A362C , 400mM potassium chloride mixed, the final concentration of Pif1-like helicase is (600nM) and DNA is (30nM).
- Pif1-like helicase binds to DNA for 1 hour at room temperature.
- TMAD was added to each sample to a final concentration of 5 ⁇ M, and incubated at room temperature for 1 hour. Load the sample on a 4-20% TBE gel and run the gel at 160V for 1.5 hours. Then the gel was observed under blue fluorescence for DNA bands.
- Figure 3 shows the effect of Pif1-like helicase on DNA binding ability before and after modification.
- Lanes 2 to 6 show that a large amount of DNA is not bound by Pif1-like helicase during electrophoresis and no obvious binding bands are observed.
- Lanes 7-11 show the binding bands of different numbers of enzymes bound to DNA.
- Lanes 9 and 10 show that up to 5 Pif1-like helicases can bind to the single-stranded part of SEQ ID NO: 18. This indicates that Pif1-like helicase significantly enhanced the firmness of binding to DNA after modification.
- Mph MP1-E105C/A362C (SEQ ID NO: 11 with mutation E105C/A362C) as an example to verify how Pif1-like helicase controls the movement of the entire DNA chain through a single MspA nanopore (SEQ ID NO: 12).
- SEQ ID NO: 13 Its 5'end is connected to 20 iSpC3 spacers, and its 3'end is connected to 4 iSpC3 spacers, which are connected to SEQ ID NO: At the 5'end of 14, the 3'end of SEQ ID NO: 14 is connected to SEQ ID NO: 17, and the SEQ ID NO: 15 region of the DNA construct and SEQ ID NO: 16 (it has a 3'cholesterol tether) Hybrid).
- the DNA polynucleotide and Pif1-like helicase are added to 70 ⁇ L of buffer in the cis-compartment of the electrophysiology room to initiate the helicase-DNA complex in the Capture of nanopores. If necessary by adding the divalent metal to the cis compartment (5mM MgCl 2) and NTP (2.86 ⁇ M ATP) ATP activates helicase activity. The experiment was carried out at a constant potential of +180mV.
- Sph CBH8-A94C/A361C/C136A (SEQ ID NO: 5 with A94C/A361C/C136A mutation) and Eph Pei26-D99C/A366C/C141A (SEQ ID NO: 6 with D99C/A366C/C141A mutation)
- Pph PspYZU05-D104C/A375C/C146A (SEQ ID NO: 8 with D104C/A375C and C146A mutations) as examples to verify how Pif1-like helicase controls the movement of the entire DNA chain through a single MspA nanopore.
- DNA construct B as shown in Figure 4: SEQ ID NO: 13 Its 5'end is connected to 20 iSpC3 spacers, and its 3'end is connected to 4 iSpC3 spacers, which is connected to SEQ ID NO: At the 5'end of 14, the 3'end of SEQ ID NO: 14 is connected to SEQ ID NO: 24, and the SEQ ID NO: 15 region of the DNA construct is connected to SEQ ID NO: 16 (which has a 3'cholesterol tether) Hybridization.
- This DNA construct B is similar to the construct used in Example 4, except that the region marked E corresponds to SEQ ID NO: 24.
- DNA construct B For DNA construct B, observe the DNA movement controlled by Pif1-like helicase, and the DNA movement controlled by Sph CBH8-A94C/A361C/C136A, Eph Pei26-D99C/A366C/C141A or Pph PspYZU05-D104C/A375C/C146A The results are shown in Figure 7-9.
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Immunology (AREA)
- Analytical Chemistry (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Oscillators With Electromechanical Resonators (AREA)
- Steroid Compounds (AREA)
- Hybrid Cells (AREA)
Abstract
Description
Pif1-like解旋酶 | SEQ ID NO | 1A结构域 | 2A结构域 |
Pba-PM2 | 1 | M1-L88,M106-V181 | E264-P278,N296-A394 |
Aph-Acj61 | 2 | M1-L88,I106-M180 | E265-P279,N297-A392 |
Aph-PX29 | 3 | M1-L88,K110-V182 | T266-P280,N298-S403 |
Avi-Aeh1 | 4 | M1-L88,K110-V182 | T266-P280,N298-S404 |
Sph-CBH8 | 5 | M1-L84,M103-K177 | E260-P274,N292-A391 |
Eph-Pei26 | 6 | M1-L90,M108-M183 | E266-P280,N298-A396 |
Aph-AM101 | 7 | M1-L99,D117-M191 | T276-P290,N308-P402 |
PphPspYZU05 | 8 | M1-L95,I113-K187 | D274-P288,N306-A404 |
Eph-EcS1 | 9 | M1-L85,M103-K177 | E260-P274,N292-A391 |
Eph-Cronus2 | 10 | M1-L87,I105-K180 | E265-P279,H297-A393 |
Mph-MP1 | 11 | M1-L96,P114-K184 | E264-P278,N296-P389 |
Claims (38)
- 一种Pif1-like解旋酶,其特征在于,包括在Pif1-like解旋酶的塔结构域、销结构域和/或1A结构域引入至少一个半胱氨酸残基和/或至少一个非天然氨基酸,其中所述Pif1-like解旋酶保留其控制多核苷酸移动的能力。
- 根据权利要求1所述的Pif1-like解旋酶,其中所述Pif1-like解旋酶包括:(a)SEQ ID NO:1的变体,其中在塔结构域的E264-P278和N296-A394,和/或销结构域的K89-E105,和/或1A结构域的M1-L88和M106-V181中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(b)SEQ ID NO:2的变体,其中在塔结构域的E265-P279和N297-A392,和/或销结构域的K89-D105,和/或1A结构域的M1-L88和I106-M180中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(c)SEQ ID NO:3的变体,其中在塔结构域的T266-P280和N298-S403,和/或销结构域的K89-A109,和/或1A结构域的M1-L88和K110-V182中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(d)SEQ ID NO:4的变体,其中在塔结构域的T266-P280和N298-S404,和/或销结构域的K89-A109,和/或1A结构域的M1-L88和K110-V182中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(e)SEQ ID NO:5的变体,其中在塔结构域的E260-P274和N292-A391,和/或销结构域的K86-E102,和/或1A结构域的M1-L84和M103-K177中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(f)SEQ ID NO:6的变体,其中在塔结构域的E266-P280和N298-A396,和/或销结构域的K91-E107,和/或1A结构域的M1-L90和M108-M183中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(g)SEQ ID NO:7的变体,其中在塔结构域的T276-P290和N308-P402,和/或销结构域的K100-D116,和/或1A结构域的M1-L99和D117-M191中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(h)SEQ ID NO:8的变体,其中在塔结构域的D274-P288和N306-A404,和/或销结构域的K95-E112,和/或1A结构域的M1-L95和I113-K187中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(i)SEQ ID NO:9的变体,其中在塔结构域的E260-P274和N292-A391,和/或销结构域的K86-E102,和/或1A结构域的M1-L85和M103-K177中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;(j)SEQ ID NO:10的变体,其中在塔结构域的E265-P279和H297-A393,和/或销结构域的K88-E104,和/或1A结构域的M1-L87和I105-K180中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸;或,(k)SEQ ID NO:11的变体,其中在塔结构域的E264-P278和N296-P389,和/或销结构域的K97-A113,和/或1A结构域的M1-L96和P114-K184中引入了至少一个半胱氨酸残基和/或至少一个非天然氨基酸。
- 根据权利要求2所述的Pif1-like解旋酶,其中所述解旋酶包括:(a)SEQ ID NO:11的变体,其包括(i)E105C和/或A362C;(ii)E104C和/或K360C;(iii)E104C和/或A362C;(iv)E104C和/或Q363C;(v)E104C和/或K366C;(vi)E105C和/或M356C;(vii)E105C和/或K360C;(viii)E104C和/或M356C;(ix)E105C和/或Q363C;(x)E105C和/或K366C;(xi)F108C和/或M356C;(xii)F108C和/或K360C;(xiii)F108C和/或A362C;(xiv)F108C和/或Q363C;(xv)F108C和/或K366C;(xvi)K134C和/或M356C;(xvii)K134C和/或K360C;(xviii)K134C和/或A362C;(xix)K134C和/或Q363C;(xx)K134C和/或K366C;(xxi)(i)到(xx)任一个和G359C;(xxii)(i)到(xx)任一个和Q111C;(xxiii)(i)到(xx)任一个和I138C;(xxiv)(i)到(xx)任一个和Q111C和I138C;(xxv)E105C和/或F377C;(xxvi)Y103L,E105Y,N352N,A362C和Y365N;(xxvii)E105Y和A362C;(xxviii)A362C;(xxix)Y103L,E105C,N352N,A362Y和Y365N;(xxx)Y103L,E105C和A362Y;(xxxi)E105C和/或A362C,和I280A;(xxxii)E105C和/或L358C;(xxxiii)E104C和/或G359C;(xxxiv)E104C和/或A362C;(xxxv)K106C和/或W378C;(xxxvi)T102C和/或N382C;(xxxvii)T102C和/或W378C;(xxxviii)E104C和/或Y355C;(xxxix)E104C和/或N382C;(xl)E104C和/或K381C;(xli)E104C和/或K379C;(xlii)E104C和/或D376C; (xliii)E104C和/或W378C;(xliv)E104C和/或W374C;(xlv)E105C和/或Y355C;(xlvi)E105C和/或N382C;(xlvii)E105C和/或K381C;(xlviii)E105C和/或K379C;(xlix)E105C和/或D376C;(l)E105C和/或W378C;(li)E105C和/或W374C;(lii)E105C和A362Y;(liii)E105C,G359C和A362C;或(liv)I2C,E105C和A362C;或者,(b)SEQ ID NO:1到10中任一个的变体,其包括在对应于SEQ ID NO:11的(i)到(liv)中任一个所限定的位置处的半胱氨酸残基。
- 根据权利要求1-3任一所述的Pif1-like解旋酶,所述Pif1-like解旋酶的氨基酸序列为SEQ ID NO:1到11所示氨基酸序列或与SEQ ID NO:1到11所示氨基酸序列具有至少30%、至少40%、至少50%、60%、至少70%、至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、至少99%或至少99.9%的同源性并具有控制多核苷酸移动的能力。
- 根据权利要求1-4任一所述的Pif1-like解旋酶,其特征在于,所述引入的半胱氨酸与半胱氨酸之间相互连接,引入的非天然氨基酸与非天然氨基酸之间相互连接,引入的半胱氨酸与非天然氨基酸之间相互连接,引入的半胱氨酸与天然氨基酸之间相互连接,或者引入的非天然氨基酸与天然氨基酸之间相互连接。
- 根据权利要求1-5任一所述的Pif1-like解旋酶,其特征在于,所述Pif1-like解旋酶还包括至少一个或多个的半胱氨酸被取代;优选为丙氨酸、丝氨酸或缬氨酸取代半胱氨酸。
- 根据权利要求1-6任一所述的Pif1-like解旋酶,其特征在于,所述的Pif1-like解旋酶包括:(a)SEQ ID NO:5的变体,并且所述一个或多个被取代的天然半胱氨酸残基为C109、C114、C136或C414中的一个或多个;或者,(b)SEQ ID NO:1、2、3、4、6、7、8、9、10和11中任一个的变体,并且所述一个或多个被取代的天然半胱氨酸残基对应于SEQ ID NO:5中C109、C114、C136或C414中的一个或多个。
- 根据权利要求1-7任一所述的Pif1-like解旋酶,其特征在于,所述的Pif1-like解旋酶还包括对表面带负电的氨基酸、极性或非极性氨基酸进行取代或修饰;进一步优选的,所述的取代包括带正电的氨基酸、不带电荷的氨基酸取代带负电的氨基酸、不带电荷的氨基酸、芳香族氨基酸、极性或非极性氨基酸。
- 根据权利要求8所述的Pif1-like解旋酶,其特征在于,其中所述Pif1-like解旋 酶包括:a)SEQ ID NO:11的变体并且所述一个或多个带负电荷的氨基酸为D5,E9,E24,E87,I65,S58,D209或D216中的一个或多个;或者,b)SEQ ID NO:1到10中任一个的变体,其中,所述一个或多个带负电荷的氨基酸对应于SEQ ID NO:11中D5,E9,E24,E87,I65,S58,D209或D216中的一个或多个。
- 根据权利要求9所述的Pif1-like解旋酶,其特征在于,其中所述Pif1-like解旋酶还包括:a)SEQ ID NO:1的变体,并且所述一个或多个带负电荷的氨基酸还包括S171;或者,b)SEQ ID NO:9的变体,并且所述一个或多个带负电荷的氨基酸还包括S173。
- 根据权利要求1-10任一所述的Pif1-like解旋酶,其特征在于,其中所述非天然氨基酸选自4-叠氮基-L-苯丙氨酸(Faz),4-乙酰基-L-苯丙氨酸,3-乙酰基-L-苯丙氨酸,4-乙酰乙酰基-L苯丙氨酸,O-烯丙基-L-酪氨酸,3-(苯基硒烷基)-L-丙氨酸,O-2-丙炔-1-基-L-酪氨酸,4(二羟基硼基)-L-苯丙氨酸,4-[(乙基硫烷基)羰基]-L-苯丙氨酸,(2S)-2-氨基-3-{4-[(丙烷-2-基硫烷基)羰基]苯基}丙酸,(2S)-2-氨基-3-{4-[(2-氨基-3-硫烷基丙酰基)氨基]苯基}丙酸,O-甲基-L-酪氨酸,4-氨基-L-苯丙氨酸,4-氰基-L-苯丙氨酸,3-氰基-L-苯丙氨酸,4-氟-L-苯丙氨酸,4-碘-L-苯丙氨酸,4-溴-L-苯丙氨酸,O-(三氟甲基)酪氨酸,4-硝基L-苯丙氨酸,3-羟基-L-酪氨酸,3-氨基-L-酪氨酸,3-碘-L-酪氨酸,4-异丙基-L-苯丙氨酸,3-(2-萘基)-L-丙氨酸,4-苯基-L-苯丙氨酸,(2S)-2-氨基-3-(萘-2-基氨基)丙酸,6-(甲基硫烷基)正亮氨酸,6-氧-L-赖氨酸,D-酪氨酸,(2R)-2-羟基-3-(4-羟基苯基)丙酸,(2R)-2氨基辛酸酯3-(2,2′-二吡啶-5-基)-D-丙氨酸,2-氨基-3-(8-羟基-3-喹啉基)丙酸,4-苯甲酰-L-苯丙氨酸,S-(2-硝基苄基)半胱氨酸,(2R)-2-氨基-3-[(2-硝基苄基)硫烷基]丙酸,(2S)-2-氨基-3-[(2-硝基苄基)氧基]丙酸,O-(4,5-二甲氧基-2-硝基苄基)-L-丝氨酸,(2S)-2-氨基-6-({[(2-硝基苄基)氧基]羰基}氨基)己酸,O-(2-硝基苄基)-L-酪氨酸,2-硝基苯丙氨酸,4-[(E)-苯基二氮烯基]-L-苯丙氨酸,4-[3-(三氟甲基)-3H-二吖丙啶基-3基]-D-苯丙氨酸,2-氨基-3-[[5-(二甲基氨基)-1-萘基]磺酰基氨基]丙酸,(2S)-2-氨基4-(7-羟基-2-氧-2H-色烯-4-基)丁酸,(2S)-3-[(6-乙酰基萘-2-基)氨基]-2-氨基丙酸,4(羧基甲基)苯丙氨酸,3-硝基-L-酪氨酸,O-硫基-L-酪氨酸,(2R)-6-乙酰氨基-2-氨基己酸酯,1-甲基组氨酸,2-氨基壬酸,2-氨基癸酸,L-同质半胱氨酸,5-硫烷基正 缬氨酸,6-硫烷基-L-正亮氨酸,5-(甲基硫烷基)-L-正缬氨酸,N6-{[(2R,3R)-3-甲基-3,4-二氢-2H-吡咯2-基]羰基}-L-赖氨酸,N6-[(苄基氧基)羰基]赖氨酸,(2S)-2-氨基-6-[(环戊基羰基)氨基]己酸,N6-[(环戊基氧基)羰基]-L-赖氨酸,(2S)-2-氨基-6-{[(2R)-四氢呋喃-2-基羰基]氨基}己酸,(2S)-2-氨基-8-[(2R,3S)-3-乙炔基四氢呋喃-2-基]-8-氧基辛酸,N6-(叔丁氧基羰基)-L-赖氨酸,(2S)-2-羟基-6-({[(2-甲基-2-丙烷基)氧基]羰基}氨基)己酸,N6-[(烯丙氧基)羰基]赖氨酸,(2S)-2-氨基-6-({[(2-叠氮苄基)氧基]羰基}氨基)己酸,N6L-脯氨酰基-L-赖氨酸,(2S)-2-氨基-6-{[(丙-2-炔-1-基氧基)羰基]氨基}己酸或N6-[(2叠氮乙氧基)羰基]-L-赖氨酸。
- 根据权利要求1-11任一所述的Pif1-like解旋酶,其特征在于,所述的Pif1-like解旋酶还包括:(a)至少一个与单链DNA或双链DNA中一个或多个核苷酸相互作用的氨基酸被取代;和/或,(b)至少一个与跨膜孔相互作用的氨基酸被取代,其中,所述Pif1-like解旋酶具有控制多核苷酸移动的能力。
- 根据权利要求12所述的Pif1-like解旋酶,其中,在(a)中,用包含较大侧链的氨基酸取代至少一个与单链或双链DNA中一个或多个核苷酸的糖和/或碱基相互作用的氨基酸。
- 根据权利要求13所述的Pif1-like解旋酶,其中,所述Pif1-like解旋酶包括:(a)SEQ ID NO:11的变体,其中,所述至少一个与单链或双链DNA中一个或多个核苷酸的糖和/或碱基相互作用的氨基酸是P73,H93,N99,F109,I280,A161,F130,D132,D162,D163,E277,K415,Q291,H396,Y244或P100中的至少一个;或者,(b)SEQ ID NO:1到10中任一的变体,其中,所述至少一个与单链或双链DNA中一个或多个核苷酸的糖和/或碱基相互作用的氨基酸对应于SEQ ID NO:11中的P73,H93,N99,F109,I280,A161,F130,D132,D162,D163,E277,K415,Q291,H396,Y244或P100中的至少一个。
- 根据权利要求13或14所述的Pif1-like解旋酶,其中,所述较大侧链包括增加数目的碳原子,具有增加的长度,增加的分子体积和/或具有增加的范德华体积。
- 根据权利要求13-15任一所述的Pif1-like解旋酶,其中,所述较大侧链增加了所述至少一个氨基酸与所述单链或双链DNA中一个或多个核苷酸之间的(i)静电相互作用;(ii)氢键和/或(iii)阳离子-pi相互作用。
- 根据权利要求13-16任一所述的Pif1-like解旋酶,其中,所述较大侧链的氨基酸不是丙氨酸(A)、半胱氨酸(C)、甘氨酸(G)、硒代半胱氨酸(U)、甲硫氨酸(M)、天冬氨酸(D)或谷氨酸(E)。
- 根据权利要求12-17任一所述的Pif1-like解旋酶,其中,A)组氨酸(H)被(i)精氨酸(R)或赖氨酸(K);(ii)谷氨酰胺(Q)或天冬酰胺(N);或(iii)苯丙氨酸(F)、酪氨酸(Y)或色氨酸(W)取代;B)天冬酰胺(N)被(i)精氨酸(R)或赖氨酸(K);(ii)谷氨酰胺(Q)或组氨酸(H);或(iii)苯丙氨酸(F)、酪氨酸(Y)或色氨酸(W)取代;C)脯氨酸(P)被(i)精氨酸(R)或赖氨酸(K);(ii)谷氨酰胺(Q)、天冬酰胺(N)、苏氨酸(T)或组氨酸(H);(iii)酪氨酸(Y)、苯丙氨酸(F)或色氨酸(W);或(iv)亮氨酸(L)、缬氨酸(V)或异亮氨酸(I)取代;D)苯丙氨酸(F)被(i)精氨酸(R)或赖氨酸(K);(ii)组氨酸(H);或(iii)酪氨酸(Y)或色氨酸(W)取代;E)天冬氨酸(D)被(i)精氨酸(R)或赖氨酸(K);(ii)谷氨酰胺(Q)、天冬酰胺(N)或组氨酸(H);或(iii)苯丙氨酸(F)、酪氨酸(Y)或色氨酸(W)取代;F)缬氨酸(V)被(i)精氨酸(R)或赖氨酸(K);(ii)谷氨酰胺(Q)、天冬酰胺(N)或组氨酸(H);(iii)苯丙氨酸(F)、酪氨酸(Y)或色氨酸(W);或(iv)异亮氨酸(I)或亮氨酸(L)取代;G)丝氨酸(S)被(i)精氨酸(R)或赖氨酸(K);(ii)谷氨酰胺(Q)、天冬酰胺(N)或组氨酸(H);(iii)苯丙氨酸(F)、酪氨酸(Y)或色氨酸(W);或(iv)异亮氨酸(I)或亮氨酸(L)取代;和/或,H)酪氨酸(Y)被(i)精氨酸(R)或赖氨酸(K);或(ii)色氨酸(W)取代。
- 根据权利要求12-19任一所述的Pif1-like解旋酶,其中,在(a)中,至少一个与单链DNA或双链DNA中一个或多个核苷酸的一个或多个磷酸基团相互作用的氨基 酸被取代。
- 根据权利要求20所述的Pif1-like解旋酶,其中,所述Pif1-like解旋酶包括:(a)SEQ ID NO:11的变体,其中,所述至少一个与ssDNA或dsDNA中一个或多个核苷酸的一个或多个磷酸基团相互作用的氨基酸是H75,T91,S94,K97,N246,N247,N284,K288,N297,T394或K397中的至少一个;或者,(b)SEQ ID NO:1到10中任一的变体,其中,所述至少一个与ssDNA或dsDNA中一个或多个核苷酸的一个或多个磷酸基团相互作用的氨基酸是与SEQID NO:11中的H75,T91,S94,K97,N246,N247,N284,K288,N297,T394或K397相对应的至少一个氨基酸。
- 根据权利要求21所述的Pif1-like解旋酶,其中,a)组氨酸(H)被(i)精氨酸(R)或赖氨酸(K);(ii)天冬酰胺(N)、丝氨酸(S)、谷氨酰胺(Q)或苏氨酸(T);或(iii)苯丙氨酸(F)、色氨酸(W)或酪氨酸(Y)取代;b)苏氨酸(T)被(i)精氨酸(R)、组氨酸(H)或赖氨酸(K);(ii)天冬酰胺(N)、丝氨酸(S)、谷氨酰胺(Q)或组氨酸(H);或(iii)苯丙氨酸(F)、色氨酸(W)、酪氨酸(Y)或组氨酸(H)取代;c)丝氨酸被(i)精氨酸(R)、组氨酸(H)或赖氨酸(K);(ii)天冬酰胺(N)、谷氨酰胺(Q)、苏氨酸(T)或组氨酸(H);或(iii)苯丙氨酸(F)、色氨酸(W)、酪氨酸(Y)或组氨酸(H)取代;d)天冬酰胺(N)被(i)精氨酸(R)、组氨酸(H)或赖氨酸(K);(ii)丝氨酸(S)、谷氨酰胺(Q)、苏氨酸(T)或组氨酸(H);或(iii)苯丙氨酸(F)、色氨酸(W)、酪氨酸(Y)或组氨酸(H)取代;和/或,e)赖氨酸(K)被(i)精氨酸(R)或组氨酸(H);(ii)天冬酰胺(N)、丝氨酸(S)、谷氨酰胺(Q)、苏氨酸(T)或组氨酸(H);或(iii)苯丙氨酸(F)、色氨酸(W)、酪氨酸(Y)或组氨酸(H)取代。
- 根据权利要求20-22任一所述的Pif1-like解旋酶,其中,所述Pif1-like解旋酶是SEQ ID NO:11的变体,其包含(a)至(k)中的一个或多个,其中,(a)H75N,H75Q,H75K或H75F;(b)T91K,T91Q或T91N;(c)S94H,S94N,S94K,S94T,S94R或S94Q;(d)K97Q,K97H或K97Y;(e)N246H或N246Q;(f)N247Q或N247H;(g)N284H或N284Q;(h)K288Q或K288H;(i)N297Q,N297K或N297H;(j)T394K,T394H或T394N;或(k)K397R,K397H或K397Y。
- 根据权利要求12-23任一所述的Pif1-like解旋酶,其中,所述解旋酶包括至 少一个与跨膜孔相互作用的氨基酸被取代:(a)SEQ ID NO:11的变体,其中,所述Pif1-like解旋酶包含在(a)E196(b)W202(c)N199或(d)G201上的一个或多个取代;或者,(b)SEQ ID NO:1到10中任一的变体,其中,所述Pif1-like解旋酶包含在(a)E196(b)W202(c)N199或(d)G201相对应的至少一个取代。
- 根据权利要求1-24任一所述的Pif1-like解旋酶,其中,所述Pif1-like解旋酶是SEQ ID NO:11的变体,其包含在以下位点上的取代:-F109/E196/H75,例如,F109W/E196L/H75N,F109W/E196L/H75Q,F109W/E196L/H75K或者F109W/E196L/H75F;-F109/E196/T91,例如,F109W/E196L/T91K,F109W/E196L/T91Q或者F109W/E196L/T91N;-F109/S94/E196,例如,F109W/S94H/E196L,F109W/S94T/E196L,F109W/S94R/E196L,F109W/S94Q/E196L,F109W/S94N/E196L或者F109W/S94K/E196L;F109/N99/E196,例如,F109W/N99R/E196L,F109W/N99H/E196L,F109W/N99W/E196L或者F109W/N99Y/E196L;-F109/S94/E196/I280,例如,F109W/S94H/E196L/I280K;-F109/P100/E196,例如,F109W/P100L/E196L,F109W/P100V/E196L,F109W/P100I/E196L或者F109W/P100T/E196L;-F109/D132/E196,例如,F109W/D132H/E196L,F109W/D132Y/E196L或者F109W/D132K/E196L;-F109/A161/E196,例如,F109W/A161I/E196L,F109W/A161L/E196L,F109W/A161N/E196L,F109W/A161W/E196L或者F109W/A161H/E196L;-F109/D163/E196,例如,F109W/D163W/E196L,F109W/D163F/E196L,F109W/D163Y/E196L,F109W/D163H/E196L,F109W/D163I/E196L,F109W/D163L/E196L或者F109W/D163V/E196L;-F109/Y244/E196,例如,F109W/Y244W/E196L,F109W/Y244Y/E196L或者F109W/Y244H/E196L;-F109/N246/E196,例如,F109W/N246H/E196L或者F109W/N246Q/E196L;-F109/E196/I280,例如,F109W/E196L/I280K,F109W/E196L/I280H,F109W/E196L/I280W或者F109W/E196L/I280R;-F109/E196/Q291,例如,F109W/E196L/Q291K,F109W/E196L/Q291R,F109W/E196L/Q291W或者F109W/E196L/Q291F;-F109/N297/E196,例如,F109W/N297Q/E196L,F109W/N297K/E196L或者F109W/N297H/E196L;-F109/T394/E196,例如,F109W/T394K/E196L,F109W/T394H/E196L或者F109W/T394N/E196L;-F109/H396/E196,例如,F109W/H396Y/E196L,F109W/H396F/E196L,F109W/H396Q/E196L或者F109W/H396K/E196L;-F109/K397/E196,例如,F109W/K397R/E196L,F109W/K397H/E196L或者F109W/K397Y/E196L;或者,-F109/Y416/E196,例如,F109W/Y416W/E196L或者F109W/Y416R/E196L。
- 一种构建体,其特征在于,所述的构建体包含至少一个权利要求1-25任一所述的Pif1-like解旋酶。
- 根据权利要求26所述的构建体,其特征在于,所述的构建体还包含多核苷酸结合部分。
- 一种核酸,其特征在于,所述的核酸编码权利要求1-25任一所述的Pif1-like解旋酶或权利要求26-27任一所述的构建体。
- 一种表达载体,其特征在于,所述的表达载体包含权利要求28所述的核酸。
- 一种宿主细胞,其特征在于,所述的宿主细胞包含权利要求28所述的核酸或权利要求29所述的表达载体。
- 一种控制多核苷酸移动的方法,其特征在于,所述的方法包括将权利要求1-25任一所述的Pif1-like解旋酶或权利要求26-27任一所述的构建体与多核苷酸接触。
- 一种表征目标多核苷酸的方法,其特征在于,所述的方法包括:I)将权利要求1-25任一所述的Pif1-like解旋酶或权利要求26-27任一所述的构建体,与目标多核苷酸以及孔接触,使得Pif1-like解旋酶或构建体控制目标多核苷酸穿过孔的移动;并II)获取目标多核苷酸中的核苷酸与所述孔相互作用时的一个或多个特征,以表征所述目标多核苷酸。
- 根据权利要求32所述的方法,其特征在于,所述的方法还包括横跨与所述解旋酶或构建体,和目标多核苷酸接触的孔施加势差的步骤;优选的,所述的孔选自生物孔、固态孔或生物与固态杂交的孔。
- 根据权利要求32或33所述的方法,其特征在于,所述的目标多核苷酸为单链、双链或至少一部分是双链的。
- 根据权利要求32-34任一所述的方法,其特征在于,所述的一个或多个特征选自目标多核苷酸的来源、长度、同一性、序列、二级结构或目标多核苷酸是否被修饰;优选的,所述的一个或多个特征通过电测量和/或光学测量进行。
- 一种表征目标多核苷酸的产品,其特征在于,所述的产品包含权利要求1-25任一所述的Pif1-like解旋酶、权利要求26-27任一所述的构建体、权利要求28所述的核酸、权利要求29所述的表达载体或权利要求30所述的宿主细胞,和孔。
- 根据权利要求36所述的产品,其特征在于,所述的产品选自试剂盒、装置或传感器。
- 权利要求1-25任一所述的Pif1-like解旋酶、权利要求26-27任一所述的构建体、权利要求28所述的核酸、权利要求29所述的表达载体、权利要求30所述的宿主细胞或权利要求36-37任一所述的产品在表征目标多核苷酸或控制目标多核苷酸穿过孔的移动中的应用。
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2022551797A JP2023515597A (ja) | 2020-06-19 | 2020-06-19 | Pif1様ヘリカーゼ及びその使用 |
KR1020227026798A KR20220124741A (ko) | 2020-06-19 | 2020-06-19 | Pif1 유사 헬리카제 및 이의 용도 |
PCT/CN2020/097126 WO2021253410A1 (zh) | 2020-06-19 | 2020-06-19 | 一种Pif1-like解旋酶及其应用 |
EP20941459.8A EP4063382A4 (en) | 2020-06-19 | 2020-06-19 | PIF1-LIKE HELICASE AND ITS USE |
BR112022014144A BR112022014144A2 (pt) | 2020-06-19 | 2020-06-19 | Helicase do tipo pif1, constructo, ácido nucleico, vetor de expressão, célula hospedeira, método para controlar o movimento de um polinucleotídeo, método de caracterização de um polinucleotídeo alvo, produto para caracterizar um polinucleotídeo alvo e uso da helicase do tipo pif1 |
IL294925A IL294925A (en) | 2020-06-19 | 2020-06-19 | pif1-like helicase and applications therefor |
CN202080073171.0A CN114599666A (zh) | 2020-06-19 | 2020-06-19 | 一种Pif1-like解旋酶及其应用 |
US17/590,447 US20220290223A1 (en) | 2020-06-19 | 2022-02-01 | Pif1-like helicase and use thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/097126 WO2021253410A1 (zh) | 2020-06-19 | 2020-06-19 | 一种Pif1-like解旋酶及其应用 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/590,447 Continuation-In-Part US20220290223A1 (en) | 2020-06-19 | 2022-02-01 | Pif1-like helicase and use thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021253410A1 true WO2021253410A1 (zh) | 2021-12-23 |
Family
ID=79268961
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/097126 WO2021253410A1 (zh) | 2020-06-19 | 2020-06-19 | 一种Pif1-like解旋酶及其应用 |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP4063382A4 (zh) |
JP (1) | JP2023515597A (zh) |
KR (1) | KR20220124741A (zh) |
CN (1) | CN114599666A (zh) |
BR (1) | BR112022014144A2 (zh) |
WO (1) | WO2021253410A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116334030A (zh) * | 2023-05-06 | 2023-06-27 | 深圳市梅丽纳米孔科技有限公司 | 一种经修饰的CfM HL4解旋酶及其应用 |
EP4198125A3 (en) * | 2021-12-17 | 2023-06-28 | Qitan Technology Ltd., Beijing | Pif1-like helicase and use thereof |
WO2024056038A1 (zh) * | 2022-09-16 | 2024-03-21 | 北京普译生物科技有限公司 | 一种经修饰的CaPif1解旋酶及其应用 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117384878A (zh) * | 2023-07-21 | 2024-01-12 | 北京普译生物科技有限公司 | 一种经修饰的CtPif1解旋酶及其应用 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001030986A2 (en) * | 1999-10-22 | 2001-05-03 | Bristol-Myers Squibb Co. | Human pif1 dna helicase and uses thereof |
WO2013057495A2 (en) | 2011-10-21 | 2013-04-25 | Oxford Nanopore Technologies Limited | Enzyme method |
US20150065354A1 (en) | 2011-12-29 | 2015-03-05 | Oxford Nanopore Technologies Limited | Method for characterising a polynucelotide by using a xpd helicase |
CN107109380A (zh) | 2014-10-07 | 2017-08-29 | 牛津纳米孔技术公司 | 经修饰的酶 |
CN109219665A (zh) * | 2016-05-25 | 2019-01-15 | 牛津纳米孔技术公司 | 方法 |
CN111164096A (zh) * | 2019-09-29 | 2020-05-15 | 北京齐碳科技有限公司 | 一种Mmup单体变体及其应用 |
CN111235245A (zh) * | 2013-08-16 | 2020-06-05 | 牛津纳米孔技术公司 | 方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2879261C (en) * | 2012-07-19 | 2022-12-06 | Oxford Nanopore Technologies Limited | Modified helicases |
CN117947149A (zh) * | 2013-10-18 | 2024-04-30 | 牛津纳米孔科技公开有限公司 | 经修饰的酶 |
CN113930406B (zh) * | 2021-12-17 | 2022-04-08 | 北京齐碳科技有限公司 | 一种Pif1-like解旋酶及其应用 |
-
2020
- 2020-06-19 EP EP20941459.8A patent/EP4063382A4/en active Pending
- 2020-06-19 CN CN202080073171.0A patent/CN114599666A/zh active Pending
- 2020-06-19 BR BR112022014144A patent/BR112022014144A2/pt unknown
- 2020-06-19 WO PCT/CN2020/097126 patent/WO2021253410A1/zh active Application Filing
- 2020-06-19 JP JP2022551797A patent/JP2023515597A/ja active Pending
- 2020-06-19 KR KR1020227026798A patent/KR20220124741A/ko unknown
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001030986A2 (en) * | 1999-10-22 | 2001-05-03 | Bristol-Myers Squibb Co. | Human pif1 dna helicase and uses thereof |
WO2013057495A2 (en) | 2011-10-21 | 2013-04-25 | Oxford Nanopore Technologies Limited | Enzyme method |
US20150065354A1 (en) | 2011-12-29 | 2015-03-05 | Oxford Nanopore Technologies Limited | Method for characterising a polynucelotide by using a xpd helicase |
CN111235245A (zh) * | 2013-08-16 | 2020-06-05 | 牛津纳米孔技术公司 | 方法 |
CN107109380A (zh) | 2014-10-07 | 2017-08-29 | 牛津纳米孔技术公司 | 经修饰的酶 |
CN109219665A (zh) * | 2016-05-25 | 2019-01-15 | 牛津纳米孔技术公司 | 方法 |
CN111164096A (zh) * | 2019-09-29 | 2020-05-15 | 北京齐碳科技有限公司 | 一种Mmup单体变体及其应用 |
Non-Patent Citations (4)
Title |
---|
CAO, YING ET AL.: "Research Progress and Application of Nanopore Sequencing Technology", CHINESE JOURNAL OF BIOTECHNOLOGY, vol. 36, no. 5, 25 May 2020 (2020-05-25), pages 811 - 819, XP055882525 * |
CHEN WEI-FEI, DAI YANG-XUE, DUAN XIAO-LEI, LIU NA-NV, SHI WEI, LI NA, LI MING, DOU SHOU-XING, DONG YU-HUI, RETY STEPHANE, XI XU-GU: "Crystal structures of the BsPif1 helicase reveal that a major movement of the 2B SH3 domain is required for DNA unwinding", NUCLEIC ACIDS RESEARCH, vol. 44, no. 6, 7 April 2016 (2016-04-07), GB , pages 2949 - 2961, XP055882522, ISSN: 0305-1048, DOI: 10.1093/nar/gkw033 * |
NOBUAKI KONO; KAZUHARU ARAKAWA: "Nanopore sequencing: Review of potential applications in functional genomics", DEVELOPMENT GROWTH AND DIFFERENTIATION, vol. 61, no. 5, 29 April 2019 (2019-04-29), US , pages 316 - 326, XP071144236, ISSN: 0012-1592, DOI: 10.1111/dgd.12608 * |
See also references of EP4063382A4 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP4198125A3 (en) * | 2021-12-17 | 2023-06-28 | Qitan Technology Ltd., Beijing | Pif1-like helicase and use thereof |
WO2024056038A1 (zh) * | 2022-09-16 | 2024-03-21 | 北京普译生物科技有限公司 | 一种经修饰的CaPif1解旋酶及其应用 |
CN116334030A (zh) * | 2023-05-06 | 2023-06-27 | 深圳市梅丽纳米孔科技有限公司 | 一种经修饰的CfM HL4解旋酶及其应用 |
CN116334030B (zh) * | 2023-05-06 | 2024-01-16 | 深圳市梅丽纳米孔科技有限公司 | 一种经修饰的CfM HL4解旋酶及其应用 |
Also Published As
Publication number | Publication date |
---|---|
KR20220124741A (ko) | 2022-09-14 |
BR112022014144A2 (pt) | 2022-12-27 |
JP2023515597A (ja) | 2023-04-13 |
CN114599666A (zh) | 2022-06-07 |
EP4063382A1 (en) | 2022-09-28 |
EP4063382A9 (en) | 2022-11-16 |
EP4063382A4 (en) | 2023-08-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021253410A1 (zh) | 一种Pif1-like解旋酶及其应用 | |
US11525125B2 (en) | Modified helicases | |
US11965183B2 (en) | Modified enzymes | |
US20230212665A1 (en) | Enzyme stalling method | |
JP6776366B2 (ja) | 変異体ポア | |
KR102472805B1 (ko) | 돌연변이체 포어 | |
CN112646019B (zh) | 突变胞溶素孔 | |
EP4198125A2 (en) | Pif1-like helicase and use thereof | |
JP7499761B2 (ja) | 細孔 | |
CN112805393A (zh) | 一种解旋酶及其应用 | |
WO2022126304A1 (zh) | 一种经修饰的解旋酶及其应用 | |
EP4299746A1 (en) | Modified prp43 helicase and use thereof | |
US20220162568A1 (en) | Pore | |
US20220290223A1 (en) | Pif1-like helicase and use thereof | |
CN116334030B (zh) | 一种经修饰的CfM HL4解旋酶及其应用 | |
WO2023198911A2 (en) | Novel modified protein pores and enzymes | |
CN118256603A (zh) | 经修饰的酶 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20941459 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2020941459 Country of ref document: EP Effective date: 20220622 |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112022014144 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 20227026798 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2022551797 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 112022014144 Country of ref document: BR Kind code of ref document: A2 Effective date: 20220718 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 523442322 Country of ref document: SA |