CN116083393A - DNA assembly molecular machine and application thereof - Google Patents
DNA assembly molecular machine and application thereof Download PDFInfo
- Publication number
- CN116083393A CN116083393A CN202310140791.7A CN202310140791A CN116083393A CN 116083393 A CN116083393 A CN 116083393A CN 202310140791 A CN202310140791 A CN 202310140791A CN 116083393 A CN116083393 A CN 116083393A
- Authority
- CN
- China
- Prior art keywords
- dna
- seq
- domain
- amino acids
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000007702 DNA assembly Methods 0.000 title claims abstract description 73
- 230000000694 effects Effects 0.000 claims abstract description 36
- 238000003780 insertion Methods 0.000 claims abstract description 27
- 230000037431 insertion Effects 0.000 claims abstract description 27
- 210000003813 thumb Anatomy 0.000 claims abstract description 21
- 238000012163 sequencing technique Methods 0.000 claims abstract description 15
- 108060002716 Exonuclease Proteins 0.000 claims abstract description 13
- 102000013165 exonuclease Human genes 0.000 claims abstract description 13
- 210000003811 finger Anatomy 0.000 claims abstract description 11
- 238000001514 detection method Methods 0.000 claims abstract description 9
- 238000007672 fourth generation sequencing Methods 0.000 claims abstract description 7
- 150000001413 amino acids Chemical class 0.000 claims description 66
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 33
- 108020004414 DNA Proteins 0.000 claims description 23
- 230000003321 amplification Effects 0.000 claims description 19
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 19
- 238000000034 method Methods 0.000 claims description 17
- 230000035772 mutation Effects 0.000 claims description 17
- 238000006243 chemical reaction Methods 0.000 claims description 14
- 230000004544 DNA amplification Effects 0.000 claims description 10
- 239000013598 vector Substances 0.000 claims description 9
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 claims description 8
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 claims description 8
- 230000003197 catalytic effect Effects 0.000 claims description 8
- 238000007069 methylation reaction Methods 0.000 claims description 8
- 230000004048 modification Effects 0.000 claims description 8
- 238000012986 modification Methods 0.000 claims description 8
- 108020004707 nucleic acids Proteins 0.000 claims description 8
- 102000039446 nucleic acids Human genes 0.000 claims description 8
- 150000007523 nucleic acids Chemical class 0.000 claims description 8
- 238000005096 rolling process Methods 0.000 claims description 8
- 230000004543 DNA replication Effects 0.000 claims description 6
- -1 salt ions Chemical class 0.000 claims description 6
- 238000001712 DNA sequencing Methods 0.000 claims description 5
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 claims description 5
- 239000003153 chemical reaction reagent Substances 0.000 claims description 5
- 238000006073 displacement reaction Methods 0.000 claims description 5
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 claims description 4
- 239000001888 Peptone Substances 0.000 claims description 4
- 238000003559 RNA-seq method Methods 0.000 claims description 4
- 239000011575 calcium Substances 0.000 claims description 4
- 229910001424 calcium ion Inorganic materials 0.000 claims description 4
- 238000013412 genome amplification Methods 0.000 claims description 4
- 230000013595 glycosylation Effects 0.000 claims description 4
- 238000006206 glycosylation reaction Methods 0.000 claims description 4
- 230000011987 methylation Effects 0.000 claims description 4
- 239000002773 nucleotide Substances 0.000 claims description 4
- 125000003729 nucleotide group Chemical group 0.000 claims description 4
- 238000012070 whole genome sequencing analysis Methods 0.000 claims description 4
- 239000011701 zinc Substances 0.000 claims description 4
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 claims description 3
- 102000053602 DNA Human genes 0.000 claims description 3
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 claims description 3
- 229910052791 calcium Inorganic materials 0.000 claims description 3
- 239000010949 copper Substances 0.000 claims description 3
- 229910052725 zinc Inorganic materials 0.000 claims description 3
- 230000005730 ADP ribosylation Effects 0.000 claims description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 claims description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 claims description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 claims description 2
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 claims description 2
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 claims description 2
- 239000004396 Octenyl succinic acid modified gum arabic Substances 0.000 claims description 2
- 101100271190 Plasmodium falciparum (isolate 3D7) ATAT gene Proteins 0.000 claims description 2
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 claims description 2
- 230000006295 S-nitrosylation Effects 0.000 claims description 2
- 238000006640 acetylation reaction Methods 0.000 claims description 2
- 239000010941 cobalt Substances 0.000 claims description 2
- 229910017052 cobalt Inorganic materials 0.000 claims description 2
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 claims description 2
- 238000010276 construction Methods 0.000 claims description 2
- 229910052802 copper Inorganic materials 0.000 claims description 2
- 230000033444 hydroxylation Effects 0.000 claims description 2
- 238000005805 hydroxylation reaction Methods 0.000 claims description 2
- 230000003993 interaction Effects 0.000 claims description 2
- 230000029226 lipidation Effects 0.000 claims description 2
- 239000011777 magnesium Substances 0.000 claims description 2
- 229910052749 magnesium Inorganic materials 0.000 claims description 2
- WPBNNNQJVZRUHP-UHFFFAOYSA-L manganese(2+);methyl n-[[2-(methoxycarbonylcarbamothioylamino)phenyl]carbamothioyl]carbamate;n-[2-(sulfidocarbothioylamino)ethyl]carbamodithioate Chemical compound [Mn+2].[S-]C(=S)NCCNC([S-])=S.COC(=O)NC(=S)NC1=CC=CC=C1NC(=S)NC(=O)OC WPBNNNQJVZRUHP-UHFFFAOYSA-L 0.000 claims description 2
- 229910052759 nickel Inorganic materials 0.000 claims description 2
- 230000026731 phosphorylation Effects 0.000 claims description 2
- 238000006366 phosphorylation reaction Methods 0.000 claims description 2
- 229920001195 polyisoprene Polymers 0.000 claims description 2
- 239000001818 polyoxyethylene sorbitan monostearate Substances 0.000 claims description 2
- 239000001816 polyoxyethylene sorbitan tristearate Substances 0.000 claims description 2
- 229910052700 potassium Inorganic materials 0.000 claims description 2
- 239000011591 potassium Substances 0.000 claims description 2
- 238000002360 preparation method Methods 0.000 claims description 2
- 230000003362 replicative effect Effects 0.000 claims description 2
- 229910052708 sodium Inorganic materials 0.000 claims description 2
- 239000011734 sodium Substances 0.000 claims description 2
- 230000035322 succinylation Effects 0.000 claims description 2
- 238000010613 succinylation reaction Methods 0.000 claims description 2
- 230000034512 ubiquitination Effects 0.000 claims description 2
- 238000010798 ubiquitination Methods 0.000 claims description 2
- 238000003745 diagnosis Methods 0.000 abstract description 4
- 229910021645 metal ion Inorganic materials 0.000 abstract description 4
- 238000000338 in vitro Methods 0.000 abstract description 3
- 238000011901 isothermal amplification Methods 0.000 abstract description 3
- 235000001014 amino acid Nutrition 0.000 description 52
- 229940024606 amino acid Drugs 0.000 description 52
- 108090000623 proteins and genes Proteins 0.000 description 26
- 102000004169 proteins and genes Human genes 0.000 description 24
- 235000018102 proteins Nutrition 0.000 description 23
- 210000004027 cell Anatomy 0.000 description 19
- 150000003839 salts Chemical class 0.000 description 8
- 239000000047 product Substances 0.000 description 7
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 150000002500 ions Chemical class 0.000 description 5
- 210000004962 mammalian cell Anatomy 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 229910001385 heavy metal Inorganic materials 0.000 description 4
- 238000012916 structural analysis Methods 0.000 description 4
- 230000005945 translocation Effects 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 230000015784 hyperosmotic salinity response Effects 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 239000011535 reaction buffer Substances 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000002515 oligonucleotide synthesis Methods 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- PPDNGMUGVMESGE-JTQLQIEISA-N (2s)-2-amino-3-(4-ethynylphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(C#C)C=C1 PPDNGMUGVMESGE-JTQLQIEISA-N 0.000 description 1
- JSXMFBNJRFXRCX-NSHDSACASA-N (2s)-2-amino-3-(4-prop-2-ynoxyphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OCC#C)C=C1 JSXMFBNJRFXRCX-NSHDSACASA-N 0.000 description 1
- NEMHIKRLROONTL-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(4-azidophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N=[N+]=[N-])C=C1 NEMHIKRLROONTL-QMMMGPOBSA-N 0.000 description 1
- ZXSBHXZKWRIEIA-JTQLQIEISA-N (2s)-3-(4-acetylphenyl)-2-azaniumylpropanoate Chemical compound CC(=O)C1=CC=C(C[C@H](N)C(O)=O)C=C1 ZXSBHXZKWRIEIA-JTQLQIEISA-N 0.000 description 1
- 125000004178 (C1-C4) alkyl group Chemical group 0.000 description 1
- 101150084750 1 gene Proteins 0.000 description 1
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 1
- 102220491082 ADP-ribosylation factor 6_D12A_mutation Human genes 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N D-alpha-Ala Natural products CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 102000006947 Histones Human genes 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- HBBGRARXTFLTSG-UHFFFAOYSA-N Lithium ion Chemical compound [Li+] HBBGRARXTFLTSG-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 102220469611 Protein unc-13 homolog A_D67A_mutation Human genes 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 241001052560 Thallis Species 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229960002684 aminocaproic acid Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009697 arginine Nutrition 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- IVRMZWNICZWHMI-UHFFFAOYSA-N azide group Chemical group [N-]=[N+]=[N-] IVRMZWNICZWHMI-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 229940000635 beta-alanine Drugs 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- 230000009849 deactivation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000004554 glutamine Nutrition 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 229910001416 lithium ion Inorganic materials 0.000 description 1
- 235000018977 lysine Nutrition 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 239000011572 manganese Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 235000006109 methionine Nutrition 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 102220172075 rs150884181 Human genes 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- UWHCKJMYHZGTIT-UHFFFAOYSA-N tetraethylene glycol Chemical compound OCCOCCOCCOCCO UWHCKJMYHZGTIT-UHFFFAOYSA-N 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 235000002374 tyrosine Nutrition 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1252—DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/07—Nucleotidyltransferases (2.7.7)
- C12Y207/07007—DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B50/00—Methods of creating libraries, e.g. combinatorial synthesis
- C40B50/06—Biochemical methods, e.g. using enzymes or whole viable microorganisms
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicinal Chemistry (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Immunology (AREA)
- Biomedical Technology (AREA)
- General Chemical & Material Sciences (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The invention discloses a DNA assembly molecular machine and application thereof, and relates to the technical field of molecular biology. The DNA assembly molecule machine comprises a palm domain, 2 terminal insertion domains, thumb domain, finger domain, and exonuclease activity domain. The DNA assembly molecule has good stability and is suitable for isothermal amplification. In addition, the DNA assembly molecule and the mutant thereof have DNA assembly functions under various metal ions and different temperatures, and can be applied to nanopore sequencing, single molecule sequencing, RCA library establishment, environment detection and in-vitro diagnosis.
Description
Technical Field
The invention relates to the technical field of molecular biology, in particular to a DNA assembly molecular machine and application thereof.
Background
The development of DNA sequencing technology has important significance for modern biological exploration, disease diagnosis and human tracing. With the progress of sequencing technology, the progress from indirect sequencing after amplification to in-situ single molecule sequencing has been advanced. Whether Pacbio tSMS single-molecule fluorescence sequencing technology or ONT electrophysiological sequencing technology, sequencing length, accuracy and efficiency are severely dependent on the molecular motor that controls DNA movement and assembly. Among them, pacbio is mainly utilized and the DNB-SEQ technology of Hua Dazhi relies on rolling circle amplification of phi29 polymerase, in which technology the phi29 polymerase is required to have higher thermostability and activity. In nanopore sequencing, the activity of the existing polymerase is too high, so that the DNA translocation through hole speed is too high, the DNA translocation perforation time is difficult to be increased from 10us to 100us, and the requirement on a DNA assembly molecular machine is that the DNA assembly molecular machine has higher salt tolerance and needs to bear the salt concentration of more than 300 mM.
The heat resistance and salt tolerance of the existing phi29 polymerase are respectively modified, for example, the heat stability of the commercial phi29 polymerase is improved through 73 point mutations in the patent WO2021248757A1, wherein the heat stability Tm value of the mutation combination M97T, Y K and E515S is improved from 48.5 ℃ to 50.8 ℃, so that the modification of the commercial phi29 polymerase is difficult, and the characteristics of the enzyme are difficult to change obviously. In patent CN201910778360 a novel salt tolerant polymerase is disclosed, which is resistant to sodium chloride at 300mM and becomes inactive after more than 500mM, compared to phi29 polymerase and Taq DNA polymerase, and which is prone to deactivation under calcium, zinc and lithium ion conditions. Therefore, further designing a novel DNA assembly molecular machine to improve salt tolerance, thermal stability, and tolerance to heavy metal ions is a technical bottleneck in the art to be solved.
In view of this, the present invention has been made.
Disclosure of Invention
The invention aims to provide a DNA assembly molecular machine and application thereof to solve the technical problems.
The invention is realized in the following way:
in a first aspect, the present invention provides a DNA assembly molecule machine consisting of 500-1000 amino acids comprising a palm domain, 2 terminal insertion domains, thumb domain, finger domain and exonuclease activity domain; wherein the palm domain consists of 150-220 amino acids, the thumb domain consists of 40-70 amino acids, the finger domain consists of 30-45 amino acids to form an alpha helix structure, and the exonuclease activity domain consists of 150-220 amino acids; the 2 terminal insertion domains comprise a terminal insertion domain 1 and a terminal insertion domain 2, wherein the terminal insertion domain 1 consists of 50-150 amino acids, and the terminal insertion domain 2 consists of 20-70 amino acids;
the DNA assembly molecular machine has the following functions:
the ability to interact with nucleotides through amino acid side chains; and the DNA assembly molecule machine has the ability to load and/or ligate DNA molecules.
The DNA assembly molecule may interact with the base of the free nucleotide through the amino acid side chain or with the phosphate backbone. In an alternative embodiment, the DNA assembly molecule has the function of loading and/or ligating primer strands, template strands.
The DNA assembly molecular machine consisting of the palm domain, the 2 terminal insertion domains, the thumb domain, the finger domain and the exonuclease activity domain has good stability and can be used for isothermal amplification. The DNA assembly molecule and the mutant thereof provided by the invention have the DNA assembly function under various metal ions and different temperatures, and can be applied to nanopore sequencing, single molecule sequencing, RCA library building, environment detection and in-vitro diagnosis. Particularly, the method has the amplification activity far superior to that of the existing DNA polymerase under the condition of high salt concentration, and can prolong the DNA translocation and perforation time by improving the salt concentration, thereby improving the accuracy of single-molecule sequencing and greatly reducing the loss of key sequencing information.
In an alternative embodiment, the palm domain consists of 150 to 200 amino acids, or 160 to 220 amino acids, or 180 to 220 amino acids.
In an alternative embodiment, the thumb domain consists of 40, 45, 50, 55, 60, 65 or 70 amino acids, forming a 30-45 angstrom long domain between the exonuclease active domain and the palm domain that prevents dissociation of enzyme from DNA during DNA assembly.
In an alternative embodiment, the finger domain is an alpha helix structure of 30, 32, 36, 38, 40 or 45 amino acids, the primary function of the segment being to bind DNA and assist in the function of exonuclease activity.
In an alternative embodiment, the exonuclease activity domain consists of 150, 160, 170, 180, 190, 200, 210 or 220 amino acids.
The terminal insertion domain comprises a terminal insertion domain 1 and a terminal insertion domain 2, wherein the terminal insertion domain 1 is composed of 50-150 amino acids, the terminal insertion domain 2 is composed of 20-70 amino acids, so that the template strand and the non-template strand are effectively separated, and effective strand displacement and continuity are facilitated.
In a preferred embodiment of the invention, the amino acid type of each domain of the DNA-assembling molecular machine is selected from at least one of the following: natural amino acids, unnatural amino acids, modified natural amino acids, modified unnatural amino acids.
The natural amino acid is selected from glycine, alanine, valine, leucine, isoleucine, phenylalanine, proline, serine, threonine, histidine, tryptophan, cysteine, asparagine, aspartic acid, glutamic acid, glutamine, lysine, tyrosine, methionine, arginine, cystine, or C1-C4 alkyl esters of the above amino acids.
The unnatural amino acid is selected from the group consisting of p-acetyl-phenylalanine, p-ethynyl-phenylalanine, p-propargyloxyphenylalanine, p-azido-phenylalanine, beta-alanine (beta Ala), 6-aminocaproic acid (Aca), and tetraethylene glycol (Teg, NH) with an amino group at one end and a carboxyl group at one end 2 -CH 2 CH 2 -O-CH 2 CH 2 -O-CH 2 CH 2 -O-CH 2 CH 2 -COOH), NAEK, lys-azido, or other containing one, two or three of biazidine, azide structures.
The above-mentioned modifications are selected from: at least one of glycosylation, phosphorylation, ubiquitination, S-nitrosylation, methylation, N-acetylation, succinylation, hydroxylation, lipidation, and polyisoprene.
In an alternative embodiment, the methylation is selected from N-methylation or O-methylation;
in an alternative embodiment, the glycosylation is selected from ADP ribosylation.
In a preferred embodiment of the invention, the palm domain has two amino acid sequences forming a β -sheet.
The two amino acid sequences forming the beta sheet are respectively: EWKFKMV and ATAT;
deletion of the amino acid sequence of the β -sheet (EWKFKMV) (βdel) results in reduced assembly activity.
In an alternative embodiment, the thumb domain has an amino acid sequence that forms an alpha helix: DPNDFTEEEIKRKNI.
In some embodiments, the DNA assembly molecules provided herein include sequences on the thumb domain that form an alpha helix that is particularly critical for DNA assembly, as compared to existing DNA polymerases, and deletion of the sequences on the thumb domain that form the alpha helix results in loss of DNA amplification activity of the DNA assembly molecule. This structure belongs to the key sequence of the DNA assembly molecule.
The inventors found that: the formation of an alpha helix structure on the thumb domain plays a key role in enhancing the DNA amplification activity of DNA assembly molecules by: this structure increases the binding capacity of the DNA to the enzyme. As set forth in SEQ ID NO:1-7 forms an alpha helix structure on the thumb domain, as set forth in any one of SEQ ID NOs: 8-9 does not form an alpha helix structure on the thumb domain.
In a preferred embodiment of the invention, the DNA assembly machine comprises:
(a) The method comprises the following steps A mutant obtained by mutating the amino acid sequence shown in SEQ ID NO.1, wherein at least one amino acid at the site of interaction between the mutant and DNA is substituted or modified;
and/or, SEQ ID NO:1, inserting one or more amino acids into the amino acid sequence shown in figure 1;
and/or, SEQ ID NO:1, deleting one or more amino acids in the amino acid sequence shown in figure 1;
and/or, SEQ ID NO:1, carrying out side chain modification on the amino acid sequence shown in the formula 1;
or (b): a mutant obtained by mutating the amino acid sequence shown in SEQ ID No.2,SEQ ID NO:3,SEQ ID NO:4,SEQ ID NO:5,SEQ ID NO:6,SEQ ID NO:7,SEQ ID NO:8 or SEQ ID No. 9, wherein the mutation position of the mutant is determined by: the mutant was compared with SEQ ID NO:1, the mutation position of the mutant corresponds to the sequence shown in SEQ ID NO:1, and a functional site of the sequence shown in 1; the mutant is substituted, deleted, inserted or modified with at least one of the amino acids at the mutation position.
The sequence alignment method can use Blast website (https:// Blast. Ncbi. Nlm. Nih. Gov/Blast. Cgi) to carry out Protein Blast alignment; the same results can be obtained using other sequence alignment methods or tools well known in the art.
It should be noted that, from SEQ ID NO:2-9, and the mutation position of the mutant obtained by mutating the sequence shown in any one of SEQ ID NO:1, or may be identical or different (e.g., SEQ ID NOS: 2,3 and 5), the specific location of the mutation being determined from the alignment of the sequences described above, provided that it corresponds to the sequence of SEQ ID NO:1 is the mutation site of the invention.
SEQ ID NO:1 is derived from Bacillus phage (Bacillus nage), and has a length of 606aa and a sequence similarity of 58% with phi29. The UniProt database is numbered A0A1X9SGT8.
SEQ ID NO:2-9 with reference to table 1.
TABLE 1 Gene list
In a preferred embodiment of the invention, the process of (a): the amino acid substituted or modified in the amino acid sequence shown in SEQ ID NO.1 is at least one of D12, E14, T15, Y60, H62, N63, F66, D67, F70, V94, S123, L124, D146, Y149, D170, D257, V258, S260, Y262, T365, I372, W374, K378, K379, R338, K392, L393, N396, S397, Y399, G400, F402, K428, E429, T443, D465, D467, K507, Y509 and K547;
(b) The method comprises the following steps The functional site is SEQ ID NO:1, D12, E14, T15, Y60, H62, N63, F66, D67, F70, V94, S123, L124, D146, Y149, D170, D257, V258, S260, Y262, T365, I372, W374, K378, K379, R338, K392, L393, N396, S397, Y399, G400, F402, K428, E429, T443, D465, D467, K507, Y509, and K547.
SEQ ID NO:1, and the functions of the functional sites of the sequence shown in Table 2 are as follows:
TABLE 2 Key functional site of SEQ ID NO:1 (A0A 1X9SGT 8)
The inventors found that mutation of D12 protein resulted in a decrease in protein assembly activity, while simultaneous mutation of D12A and D67A resulted in a loss of protein assembly activity. This is quite different from the effect of the existing phi29 protein on the enhancement of DNA amplification activity after D12 and D67 site mutation.
In an alternative embodiment, at least one of the following positions shown in SEQ ID NO.2 is mutated: d27 E29, T30, Y77, H79, F83, D84, F87, L113, S138, L139, D161, Y164, D185, D265, V266, S268, Y270, S372, I379, K381, V385, K386, R394, K398, L399, N402, A403, Y405, G406, F408, E435, E436, T450, R455, D472, D474, K512, Y514, and K550;
in an alternative embodiment, at least one of the following positions shown in SEQ ID NO.3 is mutated: d9 E11, T12, Y56, H58, F62, D63, F66, T93, S122, L123, E145, Y148, D169, D249, V250, S252, Y254, V357, I364, R366, I370, K371, V381, K385, L386, N389, S390, Y392, G393, F395, K421, E423, T436, R440, D458, D460, K500, Y502 and K532;
in an alternative embodiment, at least one of the following positions shown in SEQ ID NO.5 is mutated: d272 E274, T275, Y322, H324, F328, D329, F332, S371, S395, L396, D418, Y421, D442, D532, V533, S535, Y537, V667, K676, I680, K681, R689, K693, L694, N697, N698, Y700, G701, M703, R728, A729, T744, R748, D771, D773, K812, Y814 and K834.
The corresponding sites for the four typical key functional sites of the polymerase are shown in Table 3:
in a preferred embodiment of the invention, the DNA assembly machine comprises:
and SEQ ID NO:1-9, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or at least 99.9% homologous.
The DNA assembly molecule machinery of the present invention is modulated by agonists, which may be used include: co 0.01-100mM 2+ ,Ni 2+ ,Cu 2+ ,Mn 2+ ,Zn 2+ ,Ca 2+ At least one ion of (a) and (b). In particular, under the condition of calcium ion concentration, the DNA assembly molecule provided by the invention shows higher DNA amplification activity than other proteins (phi 29).
In a second aspect, the invention also provides a nucleic acid molecule encoding a DNA assembly molecule machine as described above.
In a third aspect, the invention also provides a vector or recombinant cell comprising the nucleic acid molecule described above.
The term "vector" is used herein in its most general sense and includes any intermediate vector for a nucleic acid that enables the nucleic acid to be introduced, for example, into a prokaryotic and/or eukaryotic cell, and where appropriate, integrated into the genome. Vectors of this type are preferably replicated and/or expressed in cells. The vector comprises a plasmid, phagemid, phage or viral genome. The term "plasmid" as used herein generally refers to a construct of extrachromosomal genetic material, typically circular DNA duplex, that can replicate independently of chromosomal DNA.
The term "recombinant cell" refers to any cell that can be transformed or transfected with an exogenous nucleic acid. The term "recombinant cell" according to the invention comprises prokaryotic (e.g. E.coli) or eukaryotic cells (e.g. mammalian cells, in particular human cells, yeast cells and insect cells). Mammalian cells, such as cells from humans, mice, hamsters, pigs, goats or primates, are particularly preferred. Cells may be derived from a plurality of tissue types and comprise primary cells and cell lines. The nucleic acid may be present in the host cell in a single copy or in two or more copies, and in one embodiment is expressed in a recombinant cell.
In an alternative embodiment, the recombinant cell is a eukaryotic cell;
in an alternative embodiment, the recombinant cell is a mammalian cell;
in an alternative embodiment, the recombinant cell is HEK293.
In a fourth aspect, the present invention also provides a method of DNA replication or amplification comprising replicating or amplifying DNA using the DNA assembly molecule machine described above as a DNA polymerase.
In a fifth aspect, the invention also provides the use of a DNA-assembling molecule machine in any one of the following applications:
1) As a DNA polymerase;
2) As RNA polymerase;
3) Controlling the movement of the DNA or RNA;
4) Preparing a single molecule sequencing reagent or a kit;
5) Preparing a nanopore sequencing reagent or a kit;
6) Assembling a molecular machine chain reaction;
7) Catalyzing DNA replication and/or catalyzing DNA amplification;
8) Catalytic rolling circle amplification and/or catalytic multiplex strand displacement amplification;
9) Performing DNA sequencing or RNA sequencing or whole genome sequencing;
10 RCA library construction;
11 Genome amplification coverage detection;
12 Preparing a kit product for catalyzing DNA replication and/or catalyzing DNA amplification;
13 Preparing a product for catalytic rolling circle amplification and/or catalytic multiplex strand displacement amplification;
14 Preparing a product for DNA sequencing or RNA sequencing or whole genome sequencing;
15 Preparing a product for RCA banking;
16 Preparation of a product for genome amplification coverage detection.
In a sixth aspect, the present invention also provides a method for assembling a DNA molecule using the DNA assembling molecular machine described above, comprising performing a reaction in a reaction system; the reaction system comprises 0.3M-1.5M salt ions.
The activity of the DNA assembly molecular machine provided by the invention is higher than phi29 when the salt ion concentration is higher than 300 nM. The DNA assembly molecular machine provided by the invention can be suitable for high-salt concentration environments, such as nanopore sequencing.
In an alternative embodiment, the salt ion is sodium, potassium, calcium, magnesium, manganese, nickel, cobalt, copper, or zinc.
In an alternative embodiment, the reaction system further comprises calcium ions.
The invention has the following beneficial effects:
compared with the existing DNA polymerase, the DNA assembly molecule provided by the invention has a sequence capable of forming an alpha helix structure on the thumb domain, and the alpha helix structure is particularly critical for improving the DNA amplification activity of the DNA assembly molecule. The DNA assembly molecule has good stability and is suitable for isothermal amplification.
In addition, the DNA assembly molecule and the mutant thereof provided by the invention have the DNA assembly function under various metal ions and different temperatures, and can be applied to nanopore sequencing, single molecule sequencing, RCA library building, environment detection and in-vitro diagnosis. Particularly, the method has the amplification activity far superior to that of the existing DNA polymerase under the condition of high salt concentration, and can prolong the DNA translocation and perforation time by improving the salt concentration, thereby improving the accuracy of single-molecule sequencing and greatly reducing the loss of key sequencing information.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the embodiments will be briefly described below, it being understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and other related drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a diagram showing the structure prediction of a protein having the amino acid sequence shown in SEQ ID NO. 1;
FIG. 2 is a diagram showing the structure prediction of a protein having the amino acid sequence shown in SEQ ID NO. 2;
FIG. 3 is a diagram showing the structure prediction of a protein having the amino acid sequence shown in SEQ ID NO. 3;
FIG. 4 is a diagram showing the structure prediction of a protein having the amino acid sequence shown in SEQ ID NO. 4;
FIG. 5 is a diagram showing the comparison of the structure of a protein having the amino acid sequence shown in SEQ ID NO.1 with that of the existing DNA assembling molecule machine phi 29;
FIG. 6 is a diagram showing the result of SDS-PAGE of proteins;
FIG. 7 is a graph showing the detection results of rolling circle amplification by each DNA assembly molecule machine;
FIG. 8 is a graph of the effect of heavy metal ions on amplification;
FIG. 9 is a graph showing the effect of NaCl concentration on amplification;
FIG. 10 is a graph showing the results of activity detection of each mutant.
Detailed Description
Reference now will be made in detail to embodiments of the invention, one or more examples of which are described below. Each example is provided by way of explanation, not limitation, of the invention. Indeed, it will be apparent to those skilled in the art that various modifications and variations can be made to the present invention without departing from the scope or spirit of the invention. For example, features illustrated or described as part of one embodiment can be used on another embodiment to yield still a further embodiment.
Unless otherwise indicated, practice of the present invention will employ conventional techniques of cell biology, molecular biology (including recombinant techniques), microbiology, biochemistry and immunology, which are within the ability of a person skilled in the art. This technique is well explained in the literature, as is the case for molecular cloning: laboratory Manual (Molecular Cloning: A Laboratory Manual), second edition (Sambrook et al, 1989); oligonucleotide Synthesis (Oligonucleotide Synthesis) (M.J.Gait et al, 1984); animal cell culture (Animal Cell Culture) (r.i. freshney, 1987); methods of enzymology (Methods in Enzymology) (Academic Press, inc.), experimental immunology handbook (Handbook of Experimental Immunology) (D.M.Weir and C.C.Blackwell, inc.), gene transfer vectors for mammalian cells (Gene Transfer Vectors for Mammalian Cells) (J.M.Miller and M.P.calos, inc., 1987), methods of contemporary molecular biology (Current Protocols in Molecular Biology) (F.M.Ausubel et al, inc., 1987), PCR: polymerase chain reaction (PCR: the Polymerase Chain Reaction, inc., 1994), and methods of contemporary immunology (Current Protocols in Immunology) (J.E.Coligan et al, 1991), each of which is expressly incorporated herein by reference.
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more clear, the technical solutions of the embodiments of the present invention will be clearly and completely described below. The specific conditions are not noted in the examples and are carried out according to conventional conditions or conditions recommended by the manufacturer. The reagents or apparatus used were conventional products commercially available without the manufacturer's attention.
The features and capabilities of the present invention are described in further detail below in connection with the examples.
Example 1
The present example provides a DNA assembly molecular machine, the structure of which is analyzed.
The amino acid sequence of the DNA assembly molecular machine of this example is shown in SEQ ID NO.1, and structural analysis is performed by structure prediction software to obtain the structure shown in FIG. 1. Wherein amino acids 1-192 constitute the exonuclease active domain, amino acids 192-268 and 437-549 constitute the palm domain, amino acids 269-367 constitute the terminal insertion domain 1, 368-404 constitute the finger domain, 405-436 constitute the terminal insertion domain 2, and amino acids 550-606 constitute the thumb structure.
This structure is longer than the structure of the existing DNA assembly molecular machine phi29 (PDB ID:1 xhx) (see FIG. 5), in which the structure is one more alpha helix on the thumb structure and the beta sheet structure on the palm structure.
Example 2
The present example provides a DNA assembly molecular machine, the structure of which is analyzed.
The amino acid sequence of the DNA assembly molecular machine of this example is shown in SEQ ID NO.2, and structural analysis is performed by structure prediction software to obtain the structure shown in FIG. 2. Wherein amino acids 1-205 constitute the exonuclease active domain, amino acids 206-276 and 443-551 constitute the palm domain, amino acids 277-375 constitute the terminal insertion domain 1, amino acids 376-409 constitute the finger domain, amino acids 410-442 constitute the terminal insertion domain 2, and amino acids 552-598 constitute the thumb structure.
Example 3
The present example provides a DNA assembly molecular machine, the structure of which is analyzed.
The amino acid sequence of the DNA assembly molecular machine of this example is shown in SEQ ID NO.3, and the structure shown in FIG. 3 is obtained by structural analysis by the structure prediction software. Wherein amino acids 1-189 constitute the exonuclease active domain, amino acids 190-260 and 429-537 constitute the palm domain, amino acids 261-360 constitute the terminal insertion domain 1, amino acids 361-396 constitute the finger domain, amino acids 397-428 constitute the terminal insertion domain 2, and amino acids 538-578 constitute the thumb structure.
Example 4
The present example provides a DNA assembly molecular machine, the structure of which is analyzed.
The amino acid sequence of the DNA assembly molecular machine of this example is shown in SEQ ID NO.5, and structural analysis is performed by structure prediction software to obtain the structure shown in FIG. 4. Wherein amino acids 265-462 constitute the exonuclease active domain, amino acids 463-545 and 737-839 constitute the palm domain, amino acids 546-670 constitute the terminal insertion domain 1, amino acids 671-703 constitute the finger domain, amino acids 704-736 constitute the terminal insertion domain 2, and amino acids 840-915 constitute the thumb structure.
Example 5
Expression of the proteins of this example.
The synthesis of the sequence SEQ ID No:1,2,3,5, 6. Transferring the obtained sequence into a pET30a vector to obtain an expression vector of the DNA assembly molecular machine. The expression vector was transformed into E.coli competent cell BL21 (DE 3) (Beijing full gold biotechnology Co., ltd.) and the ampicillin resistant medium was cultured and sequenced. After confirming successful transformation, picking a monoclonal colony, and inoculating the monoclonal colony into 5mL of LB liquid medium for overnight culture; then transferring the culture medium into 1L of LB liquid medium, culturing for 4 hours until the OD value is 0.6, adding IPTG until the final concentration of IPTG is 500uM, and continuing shaking culture overnight. Centrifuging the culture solution, collecting thalli, and lysing cells by an ultrasonic cell grinder, wherein the power of ultrasonic waves is 80W, the ultrasonic waves are 1s, and the interval is 2s, and the total duration is 20min. Then His-tag column affinity chromatography is used for purification to obtain protein (namely DNA assembly molecular machine).
The resulting DNA assembly molecule machine was subjected to SDS-PAGE protein electrophoresis to verify the size of the DNA assembly molecule machine protein. The molecular weight of the DNA assembly molecule machine is 76kD.
SDS-PAGE of proteins shows the results of FIG. 6. In FIG. 6, lane 1 is the phi29 protein and lane 2 is the protein SEQ ID NO:1, lane 3 is SEQ ID NO:2, lane 4 is SEQ ID NO:3, lane 5 is SEQ ID NO:5, lane 6 is SEQ ID NO:6, the protein size is about 70 kDa.
Example 6
This example uses rolling circle amplification to detect the function of the DNA assembly molecule machine.
Rolling circle amplification was performed using each of the histones, phi29 and control groups prepared in example 5 as a DNA assembly molecule machine. Specifically, 200nM DNA assembly molecule machine, 500nM hairpin template and reaction buffer solution are mixed to obtain a reaction system. Wherein: the reaction buffer consists of 50mM Tris-HCl,10 mM MgCl 2 10mM (NH) 4 ) 2 SO 4 4mM DTT and 200uM dNTPs, the pH of the reaction buffer at 25℃being 7.5; the nucleotide sequence of the hairpin template is as follows:
a sequence:
5’-p-TTGGCATATCGTACGATATGCCACCACCACCACCACAACCACCACCACCAAGCGATACGCGT ATCGCTTA-3’;
b sequence:
5-AAAAAAACCTTCCTTTT-ATATGCCAATAAGCGATACG-3’。
and (3) placing the reaction system at the temperature of 30 ℃ for reaction for 2 hours to obtain a reaction product.
As shown in FIG. 7, the DNA assembly activity of the DNA assembly molecular machine prepared in example 7 was found to be lower than that of phi29 polymerase under the test conditions, although the DNA assembly activity was able to be extended to form a product strand using a template strand as a template.
Experimental example 1
An influence factor experiment was performed according to example 6.
The reaction system for detecting heavy metal ions comprises: 50mM Tris-HCl,100uM metal ion, 10mM (NH) 4 ) 2 SO 4 ,4mM DTT,0.25mM dNTP,1uM DNA,0.8uM protein.
Referring to FIG. 8, heavy metal ion impact analysis showed that A0A1X9SGT8 exhibited higher activity than other proteins at calcium ion concentration.
Further, the activity of various salt concentrations on A0A1X9SGT8 was measured as in example 6 and was inferior to commercial phi29 at concentrations below 300mM NaCl as shown in FIG. 9.
And when the salt ion concentration is higher than 300nM, the activity is stronger than that of Phi29 (namely, phi29 shown in the figure).
This illustrates that the DNA assembly molecule machine of the present invention can be adapted for use in high salt concentration environments.
Experimental example 2
The activity of the mutant of SEQ ID NO.1 was verified.
Each mutant of A0A1X9SGT8 was assayed for activity using the assay format of example 6. The results are shown in FIG. 10:
deletion of amino acids 571-585 (DPNDFTEEEIKRKNI), i.e., alpha del, respectively, the DNA assembly molecule machine loses DNA amplification activity, indicating that this domain is essential for DNA assembly in the DNA assembly molecule machine and is a critical sequence provided by the present invention.
The importance of this domain was also demonstrated by reduced assembly activity for deletion of amino acids 525-532 (EWKFKMV) (. Beta.del). After mutation at positions D12 and D67, activity was likewise lost. However, the phi29 protein in the prior patent has the activity enhanced after mutation at the two sites. This demonstrates the significant variability of the DNA molecule assembly machine of the present invention from the prior patent proteins.
In FIG. 10, DD refers to the simultaneous mutations D12 and D67.Control 100mM EDTA was added to the WT group.
The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Claims (11)
1. A DNA assembly molecule machine comprising a palm domain, a 2 terminal insertion domain, a thumb domain, a finger domain, and an exonuclease activity domain; wherein the palm domain consists of 150-220 amino acids, the thumb domain consists of 40-70 amino acids, the finger domain consists of 30-45 amino acids to form an alpha-helix structure, and the exonuclease activity domain consists of 150-220 amino acids; the 2 terminal insertion domains include a terminal insertion domain 1 and a terminal insertion domain 2, the terminal insertion domain 1 consisting of 50-150 amino acids, the terminal insertion domain 2 consisting of 20-70 amino acids;
the DNA assembly molecular machine has the following functions:
the ability to interact with nucleotides through amino acid side chains; and the DNA assembly molecule machine has the ability to load and/or ligate DNA molecules.
2. The DNA-assembling molecular machine of claim 1, wherein the amino acid type of each domain of the DNA-assembling molecular machine is selected from at least one of the following: natural amino acids, unnatural amino acids, modified natural amino acids, modified unnatural amino acids;
wherein the modification is selected from: at least one of glycosylation, phosphorylation, ubiquitination, S-nitrosylation, methylation, N-acetylation, succinylation, hydroxylation, lipidation, and polyisoprene;
preferably, the methylation is selected from N-methylation or O-methylation;
preferably, the glycosylation is selected from ADP ribosylation.
3. The DNA-assembling molecular machine of any one of claims 1-2, wherein the palm domain has two amino acid sequences that form a β -sheet;
preferably, the two β -sheet forming amino acid sequences are:
EWKFKMV and ATAT;
preferably, the thumb domain comprises a sequence forming an alpha helix, and the amino acid sequence forming an alpha helix on the thumb domain is as follows: DPNDFTEEEIKRKNI.
4. The DNA assembly molecular machine of claim 1 or 2, wherein the DNA assembly molecular machine comprises:
(a) The method comprises the following steps A mutant obtained by mutating the amino acid sequence shown in SEQ ID NO.1, wherein at least one amino acid at the site of interaction between the mutant and DNA is substituted or modified;
and/or, SEQ ID NO:1, inserting one or more amino acids into the amino acid sequence shown in figure 1;
and/or, SEQ ID NO:1, deleting one or more amino acids in the amino acid sequence shown in figure 1;
and/or, SEQ ID NO:1, carrying out side chain modification on the amino acid sequence shown in the formula 1;
or (b): a mutant obtained by mutating the amino acid sequence shown in SEQ ID No.2,SEQ ID NO:3,SEQ ID NO:4,SEQ ID NO:5,SEQ ID NO:6,SEQ ID NO:7,SEQ ID NO:8 or SEQ ID No. 9, wherein the mutation position of the mutant is determined by: comparing the mutant with SEQ ID NO:1, the mutation position of the mutant corresponds to the sequence shown in SEQ ID NO:1, and a functional site of the sequence shown in 1; the mutant is substituted, deleted, inserted or modified with at least one of the amino acids at the mutation position.
5. The DNA-assembling molecular machine of claim 4, wherein the (a): the amino acid substituted or modified in the amino acid sequence shown in SEQ ID NO.1 is at least one of D12, E14, T15, Y60, H62, N63, F66, D67, F70, V94, S123, L124, D146, Y149, D170, D257, V258, S260, Y262, T365, I372, W374, K378, K379, R338, K392, L393, N396, S397, Y399, G400, F402, K428, E429, T443, D465, D467, K507, Y509 and K547;
the (b): the functional site is SEQ ID NO:1, D12, E14, T15, Y60, H62, N63, F66, D67, F70, V94, S123, L124, D146, Y149, D170, D257, V258, S260, Y262, T365, I372, W374, K378, K379, R338, K392, L393, N396, S397, Y399, G400, F402, K428, E429, T443, D465, D467, K507, Y509, and K547;
preferably, at least one of the following positions shown in SEQ ID NO.2 is mutated: d27 E29, T30, Y77, H79, F83, D84, F87, L113, S138, L139, D161, Y164, D185, D265, V266, S268, Y270, S372, I379, K381, V385, K386, R394, K398, L399, N402, A403, Y405, G406, F408, E435, E436, T450, R455, D472, D474, K512, Y514, and K550;
preferably, at least one of the following positions shown in SEQ ID NO.3 is mutated: d9 E11, T12, Y56, H58, F62, D63, F66, T93, S122, L123, E145, Y148, D169, D249, V250, S252, Y254, V357, I364, R366, I370, K371, V381, K385, L386, N389, S390, Y392, G393, F395, K421, E423, T436, R440, D458, D460, K500, Y502 and K532;
preferably, at least one of the following positions shown in SEQ ID NO.5 is mutated: d272 E274, T275, Y322, H324, F328, D329, F332, S371, S395, L396, D418, Y421, D442, D532, V533, S535, Y537, V667, K676, I680, K681, R689, K693, L694, N697, N698, Y700, G701, M703, R728, A729, T744, R748, D771, D773, K812, Y814 and K834.
6. The DNA assembly molecular machine of claim 1 or 2, wherein the DNA assembly molecular machine comprises:
and SEQ ID NO:1-9, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or at least 99.9% homologous.
7. A nucleic acid molecule encoding the DNA assembly molecule machine of any one of claims 1-6.
8. A vector or recombinant cell comprising the nucleic acid molecule of claim 7.
A method of dna replication or amplification, characterized by: comprising replicating or amplifying DNA using the DNA assembly molecule machine of any one of claims 1-6 as a DNA polymerase.
10. Use of the DNA-assembling molecule machine of any one of claims 1-6 in any one of the following applications:
1) As a DNA polymerase;
2) As RNA polymerase;
3) Controlling the movement of the DNA or RNA;
4) Preparing a single molecule sequencing reagent or a kit;
5) Preparing a nanopore sequencing reagent or a kit;
6) Assembling a molecular machine chain reaction;
7) Catalyzing DNA replication and/or catalyzing DNA amplification;
8) Catalytic rolling circle amplification and/or catalytic multiplex strand displacement amplification;
9) Performing DNA sequencing or RNA sequencing or whole genome sequencing;
10 RCA library construction;
11 Genome amplification coverage detection;
12 Preparing a kit product for catalyzing DNA replication and/or catalyzing DNA amplification;
13 Preparing a product for catalytic rolling circle amplification and/or catalytic multiplex strand displacement amplification;
14 Preparing a product for DNA sequencing or RNA sequencing or whole genome sequencing;
15 Preparing a product for RCA banking;
16 Preparation of a product for genome amplification coverage detection.
11. A method for assembling a DNA molecule using the DNA assembling molecular machine according to any one of claims 1 to 6, characterized in that it comprises performing a reaction in a reaction system; the reaction system comprises 0.3M-1.5M salt ions;
preferably, the salt ion is sodium, potassium, calcium, magnesium, manganese, nickel, cobalt, copper or zinc;
preferably, the reaction system further comprises calcium ions.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310140791.7A CN116083393A (en) | 2023-02-14 | 2023-02-14 | DNA assembly molecular machine and application thereof |
PCT/CN2023/080346 WO2024168961A1 (en) | 2023-02-14 | 2023-03-08 | Dna molecular assembler and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310140791.7A CN116083393A (en) | 2023-02-14 | 2023-02-14 | DNA assembly molecular machine and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116083393A true CN116083393A (en) | 2023-05-09 |
Family
ID=86211955
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310140791.7A Pending CN116083393A (en) | 2023-02-14 | 2023-02-14 | DNA assembly molecular machine and application thereof |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN116083393A (en) |
WO (1) | WO2024168961A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024168961A1 (en) * | 2023-02-14 | 2024-08-22 | 南方科技大学 | Dna molecular assembler and application thereof |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11312944B2 (en) * | 2016-12-19 | 2022-04-26 | Quantum-Si Incorporated | Polymerizing enzymes for sequencing reactions |
CN116083393A (en) * | 2023-02-14 | 2023-05-09 | 南方科技大学 | DNA assembly molecular machine and application thereof |
-
2023
- 2023-02-14 CN CN202310140791.7A patent/CN116083393A/en active Pending
- 2023-03-08 WO PCT/CN2023/080346 patent/WO2024168961A1/en unknown
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024168961A1 (en) * | 2023-02-14 | 2024-08-22 | 南方科技大学 | Dna molecular assembler and application thereof |
Also Published As
Publication number | Publication date |
---|---|
WO2024168961A1 (en) | 2024-08-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP3026554B2 (en) | Thermostable enzyme | |
EP1224295B1 (en) | Method and compositions for improved polynucleotide synthesis | |
Sinha et al. | AdnAB: a new DSB-resecting motor–nuclease from mycobacteria | |
JP2017503521A (en) | Novel reverse transcriptase for use in high temperature nucleic acid synthesis | |
CN113005108A (en) | Modified transposases for improved insert sequence bias and increased DNA import tolerance | |
JP7256280B2 (en) | Phi29 DNA polymerase mutants with enhanced thermostability and their application in sequencing | |
JP6963238B2 (en) | DNA polymerase mutant | |
JP2018046876A (en) | Modified heat-resistant dna polymerase | |
EP3617308A1 (en) | Phi29 dna polymerase mutant with increased thermostability | |
CN116083393A (en) | DNA assembly molecular machine and application thereof | |
EP3959312A1 (en) | Isolated nucleic acid binding domains | |
JP7363063B2 (en) | Mutant DNA polymerase | |
JP4355830B2 (en) | Novel DNA replication factor | |
JP3891330B2 (en) | Modified thermostable DNA polymerase | |
JP2020182463A (en) | Nucleic acid amplification reagent | |
EP2687603A1 (en) | T7 promoter variants and methods of using the same | |
JP2015528281A (en) | Novel DNA polymerase with expanded substrate range | |
JP2021507694A (en) | DNA polymerase | |
JP4598275B2 (en) | High fidelity thermostable ligase and use thereof | |
JP2012143226A (en) | Cloning vector | |
WO2023143123A1 (en) | Terminal transferase variant for controllable synthesis of single-stranded dna and use thereof | |
US20230332118A1 (en) | Dna polymerase and dna polymerase derived 3'-5'exonuclease | |
WO2023134584A1 (en) | Udgx-based technology for detecting dna deoxyuracil on single-base resolution level | |
CN114561374B (en) | Thermophilic endonuclease mutant and preparation method and application thereof | |
JP2022550810A (en) | marine DNA polymerase I |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |