WO1998049274A1 - Thermostable dna polymerase and inteins of the thermococcus fumicolans species - Google Patents
Thermostable dna polymerase and inteins of the thermococcus fumicolans species Download PDFInfo
- Publication number
- WO1998049274A1 WO1998049274A1 PCT/FR1997/000761 FR9700761W WO9849274A1 WO 1998049274 A1 WO1998049274 A1 WO 1998049274A1 FR 9700761 W FR9700761 W FR 9700761W WO 9849274 A1 WO9849274 A1 WO 9849274A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- glu
- lys
- leu
- val
- arg
- Prior art date
Links
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 title claims abstract description 88
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 title claims abstract description 82
- 230000017730 intein-mediated protein splicing Effects 0.000 title claims abstract description 70
- 241000204103 Thermococcus fumicolans Species 0.000 title claims abstract description 45
- 241000203069 Archaea Species 0.000 claims abstract description 12
- 239000012634 fragment Substances 0.000 claims description 43
- 238000000034 method Methods 0.000 claims description 43
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 23
- 239000002773 nucleotide Substances 0.000 claims description 16
- 239000013598 vector Substances 0.000 claims description 15
- 125000003729 nucleotide group Chemical group 0.000 claims description 14
- 230000003321 amplification Effects 0.000 claims description 8
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 8
- 150000007523 nucleic acids Chemical group 0.000 claims description 8
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 6
- 230000008569 process Effects 0.000 claims description 6
- 230000002255 enzymatic effect Effects 0.000 claims description 3
- 238000002360 preparation method Methods 0.000 claims description 3
- 239000000284 extract Substances 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 108020004414 DNA Proteins 0.000 description 53
- 230000000694 effects Effects 0.000 description 42
- 108090000623 proteins and genes Proteins 0.000 description 42
- 239000000872 buffer Substances 0.000 description 26
- 102000004169 proteins and genes Human genes 0.000 description 20
- 102000004190 Enzymes Human genes 0.000 description 19
- 108090000790 Enzymes Proteins 0.000 description 19
- 229940088598 enzyme Drugs 0.000 description 19
- 238000003752 polymerase chain reaction Methods 0.000 description 19
- 235000018102 proteins Nutrition 0.000 description 19
- 150000001413 amino acids Chemical class 0.000 description 18
- 108060002716 Exonuclease Proteins 0.000 description 17
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 17
- 102000013165 exonuclease Human genes 0.000 description 17
- 238000012360 testing method Methods 0.000 description 16
- 241000588724 Escherichia coli Species 0.000 description 15
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 15
- 238000000746 purification Methods 0.000 description 15
- 241001467519 Pyrococcus sp. Species 0.000 description 14
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 12
- 238000010367 cloning Methods 0.000 description 12
- 239000000523 sample Substances 0.000 description 12
- 108010013835 arginine glutamate Proteins 0.000 description 11
- 210000004027 cell Anatomy 0.000 description 11
- 238000009396 hybridization Methods 0.000 description 11
- 230000006698 induction Effects 0.000 description 11
- 239000002609 medium Substances 0.000 description 11
- 108010050848 glycylleucine Proteins 0.000 description 10
- 108010017391 lysylvaline Proteins 0.000 description 10
- 239000000463 material Substances 0.000 description 10
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 108090000765 processed proteins & peptides Proteins 0.000 description 9
- 102000004196 processed proteins & peptides Human genes 0.000 description 9
- 235000002639 sodium chloride Nutrition 0.000 description 9
- 239000011780 sodium chloride Substances 0.000 description 9
- 238000000502 dialysis Methods 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 238000010348 incorporation Methods 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 108010073969 valyllysine Proteins 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- 241000205156 Pyrococcus furiosus Species 0.000 description 7
- 229940024606 amino acid Drugs 0.000 description 7
- 235000001014 amino acid Nutrition 0.000 description 7
- 108010054155 lysyllysine Proteins 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 108091008146 restriction endonucleases Proteins 0.000 description 7
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 241000205160 Pyrococcus Species 0.000 description 6
- 230000029087 digestion Effects 0.000 description 6
- 238000000605 extraction Methods 0.000 description 6
- 238000010353 genetic engineering Methods 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 229910001629 magnesium chloride Inorganic materials 0.000 description 6
- 241000205188 Thermococcus Species 0.000 description 5
- 239000013504 Triton X-100 Substances 0.000 description 5
- 229920004890 Triton X-100 Polymers 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 230000007717 exclusion Effects 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 229940080469 phosphocellulose Drugs 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- 229920002101 Chitin Polymers 0.000 description 4
- 108010042407 Endonucleases Proteins 0.000 description 4
- 102000004533 Endonucleases Human genes 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 4
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 4
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 4
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 4
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 4
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 4
- 239000007983 Tris buffer Substances 0.000 description 4
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 4
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 230000009089 cytolysis Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 229960002897 heparin Drugs 0.000 description 4
- 229920000669 heparin Polymers 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 239000012071 phase Substances 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 4
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 3
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 3
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 3
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 3
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 3
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 3
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 3
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 3
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 3
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 3
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 3
- 108091092195 Intron Proteins 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 3
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 3
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 229920001213 Polysorbate 20 Polymers 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 229920002684 Sepharose Polymers 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 3
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 3
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 3
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 3
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 3
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 3
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 3
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 3
- 239000011543 agarose gel Substances 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 229960005091 chloramphenicol Drugs 0.000 description 3
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 244000005700 microbiome Species 0.000 description 3
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 3
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 3
- 230000016434 protein splicing Effects 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- IQFYYKKMVGJFEH-OFKYTIFKSA-N 1-[(2r,4s,5r)-4-hydroxy-5-(tritiooxymethyl)oxolan-2-yl]-5-methylpyrimidine-2,4-dione Chemical compound C1[C@H](O)[C@@H](CO[3H])O[C@H]1N1C(=O)NC(=O)C(C)=C1 IQFYYKKMVGJFEH-OFKYTIFKSA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 2
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 2
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 2
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 2
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 2
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- 239000002028 Biomass Substances 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 108010017826 DNA Polymerase I Proteins 0.000 description 2
- 102000004594 DNA Polymerase I Human genes 0.000 description 2
- 238000007900 DNA-DNA hybridization Methods 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241001200922 Gagata Species 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 2
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 2
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 2
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 2
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 2
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 2
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 2
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 2
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 2
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 2
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 2
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 2
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 2
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 2
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 2
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 2
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 2
- 101000962056 Mytilus edulis Major extrapallial fluid protein Proteins 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 101710163270 Nuclease Proteins 0.000 description 2
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- FKFCKDROTNIVSO-JYJNAYRXSA-N Phe-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O FKFCKDROTNIVSO-JYJNAYRXSA-N 0.000 description 2
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 2
- SBYVDRLQAGENMY-DCAQKATOSA-N Pro-Asn-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O SBYVDRLQAGENMY-DCAQKATOSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 2
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- 241000204969 Thermococcales Species 0.000 description 2
- 241001495444 Thermococcus sp. Species 0.000 description 2
- 241000589500 Thermus aquaticus Species 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 2
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 2
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 2
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 2
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 2
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 2
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 2
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 2
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 2
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 2
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 2
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 2
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 2
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 241000617156 archaeon Species 0.000 description 2
- 108010024668 arginyl-glutamyl-aspartyl-valine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010027371 asparaginyl-leucyl-prolyl-arginine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 244000309466 calf Species 0.000 description 2
- 230000006037 cell lysis Effects 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 2
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 2
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 2
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- -1 nucleotide triphosphates Chemical class 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000005892 protein maturation Effects 0.000 description 2
- 239000000700 radioactive tracer Substances 0.000 description 2
- 239000012429 reaction media Substances 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 108010001055 thymocartin Proteins 0.000 description 2
- 210000001541 thymus gland Anatomy 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 239000001226 triphosphate Substances 0.000 description 2
- 235000011178 triphosphate Nutrition 0.000 description 2
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 1
- 102000000662 3'-5' exonuclease domains Human genes 0.000 description 1
- 108050008023 3'-5' exonuclease domains Proteins 0.000 description 1
- DQDKOANVADPVQR-UHFFFAOYSA-N 5-imino-3,4-dihydropyrrol-2-amine Chemical compound NC1=NC(=N)CC1 DQDKOANVADPVQR-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- RZNAMKZJPBQWDJ-SRVKXCTJSA-N Asn-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N RZNAMKZJPBQWDJ-SRVKXCTJSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- XHTUGJCAEYOZOR-UBHSHLNASA-N Asn-Ser-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XHTUGJCAEYOZOR-UBHSHLNASA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- SDDAYZYYUJILPB-UHFFFAOYSA-N Asp-Leu-Val-Tyr Chemical compound OC(=O)CC(N)C(=O)NC(CC(C)C)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SDDAYZYYUJILPB-UHFFFAOYSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- 241001301450 Crocidium multicaule Species 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- 108010054814 DNA Gyrase Proteins 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- 241000701533 Escherichia virus T4 Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 102100037091 Exonuclease V Human genes 0.000 description 1
- 239000004606 Fillers/Extenders Substances 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- BKOVCRUIXDIWFV-IXOXFDKPSA-N His-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 BKOVCRUIXDIWFV-IXOXFDKPSA-N 0.000 description 1
- YYOCMTFVGKDNQP-IHRRRGAJSA-N His-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YYOCMTFVGKDNQP-IHRRRGAJSA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- 101000881977 Homo sapiens Exonuclease V Proteins 0.000 description 1
- RCFDOSNHHZGBOY-ACZMJKKPSA-N Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(O)=O RCFDOSNHHZGBOY-ACZMJKKPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- YODLGZSPTHGVQX-VJANTYMQSA-N Leu-Asp-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N YODLGZSPTHGVQX-VJANTYMQSA-N 0.000 description 1
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- VIZLHGTVGKBBKO-AVGNSLFASA-N Met-Arg-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VIZLHGTVGKBBKO-AVGNSLFASA-N 0.000 description 1
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 241000186362 Mycobacterium leprae Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- BACYUWVYYTXETD-UHFFFAOYSA-N N-Lauroylsarcosine Chemical compound CCCCCCCCCCCC(=O)N(C)CC(O)=O BACYUWVYYTXETD-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 239000007990 PIPES buffer Substances 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- 241000432808 Pyrococcus abyssi GE5 Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- LTFSLKWFMWZEBD-IMJSIDKUSA-N Ser-Asn Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O LTFSLKWFMWZEBD-IMJSIDKUSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 1
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000205101 Sulfolobus Species 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 101000865057 Thermococcus litoralis DNA polymerase Proteins 0.000 description 1
- 241000589596 Thermus Species 0.000 description 1
- 241000589499 Thermus thermophilus Species 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 1
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 1
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- STJXERBCEWQLKS-IHPCNDPISA-N Trp-Tyr-Cys Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 STJXERBCEWQLKS-IHPCNDPISA-N 0.000 description 1
- DVLHKUWLNKDINO-PMVMPFDFSA-N Trp-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DVLHKUWLNKDINO-PMVMPFDFSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- JXNRXNCCROJZFB-RYUDHWBXSA-N Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JXNRXNCCROJZFB-RYUDHWBXSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- 102000011731 Vacuolar Proton-Translocating ATPases Human genes 0.000 description 1
- 108010037026 Vacuolar Proton-Translocating ATPases Proteins 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 238000012197 amplification kit Methods 0.000 description 1
- 230000003698 anagen phase Effects 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- ZYWFEOZQIUMEGL-UHFFFAOYSA-N chloroform;3-methylbutan-1-ol;phenol Chemical compound ClC(Cl)Cl.CC(C)CCO.OC1=CC=CC=C1 ZYWFEOZQIUMEGL-UHFFFAOYSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 150000002148 esters Chemical group 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 238000005189 flocculation Methods 0.000 description 1
- 230000016615 flocculation Effects 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 239000006193 liquid solution Substances 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 229910000357 manganese(II) sulfate Inorganic materials 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 231100000956 nontoxicity Toxicity 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000020477 pH reduction Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010065320 prolyl-lysyl-glutamyl-lysine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 230000001915 proofreading effect Effects 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000002798 spectrophotometry method Methods 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 238000005809 transesterification reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1252—DNA-directed DNA polymerase (2.7.7.7), i.e. DNA replicase
Definitions
- the present invention relates to a new thermostable DNA polymerase and its two inteins, originating from an archaebacterium of the species Thermococcus fumicolans.
- DNA polymerases are enzymes involved in the replication and repair of DNA in any living cell. Many DNA polymerases isolated from microorganisms such as E. coli are known today.
- DNA polymerase I DNA polymerase I
- phage T4 DNA polymerases have also been identified and purified and from thermophilic microorganisms such as Thermus aqua ticus (Taq polymerase, Chien, A. et a. J. Bact. 1976,
- Thermus thermophilus or species of the genus Bacillus (European patent application published under No. 699 760), Thermococcus (European patent application No. 455 430), Sulfolobus and Pyrococcus (patent application
- DNA polymerases The mechanism of action of DNA polymerases is relatively well known today and consists of replication of DNA identically according to a semi-conservative mode. The copied strand serves as a matrix and the four nucleotide triphosphates are the substrate for this polymerization. Enzymes with DNA polymerase activity are increasingly used today in vitro to work in molecular biology for various purposes such as cloning, error detection,
- This amplification, in vi tro, of deoxyribonucleic acid sequences calls upon the technique of the polymerase chain reaction (PCR) described in European patents Nos. 200 362 and 201 184.
- the principle of this technique is based on the carrying out successive cycles of extension of primers using the four nucleotide triphosphates as well as a DNA polymerase and a template DNA to be copied.
- the enzyme doubles the number of DNA strands available and between each cycle thermodenaturation is necessary in order to open the DNA double helix for the next cycle.
- the temperatures used for this thermodenaturation step are not compatible with the conservation of the activity of most of the known DNA polymerases, such as Klenow.
- thermophilic microorganisms are known today, it still remains difficult to obtain these thermostable enzymes with sufficient production yields.
- Molecular biology and genetic engineering overcome this drawback.
- the gene coding for DNA polymerase is cloned, sequenced and then recloned in an expression vector in order to produce the so-called recombinant protein, in a mesophilic host which is easier to cultivate such as E. coli or S. cerevisiae.
- This expression method in E. coli has in particular been described in the international patent application PCT published under No. WO 89/06691 for producing DNA polymerase from Thermus aquaticus.
- the DNA polymerase of the invention comes from an archaebacterium of the species Thermococcus fumicolans. In addition to its thermostable properties making it particularly effective in particular in a PCR process, this DNA
- ISAEP polymerase is remarkable in that it has two "protein introns”, also called “inteins”, at the level of its precursor polypeptide.
- the nucleotide sequence of its inteins is inserted into that of DNA polymerase, generally at the level of conserved sites involved, after translation, in catalytic reactions. These sequences are transcribed and translated at the same time as that of DNA polymerase and the autocatalytic splicing of the inteins then produces three enzymes: two inteins and one DNA polymerase.
- inteins are also found in other molecules such as vacuolar ATPase in S. cerevisiae (4), GyrA in Mycobacterium leprae (7), Rec A in Mycobacterium tuberculosis (5, 6).
- the inteins belong for the most part to the family of endonucleases of the "homing endonucleases” type since they cut DNA at a recognized site, at the very place where their nucleotide sequence is inserted.
- the development of biotechnologies both in research and in the fields of medicine or agri-food requires having various types of DNA polymerases capable of improving quantitatively and qualitatively techniques as diverse as cloning, detection , amplification of DNA sequences.
- the present invention aims precisely to offer a new thermostable DNA polymerase which is derived from a recently described species: Thermococcus fumi colans (8).
- This isolate was isolated from fragments of chimneys taken in the North Fidgian basin during the Franco-Japanese STARMER campaign in 1989.
- This species strict anaerobic, has an optimal growth temperature of 90 ° C, which is relatively high for a Thermococcus. Its optimum pH is 8.8, and its salinity level from 20 g / 1 to 40 g / 1.
- the subject of the invention is therefore a purified thermostable DNA polymerase of archaebacteria of
- the invention therefore relates to purified thermostable DNA polymerase of archaebacteria of the species Thermococcus fumicolans having a molecular weight of the order of 89,000 daltons as well as its enzymatically equivalent derivatives.
- the term “enzymatically equivalent derivatives” is understood to mean the polypeptides and proteins constituted by or comprising the amino acid sequence represented in the sequence list in the appendix under the number SEQ ID NO: 2 as soon as they exhibit the properties of the DNA polymerase of Thermococcus fumi col ans.
- the invention more particularly contemplates a DNA polymerase, the amino acid sequence of which is represented in the sequence list in the appendix under the number SEQ ID NO: 1 or a fragment thereof or an assembly of such fragments, such as the 774 amino acid sequence represented in the sequence list in the appendix under the number SEQ ID NO: 2.
- derivatives is also understood to mean the amino acid sequences modified above by insertion and / or deletion and / or substitution of one or more amino acids, provided that the properties of the DNA polymerase of T. fumicolans which result therefrom are not significantly changed.
- the invention also relates to a DNA sequence consisting of or comprising the sequence coding for a DNA polymerase of the invention.
- the DNA sequence represented in the annexed sequence list under the number SED ID NO: 1 represents such a sequence.
- the DNA coding for the DNA polymerase of T. fumicolans and its two inteins consists of nucleotides 357 to 5028. Nucleotides 1 to 356 correspond to the promoter region of this gene. Consequently, the subject of the invention is a DNA sequence constituted by or comprising the sequence between nucleotides 357 to 5028 of SED ID NO: 1, or a fragment thereof, or an assembly of such fragmen s .
- the invention relates more particularly to a DNA sequence constituted by or comprising nucleotides 357 to 1674 and 2755 to 3156 and 4324 to 5028 of the DNA sequence represented in the sequence list in the appendix under the number SED ID NO : 1.
- This sequence codes for the DNA polymerase of T. fumicolans whose sequence of 774 amino acids is represented in the sequence list in the appendix under the number SED ID NO: 2.
- the invention relates as much to DNA polymerase isolated and purified from the Thermococcus fumicolans strain as to DNA polymerase prepared by chemical synthesis, for example by ligation of polypeptide fragments, or also by genetic engineering methods. Within the framework of these genetic engineering methods, the invention also relates to a vector comprising a DNA sequence defined above, as well as a process for the production or expression in a cellular host of the thermostable DNAs of the invention. .
- a process for the production of a DNA polymerase according to the invention consists in:
- the cell host used in the above methods can be chosen from prokaryotes or eukaryotes and in particular from bacteria, yeasts, mammalian, plant or insect cells.
- the vector used is chosen according to the host to which it will be transferred.
- thermostable DNA polymerase of the invention is useful in particular in the methods of enzymatic amplification of nucleic acid sequences. Consequently, the subject of the invention is such methods using the thermostable DNA polymerase described above, as well as the amplification kits comprising, in addition to the reagents generally used, an adequate quantity of this DNA polymerase.
- the invention also relates to a purified thermostable intein of archaebacteria of the species Thermococcus fumicolans.
- the inteins are also defined as protein introns which are not spliced at the level of the messenger RNA but at the level
- RECTIFIED SHEET (RULE 91) ISA / EP protein maturation. They therefore relate to a single gene translated and transcribed in a single step, and constitute by-products of the maturation of the protein encoded by this gene (Xu, MG, Comb, DG., Paulus H., Noren CJ, Shao Y ., Perler, F., 1994, Protein splicing: an analysis of the branched intermediate and its resolution by succinimidine formation. EMBO J. 13, 5517-5522.)
- the inteins are restriction endonucleases which have the property of cutting DNA at the very place where their gene is inserted, and therefore they can be considered as selfish sequences.
- the inteins have in their sequence all the information necessary for their own splicing since they splice in E. coli.
- a first step in the formation of a linear intermediary which has an ester function is a first step in the formation of a linear intermediary which has an ester function.
- This reaction is dependent on the pH and the local environment of this bond (nature of the amino acids).
- This principle is used in cloning, expression or purification kits using inteins, because a change in the environment causes splicing or not. Indeed, it would be inhibited at pH 11 and activated at pH 7.5.
- the third step consists in a cyclization of 1 asparagine releasing the intein.
- the fourth step is the stabilization of the mature protein and the formation of a real peptide bond.
- thermosensitive mutants making it possible to block splicing.
- This possibility of controlling protein splicing by temperature can be used in cloning vectors with a sequence coding for the intein and around cloning sites. If the protein to be cloned and expressed is toxic to the host, it can be cloned into two fragments around the intein sequence. Thus, overall, the gene to be cloned is complete but it is interrupted by the sequence of the intein. During expression, the intein is found in the expressed protein, thus making it inactive. It is then possible by heating, at the end of the induction, to release the intein by autocatalytic splicing and thus find the cloned active protein.
- the inteins thus make it possible to carry out purifications and are used in kits according to the principle described below. Certain residues around the intein splicing site are modified.
- the expression of the recombinant protein is carried out at low temperature to block possible splicing too early.
- At the C-terminal of the intein is fixed a site having a strong affinity for chitin.
- the cloned protein is expressed as well as the intein and the chitin binding site.
- the purification is then carried out with the chitin, on affinity columns, which retain the chitin and also the intein and the cloned protein, the whole being part of the same pre-protein.
- the N-terminal end of the intein is then hydrolyzed with DTT or ⁇ -mercaptoethanol to release the cloned protein.
- thermostable restriction endonucleases which have as a recognition site the very place where their gene is inserted in the "host” sequence. They have a twice repeated nucleic sequence (LAGLIDADG) in the protein,
- RECTIFIED SHEET (RULE 91) ISA / EP more or less conserved sequence which corresponds to the active DNA recognition and cleavage site. These enzymes also seem to need g ++ for their activity. It should be noted that the two inteins of the invention are co-expressed in E. coli and are self-splicing. This means that they have no toxicity for the host, unlike one of the inteins of Thermococcus li toralis (9), and therefore their use in expression or purification kits is easy. .
- a first intein sequence according to the invention is represented in the annexed sequence list under the number SEQ ID NO: 3.
- This intein, called I-Tfu-1 has a molecular weight of 41,409 Da and a pi of 9.13, deduced from the 360 amino acid sequence of the sequence SEQ ID NO: 3.
- a second intein sequence according to the invention is represented in the annexed sequence list under the number SEQ ID NO: 4.
- This intein called I-Tfu-2, has a molecular weight of 44,765 Da and pi of 9 , 6, deduced from the 389 amino acid sequence of the sequence SEQ ID NO: 4.
- thermostable inteins of the invention are useful in particular in methods of restriction of nucleic acids and in the development of expression vectors making it possible to reduce the toxicity of the protein to be expressed by inserting one of the two sequences of inteins in the sequence of the protein to be expressed. This can be done without manipulation of the sequence of inteins if the cloning is carried out in E. coli, the expression techniques used having demonstrated their harmlessness for this host organism. Consequently, the subject of the invention is such processes using one or both of the two thermostable inteins described above, as well as the expression or purification kits containing one or the two sequences coding for said thermostable inteins.
- the invention also relates to a DNA sequence consisting of or comprising the sequence coding for an intein of the invention.
- nucleotides 1675 and 2754 in the sequence SED ID NO: 1 in the appendix This DNA sequence codes for the intein, the amino acid sequence of which is represented in the annexed sequence list under the number SED ID NO: 3.
- the invention relates both to these thermostable inteins isolated and purified from the Thermococcus fumicolans strain as to inteins prepared by chemical synthesis, for example by ligation of polypeptide fragments, or also by genetic engineering methods.
- the invention also relates to a vector comprising a DNA sequence defined above, as well as a process for the production or expression in a cellular host of DNA coding for the inteins of the invention. Such methods are identical to those reported previously for T DNA polymerase. fumicolans.
- thermostable DNA of the invention RECTIFIED SHEET (RULE 91) ISA / EP of the thermostable DNA of the invention, and referring to the accompanying drawings in which:
- FIG. 1 shows the DNA-DNA hybridization of the genomic DNA of T. fumicolans digested with various restriction enzymes and hybridized with the probes GE23ClaI-HindIII and GE23XhoI-SalI.
- FIG. 2 shows the cloning strategy, the gene structure of the DNA polymerase of T. fumiculans and the gene products.
- FIG. 3 shows the results of purification of the recombinant polymerase of T. fumi colans after a heparin sepharose column, visualized by SDS-PAGE.
- FIG. 4 represents the results of purification of the recombinant polymerase of T. fumiculans after a Blue column HTrap n ° 2, visualized by SDS-PAGE.
- FIG. 5 shows the purification results of the recombinant polymerase of T. fumi col years after a phosphocellulose column, visualized by SDS-PAGE.
- FIG. 6 represents the results of purification of the recombinant polymerase of T. fumicolans after a MonoQ column, visualized by SDS-PAGE.
- FIG. 7 shows the PCR results with DNA polymerase from T. fumicolans with the exclusion fractions from the MonoQ column.
- the strain Pyrococcus sp. GE 23 was isolated from chimneys of deep hydrothermal vents and was supplied by G Erauso (CNRS, Station Bitician de Roscoff, France).
- the Thermococcus fumicolans strain was obtained from the Marine Microbiology laboratory of G. Barbier (IFREMER-DRV-VP-CMM) in Brest, France.
- This strain, Thermococcus fumicolans was obtained by purification from fragments of hydrothermal vents collected in the North Fidgian basin during the Franco-Aponais campaign STARMER carried out in 1989 at 2000 meters depth.
- Pyrococcus sp.GE23 was cultured at 85 ° C in 2216S medium (DIFCO) at a pH of 6.5.
- Thermococcus fumi col ans described in 1996 (Godfroy et al.) Is cultivated under anaerobic conditions in a medium containing the following elements: peptone 2g / 1; yeast extracts 0.5 g / l; sea salt (Sigma) 30g / l; PIPES buffer 6.05g / l, elemental sulfur 10g / 1, rezasurin lmg / 1. The pH is adjusted to 8.5 with 5N sodium hydroxide at 20 ° C.
- the Escherichia coli SURE strain (Stratagene, La Jolla, Calif.) was cultured in LB medium with the appropriate antibiotic (s), at 37 ° C. with stirring. This strain was used as a host to receive the primary constructs from the pUC 18 or pBluescript vectors.
- the NovaBlue, BL21 (DE3) and BL21 (DE3) pLysS strains (Novagen, Madison, Wi.) Were cultured in 2xYT medium with the appropriate antibiotics at 37 ° C or 30 ° C. These strains were used as hosts for the expression plasmids.
- Thermococcus fumicolans was isolated by the modified Charbonnier method (3).
- the cells were resuspended in TE-Na IX buffer, then lysed at 40 ° C for three hours
- RECTIFIED SHEET (RULE 91) ISA / EP with a mixture of 1% N-Lauryl-sarcosine, 1% sodium dodecyl sulfate and 0.4 mg / ml of proteinase K. After lysis, centrifugation at 5000 g for 10 minutes makes it possible to remove cellular debris.
- the DNA is extracted by a treatment with Phenol-Chloroform-Isoamyl alcohol or PCI (25-24-1), then treated with RNAse at 5 ⁇ g / ml at 60 ° C for one hour. These steps are followed by two additional PCI extractions and a chlorophoric extraction.
- the DNA is then precipitated with absolute ethanol at -20 ° C, then centrifuged, air dried and taken up in TE-1x buffer. The concentration and purity of this DNA are measured by spectrophotometry at 230, 260 and 280nm with a GeneQuantlI device (Pharmacia, Upsalla, Sweden).
- the DNA was completely digested overnight at 37 ° C. by a series of restriction enzymes (BamHI, HindIII, EcoRI, EcoRV, PvuII, Sali , Xbal and Xhol) by simple and double digestions.
- DNA fragments are fractionated on 0.8% agarose gel in TBE-1X and transferred in vacuo to a Hybond-N + nylon membrane (Amersham, UK). These membranes were hybridized with DNA probes prepared by PCR with specific primers selected from the DNA polymerase genes of P. furi osus, T. li toral is and Pyrococcus sp. GE23. These probes are previously marked with 32P by "random primmg" in accordance with the manufacturer's recommendations (Megaprime, Amersham, UK).
- Tli I and Tli T covering respectively the regions delimited by base pairs 297 to 1768 and 4631 to 5378 respectively, as defined by Hodges et al. (9).
- Pyrococcus sp GE23 probes were used, one containing the 5 'part of the gene (Clal-HindIII fragment
- RECTIFIED SHEET (RULE 91) ISA / EP corresponding to sites pb 8 and pb 1353 of the coding section) and the other containing the terminal part of this same gene obtained by PCR (primers known as expression Ndel and Sali corresponding to sites pb -1 and pb 2318, then digestion by Xhol and Sali including the bases of Nos. 1879 to 2318). Positive fragments were identified by DNA-DNA hybridization (14). Only hybridizations with Pyrococcus sp. GE 23 provided positive signals at 55 ° C in less than 24 h of exposure. The probes from T. li toralis and P. furiosus gave no results, even at 50 ° C in a standard buffer without formamide.
- HindIII fragments of 1.9 kb were selected, then prepared by appropriate digestion of 100 ⁇ g of genomic DNA, purified in dialysis bags from agarose gels, and precipitated with absolute ethanol after PCI extraction. The fragments were ligated into a dephosphorylated pUC 18 / HindIII.
- the transformations of the host strains were carried out by electroporation (Gene Inc., Biorad).
- the screening of the recombinant clones was carried out by selection with ampicillin, alpha-complementation on X-Gal-IPTG substrate then hybridization of colonies according to standard techniques (12). The temperature of the colony hybridizations was 55 ° C.
- the plasmid DNA was isolated according to the method described by Birnboim and Doly (1), then purified by anion exchange chromatography in solid phase (Quiagen, Chatsworth, Calif.). The restriction fragments of the plasmids were purified on agarose gel by the GeneClean method (Bio 101, La Jolla, Calif.) For later cloning.
- Thermococcus fumicolans 16S and 23S rRNA was amplified by PCR using the following primers:
- the PCR product was cloned into the vector pUC18 for subsequent sequencing.
- the DNA sequences were obtained by the chain termination method (13) using an Applied Biosyste s automatic DNA analysis system.
- the two strands of the genes coding for DNA polymerase and the two inteins were sequenced using universal primers localized on vectors as well as internal primers.
- the 16S rDNA sequence was carried out on both strands, after cloning (SureClone, Pharmacia, Uppsala, Sweeden), using the Hot-Tub kit (Amersham, UK.) In order to remove the compressions.
- PCR was used to prepare the complete fragment of DNA polymerase and the two inteins in using primers containing the NdeI and BamHI restriction sites:
- the reaction mixture contained GOLDSTAR DNA polymerase (Eurogentec, B), the Taq Extender enzyme (containing Pfag from Stratagene), the extension buffer with the four dNTPs (each at 0.2mM) and the primers Tfu Dir and Tfu Rev at 50 pmol in a volume of 50 ⁇ l final.
- the amplification was carried out over 20 cycles: 1 min 94 ° C, 1 min at 54 ° C and 6 min at 72 ° C using a Stratagene 96-gradient thermocycler.
- the PCR fragments were then digested with the enzymes NdeI and BamHI and then ligated to the same sites of the vector, thus restoring the initiation codon.
- the construction thus obtained was named PARHS2TFU1.
- This construction was sequenced at the junction sites to verify its integrity with respect to the genomic DNA sequence.
- the expression tests were carried out according to the following protocol: selection of the recombinant clones in the E. coli Novablue strain, expression with the BL21 (DE3) pLysS strain in a 2xYT medium and induction for four hours at ImM of IPTG.
- the first induction tests were carried out on 5 ml cultures, whether induced or not. Night precultures are carried out without an inductor and restarted in a fresh medium in the morning (to the tenth), until the optical density (measured at 600nm) is 0.6, then either induced for 4 hours, or not induced and stopped after 2, 4 or 14 hours. Four ml of cultures are then centrifuged at 4 ° C, 5000 rpm for 10 minutes. The pellet is then taken up in a lysis buffer (10 mM Tris-HCl pH 7.5; 10 mM NaCl, 2 mM MgCl2). The cells thus taken up are then lysed, either with triton X-100 1% v / v, or lysozyme at 1 mg / ml of
- RECTIFIED SHEET (RULE 91) ISA / EP lysis and left on ice about 5 to 10 min.
- the cells are then thermodenatured by an exposure of 20 min at 72 ° C. This largely destroys the cells of the mesophilic host without destroying the recombinant proteins.
- the lysis product is then centrifuged for 20 min at 10 000 rpm at 4 ° C. The supernatant is recovered in order to test it in incorporation, in PCR or on gel.
- the incorporations are carried out according to two techniques: - With tritiated thymidine as tracer.
- the incorporation is carried out on activated calf thymus (SIGMA Aldrich, F) in the following reaction medium: Tris-HCl 50mM pH 8.8; DTT ImM; MgCl2 10 mM; KCl 10MM;
- the active fractions thus recovered are dialyzed for 5 h against a buffer C: 10 mM Tris-Ci pH 7.5; 0.5 mM EGTA; 5 mM MgCl2; 10 mM -mercaptoethanol; 0.2% Triton x100; 10% glycerol; 50 mM NaCl.
- the products resulting from dialysis are successively loaded onto an affinity column for the proteins binding to DNA (Pharmacia, Blue HiTrap) and eluted in NaCl gradient with the same buffers as above.
- the column is adjusted to 0.2 ml / min, the loading buffer A is composed of KP04 pH7 20mM, EDTA 0, lmM, DTT ImM, Glycerol 5%, Triton X-100 0.1% and the gradient (between 0% and 50% deB) is produced by KCl present in buffer B at 2M.
- the second dialysis is carried out with the following buffer: K2HP04 pH 7.5, 10 lMM, K2PO4 lOmM, KCl 25 mM, DTT ImM, Triton X-100 0.1%, Glycerol 10%, for 1 hour.
- the solution is then loaded onto a MonoQ column at 0.5 ml / min with a NaCl gradient of 0 to 20% (buffers used for heparin).
- the 3 '-5' exonuclease assays are quantified by the release of P labeled nucleotides.
- a first step makes it possible to carry out the labeling: the DNA of is digested with HindIII and then, the Klenow fragment copies the DNA from the free 3 '-OH ends, in a medium containing, in addition to the buffer, l DNA and the enzyme, 3 P dATP and 32 P dTTP, the dGTP and dCTP being cold. After one hour at 37 ° C, the four cold dNTPs are added in excess for half a hour. Kleno and dNTPs are removed by phenol extraction and precipitated with ethanol.
- the exonuclease tests are carried out in solutions containing the enzyme buffers, 0.02 mg / ml of labeled DNA, and incubated overnight at 72 ° C, 80 ° C and 95 ° C. Different buffers containing MgCl2 or MnS04 are tested. The same test is carried out with the Wind as a positive control. 101 of reaction solution are then deposited on DE81 paper (Watmann), dried and then counted before and after washing (3 times 5 min with a2HP04, 1 time with water and then with 95% ethanol) using the technique from Cerenkov.
- the labeling reaction uses the polynucleotide kinase to label the 5' substrate.
- the Southern blot hybridization revealed fragments of two types: a HindIII-HindIII fragment of 1.9 kb and an Xhol-Xhol fragment of 5 kb. These two fragments were revealed only with the probe
- RECTIFIED SHEET (RULE 91) ISAEP digested with Xhol. About 400 recombinants (E. Coli SURE) were screened with the Clal-HindIII probe. Of these 400 colonies, two gave a positive signal during hybridization. (n ° 9.26 and 12.79). The two clones were cultured in LB-Amp medium and their restriction profiles were identical, with an insert of 1.8 kb. The subsequent sequencing of one of these clones (designated 557MACa) and the sequence comparison (Megalign, DNASTAR program) have shown that it corresponds to the promoter region and to the first 1404 base pairs of a DNA gene. polymerase belonging to the family B (2).
- fragment obtained by digestion of PCR product This fragment was cloned as before. About 200 recombinant clones were screened and four of them gave a positive signal. The four clones have an identical profile after digestion with the restriction enzyme HindIII. One of them, 557MACc, was sequenced and the sequence comparison demonstrated that it was the end of the DNA polymerase gene previously identified.
- Thermococcus fumicolans is a new species of Thermococcales described by Godfroy et al (8).
- the second is inserted at the base n ° 3157 and ends at 4323. It is also distributed between the clone 557MACb and 557MACC.
- the coding section of the DNA polymerase gene therefore consists of three disjoint parts.
- the first part of the gene, carried by 557MACa comprises the zone coding for the 3'-5 'exonuclease, where, after translation, the motif FDIET is recognized.
- the second part, carried by 557MACb and the third part carried by 557MACC include the preserved sites SLYPSI, and YG.DTD.
- the two insertion sequences are located on conserved areas of DNA polymerase, the first at the DFR / SLYPSII site such as I-KOD-1 of Pyrococcus sp. KOD1 and the second at the D / TDG site as T-I-Tli-1. l i toral i s.
- I-Tfu-1 and I-Tfu-2 seem to represent two new "alleles” of "homing" archaebacterial endonucleases of classes A and C.
- I-Tfu-1 is the third known allele of intein inserting into the A site of Archaea DNA polymerase, while I-Tfu-2 is the second of its class.
- rTful00-2 clone Two other clones, also tested, are weakly inducible and express very little DNA polymerase. The rTful-1 and rTful00-2 clones are then tested in a 50 ml Erlenmeyer flask under the conditions described above. Only the inducible rTfulOO-2 clone has a constant volume expression of 50 ml. The rest of the work was therefore carried out on this clone.
- the culture of the clone Tful00-2 was carried out in a 16 liter fermenter, in the 2xYT medium supplemented with ampicillin and chloramphenicol.
- the fermenter has been prepared and put in temperature at 30 ° C with the culture medium.
- the preculture was returned to 30 ° C one hour before its transfer to the fermenter.
- the final culture volume is 15 liters.
- the pH of the culture was adjusted to 7 during the acidification phase and then left free during the alkaline phase.
- the bacteria were removed after four hours of induction, when the pH was 8.3.
- the culture was then centrifuged and the cells were divided into three batches. One of them, 20 g of paste, was taken up in 80 ml of lysis buffer for further processing indicated in the Materials and methods chapter.
- Fractions 54 to 60 are loaded at a rate of 0.25 ml / min. The elution makes it possible to recover 65 fractions of 5 ml with peaks of activity for fractions 36 to 56.
- the dosage of the activity shows a concentration of 3 to 5 units for fractions 36 to 40.
- FIG. 4 shows the result on SDS-PAGE gel. Following these three columns, the purity of the polymerase is significantly improved. Nevertheless, traces of DNA from E. coli attached to the polymerase and demonstrated by PCR remain. Two additional columns will therefore be implemented. 30ml of active fractions obtained previously are loaded onto a phosphocellulose column (this should fix the DNA and the polymerase differently).
- the activity of the polymerase is identified by its high 3 '-5' exonuclease activity at 72 ° C. on the DNA of Lambda digested with HindIII. The strongest activity is located between fractions 40 and 49 with a sharp peak at 46 and 47. On these fractions 40 to 49 the pBR322 DNA is intact after overnight at 37 ° C.
- Figure 5 shows us the results on gel with activities
- T buffer Tris HC1 pH 8.8: 600mM; KCl: 500mM; MgC12: 15mM; Tween 20: 0.1%
- FIG. 8 presents the results obtained with a reaction volume of 50 ⁇ l for quantities of DNA polymerase from Thermococcus fumicolans of 2.7 units. As it stands, the best results with Tfu are obtained with buffer R.
- thermostability measured according to the protocol described above (Materials and Methods), or better, the residual activity in incorporation at 72 ° C. after exposure of the enzyme to high temperatures for variable times, is given in table 5 below. below.
- thermostability which is lower than that of polymerases from more hyperthermophilic organisms such as Pyrococcus, is nonetheless very much greater than that of polymerases from Thermus and in particular all Taq.
- the thermostability of the purified recombinant enzyme, both for the polymerase domain and for the exonuclease, is in any case very much higher than all known requirements in PCR.
- GAG TTC GCC GAG GGG CCT ATT CTT ATG ATA AGC TAT GCC
- GAC GAG GAA 954 Glu Phe Ala Glu Gly Pro Ile Leu Met Ile Ser Tyr Ala Asp Glu Glu 155 160 165
- GTT GCC AAA AAA GAG GTT TAT CTT AGG GTT AGG GAA ATT ATG GAT GGC 2298 Val Ala Lys Lys Glu Val Tyr Leu Arg Val Arg Glu Ile Met Asp Gly 620 625 630
- NAME / KEY DNA polymerase of THERMOCOCCUS fumicolans Tfu (xi) DESCRIPTION OF THE SEQUENCES: SEQ ID NO: 2:
- Trp Glu Leu Ile Gly Leu Leu Val Gly Asp Gly Asn Trp Gly Gly Gin 145 150 155 160
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Molecular Biology (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention concerns a purified thermostable DNA polymerase, thermostable archae bacteria DNA polymerase of the Thermococcus fumicolans species having a molecular weight of the order of 89000 daltons and its thermostable inteins.
Description
ADN POLYMERASE THERMOSTABLE ET INTEINES DE L'ESPECE THERMOCOCCUS FUMICOLANS THERMOSTABLE DNA POLYMERASE AND INTEINS OF THE THERMOCOCCUS FUMICOLANS SPECIES
La présente invention concerne une nouvelle ADN polymerase thermostable ainsi que ses deux intéines, provenant d'une archaebactérie de l'espèce Thermococcus fumicolans .The present invention relates to a new thermostable DNA polymerase and its two inteins, originating from an archaebacterium of the species Thermococcus fumicolans.
Les ADN polymérases sont des enzymes impliquées dans la réplication et la réparation de 1 'ADN dans toute cellule vivante. On connaît aujourd'hui de nombreuses ADN polymérases isolées de micro-organismes tel que E. coliDNA polymerases are enzymes involved in the replication and repair of DNA in any living cell. Many DNA polymerases isolated from microorganisms such as E. coli are known today.
(ADN polymerase I) ou du phage T4. Des ADN polymérases ont aussi été identifiées et purifiées et à partir de micro-organismes thermophiles comme Thermus aqua ticus (Taq polymerase, Chien, A. et a l . J. Bact. 1976,(DNA polymerase I) or phage T4. DNA polymerases have also been identified and purified and from thermophilic microorganisms such as Thermus aqua ticus (Taq polymerase, Chien, A. et a. J. Bact. 1976,
127:1550-1557 ; Kaladin et al . Biokhymiya 1980, 45:644-127: 1550-1557; Kaladin et al. Biokhymiya 1980, 45: 644-
651), Thermus thermophilus , ou encore des espèces du genre Bacillus (demande de brevet Européen publiée sous le No. 699 760), Thermococcus (demande de brevet Européen No. 455 430), Sulfolobus et Pyrococcus (demande de brevet651), Thermus thermophilus, or species of the genus Bacillus (European patent application published under No. 699 760), Thermococcus (European patent application No. 455 430), Sulfolobus and Pyrococcus (patent application
Européen publiée sous le No. 547 359) . Parmi ces ADN polymérases issues d'archaebactéries on peut citer laEuropean published under No. 547 359). Among these DNA polymerases originating from archaebacteria, mention may be made of
Pfu , isolées de Pyrococcus furiosus (18), la Vent™ polymerase de Thermococcus li toralis (10), la 9°N de Pyrococcus sp. 9 °N (15) et la DeepVent™ de Pyrococcus GB-Pfu, isolated from Pyrococcus furiosus (18), Vent ™ polymerase from Thermococcus li toralis (10), 9 ° N from Pyrococcus sp. 9 ° N (15) and the DeepVent ™ from Pyrococcus GB-
D, les deux premières provenant de souches du littoral (Baie de Naples) , les deux suivantes de souches sous- marines profondes .D, the first two from littoral strains (Bay of Naples), the next two from deep underwater strains.
Le mécanisme d'action des ADN polymérases est aujourd'hui relativement bien connu et consiste en une réplication de l'ADN à l'identique selon un mode semi- conservatif. Le brin recopié sert de matrice et les quatre nucléotides triphosphates sont le substrat de cette polymérisation. Les enzymes ayant une activité ADN polymerase sont aujourd'hui de plus en plus utilisées in vi tro afin de travailler en biologie moléculaire dans divers buts tels le clonage, la détection d'erreurs, leThe mechanism of action of DNA polymerases is relatively well known today and consists of replication of DNA identically according to a semi-conservative mode. The copied strand serves as a matrix and the four nucleotide triphosphates are the substrate for this polymerization. Enzymes with DNA polymerase activity are increasingly used today in vitro to work in molecular biology for various purposes such as cloning, error detection,
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
séquençage, le marquage, et de façon générale, l'amplification de séquences d'acides nucléiques.RECTIFIED SHEET (RULE 91) ISA / EP sequencing, labeling, and generally speaking, amplification of nucleic acid sequences.
Cette amplification, in vi tro , de séquences d'acide désoxyribonucléique fait appel à la technique de la réaction de polymérisation en chaine (PCR) décrite dans les brevets Européens No. 200 362 et 201 184. Le principe de cette technique est basé sur la réalisation de cycles successifs d'extension d'amorces mettant en oeuvre les quatre nucléotides triphosphates ainsi qu ' une ADN polymerase et un ADN matrice à recopier. A chaque cycle, l'enzyme double le nombre de brins d'ADN disponibles et entre chaque cycle une thermodénaturation est nécessaire afin d'ouvrir la double hélice d'ADN pour le cycle suivant. Les températures utilisées pour cette étape de thermodénaturation ne sont pas compatibles avec la conservation de l'activité de la plupart des ADN polymérases connues, telle la Klenow. C'est ainsi que des nombreux efforts de recherche ont été réalisés afin de trouver des enzymes supportant ces températures . Cependant, si les micro-organismes thermophiles sont aujourd'hui connus, il reste encore difficile d'obtenir ces enzymes thermostables avec des rendements de production suffisants. La biologie moléculaire et le génie génétique permettent de palier cet inconvénient. Ainsi, une fois repéré dans le génome, le gène codant pour l'ADN polymerase est clone, séquence puis recloné dans un vecteur d'expression afin de produire la protéine dite alors recombinante, chez un hôte mésophile plus aisé à cultiver tel E. coli ou S. cerevisiae. Cette méthode d'expression chez E. coli a notamment été décrite dans la demande de brevet internationale PCT publiée sous le No. W0 89/06691 pour produire l'ADN polymerase de Thermus aquaticus .This amplification, in vi tro, of deoxyribonucleic acid sequences calls upon the technique of the polymerase chain reaction (PCR) described in European patents Nos. 200 362 and 201 184. The principle of this technique is based on the carrying out successive cycles of extension of primers using the four nucleotide triphosphates as well as a DNA polymerase and a template DNA to be copied. At each cycle, the enzyme doubles the number of DNA strands available and between each cycle thermodenaturation is necessary in order to open the DNA double helix for the next cycle. The temperatures used for this thermodenaturation step are not compatible with the conservation of the activity of most of the known DNA polymerases, such as Klenow. Many research efforts have been made to find enzymes that can withstand these temperatures. However, if thermophilic microorganisms are known today, it still remains difficult to obtain these thermostable enzymes with sufficient production yields. Molecular biology and genetic engineering overcome this drawback. Thus, once located in the genome, the gene coding for DNA polymerase is cloned, sequenced and then recloned in an expression vector in order to produce the so-called recombinant protein, in a mesophilic host which is easier to cultivate such as E. coli or S. cerevisiae. This expression method in E. coli has in particular been described in the international patent application PCT published under No. WO 89/06691 for producing DNA polymerase from Thermus aquaticus.
L'ADN polymerase de l'invention provient d'une archaebactérie de l'espèce Thermococcus fumicolans . Outre ses propriétés thermostables la rendant particulièrement efficace notamment dans une processus de PCR, cette ADNThe DNA polymerase of the invention comes from an archaebacterium of the species Thermococcus fumicolans. In addition to its thermostable properties making it particularly effective in particular in a PCR process, this DNA
FEUILLE RECTIFIEE (REGLE 91) ISAEP
polymerase est remarquable en ce qu'elle présente deux " introns protéiques", encore appelés "intéines", au niveau de son polypeptide précurseur.RECTIFIED SHEET (RULE 91) ISAEP polymerase is remarkable in that it has two "protein introns", also called "inteins", at the level of its precursor polypeptide.
La séquence nucléotidique de ses intéines est insérée dans celle de l'ADN polymerase, généralement au niveau de sites conservés impliqués, après traduction, dans les réactions catalytiques . Ces séquences sont transcrites et traduites en même temps que celle de l'ADN polymerase et l'épissage autocatalytique des intéines produit alors trois enzymes: deux intéines et une ADN polymerase .The nucleotide sequence of its inteins is inserted into that of DNA polymerase, generally at the level of conserved sites involved, after translation, in catalytic reactions. These sequences are transcribed and translated at the same time as that of DNA polymerase and the autocatalytic splicing of the inteins then produces three enzymes: two inteins and one DNA polymerase.
On trouve de telles intéines également au sein d'autres molécules telles l'ATPase vacuolaire chez S . cerevisiae (4), GyrA chez Mycobacterium leprae (7), Rec A chez Mycobacterium tuberculosis (5, 6). Les intéines font partie pour leur majorité de la famille des endonucléases de type "homing endonucléases" puisqu'elles coupent l'ADN en un site reconnu, à l'endroit même où leur séquence nucléotidique vient s'insérer. Le développement des biotechnologies tant dans la recherche que dans les domaines de la médecine ou de 1 ' agro-alimentaire, nécessite de disposer de divers types d'ADN polymérases susceptibles d'améliorer quantitativement et qualitativement des techniques aussi diverses que le clonage, la détection, l'amplification de séquences d'ADN. La présente invention vise précisément à offrir une nouvelle ADN polymerase thermostable qui est issue d'une espèce récemment décrite: Thermococcus fumi colans (8) . Cet isolât a été isolé à partir de fragments de cheminées prélevées dans le bassin Nord- Fidgien lors de la campagne franco-japonnaise STARMER en 1989. Cette espèce, anaérobie stricte, présente une température optimale de croissance de 90°C, ce qui est relativement élevé pour un Thermococcus . Son pH optimum est de 8,8, et son taux de salinité de 20 g/1 à 40 g/1.Such inteins are also found in other molecules such as vacuolar ATPase in S. cerevisiae (4), GyrA in Mycobacterium leprae (7), Rec A in Mycobacterium tuberculosis (5, 6). The inteins belong for the most part to the family of endonucleases of the "homing endonucleases" type since they cut DNA at a recognized site, at the very place where their nucleotide sequence is inserted. The development of biotechnologies both in research and in the fields of medicine or agri-food, requires having various types of DNA polymerases capable of improving quantitatively and qualitatively techniques as diverse as cloning, detection , amplification of DNA sequences. The present invention aims precisely to offer a new thermostable DNA polymerase which is derived from a recently described species: Thermococcus fumi colans (8). This isolate was isolated from fragments of chimneys taken in the North Fidgian basin during the Franco-Japanese STARMER campaign in 1989. This species, strict anaerobic, has an optimal growth temperature of 90 ° C, which is relatively high for a Thermococcus. Its optimum pH is 8.8, and its salinity level from 20 g / 1 to 40 g / 1.
L'invention a donc pour objet une ADN polymerase purifiée thermostable d'archaebactéries deThe subject of the invention is therefore a purified thermostable DNA polymerase of archaebacteria of
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
l'espèce Thermococcus fumi colans ayant un poids moléculaire de l'ordre de 89000 Da ainsi que ses intéines thermostables , dont le gène comportant les deux séquences codant pour lesdites intéines a été clone.RECTIFIED SHEET (RULE 91) ISA / EP the species Thermococcus fumi colans having a molecular weight of the order of 89,000 Da as well as its thermostable inteins, the gene of which comprising the two sequences coding for said inteins has been cloned.
Les travaux de recherche ayant permis d'identifier, de sequencer et d'étudier l'ADN polymerase de 1 ' invention ont été réalisée à partir de la souche Thermococcus fumicolans ST557 déposée à la Collection de l'Institut Pasteur (CIP) sous le numéro CIP 104680. Cette ADN polymerase sera dénommée dans ce qui suit Tfu . Sa séquence de 774 acides aminés est représentée dans la liste de séquences en annexe sous le numéro SEQ ID NO: 2. Un poids moléculaire de 89797 Da et un pi de 8.1 ont été déduits de cette séquence.The research work which made it possible to identify, sequence and study the DNA polymerase of the invention was carried out using the strain Thermococcus fumicolans ST557 deposited in the Collection of the Institut Pasteur (CIP) under the number CIP 104680. This DNA polymerase will be named in the following Tfu. Its sequence of 774 amino acids is represented in the sequence list in the appendix under the number SEQ ID NO: 2. A molecular weight of 89797 Da and a pI of 8.1 have been deduced from this sequence.
L'invention concerne donc l'ADN polymerase purifiée thermostable d'archaebactéries de l'espèce Thermococcus fumicolans ayant un poids moléculaire de l'ordre de 89 000 daltons ainsi que ses dérivés enzymatiquement équivalents . On entend par dérivés enzymatiquement équivalent, les polypeptides et protéines constitués par ou comprenant la séquence en acides aminés représentée dans la liste de séquences en annexe sous le numéro SEQ ID NO : 2 dès lors qu'ils présentent les propriétés de l'ADN polymerase de Thermococcus fumi col ans . A ce titre l'invention envisage plus particulièrement une ADN polymerase dont la séquence en acides aminés est représentée dans la liste de séquence en annexe sous le numéro SEQ ID NO:l ou un fragment de celle-ci ou encore un assemblage de tels fragments, comme la séquence de 774 acides aminés représentée dans la liste de séquences en annexe sous le numéro SEQ ID NO: 2.The invention therefore relates to purified thermostable DNA polymerase of archaebacteria of the species Thermococcus fumicolans having a molecular weight of the order of 89,000 daltons as well as its enzymatically equivalent derivatives. The term “enzymatically equivalent derivatives” is understood to mean the polypeptides and proteins constituted by or comprising the amino acid sequence represented in the sequence list in the appendix under the number SEQ ID NO: 2 as soon as they exhibit the properties of the DNA polymerase of Thermococcus fumi col ans. As such, the invention more particularly contemplates a DNA polymerase, the amino acid sequence of which is represented in the sequence list in the appendix under the number SEQ ID NO: 1 or a fragment thereof or an assembly of such fragments, such as the 774 amino acid sequence represented in the sequence list in the appendix under the number SEQ ID NO: 2.
En effet, la présence de deux intéines I-Tfu-1 et I-Tfu-2 dans la séquence numéro SEQ ID NO:l, sont susceptibles de conduire lors de la préparation par synthèse chimique ou par génie génétique, à des séquences d'ADN polymerase de T. fumiculans tronquées dont lesIndeed, the presence of two inteins I-Tfu-1 and I-Tfu-2 in the sequence number SEQ ID NO: 1, are likely to lead during the preparation by chemical synthesis or by genetic engineering, to sequences of Truncated T. fumiculans DNA polymerase, the
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
propriétés enzymatiques sont équivalente à celle de l'ADN polymerase de T. fumicolans purifiée.RECTIFIED SHEET (RULE 91) ISA / EP Enzymatic properties are equivalent to that of purified T. fumicolans DNA polymerase.
On entend aussi par dérivés, les séquences en acides aminés ci-dessus modifiées par insertion et/ou délétion et/ou substitution d'un ou plusieurs aminoacides , pour autant que les propriétés de l'ADN polymerase de T. fumicolans qui en résultent ne soient pas significativement modifiées.The term “derivatives” is also understood to mean the amino acid sequences modified above by insertion and / or deletion and / or substitution of one or more amino acids, provided that the properties of the DNA polymerase of T. fumicolans which result therefrom are not significantly changed.
L'invention concerne également une séquence d'ADN constituée par ou comprenant la séquence codant pour une ADN polymerase de l'invention.The invention also relates to a DNA sequence consisting of or comprising the sequence coding for a DNA polymerase of the invention.
La séquence d'ADN représenté dans la liste de séquences en annexe sous le numéro SED ID NO: 1 représente une telle séquence. L'ADN codant pour l'ADN polymerase de T. fumicolans et ses deux intéines est constituée par le nucleotides 357 à 5028. Les nucleotides 1 à 356 correspondent à la région promotrice de ce gène. En conséquence, l'invention a pour objet une séquence d'ADN constituée par ou comprenant la séquence comprise entre les nucleotides 357 à 5028 de la SED ID NO: 1, ou un fragment de celle-ci ou encore un assemblage de tels fragmen s .The DNA sequence represented in the annexed sequence list under the number SED ID NO: 1 represents such a sequence. The DNA coding for the DNA polymerase of T. fumicolans and its two inteins consists of nucleotides 357 to 5028. Nucleotides 1 to 356 correspond to the promoter region of this gene. Consequently, the subject of the invention is a DNA sequence constituted by or comprising the sequence between nucleotides 357 to 5028 of SED ID NO: 1, or a fragment thereof, or an assembly of such fragmen s .
1 ' invention se rapporte tout particulièrement à une séquence d'ADN constituée par ou comprenant les nucleotides 357 à 1674 et 2755 à 3156 et 4324 à 5028 de la séquence d'ADN représenté dans la liste de séquences en annexe sous le numéro SED ID NO : 1.The invention relates more particularly to a DNA sequence constituted by or comprising nucleotides 357 to 1674 and 2755 to 3156 and 4324 to 5028 of the DNA sequence represented in the sequence list in the appendix under the number SED ID NO : 1.
Cette séquence code pour l'ADN polymerase de T. fumicolans dont la séquence de 774 acides aminés est représentée dans la liste de séquences en annexe sous le numéro SED ID NO : 2.This sequence codes for the DNA polymerase of T. fumicolans whose sequence of 774 amino acids is represented in the sequence list in the appendix under the number SED ID NO: 2.
L'invention concerne autant l'ADN polymerase isolées et purifiées de la souche Thermococcus fumicolans que l'ADN polymerase préparées par synthèse chimique, par exemple par ligature de fragments polypeptidiques , ou encore par les méthodes du génie génétique.
Dans le cadre de ces méthodes du génie génétique, l'invention a aussi pour objet un vecteur comprenant une séquence d'ADN définie précédemment, ainsi qu'un procédé de production ou d'expression dans un hôte cellulaire des ADN thermostables de l'invention.The invention relates as much to DNA polymerase isolated and purified from the Thermococcus fumicolans strain as to DNA polymerase prepared by chemical synthesis, for example by ligation of polypeptide fragments, or also by genetic engineering methods. Within the framework of these genetic engineering methods, the invention also relates to a vector comprising a DNA sequence defined above, as well as a process for the production or expression in a cellular host of the thermostable DNAs of the invention. .
Un procédé de production d'une ADN polymerase conforme à l'invention consiste:A process for the production of a DNA polymerase according to the invention consists in:
- à transférer une séquence d'acide nucléique codant pour l'ADN polymerase ou un vecteur contenant ladite séquence dans un hôte cellulaire,transferring a nucleic acid sequence coding for DNA polymerase or a vector containing said sequence into a cell host,
- à cultiver l'hôte cellulaire obtenu à l'étape précédente dans des conditions permettant la production de l'ADN polymerase,to cultivate the cell host obtained in the previous step under conditions allowing the production of DNA polymerase,
- à isoler, par tous moyens appropriés, ladite ADN polymerase.- to isolate, by any appropriate means, said DNA polymerase.
L'hôte cellulaire mis en oeuvre dans les procédés précédents peut être choisi parmi les procaryotes ou les eucaryotes et notamment parmi les bactéries, les levures, les cellules de mammifères, de plantes ou d'insectes.The cell host used in the above methods can be chosen from prokaryotes or eukaryotes and in particular from bacteria, yeasts, mammalian, plant or insect cells.
Le vecteur utilisé est choisi en fonction de l'hôte dans lequel il sera transféré.The vector used is chosen according to the host to which it will be transferred.
L'ADN polymerase thermostable de l'invention est utile notamment dans les procédés d'amplification enzymatique de séquences d'acides nucléiques. En conséquence, l'invention a pour objet, de tels procédés mettant en oeuvre l'ADN polymerase thermostable décrite précédemment, ainsi que les kits d'amplification comprenant, outre les réactifs généralement utilisés, une quantité adéquate de cette ADN polymerase.The thermostable DNA polymerase of the invention is useful in particular in the methods of enzymatic amplification of nucleic acid sequences. Consequently, the subject of the invention is such methods using the thermostable DNA polymerase described above, as well as the amplification kits comprising, in addition to the reagents generally used, an adequate quantity of this DNA polymerase.
L'invention a également pour objet une intéine purifiée thermostable d'archaebactéries de l'espèce Thermococcus fumicolans. Comme indiqué précédemment, les intéines sont aussi définies comme des introns protéiques qui sont non pas épissés au niveau de l'ARN messager mais au niveauThe invention also relates to a purified thermostable intein of archaebacteria of the species Thermococcus fumicolans. As indicated previously, the inteins are also defined as protein introns which are not spliced at the level of the messenger RNA but at the level
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
d'une maturation proteique. Elles relèvent donc d'un seul gène traduit et transcrit en une seule étape, et constituent des sous produits de la maturation de la protéine codée par ce gène (Xu, M. G. , Comb, D.G ., Paulus H., Noren C.J., Shao Y., Perler, F., 1994, Protein splicing : an analysis of the branched intermediate and its resolution by succinimidine formation . EMBO J . 13, 5517-5522.)RECTIFIED SHEET (RULE 91) ISA / EP protein maturation. They therefore relate to a single gene translated and transcribed in a single step, and constitute by-products of the maturation of the protein encoded by this gene (Xu, MG, Comb, DG., Paulus H., Noren CJ, Shao Y ., Perler, F., 1994, Protein splicing: an analysis of the branched intermediate and its resolution by succinimidine formation. EMBO J. 13, 5517-5522.)
Les intéines sont des endonucléases de restriction qui ont la propriété de couper l'ADN à l'endroit même où s'insère leur gène, et en conséquence, elles peuvent être considérées comme des séquences égoïstes .The inteins are restriction endonucleases which have the property of cutting DNA at the very place where their gene is inserted, and therefore they can be considered as selfish sequences.
Les intéines possèdent dans leur séquence toute l'information nécessaire à leur propre épissage puisqu'elles s'épissent dans E. coli .The inteins have in their sequence all the information necessary for their own splicing since they splice in E. coli.
Il est possible de distinguer quatre grandes étapes de maturation proteique :It is possible to distinguish four main stages of protein maturation:
Une première étape de formation d'un intermédiaire linéaire qui possède une fonction ester.A first step in the formation of a linear intermediary which has an ester function.
Cette réaction est dépendante du pH et de l'environneme t local de cette liaison (nature des acides aminés) . Ce principe est utilisé dans les kits de clonage, d'expression ou de purification mettant en oeuvre des intéines, car un changement de l'environnement provoque ou non un épisssage. En effet, celui-ci serait inhibé à pH 11 et activé à pH 7,5.This reaction is dependent on the pH and the local environment of this bond (nature of the amino acids). This principle is used in cloning, expression or purification kits using inteins, because a change in the environment causes splicing or not. Indeed, it would be inhibited at pH 11 and activated at pH 7.5.
- Une deuxième étape de transestérif ication qui permet de transformer l'intermédiaire précédent et de déplacer l'équilibre de la première étape.- A second transesterification step which allows the previous intermediary to be transformed and to shift the balance of the first step.
La troisième étape consiste en une cyclisation de 1 ' asparagine libérant l'intéine.The third step consists in a cyclization of 1 asparagine releasing the intein.
- La quatrième étape est la stabilisation de la protéine mature et la formation d'une réelle liaison peptidique.- The fourth step is the stabilization of the mature protein and the formation of a real peptide bond.
Il est donc possible de construire des mutants thermosensibles permettant de bloquer l' épissageIt is therefore possible to construct thermosensitive mutants making it possible to block splicing.
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
proteique aux températures d'expression (30°C) et de 1 ' induire par chauffage .RECTIFIED SHEET (RULE 91) ISA / EP protein at expression temperatures (30 ° C) and induce it by heating.
Cette possibilité de contrôle de l' épissage proteique par la température peut-être utilisée dans des vecteurs de clonage avec une séquence codant pour 1 ' intéine et des sites de clonage autour. Si la protéine à cloner et à exprimer est toxique pour l'hôte, on peut la cloner en deux fragments autour de la séquence d' intéine. Ainsi, globalement, le gène à cloner est complet mais il est interrompu par la séquence de l' intéine. Lors de l'expression, 1 ' intéine se retrouve dans la protéine exprimée, la rendant ainsi inactive. Il est alors possible par chauffage, à la fin de l'induction, de libérer 1 ' intéine par épissage autocatalytique et ainsi retrouver la protéine clonée active .This possibility of controlling protein splicing by temperature can be used in cloning vectors with a sequence coding for the intein and around cloning sites. If the protein to be cloned and expressed is toxic to the host, it can be cloned into two fragments around the intein sequence. Thus, overall, the gene to be cloned is complete but it is interrupted by the sequence of the intein. During expression, the intein is found in the expressed protein, thus making it inactive. It is then possible by heating, at the end of the induction, to release the intein by autocatalytic splicing and thus find the cloned active protein.
Les intéines permettent ainsi de réaliser des purifications et sont utilisés dans des kits selon le principe décrit ci-après. Certains résidus autour de site d' épissage de 1 ' intéine sont modifiés. L'expression de la protéine recombinante est réalisée à basse température pour bloquer un éventuel épissage trop précoce. En C- terminal de l' intéine est fixé un site ayant une forte affinité pour la chitine. Lors de l'induction, la protéine clonée est exprimée ainsi que 1 ' intéine et le site de fixation à la chitine. La purification est alors réalisée avec la chitine, sur des colonnes d'affinité, qui retiennent la chitine et aussi 1 ' intéine et la protéine clonée, le tout faisant partie de la même pré- protéine. On hydrolyse alors l'extrémité N-terminale de 1 ' intéine avec du DTT ou du β-mercaptoéthanol pour libérer la protéine clonée.The inteins thus make it possible to carry out purifications and are used in kits according to the principle described below. Certain residues around the intein splicing site are modified. The expression of the recombinant protein is carried out at low temperature to block possible splicing too early. At the C-terminal of the intein is fixed a site having a strong affinity for chitin. During induction, the cloned protein is expressed as well as the intein and the chitin binding site. The purification is then carried out with the chitin, on affinity columns, which retain the chitin and also the intein and the cloned protein, the whole being part of the same pre-protein. The N-terminal end of the intein is then hydrolyzed with DTT or β-mercaptoethanol to release the cloned protein.
Les intéines sont aussi des endonucléases de restrictions thermostables, qui ont pour site de reconnaissance l'endroit même où s'insère leur gène dans la séquence "hôte" . Elles possèdent une séquence nucléique répétée deux fois (LAGLIDADG) dans la protéine,The inteins are also thermostable restriction endonucleases, which have as a recognition site the very place where their gene is inserted in the "host" sequence. They have a twice repeated nucleic sequence (LAGLIDADG) in the protein,
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
séquence plus ou moins conservée et qui correspond au site actif de reconnaissance et de coupure de l'ADN. Ces enzymes semblent aussi avoir besoin de g++ pour leur activité. II convient de remarquer que les deux intéines de 1 ' invention sont coexprimées dans E. coli et sont autoépissées . Ce qui signifie qu'elle n'ont pas de toxicité pour l'hôte, à la différence de l'une des intéines de Thermococcus li toralis (9), et en conséquence leur utilisation dans des kits d'expression ou de purification est aisée.RECTIFIED SHEET (RULE 91) ISA / EP more or less conserved sequence which corresponds to the active DNA recognition and cleavage site. These enzymes also seem to need g ++ for their activity. It should be noted that the two inteins of the invention are co-expressed in E. coli and are self-splicing. This means that they have no toxicity for the host, unlike one of the inteins of Thermococcus li toralis (9), and therefore their use in expression or purification kits is easy. .
Une première séquence d' intéine selon 1 ' invention est représentée dans la liste de séquences en annexe sous le numéro SEQ ID NO : 3. Cette intéine, dénommée I-Tfu-1, présente un poids moléculaire de 41 409 Da et un pi de 9,13, déduits de la séquence de 360 acides aminés de la séquence SEQ ID NO : 3.A first intein sequence according to the invention is represented in the annexed sequence list under the number SEQ ID NO: 3. This intein, called I-Tfu-1, has a molecular weight of 41,409 Da and a pi of 9.13, deduced from the 360 amino acid sequence of the sequence SEQ ID NO: 3.
Une seconde séquence d' intéine selon 1 ' invention est représentée dans la liste de séquences en annexe sous le numéro SEQ ID NO : 4. Cette intéine, dénommée I-Tfu-2, présente un poids moléculaire de 44 765 Da et pi de 9,6, déduits de la séquence de 389 acides aminés de la séquence SEQ ID NO: 4.A second intein sequence according to the invention is represented in the annexed sequence list under the number SEQ ID NO: 4. This intein, called I-Tfu-2, has a molecular weight of 44,765 Da and pi of 9 , 6, deduced from the 389 amino acid sequence of the sequence SEQ ID NO: 4.
Comme rappelé précédemment, les intéines thermostables de 1 ' invention sont utiles notamment dans les procédés de restriction d'acides nucléiques et dans la mise au point de vecteurs d'expression permettant de réduire la toxicité de la protéine à exprimer en insérant l'une des deux séquences d' intéines dans la séquence de la protéine à exprimer. Ceci peut se faire sans manipulation de la séquence des intéines si le clonage s'effectue chez E. coli , les techniques d'expression utilisées ayant démontré leur inocuité pour cet organisme hôte. En conséquence, l'invention a pour objet, de tels procédés mettant en oeuvre l'une des deux ou les deux intéines thermostables décrites précédemment, ainsi que les kits d'expression ou de purification contenant l'une
ou les deux séquences codant pour les dites intéines thermostables .As recalled previously, the thermostable inteins of the invention are useful in particular in methods of restriction of nucleic acids and in the development of expression vectors making it possible to reduce the toxicity of the protein to be expressed by inserting one of the two sequences of inteins in the sequence of the protein to be expressed. This can be done without manipulation of the sequence of inteins if the cloning is carried out in E. coli, the expression techniques used having demonstrated their harmlessness for this host organism. Consequently, the subject of the invention is such processes using one or both of the two thermostable inteins described above, as well as the expression or purification kits containing one or the two sequences coding for said thermostable inteins.
L'invention concerne également une séquence d'ADN constituée par ou comprenant la séquence codant pour une intéine de 1 ' invention .The invention also relates to a DNA sequence consisting of or comprising the sequence coding for an intein of the invention.
Une séquence d'ADN codant pour 1 ' intéine I-Tfu-A DNA sequence coding for the I-Tfu intein
1 est comprise entre les nucleotides 1675 et 2754 dans la séquence SED ID NO: 1 en annexe. Cette séquence d'ADN code pour 1 ' intéine dont la séquence en acides aminés est représentée dans la liste de séquences en annexe sous le numéro SED ID NO : 3.1 is between nucleotides 1675 and 2754 in the sequence SED ID NO: 1 in the appendix. This DNA sequence codes for the intein, the amino acid sequence of which is represented in the annexed sequence list under the number SED ID NO: 3.
Une séquence d'ADN codant pour 1 ' intéine I-Tfu-A DNA sequence coding for the I-Tfu intein
2 est comprise entre les nucleotides 3157 et 4323 dans la séquence SED ID NO: 1 en annexe. Cette séquence d'ADN code pour l' intéine dont la séquence en acides aminés est représentée dans la liste de séquence en annexe sous le numéro SED ID NO : 4.2 is between nucleotides 3157 and 4323 in the sequence SED ID NO: 1 in the appendix. This DNA sequence codes for the intein whose amino acid sequence is represented in the sequence list in the appendix under the number SED ID NO: 4.
L'invention concerne autant ces intéines thermostables isolées et purifiées de la souche Thermococcus fumicolans que des intéines préparées par synthèse chimique, par exemple par ligature de fragments polypeptidiques, ou encore par les méthodes du génie génétique. Dans le cadre de ces méthodes du génie génétique, l'invention a aussi pour objet un vecteur comprenant une séquence d'ADN définie précédemment, ainsi qu'un procédé de production ou d'expression dans un hôte cellulaire des ADN codant pour les intéines de l'invention. De tels procédés sont identiques à ceux rapportés précédemment pour l'ADN polymerase de T . fumicolans .The invention relates both to these thermostable inteins isolated and purified from the Thermococcus fumicolans strain as to inteins prepared by chemical synthesis, for example by ligation of polypeptide fragments, or also by genetic engineering methods. Within the framework of these methods of genetic engineering, the invention also relates to a vector comprising a DNA sequence defined above, as well as a process for the production or expression in a cellular host of DNA coding for the inteins of the invention. Such methods are identical to those reported previously for T DNA polymerase. fumicolans.
D'autres avantages et caractéristiques de 1 ' invention apparaîtront à la lecture des exemples qui suivent, donnés à titre non limitatifs et concernant le clonage, l'expression, la caractérisation et l'activitéOther advantages and characteristics of the invention will become apparent on reading the examples which follow, given without limitation and relating to cloning, expression, characterization and activity
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
de l'ADN thermostable de l'invention, et se référant aux dessins annexés dans lesquels :RECTIFIED SHEET (RULE 91) ISA / EP of the thermostable DNA of the invention, and referring to the accompanying drawings in which:
- La figure 1 représente l'hybridation ADN-ADN de l'ADN genomique de T. fumicolans digéré par diverses enzymes de restriction et hybride avec les sondes GE23ClaI-HindIII et GE23XhoI-SalI .- Figure 1 shows the DNA-DNA hybridization of the genomic DNA of T. fumicolans digested with various restriction enzymes and hybridized with the probes GE23ClaI-HindIII and GE23XhoI-SalI.
- La figure 2 représente la stratégie de clonage, la structure du gène de l'ADN polymerase de T. fumiculans et les produits du gène. - La figure 3 représente les résultats de purification de la polymerase recombinante de T. fumi colans après une colonne d'héparine sépharose, visualisés par SDS-PAGE.- Figure 2 shows the cloning strategy, the gene structure of the DNA polymerase of T. fumiculans and the gene products. - Figure 3 shows the results of purification of the recombinant polymerase of T. fumi colans after a heparin sepharose column, visualized by SDS-PAGE.
- La figure 4 représente les résultats de purification de la polymerase recombinante de T. fumiculans après une colonne Bleue HTrap n°2 , visualisé par SDS-PAGE.- Figure 4 represents the results of purification of the recombinant polymerase of T. fumiculans after a Blue column HTrap n ° 2, visualized by SDS-PAGE.
- La figure 5 représente les résultats de purification de la polymerase recombinante de T. fumi col ans après une colonne de phosphocellulose , visualisés par SDS-PAGE.- Figure 5 shows the purification results of the recombinant polymerase of T. fumi col years after a phosphocellulose column, visualized by SDS-PAGE.
- La figure 6 représente les résultats de purification de la polymerase recombinante de T. fumicolans après une colonne MonoQ, visualisés par SDS- PAGE.- Figure 6 represents the results of purification of the recombinant polymerase of T. fumicolans after a MonoQ column, visualized by SDS-PAGE.
- La figure 7 représente les résultats de PCR avec l'ADN polymerase de T. fumicolans avec les fractions d'exclusion de la colonne MonoQ.- Figure 7 shows the PCR results with DNA polymerase from T. fumicolans with the exclusion fractions from the MonoQ column.
I- Matériel et méthodes .I- Materials and methods.
1) Conditions de culture, plasmides et souches utilisées . Les souches Thermococcus li toralis (DSM 5474 T) et Pyrococcus furiosus (DSM 3638 T ) ont été obtenues auprès de la collection du Deutsche Sammlung von1) Culture conditions, plasmids and strains used. The strains Thermococcus li toralis (DSM 5474 T) and Pyrococcus furiosus (DSM 3638 T) were obtained from the collection of the Deutsche Sammlung von
Microorganismen (DSM) Braunschweig-Stocheim, Allemagne.Microorganismen (DSM) Braunschweig-Stocheim, Germany.
FEUILLE RECTIFIEE (REGLE 91) ISAEP
La souche Pyrococcus sp. GE 23 a été isolée de cheminées de sources hydrothermales profondes et a été fournie par G Erauso (CNRS, Station Biologique de Roscoff , France) . La souche Thermococcus fumicolans à été obtenue auprès du laboratoire de Microbiologie Marine de G. Barbier (IFREMER-DRV-VP-CMM) à Brest, France. Cette souche, Thermococcus fumicolans à été obtenue par purification à partir de fragments de cheminées hydrothermales receuillies dans le bassin nord-Fidgien lors de la campagne franco- aponaise STARMER effectuée en 1989 à 2000 mètres de profondeur.RECTIFIED SHEET (RULE 91) ISAEP The strain Pyrococcus sp. GE 23 was isolated from chimneys of deep hydrothermal vents and was supplied by G Erauso (CNRS, Station Biologique de Roscoff, France). The Thermococcus fumicolans strain was obtained from the Marine Microbiology laboratory of G. Barbier (IFREMER-DRV-VP-CMM) in Brest, France. This strain, Thermococcus fumicolans, was obtained by purification from fragments of hydrothermal vents collected in the North Fidgian basin during the Franco-Aponais campaign STARMER carried out in 1989 at 2000 meters depth.
Pyrococcus sp.GE23 a été cultivée à 85°C dans un milieu 2216S (DIFCO) à un pH de 6 , 5.Pyrococcus sp.GE23 was cultured at 85 ° C in 2216S medium (DIFCO) at a pH of 6.5.
Thermococcus fumi col ans , décrite en 1996 (Godfroy et al.) est cultivée en conditions anaérobies dans un milieu contenant les éléments suivants: peptone 2g/ 1; extraits de levure 0,5g/l; sel de mer (Sigma) 30g/l; tampon PIPES 6,05g/l, soufre élémentaire 10g/1, rézasurine lmg/1. Le pH est ajusté à 8,5 par de la soude 5N à 20°C.Thermococcus fumi col ans, described in 1996 (Godfroy et al.) Is cultivated under anaerobic conditions in a medium containing the following elements: peptone 2g / 1; yeast extracts 0.5 g / l; sea salt (Sigma) 30g / l; PIPES buffer 6.05g / l, elemental sulfur 10g / 1, rezasurin lmg / 1. The pH is adjusted to 8.5 with 5N sodium hydroxide at 20 ° C.
La souche de Escherichia coli SURE (Stratagene, La Jolla, Calif.) a été cultivée dans du milieu LB avec le ou les antibiotiques appropriés, à 37°C sous agitation. Cette souche a été utilisée comme hôte pour recevoir les constructions primaires à partir des vecteurs pUC 18 ou pBluescript. Les souches NovaBlue, BL21(DE3) et BL21(DE3)pLysS (Novagen, Madison, Wi . ) ont été cultivées dans du milieu 2xYT avec les antibiotiques appropriés à 37°C ou 30°C. Ces souches ont été utilisées commes hôtes pour les plasmides d'expression.The Escherichia coli SURE strain (Stratagene, La Jolla, Calif.) Was cultured in LB medium with the appropriate antibiotic (s), at 37 ° C. with stirring. This strain was used as a host to receive the primary constructs from the pUC 18 or pBluescript vectors. The NovaBlue, BL21 (DE3) and BL21 (DE3) pLysS strains (Novagen, Madison, Wi.) Were cultured in 2xYT medium with the appropriate antibiotics at 37 ° C or 30 ° C. These strains were used as hosts for the expression plasmids.
2) Isolement de l'ADN, hybridation et ADN recombinants .2) Isolation of DNA, hybridization and recombinant DNA.
L'ADN de haut poids moléculaire de Thermococcus fumicolans a été isolé par la méthode de Charbonnier modifiée (3) . Les cellules ont été resuspendues dans un tampon TE-Na IX, puis lysées à 40°C pendant trois heuresThe high molecular weight DNA of Thermococcus fumicolans was isolated by the modified Charbonnier method (3). The cells were resuspended in TE-Na IX buffer, then lysed at 40 ° C for three hours
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
avec un mélange de N-Lauryl-sarcosine 1%, dodecyl sulfate de sodium 1% et 0,4 mg/ml de protéinase K. Après la lyse, une centrifugation à 5000g pendant 10 minutes permet d'éliminer les débris cellulaires. L'ADN est extrait par un traitement au Phenol-Chloroforme-alcool Isoamylique ou PCI (25-24-1) , puis traité par la RNAse à 5μg/ml à 60°C pendant une heure. Ces étapes sont suivies de deux extractions additionelles par PCI et d'une extraction au chlorophorme . L'ADN est ensuite précipité à l'éthanol absolu à -20°C, puis centrifugé, séché à l'air et repris dans un tampon TE-lx. La concentration et la pureté de cet ADN sont mesurées par spectrophotométrie à 230, 260 et 280nm avec un appareil GeneQuantlI (Pharmacia, Upsalla, Sweden) . Pour la construction de la mini-banque genomique en pUC 18 (17), l'ADN a été totalement digéré pendant une nuit à 37°C par une série d'enzymes de restriction (BamHI, HindIII, EcoRI , EcoRV, PvuII, Sali, Xbal et Xhol) par simples et doubles digestions. Puis les fragments d'ADN sont fractionnés sur gel d'agarose à 0,8% dans de TBE-1X et transférés sous vide sur une membrane de nylon Hybond-N+ (Amersham, UK) . Ces membranes ont été hybridées avec des sondes d'ADN préparées par PCR avec des amorces spécifiques sélectionnées à partir des gènes d'ADN polymérases de P . furi osus , T . l i toral i s et Pyrococcus sp . GE23. Ces sondes sont préalablement marquées au 32P par "random primmg" conformément aux recommandations du fabricant (Megaprime, Amersham, UK) .RECTIFIED SHEET (RULE 91) ISA / EP with a mixture of 1% N-Lauryl-sarcosine, 1% sodium dodecyl sulfate and 0.4 mg / ml of proteinase K. After lysis, centrifugation at 5000 g for 10 minutes makes it possible to remove cellular debris. The DNA is extracted by a treatment with Phenol-Chloroform-Isoamyl alcohol or PCI (25-24-1), then treated with RNAse at 5 μg / ml at 60 ° C for one hour. These steps are followed by two additional PCI extractions and a chlorophoric extraction. The DNA is then precipitated with absolute ethanol at -20 ° C, then centrifuged, air dried and taken up in TE-1x buffer. The concentration and purity of this DNA are measured by spectrophotometry at 230, 260 and 280nm with a GeneQuantlI device (Pharmacia, Upsalla, Sweden). For the construction of the genomic mini-bank in pUC 18 (17), the DNA was completely digested overnight at 37 ° C. by a series of restriction enzymes (BamHI, HindIII, EcoRI, EcoRV, PvuII, Sali , Xbal and Xhol) by simple and double digestions. Then the DNA fragments are fractionated on 0.8% agarose gel in TBE-1X and transferred in vacuo to a Hybond-N + nylon membrane (Amersham, UK). These membranes were hybridized with DNA probes prepared by PCR with specific primers selected from the DNA polymerase genes of P. furi osus, T. li toral is and Pyrococcus sp. GE23. These probes are previously marked with 32P by "random primmg" in accordance with the manufacturer's recommendations (Megaprime, Amersham, UK).
Deux sondes de P. furiosus ont été utilisées, Pfu et PfuTwo P. furiosus probes were used, Pfu and Pfu
F, couvrant respectivement les régions délimitées par les paires de bases 8 à 2316 et 819 à 1915 de la section codante du gène de la polymerase comme défini par Uemori et al . (18) . Deux sondes de T. li toralis, Tli I et Tli T, couvrant respectivement les régions délimitées par les paires de bases 297 à 1768 et 4631 à 5378, comme défini par Hodges et al . (9) . Deux sondes de Pyrococcus sp GE23 ont été utilisées, l'une contenant la partie 5' du gène (fragment Clal-HindIIIF, covering respectively the regions delimited by base pairs 8 to 2316 and 819 to 1915 of the coding section of the polymerase gene as defined by Uemori et al. (18). Two T. li toralis probes, Tli I and Tli T, covering the regions delimited by base pairs 297 to 1768 and 4631 to 5378 respectively, as defined by Hodges et al. (9). Two Pyrococcus sp GE23 probes were used, one containing the 5 'part of the gene (Clal-HindIII fragment
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
correspondant aux sites pb 8 et pb 1353 de la section codante) et l'autre contenant la partie terminale de ce même gène obtenu par PCR (amorces dites d'expression Ndel et Sali correspondant aux sites pb -1 et pb 2318, puis digestion par Xhol et Sali comprenant les bases du n° 1879 à 2318) . Les fragments positifs ont été identifiés par hybridation ADN-ADN (14). Seules les hybridations avec les sondes de Pyrococcus sp . GE 23 ont fourni des signaux positifs à 55°C en moins de 24 h d'exposition. Les sondes issues de T. li toralis et de P. furiosus n'ont donné aucun résultat, même à 50°C dans un tampon standard sans formamide . A partir des hybridations avec les sondes de Pyrococcus sp. GE 23, des fragments HindIII de 1,9 kb ont été sélectionnés, puis préparés par une digestion appropriée de lOOμg d'ADN genomique, purifiés dans des sacs de dialyse à partir des gels d'agarose, et précipités à l'éthanol absolu après une extraction au PCI. Les fragments ont été ligaturés dans un pUC 18/HindIII déphosphorylé. Les transformations des souches hôtes ont été réalisées par électroporation (Gène Puiser, Biorad) . Le criblage des clones recombinants a été effectué par sélection à 1 ' ampicilline , alpha- complémentation sur substrat X-Gal-IPTG puis hybridation de colonies selon les techniques standards (12) . La température des hybridations de colonies était de 55°C avec les sondes de Pyrococcus sp GE23, dans un tampon standard sans formamide. L'ADN plasmidique a été isolé selon la méthode décrite par Birnboim et Doly (1) , puis purifié par chromatographie échangeuse d'anions en phase solide (Quiagen, Chatsworth, Calif.). Les fragments de restriction des plasmides ont été purifiés sur gel d'agarose par la méthode du GeneClean (Bio 101, La Jolla, Calif.) pour un clonage ultérieur.RECTIFIED SHEET (RULE 91) ISA / EP corresponding to sites pb 8 and pb 1353 of the coding section) and the other containing the terminal part of this same gene obtained by PCR (primers known as expression Ndel and Sali corresponding to sites pb -1 and pb 2318, then digestion by Xhol and Sali including the bases of Nos. 1879 to 2318). Positive fragments were identified by DNA-DNA hybridization (14). Only hybridizations with Pyrococcus sp. GE 23 provided positive signals at 55 ° C in less than 24 h of exposure. The probes from T. li toralis and P. furiosus gave no results, even at 50 ° C in a standard buffer without formamide. From hybridizations with the Pyrococcus sp. GE 23, HindIII fragments of 1.9 kb were selected, then prepared by appropriate digestion of 100 μg of genomic DNA, purified in dialysis bags from agarose gels, and precipitated with absolute ethanol after PCI extraction. The fragments were ligated into a dephosphorylated pUC 18 / HindIII. The transformations of the host strains were carried out by electroporation (Gene Puiser, Biorad). The screening of the recombinant clones was carried out by selection with ampicillin, alpha-complementation on X-Gal-IPTG substrate then hybridization of colonies according to standard techniques (12). The temperature of the colony hybridizations was 55 ° C. with the Pyrococcus sp GE23 probes, in a standard buffer without formamide. The plasmid DNA was isolated according to the method described by Birnboim and Doly (1), then purified by anion exchange chromatography in solid phase (Quiagen, Chatsworth, Calif.). The restriction fragments of the plasmids were purified on agarose gel by the GeneClean method (Bio 101, La Jolla, Calif.) For later cloning.
L'ARNr 16S et 23S de Thermococcus fumicolans a été amplifié par PCR en utilisant les amorces suivantes :Thermococcus fumicolans 16S and 23S rRNA was amplified by PCR using the following primers:
- amorce directe Aa: 5' TCCGGTTGATCCTGCCGGAA-3 '- direct primer Aa: 5 'TCCGGTTGATCCTGCCGGAA-3'
- amorce réverse 23Sa: 5 ' -CTTTCGGTCGCCCCTACT-3 '- 23Sa reverse primer: 5 '-CTTTCGGTCGCCCCTACT-3'
FEUILLE RECTIFIEE (REGLE 91) ISAEP
- étape initiale 3 minutes à 94°C suivi de 30 cycles (94°C, 1min/ 49°C, lmn/ 72°C, 2mn) et, élongation finale de 5 irai à 72°C.RECTIFIED SHEET (RULE 91) ISAEP - initial step 3 minutes at 94 ° C followed by 30 cycles (94 ° C, 1min / 49 ° C, lmn / 72 ° C, 2mn) and, final elongation of 5 will go to 72 ° C.
Le produit de PCR a été clone dans le vecteur pUC18 pour séquençage ultérieur.The PCR product was cloned into the vector pUC18 for subsequent sequencing.
3 ) Sé uencaσe des ADN.3) Se uencaσe of DNA.
Les séquences d'ADN ont été obtenues par la méthode de terminaison de chaine (13) en utilisant un système d'analyses automatique d'ADN Applied Biosyste s. Les deux brins des gènes codant pour l'ADN polymerase et les deux intéines ont été séquences en utilisant des amorces universelles localisées sur des vecteurs ainsi que des amorces internes . La séquence d'ADNr 16S a été réalisée sur les deux brins, après clonage (SureClone, Pharmacia, Uppsala, Sweeden) , en utilisant le kit Hot-Tub (Amersham, UK. ) afin de lever les compressions .The DNA sequences were obtained by the chain termination method (13) using an Applied Biosyste s automatic DNA analysis system. The two strands of the genes coding for DNA polymerase and the two inteins were sequenced using universal primers localized on vectors as well as internal primers. The 16S rDNA sequence was carried out on both strands, after cloning (SureClone, Pharmacia, Uppsala, Sweeden), using the Hot-Tub kit (Amersham, UK.) In order to remove the compressions.
L'analyse de séquence a été réalisée avec le logiciel DNASTAR (Madison, Wis . , USA) et le programme de Genetic Computer Group (University of Wisconsin Biotechnology Center, Wis., USA) accessible en ligne sur INFOBIOGEN. Les recherches informatisées de similitude ont été réalisées avec le programme BLAST, les alignements multiples avec CLUSTAL V, et les arbres phylogénétiques ont été établis selon la méthode dite de "Neighbour- oining" (11).Sequence analysis was performed with DNASTAR software (Madison, Wis., USA) and the program of Genetic Computer Group (University of Wisconsin Biotechnology Center, Wis., USA) available online on INFOBIOGEN. Computerized similarity searches were performed with the BLAST program, multiple alignments with CLUSTAL V, and phylogenetic trees were established using the so-called Neighbouring method (11).
4 ) Construction du recombinant exprimant l'ADN polymerase de Thermococcus fumicolans .4) Construction of the recombinant expressing the DNA polymerase of Thermococcus fumicolans.
L'ADN polymerase de Thermococcus fumicolans ainsi que ses deux intéines, ont été exprimées en même temps chez E. coli avec le vecteur d'expression PARHS2 qui appartient à la famille des systèmes d'expression T7 ( 16 ) acquis auprès d ' Eurogentec .The DNA polymerase of Thermococcus fumicolans as well as its two inteins, were expressed at the same time in E. coli with the expression vector PARHS2 which belongs to the family of expression systems T7 (16) acquired from Eurogentec.
La PCR a été utilisée pour préparer le fragment complet de l'ADN polymerase et des deux intéines en
utilisant des amorces contenant les sites de restriction Ndel et BamHI :PCR was used to prepare the complete fragment of DNA polymerase and the two inteins in using primers containing the NdeI and BamHI restriction sites:
- amorce Tfu Dir : 5 ' -TGG GGA TCC ATA TGA TCC TCG ATA CAG ACT ACA TC-3 ' - amorce Tfu Rev :- primer Tfu Dir: 5 '- TGG GGA TCC ATA TGA TCC TCG ATA CAG ACT ACA TC-3' - primer Tfu Rev:
5 ' -AAG CTT GGA TCC TCA TTT CTT CCC CAT TTT GAG CC-3 '5 '-AAG CTT GGA TCC TCA TTT CTT CCC CAT TTT GAG CC-3'
Le mélange réactionnel contenait l'ADN polymerase GOLDSTAR (Eurogentec, B) , l'enzyme Taq Extender (contenant la Pfu de Stratagene) , le tampon d'extension avec les quatre dNTP (chacun à 0,2mM) et les amorces Tfu Dir et Tfu Rev à 50 pmoles dans un volume de 50μl final. L'amplification a été effectuée sur 20 cycles : 1 mn 94°C, 1 mn à 54°C et 6 mn à 72°C en utilisant un thermocycleur Stratagene 96-gradient. Les fragments de PCR ont ensuite été digérés par les enzymes Ndel et BamHI puis ligaturés aux mêmes sites du vecteur, rétablissant ainsi le codon d'initiation. La construction ainsi obtenue a été nommée PARHS2TFU1. Cette construction a été séquencée aux sites de jonction afin de vérifier son intégrité par rapport à la séquence de l'ADN genomique. Les tests d'expression ont été réalisés selon le protocole suivant: sélection des clones recombinants dans la souche E. coli Novablue, expression avec la souche BL21 (DE3 )pLysS dans un milieu 2xYT et induction de quatre heures à ImM d'IPTG.The reaction mixture contained GOLDSTAR DNA polymerase (Eurogentec, B), the Taq Extender enzyme (containing Pfag from Stratagene), the extension buffer with the four dNTPs (each at 0.2mM) and the primers Tfu Dir and Tfu Rev at 50 pmol in a volume of 50 μl final. The amplification was carried out over 20 cycles: 1 min 94 ° C, 1 min at 54 ° C and 6 min at 72 ° C using a Stratagene 96-gradient thermocycler. The PCR fragments were then digested with the enzymes NdeI and BamHI and then ligated to the same sites of the vector, thus restoring the initiation codon. The construction thus obtained was named PARHS2TFU1. This construction was sequenced at the junction sites to verify its integrity with respect to the genomic DNA sequence. The expression tests were carried out according to the following protocol: selection of the recombinant clones in the E. coli Novablue strain, expression with the BL21 (DE3) pLysS strain in a 2xYT medium and induction for four hours at ImM of IPTG.
Les premiers essais d'induction ont été réalisés sur des cultures de 5 ml, induites ou non. Des précultures de nuit sont réalisées sans inducteur et relancées dans un milieu frais le matin (au dizième) , jusqu'à ce que la densité optique (mesurée à 600nm) soit de 0,6, puis soit induites pendant 4 heures, soit non induites et arrêtées au bout de 2 , 4 ou 14 heures . Quatre ml de cultures sont alors centrifugés à 4°C, 5000rpm pendant 10 minutes. Le culot est ensuite repris dans un tampon de lyse (Tris-HCl lOmM pH 7,5; NaCl lOmM, MgCl2 2 mM) . Les cellules ainsi reprises sont alors lysées , soit avec du triton X-100 1% v/v, soit du lysozyme à lmg/ml deThe first induction tests were carried out on 5 ml cultures, whether induced or not. Night precultures are carried out without an inductor and restarted in a fresh medium in the morning (to the tenth), until the optical density (measured at 600nm) is 0.6, then either induced for 4 hours, or not induced and stopped after 2, 4 or 14 hours. Four ml of cultures are then centrifuged at 4 ° C, 5000 rpm for 10 minutes. The pellet is then taken up in a lysis buffer (10 mM Tris-HCl pH 7.5; 10 mM NaCl, 2 mM MgCl2). The cells thus taken up are then lysed, either with triton X-100 1% v / v, or lysozyme at 1 mg / ml of
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
lyse et laissées sur glace environ 5 à 10 min. Les cellules sont ensuite thermodénaturées par une exposition de 20 min à 72°C. Ceci permet de détruire en grande partie les cellules de l'hôte mésophile sans détruire les protéines recombinantes . Le produit de lyse est ensuite centrifugé 20 min à 10 OOOrpm à 4°C. Le surnageant est récupéré afin de le tester en incorporation, en PCR ou sur gel. Les incorporations sont réalisées suivant deux techniques : - Avec de la thymidine tritiée comme traceur.RECTIFIED SHEET (RULE 91) ISA / EP lysis and left on ice about 5 to 10 min. The cells are then thermodenatured by an exposure of 20 min at 72 ° C. This largely destroys the cells of the mesophilic host without destroying the recombinant proteins. The lysis product is then centrifuged for 20 min at 10 000 rpm at 4 ° C. The supernatant is recovered in order to test it in incorporation, in PCR or on gel. The incorporations are carried out according to two techniques: - With tritiated thymidine as tracer.
L'incorporation est réalisée sur du thymus de veau activé (SIGMA Aldrich, F) dans le milieu de réaction suivant: Tris-HCl 50mM pH 8,8 ; DTT ImM ; MgCl2 lOmM ; KCl lOmM ;The incorporation is carried out on activated calf thymus (SIGMA Aldrich, F) in the following reaction medium: Tris-HCl 50mM pH 8.8; DTT ImM; MgCl2 10 mM; KCl 10MM;
BSA 0,4 mg/ml; chaque dNTP à 0,4 mM. - Avec du 32 P -dATP (Amersham) comme traceur.BSA 0.4 mg / ml; each dNTP at 0.4 mM. - With 32 P -dATP (Amersham) as a tracer.
L'incorporation est réalisée sur de l'ADN thymus de veau activé (Appligene) dans le milieu de réaction suivant:The incorporation is carried out on activated calf thymus DNA (Appligene) in the following reaction medium:
Tris-HCl pH 9 50mM; KCl 50mM; MgCl2 7mM; BSA 0,2mg/ml et50 mM Tris-HCl pH 9; 50 mM KCl; 7mM MgCl2; BSA 0.2mg / ml and
(NH4+)S04 (filtré) 16mM, avec un mélange des 4 dNTP à 500μM final chacun. Cette seconde méthode permet d'estimer avec précision le nombre d'unités d'enzyme.(NH4 +) S04 (filtered) 16mM, with a mixture of the 4 dNTPs at 500μM final each. This second method makes it possible to accurately estimate the number of units of enzyme.
5) Purification.5) Purification.
a) Culture.a) Culture.
Après des essais en petits volumes, les cultures destinées à l'expression de l'enzyme recombinante ont été réalisées comme suit: production d'un inoculum de 700 ml (milieu 2x YT complémenté en ampicilline et chloramphenicol) cultivé à 30°C jusqu'à DO = 0,8; ensemencement d'un fermenteur contenant 16 1 du même milieu; culture pendant 4 h jusqu'à DO= 0,6, puis induction à l'iPTG 1 mM et culture pendant 4 h. La biomasse résultante est centrifugée 20 mn à 6 000 rpm à 4°C (Centrifugeuse JOUAN) .After tests in small volumes, the cultures intended for the expression of the recombinant enzyme were carried out as follows: production of an inoculum of 700 ml (2x YT medium supplemented with ampicillin and chloramphenicol) cultivated at 30 ° C. until at DO = 0.8; inoculation of a fermenter containing 16 1 of the same medium; culture for 4 h until OD = 0.6, then induction with 1 mM iPTG and culture for 4 h. The resulting biomass is centrifuged for 20 min at 6000 rpm at 4 ° C (JOUAN centrifuge).
FEUILLE RECTIFIEE (REGLE 91) ISAEP
b) Lyse cellulaire et première étape de purification.RECTIFIED SHEET (RULE 91) ISAEP b) Cell lysis and first purification step.
20 g de biomasse sont repris dans 80 ml de tampon (20 mM Tris-Ci pH 7 , 5 ; 10 mM NaCl; 2 mM MgC12 ; 1 mM EGTA; 1% Triton xlOO; 2,2 mM PMSF) . Le mélange résultant, maintenu à 4°C maximum, est soniqué à 12 reprises successives (cycle de 15 s) jusqu'à obtention d'une solution liquide. Le surnageant est ensuite centrifugé 20 mn à 4°C à 20 000 rpm (SORVALL Ti45) . Le surnageant est récupéré et traité par la chaleur (70°C pendant 10 mn) afin de thermodénaturer l'essentiel des protéines natives de E. coli , puis centrifugé à nouveau.20 g of biomass are taken up in 80 ml of buffer (20 mM Tris-Ci pH 7.5, 10 mM NaCl; 2 mM MgC12; 1 mM EGTA; 1% Triton xlOO; 2.2 mM PMSF). The resulting mixture, kept at 4 ° C maximum, is sonicated 12 successive times (15 s cycle) until a liquid solution is obtained. The supernatant is then centrifuged for 20 min at 4 ° C at 20,000 rpm (SORVALL Ti45). The supernatant is recovered and treated with heat (70 ° C. for 10 min) in order to thermodenature most of the native proteins of E. coli, then centrifuged again.
c) Chromatoσraphie . Les 70 ml de surnageant issus de l'étape précédente sont chargés sur une colonne Pharmacia Héparine Sépharose (30 ml de résine) , après équilibration avec un tampon A (10 mM Tris-Ci pH 7 , 5 ; 0 , 5 mM EGTA; 5 mM MgCl2; 10 mM β-mercaptoéthanol ; 0,2% Triton xlOO et 10% glycerol. Un lavage est effectué avec un tampon B (idem tampon A + 2 M NaCl) à raison de 0,3 ml/mn sur un système FPLC Pharmacia. Les différentes fractions sont récupérées en gradient de NaCl .c) Chromatography. The 70 ml of supernatant from the previous step are loaded onto a Pharmacia Heparin Sepharose column (30 ml of resin), after equilibration with buffer A (10 mM Tris-Ci pH 7.5, 0.5 mM EGTA; 5 mM MgCl2; 10 mM β-mercaptoethanol; 0.2% Triton xlOO and 10% glycerol. Washing is carried out with buffer B (same buffer A + 2 M NaCl) at a rate of 0.3 ml / min on an FPLC system Pharmacia The different fractions are recovered in NaCl gradient.
Les fractions actives ainsi récupérées sont dialysées pendant 5 h contre un tampon C: 10 mM Tris-Ci pH 7,5; 0,5 mM EGTA; 5 mM MgCl2 ; 10 mM -mercaptoéthanol; 0,2% Triton xlOO; 10% glycerol; 50 mM NaCl. Les produits issus de dialyse sont successivement chargés sur une colonne d'affinité pour les protéines se liant à l'ADN (Pharmacia, Bleue HiTrap) et élues en gradient de NaCl avec les mêmes tampons que précédemment .The active fractions thus recovered are dialyzed for 5 h against a buffer C: 10 mM Tris-Ci pH 7.5; 0.5 mM EGTA; 5 mM MgCl2; 10 mM -mercaptoethanol; 0.2% Triton x100; 10% glycerol; 50 mM NaCl. The products resulting from dialysis are successively loaded onto an affinity column for the proteins binding to DNA (Pharmacia, Blue HiTrap) and eluted in NaCl gradient with the same buffers as above.
30ml de fractions actives obtenues précédemment sont chargées sur une colonne de phosphocellulose avec de la résine Pli de Whatmann ( volume :20ml; diamètre: 2,5cm). Ces fractions ont été dialysées 5 heures contre le tampon suivant : KP04 pH7 20mM, EDTA 0,lmM, DTT ImM, Glycerol 5%, Triton X-100 0,1% et KCl 0,1M. Le débit de30ml of active fractions obtained previously are loaded onto a phosphocellulose column with Pli resin from Whatmann (volume: 20ml; diameter: 2.5cm). These fractions were dialyzed for 5 hours against the following buffer: KP04 pH7 20mM, EDTA 0, 1mM, DTT ImM, Glycerol 5%, Triton X-100 0.1% and KCl 0.1M. The flow of
FEUILLE RECTIFIEE (REGLE 91) ISAEP
la colonne est réglé à 0,2ml /min, le tampon A de chargement est composé de KP04 pH7 20mM, EDTA 0,lmM, DTT ImM, Glycerol 5%, Triton X-100 0,1% et le gradient (entre 0% et 50% deB) est réalisé par le KCl présent dans tampon B à 2M.RECTIFIED SHEET (RULE 91) ISAEP the column is adjusted to 0.2 ml / min, the loading buffer A is composed of KP04 pH7 20mM, EDTA 0, lmM, DTT ImM, Glycerol 5%, Triton X-100 0.1% and the gradient (between 0% and 50% deB) is produced by KCl present in buffer B at 2M.
Ayant déjà mis en évidence que cette polymerase ne se fixe pas sur une MonoQ ou une ressource Q et ce, quel que soit le pH utilisé, nous avons essayé de la récupérer en exclusion en faisant l'hypothèse d'une fixation de l'ADN contaminant par la colonne.Having already demonstrated that this polymerase does not bind to a MonoQ or Q resource and this, whatever the pH used, we tried to recover it in exclusion by making the hypothesis of a DNA binding contaminant through the column.
Tout d'abord un essai a été réalisé en sortie de la deuxième Bleue HiTrap avec un aliquot de 5ml et dialyse selon la même méthode que pour un passage sur une Bleue HiTrap. Une deuxième tentative a été réalisée après la phosphocellulose et après deux dialyses des fractions les plus actives 45, 46 et 49. Les fractions sont tout d'abord chauffées 40 min à 85°C. Une première dialyse est alors réalisée contre le tampon KCl 0,lmM, K2HP04 1M pH7 , 5 pendant 3 heures. La deuxième dialyse est réalisée avec le tampon suivant : K2HP04 pH 7 , 5 lOmM, K2P04 lOmM, KCl 25mM, DTT ImM, Triton X-100 0,1%, Glycerol 10%, pendant 1 heure. La solution est alors chargée sur une colonne MonoQ à 0,5ml/min avec un gradient en NaCl de 0 à 20% (tampons utilisés pour l'Héparine) .First of all, a test was carried out at the end of the second HiTrap Blue with an aliquot of 5ml and dialysis according to the same method as for a passage on a HiTrap Blue. A second attempt was made after phosphocellulose and after two dialysis of the most active fractions 45, 46 and 49. The fractions are first heated 40 min at 85 ° C. A first dialysis is then carried out against the 0.1 KM KCl buffer, 1M K2HP04 pH7.5 for 3 hours. The second dialysis is carried out with the following buffer: K2HP04 pH 7.5, 10 lMM, K2PO4 lOmM, KCl 25 mM, DTT ImM, Triton X-100 0.1%, Glycerol 10%, for 1 hour. The solution is then loaded onto a MonoQ column at 0.5 ml / min with a NaCl gradient of 0 to 20% (buffers used for heparin).
6) Activités exonucléasiques .6) Exonuclease activities.
Les tests d' exonuclease 3 '-5' sont quantifiés par la libération de nucleotides marqués au P . A cette fin, une première étape permet de réaliser le marquage: l'ADN de est digéré par HindIII puis, le fragment de Klenow recopie l'ADN à partir des extrémités 3 ' -OH libres , dans un milieu contenant outre le tampon, l'ADN et l'enzyme, du 3 P dATP et du 32P dTTP, les dGTPet dCTP étant froids. Après une heure à 37°C, les quatre dNTP froids sont rajoutés en excès pour une demi-
heure. La Kleno et les dNTP sont éliminés par une extraction au phénol et précipités à l'éthanol.The 3 '-5' exonuclease assays are quantified by the release of P labeled nucleotides. To this end, a first step makes it possible to carry out the labeling: the DNA of is digested with HindIII and then, the Klenow fragment copies the DNA from the free 3 '-OH ends, in a medium containing, in addition to the buffer, l DNA and the enzyme, 3 P dATP and 32 P dTTP, the dGTP and dCTP being cold. After one hour at 37 ° C, the four cold dNTPs are added in excess for half a hour. Kleno and dNTPs are removed by phenol extraction and precipitated with ethanol.
Les tests exonucléasiques sont effectués dans des solutions contenant les tampons des enzymes, 0,02mg/ml d'ADN marqué, et incubées toute la nuit à 72°C, 80°C et 95°C. Différents tampons contenant du MgCl2 ou du MnS04 sont testés. Le même test est réalisé avec la Vent en témoin positif. 101 de solution de réaction sont alors déposés sur papier DE81 (Watmann) , séchés puis comptés avant et après lavage (3 fois 5 min avec du a2HP04, 1 fois à l'eau puis à l'éthanol à 95%) en utilisant la technique de Cerenkov.The exonuclease tests are carried out in solutions containing the enzyme buffers, 0.02 mg / ml of labeled DNA, and incubated overnight at 72 ° C, 80 ° C and 95 ° C. Different buffers containing MgCl2 or MnS04 are tested. The same test is carried out with the Wind as a positive control. 101 of reaction solution are then deposited on DE81 paper (Watmann), dried and then counted before and after washing (3 times 5 min with a2HP04, 1 time with water and then with 95% ethanol) using the technique from Cerenkov.
Pour le test d'actvité exonucléasique 5'-3', la réaction de marquage utilise la polynucléotide kinase afin de marquer le substrat en 5 ' .For the 5'-3 'exonuclease activity test, the labeling reaction uses the polynucleotide kinase to label the 5' substrate.
II - Résultats .II - Results.
1) Isolement du gène de l'ADN polymerase de Thermococcus fumicolans ainsi σue des deux gènes d' intéines .1) Isolation of the DNA polymerase gene from Thermococcus fumicolans as well as from the two intein genes.
L'ADN de Thermococcus fumicolans , digéré par une série d'enzymes de restriction, a été hybride à des sondes de P. furiosus et T. li toralis, préparées par PCR et aux sondes de Pyrococcus sp. GE23 obtenues lors du clonage de cette autre ADN polymerase (Dépôt de brevet n°96 08631 auprès de l'INPI) . Comme montré dans les figures la et lb, l'hybridation de type Southern blot a révélé des fragments de deux types : un fragment HindlII- HindIII de 1,9 kb et un fragment Xhol-Xhol de 5 kb. Ces deux fragments ont été révélés uniquement avec la sondeThe DNA of Thermococcus fumicolans, digested with a series of restriction enzymes, was hybridized to probes of P. furiosus and T. li toralis, prepared by PCR and to probes of Pyrococcus sp. GE23 obtained during the cloning of this other DNA polymerase (Patent deposit n ° 96 08631 with the INPI). As shown in FIGS. 1a and 1b, the Southern blot hybridization revealed fragments of two types: a HindIII-HindIII fragment of 1.9 kb and an Xhol-Xhol fragment of 5 kb. These two fragments were revealed only with the probe
Clal-HindIII du gène de Pyrococcus sp. GE23, marquée auClal-HindIII of the Pyrococcus sp. GE23, marked with
32 P avec une exposition de deux heures. Les deux fragments repérés ont ensuite été récupérés et purifiés comme décrit précédemment dans Matériel et Méthodes puis clones dans le vecteur pUC18 déphosphorylé et digéré par32 P with a two hour exposure. The two identified fragments were then recovered and purified as described previously in Materials and Methods then cloned into the vector pUC18 dephosphorylated and digested with
HindIII ou dans le vecteur pBluescript déphosphorylé etHindIII or in the dephosphorylated pBluescript vector and
FEUILLE RECTIFIEE (REGLE 91) ISAEP
digéré par Xhol . Environ 400 recombinants ( E. Coli SURE) ont été criblés avec la sonde Clal-HindIII . Sur ces 400 colonies, deux ont donné un signal positif lors de l'hybridation. (n° 9.26 et 12.79). Les deux clones ont été mis en culture en milieu LB-Amp et leurs profils de restriction étaient identiques, avec un insert de 1,8 kb. Le séquençage ultérieur de l'un de ces clones (désigné 557MACa) et la comparaison de séquence (Megalign, programme DNASTAR) ont montré qu'il correspond à la région promotrice et aux 1404 premières paires de base d'un gène d'une ADN polymerase appartenant à la famille B (2) .RECTIFIED SHEET (RULE 91) ISAEP digested with Xhol. About 400 recombinants (E. Coli SURE) were screened with the Clal-HindIII probe. Of these 400 colonies, two gave a positive signal during hybridization. (n ° 9.26 and 12.79). The two clones were cultured in LB-Amp medium and their restriction profiles were identical, with an insert of 1.8 kb. The subsequent sequencing of one of these clones (designated 557MACa) and the sequence comparison (Megalign, DNASTAR program) have shown that it corresponds to the promoter region and to the first 1404 base pairs of a DNA gene. polymerase belonging to the family B (2).
En ce qui concerne le fragment Xhol-Xhol de 5 kb, 700 recombinants ont été criblés sans aucun signal positif lors de l'hybridation. Cette absence de réussite pourrait être due à la présence d'une intéine au sein de ce fragment, le rendant alors instable dans un vecteur à haut nombre de copies de type pBluescript .With regard to the 5 kb Xhol-Xhol fragment, 700 recombinants were screened without any positive signal during hybridization. This lack of success could be due to the presence of an intein within this fragment, thus rendering it unstable in a vector with a high number of copies of pBluescript type.
Après 12 heures d'exposition, un deuxième fragment HindIII-HindIII de 2 kb a été repéré par hybridation de type Southern sur la même membrane que précédemment, en utilisant comme sonde marquée la fin du gène de Pyrococcus sp. GE23 entre les sites Xhol et SaliAfter 12 hours of exposure, a second HindIII-HindIII fragment of 2 kb was identified by Southern type hybridization on the same membrane as above, using the end of the gene for Pyrococcus sp. GE23 between the Xhol and Sali sites
(fragment obtenu par digestion de produit de PCR). Ce fragment a été clone comme précédemment. Environ 200 clones recombinants ont été criblés et quatre d'entre eux ont donné un signal positif. Les quatres clones ont un profil identique après digestion par l'enzyme de restriction HindIII. L'un d'entre eux, 557MACc, a été séquence et la comparaison de séquence a démontré qu'il s'agissait de la fin du gène de l'ADN polymerase précédemment identifiée.(fragment obtained by digestion of PCR product). This fragment was cloned as before. About 200 recombinant clones were screened and four of them gave a positive signal. The four clones have an identical profile after digestion with the restriction enzyme HindIII. One of them, 557MACc, was sequenced and the sequence comparison demonstrated that it was the end of the DNA polymerase gene previously identified.
Supposant que les fragments de gènes obtenus appartenaient à la même ADN polymerase, des oligonucléotides ont été utilisés afin d'amplifier la zone manquante. Ce fragment de PCR a ensuite été purifié sur gel comme décrit dans Matériel et Méthodes, et
utilisé comme sonde marquée radioactivement pour repérer sur la membrane le fragment manquant. Après deux heures de révélation un fragment HindIII-HindIII a été révélé, de 2 kb environ. Clone comme précédemment, 600 colonies ont alors été criblées, donnant quatre clones positifs. Les quatre clones avaient le même profil et l'un d'entre eux, nommé 557MACb a été séquence et les comparaisons de séquences ont démontré qu'il s'agit de la partie intermédiaire de l'ADN polymerase et ce fragment, délimité par deux sites HindIII, s'insère parfaitement entre 557MACa et 557MACc. L'ensemble de ces trois clones donne la séquence complète de l'ADN polymerase de T. fumicolans .Assuming that the gene fragments obtained belong to the same DNA polymerase, oligonucleotides were used in order to amplify the missing area. This PCR fragment was then purified on gel as described in Materials and Methods, and used as a radioactively labeled probe to locate the missing fragment on the membrane. After two hours of development, a HindIII-HindIII fragment was revealed, of approximately 2 kb. Clone as before, 600 colonies were then screened, giving four positive clones. The four clones had the same profile and one of them, named 557MACb, was sequenced and the sequence comparisons demonstrated that it is the intermediate part of the DNA polymerase and this fragment, delimited by two HindIII sites, fits perfectly between 557MACa and 557MACc. Together, these three clones give the complete DNA polymerase sequence of T. fumicolans.
2 ) Position phylogénétiσue de Theirmococcus fumicolans2) Phylogenetic position of Theirmococcus fumicolans
Thermococcus fumicolans est une nouvelle espèce de Thermococcales décrite par Godfroy et al (8) .Thermococcus fumicolans is a new species of Thermococcales described by Godfroy et al (8).
3 ) Séquences nucléotidiαues et polypeptidiques de l'ADN polymerase de Thermococcus fumicolans .3) Nucleotide and polypeptide sequences of the DNA polymerase of Thermococcus fumicolans.
a) Séquences nucléotidicruesa) Nucleotide sequences
Les trois fragments délimités par des sites HindIII ont été assemblés (figures 2 et 3) . Ils forment à eux trois un fragment de 5039 paires de bases. Le premier fragment, issu du clone 557MACa, contient le codon de départ ATG en position 457 pb . La phase ouverte de lecture est alors ininterrompue sur 4572 paires de bases jusqu'à rencontrer un codon STOP sur le fragment issu du clone 557MACc en position 5028 pb, six paires de bases avant le dernier site HindIII. Par alignement, avec les autres gènes d'ADN polymérases disponibles en banquesThe three fragments delimited by HindIII sites were assembled (Figures 2 and 3). The three of them form a fragment of 5039 base pairs. The first fragment, coming from the clone 557MACa, contains the starting codon ATG at position 457 bp. The open reading phase is then uninterrupted on 4572 base pairs until it encounters a STOP codon on the fragment derived from the clone 557MACc in position 5028 bp, six base pairs before the last HindIII site. By alignment, with the other DNA polymerase genes available in libraries
(Pfu, Tli, GB-D, KOD, ..), avec la méthode CLUSTAL V, complétée par un alignement manuel final nécessaire pour restituer les sites d' autoépissage des intéines, deux séquences d'insertions ont été mises en évidence.
- La première est insérée à la paire de base n° 1675 et se termine à 2754, étant ainsi répartie entre le fragment issu du clone 557MACa et celui du clone 557MACb.(Pfu, Tli, GB-D, KOD, ..), with the CLUSTAL V method, supplemented by a final manual alignment necessary to restore the sites of self-splicing of the inteins, two sequences of insertions were highlighted. - The first is inserted into base pair no. 1675 and ends at 2754, thus being distributed between the fragment originating from clone 557MACa and that of clone 557MACb.
- La deuxième est insérée à la base n° 3157 et se termine à 4323. Elle est, de même, répartie entre le clone 557MACb et 557MACC.- The second is inserted at the base n ° 3157 and ends at 4323. It is also distributed between the clone 557MACb and 557MACC.
Ces deux séquences d'insertions forment, avec le reste de la séquence codant pour l'ADN polymerase, un seul cadre de lecture.These two insertion sequences form, with the rest of the sequence coding for DNA polymerase, a single reading frame.
b) Séquences polypeptidiques .b) Polypeptide sequences.
La section codante du gène de l'ADN polymerase est donc constituée de trois parties disjointes. La première partie du gène, portée par 557MACa, comporte la zone codant pour 1 ' exonucléase 3'- 5', où après traduction, on reconnaît le motif FDIET. La deuxième partie, portée par 557MACb et la troisième partie portée par 557MACC comprennent les sites conservés SLYPSI, et YG.DTD. Les deux séquences d'insertion sont situées sur des zones conservées de l'ADN polymerase, la première au site DFR/SLYPSII comme I-KOD-1 de Pyrococcus sp . KOD1 et la deuxième au site D/TDG comme I-Tli-1 de T . l i toral i s . Ces deux protéines sont libérées par autoépissage proteique. Elles comportent les sites de type LAGLIDADG répétés deux fois à 100 paires de bases de distance environ. Les alignements avec les autres intéines déposées dans les banques de séquences permettent de les assimiler à des endonucléases de restriction de type archaebactérien, endonucléases qui coupent l'ADN à l'endroit précis où leur gène s'insère.The coding section of the DNA polymerase gene therefore consists of three disjoint parts. The first part of the gene, carried by 557MACa, comprises the zone coding for the 3'-5 'exonuclease, where, after translation, the motif FDIET is recognized. The second part, carried by 557MACb and the third part carried by 557MACC include the preserved sites SLYPSI, and YG.DTD. The two insertion sequences are located on conserved areas of DNA polymerase, the first at the DFR / SLYPSII site such as I-KOD-1 of Pyrococcus sp. KOD1 and the second at the D / TDG site as T-I-Tli-1. l i toral i s. These two proteins are released by protein self-splicing. They include LAGLIDADG type sites repeated twice at approximately 100 base pairs apart. The alignments with the other inteins deposited in the sequence banks make it possible to assimilate them to restriction endonucleases of the archaebacterial type, endonucleases which cut DNA at the precise place where their gene is inserted.
c) Comparaison avec les autres séquences d'ADN polymérases . L'alignement des différentes séquences polypeptidiques des ADN polymérases de Thermococcales disponibles en banque, P. abyssi strain GE5 , Pyrococcus
sp. GE23, Pyrococcus sp . KOD1 , Pyrococcus sp. GB-D, P. furi os us , Pyrococcus sp 9°N, T. l i toralis avec la séquence de T. fumicolans (sans intéines), réalisé avec le programme CLUSTAL utilisant la matrice PAM250, donne les niveaux de similarité entre ces diverses polymérases figurant au tableau 1 ci-dessous.c) Comparison with other DNA polymerase sequences. The alignment of the different polypeptide sequences of the DNA polymerases of Thermococcales available in the bank, P. abyssi strain GE5, Pyrococcus sp. GE23, Pyrococcus sp. KOD1, Pyrococcus sp. GB-D, P. furi os us, Pyrococcus sp 9 ° N, T. li toralis with the sequence of T. fumicolans (without inteins), carried out with the CLUSTAL program using the PAM250 matrix, gives the levels of similarity between these various polymerases listed in Table 1 below.
Tableau 1Table 1
Le niveau de similarité le plus proche est celui observé avec Pyrococcus sp . 9°N (90,6%), ce qui indique clairement que l'ADN polymerase de T. fumicolans est originale au niveau de sa séquence et constitue de ce fait une nouvelle ADN polymerase.The closest similarity level is that observed with Pyrococcus sp. 9 ° N (90.6%), which clearly indicates that the DNA polymerase of T. fumicolans is original in terms of its sequence and therefore constitutes a new DNA polymerase.
d) Comparaison des intéines avec les autres intéines disponibles dans les banques .d) Comparison of the inteins with the other inteins available in banks.
Les séquences disponibles en banque ont été alignées selon les mêmes méthodes que précédemment. Les comparaisons de l'ensemble des intéines démontrent que celles-ci sont réparties en trois groupes correspondant aux sites d'insertion des motifs A (R/SLYPSI), B (KILAN/S) et C (D/TDG) . L'analyse des niveaux de similarité et la recherche de relations phylogénétiques n'ont de sens que pour des intéines appartenant à la même classe, c'est à dire s ' insérant dans un motif donné. Les niveaux de similarités pour I-Tfu-1 (classe A) et I-Tfu-2 (classe C) avec leurs intéines homologues déjà décrites sont donnés respectivement aux tableaux 2 et 3 ci- dessous . Tableau 2The sequences available in the bank were aligned using the same methods as above. The comparisons of all the inteins show that these are divided into three groups corresponding to the sites of insertion of the motifs A (R / SLYPSI), B (KILAN / S) and C (D / TDG). The analysis of the levels of similarity and the search for phylogenetic relationships only make sense for inteins belonging to the same class, that is to say, fitting into a given motif. The similarity levels for I-Tfu-1 (class A) and I-Tfu-2 (class C) with their homologous inteins already described are given in Tables 2 and 3 respectively below. Table 2
Tableau 3Table 3
I-Tfu-1 et I-Tfu-2 semblent représenter deux nouveaux "allèles" des "homing" endonucléases archaebactériennes des classes A et C.I-Tfu-1 and I-Tfu-2 seem to represent two new "alleles" of "homing" archaebacterial endonucleases of classes A and C.
I-Tfu-1 est le troisième allèle connu d' intéine s ' insérant au site A des ADN polymerase d'Archaea, tandis que I-Tfu-2 est le second de sa classe.I-Tfu-1 is the third known allele of intein inserting into the A site of Archaea DNA polymerase, while I-Tfu-2 is the second of its class.
5 ) Expression, caractérisation et activité de l'ADN polymerase de T. fumicolans .5) Expression, characterization and activity of T. fumicolans DNA polymerase.
a) Clonaσe et expression. Un insert de 4595 pb obtenu par PCR longue- distance avec les amorces TfuDir et TfuRev et couvrant la totalité du gène de l'ADN polymerase de Thermococcus fumicolans, avec les deux intéines, a été clone aux sites Ndel et BamHI d'un vecteur pour transformer la souche E. coli Novablue. Des mini-préparations d'ADN plasmidique ont été réalisées sur une dizaine de clones transformants et ont toutes donné un signal positif à l'hybridation. Deux clones ont été retenus sur la base de leur profil après une digestion par Ndel et Sali ou par HindIII. Ces deux plasmides ont ensuite été transformés dans la souche E. coli BL21(DE3)pLysS.a) Clonaσe and expression. A 4595 bp insert obtained by long-distance PCR with the primers TfuDir and TfuRev and covering the entire DNA polymerase gene of Thermococcus fumicolans, with the two inteins, was cloned at the NdeI and BamHI sites of a vector for transform the E. coli Novablue strain. Mini-plasmid DNA preparations were carried out on ten transforming clones and all gave a positive signal to hybridization. Two clones were selected on the basis of their profile after digestion with Ndel and Sali or with HindIII. These two plasmids were then transformed into the E. coli BL21 (DE3) pLysS strain.
Des essais d'expression ont été réalisés en culture de 5 ml afin de déterminer les conditions optimales de culture et d'induction. Tout d'abord les essais de cultures ont été réalisés à 37°C, où l'on observe une lyse cellulaire trop importante, puis à 30°C où la culture ne lyse pratiquement pas. La culture est
réalisée en milieu 2xYT, où le rendement de production est meilleur et la lyse réduite, supplémenté avec de l'ampicilline (lOOμg/ml) et du chloramphenicol (15μg/ml) . L'expression est alors induite en phase exponentielle de croissance (DO600nm = 0,6 à 0,7) avec une concentration en IPTG de ImM, concentration qui s'est avérée optimale. Des échantillons sont prélevés avant induction puis 2 heures, 4 heures et 6 heures après induction, ainsi qu'après une nuit. Des prélèvemments sont aussi réalisés sur les cultures non induites après 6, 8, 12 et 24 heures de culture .Expression tests were carried out in 5 ml culture in order to determine the optimal culture and induction conditions. First of all, the culture tests were carried out at 37 ° C., where there is too much cell lysis, then at 30 ° C. where the culture practically does not lyse. Culture is carried out in a 2xYT medium, where the production yield is better and the lysis reduced, supplemented with ampicillin (100 μg / ml) and chloramphenicol (15 μg / ml). Expression is then induced in the exponential growth phase (OD600nm = 0.6 to 0.7) with an IPTG concentration of ImM, a concentration which has been found to be optimal. Samples are taken before induction, then 2 hours, 4 hours and 6 hours after induction, as well as overnight. Samples are also taken from non-induced cultures after 6, 8, 12 and 24 hours of culture.
Les échantillons sont alors traités comme indiqué au chapitre Matériel et Méthodes, afin de tester le niveau d'activité de l'ADN polymerase recombinante parThe samples are then treated as indicated in the Materials and Methods chapter, in order to test the level of activity of the recombinant DNA polymerase by
39 incorporation de thymidine tritiée ou de P dCTP. Cette première technique, qui permet une approche qualitative, nous donne deux types de comportement. Un des clones est non inductible et la meilleure activité est obtenue après une nuit de culture (clone rTful-1) . Un second clone est inductible et présente une activité maximale après quatre heures d'induction, activité qui diminue par la suite39 incorporation of tritiated thymidine or P dCTP. This first technique, which allows a qualitative approach, gives us two types of behavior. One of the clones is non-inducible and the best activity is obtained after a night of culture (rTful-1 clone). A second clone is inducible and exhibits maximum activity after four hours of induction, an activity which subsequently decreases
(clone rTful00-2). Deux autres clones, aussi testés sont faiblement inductibles et expriment très peu l'ADN polymerase . Les clones rTful-1 et rTful00-2 sont ensuite testés en erlenmeyer de 50 ml dans les conditions décrites précédemment. Seul le clone inductible rTfulOO-2 a une expression constante en volume de 50 ml. La suite des travaux a donc été réalisée sur ce clone .(rTful00-2 clone). Two other clones, also tested, are weakly inducible and express very little DNA polymerase. The rTful-1 and rTful00-2 clones are then tested in a 50 ml Erlenmeyer flask under the conditions described above. Only the inducible rTfulOO-2 clone has a constant volume expression of 50 ml. The rest of the work was therefore carried out on this clone.
b) Fermentation et extraction des cellules .b) Fermentation and extraction of cells.
La culture du clone Tful00-2 à été réalisée dans un fermenteur de 16 litres, dans le milieu 2xYT supplémenté en ampicilline et chloramphenicol. La préculture, de 750ml a été réalisée la veille et arrêtée en phase exponentielle (DO 600nm= 0,7-0,8) et laissée àThe culture of the clone Tful00-2 was carried out in a 16 liter fermenter, in the 2xYT medium supplemented with ampicillin and chloramphenicol. The 750 ml preculture was carried out the day before and stopped in the exponential phase (OD 600nm = 0.7-0.8) and left at
4°C pour la nuit. Le fermenteur a été préparé et mis en
température à 30°C avec le milieu de culture. La préculture a été remise à 30°C une heure avant son transfert dans le fermenteur. Le volume final de culture est de 15 litres. Les conditions sont les suivantes: température = 30°C ; agitation = 300 rpm. L'induction avec 1 mM d'IPTG a été réalisée à DO 600 = 0,58. Le pH de la culture a été régulé à 7 pendant la phase d'acidification puis laissé libre lors de la phase alcaline. Les bactéries ont été prélevées après quatre heures d'induction, alors que le pH était de 8,3. La culture a ensuite été centrifugée puis les cellules ont été divisées en trois lots. L'un d'eux, 20 g de pâte, a été repris dans 80 ml de tampon de lyse pour un traitement ultérieur indiqué au chapitre Matériel et méthodes .4 ° C for the night. The fermenter has been prepared and put in temperature at 30 ° C with the culture medium. The preculture was returned to 30 ° C one hour before its transfer to the fermenter. The final culture volume is 15 liters. The conditions are as follows: temperature = 30 ° C; agitation = 300 rpm. Induction with 1 mM IPTG was carried out at OD 600 = 0.58. The pH of the culture was adjusted to 7 during the acidification phase and then left free during the alkaline phase. The bacteria were removed after four hours of induction, when the pH was 8.3. The culture was then centrifuged and the cells were divided into three batches. One of them, 20 g of paste, was taken up in 80 ml of lysis buffer for further processing indicated in the Materials and methods chapter.
c) Purification de l'ADN polymerase de T. fumicolans .c) Purification of T. fumicolans DNA polymerase.
La purification a été .effectuée comme indiqué précédemment (Matériel et Méthodes) . Pour la colonne Héparine -Sépharose, un gradient de 3 à 50% de tampon B, correspondant aux volume 365 ml à 1363 ml, permet de récupérer 73 fractions de 6 ml . Le pic d'activité de la polymerase est obtenu pour une valeur de gradient de 0,5 M environ, et correspond aux fractions 55/56 (dosées à 10 et 12 unités respectivement) comme indiqué à la figure 7. Ces fractions, incubées à 37°C pendant la nuit en présence d'ADN de pBR322, dégradent l'ADN et présentent en conséquence des traces de nucléase de l'hôte, non visibles sur gel. Leur élimination, ou tout au moins une réduction substancielle de leur concentration a été réalisée par un passage sur colonne d'affinité (Bleue Hitrap) .The purification was carried out as indicated previously (Materials and Methods). For the Heparin-Sepharose column, a gradient of 3 to 50% of buffer B, corresponding to the volume 365 ml to 1363 ml, makes it possible to recover 73 fractions of 6 ml. The peak of polymerase activity is obtained for a gradient value of approximately 0.5 M, and corresponds to fractions 55/56 (assayed at 10 and 12 units respectively) as indicated in FIG. 7. These fractions, incubated at 37 ° C overnight in the presence of pBR322 DNA, degrade the DNA and consequently show traces of host nuclease, not visible on gel. Their elimination, or at least a substantial reduction in their concentration, was achieved by passing through an affinity column (Blue Hitrap).
Les fractions 54 à 60, regroupées et dialysées, sont chargées à raison de 0,25 ml/mn. L'élution permet de récupérer 65 fractions de 5 ml avec des pics d'activité pour les fractions 36 à 56. Le dosage de
l'activité fait apparaître une concentration de 3 à 5 unités pour les fractions 36 à 40. Ces fractions, mises en présence d'ADN à 37°C, pressentent une faible activité nucléasique. Néanmoins, l'activité sur le plasmide pBR322 àFractions 54 to 60, grouped and dialyzed, are loaded at a rate of 0.25 ml / min. The elution makes it possible to recover 65 fractions of 5 ml with peaks of activity for fractions 36 to 56. The dosage of the activity shows a concentration of 3 to 5 units for fractions 36 to 40. These fractions, when put in the presence of DNA at 37 ° C, sense a weak nucleic activity. Nevertheless, the activity on the plasmid pBR322 to
37°C toute la nuit montre une nette amélioration de la pureté de l'enzyme. Une deuxième colonne Bleue HiTrap a été réutilisée en prenant les fractions 36 à 44 et dialysées comme précédemment. 25ml sur les 30ml de fraction ont été injectés sur la colonne avec un débit de 0,25ml/min (5 ml étant gardés pour essayer une MonoQ) . Après cette deuxième colonne Bleue HiTrap, l'activité sur le plasmide pBR322 fermé et incubé toute la nuit avec des fractions de la colonne, est nulle. L'incubation d'une heure à 72°C de fractions avec l'ADN de Lambda digéré par HindIII montre une très nette dégradation, mettant en évidence l'activité exonucléase 3 '-5' associée à notre ADN polymerase. Le pic d'activité se situe entre les fractions 27 et 32. L'ADN polymerase plus pure est donc sortie plus tôt sur le gradient de NaCl. Sur les fractions les plus actives, un comptage a été réalisé donnant une activité supérieure à 5U/μl pour les fractions 29, 30 et 31. La figure 4 montre le résultat sur gel SDS-PAGE. Suite à ces trois colonnes, la pureté de la polymerase est nettement améliorée. Néanmoins, il reste des traces d'ADN de E. coli fixé à la polymerase et mises en évidence par PCR. Deux colonnes supplémentaires vont donc être mis en oeuvre. 30ml de fractions actives obtenues précédemment sont chargées sur une colonne de phosphocellulose (celle-ci devrait fixer différemment l'ADN et la polymerase). L'activité de la polymerase est repérée par son activité exonucléasique 3 '-5' élevée à 72°C sur l'ADN de Lambda digéré par HindIII. L'activité la plus forte se situe entre les fractions 40 et 49 avec un pic net en 46 et 47. Sur ces fractions 40 à 49 l'ADN de pBR322 est intact après une nuit à 37°C. La figure 5 nous montre les résultats sur gel avec des activités37 ° C overnight shows a marked improvement in the purity of the enzyme. A second HiTrap Blue column was reused, taking fractions 36 to 44 and dialyzed as before. 25ml of the 30ml fraction were injected onto the column with a flow rate of 0.25ml / min (5ml being kept to try a MonoQ). After this second Blue HiTrap column, the activity on the plasmid pBR322 closed and incubated overnight with fractions of the column is zero. The one hour incubation at 72 ° C of fractions with the DNA of Lambda digested with HindIII shows a very clear degradation, highlighting the exonuclease activity 3 '-5' associated with our DNA polymerase. The peak of activity is between fractions 27 and 32. The purer DNA polymerase therefore left earlier on the NaCl gradient. On the most active fractions, a count was made giving an activity greater than 5U / μl for fractions 29, 30 and 31. FIG. 4 shows the result on SDS-PAGE gel. Following these three columns, the purity of the polymerase is significantly improved. Nevertheless, traces of DNA from E. coli attached to the polymerase and demonstrated by PCR remain. Two additional columns will therefore be implemented. 30ml of active fractions obtained previously are loaded onto a phosphocellulose column (this should fix the DNA and the polymerase differently). The activity of the polymerase is identified by its high 3 '-5' exonuclease activity at 72 ° C. on the DNA of Lambda digested with HindIII. The strongest activity is located between fractions 40 and 49 with a sharp peak at 46 and 47. On these fractions 40 to 49 the pBR322 DNA is intact after overnight at 37 ° C. Figure 5 shows us the results on gel with activities
FEUILLE RECTIFIEE (REGLE 91) ISAEP
mesurées de 6U/μl pour la fraction 46, 4,5U/μl pour la 47 et 3U/μl pour la 48. Néanmoins il reste encore des traces d'ADN de E. coli .RECTIFIED SHEET (RULE 91) ISAEP measured at 6U / μl for fraction 46, 4.5U / μl for 47 and 3U / μl for 48. However, traces of DNA from E. coli still remain.
Ayant déjà mis en évidence que cette polymerase ne se fixe pas sur une MonoQ ou une ressource Q et ce quel que soit le pH utilisé, nous avons essayé de la récupérer en exclusion en espérant une fixation de l'ADN contaminant par la colonne.Having already demonstrated that this polymerase does not bind to a MonoQ or Q resource, regardless of the pH used, we tried to recover it by exclusion, hoping for fixation of the contaminating DNA by the column.
Tout d'abord un essai a été réalisé en sortie de la deuxième Bleue HiTrap avec 5ml dialyses Les résultats ont été décevants, les fractions d'exclusion étant peu actives en incorporation.First of all, a test was carried out at the end of the second HiTrap Blue with 5 ml dialysis. The results were disappointing, the exclusion fractions being not very active in incorporation.
Une deuxième tentative a été réalisée après la phosphocellulose et après deux dialyses des fractions les plus actives 45, 46 et 49. Les fractions sont tout d'abord chauffées 40 min à 85°C en raison d'une détection de contaminant dégradant le pBR322 à 72°C en une nuit. Aucune floculation n'est alors visible et l'extrait est mis sur glace. Un nouveau test démontre que la contamination semble réduite. Après les dialyses et le passage sur colonne, l'activité exonucléasique est mise en évidence sur les fractions d'exclusion 3 à 7. L'activité est dosée à 2U/μl et l'enzyme est visualisée sur la figure 10. Suite a cette dernière étape de purification, nous avons obtenu des résultats positifs en PCR et comparables à ceux obtenus pour la Vent.A second attempt was made after phosphocellulose and after two dialyses of the most active fractions 45, 46 and 49. The fractions are first heated 40 min at 85 ° C. due to the detection of a contaminant degrading pBR322 at 72 ° C overnight. No flocculation is then visible and the extract is put on ice. A new test shows that the contamination seems reduced. After the dialysis and the passage through a column, the exonuclease activity is demonstrated on the exclusion fractions 3 to 7. The activity is assayed at 2U / μl and the enzyme is displayed in FIG. 10. Following this last purification step, we obtained positive PCR results and comparable to those obtained for Wind.
d) Caractérisation des activité des fractions purifiées . L'activité de l'ADN polymerase des différentesd) Characterization of the activities of the purified fractions. DNA polymerase activity of different
32 fractions a été dosée par incorporation au P dATP selon le protocole décrit en Matériel et Méthodes.32 fractions were assayed by incorporation into P dATP according to the protocol described in Materials and Methods.
- Amplification de gènes in vi tro . Un fragment de 459 pb a été amplifié à partir d'un ADN genomique d'Archaebactérie ( Thermococcus sp. GE- Amplification of genes in vi tro. A 459 bp fragment was amplified from genomic DNA from Archaebacteria (Thermococcus sp. GE
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
8) avec des amorces spécifiques. Différents tampons ont été utilisés:RECTIFIED SHEET (RULE 91) ISA / EP 8) with specific primers. Different buffers were used:
- tampon lOx R: Tris HC1 pH 8,8: 300mM; KCl: 500mM; MgC12: 30mM; T een 20: 0,1% - tampon lOx H: Tris HC1 : pH 8,8: 300 mM; KCl:- Ox R buffer: Tris HC1 pH 8.8: 300mM; KCl: 500mM; MgC12: 30mM; T een 20: 0.1% - 10 × H buffer: Tris HC1: pH 8.8: 300 mM; KCl:
500mM; MgC12 : 15mM; Tween 20: 0,1%500mM; MgC12: 15mM; Tween 20: 0.1%
- tampon lOx T: Tris HC1 pH 8,8: 600mM; KCl: 500mM; MgC12 : 15mM; Tween 20: 0,1%- lOx T buffer: Tris HC1 pH 8.8: 600mM; KCl: 500mM; MgC12: 15mM; Tween 20: 0.1%
- tampon lOx S: Tris Hcl pH 8,8: 200mM; KCl: 250mM; MgC12 : 20mM; Tween 20: 0,1%- lOx S buffer: Tris Hcl pH 8.8: 200mM; KCl: 250mM; MgC12: 20mM; Tween 20: 0.1%
Trente cycles ont été réalisés, chacun comprenant une étape de dénaturation à 94°C pendant 30 sec., une étape d'hybridation des amorces à 51°C pendant 1 min. puis une étape d' élongation à 72°C pendant 2 min. La figure 8 présente les résultats obtenus avec un volume reactionnel de 50 μl pour des quantités d'ADN polymerase de Thermococcus fumicolans de 2,7 unités. En l'état actuel, les meilleurs résultats avec la Tfu sont obtenus avec le tampon R . La figure 9 représente les résultats de l'amplification d'un fragment de 1,6 kb avec la Tfu purifiée sur colonnes d'héparine puis de séphacryl-bleue, et un tampon reactionnel lOx ayant la composition suivante: Tris HC1 pH 8,8: 200mM; KCl: lOOmM; (NH4)2Sθ4: lOOmM; MgS04 : 20mM; Triton X-100 : 1%.Thirty cycles were carried out, each comprising a denaturation step at 94 ° C for 30 sec., A step of hybridization of the primers at 51 ° C for 1 min. then an elongation step at 72 ° C for 2 min. FIG. 8 presents the results obtained with a reaction volume of 50 μl for quantities of DNA polymerase from Thermococcus fumicolans of 2.7 units. As it stands, the best results with Tfu are obtained with buffer R. FIG. 9 represents the results of the amplification of a 1.6 kb fragment with the purified Tfu on heparin and then sephacryl-blue columns, and a 10 × reaction buffer having the following composition: Tris HC1 pH 8.8 : 200mM; KCl: 100 mM; (NH4) 2Sθ4: 100mM; MgSO4: 20mM; Triton X-100: 1%.
- Activité exonucléasique.- Exonuclease activity.
Les tests d'activité, selon le protocole détaillé en Matériels et Méthodes, ne révèlent pas d'activité exonucléasique 5 '-3' chez la Tfu. Ceci est en conformité avec la structure de l'enzyme déduite de l'analyse de la séquence polypeptidique qui ne fait pas apparaître de domaine fonctionnel exonucléase 5 '-3', contrairement à ce qui est observé pour DNA poli de E. coli et la Taq.The activity tests, according to the protocol detailed in Materials and Methods, do not reveal any 5 '-3' exonuclease activity in Tfu. This is in accordance with the structure of the enzyme deduced from the analysis of the polypeptide sequence which does not show a functional domain of exonuclease 5 '-3', contrary to what is observed for polished DNA of E. coli and the Taq.
Les tests d'activité, selon le protocole détaillé au chapitre Matériels et Méthodes, font
apparaître une activité exonucléasique 3 '-5' (activité proof-reading ou de correction d'erreurs) chez la Tfu, à un niveau sensiblement égal à celui de la Vent comme montré dans le tableau 4 ci-dessous. Le tableau 4 rapporte la mesure de l'actvité exonucléasique 3 '-5' de la Tfu en fonction de la concentration en dNTP, et en comparaison avec la Vent.The activity tests, according to the protocol detailed in the Materials and Methods chapter, a 3 '-5' exonuclease activity (proof-reading or error correction activity) appears in the Tfu, at a level substantially equal to that of the Wind as shown in table 4 below. Table 4 reports the measurement of the 3 '-5' exonuclease activity of Tfu as a function of the dNTP concentration, and in comparison with the Wind.
Tableau 4Table 4
Ces résultats sont en conformité avec la structure de l'enzyme déduite de l'analyse de la séquence polypeptidique qui révèle la présence d'un domaine exonucléasique 3 '-5' en position N-terminale, ainsi que la présence des motifs catalytiques caractéristiques de ce domaine. La Tfu est sensible à une concentration de l'ordre de 0 , 8 mM de dNTP, tandis que la Vent manifeste une sensibilité dès 0,5 mM. Cette activité est connue pour améliorer in vi tro la fidélité des polymérases utilisées en PCR. En outre, cette activité exonucléasique est confirmée par un test plus simple. Les fractions purifiées, dépourvues d'activité nucléasique à 37°C, mises en présence d'ADN de digéré par HindIII puis exposé à 72°C pendant la nuit, dégradent complètement cet ADN, mettant ainsi en évidence la présence de l'activité 3 '-5' exonucléase de la T f u ainsi que sa thermostabilité . These results are in accordance with the structure of the enzyme deduced from the analysis of the polypeptide sequence which reveals the presence of a 3 '-5' exonuclease domain in the N-terminal position, as well as the presence of the catalytic motifs characteristic of this domain. The Tfu is sensitive to a concentration of the order of 0.8 mM of dNTP, while the Wind manifests a sensitivity from 0.5 mM. This activity is known to improve in vi tro the fidelity of the polymerases used in PCR. In addition, this exonuclease activity is confirmed by a simpler test. The purified fractions, devoid of nuclease activity at 37 ° C., placed in the presence of DNA digested with HindIII and then exposed to 72 ° C. overnight, completely degrade this DNA, thus demonstrating the presence of activity 3 '-5' exonuclease of T fu and its thermostability.
- Thermostabilité.- Thermostability.
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
32RECTIFIED SHEET (RULE 91) ISA / EP 32
La thermostabilité mesurée selon le protocole décrit précédemment (Matériel et Méthodes), ou mieux, l'activité résiduelle en incorporation à 72°C après exposition de l'enzyme à des températures élevées pendant des temps variables, est donnée dans le tableau 5 ci- dessous .The thermostability measured according to the protocol described above (Materials and Methods), or better, the residual activity in incorporation at 72 ° C. after exposure of the enzyme to high temperatures for variable times, is given in table 5 below. below.
Tableau 5Table 5
Cette thermostabilité, inférieure à celle des polymérases issues d'organismes plus hyperthermophiles tels que les Pyrococcus , n'en demeure pas moins très largement supérieure à celles des polymérases issues des Thermus et en particulier toutes les Taq. La thermostabilité de l'enzyme recombinante purifiée, tant pour le domaine polymerase que pour l' exonucléase, est de toute manière très largement supérieure à tous les besoins connus en PCR.
This thermostability, which is lower than that of polymerases from more hyperthermophilic organisms such as Pyrococcus, is nonetheless very much greater than that of polymerases from Thermus and in particular all Taq. The thermostability of the purified recombinant enzyme, both for the polymerase domain and for the exonuclease, is in any case very much higher than all known requirements in PCR.
REFERENCES BIBLIOGRAPHIQUES .BIBLIOGRAPHICAL REFERENCES.
1. Birnboim, H. C, and J. Doly. 1979. A rapid extraction procédure for screening recombinant plasmid DNA. Nucleic Acids Research. 7:1503.1. Birnboim, H. C, and J. Doly. 1979. A rapid extraction procedure for recombinant screening plasmid DNA. Nucleic Acids Research. 7: 1503.
2. Braithwaite, D. K., and J. Ito. 1993. Compilation, alignment, and phylogenetic relationships of DNA polymérases. Nucleic Acids Research. 21 ( 4) : 787-802.2. Braithwaite, D. K., and J. Ito. 1993. Compilation, alignment, and phylogenetic relationships of DNA polymerases. Nucleic Acids Research. 21 (4): 787-802.
3. Charbonnier, F. 1993. Paris Sud. 4. Chong, S. R., Y. Shao, H. Paulus, J. Benner,3. Charbonnier, F. 1993. Paris Sud. 4. Chong, S. R., Y. Shao, H. Paulus, J. Benner,
F. B. Perler, and M. Q. Xu. 1996. Protein splicing involving the Saccharomyces cerevisiae VMA intein - The steps in the splicing pathway, side reactions leading to protein cleavage, and establishment of an in vitro splicing System. Journal of Biological Chemistry.F. B. Perler, and M. Q. Xu. 1996. Protein splicing involving the Saccharomyces cerevisiae VMA intein - The steps in the splicing pathway, side reactions leading to protein cleavage, and establishment of an in vitro splicing System. Journal of Biological Chemistry.
271(36) :22159-22168.271 (36): 22159-22168.
5. Davis, E. 0., S. G. Sedgwick, and J. Colston. 1991. Novel structure of Mycobacterium tuberculosis implies processing of the gène product. Journal of Bacteriology . 173 (18) :5653-5662.5. Davis, E. 0., S. G. Sedgwick, and J. Colston. 1991. Novel structure of Mycobacterium tuberculosis implies processing of the gene product. Journal of Bacteriology. 173 (18): 5653-5662.
6. Davis, E. 0., S. G. Sedgwick, and M. J. Colston. 1991. Novel structure of the recA locus of Mycobacterium tuberculosis implies processing of the gène product. Journal of Bacteriology. 173 (18) : 5653-5662. 7. Fsihi, H., V. Vincent, and S. T. Cole . 1996.6. Davis, E. 0., S. G. Sedgwick, and M. J. Colston. 1991. Novel structure of the recA locus of Mycobacterium tuberculosis implies processing of the gene product. Journal of Bacteriology. 173 (18): 5653-5662. 7. Fsihi, H., V. Vincent, and S. T. Cole. 1996.
Homing events in the gyrA gène of some mycobacteria. Proceedings of the National Academy of Sciences of the United States of America. 93 (8) :3410-3415.Homing events in the gyrA gene of some mycobacteria. Proceedings of the National Academy of Sciences of the United States of America. 93 (8): 3410-3415.
8. Godfroy, A., J. R. Meunier, J. Guézennec, F. Lesongeur, G. Raguénès, A. Rimbault, and G. Barbier.8. Godfroy, A., J. R. Meunier, J. Guézennec, F. Lesongeur, G. Raguénès, A. Rimbault, and G. Barbier.
1996. Thermococcus fumicolans sp . nov . , a new hyperthermophilic archaeon isolated from a deep-sea hydrothermal vent in the North Fiji Basin. International Journal of Systematic Bacteriology. 46(4) .1113-1119. 9. Hodges, R. A., F. B. Perler, C. J. Noren, and W. E. Jack. 1992. Protein Splicing Removes
Intervening Séquences in an Archaea DNA Polymerase. Nucleic Acids Res . 20 (23 ): 6153-6157.1996. Thermococcus fumicolans sp. Nov . , a new hyperthermophilic archaeon isolated from a deep-sea hydrothermal vent in the North Fiji Basin. International Journal of Systematic Bacteriology. 46 (4). 1113-1119. 9. Hodges, RA, FB Perler, CJ Noren, and WE Jack. 1992. Protein Splicing Removes Intervening Sequences in an Archaea DNA Polymerase. Nucleic Acids Res. 20 (23): 6153-6157.
10. Kong, H. M., R. B: Kucera, and W. E. Jack. 1993. Characterization of a DNA Polymerase from the Hyperthermophile Archaea Thermococcus litoralis - Vent DNA Polymerase, Steady State Kinetics, Thermal Stability, Processivity, Strand Displacement, and Exonucléase Activities. J Biol Chem. 268 (3 ): 1965-1975.10. Kong, H. M., R. B: Kucera, and W. E. Jack. 1993. Characterization of a DNA Polymerase from the Hyperthermophile Archaea Thermococcus litoralis - Vent DNA Polymerase, Steady State Kinetics, Thermal Stability, Processivity, Strand Displacement, and Exonuclease Activities. J Biol Chem. 268 (3): 1965-1975.
11. Saitou, N., and M. Nei . 1987. The neighbor- oining method: a new method for reconstructing phylogenetic trees . Mol. Biol. Evol . 4 (4) : 406-425.11. Saitou, N., and M. Nei. 1987. The neighbor-oining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4 (4): 406-425.
12. Sambrook, J., E. F. Fritsch, and T. Maniatis . 1989. Molecular Cloning. A Laboratory Manual, 2nd éd. Cold Spring Harbor Laboratory Press, Cold Spring Harbor .12. Sambrook, J., E. F. Fritsch, and T. Maniatis. 1989. Molecular Cloning. A Laboratory Manual, 2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor.
13. Sanger, F., S. Nicklen, and A. R. Coulson. 1977. DNA sequencing with chain terminating inhibitors. Proc . Natl . Acad. Sci . USA. 74:5467-5473.13. Sanger, F., S. Nicklen, and A. R. Coulson. 1977. DNA sequencing with chain terminating inhibitors. Proc. Natl. Acad. Sci. USA. 74: 5467-5473.
14. Southern, E. M. 1975. Détection of spécifie séquences among DNA fragments separated by gel electrophoresis . Journal of molecular biology. 98:503.14. Southern, E. M. 1975. Detection of specifies sequences among DNA fragments separated by gel electrophoresis. Journal of molecular biology. 98: 503.
15. Southworth, M. W. , H. Kong, R. B. Kucera, J. Ware, H. W. Jannasch, and F. B. Perler. 1996. Cloning of thermostable DNA polymérases from hyperthermophilic marine Archaea with emphasis on15. Southworth, M. W., H. Kong, R. B. Kucera, J. Ware, H. W. Jannasch, and F. B. Perler. 1996. Cloning of thermostable DNA polymerases from hyperthermophilic marine Archaea with emphasis on
Thermococcus sp . 9°N-7 and mutations affecting 3 ' -5 ' exonucléase activity. Proc. Ntl . Acad. Sci. USA. 93:5281- 5285.Thermococcus sp. 9 ° N-7 and mutations affecting 3 '-5' exonuclease activity. Proc. Ntl. Acad. Sci. USA. 93: 5281-5285.
16. Studier, F. W. , A. H. Rosenberg, F. J. Dunn, and J. W. Dubendorff. 1990. Use of T7 RNA polymerase to direct expression of cloned gènes. Methods Enzymol. 185:60-89.16. Studier, F. W., A. H. Rosenberg, F. J. Dunn, and J. W. Dubendorff. 1990. Use of T7 RNA polymerase to direct expression of cloned genes. Methods Enzymol. 185: 60-89.
17. Sutherland, K. J., C. M. Henneke, P. Towner, and D. W. Hough. 1990. Citrate synthase from the thermophilic archaebacterium Thermoplasma acidophilum.17. Sutherland, K. J., C. M. Henneke, P. Towner, and D. W. Hough. 1990. Citrate synthase from the thermophilic archaebacterium Thermoplasma acidophilum.
Cloning and sequencing of the gène. European Journal of Biochemistry. 194:839-844.
18. Uemori, T., Y. Ishino, H. Toh, K. Asada, and I. Kato. 1993. Organization and nucleotide séquence of the DNA polymerase gène from the archaeon Pyrococcus furiosus . Nucleic Acids Research. 21(2) :259- 265.
Cloning and sequencing of the gene. European Journal of Biochemistry. 194: 839-844. 18. Uemori, T., Y. Ishino, H. Toh, K. Asada, and I. Kato. 1993. Organization and nucleotide sequence of the DNA polymerase gene from the archaeon Pyrococcus furiosus. Nucleic Acids Research. 21 (2): 259-265.
LISTE DE SEQUENCESLIST OF SEQUENCES
;i) INFORMATION GÉNÉRALES :; i) GENERAL INFORMATION:
(i) DEPOSANT : APPLIGENE - ONCOR(i) DEPOSITOR: APPLIGENE - ONCOR
( ii) TITRE DE L' INVENTION: ADN POLYMERASE THERMOSTABLE D'ARCHAEBACTÉRIES DE L 'ESPECE THERMOCOCCUS fumicolans (iii) NOMBRE DE SEQUENCES: 4(ii) TITLE OF THE INVENTION: THERMOSTABLE DNA POLYMERASE OF ARCHAEBACTERIA OF THE THERMOCOCCUS fumicolans SPECIES (iii) NUMBER OF SEQUENCES: 4
(2) INFORMATION POUR LA SEQ ID NO : 1 :(2) INFORMATION FOR SEQ ID NO: 1:
(i) CARACTERISTIQUES DE LA SEQUENCE:(i) CHARACTERISTICS OF THE SEQUENCE:
(A) LONGUEUR: 5039 paires de bases(A) LENGTH: 5039 base pairs
(B) TYPE: nucleotide(B) TYPE: nucleotide
(C) NOMBRE DE BRIN: double(C) NUMBER OF STRANDS: double
(D) CONFIGURATION: linéaire (ii) TYPE DE MOLECULE: ADN(D) CONFIGURATION: linear (ii) TYPE OF MOLECULE: DNA
(ix) CARACTERISTIQUES(ix) CHARACTERISTICS
(A) NOM/CLE: séquence codante de l 'ADN polymerase de THERMOCOCCUS fumicolans Tfu(A) NAME / KEY: DNA polymerase sequence of THERMOCOCCUS fumicolans Tfu
(B) EMPLACEMENT: de 457 à 5028 ( ix ) CARACTERI STIQUES(B) LOCATION: from 457 to 5028 (ix) CHARACTERISTICS
(A) NOM/CLE: séquence codante de 1 ' intéine I-Tfu-1(A) NAME / KEY: coding sequence of the I-Tfu-1 intein
(B) EMPLACEMENT: de 1675 à 2754 ( ix ) CARACTERI STIQUES(B) LOCATION: from 1675 to 2754 (ix) CHARACTERISTICS
(A) NOM/CLE: séquence codante de 1 ' intéine I-Tfu-2(A) NAME / CLE: coding sequence of the I-Tfu-2 intein
(B) EMPLACEMENT: de 3157 à 4323 ( ix ) CARACTERI STIQUES(B) LOCATION: from 3157 to 4323 (ix) CHARACTERISTICS
(A) NOM/CLE: codon stop(A) NAME / KEY: stop codon
(B) EMPLACEMENT: de 5026 à 5028(B) LOCATION: 5026 to 5028
(xi) DESCRIPTION DE LA SEQUENCES: SEQ ID NO:l :(xi) DESCRIPTION OF THE SEQUENCES: SEQ ID NO: l:
AGCTTAAAGC GTCCGCCACT ACTTCCTGAA AGCTCACGCG GTAAAACAGC TCCATGCTCG 60AGCTTAAAGC GTCCGCCACT ACTTCCTGAA AGCTCACGCG GTAAAACAGC TCCATGCTCG 60
GCTCTTCGAT GGGAGGTTTA AAAAGGTGGT GGTGAGGTTT ATTAGGAAGA AGGCTCAACT 120GCTCTTCGAT GGGAGGTTTA AAAAGGTGGT GGTGAGGTTT ATTAGGAAGA AGGCTCAACT 120
AGAGACGGTG GGAGTATGGA AGAGGTCGAC AGGCTCGTGT TCAACTTTCC CCTCTTCAAA 180AGAGACGGTG GGAGTATGGA AGAGGTCGAC AGGCTCGTGT TCAACTTTCC CCTCTTCAAA 180
GATTACTGGG AAAAGGAGCG GTTCCTCAAG GTCGTTGGGC TTCTGGTGAG CCACCAGATA 240GATTACTGGG AAAAGGAGCG GTTCCTCAAG GTCGTTGGGC TTCTGGTGAG CCACCAGATA 240
ACGTTTGAGA AAGCTGCCGA GCTTCTGGAC ATGAGGCTCG AAGAGCTGGC GTTCCTCCTT 300ACGTTTGAGA AAGCTGCCGA GCTTCTGGAC ATGAGGCTCG AAGAGCTGGC GTTCCTCCTT 300
GACAAGCTCG GCGTTGAGTA CTCGCTTCTT GATGATGAAG AGGCCAGACT TGAGAGAGAA 360GACAAGCTCG GCGTTGAGTA CTCGCTTCTT GATGATGAAG AGGCCAGACT TGAGAGAGAA 360
GAGGCCAATA AGCTCATGGG GGAAATGAAG GGTGGAGCGT TTGTCTGATT CTTCTGAGCT 420GAGGCCAATA AGCTCATGGG GGAAATGAAG GGTGGAGCGT TTGTCTGATT CTTCTGAGCT 420
GTTATTGGTG TTTCACAGGC TGGGAGGTGG TGGATT ATG ATC CTC GAT ACA GAC 474GTTATTGGTG TTTCACAGGC TGGGAGGTGG TGGATT ATG ATC CTC GAT ACA GAC 474
Met Ile Leu Asp Thr Asp 1 5Met Ile Leu Asp Thr Asp 1 5
TAC ATC ACC GAA GAC GGA AGG CCC GTC ATC AGG GTG TTC AAG AAG GAG 522TAC ATC ACC GAA GAC GGA AGG CCC GTC ATC AGG GTG TTC AAG AAG GAG 522
Tyr Ile Thr Glu Asp Gly Arg Pro Val Ile Arg Val Phe Lys Lys Glu 10 15 20
AAC GGC GAG TTC AAA ATC GAG TAC GAC AGG GAC TTC GAG CCT TAC ATC 570 Asn Gly Glu Phe Lys Ile Glu Tyr Asp Arg Asp Phe Glu Pro Tyr Ile 25 30 35Tyr Ile Thr Glu Asp Gly Arg Pro Val Ile Arg Val Phe Lys Lys Glu 10 15 20 AAC GGC GAG TTC AAA ATC GAG TAC GAC AGG GAC TTC GAG CCT TAC ATC 570 Asn Gly Glu Phe Lys Ile Glu Tyr Asp Arg Asp Phe Glu Pro Tyr Ile 25 30 35
TAC GCT CTC CTG AAG GAC GAT TCC GCG ATC GAG GAC GTC AAG AAG ATA 618 Tyr Ala Leu Leu Lys Asp Asp Ser Ala Ile Glu Asp Val Lys Lys Ile 40 45 50TAC GCT CTC CTG AAG GAC GAT TCC GCG ATC GAG GAC GTC AAG AAG ATA 618 Tyr Ala Leu Leu Lys Asp Asp Ser Ala Ile Glu Asp Val Lys Lys Ile 40 45 50
ACT GCA AGC CGG CAC GGT ACC ACC GTC AGG GTC GTC AGG GCC GGG AAG 666 Thr Ala Ser Arg His Gly Thr Thr Val Arg Val Val Arg Ala Gly Lys 55 60 65 70ACT GCA AGC CGG CAC GGT ACC ACC GTC AGG GTC GTC AGG GCC GGG AAG 666 Thr Ala Ser Arg His Gly Thr Thr Val Arg Val Val Arg Ala Gly Lys 55 60 65 70
GTG AAG AAG AAG TTC CTC GGC AGG CCG ATA GAG GTC TGG AAG CTC TAC 714 Val Lys Lys Lys Phe Leu Gly Arg Pro Ile Glu Val Trp Lys Leu Tyr 75 80 85GTG AAG AAG AAG TTC CTC GGC AGG CCG ATA GAG GTC TGG AAG CTC TAC 714 Val Lys Lys Lys Phe Leu Gly Arg Pro Ile Glu Val Trp Lys Leu Tyr 75 80 85
TTC ACC CAT CCC CAG GAC GTT CCG GCA ATC AGG GAC AAA ATC AGG GAG 762 Phe Thr His Pro Gin Asp Val Pro Ala Ile Arg Asp Lys Ile Arg Glu 90 95 100TTC ACC CAT CCC CAG GAC GTT CCG GCA ATC AGG GAC AAA ATC AGG GAG 762 Phe Thr His Pro Gin Asp Val Pro Ala Ile Arg Asp Lys Ile Arg Glu 90 95 100
CAC CCT GCC GTG GTC GAC ATA TAT GAG TAC GAC ATA CCC TTT GCC AAG 810 His Pro Ala Val Val Asp Ile Tyr Glu Tyr Asp Ile Pro Phe Ala Lys 105 110 115CAC CCT GCC GTG GTC GAC ATA TAT GAG TAC GAC ATA CCC TTT GCC AAG 810 His Pro Ala Val Val Asp Ile Tyr Glu Tyr Asp Ile Pro Phe Ala Lys 105 110 115
CGC TAC CTC ATC GAT AAG GGC CTC ATC CCG ATG GAG GGC GAC GAG GAG 858 Arg Tyr Leu Ile Asp Lys Gly Leu Ile Pro Met Glu Gly Asp Glu Glu 120 125 130CGC TAC CTC ATC GAT AAG GGC CTC ATC CCG ATG GAG GGC GAC GAG GAG 858 Arg Tyr Leu Ile Asp Lys Gly Leu Ile Pro Met Glu Gly Asp Glu Glu 120 125 130
CTC AAG ATG CTC GCC TTC GAC ATC GAG ACG CTC TAC CAC GAG GGC GAG 906 Leu Lys Met Leu Ala Phe Asp Ile Glu Thr Leu Tyr His Glu Gly Glu 135 140 145 150CTC AAG ATG CTC GCC TTC GAC ATC GAG ACG CTC TAC CAC GAG GGC GAG 906 Leu Lys Met Leu Ala Phe Asp Ile Glu Thr Leu Tyr His Glu Gly Glu 135 140 145 150
GAG TTC GCC GAG GGG CCT ATT CTT ATG ATA AGC TAT GCC GAC GAG GAA 954 Glu Phe Ala Glu Gly Pro Ile Leu Met Ile Ser Tyr Ala Asp Glu Glu 155 160 165GAG TTC GCC GAG GGG CCT ATT CTT ATG ATA AGC TAT GCC GAC GAG GAA 954 Glu Phe Ala Glu Gly Pro Ile Leu Met Ile Ser Tyr Ala Asp Glu Glu 155 160 165
GGG GCG AGG GTA ATA ACC TGG AAG AAG ATC GAC CTT CCC TAC GTT GAC 1002 Gly Ala Arg Val Ile Thr Trp Lys Lys Ile Asp Leu Pro Tyr Val Asp 170 175 180GGG GCG AGG GTA ATA ACC TGG AAG AAG ATC GAC CTT CCC TAC GTT GAC 1002 Gly Ala Arg Val Ile Thr Trp Lys Lys Ile Asp Leu Pro Tyr Val Asp 170 175 180
GTC GTT TCA ACG GAG AAG GAG ATG ATA AAG CGC TTC CTG AAG GTT GTC 1050 Val Val Ser Thr Glu Lys Glu Met Ile Lys Arg Phe Leu Lys Val Val 185 190 195GTC GTT TCA ACG GAG AAG GAG ATG ATA AAG CGC TTC CTG AAG GTT GTC 1050 Val Val Ser Thr Glu Lys Glu Met Ile Lys Arg Phe Leu Lys Val Val 185 190 195
AAG GAG AAG GAC CCC GAT GTC CTC ATA ACC TAC AAC GGC GAC AAC TTC 1098 Lys Glu Lys Asp Pro Asp Val Leu Ile Thr Tyr Asn Gly Asp Asn Phe 200 205 210AAG GAG AAG GAC CCC GAT GTC CTC ATA ACC TAC AAC GGC GAC AAC TTC 1098 Lys Glu Lys Asp Pro Asp Val Leu Ile Thr Tyr Asn Gly Asp Asn Phe 200 205 210
GAC TTC GCT TAC CTC AAG AAG CGC TCC GAG AAG CTC GGC GTT AAG TTC 1146 Asp Phe Ala Tyr Leu Lys Lys Arg Ser Glu Lys Leu Gly Val Lys Phe 215 220 225 230GAC TTC GCT TAC CTC AAG AAG CGC TCC GAG AAG CTC GGC GTT AAG TTC 1146 Asp Phe Ala Tyr Leu Lys Lys Arg Ser Glu Lys Leu Gly Val Lys Phe 215 220 225 230
ATC CTC GGA AGG GAC GGC AGC GAG CCG AAG ATA CAG AGG ATG GGC GAC 1194 Ile Leu Gly Arg Asp Gly Ser Glu Pro Lys Ile Gin Arg Met Gly Asp 235 240 245ATC CTC GGA AGG GAC GGC AGC GAG CCG AAG ATA CAG AGG ATG GGC GAC 1194 Ile Leu Gly Arg Asp Gly Ser Glu Pro Lys Ile Gin Arg Met Gly Asp 235 240 245
CGC TTC GCC GTC GAG GTG AAG GGA AGA ATA CAC TTC GAC CTC TAC CCC 1242 Arg Phe Ala Val Glu Val Lys Gly Arg Ile His Phe Asp Leu Tyr Pro 250 255 260
GTC ATA AGA CAC ACC ATC AAC CTG CCC ACC TAC ACG CTG GAG GCC GTC 1290 Val Ile Arg His Thr Ile Asn Leu Pro Thr Tyr Thr Leu Glu Ala Val 265 270 275CGC TTC GCC GTC GAG GTG AAG GGA AGA ATA CAC TTC GAC CTC TAC CCC 1242 Arg Phe Ala Val Glu Val Lys Gly Arg Ile His Phe Asp Leu Tyr Pro 250 255 260 GTC ATA AGA CAC ACC ATC AAC CTG CCC ACC TAC ACG CTG GAG GCC GTC 1290 Val Ile Arg His Thr Ile Asn Leu Pro Thr Tyr Thr Leu Glu Ala Val 265 270 275
TAC GAG GCG ATT TTT GGG CAG CCA AAG GAG AAG GTC TAC GCT GAG GAG 1338 Tyr Glu Ala Ile Phe Gly Gin Pro Lys Glu Lys Val Tyr Ala Glu Glu 280 285 290TAC GAG GCG ATT TTT GGG CAG CCA AAG GAG AAG GTC TAC GCT GAG GAG 1338 Tyr Glu Ala Ile Phe Gly Gin Pro Lys Glu Lys Val Tyr Ala Glu Glu 280 285 290
ATA GCG CAG GCC TGG GAA ACG GGC GAG GGG CTT GAG CGC GTC GCG CGC 1386 Ile Ala Gin Ala Trp Glu Thr Gly Glu Gly Leu Glu Arg Val Ala Arg 295 300 305 310ATA GCG CAG GCC TGG GAA ACG GGC GAG GGG CTT GAG CGC GTC GCG CGC 1386 Ile Ala Gin Ala Trp Glu Thr Gly Glu Gly Leu Glu Arg Val Ala Arg 295 300 305 310
TAC TCG ATG GAG GAC GCC AAG GTA ACC TAC GAG CTG GGA AGG GAG TTC 1434 Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr Glu Leu Gly Arg Glu Phe 315 320 325TAC TCG ATG GAG GAC GCC AAG GTA ACC TAC GAG CTG GGA AGG GAG TTC 1434 Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr Glu Leu Gly Arg Glu Phe 315 320 325
TTC CCG ATG GAG GCC CAA CTT TCT CGG CTG GTC GGT CAG AGC TTC TGG 1482 Phe Pro Met Glu Ala Gin Leu Ser Arg Leu Val Gly Gin Ser Phe Trp 330 335 340TTC CCG ATG GAG GCC CAA CTT TCT CGG CTG GTC GGT CAG AGC TTC TGG 1482 Phe Pro Met Glu Ala Gin Leu Ser Arg Leu Val Gly Gin Ser Phe Trp 330 335 340
GAC GTC TCG CGC TCC AGC ACC GGC AAC CTC GTC GAG TGG TAC CTC CTC 1530 Asp Val Ser Arg Ser Ser Thr Gly Asn Leu Val Glu Trp Tyr Leu Leu 345 350 355GAC GTC TCG CGC TCC AGC ACC GGC AAC CTC GTC GAG TGG TAC CTC CTC 1530 Asp Val Ser Arg Ser Ser Thr Gly Asn Leu Val Glu Trp Tyr Leu Leu 345 350 355
AGG AAG GCC TAC GAG AGG AAC GAG CTG GCA CCG AAC AAG CCC TCC GGC 1578 Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala Pro Asn Lys Pro Ser Gly 360 365 370AGG AAG GCC TAC GAG AGG AAC GAG CTG GCA CCG AAC AAG CCC TCC GGC 1578 Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala Pro Asn Lys Pro Ser Gly 360 365 370
AGA GAA CTT GAG AGG CGC CGC GGG GGC TAC GCC GGC GGC TAC GTC AAG 1626 Arg Glu Leu Glu Arg Arg Arg Gly Gly Tyr Ala Gly Gly Tyr Val Lys 375 400 405 410AGA GAA CTT GAG AGG CGC CGC GGG GGC TAC GCC GGC GGC TAC GTC AAG 1626 Arg Glu Leu Glu Arg Arg Arg Gly Gly Tyr Ala Gly Gly Tyr Val Lys 375 400 405 410
GAG CCG GAG AGG GGA CTT TGG GAG AAC ATA GCT TAT TTA GAT TTT AGG 1674 Glu Pro Glu Arg Gly Leu Trp Glu Asn Ile Ala Tyr Leu Asp Phe Arg 415 420 425GAG CCG GAG AGG GGA CTT TGG GAG AAC ATA GCT TAT TTA GAT TTT AGG 1674 Glu Pro Glu Arg Gly Leu Trp Glu Asn Ile Ala Tyr Leu Asp Phe Arg 415 420 425
TGT CAT CCT GCC GAC ACT AAA GTC ATT GTC AAA GGG AAG GGC GTT GTA 1722 Cys His Pro Ala Asp Thr Lys Val Ile Val Lys Gly Lys Gly Val Val 430 435 440TGT CAT CCT GCC GAC ACT AAA GTC ATT GTC AAA GGG AAG GGC GTT GTA 1722 Cys His Pro Ala Asp Thr Lys Val Ile Val Lys Gly Lys Gly Val Val 430 435 440
AAC ATC AGC GAA GTT AGG GAG GGG GAC TAC GTT CTC GGC ATA GAC GGC 1770 Asn Ile Ser Glu Val Arg Glu Gly Asp Tyr Val Leu Gly Ile Asp Gly 445 450 455AAC ATC AGC GAA GTT AGG GAG GGG GAC TAC GTT CTC GGC ATA GAC GGC 1770 Asn Ile Ser Glu Val Arg Glu Gly Asp Tyr Val Leu Gly Ile Asp Gly 445 450 455
TGG CAG AAG GTT CAA AGG GTC TGG GAG TAT GAT TAC GAG GGA GAA CTC 1818 Trp Gin Lys Val Gin Arg Val Trp Glu Tyr Asp Tyr Glu Gly Glu Leu 460 465 470TGG CAG AAG GTT CAA AGG GTC TGG GAG TAT GAT TAC GAG GGA GAA CTC 1818 Trp Gin Lys Val Gin Arg Val Trp Glu Tyr Asp Tyr Glu Gly Glu Leu 460 465 470
GTA AAT ATA AAC GGC CTT AAG TGC ACA CCG AAC CAT AAG CTT CCG GTC 1866 Val Asn Ile Asn Gly Leu Lys Cys Thr Pro Asn His Lys Leu Pro Val 475 480 485 490GTA AAT ATA AAC GGC CTT AAG TGC ACA CCG AAC CAT AAG CTT CCG GTC 1866 Val Asn Ile Asn Gly Leu Lys Cys Thr Pro Asn His Lys Leu Pro Val 475 480 485 490
GTT AGG AGG ACT GAG AGG CAG ACT GCG ATA AGG GAC AGC CTT GCA AAG 1914 Val Arg Arg Thr Glu Arg Gin Thr Ala Ile Arg Asp Ser Leu Ala Lys 495 500 505GTT AGG AGG ACT GAG AGG CAG ACT GCG ATA AGG GAC AGC CTT GCA AAG 1914 Val Arg Arg Thr Glu Arg Gin Thr Ala Ile Arg Asp Ser Leu Ala Lys 495 500 505
TCT TTT CTC ACG AAA AAA GTT AAA GGT AAG CTG ATA ACC ACG CCT CTC 1962 Ser Phe Leu Thr Lys Lys Val Lys Gly Lys Leu Ile Thr Thr Pro Leu 510 515 520
TTT GAA AAA ATC GGG AAG ATC GAG CGA GAG GAC GTG CCA GAA GAG GAG 2010 Phe Glu Lys Ile Gly Lys Ile Glu Arg Glu Asp Val Pro Glu Glu Glu 525 530 535TCT TTT CTC ACG AAA AAA GTT AAA GGT AAG CTG ATA ACC ACG CCT CTC 1962 Ser Phe Leu Thr Lys Lys Val Lys Gly Lys Leu Ile Thr Thr Pro Leu 510 515 520 TTT GAA AAA ATC GGG AAG ATC GAG CGA GAG GAC GTG CCA GAA GAG GAG 2010 Phe Glu Lys Ile Gly Lys Ile Glu Arg Glu Asp Val Pro Glu Glu Glu 525 530 535
ATA CTC AAA GGA GAA CTC GCC GGA ATA ATC CTG GCT GAG GGC ACA CTC 2058 Ile Leu Lys Gly Glu Leu Ala Gly Ile Ile Leu Ala Glu Gly Thr Leu 540 545 550ATA CTC AAA GGA GAA CTC GCC GGA ATA ATC CTG GCT GAG GGC ACA CTC 2058 Ile Leu Lys Gly Glu Leu Ala Gly Ile Leu Ala Glu Gly Thr Leu 540 545 550
CTG AGA AAG GAT GTC GAG TAC TTT GAC TCT TCC AGA GGG AAG AAG AGA 2106 Leu Arg Lys Asp Val Glu Tyr Phe Asp Ser Ser Arg Gly Lys Lys Arg 555 560 565 570CTG AGA AAG GAT GTC GAG TAC TTT GAC TCT TCC AGA GGG AAG AAG AGA 2106 Leu Arg Lys Asp Val Glu Tyr Phe Asp Ser Ser Arg Gly Lys Lys Arg 555 560 565 570
GTA TCA CAC CAG TAC AGG GTT GAA ATA ACC GTT GGG GCG CAG GAG GAG 2154 Val Ser His Gin Tyr Arg Val Glu Ile Thr Val Gly Ala Gin Glu Glu 575 580 585GTA TCA CAC CAG TAC AGG GTT GAA ATA ACC GTT GGG GCG CAG GAG GAG 2154 Val Ser His Gin Tyr Arg Val Glu Ile Thr Val Gly Ala Gin Glu Glu 575 580 585
GAC TTC CAG AGG AGG ATC GTT TAC ATT TTC GAA CGC CTC TTT GGG GTA 2202 Asp Phe X Arg Arg Ile Val Tyr Ile Phe Glu Arg Leu Phe Gly Val 590 595 600GAC TTC CAG AGG AGG ATC GTT TAC ATT TTC GAA CGC CTC TTT GGG GTA 2202 Asp Phe X Arg Arg Ile Val Tyr Ile Phe Glu Arg Leu Phe Gly Val 590 595 600
ACT CCC AGT GTT TAC CGG AAA AAG AAC ACA AAC GCA ATA ACG TTC AAA 2250 Thr Pro Ser Val Tyr Arg Lys Lys Asn Thr Asn Ala Ile Thr Phe Lys 605 610 615ACT CCC AGT GTT TAC CGG AAA AAG AAC ACA AAC GCA ATA ACG TTC AAA 2250 Thr Pro Ser Val Tyr Arg Lys Lys Asn Thr Asn Ala Ile Thr Phe Lys 605 610 615
GTT GCC AAA AAA GAG GTT TAT CTT AGG GTT AGG GAA ATT ATG GAT GGC 2298 Val Ala Lys Lys Glu Val Tyr Leu Arg Val Arg Glu Ile Met Asp Gly 620 625 630GTT GCC AAA AAA GAG GTT TAT CTT AGG GTT AGG GAA ATT ATG GAT GGC 2298 Val Ala Lys Lys Glu Val Tyr Leu Arg Val Arg Glu Ile Met Asp Gly 620 625 630
ATT GAG AAC CTC CAC GCT CCT TCT GTG TTA AGG GGC TTT TTT GAA GGA 2346 Ile Glu Asn Leu His Ala Pro Ser Val Leu Arg Gly Phe Phe Glu Gly 635 640 645 650ATT GAG AAC CTC CAC GCT CCT TCT GTG TTA AGG GGC TTT TTT GAA GGA 2346 Ile Glu Asn Leu His Ala Pro Ser Val Leu Arg Gly Phe Phe Glu Gly 635 640 645 650
GAC GGA AGC GTC AAC AAG GTC CGG AAG ACA GTG GTA GTG AAT CAG GGC 2394 Asp Gly Ser Val Asn Lys Val Arg Lys Thr Val Val Val Asn Gin Gly 655 660 665GAC GGA AGC GTC AAC AAG GTC CGG AAG ACA GTG GTA GTG AAT CAG GGC 2394 Asp Gly Ser Val Asn Lys Val Arg Lys Thr Val Val Val Asn Gin Gly 655 660 665
ACC AAT AAT GAA TGG AAA ATT GAA GTG GTG TCA AAA CTC CTC AAC AAG 2442 Thr Asn Asn Glu Trp Lys Ile Glu Val Val Ser Lys Leu Leu Asn Lys 670 675 680ACC AAT AAT GAA TGG AAA ATT GAA GTG GTG TCA AAA CTC CTC AAC AAG 2442 Thr Asn Asn Glu Trp Lys Ile Glu Val Val Ser Lys Leu Leu Asn Lys 670 675 680
TTG GGG ATT CCG CAT AGA AGG TAC ACA TAC GAT TAC ACC GAA AGA GAA 2490 Leu Gly Ile Pro His Arg Arg Tyr Thr Tyr Asp Tyr Thr Glu Arg Glu 685 690 695TTG GGG ATT CCG CAT AGA AGG TAC ACA TAC GAT TAC ACC GAA AGA GAA 2490 Leu Gly Ile Pro His Arg Arg Tyr Thr Tyr Asp Tyr Thr Glu Arg Glu 685 690 695
AAA ACC ATG ACA ACG CAT ATA CTT GAG ATA GCC GGC AGG GAT GGG TTA 2538 Lys Thr Met Thr Thr His Ile Leu Glu Ile Ala Gly Arg Asp Gly Leu 700 705 710AAA ACC ATG ACA ACG CAT ATA CTT GAG ATA GCC GGC AGG GAT GGG TTA 2538 Lys Thr Met Thr Thr His Ile Leu Glu Ile Ala Gly Arg Asp Gly Leu 700 705 710
ATC CTT TTC CAG ACC ATT GTG GGA TTC ATA AGC ACT GAG AAG AAC ATG 2586 Ile Leu Phe Gin Thr Ile Val Gly Phe Ile Ser Thr Glu Lys Asn Met 715 720 725 730ATC CTT TTC CAG ACC ATT GTG GGA TTC ATA AGC ACT GAG AAG AAC ATG 2586 Ile Leu Phe Gin Thr Ile Val Gly Phe Ile Ser Thr Glu Lys Asn Met 715 720 725 730
GCG CTG GAG GAG GCA ATC AGG AAC AGG GAA GTG AAC CGC CTA GAA AAC 2634 Ala Leu Glu Glu Ala Ile Arg Asn Arg Glu Val Asn Arg Leu Glu Asn 735 740 745GCG CTG GAG GAG GCA ATC AGG AAC AGG GAA GTG AAC CGC CTA GAA AAC 2634 Ala Leu Glu Glu Ala Ile Arg Asn Arg Glu Val Asn Arg Leu Glu Asn 735 740 745
AAT GCC TTC TAT ACC CTA GCC GAC TTT ACG GCG AAG ACA GAG TAC TAC 2682 Asn Ala Phe Tyr Thr Leu Ala Asp Phe Thr Ala Lys Thr Glu Tyr Tyr 750 755 780
AAG GGC AAA GTT TAC GAC TTA ACC CTT GAG GGA ACG CCC TAT TAC TTC 2730 Lys Gly Lys Val Tyr Asp Leu Thr Leu Glu Gly Thr Pro Tyr Tyr Phe 785 790 795AAT GCC TTC TAT ACC CTA GCC GAC TTT ACG GCG AAG ACA GAG TAC TAC 2682 Asn Ala Phe Tyr Thr Leu Ala Asp Phe Thr Ala Lys Thr Glu Tyr Tyr 750 755 780 AAG GGC AAA GTT TAC GAC TTA ACC CTT GAG GGA ACG CCC TAT TAC TTC 2730 Lys Gly Lys Val Tyr Asp Leu Thr Leu Glu Gly Thr Pro Tyr Tyr Phe 785 790 795
GCC AAT GGC ATA CTG ACC CAC AAT TCG CTA TAT CCT TCG ATT ATA ATT 2778 Ala Asn Gly Ile Leu Thr His Asn Ser Leu Tyr Pro Ser Ile Ile Ile 800 805 810GCC AAT GGC ATA CTG ACC CAC AAT TCG CTA TAT CCT TCG ATT ATA ATT 2778 Ala Asn Gly Ile Leu Thr His Asn Ser Leu Tyr Pro Ser Ile Ile 800 805 810
TCC CAC AAC GTC TCC CCC GAT ACG CTC AAC CGC GAG GGC TGC GGG GAG 2826 Ser His Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Gly Glu 815 820 825 830TCC CAC AAC GTC TCC CCC GAT ACG CTC AAC CGC GAG GGC TGC GGG GAG 2826 Ser His Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Gly Glu 815 820 825 830
TAC GAC GAG GCT CCG CAG GTA GGG CAT CGC TTT TGT AAG GAC TTC CCC 2874 Tyr Asp Glu Ala Pro Gin Val Gly His Arg Phe Cys Lys Asp Phe Pro 835 840 845TAC GAC GAG GCT CCG CAG GTA GGG CAT CGC TTT TGT AAG GAC TTC CCC 2874 Tyr Asp Glu Ala Pro Gin Val Gly His Arg Phe Cys Lys Asp Phe Pro 835 840 845
GGC TTC ATC CCC AGC CTC CTC GGT GAC CTG CTC GAC GAG AGG CAG AAG 2922 Gly Phe Ile Pro Ser Leu Leu Gly Asp Leu Leu Asp Glu Arg Gin Lys 855 860 865GGC TTC ATC CCC AGC CTC CTC GGT GAC CTG CTC GAC GAG AGG CAG AAG 2922 Gly Phe Ile Pro Ser Leu Leu Gly Asp Leu Leu Asp Glu Arg Gin Lys 855 860 865
GTA AAG AAG CAC ATG AAG GCC ACG GTG GAC CCG ATA GAG AAG AAG CTC 2970 Val Lys Lys His Met Lys Ala Thr Val Asp Pro Ile Glu Lys Lys Leu 870 875 880GTA AAG AAG CAC ATG AAG GCC ACG GTG GAC CCG ATA GAG AAG AAG CTC 2970 Val Lys Lys His Met Lys Ala Thr Val Asp Pro Ile Glu Lys Lys Leu 870 875 880
CTC GAT TAC AGG CAG CGC GCA ATT AAA ATC CTC GCC AAC AGC TTC TAC 3018 Leu Asp Tyr Arg Gin Arg Ala Ile Lys Ile Leu Ala Asn Ser Phe Tyr 885 890 895CTC GAT TAC AGG CAG CGC GCA ATT AAA ATC CTC GCC AAC AGC TTC TAC 3018 Leu Asp Tyr Arg Gin Arg Ala Ile Lys Ile Leu Ala Asn Ser Phe Tyr 885 890 895
GGC TAC TAT GGC TAC GCA AAG GCC CGC TGG TAC TGC AAG GAG TGC GCC 3066 Gly Tyr Tyr Gly Tyr Ala Lys Ala Arg Trp Tyr Cys Lys Glu Cys Ala 900 905 910 915GGC TAC TAT GGC TAC GCA AAG GCC CGC TGG TAC TGC AAG GAG TGC GCC 3066 Gly Tyr Tyr Gly Tyr Ala Lys Ala Arg Trp Tyr Cys Lys Glu Cys Ala 900 905 910 915
GAG AGC GTT ACC GCC TGG GGC AGG CAG TAC ATT GAG ACC ACC ATG AGG 3114 Glu Ser Val Thr Ala Trp Gly Arg Gin Tyr Ile Glu Thr Thr Met Arg 920 925 930GAG AGC GTT ACC GCC TGG GGC AGG CAG TAC ATT GAG ACC ACC ATG AGG 3114 Glu Ser Val Thr Ala Trp Gly Arg Gin Tyr Ile Glu Thr Thr Met Arg 920 925 930
GAA ATA GAG GAA AAA TTT GGC TTT AAA GTG CTG TAC GCG GAT AGT GTT 3162 Glu Ile Glu Glu Lys Phe Gly Phe Lys Val Leu Tyr Ala Asp Ser Val 935 940 945GAA ATA GAG GAA AAA TTT GGC TTT AAA GTG CTG TAC GCG GAT AGT GTT 3162 Glu Ile Glu Glu Lys Phe Gly Phe Lys Val Leu Tyr Ala Asp Ser Val 935 940 945
ACA GGG GAC ACA GAG GTA ACC ATC AGA AGA AAC GGC AGG ATT GAG TTC 3210 Thr Gly Asp Thr Glu Val Thr Ile Arg Arg Asn Gly Arg Ile Glu Phe 950 955 960ACA GGG GAC ACA GAG GTA ACC ATC AGA AGA AAC GGC AGG ATT GAG TTC 3210 Thr Gly Asp Thr Glu Val Thr Ile Arg Arg Asn Gly Arg Ile Glu Phe 950 955 960
GTT CCA ATC GAG AAA CTC TTT GAG CGC GTT GAT CAC CGT GTT GGT GAG 3258 Val Pro Ile Glu Lys Leu Phe Glu Arg Val Asp His Arg Val Gly Glu 965 970 975GTT CCA ATC GAG AAA CTC TTT GAG CGC GTT GAT CAC CGT GTT GGT GAG 3258 Val Pro Ile Glu Lys Leu Phe Glu Arg Val Asp His Arg Val Gly Glu 965 970 975
AAG GAG TAC TGC GTT CTT GGA GGG GTT GAG GCA CTG ACA CTC GAC AAC 3306 Lys Glu Tyr Cys Val Leu Gly Gly Val Glu Ala Leu Thr Leu Asp Asn 980 985 990 995AAG GAG TAC TGC GTT CTT GGA GGG GTT GAG GCA CTG ACA CTC GAC AAC 3306 Lys Glu Tyr Cys Val Leu Gly Gly Val Glu Ala Leu Thr Leu Asp Asn 980 985 990 995
AGG GGC AGG CTC GTG TGG AAG AAG GTT CCG TAC GTC ATG AGA CAT AAA 3354 Arg Gly Arg Leu Val Trp Lys Lys Val Pro Tyr Val Met Arg His Lys 1000 1005 1010AGG GGC AGG CTC GTG TGG AAG AAG GTT CCG TAC GTC ATG AGA CAT AAA 3354 Arg Gly Arg Leu Val Trp Lys Lys Val Pro Tyr Val Met Arg His Lys 1000 1005 1010
ACG GAC AAA AGA ATC TAT AGG GTA TGG TTC ACC AAC TCT TGG TAC CTT 3402 Thr Asp Lys Arg Ile Tyr Arg Val Trp Phe Thr Asn Ser Trp Tyr Leu 1015 1020 1025
GAC GTG ACG GAG GAT CAC TCG CTA ATA GGC TAC CTG AAC ACA AGC AAA 3450 Asp Val Thr Glu Asp His Ser Leu Ile Gly Tyr Leu Asn Thr Ser Lys 1030 1035 1040ACG GAC AAA AGA ATC TAT AGG GTA TGG TTC ACC AAC TCT TGG TAC CTT 3402 Thr Asp Lys Arg Ile Tyr Arg Val Trp Phe Thr Asn Ser Trp Tyr Leu 1015 1020 1025 GAC GTG ACG GAG GAT CAC TCG CTA ATA GGC TAC CTG AAC ACA AGC AAA 3450 Asp Val Thr Glu Asp His Ser Leu Ile Gly Tyr Leu Asn Thr Ser Lys 1030 1035 1040
GTC AAA CCC GGA AAG CCC TTG AAA GAG CGT CTC GTC GAG GTC AAG CCA 3498 Val Lys Pro Gly Lys Pro Leu Lys Glu Arg Leu Val Glu Val Lys Pro 1045 1050 1055GTC AAA CCC GGA AAG CCC TTG AAA GAG CGT CTC GTC GAG GTC AAG CCA 3498 Val Lys Pro Gly Lys Pro Leu Lys Glu Arg Leu Val Glu Val Lys Pro 1045 1050 1055
GAA GAA TTG GGG GGT AAG GTC AAG TCT CTC ATT ACG CCC AAT CGG CCA 3546 Glu Glu Leu Gly Gly Lys Val Lys Ser Leu Ile Thr Pro Asn Arg Pro 1060 1065 1070 1075GAA GAA TTG GGG GGT AAG GTC AAG TCT CTC ATT ACG CCC AAT CGG CCA 3546 Glu Glu Leu Gly Gly Lys Val Lys Ser Leu Ile Thr Pro Asn Arg Pro 1060 1065 1070 1075
ATT GCC CGT ACC ATC AAG GCC AAC CCC ATT GCC GTC AAG CTC TGG GAG 3594 Ile Ala Arg Thr Ile Lys Ala Asn Pro Ile Ala Val Lys Leu Trp Glu 1080 1085 1090ATT GCC CGT ACC ATC AAG GCC AAC CCC ATT GCC GTC AAG CTC TGG GAG 3594 Ile Ala Arg Thr Ile Lys Ala Asn Pro Ile Ala Val Lys Leu Trp Glu 1080 1085 1090
TTA ATT GGC CTG CTG GTG GGA GAT GGC AAC TGG GGT GGA CAA TCG AAC 3642 Leu Ile Gly Leu Leu Val Gly Asp Gly Asn Trp Gly Gly Gin Ser Asn 1095 1100 1105TTA ATT GGC CTG CTG GTG GGA GAT GGC AAC TGG GGT GGA CAA TCG AAC 3642 Leu Ile Gly Leu Leu Val Gly Asp Gly Asn Trp Gly Gly Gin Ser Asn 1095 1100 1105
TGG GCC AAA TAC TAC GTT GGC CTC TCC TGT GGG CTG GAT AAA GCC GAA 3690 Trp Ala Lys Tyr Tyr Val Gly Leu Ser Cys Gly Leu Asp Lys Ala Glu 1110 1115 1120TGG GCC AAA TAC TAC GTT GGC CTC TCC TGT GGG CTG GAT AAA GCC GAA 3690 Trp Ala Lys Tyr Tyr Val Gly Leu Ser Cys Gly Leu Asp Lys Ala Glu 1110 1115 1120
ATA GAG AGA AAA GTC CTG AAC CCT TTA AGA GAG GCA AGC GTC ATC TCC 3738 Ile Glu Arg Lys Val Leu Asn Pro Leu Arg Glu Ala Ser Val Ile Ser 1125 1130 1135ATA GAG AGA AAA GTC CTG AAC CCT TTA AGA GAG GCA AGC GTC ATC TCC 3738 Ile Glu Arg Lys Val Leu Asn Pro Leu Arg Glu Ala Ser Val Ile Ser 1125 1130 1135
AAC TAC TAC GAC AAG AGC AAG AAG GGC GAC GTT TCC ATA CTC TCC AAG 3786 Asn Tyr Tyr Asp Lys Ser Lys Lys Gly Asp Val Ser Ile Leu Ser Lys 1140 1145 1150 1155AAC TAC TAC GAC AAG AGC AAG AAG GGC GAC GTT TCC ATA CTC TCC AAG 3786 Asn Tyr Tyr Asp Lys Ser Lys Lys Gly Asp Val Ser Ile Leu Ser Lys 1140 1145 1150 1155
TGG CTC GCC GGA TTC ATG GTC AAA TAC TTC AAA GAT GAA AAT GGG AAC 3834 Trp Leu Ala Gly Phe Met Val Lys Tyr Phe Lys Asp Glu Asn Gly Asn 1160 1165 1170TGG CTC GCC GGA TTC ATG GTC AAA TAC TTC AAA GAT GAA AAT GGG AAC 3834 Trp Leu Ala Gly Phe Met Val Lys Tyr Phe Lys Asp Glu Asn Gly Asn 1160 1165 1170
AAG GCC ATT CCC AGC TTC ATG TTC AAC CTT CCA AGG GAA TAC ATA GAG 3882 Lys Ala Ile Pro Ser Phe Met Phe Asn Leu Pro Arg Glu Tyr Ile Glu 1175 1180 1185AAG GCC ATT CCC AGC TTC ATG TTC AAC CTT CCA AGG GAA TAC ATA GAG 3882 Lys Ala Ile Pro Ser Phe Met Phe Asn Leu Pro Arg Glu Tyr Ile Glu 1175 1180 1185
GCC TTT CTA CGG GGG CTG TTT TCA GCG GAC GGA ACG GTA AGC TTG CGT 3930 Ala Phe Leu Arg Gly Leu Phe Ser Ala Asp Gly Thr Val Ser Leu Arg 1190 1195 1200GCC TTT CTA CGG GGG CTG TTT TCA GCG GAC GGA ACG GTA AGC TTG CGT 3930 Ala Phe Leu Arg Gly Leu Phe Ser Ala Asp Gly Thr Val Ser Leu Arg 1190 1195 1200
AGA GGA ATC CCA GAA ATT AGA CTG ACA AGC GTT AAC AGA GAG CTT AGT 3978 Arg Gly Ile Pro Glu Ile Arg Leu Thr Ser Val Asn Arg Glu Leu Ser 1205 1210 1215AGA GGA ATC CCA GAA ATT AGA CTG ACA AGC GTT AAC AGA GAG CTT AGT 3978 Arg Gly Ile Pro Glu Ile Arg Leu Thr Ser Val Asn Arg Glu Leu Ser 1205 1210 1215
GAT GCC GTG AGA AAG TTG CTG TGG CTG GTT GGG GTC TCC AAC TCA CTA 4026 Asp Ala Val Arg Lys Leu Leu Trp Leu Val Gly Val Ser Asn Ser Leu 1220 1225 1230 1235GAT GCC GTG AGA AAG TTG CTG TGG CTG GTT GGG GTC TCC AAC TCA CTA 4026 Asp Ala Val Arg Lys Leu Leu Trp Leu Val Gly Val Ser Asn Ser Leu 1220 1225 1230 1235
TTC ACC GAA ACC AAG CCA AAC CGG TAC CTG GAG AAA GAA AGT GGA ACG 4074 Phe Thr Glu Thr Lys Pro Asn Arg Tyr Leu Glu Lys Glu Ser Gly Thr 1240 1245 1250TTC ACC GAA ACC AAG CCA AAC CGG TAC CTG GAG AAA GAA AGT GGA ACG 4074 Phe Thr Glu Thr Lys Pro Asn Arg Tyr Leu Glu Lys Glu Ser Gly Thr 1240 1245 1250
CAT TCG ATT CAC GTG AGG ATA AAG AAC AAG CAT CGC TTT GCC GAT AGA 4122 His Ser Ile His Val Arg Ile Lys Asn Lys His Arg Phe Ala Asp Arg 1255 1260 1265
ATA GGC TTT CTC ATA GAC AGA AAA TCC ACC AAA CTC TCC GAG AAC CTG 4170 Ile Gly Phe Leu Ile Asp Arg Lys Ser Thr Lys Leu Ser Glu Asn Leu 1270 1275 1280CAT TCG ATT CAC GTG AGG ATA AAG AAC AAG CAT CGC TTT GCC GAT AGA 4122 His Ser Ile His Val Arg Ile Lys Asn Lys His Arg Phe Ala Asp Arg 1255 1260 1265 ATA GGC TTT CTC ATA GAC AGA AAA TCC ACC AAA CTC TCC GAG AAC CTG 4170 Ile Gly Phe Leu Ile Asp Arg Lys Ser Thr Lys Leu Ser Glu Asn Leu 1270 1275 1280
GGG GGA CAT ACA AAC AAG AAG AGG GCT TAC AAA TAT GAT TTT GAC TTG 4218 Gly Gly His Thr Asn Lys Lys Arg Ala Tyr Lys Tyr Asp Phe Asp Leu 1285 1290 1295GGG GGA CAT ACA AAC AAG AAG AGG GCT TAC AAA TAT GAT TTT GAC TTG 4218 Gly Gly His Thr Asn Lys Lys Arg Ala Tyr Lys Tyr Asp Phe Asp Leu 1285 1290 1295
GTA TAC CCC AGA AAA ATC GAA GAG ATA ACC TAC GAC GGC TAC GTC TAT 4266 Val Tyr Pro Arg Lys Ile Glu Glu Ile Thr Tyr Asp Gly Tyr Val Tyr 1300 1305 1310 1315GTA TAC CCC AGA AAA ATC GAA GAG ATA ACC TAC GAC GGC TAC GTC TAT 4266 Val Tyr Pro Arg Lys Ile Glu Glu Ile Thr Tyr Asp Gly Tyr Val Tyr 1300 1305 1310 1315
GAC ATC GAG GTT GAG GGA ACC CAC AGG TTC TTC GCC AAC GGA ATA CTC 4314 Asp Ile Glu Val Glu Gly Thr His Arg Phe Phe Ala Asn Gly Ile Leu 1320 1325 1330GAC ATC GAG GTT GAG GGA ACC CAC AGG TTC TTC GCC AAC GGA ATA CTC 4314 Asp Ile Glu Val Glu Gly Thr His Arg Phe Phe Ala Asn Gly Ile Leu 1320 1325 1330
GTT CAC AAC ACA GAC GGC TTT TTC GCA ACA ATC CCC GGA GCG GAC GCC 4362 Val His Asn Thr Asp Gly Phe Phe Ala Thr Ile Pro Gly Ala Asp Ala 1335 1340 1345GTT CAC AAC ACA GAC GGC TTT TTC GCA ACA ATC CCC GGA GCG GAC GCC 4362 Val His Asn Thr Asp Gly Phe Phe Ala Thr Ile Pro Gly Ala Asp Ala 1335 1340 1345
GAG ACG GTC AAA AAG AAG GCC AGG GAG TTC CTT AAC TAC ATT AAC CCC 4410 Glu Thr Val Lys Lys Lys Ala Arg Glu Phe Leu Asn Tyr Ile Asn Pro 1350 1355 1360GAG ACG GTC AAA AAG AAG GCC AGG GAG TTC CTT AAC TAC ATT AAC CCC 4410 Glu Thr Val Lys Lys Lys Ala Arg Glu Phe Leu Asn Tyr Ile Asn Pro 1350 1355 1360
AAG CTG CCC GGT CTC CTC GAA CTC GAG TAC GAG GGC TTC TAC AGG CGC 4458 Lys Leu Pro Gly Leu Leu Glu Leu Glu Tyr Glu Gly Phe Tyr Arg Arg 1365 1370 1375AAG CTG CCC GGT CTC CTC GAA CTC GAG TAC GAG GGC TTC TAC AGG CGC 4458 Lys Leu Pro Gly Leu Leu Glu Leu Glu Tyr Glu Gly Phe Tyr Arg Arg 1365 1370 1375
GGT TTC TTC GTA ACC AAG AAG AAG TAC GCG GTG ATA GAC GAG GAG GGC 4506 Gly Phe Phe Val Thr Lys Lys Lys Tyr Ala Val Ile Asp Glu Glu Gly 1380 1385 1390 1395GGT TTC TTC GTA ACC AAG AAG AAG TAC GCG GTG ATA GAC GAG GAG GGC 4506 Gly Phe Phe Val Thr Lys Lys Lys Tyr Ala Val Ile Asp Glu Glu Gly 1380 1385 1390 1395
AAG ATA ACG ACG CGC GGG CTT GAG ATC GTC CGG CGC GAC TGG AGT GAG 4554 Lys Ile Thr Thr Arg Gly Leu Glu Ile Val Arg Arg Asp Trp Ser Glu 1400 1405 1410AAG ATA ACG ACG CGC GGG CTT GAG ATC GTC CGG CGC GAC TGG AGT GAG 4554 Lys Ile Thr Thr Arg Gly Leu Glu Ile Val Arg Arg Asp Trp Ser Glu 1400 1405 1410
GTG GCT AAG GAG ACG CAG GCG AGG GTC TTG GAG GCC ATA CTG CGG CAC 4602 Val Ala Lys Glu Thr Gin Ala Arg Val Leu Glu Ala Ile Leu Arg His 1415 1420 1425GTG GCT AAG GAG ACG CAG GCG AGG GTC TTG GAG GCC ATA CTG CGG CAC 4602 Val Ala Lys Glu Thr Gin Ala Arg Val Leu Glu Ala Ile Leu Arg His 1415 1420 1425
GGT GAC GTC GAG GAG GCC GTG AGG ATT GTC AAG GAA GTG ACG GAA AAG 4650 Gly Asp Val Glu Glu Ala Val Arg Ile Val Lys Glu Val Thr Glu Lys 1430 1435 1440GGT GAC GTC GAG GAG GCC GTG AGG ATT GTC AAG GAA GTG ACG GAA AAG 4650 Gly Asp Val Glu Glu Ala Val Arg Ile Val Lys Glu Val Thr Glu Lys 1430 1435 1440
CTG AGC AAG TAC GAG GTT CCG CCA GAG AAG CTC GTC ATC CAC GAG CAG 4698 Leu Ser Lys Tyr Glu Val Pro Pro Glu Lys Leu Val Ile His Glu Gin 1445 1450 1455CTG AGC AAG TAC GAG GTT CCG CCA GAG AAG CTC GTC ATC CAC GAG CAG 4698 Leu Ser Lys Tyr Glu Val Pro Pro Glu Lys Leu Val Ile His Glu Gin 1445 1450 1455
ATT ACC AGG GAG CTG AAG GAC TAC AAG GCC ACC GGC CCG CAC GTG GCC 4746 Ile Thr Arg Glu Leu Lys Asp Tyr Lys Ala Thr Gly Pro His Val Ala 1460 1465 1470 1475ATT ACC AGG GAG CTG AAG GAC TAC AAG GCC ACC GGC CCG CAC GTG GCC 4746 Ile Thr Arg Glu Leu Lys Asp Tyr Lys Ala Thr Gly Pro His Val Ala 1460 1465 1470 1475
ATA GCG AAG CGC CTC GCC GCG AGG GGG ATT AAG GTT CGC CCT GGG ACA 4794 Ile Ala Lys Arg Leu Ala Ala Arg Gly Ile Lys Val Arg Pro Gly Thr 1480 1485 1490ATA GCG AAG CGC CTC GCC GCG AGG GGG ATT AAG GTT CGC CCT GGG ACA 4794 Ile Ala Lys Arg Leu Ala Ala Arg Gly Ile Lys Val Arg Pro Gly Thr 1480 1485 1490
GTC ATC AGC TAC ATC GTC CTG AAA GGT TCC GGC AGG ATA GGG GAC AGG 4842 Val Ile Ser Tyr Ile Val Leu Lys Gly Ser Gly Arg Ile Gly Asp Arg 1495 1500 1505
ACG ATA CCC TTC GAC GAG TTC GAC CCC ACG AAG CAC AGG TAC GAT GCG 4890 Thr Ile Pro Phe Asp Glu Phe Asp Pro Thr Lys His Arg Tyr Asp Ala 1510 1515 1520GTC ATC AGC TAC ATC GTC CTG AAA GGT TCC GGC AGG ATA GGG GAC AGG 4842 Val Ile Ser Tyr Ile Val Leu Lys Gly Ser Gly Arg Ile Gly Asp Arg 1495 1500 1505 ACG ATA CCC TTC GAC GAG TTC GAC CCC ACG AAG CAC AGG TAC GAT GCG 4890 Thr Ile Pro Phe Asp Glu Phe Asp Pro Thr Lys His Arg Tyr Asp Ala 1510 1515 1520
GAG TAC TAC ATC GAG AAC CAG GTT CTG CCG GCG GTG GAG AGA ATC CTC 4938 Glu Tyr Tyr Ile Glu Asn Gin Val Leu Pro Ala Val Glu Arg Ile Leu 1525 1530 1535GAG TAC TAC ATC GAG AAC CAG GTT CTG CCG GCG GTG GAG AGA ATC CTC 4938 Glu Tyr Tyr Ile Glu Asn Gin Val Leu Pro Ala Val Glu Arg Ile Leu 1525 1530 1535
AAG GCC TTC GGC TAC AAG AAG GAG GAT TTG CGC TAC CAG AAG ACG CGG 4986 Lys Ala Phe Gly Tyr Lys Lys Glu Asp Leu Arg Tyr Gin Lys Thr Arg 1540 1545 1550 1555AAG GCC TTC GGC TAC AAG AAG GAG GAT TTG CGC TAC CAG AAG ACG CGG 4986 Lys Ala Phe Gly Tyr Lys Lys Glu Asp Leu Arg Tyr Gin Lys Thr Arg 1540 1545 1550 1555
CAG GTT GGG CTG GGG GCG TGG CTC AAA ATG GGG AAG AAA TGA 5028CAG GTT GGG CTG GGG GCG TGG CTC AAA ATG GGG AAG AAA TGA 5028
Gin Val Gly Leu Gly Ala Trp Leu Lys Met Gly Lys LysGin Val Gly Leu Gly Ala Trp Leu Lys Met Gly Lys Lys
1560 1565 15681560 1565 1568
AGGCCAAGCT T 5039AGGCCAAGCT T 5039
(2) INFORMATION POUR LA SEQ ID NO : 2 :(2) INFORMATION FOR SEQ ID NO: 2:
(i) CARACTERISTIQUES DE LA SEQUENCE:(i) CHARACTERISTICS OF THE SEQUENCE:
(A) LONGUEUR: 774 acides aminés (ii) TYPE DE MOLECULE: protéine (ix) CARACTERISTIQUES(A) LENGTH: 774 amino acids (ii) TYPE OF MOLECULE: protein (ix) CHARACTERISTICS
(A) NOM/CLE: ADN polymerase de THERMOCOCCUS fumicolans Tfu (xi) DESCRIPTION DE LA SEQUENCES: SEQ ID NO : 2 :(A) NAME / KEY: DNA polymerase of THERMOCOCCUS fumicolans Tfu (xi) DESCRIPTION OF THE SEQUENCES: SEQ ID NO: 2:
Met Ile Leu Asp Thr Asp Tyr Ile Thr Glu Asp Gly Arg Pro Val Ile 1 5 10 15Met Ile Leu Asp Thr Asp Tyr Ile Thr Glu Asp Gly Arg Pro Val Ile 1 5 10 15
Arg Val Phe Lys Lys Glu Asn Gly Glu Phe Lys Ile Glu Tyr Asp Arg 20 25 30Arg Val Phe Lys Lys Glu Asn Gly Glu Phe Lys Ile Glu Tyr Asp Arg 20 25 30
Asp Phe Glu Pro Tyr Ile Tyr Ala Leu Leu Lys Asp Asp Ser Ala Ile 35 40 45Asp Phe Glu Pro Tyr Ile Tyr Ala Leu Leu Lys Asp Asp Ser Ala Ile 35 40 45
Glu Asp Val Lys Lys Ile Thr Ala Ser Arg His Gly Thr Thr Val Arg 50 55 60Glu Asp Val Lys Lys Ile Thr Ala Ser Arg His Gly Thr Thr Val Arg 50 55 60
Val Val Arg Ala Gly Lys Val Lys Lys Lys Phe Leu Gly Arg Pro Ile 65 70 75 80Val Val Arg Ala Gly Lys Val Lys Lys Lys Phe Leu Gly Arg Pro Ile 65 70 75 80
Glu Val Trp Lys Leu Tyr Phe Thr His Pro Gin Asp Val Pro Ala Ile 85 90 95Glu Val Trp Lys Leu Tyr Phe Thr His Pro Gin Asp Val Pro Ala Ile 85 90 95
Arg Asp Lys Ile Arg Glu His Pro Ala Val Val Asp Ile Tyr Glu Tyr 100 105 110Arg Asp Lys Ile Arg Glu His Pro Ala Val Val Asp Ile Tyr Glu Tyr 100 105 110
Asp Ile Pro Phe Ala Lys Arg Tyr Leu Ile Asp Lys Gly Leu Ile Pro 115 120 125Asp Ile Pro Phe Ala Lys Arg Tyr Leu Ile Asp Lys Gly Leu Ile Pro 115 120 125
Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe Asp Ile Glu Thr 130 135 140Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe Asp Ile Glu Thr 130 135 140
Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro Ile Leu Met Ile 145 150 155 160Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro Ile Leu Met Ile 145 150 155 160
Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val Ile Thr Trp Lys Lys Ile 165 170 175
Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Lys Glu Met Ile Lys 180 185 190Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val Ile Thr Trp Lys Lys Ile 165 170 175 Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Lys Glu Met Ile Lys 180 185 190
Arg Phe Leu Lys Val Val Lys Glu Lys Asp Pro Asp Val Leu Ile Thr 195 200 205Arg Phe Leu Lys Val Val Lys Glu Lys Asp Pro Asp Val Leu Ile Thr 195 200 205
Tyr Asn Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys Lys Arg Ser Glu 210 215 220Tyr Asn Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys Lys Arg Ser Glu 210 215 220
Lys Leu Gly Val Lys Phe Ile Leu Gly Arg Asp Gly Ser Glu Pro Lys 225 230 235 240Lys Leu Gly Val Lys Phe Ile Leu Gly Arg Asp Gly Ser Glu Pro Lys 225 230 235 240
Ile Gin Arg Met Gly Asp Arg Phe Ala Val Glu Val Lys Gly Arg Ile 245 250 255Ile Gin Arg Met Gly Asp Arg Phe Ala Val Glu Val Lys Gly Arg Ile 245 250 255
His Phe Asp Leu Tyr Pro Val Ile Arg His Thr Ile Asn Leu Pro Thr 260 265 270His Phe Asp Leu Tyr Pro Val Ile Arg His Thr Ile Asn Leu Pro Thr 260 265 270
Tyr Thr Leu Glu Ala Val Tyr Glu Ala Ile Phe Gly Gin Pro Lys Glu 275 280 285Tyr Thr Leu Glu Ala Val Tyr Glu Ala Ile Phe Gly Gin Pro Lys Glu 275 280 285
Lys Val Tyr Ala Glu Glu Ile Ala Gin Ala Trp Glu Thr Gly Glu Gly 290 295 300Lys Val Tyr Ala Glu Glu Ile Ala Gin Ala Trp Glu Thr Gly Glu Gly 290 295 300
Leu Glu Arg Val Ala Arg Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr 305 310 315 320Leu Glu Arg Val Ala Arg Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr 305 310 315 320
Glu Leu Gly Arg Glu Phe Phe Pro Met Glu Ala Gin Leu Ser Arg Leu 325 330 335Glu Leu Gly Arg Glu Phe Phe Pro Met Glu Ala Gin Leu Ser Arg Leu 325 330 335
Val Gly Gin Ser Phe Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 340 345 350Val Gly Gin Ser Phe Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 340 345 350
Val Glu Trp Tyr Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala 355 360 365Val Glu Trp Tyr Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala 355 360 365
Pro Asn Lys Pro Ser Gly Arg Glu Leu Glu Arg Arg Arg Gly Gly Tyr 370 375 380Pro Asn Lys Pro Ser Gly Arg Glu Leu Glu Arg Arg Arg Gly Gly Tyr 370 375 380
Ala Gly Gly Tyr Val Lys Glu Pro Glu Arg Gly Leu Trp Glu Asn Ile 385 390 395 400Ala Gly Gly Tyr Val Lys Glu Pro Glu Arg Gly Leu Trp Glu Asn Ile 385 390 395 400
Ala Tyr Leu Asp Phe Arg Ser Leu Tyr Pro Ser Ile Ile Ile Ser His 405 405 410Ala Tyr Leu Asp Phe Arg Ser Leu Tyr Pro Ser Ile Ile Ile Ser His 405 405 410
Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Gly Glu Tyr Asp 415 420 425Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Gly Glu Tyr Asp 415 420 425
Glu Ala Pro GÏn Val Gly His Arg Phe Cys Lys Asp Phe Pro Gly Phe 430 435 440Glu Ala Pro GÏn Val Gly His Arg Phe Cys Lys Asp Phe Pro Gly Phe 430 435 440
Ile Pro Ser Leu Leu Gly Asp Leu Leu Asp Glu Arg Gin Lys Val Lys 445 450 455Ile Pro Ser Leu Leu Gly Asp Leu Leu Asp Glu Arg Gin Lys Val Lys 445 450 455
Lys His Met Lys Ala Thr Val Asp Pro Ile Glu Lys Lys Leu Leu Asp 460 465 470 475Lys His Met Lys Ala Thr Val Asp Pro Ile Glu Lys Lys Leu Leu Asp 460 465 470 475
Tyr Arg Gin Arg Ala Ile Lys Ile Leu Ala Asn Ser Phe Tyr Gly Tyr 480 485 490
Tyr Gly Tyr Ala Lys Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser 495 500 505Tyr Arg Gin Arg Ala Ile Lys Ile Leu Ala Asn Ser Phe Tyr Gly Tyr 480 485 490 Tyr Gly Tyr Ala Lys Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser 495 500 505
Val Thr Ala Trp Gly Arg Gin Tyr Ile Glu Thr Thr Met Arg Glu Ile 510 515 520Val Thr Ala Trp Gly Arg Gin Tyr Ile Glu Thr Thr Met Arg Glu Ile 510 515 520
Glu Glu Lys Phe Gly Phe Lys Val Leu Tyr Ala Asp Thr Asp Gly Phe 525 530 535Glu Glu Lys Phe Gly Phe Lys Val Leu Tyr Ala Asp Thr Asp Gly Phe 525 530 535
Phe Ala Thr Ile Pro Gly Ala Asp Ala Glu Thr Val Lys Lys Lys Ala 540 545 555 560Phe Ala Thr Ile Pro Gly Ala Asp Ala Glu Thr Val Lys Lys Lys Ala 540 545 555 560
Arg Glu Phe Leu Asn Tyr Ile Asn Pro Lys Leu Pro Gly Leu Leu Glu 565 570 575Arg Glu Phe Leu Asn Tyr Ile Asn Pro Lys Leu Pro Gly Leu Leu Glu 565 570 575
Leu Glu Tyr Glu Gly Phe Tyr Arg Arg Gly Phe Phe Val Thr Lys Lys 580 585 590Leu Glu Tyr Glu Gly Phe Tyr Arg Arg Gly Phe Phe Val Thr Lys Lys 580 585 590
Lys Tyr Ala Val Ile Asp Glu Glu Gly Lys Ile Thr Thr Arg Gly Leu 595 600 605Lys Tyr Ala Val Ile Asp Glu Glu Gly Lys Ile Thr Thr Arg Gly Leu 595 600 605
Glu Ile Val Arg Arg Asp Trp Ser Glu Val Ala Lys Glu Thr Gin Ala 610 615 620Glu Ile Val Arg Arg Asp Trp Ser Glu Val Ala Lys Glu Thr Gin Ala 610 615 620
Arg Val Leu Glu Ala Ile Leu Arg His Gly Asp Val Glu Glu Ala Val 625 630 635 640Arg Val Leu Glu Ala Ile Leu Arg His Gly Asp Val Glu Glu Ala Val 625 630 635 640
Arg Ile Val Lys Glu Val Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro 645 650 655Arg Ile Val Lys Glu Val Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro 645 650 655
Pro Glu Lys Leu Val Ile His Glu Gin Ile Thr Arg Glu Leu Lys Asp 660 665 670Pro Glu Lys Leu Val Ile His Glu Gin Ile Thr Arg Glu Leu Lys Asp 660 665 670
Tyr Lys Ala Thr Gly Pro His Val Ala Ile Ala Lys Arg Leu Ala Ala 675 680 685Tyr Lys Ala Thr Gly Pro His Val Ala Ile Ala Lys Arg Leu Ala Ala 675 680 685
Arg Gly Ile Lys Val Arg Pro Gly Thr Val Ile Ser Tyr Ile Val Leu 690 695 700Arg Gly Ile Lys Val Arg Pro Gly Thr Val Ile Ser Tyr Ile Val Leu 690 695 700
Lys Gly Ser Gly Arg Ile Gly Asp Arg Thr Ile Pro Phe Asp Glu Phe 705 710 715 720Lys Gly Ser Gly Arg Ile Gly Asp Arg Thr Ile Pro Phe Asp Glu Phe 705 710 715 720
Asp Pro Thr Lys His Arg Tyr Asp Ala Glu Tyr Tyr Ile Glu Asn Gin 725 730 735Asp Pro Thr Lys His Arg Tyr Asp Ala Glu Tyr Tyr Ile Glu Asn Gin 725 730 735
Val Leu Pro Ala Val Glu Arg Ile Leu Lys Ala Phe Gly Tyr Lys Lys 740 745 750Val Leu Pro Ala Val Glu Arg Ile Leu Lys Ala Phe Gly Tyr Lys Lys 740 745 750
Glu Asp Leu Arg Tyr Gin Lys Thr Arg Gin Val Gly Leu Gly Ala Trp 755 760 765Glu Asp Leu Arg Tyr Gin Lys Thr Arg Gin Val Gly Leu Gly Ala Trp 755 760 765
Leu Lys Met Gly Lys Lys 770 774
(2) INFORMATION POUR LA SEQ ID NO : 3 :Leu Lys Met Gly Lys Lys 770 774 (2) INFORMATION FOR SEQ ID NO: 3:
(i) CARACTERISTIQUES DE LA SEQUENCE:(i) CHARACTERISTICS OF THE SEQUENCE:
(A) LONGUEUR: 360 acides aminés (ii) TYPE DE MOLECULE: protéine ( ix ) CARACTERISTIQUES(A) LENGTH: 360 amino acids (ii) TYPE OF MOLECULE: protein (ix) CHARACTERISTICS
(A) NOM/CLE: intéine I-Tfu-1 (xi) DESCRIPTION DE LA SEQUENCES: SEQ ID NO: 3 :(A) NAME / KEY: intein I-Tfu-1 (xi) DESCRIPTION OF THE SEQUENCES: SEQ ID NO: 3:
Cys His Pro Ala Asp Thr Lys Val Ile Val Lys Gly Lys Gly Val Val 1 5 10 15Cys His Pro Ala Asp Thr Lys Val Ile Val Lys Gly Lys Gly Val Val 1 5 10 15
Asn Ile Ser Glu Val Arg Glu Gly Asp Tyr Val Leu Gly Ile Asp Gly 20 25 30Asn Ile Ser Glu Val Arg Glu Gly Asp Tyr Val Leu Gly Ile Asp Gly 20 25 30
Trp Gin Lys Val Gin Arg Val Trp Glu Tyr Asp Tyr Glu Gly Glu Leu 35 40 45Trp Gin Lys Val Gin Arg Val Trp Glu Tyr Asp Tyr Glu Gly Glu Leu 35 40 45
Val Asn Ile Asn Gly Leu Lys Cys Thr Pro Asn His Lys Leu Pro Val 50 55 60Val Asn Ile Asn Gly Leu Lys Cys Thr Pro Asn His Lys Leu Pro Val 50 55 60
Val Arg Arg Thr Glu Arg Gin Thr Ala Ile Arg Asp Ser Leu Ala Lys 65 70 75 80Val Arg Arg Thr Glu Arg Gin Thr Ala Ile Arg Asp Ser Leu Ala Lys 65 70 75 80
Ser Phe Leu Thr Lys Lys Val Lys Gly Lys Leu Ile Thr Thr Pro Leu 85 90 95Ser Phe Leu Thr Lys Lys Val Lys Gly Lys Leu Ile Thr Thr Pro Leu 85 90 95
Phe Glu Lys Ile Gly Lys Ile Glu Arg Glu Asp Val Pro Glu Glu Glu 100 105 110Phe Glu Lys Ile Gly Lys Ile Glu Arg Glu Asp Val Pro Glu Glu Glu 100 105 110
Ile Leu Lys Gly Glu Leu Ala Gly Ile Ile Leu Ala Glu Gly Thr Leu 115 120 125Ile Leu Lys Gly Glu Leu Ala Gly Ile Leu Ala Glu Gly Thr Leu 115 120 125
Leu Arg Lys Asp Val Glu Tyr Phe Asp Ser Ser Arg Gly Lys Lys Arg 130 135 140Leu Arg Lys Asp Val Glu Tyr Phe Asp Ser Ser Arg Gly Lys Lys Arg 130 135 140
Val Ser His Gin Tyr Arg Val Glu Ile Thr Val Gly Ala Gin Glu Glu 145 150 155 160Val Ser His Gin Tyr Arg Val Glu Ile Thr Val Gly Ala Gin Glu Glu 145 150 155 160
Asp Phe X Arg Arg Ile Val Tyr Ile Phe Glu Arg Leu Phe Gly Val 165 170 175Asp Phe X Arg Arg Ile Val Tyr Ile Phe Glu Arg Leu Phe Gly Val 165 170 175
Thr Pro Ser Val Tyr Arg Lys Lys Asn Thr Asn Ala Ile Thr Phe Lys 180 185 190Thr Pro Ser Val Tyr Arg Lys Lys Asn Thr Asn Ala Ile Thr Phe Lys 180 185 190
Val Ala Lys Lys Glu Val Tyr Leu Arg Val Arg Glu Ile Met Asp Gly 195 200 205Val Ala Lys Lys Glu Val Tyr Leu Arg Val Arg Glu Ile Met Asp Gly 195 200 205
Ile Glu Asn Leu His Ala Pro Ser Val Leu Arg Gly Phe Phe Glu Gly 210 215 220Ile Glu Asn Leu His Ala Pro Ser Val Leu Arg Gly Phe Phe Glu Gly 210 215 220
Asp Gly Ser Val Asn Lys Val Arg Lys Thr Val Val Val Asn Gin Gly 225 230 235 240Asp Gly Ser Val Asn Lys Val Arg Lys Thr Val Val Val Asn Gin Gly 225 230 235 240
Thr Asn Asn Glu Trp Lys Ile Glu Val Val Ser Lys Leu Leu Asn Lys 245 250 255Thr Asn Asn Glu Trp Lys Ile Glu Val Val Ser Lys Leu Leu Asn Lys 245 250 255
Leu Gly Ile Pro His Arg Arg Tyr Thr Tyr Asp Tyr Thr Glu Arg Glu 260 265 270
Lys Thr Met Thr Thr His I le Leu Glu I le Ala Gly Arg Asp Gly Leu 275 280 285Leu Gly Ile Pro His Arg Arg Tyr Thr Tyr Asp Tyr Thr Glu Arg Glu 260 265 270 Lys Thr Met Thr Thr His I Leu Glu I Ala Gly Arg Asp Gly Leu 275 280 285
Ile Leu Phe Gin Thr Ile Val Gly Phe Ile Ser Thr Glu Lys Asn Met 290 295 300Ile Leu Phe Gin Thr Ile Val Gly Phe Ile Ser Thr Glu Lys Asn Met 290 295 300
Ala Leu Glu Glu Ala Ile Arg Asn Arg Glu Val Asn Arg Leu Glu Asn 305 310 315 320Ala Leu Glu Glu Ala Ile Arg Asn Arg Glu Val Asn Arg Leu Glu Asn 305 310 315 320
Asn Ala Phe Tyr Thr Leu Ala Asp Phe Thr Ala Lys Thr Glu Tyr Tyr 325 330 335Asn Ala Phe Tyr Thr Leu Ala Asp Phe Thr Ala Lys Thr Glu Tyr Tyr 325 330 335
Lys Gly Lys Val Tyr Asp Leu Thr Leu Glu Gly Thr Pro Tyr Tyr Phe 340 345 350Lys Gly Lys Val Tyr Asp Leu Thr Leu Glu Gly Thr Pro Tyr Tyr Phe 340 345 350
Ala Asn Gly Ile Leu Thr His Asn 355 360Ala Asn Gly Ile Leu Thr His Asn 355 360
(2 ) INFORMATION POUR LA SEQ ID NO : 4 :(2) INFORMATION FOR SEQ ID NO: 4:
( i ) CARACTRERI STIQUES DE LA SEQUENCE :(i) CHARACTERISTICS OF THE SEQUENCE:
(A) LONGUEUR : 389 acides aminés ( ii ) TYPE DE MOLECULE : protéine ( ix ) CARACTERI STIQUES(A) LENGTH: 389 amino acids (ii) TYPE OF MOLECULE: protein (ix) CHARACTERISTICS
(A) NOM/CLE : intéine I-Tfu-2 (xi ) DESCRIPTION DE LA SEQUENCES : SEQ ID NO : 4 :(A) NAME / KEY: intein I-Tfu-2 (xi) DESCRIPTION OF THE SEQUENCES: SEQ ID NO: 4:
Ser Val Thr Gly Asp Thr Glu Val Thr I le Arg Arg Asn Gly Arg I le 1 5 10 15Ser Val Thr Gly Asp Thr Glu Val Thr I le Arg Arg Asn Gly Arg I le 1 5 10 15
Glu Phe Val Pro Ile Glu Lys Leu Phe Glu Arg Val Asp His Arg Val 20 25 30Glu Phe Val Pro Ile Glu Lys Leu Phe Glu Arg Val Asp His Arg Val 20 25 30
Gly Glu Lys Glu Tyr Cys Val Leu Gly Gly Val Glu Ala Leu Thr Leu 35 40 45Gly Glu Lys Glu Tyr Cys Val Leu Gly Gly Val Glu Ala Leu Thr Leu 35 40 45
Asp Asn Arg Gly Arg Leu Val Trp Lys Lys Val Pro Tyr Val Met Arg 50 55 60Asp Asn Arg Gly Arg Leu Val Trp Lys Lys Val Pro Tyr Val Met Arg 50 55 60
His Lys Thr Asp Lys Arg I le Tyr Arg Val Trp Phe Thr Asn Ser Trp 65 70 75 80His Lys Thr Asp Lys Arg I le Tyr Arg Val Trp Phe Thr Asn Ser Trp 65 70 75 80
Tyr Leu Asp Val Thr Glu Asp His Ser Leu Ile Gly Tyr Leu Asn Thr 85 90 95Tyr Leu Asp Val Thr Glu Asp His Ser Leu Ile Gly Tyr Leu Asn Thr 85 90 95
Ser Lys Val Lys Pro Gly Lys Pro Leu Lys Glu Arg Leu Val Glu Val 100 105 110Ser Lys Val Lys Pro Gly Lys Pro Leu Lys Glu Arg Leu Val Glu Val 100 105 110
Lys Pro Glu Glu Leu Gly Gly Lys Val Lys Ser Leu Ile Thr Pro Asn 115 120 125Lys Pro Glu Glu Leu Gly Gly Lys Val Lys Ser Leu Ile Thr Pro Asn 115 120 125
Arg Pro Ile Ala Arg Thr Ile Lys Ala Asn Pro Ile Ala Val Lys Leu 130 135 140Arg Pro Ile Ala Arg Thr Ile Lys Ala Asn Pro Ile Ala Val Lys Leu 130 135 140
Trp Glu Leu Ile Gly Leu Leu Val Gly Asp Gly Asn Trp Gly Gly Gin 145 150 155 160Trp Glu Leu Ile Gly Leu Leu Val Gly Asp Gly Asn Trp Gly Gly Gin 145 150 155 160
Ser Asn Trp Ala Lys Tyr Tyr Val Gly Leu Ser Cys Gly Leu Asp Lys 165 170 175
Ala Glu Ile Glu Arg Lys Val Leu Asn Pro Leu Arg Glu Ala Ser Val 180 185 190Ser Asn Trp Ala Lys Tyr Tyr Val Gly Leu Ser Cys Gly Leu Asp Lys 165 170 175 Ala Glu Ile Glu Arg Lys Val Leu Asn Pro Leu Arg Glu Ala Ser Val 180 185 190
Ile Ser Asn Tyr Tyr Asp Lys Ser Lys Lys Gly Asp Val Ser Ile Leu 195 200 205Ile Ser Asn Tyr Tyr Asp Lys Ser Lys Lys Gly Asp Val Ser Ile Leu 195 200 205
Ser Lys Trp Leu Ala Gly Phe Met Val Lys Tyr Phe Lys Asp Glu Asn 210 215 220Ser Lys Trp Leu Ala Gly Phe Met Val Lys Tyr Phe Lys Asp Glu Asn 210 215 220
Gly Asn Lys Ala Ile Pro Ser Phe Met Phe Asn Leu Pro Arg Glu Tyr 225 230 235 240Gly Asn Lys Ala Ile Pro Ser Phe Met Phe Asn Leu Pro Arg Glu Tyr 225 230 235 240
Ile Glu Ala Phe Leu Arg Gly Leu Phe Ser Ala Asp Gly Thr Val Ser 245 250 255Ile Glu Ala Phe Leu Arg Gly Leu Phe Ser Ala Asp Gly Thr Val Ser 245 250 255
Leu Arg Arg Gly Ile Pro Glu Ile Arg Leu Thr Ser Val Asn Arg Glu 260 265 270Leu Arg Arg Gly Ile Pro Glu Ile Arg Leu Thr Ser Val Asn Arg Glu 260 265 270
Leu Ser Asp Ala Val Arg Lys Leu Leu Trp Leu Val Gly Val Ser Asn 275 280 285Leu Ser Asp Ala Val Arg Lys Leu Leu Trp Leu Val Gly Val Ser Asn 275 280 285
Ser Leu Phe Thr Glu Thr Lys Pro Asn Arg Tyr Leu Glu Lys Glu Ser 290 295 300Ser Leu Phe Thr Glu Thr Lys Pro Asn Arg Tyr Leu Glu Lys Glu Ser 290 295 300
Gly Thr His Ser Ile His Val Arg Ile Lys Asn Lys His Arg Phe Ala 305 310 315 320Gly Thr His Ser Ile His Val Arg Ile Lys Asn Lys His Arg Phe Ala 305 310 315 320
Asp Arg Ile Gly Phe Leu Ile Asp Arg Lys Ser Thr Lys Leu Ser Glu 325 330 335Asp Arg Ile Gly Phe Leu Ile Asp Arg Lys Ser Thr Lys Leu Ser Glu 325 330 335
Asn Leu Gly Gly His Thr Asn Lys Lys Arg Ala Tyr Lys Tyr Asp Phe 340 345 350Asn Leu Gly Gly His Thr Asn Lys Lys Arg Ala Tyr Lys Tyr Asp Phe 340 345 350
Asp Leu Val Tyr Pro Arg Lys Ile Glu Glu Ile Thr Tyr Asp Gly Tyr 355 360 365Asp Leu Val Tyr Pro Arg Lys Ile Glu Glu Ile Thr Tyr Asp Gly Tyr 355 360 365
Val Tyr Asp Ile Glu Val Glu Gly Thr His Arg Phe Phe Ala Asn Gly 370 375 380Val Tyr Asp Ile Glu Val Glu Gly Thr His Arg Phe Phe Ala Asn Gly 370 375 380
Ile Leu Val His Asn 385 389
Ile Leu Val His Asn 385 389
Claims
REVENDICATIONS
1) ADN polymerase purifiée thermostable d ' archabactéries de l'espèce Thermococcus fumicolans ayant un poids moléculaire de l'ordre de 89 000 daltons et ses dérivés enzymatiquement équivalents .1) purified thermostable DNA polymerase of archabacteria of the species Thermococcus fumicolans having a molecular weight of the order of 89,000 daltons and its enzymatically equivalent derivatives.
2 ) ADN polymerase selon la revendication 1 dont la séquence en acides aminés est représentée dans la liste de séquence en annexe sous le numéro SEQ ID NO:l ou un fragment de celle-ci ou encore un assemblage de tels fragments .2) DNA polymerase according to claim 1, the amino acid sequence of which is represented in the annexed sequence list under the number SEQ ID NO: 1 or a fragment thereof or an assembly of such fragments.
3 ) ADN polymerase selon la revendication 2 dont la séquence en acides aminés est représentée dans la liste de séquence en annexe sous le numéro SEQ ID NO:2.3) DNA polymerase according to claim 2, the amino acid sequence of which is represented in the annexed sequence list under the number SEQ ID NO: 2.
4) Une séquence d'ADN constituée par ou comprenant la séquence codant pour une ADN polymerase selon quelconque des revendications 1 à 3.4) A DNA sequence constituted by or comprising the sequence coding for a DNA polymerase according to any one of claims 1 to 3.
5) Une séquence d'ADN selon la revendication 4 constituée par ou comprenant la séquence comprise entre les nucleotides 357 à 5028 de la séquence SED ID NO : 1, ou un fragment de celle-ci ou encore un assemblage de tels fragments .5) A DNA sequence according to claim 4 consisting of or comprising the sequence between nucleotides 357 to 5028 of the sequence SED ID NO: 1, or a fragment thereof or an assembly of such fragments.
6) Une séquence d'ADN selon l'une des revendications 4 à 5 constituée par ou comprenant les nucleotides 357 à 1674 et 2755 à 3156 et 4324 à 5028 de la séquence d'ADN représentée dans la liste de séquences en annexe sous le numéro SED ID NO : 1.6) A DNA sequence according to one of claims 4 to 5 consisting of or comprising nucleotides 357 to 1674 and 2755 to 3156 and 4324 to 5028 of the DNA sequence represented in the sequence list in the annex under the number SED ID NO: 1.
7) Un vecteur contenant la séquence d'ADN de l'une quelconque des revendications 4 à 6.
8) Un hôte transformé par un vecteur selon la revendication 7.7) A vector containing the DNA sequence of any one of claims 4 to 6. 8) A host transformed by a vector according to claim 7.
9) Procédé de préparation d'une ADN polymerase thermostable d'archaebactéries de l'espèce Thermococcus fumicolans , caractérisé en ce que l'on cultive l'hôte selon la revendication 8 dans des conditions permettant 1 ' expression de ladite ADN polymerase et en ce que 1 ' on extrait et récupère celle-ci par tout moyen approprié.9) Process for the preparation of a thermostable DNA polymerase of archaebacteria of the species Thermococcus fumicolans, characterized in that the host is cultivated according to claim 8 under conditions allowing the expression of said DNA polymerase and in that 1 extract and recover it by any suitable means.
10) Procédé d'amplification enzymatique d'une séquence d'acide nucléique caractérisé en ce que l'on met en oeuvre une ADN polymerase thermostable selon l'une quelconque des revendications 1 à 3.10) Method of enzymatic amplification of a nucleic acid sequence characterized in that a thermostable DNA polymerase according to any one of claims 1 to 3 is used.
11) Intéine purifiée thermostable d'archaebactéries de l'espèce Thermococcus fumicolans.11) Purified thermostable intein from archaebacteria of the species Thermococcus fumicolans.
FEUILLE RECTIFIEE (REGLE 91) ISA/EP
RECTIFIED SHEET (RULE 91) ISA / EP
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/FR1997/000761 WO1998049274A1 (en) | 1997-04-29 | 1997-04-29 | Thermostable dna polymerase and inteins of the thermococcus fumicolans species |
| PCT/FR1998/000868 WO1998049275A2 (en) | 1997-04-29 | 1998-04-29 | Archaebacterium intein of the thermococcus fumicolans species |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/FR1997/000761 WO1998049274A1 (en) | 1997-04-29 | 1997-04-29 | Thermostable dna polymerase and inteins of the thermococcus fumicolans species |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO1998049274A1 true WO1998049274A1 (en) | 1998-11-05 |
Family
ID=9502944
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/FR1997/000761 WO1998049274A1 (en) | 1997-04-29 | 1997-04-29 | Thermostable dna polymerase and inteins of the thermococcus fumicolans species |
| PCT/FR1998/000868 WO1998049275A2 (en) | 1997-04-29 | 1998-04-29 | Archaebacterium intein of the thermococcus fumicolans species |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/FR1998/000868 WO1998049275A2 (en) | 1997-04-29 | 1998-04-29 | Archaebacterium intein of the thermococcus fumicolans species |
Country Status (1)
| Country | Link |
|---|---|
| WO (2) | WO1998049274A1 (en) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002029016A3 (en) * | 2000-10-05 | 2002-06-27 | Qiagen Gmbh | Thermostable polymerase based on thermococcus pacificus |
| WO2006065971A3 (en) * | 2004-12-15 | 2007-01-04 | Primera Biosystem Inc | Compositions and methods for polynucleotide amplification and detection |
| US7829503B2 (en) | 2003-03-27 | 2010-11-09 | Ptc Therapeutics, Inc. | Methods of identifying compounds that target tRNA splicing endonuclease and uses of said compounds as anti-fungal agents |
| US7939468B2 (en) | 2003-03-27 | 2011-05-10 | Ptc Therapeutics, Inc. | Methods of identifying compounds that target tRNA splicing endonuclease and uses of said compounds as anti-proliferative agents |
| US8278085B2 (en) | 2003-07-02 | 2012-10-02 | Ptc Therapeutics, Inc. | RNA processing protein complexes and uses thereof |
| US8637288B2 (en) | 2000-02-17 | 2014-01-28 | Qiagen, Gmbh | Thermostable chimeric nucleic acid polymerases and uses thereof |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0878552A1 (en) | 1997-05-13 | 1998-11-18 | Erasmus Universiteit Rotterdam | Molecular detection of chromosome aberrations |
| FR2782325B1 (en) * | 1998-08-12 | 2002-05-24 | Proteus | METHOD OF IDENTIFYING POLYNUCLEOTIDE SEQUENCES AND / OR CORRESPONDING PROTEINS FROM A SAMPLE OF NUCLEIC ACIDS |
| JP2002513587A (en) | 1998-05-04 | 2002-05-14 | ダコ エー エス | Methods and probes for detecting chromosomal abnormalities |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0602899A2 (en) * | 1992-12-09 | 1994-06-22 | New England Biolabs, Inc. | Modified proteins containing controllable intervening sequences and methods of their production |
| EP0701000A2 (en) * | 1994-07-06 | 1996-03-13 | New England Biolabs, Inc. | Recombinant thermostable DNA polymerase from archaebacteria |
| JPH0870864A (en) * | 1994-09-02 | 1996-03-19 | Toyobo Co Ltd | New endonuclease 2 derived from superthrmophilic primordial microorganism |
-
1997
- 1997-04-29 WO PCT/FR1997/000761 patent/WO1998049274A1/en active Application Filing
-
1998
- 1998-04-29 WO PCT/FR1998/000868 patent/WO1998049275A2/en active Application Filing
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0602899A2 (en) * | 1992-12-09 | 1994-06-22 | New England Biolabs, Inc. | Modified proteins containing controllable intervening sequences and methods of their production |
| EP0701000A2 (en) * | 1994-07-06 | 1996-03-13 | New England Biolabs, Inc. | Recombinant thermostable DNA polymerase from archaebacteria |
| JPH0870864A (en) * | 1994-09-02 | 1996-03-19 | Toyobo Co Ltd | New endonuclease 2 derived from superthrmophilic primordial microorganism |
Non-Patent Citations (5)
| Title |
|---|
| GODFROY A. ET AL.: "Thermococcus fumicolans sp. nov.: a new hyperthermophilic archaeon isolated from a deep-sea hydrothermal vent in the north Fiji basin", INTERNATIONAL JOURNAL OF SYSTEMATIC BACTERIOLOGY, vol. 46, no. 4, October 1996 (1996-10-01), WASHINGTON US, pages 1113 - 1119, XP002049496 * |
| HODGES R. A. ET AL: "PROTEIN SPLICING REMOVES INTERVENING SEQUENCES IN AN ARCHAEA DNA POLYMERASE", NUCLEIC ACIDS RESEARCH, vol. 20, no. 23, 1992, pages 6153 - 6157, XP002033279 * |
| PATENT ABSTRACTS OF JAPAN vol. 096, no. 007 31 July 1996 (1996-07-31) * |
| PERLER F. B. ET AL: "INTERVENING SEQUENCES IN AN ARCHAEA DNA POLYMERASE GENE", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF USA, vol. 89, June 1992 (1992-06-01), pages 5577 - 5581, XP002033280 * |
| PIETROKOVSKI S: "A new intein in cyanobacteria and its significance for the spread of inteins", TRENDS IN GENETICS, vol. 12, no. 8, August 1996 (1996-08-01), pages 287-288, XP004037128 * |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8637288B2 (en) | 2000-02-17 | 2014-01-28 | Qiagen, Gmbh | Thermostable chimeric nucleic acid polymerases and uses thereof |
| WO2002029016A3 (en) * | 2000-10-05 | 2002-06-27 | Qiagen Gmbh | Thermostable polymerase based on thermococcus pacificus |
| US8124391B2 (en) | 2000-10-05 | 2012-02-28 | Qiagen Gmbh | Thermostable polymerases from Thermococcus pacificus |
| US7829503B2 (en) | 2003-03-27 | 2010-11-09 | Ptc Therapeutics, Inc. | Methods of identifying compounds that target tRNA splicing endonuclease and uses of said compounds as anti-fungal agents |
| US7939468B2 (en) | 2003-03-27 | 2011-05-10 | Ptc Therapeutics, Inc. | Methods of identifying compounds that target tRNA splicing endonuclease and uses of said compounds as anti-proliferative agents |
| US8278085B2 (en) | 2003-07-02 | 2012-10-02 | Ptc Therapeutics, Inc. | RNA processing protein complexes and uses thereof |
| WO2006065971A3 (en) * | 2004-12-15 | 2007-01-04 | Primera Biosystem Inc | Compositions and methods for polynucleotide amplification and detection |
Also Published As
| Publication number | Publication date |
|---|---|
| WO1998049275A3 (en) | 1999-02-11 |
| WO1998049275A2 (en) | 1998-11-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Voorhorst et al. | Characterization of the celB gene coding for beta-glucosidase from the hyperthermophilic archaeon Pyrococcus furiosus and its expression and site-directed mutation in Escherichia coli | |
| JP3626226B2 (en) | Thermostable nucleic acid polymerase | |
| US5882904A (en) | Thermococcus barossii DNA polymerase mutants | |
| JP4446498B2 (en) | Overexpression and purification of truncated thermostable DNA polymerase by protein fusion | |
| US6699981B2 (en) | Method and compositions for improved polynucleotide synthesis | |
| JP2000508538A (en) | Biologically active fragments of Bacillus stearothermophilus DNA polymerase | |
| JPH05506364A (en) | Thermostable DNA polymerase 5→3' exonuclease mutation | |
| JPH08298991A (en) | Modified DNA polymerase | |
| JPH08168376A (en) | Thermally stable recombinant dna polymerase derived from archaebacteria | |
| KR19990076834A (en) | New DNA Polymerase | |
| JP2002505572A (en) | Polymerase enhancing factor (PEF) extract, PEF protein complex, isolated PEF protein, and methods for purification and isolation | |
| CN114008197A (en) | Metabolic engineering for simultaneous consumption of xylose and glucose to produce chemicals from second-generation sugars | |
| WO1998049274A1 (en) | Thermostable dna polymerase and inteins of the thermococcus fumicolans species | |
| JP3453397B2 (en) | Purified thermostable DNA polymerase obtained from Pyrococcus species | |
| CN113755465A (en) | Chimeric DNA polymerase and method for preparing same | |
| Neveling et al. | Purification of the pyruvate dehydrogenase multienzyme complex of Zymomonas mobilis and identification and sequence analysis of the corresponding genes | |
| FR2778412A1 (en) | Production of recombinant thermophilic alpha-amylase active at low pH | |
| US20070009924A1 (en) | Thermococcus zilligii DNA polymerases and variants thereof | |
| US20160298095A1 (en) | Nucleic acid molecules for increased protein production | |
| EP0920516A2 (en) | Heat-stable dna polymerase of archaeobacteria of genus pyrococcus sp. | |
| US8124391B2 (en) | Thermostable polymerases from Thermococcus pacificus | |
| US5861296A (en) | Purified thermostable inorganic pyprophosphatase obtainable from thermococcus litoralis | |
| RU2221867C2 (en) | Purified thermococcus gorgonarius thermostable dna polymerase , its preparing and using | |
| US20050084938A1 (en) | Method for plasmid preparation by conversion of open circular plasmid to supercoiled plasmid | |
| KR0136446B1 (en) | Mass production method of novel heat-resistant thiophyll DNA synthesis enzyme |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): CA JP US |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| NENP | Non-entry into the national phase |
Ref country code: JP Ref document number: 1998546641 Format of ref document f/p: F |
|
| 122 | Ep: pct application non-entry in european phase |




