KR20170132201A - 세균 감염의 치료를 위한 살세균제와 라이소좀향성 알칼리화제와의 조합물 - Google Patents
세균 감염의 치료를 위한 살세균제와 라이소좀향성 알칼리화제와의 조합물 Download PDFInfo
- Publication number
- KR20170132201A KR20170132201A KR1020177028879A KR20177028879A KR20170132201A KR 20170132201 A KR20170132201 A KR 20170132201A KR 1020177028879 A KR1020177028879 A KR 1020177028879A KR 20177028879 A KR20177028879 A KR 20177028879A KR 20170132201 A KR20170132201 A KR 20170132201A
- Authority
- KR
- South Korea
- Prior art keywords
- gly
- lys
- val
- ala
- thr
- Prior art date
Links
- 238000011282 treatment Methods 0.000 title claims abstract description 38
- 208000035143 Bacterial infection Diseases 0.000 title claims abstract description 17
- 208000022362 bacterial infectious disease Diseases 0.000 title claims abstract description 17
- 239000003795 chemical substances by application Substances 0.000 title claims description 55
- 230000003113 alkalizing effect Effects 0.000 title claims description 19
- 239000003899 bactericide agent Substances 0.000 title description 21
- 230000000844 anti-bacterial effect Effects 0.000 title description 6
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 97
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 89
- 229920001184 polypeptide Polymers 0.000 claims description 82
- 210000004027 cell Anatomy 0.000 claims description 81
- 230000001580 bacterial effect Effects 0.000 claims description 56
- 241000894006 Bacteria Species 0.000 claims description 53
- 230000003834 intracellular effect Effects 0.000 claims description 52
- WHTVZRBIWZFKQO-AWEZNQCLSA-N (S)-chloroquine Chemical compound ClC1=CC=C2C(N[C@@H](C)CCCN(CC)CC)=CC=NC2=C1 WHTVZRBIWZFKQO-AWEZNQCLSA-N 0.000 claims description 37
- 229960003677 chloroquine Drugs 0.000 claims description 37
- WHTVZRBIWZFKQO-UHFFFAOYSA-N chloroquine Natural products ClC1=CC=C2C(NC(C)CCCN(CC)CC)=CC=NC2=C1 WHTVZRBIWZFKQO-UHFFFAOYSA-N 0.000 claims description 37
- 210000002421 cell wall Anatomy 0.000 claims description 36
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 31
- 108091033319 polynucleotide Proteins 0.000 claims description 30
- 102000040430 polynucleotide Human genes 0.000 claims description 30
- 239000002157 polynucleotide Substances 0.000 claims description 30
- 102000004169 proteins and genes Human genes 0.000 claims description 30
- 108090000623 proteins and genes Proteins 0.000 claims description 30
- 108090000790 Enzymes Proteins 0.000 claims description 28
- 102000004190 Enzymes Human genes 0.000 claims description 28
- 238000000034 method Methods 0.000 claims description 27
- 108010062877 Bacteriocins Proteins 0.000 claims description 26
- 208000015181 infectious disease Diseases 0.000 claims description 24
- 230000026683 transduction Effects 0.000 claims description 23
- 238000010361 transduction Methods 0.000 claims description 23
- 239000003242 anti bacterial agent Substances 0.000 claims description 21
- 108700042778 Antimicrobial Peptides Proteins 0.000 claims description 16
- 102000044503 Antimicrobial Peptides Human genes 0.000 claims description 16
- 230000002085 persistent effect Effects 0.000 claims description 15
- 230000009089 cytolysis Effects 0.000 claims description 14
- 239000003910 polypeptide antibiotic agent Substances 0.000 claims description 14
- 239000004472 Lysine Substances 0.000 claims description 11
- 230000003115 biocidal effect Effects 0.000 claims description 11
- 230000001965 increasing effect Effects 0.000 claims description 10
- 241001515965 unidentified phage Species 0.000 claims description 10
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 claims description 8
- 241000191940 Staphylococcus Species 0.000 claims description 7
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 claims description 6
- -1 bar philomicin A1 Chemical compound 0.000 claims description 6
- 235000019270 ammonium chloride Nutrition 0.000 claims description 4
- 230000002080 lysosomotropic effect Effects 0.000 claims description 3
- 229930186147 Cephalosporin Natural products 0.000 claims description 2
- 239000004473 Threonine Substances 0.000 claims description 2
- 108010059993 Vancomycin Proteins 0.000 claims description 2
- WZPBZJONDBGPKJ-VEHQQRBSSA-N aztreonam Chemical compound O=C1N(S([O-])(=O)=O)[C@@H](C)[C@@H]1NC(=O)C(=N/OC(C)(C)C(O)=O)\C1=CSC([NH3+])=N1 WZPBZJONDBGPKJ-VEHQQRBSSA-N 0.000 claims description 2
- 229960003644 aztreonam Drugs 0.000 claims description 2
- 239000003782 beta lactam antibiotic agent Substances 0.000 claims description 2
- YZBQHRLRFGPBSL-RXMQYKEDSA-N carbapenem Chemical compound C1C=CN2C(=O)C[C@H]21 YZBQHRLRFGPBSL-RXMQYKEDSA-N 0.000 claims description 2
- 229940124587 cephalosporin Drugs 0.000 claims description 2
- 150000001780 cephalosporins Chemical class 0.000 claims description 2
- 229940124307 fluoroquinolone Drugs 0.000 claims description 2
- 229960000564 nitrofurantoin Drugs 0.000 claims description 2
- NXFQHRVNIOXGAQ-YCRREMRBSA-N nitrofurantoin Chemical compound O1C([N+](=O)[O-])=CC=C1\C=N\N1C(=O)NC(=O)C1 NXFQHRVNIOXGAQ-YCRREMRBSA-N 0.000 claims description 2
- 150000002960 penicillins Chemical class 0.000 claims description 2
- LJVAJPDWBABPEJ-PNUFFHFMSA-N telithromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)[C@@H](C)C(=O)O[C@@H]([C@]2(OC(=O)N(CCCCN3C=C(N=C3)C=3C=NC=CC=3)[C@@H]2[C@@H](C)C(=O)[C@H](C)C[C@@]1(C)OC)C)CC)[C@@H]1O[C@H](C)C[C@H](N(C)C)[C@H]1O LJVAJPDWBABPEJ-PNUFFHFMSA-N 0.000 claims description 2
- 229960003250 telithromycin Drugs 0.000 claims description 2
- 229960002898 threonine Drugs 0.000 claims description 2
- 229960003165 vancomycin Drugs 0.000 claims description 2
- MYPYJXKWCTUITO-LYRMYLQWSA-N vancomycin Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=C2C=C3C=C1OC1=CC=C(C=C1Cl)[C@@H](O)[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@H]3C(=O)N[C@H]1C(=O)N[C@H](C(N[C@@H](C3=CC(O)=CC(O)=C3C=3C(O)=CC=C1C=3)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)O2)=O)NC(=O)[C@@H](CC(C)C)NC)[C@H]1C[C@](C)(N)[C@H](O)[C@H](C)O1 MYPYJXKWCTUITO-LYRMYLQWSA-N 0.000 claims description 2
- MYPYJXKWCTUITO-UHFFFAOYSA-N vancomycin Natural products O1C(C(=C2)Cl)=CC=C2C(O)C(C(NC(C2=CC(O)=CC(O)=C2C=2C(O)=CC=C3C=2)C(O)=O)=O)NC(=O)C3NC(=O)C2NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(CC(C)C)NC)C(O)C(C=C3Cl)=CC=C3OC3=CC2=CC1=C3OC1OC(CO)C(O)C(O)C1OC1CC(C)(N)C(O)C(C)O1 MYPYJXKWCTUITO-UHFFFAOYSA-N 0.000 claims description 2
- 239000002132 β-lactam antibiotic Substances 0.000 claims description 2
- 229940124586 β-lactam antibiotics Drugs 0.000 claims description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 claims 2
- 239000002647 aminoglycoside antibiotic agent Substances 0.000 claims 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 claims 1
- 210000000680 phagosome Anatomy 0.000 claims 1
- 108020001580 protein domains Proteins 0.000 claims 1
- 229960004889 salicylic acid Drugs 0.000 claims 1
- 239000003814 drug Substances 0.000 abstract description 3
- 239000012634 fragment Substances 0.000 description 53
- PCDUALPXEOKZPE-DXCABUDRSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O PCDUALPXEOKZPE-DXCABUDRSA-N 0.000 description 42
- 108020004414 DNA Proteins 0.000 description 36
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 29
- 239000000203 mixture Substances 0.000 description 28
- 108010050848 glycylleucine Proteins 0.000 description 26
- 235000018102 proteins Nutrition 0.000 description 26
- 108010073969 valyllysine Proteins 0.000 description 26
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 25
- 108010015792 glycyllysine Proteins 0.000 description 24
- 229940088598 enzyme Drugs 0.000 description 23
- 239000013598 vector Substances 0.000 description 23
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 21
- 108010079364 N-glycylalanine Proteins 0.000 description 18
- 206010057190 Respiratory tract infections Diseases 0.000 description 18
- 150000001413 amino acids Chemical group 0.000 description 18
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 16
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 16
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 16
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 16
- 108010038745 tryptophylglycine Proteins 0.000 description 16
- 108010051110 tyrosyl-lysine Proteins 0.000 description 16
- 108010044940 alanylglutamine Proteins 0.000 description 15
- 108010047495 alanylglycine Proteins 0.000 description 15
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 14
- 235000001014 amino acid Nutrition 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- 229940088710 antibiotic agent Drugs 0.000 description 13
- 108010047857 aspartylglycine Proteins 0.000 description 13
- 230000000694 effects Effects 0.000 description 13
- 108010085325 histidylproline Proteins 0.000 description 13
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 13
- 108700004896 tripeptide FEG Proteins 0.000 description 13
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 12
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 12
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 12
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 12
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 12
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 12
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 12
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 12
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 12
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 12
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 12
- KHTIUAKJRUIEMA-HOUAVDHOSA-N Thr-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 KHTIUAKJRUIEMA-HOUAVDHOSA-N 0.000 description 12
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 12
- 230000012010 growth Effects 0.000 description 12
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 11
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 11
- 108010081551 glycylphenylalanine Proteins 0.000 description 11
- 108010037850 glycylvaline Proteins 0.000 description 11
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 10
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 10
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 10
- 229940024606 amino acid Drugs 0.000 description 10
- 108010040030 histidinoalanine Proteins 0.000 description 10
- 108010044292 tryptophyltyrosine Proteins 0.000 description 10
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 9
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 9
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 9
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 9
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 9
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 9
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 9
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 9
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 9
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 9
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 9
- CMPHFUWXKBPNRS-WDSOQIARSA-N His-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 CMPHFUWXKBPNRS-WDSOQIARSA-N 0.000 description 9
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 9
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 9
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 9
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 9
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 9
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 9
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 9
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 9
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 9
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 9
- 108010016616 cysteinylglycine Proteins 0.000 description 9
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 9
- 108010084389 glycyltryptophan Proteins 0.000 description 9
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 9
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 9
- LVPCJMUBOHOZHE-UHFFFAOYSA-N 4-amino-2-[[2-[[2-[(2-amino-3-methylbutanoyl)amino]-3-methylpentanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-4-oxobutanoic acid Chemical compound CC(C)C(N)C(=O)NC(C(C)CC)C(=O)NC(C(=O)NC(CC(N)=O)C(O)=O)CC1=CN=CN1 LVPCJMUBOHOZHE-UHFFFAOYSA-N 0.000 description 8
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 8
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 8
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 8
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 8
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 8
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 8
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 8
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 8
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 8
- IHUJUZBUOFTIOB-QEJZJMRPSA-N Asn-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N IHUJUZBUOFTIOB-QEJZJMRPSA-N 0.000 description 8
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 8
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 8
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 8
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 8
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 8
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 8
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 8
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 8
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 8
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 8
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 8
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 8
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 8
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 8
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 8
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 8
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 8
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 8
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 8
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 8
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 8
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 8
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 8
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 8
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 8
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 8
- NIOPEYHPOBWLQO-KBPBESRZSA-N Gly-Trp-Glu Chemical compound NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOPEYHPOBWLQO-KBPBESRZSA-N 0.000 description 8
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 8
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 8
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 8
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 8
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 8
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 8
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 8
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 8
- LEHPJMKVGFPSSP-ZQINRCPSSA-N Ile-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 LEHPJMKVGFPSSP-ZQINRCPSSA-N 0.000 description 8
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 8
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 8
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 8
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 8
- 108010065920 Insulin Lispro Proteins 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 8
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 8
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 8
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 8
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 8
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 8
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 8
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 8
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 8
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 8
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 8
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 8
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 8
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 8
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 8
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 8
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 8
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 8
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 8
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 8
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 8
- ZFVWWUILVLLVFA-AVGNSLFASA-N Phe-Gln-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N ZFVWWUILVLLVFA-AVGNSLFASA-N 0.000 description 8
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 8
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 8
- FKFCKDROTNIVSO-JYJNAYRXSA-N Phe-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O FKFCKDROTNIVSO-JYJNAYRXSA-N 0.000 description 8
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 8
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 8
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 8
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 8
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 8
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 8
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 8
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 8
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 8
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 8
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 8
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 8
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 8
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 8
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 8
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 8
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 8
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 8
- YRSOERSDNRSCBC-XIRDDKMYSA-N Trp-His-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CS)C(=O)O)N YRSOERSDNRSCBC-XIRDDKMYSA-N 0.000 description 8
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 8
- UPNRACRNHISCAF-SZMVWBNQSA-N Trp-Lys-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UPNRACRNHISCAF-SZMVWBNQSA-N 0.000 description 8
- NLWCSMOXNKBRLC-WDSOQIARSA-N Trp-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLWCSMOXNKBRLC-WDSOQIARSA-N 0.000 description 8
- WHJVRIBYQWHRQA-NQCBNZPSSA-N Trp-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 WHJVRIBYQWHRQA-NQCBNZPSSA-N 0.000 description 8
- ZKVANNIVSDOQMG-HKUYNNGSSA-N Trp-Tyr-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)NCC(=O)O)N ZKVANNIVSDOQMG-HKUYNNGSSA-N 0.000 description 8
- SJWLQICJOBMOGG-PMVMPFDFSA-N Trp-Tyr-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O)N SJWLQICJOBMOGG-PMVMPFDFSA-N 0.000 description 8
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 8
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 8
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 8
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 8
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 8
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 8
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 8
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 8
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 8
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 8
- 108010038633 aspartylglutamate Proteins 0.000 description 8
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 8
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 8
- 108010034507 methionyltryptophan Proteins 0.000 description 8
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 8
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 7
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 7
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 7
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 7
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 7
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 7
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 7
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 7
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 7
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 7
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 7
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 7
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 7
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 7
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 7
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 7
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 7
- FBCURAVMSXNOLP-JYJNAYRXSA-N His-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBCURAVMSXNOLP-JYJNAYRXSA-N 0.000 description 7
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 7
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 7
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 7
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 7
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 7
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 7
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 7
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 7
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 7
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 7
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 7
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 7
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 7
- YOFKMVUAZGPFCF-IHRRRGAJSA-N Phe-Met-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O YOFKMVUAZGPFCF-IHRRRGAJSA-N 0.000 description 7
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 7
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 7
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 7
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 7
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 7
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 7
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 7
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 7
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 7
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 7
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 7
- XYNFFTNEQDWZNY-ULQDDVLXSA-N Tyr-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N XYNFFTNEQDWZNY-ULQDDVLXSA-N 0.000 description 7
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 7
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 7
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 7
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 7
- 108010093581 aspartyl-proline Proteins 0.000 description 7
- 201000008968 osteosarcoma Diseases 0.000 description 7
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 7
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 6
- 241000972773 Aulopiformes Species 0.000 description 6
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 6
- 241000699670 Mus sp. Species 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 239000001963 growth medium Substances 0.000 description 6
- 230000002147 killing effect Effects 0.000 description 6
- 210000003712 lysosome Anatomy 0.000 description 6
- 230000001868 lysosomic effect Effects 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 244000052769 pathogen Species 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- 230000003362 replicative effect Effects 0.000 description 6
- 235000019515 salmon Nutrition 0.000 description 6
- 238000004626 scanning electron microscopy Methods 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 5
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 5
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 5
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 5
- 244000063299 Bacillus subtilis Species 0.000 description 5
- 235000014469 Bacillus subtilis Nutrition 0.000 description 5
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 5
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 5
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 5
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 5
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 5
- MITYXXNZSZLHGG-OBAATPRFSA-N Ile-Trp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N MITYXXNZSZLHGG-OBAATPRFSA-N 0.000 description 5
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 5
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 5
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 5
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 5
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 5
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 5
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 5
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 5
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 5
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 5
- 108010066427 N-valyltryptophan Proteins 0.000 description 5
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 5
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 5
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 5
- 241000607142 Salmonella Species 0.000 description 5
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 5
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 5
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 5
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 5
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 5
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 5
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 5
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 5
- 108010038320 lysylphenylalanine Proteins 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 230000008685 targeting Effects 0.000 description 5
- 238000002560 therapeutic procedure Methods 0.000 description 5
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 5
- QVVDVENEPNODSI-BTNSXGMBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-5-(diaminomethylidene Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QVVDVENEPNODSI-BTNSXGMBSA-N 0.000 description 4
- KZDCMKVLEYCGQX-UDPGNSCCSA-N 2-(diethylamino)ethyl 4-aminobenzoate;(2s,5r,6r)-3,3-dimethyl-7-oxo-6-[(2-phenylacetyl)amino]-4-thia-1-azabicyclo[3.2.0]heptane-2-carboxylic acid;hydrate Chemical compound O.CCN(CC)CCOC(=O)C1=CC=C(N)C=C1.N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 KZDCMKVLEYCGQX-UDPGNSCCSA-N 0.000 description 4
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 4
- 241000606161 Chlamydia Species 0.000 description 4
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 4
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 4
- 108700003968 Human immunodeficiency virus 1 tat peptide (49-57) Proteins 0.000 description 4
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 4
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 4
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 4
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 4
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 4
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 4
- 206010031252 Osteomyelitis Diseases 0.000 description 4
- 241000191967 Staphylococcus aureus Species 0.000 description 4
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 4
- DZIKVMCFXIIETR-JSGCOSHPSA-N Trp-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O DZIKVMCFXIIETR-JSGCOSHPSA-N 0.000 description 4
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 230000002378 acidificating effect Effects 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 125000002091 cationic group Chemical group 0.000 description 4
- 206010014665 endocarditis Diseases 0.000 description 4
- 230000008029 eradication Effects 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 210000004379 membrane Anatomy 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 238000006386 neutralization reaction Methods 0.000 description 4
- 230000002688 persistence Effects 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 230000005945 translocation Effects 0.000 description 4
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 3
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 3
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 239000006145 Eagle's minimal essential medium Substances 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 3
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 3
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 3
- 206010041925 Staphylococcal infections Diseases 0.000 description 3
- 241000947772 Strawberry crinkle virus Species 0.000 description 3
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 3
- 230000032823 cell division Effects 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000034994 death Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 210000002919 epithelial cell Anatomy 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 239000004615 ingredient Substances 0.000 description 3
- 210000002540 macrophage Anatomy 0.000 description 3
- 208000015688 methicillin-resistant staphylococcus aureus infectious disease Diseases 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 150000007523 nucleic acids Chemical class 0.000 description 3
- 230000001717 pathogenic effect Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 2
- CVHJIWVKTFNGHT-ACZMJKKPSA-N Ala-Gln-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N CVHJIWVKTFNGHT-ACZMJKKPSA-N 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- 101800002011 Amphipathic peptide Proteins 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Chemical compound OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 208000031729 Bacteremia Diseases 0.000 description 2
- 102100021277 Beta-secretase 2 Human genes 0.000 description 2
- 101710150190 Beta-secretase 2 Proteins 0.000 description 2
- 241001445332 Coxiella <snail> Species 0.000 description 2
- 101000925662 Enterobacteria phage PRD1 Endolysin Proteins 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- DXMPMSWUZVNBSG-QEJZJMRPSA-N Gln-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N DXMPMSWUZVNBSG-QEJZJMRPSA-N 0.000 description 2
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 2
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 2
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 2
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 2
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 2
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 2
- 102100038225 Lysosome-associated membrane glycoprotein 2 Human genes 0.000 description 2
- 101710116771 Lysosome-associated membrane glycoprotein 2 Proteins 0.000 description 2
- 108090000988 Lysostaphin Proteins 0.000 description 2
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 108010053775 Nisin Proteins 0.000 description 2
- NVNLLIYOARQCIX-MSHCCFNRSA-N Nisin Chemical compound N1C(=O)[C@@H](CC(C)C)NC(=O)C(=C)NC(=O)[C@@H]([C@H](C)CC)NC(=O)[C@@H](NC(=O)C(=C/C)/NC(=O)[C@H](N)[C@H](C)CC)CSC[C@@H]1C(=O)N[C@@H]1C(=O)N2CCC[C@@H]2C(=O)NCC(=O)N[C@@H](C(=O)N[C@H](CCCCN)C(=O)N[C@@H]2C(NCC(=O)N[C@H](C)C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCSC)C(=O)NCC(=O)N[C@H](CS[C@@H]2C)C(=O)N[C@H](CC(N)=O)C(=O)N[C@H](CCSC)C(=O)N[C@H](CCCCN)C(=O)N[C@@H]2C(N[C@H](C)C(=O)N[C@@H]3C(=O)N[C@@H](C(N[C@H](CC=4NC=NC=4)C(=O)N[C@H](CS[C@@H]3C)C(=O)N[C@H](CO)C(=O)N[C@H]([C@H](C)CC)C(=O)N[C@H](CC=3NC=NC=3)C(=O)N[C@H](C(C)C)C(=O)NC(=C)C(=O)N[C@H](CCCCN)C(O)=O)=O)CS[C@@H]2C)=O)=O)CS[C@@H]1C NVNLLIYOARQCIX-MSHCCFNRSA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- YGYAWVDWMABLBF-UHFFFAOYSA-N Phosgene Chemical compound ClC(Cl)=O YGYAWVDWMABLBF-UHFFFAOYSA-N 0.000 description 2
- 208000035415 Reinfection Diseases 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 2
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 2
- 241000193985 Streptococcus agalactiae Species 0.000 description 2
- 101710172711 Structural protein Proteins 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 2
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 2
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 2
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 2
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 210000003567 ascitic fluid Anatomy 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000006037 cell lysis Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 210000002889 endothelial cell Anatomy 0.000 description 2
- 238000000799 fluorescence microscopy Methods 0.000 description 2
- 239000000417 fungicide Substances 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000002458 infectious effect Effects 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 231100000518 lethal Toxicity 0.000 description 2
- 230000001665 lethal effect Effects 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 230000002132 lysosomal effect Effects 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000004309 nisin Substances 0.000 description 2
- 235000010297 nisin Nutrition 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000002741 palatine tonsil Anatomy 0.000 description 2
- 230000035515 penetration Effects 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000000241 respiratory effect Effects 0.000 description 2
- 210000003491 skin Anatomy 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- BEJKOYIMCGMNRB-GRHHLOCNSA-N (2s)-2-amino-3-(4-hydroxyphenyl)propanoic acid;(2s)-2-amino-3-phenylpropanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1.OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BEJKOYIMCGMNRB-GRHHLOCNSA-N 0.000 description 1
- WPXFILQZNKUYQO-BZSNNMDCSA-N 2-[[(2s)-2-[[(2s)-1-[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 WPXFILQZNKUYQO-BZSNNMDCSA-N 0.000 description 1
- GXPCCSYVSYFRDU-LJWNLINESA-N 2-[[(2s)-2-[[(2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O)C1=CC=CC=C1 GXPCCSYVSYFRDU-LJWNLINESA-N 0.000 description 1
- 241000606750 Actinobacillus Species 0.000 description 1
- 241000186046 Actinomyces Species 0.000 description 1
- 241000606749 Aggregatibacter actinomycetemcomitans Species 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- OIRCZHKOHJUHAC-SIUGBPQLSA-N Ala-Val-Asp-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OIRCZHKOHJUHAC-SIUGBPQLSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 1
- 244000105975 Antidesma platyphyllum Species 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- WOPFJPHVBWKZJH-SRVKXCTJSA-N Arg-Arg-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O WOPFJPHVBWKZJH-SRVKXCTJSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- BQBPFMNVOWDLHO-XIRDDKMYSA-N Arg-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N BQBPFMNVOWDLHO-XIRDDKMYSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- RFNDQEWMNJMQHD-SZMVWBNQSA-N Arg-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RFNDQEWMNJMQHD-SZMVWBNQSA-N 0.000 description 1
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 description 1
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- 241000193738 Bacillus anthracis Species 0.000 description 1
- 241000193388 Bacillus thuringiensis Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 208000031462 Bovine Mastitis Diseases 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 241000589562 Brucella Species 0.000 description 1
- 206010006500 Brucellosis Diseases 0.000 description 1
- 241000606153 Chlamydia trachomatis Species 0.000 description 1
- 108010069514 Cyclic Peptides Proteins 0.000 description 1
- 102000001189 Cyclic Peptides Human genes 0.000 description 1
- LWYKPOCGGTYAIH-FXQIFTODSA-N Cys-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LWYKPOCGGTYAIH-FXQIFTODSA-N 0.000 description 1
- XSELZJJGSKZZDO-UBHSHLNASA-N Cys-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XSELZJJGSKZZDO-UBHSHLNASA-N 0.000 description 1
- 206010064687 Device related infection Diseases 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000207202 Gardnerella Species 0.000 description 1
- 241000207201 Gardnerella vaginalis Species 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 1
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- IPIVXQQRZXEUGW-UWJYBYFXSA-N His-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IPIVXQQRZXEUGW-UWJYBYFXSA-N 0.000 description 1
- UJWYPUUXIAKEES-CUJWVEQBSA-N His-Cys-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UJWYPUUXIAKEES-CUJWVEQBSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- RWYCOSAAAJBJQL-KCTSRDHCSA-N Ile-Gly-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RWYCOSAAAJBJQL-KCTSRDHCSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- RVNOXPZHMUWCLW-GMOBBJLQSA-N Ile-Met-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVNOXPZHMUWCLW-GMOBBJLQSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- ZFWISYLMLXFBSX-KKPKCPPISA-N Ile-Trp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N ZFWISYLMLXFBSX-KKPKCPPISA-N 0.000 description 1
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 241000589242 Legionella pneumophila Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- 206010024229 Leprosy Diseases 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 241000186781 Listeria Species 0.000 description 1
- 241000186779 Listeria monocytogenes Species 0.000 description 1
- 206010024641 Listeriosis Diseases 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- QBGPXOGXCVKULO-BQBZGAKWSA-N Lys-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(O)=O QBGPXOGXCVKULO-BQBZGAKWSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 1
- FMIIKPHLJKUXGE-GUBZILKMSA-N Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN FMIIKPHLJKUXGE-GUBZILKMSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- PWPBGAJJYJJVPI-PJODQICGSA-N Met-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 PWPBGAJJYJJVPI-PJODQICGSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- TZHFJXDKXGZHEN-IHRRRGAJSA-N Met-His-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O TZHFJXDKXGZHEN-IHRRRGAJSA-N 0.000 description 1
- ZEVPMOHYCQFWSE-NAKRPEOUSA-N Met-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N ZEVPMOHYCQFWSE-NAKRPEOUSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 1
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 description 1
- 241000186362 Mycobacterium leprae Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 208000006816 Neonatal Sepsis Diseases 0.000 description 1
- 102000005348 Neuraminidase Human genes 0.000 description 1
- 108010006232 Neuraminidase Proteins 0.000 description 1
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 1
- 241000187654 Nocardia Species 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108010080032 Pediocins Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010013639 Peptidoglycan Proteins 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- 201000007100 Pharyngitis Diseases 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 1
- MCPXQHVVCPTRIM-HJOGWXRNSA-N Pro-Trp-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)[C@@H]1CCCN1 MCPXQHVVCPTRIM-HJOGWXRNSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 208000025747 Rheumatic disease Diseases 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 230000027151 SOS response Effects 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 206010040047 Sepsis Diseases 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- SBMNPABNWKXNBJ-BQBZGAKWSA-N Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO SBMNPABNWKXNBJ-BQBZGAKWSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241000607764 Shigella dysenteriae Species 0.000 description 1
- 208000031726 Spotted Fever Group Rickettsiosis Diseases 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- BGHVVGPELPHRCI-HZTRNQAASA-N Thr-Trp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N)O BGHVVGPELPHRCI-HZTRNQAASA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 description 1
- HYLNRGXEQACDKG-NYVOZVTQSA-N Trp-Asn-Trp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HYLNRGXEQACDKG-NYVOZVTQSA-N 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- YYXIWHBHTARPOG-HJXMPXNTSA-N Trp-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YYXIWHBHTARPOG-HJXMPXNTSA-N 0.000 description 1
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 1
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 1
- LFGHEUIUSIRJAE-TUSQITKMSA-N Trp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N LFGHEUIUSIRJAE-TUSQITKMSA-N 0.000 description 1
- ZHDQRPWESGUDST-JBACZVJFSA-N Trp-Phe-Gln Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ZHDQRPWESGUDST-JBACZVJFSA-N 0.000 description 1
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 1
- IYHRKILQAQWODS-VJBMBRPKSA-N Trp-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IYHRKILQAQWODS-VJBMBRPKSA-N 0.000 description 1
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 208000037386 Typhoid Diseases 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- ULUXAIYMVXLDQP-PMVMPFDFSA-N Tyr-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N ULUXAIYMVXLDQP-PMVMPFDFSA-N 0.000 description 1
- AXKADNRGSUKLKI-WIRXVTQYSA-N Tyr-Trp-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 AXKADNRGSUKLKI-WIRXVTQYSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- AFWXOGHZEKARFH-ACRUOGEOSA-N Tyr-Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 AFWXOGHZEKARFH-ACRUOGEOSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- 241000607734 Yersinia <bacteria> Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- BUFLLCUFNHESEH-UHFFFAOYSA-N [5-(2-amino-6-oxo-3h-purin-9-yl)-4-hydroxy-2-[[hydroxy(phosphonooxy)phosphoryl]oxymethyl]oxolan-3-yl] phosphono hydrogen phosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(O)=O)C(OP(O)(=O)OP(O)(O)=O)C1O BUFLLCUFNHESEH-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 210000001130 astrocyte Anatomy 0.000 description 1
- 210000004961 autolysosome Anatomy 0.000 description 1
- 229940097012 bacillus thuringiensis Drugs 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000002798 bone marrow cell Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 125000000837 carbohydrate group Chemical group 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 230000008618 cell wall macromolecule catabolic process Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- RKLXDNHNLPUQRB-TVJUEJKUSA-N chembl564271 Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]2C(C)SC[C@H](N[C@@H](CC(N)=O)C(=O)NC(=O)[C@@H](NC2=O)CSC1C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NC(=C)C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NC(=O)C(=C\C)/NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H]2NC(=O)CNC(=O)[C@@H]3CCCN3C(=O)[C@@H](NC(=O)[C@H]3N[C@@H](CC(C)C)C(=O)NC(=O)C(=C)NC(=O)CC[C@H](NC(=O)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC=4C5=CC=CC=C5NC=4)CSC3)C(O)=O)C(C)SC2)C(C)C)C(C)SC1)C1=CC=CC=C1 RKLXDNHNLPUQRB-TVJUEJKUSA-N 0.000 description 1
- 229960002227 clindamycin Drugs 0.000 description 1
- KDLRVYVGXIQJDK-AWPVFWJPSA-N clindamycin Chemical compound CN1C[C@H](CCC)C[C@H]1C(=O)N[C@H]([C@H](C)Cl)[C@@H]1[C@H](O)[C@H](O)[C@@H](O)[C@@H](SC)O1 KDLRVYVGXIQJDK-AWPVFWJPSA-N 0.000 description 1
- 238000011284 combination treatment Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 208000001848 dysentery Diseases 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 210000001163 endosome Anatomy 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 208000028104 epidemic louse-borne typhus Diseases 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 239000012894 fetal calf serum Substances 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 235000009424 haa Nutrition 0.000 description 1
- 230000035876 healing Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 244000052637 human pathogen Species 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 201000001371 inclusion conjunctivitis Diseases 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 201000007119 infective endocarditis Diseases 0.000 description 1
- 208000000509 infertility Diseases 0.000 description 1
- 230000036512 infertility Effects 0.000 description 1
- 208000021267 infertility disease Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 230000010189 intracellular transport Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 210000002510 keratinocyte Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 210000002429 large intestine Anatomy 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000009593 lumbar puncture Methods 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 201000004792 malaria Diseases 0.000 description 1
- 210000005075 mammary gland Anatomy 0.000 description 1
- 208000004396 mastitis Diseases 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108010067215 mersacidin Proteins 0.000 description 1
- JSWKNDSDVHJUKY-CYGWNLPQSA-N mersacidin Chemical compound C([C@@H](C(=O)N[C@@H]1[C@H](C)SC[C@H](NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)CNC(=O)CNC(=O)CNC(=O)[C@@H]2CCCN2C(=O)[C@H](CC(C)C)NC1=O)C(=O)N[C@@H]1[C@H](C)SC[C@H]2C(=O)N[C@H](C(N/C=C/S[C@@H](C)C(NC(=O)[C@H](CC(C)C)NC1=O)C(=O)NC(=C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2)=O)[C@H](C)CC)NC(=O)[C@H]1[C@@H](SC[C@H](N)C(=O)N1)C)C1=CC=CC=C1 JSWKNDSDVHJUKY-CYGWNLPQSA-N 0.000 description 1
- 230000001394 metastastic effect Effects 0.000 description 1
- 206010061289 metastatic neoplasm Diseases 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 229960000282 metronidazole Drugs 0.000 description 1
- VAOCPAMSLUNLGC-UHFFFAOYSA-N metronidazole Chemical compound CC1=NC=C([N+]([O-])=O)N1CCO VAOCPAMSLUNLGC-UHFFFAOYSA-N 0.000 description 1
- 108010012906 microbisporicin Proteins 0.000 description 1
- 210000000274 microglia Anatomy 0.000 description 1
- 210000004980 monocyte derived macrophage Anatomy 0.000 description 1
- 210000005087 mononuclear cell Anatomy 0.000 description 1
- 210000000663 muscle cell Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000008723 osmotic stress Effects 0.000 description 1
- 210000000963 osteoblast Anatomy 0.000 description 1
- 230000001582 osteoblastic effect Effects 0.000 description 1
- 230000036542 oxidative stress Effects 0.000 description 1
- 230000020477 pH reduction Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 239000012466 permeate Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 208000011354 prosthesis-related infectious disease Diseases 0.000 description 1
- 208000008128 pulmonary tuberculosis Diseases 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000001533 respiratory mucosa Anatomy 0.000 description 1
- 230000028617 response to DNA damage stimulus Effects 0.000 description 1
- 208000013223 septicemia Diseases 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 1
- 201000004284 spotted fever Diseases 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 210000002536 stromal cell Anatomy 0.000 description 1
- 238000013337 sub-cultivation Methods 0.000 description 1
- 108010082567 subtilin Proteins 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 108700029760 synthetic LTSP Proteins 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 206010044008 tonsillitis Diseases 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 206010044325 trachoma Diseases 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000004627 transmission electron microscopy Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 201000008827 tuberculosis Diseases 0.000 description 1
- 238000007492 two-way ANOVA Methods 0.000 description 1
- 201000008297 typhoid fever Diseases 0.000 description 1
- 206010061393 typhus Diseases 0.000 description 1
- 108700042752 tyrosyl-prolyl-leucyl-glycine Proteins 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 150000003952 β-lactams Chemical class 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
- A61K38/16—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- A61K38/43—Enzymes; Proenzymes; Derivatives thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/33—Heterocyclic compounds
- A61K31/395—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins
- A61K31/435—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins having six-membered rings with one nitrogen as the only ring hetero atom
- A61K31/47—Quinolines; Isoquinolines
- A61K31/4706—4-Aminoquinolines; 8-Aminoquinolines, e.g. chloroquine, primaquine
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/33—Heterocyclic compounds
- A61K31/395—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins
- A61K31/41—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins having five-membered rings with two or more ring hetero atoms, at least one of which being nitrogen, e.g. tetrazole
- A61K31/4164—1,3-Diazoles
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/33—Heterocyclic compounds
- A61K31/395—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins
- A61K31/41—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins having five-membered rings with two or more ring hetero atoms, at least one of which being nitrogen, e.g. tetrazole
- A61K31/425—Thiazoles
- A61K31/429—Thiazoles condensed with heterocyclic ring systems
- A61K31/43—Compounds containing 4-thia-1-azabicyclo [3.2.0] heptane ring systems, i.e. compounds containing a ring system of the formula, e.g. penicillins, penems
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/33—Heterocyclic compounds
- A61K31/395—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins
- A61K31/41—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins having five-membered rings with two or more ring hetero atoms, at least one of which being nitrogen, e.g. tetrazole
- A61K31/425—Thiazoles
- A61K31/429—Thiazoles condensed with heterocyclic ring systems
- A61K31/43—Compounds containing 4-thia-1-azabicyclo [3.2.0] heptane ring systems, i.e. compounds containing a ring system of the formula, e.g. penicillins, penems
- A61K31/431—Compounds containing 4-thia-1-azabicyclo [3.2.0] heptane ring systems, i.e. compounds containing a ring system of the formula, e.g. penicillins, penems containing further heterocyclic rings, e.g. ticarcillin, azlocillin, oxacillin
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/33—Heterocyclic compounds
- A61K31/395—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins
- A61K31/535—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins having six-membered rings with at least one nitrogen and one oxygen as the ring hetero atoms, e.g. 1,2-oxazines
- A61K31/5375—1,4-Oxazines, e.g. morpholine
- A61K31/5383—1,4-Oxazines, e.g. morpholine ortho- or peri-condensed with heterocyclic ring systems
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/33—Heterocyclic compounds
- A61K31/395—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins
- A61K31/54—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins having six-membered rings with at least one nitrogen and one sulfur as the ring hetero atoms, e.g. sulthiame
- A61K31/542—Heterocyclic compounds having nitrogen as a ring hetero atom, e.g. guanethidine or rifamycins having six-membered rings with at least one nitrogen and one sulfur as the ring hetero atoms, e.g. sulthiame ortho- or peri-condensed with heterocyclic ring systems
- A61K31/545—Compounds containing 5-thia-1-azabicyclo [4.2.0] octane ring systems, i.e. compounds containing a ring system of the formula:, e.g. cephalosporins, cefaclor, or cephalexine
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/70—Carbohydrates; Sugars; Derivatives thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/70—Carbohydrates; Sugars; Derivatives thereof
- A61K31/7042—Compounds having saccharide radicals and heterocyclic rings
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
- A61K38/16—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K45/00—Medicinal preparations containing active ingredients not provided for in groups A61K31/00 - A61K41/00
- A61K45/06—Mixtures of active ingredients without chemical characterisation, e.g. antiphlogistics and cardiaca
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2462—Lysozyme (3.2.1.17)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/55—Fusion polypeptide containing a fusion with a toxin, e.g. diphteria toxin
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/74—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Veterinary Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Communicable Diseases (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
Abstract
본 발명은 의약 분야, 특히 세균 감염 및 이의 치료 분야에 관한 것이다.
Description
본 발명은 의약 분야, 특히 세균 감염 및 이의 치료 분야에 관한 것이다.
사람 병원체 스타필로코쿠스 아우레우스(Staphylococcus aureus)는 모든 사람 중 대략 1/3에서 콜로니화(colonize)되며 산업화된 사회에서 균혈증 및 감염성 심내막염의 주요 원인들 중 하나이다(1). 드러나는 항생제 내성 외에도, 지속되는 재발성 감염이 질병률 및 사망율에 지속적으로 가해지고 있다(2,3). 특히 골수염 또는 심장내막염 이후 재발률은 높으며 감염은 확실히 치유한 지 수년 후에도 재발할 수 있다(4). 감염 재발은 몇 가지 이유로 SCV(소 콜로니 변이체) 및/또는 비-복제 소집단(persister)과 관련되어 있다(5-8). 이들의 저지되거나 느린 성장 및 감소된 물질대사는 항생제가 효과적이지 않도록 한다(9-13). 더욱이, 이들은 종기내 및 숙주 세포내에서와 같이 특전이 있는 위치에 우선적으로 숨는다(14-16). 이들 특전이 있는 위치는 숙주 면역계로부터 및 세포외적으로 활성인 항생제로부터 이들을 차폐시킨다. 또한, 항생제는 효과적으로 종기를 침투하지 않으며 낮은 pH로 인하여 거의 활성적이지 않다(17-21). 따라서, '종양이 있는 부위(ubi pus ibi evacua)' - 고대에 제형화된 종기의 외과적 제거의 필수성은 고도로 활성인 항생제에도 불구하고 현재에도 여전히 적용된다.
종기와는 대조적으로, 세포내 세균은 기계적으로 제거될 수 없으며 흔히 현재 이용가능한 항생제에 의한 근절을 견딘다. 임상에서 분리된 대부분의 SCV는 아-배양(sub-cultivation)시 거대 콜로니 표현형으로 복귀한다. 이들 불안정한 특성으로 인하여, 전자 전달계에서 결함을 지닌 안정한 유전적으로 변형된 SCV 돌연변이체를 사용하여 SCV를 표적화하기 위한 새로운 전략을 발견하였으며 숙주 세포 라이소자임에 국재화하는 것으로 밝혀져 왔다(22). 그러나, 안정한 SCV 만이 임상 SCV를 부분적으로 반영한다. 이들은 거의 치명적이지 않으며(23) 매우 치명적이고 빠르게 성장하는 표현형으로 복귀하지 않는다. 결과적으로, 임상 SCV 및 소집단을 연구하기 위한 수단 및 궁극적으로 SCV 및 소집단의 효과적인 치료 방법에 대한 시급한 요구가 존재한다.
놀랍게도, 본 발명자들은 가역성 소집단 및 가역성 SCV이 낮은 pH에 의해 유도될 수 있으며 낮은 pH 적응된 세균은 구체적으로 라이소좀(낮은 pH 세포기관)내에서 세포내적으로 지속되었음을 확립하였다. 낮은 pH는 SCV를 유도하였으며 소집단은 pH가 상승되면 매우 치명적인 표현형으로 전환하였다. pH를 세포내적으로 상승시키는 것은 시험관내(in vitro) 및 생체내(in vivo)에서 낮은 pH 적응된 세균을 전환시켜 SCV 및 소집단이 일반적으로 세포내적으로 효과적이지 않은 항세균제에 대해 민감하도록 한다. 이는 피부, 호흡기 또는 편도 상피(인두편도염)를 포함하나 이에 한정되지 않는 세포내 구획에서 주로 또는 적어도 부분적으로 지속되는 감염, (소)유방염을 예방 및/또는 치료하거나, 임신한 여성에서 질 에스. 아갈락티아에(S. agalactiae) 감염(대식구내에서)을 근절하여 신생아 패혈증을 예방하거나, 에스. 뉴모니아에(S. pneumoniae)-유발된 단핵구-기원한 대식구 세포자멸사(apoptosis)를 예방/치료하거나, 폐결핵, 나병, 선회병(listeriosis), 장티푸스, 세균성 이질, 전염병, 브루셀라병, 발진티푸스, 로키산홍반열(Rocky Mountains spotted fever), 클라미디아(chlamydia), 트라코마(trachoma)를 치료하는 완전히 신규한 방법을 제공한다. 따라서, 제1 국면에서 본 발명은:
- 숙주 세포 및/또는 숙주 세포의 세포내 구획의 세포내 pH를 증가시키는 제제의 유효량을 투여하는 단계, 및
- 항세균제의 유효량을 투여하는 단계를 포함하여, 항미생물성 펩타이드을 치료하는 방법을 제공한다. 상기 방법은 본 발명에 따르는 방법으로 본원에서 언급된다.
바람직하게는, 본 발명의 구현예에서, pH에 있어서의 증가는 비-복제성 세포내 세균을 활성화시키고 항세균제는 활성화된 세포내 세균을 사멸시킨다.
본 발명의 구현예에서, 세균 감염은 바람직하게는 지속된 세균 감염이고 피부, 호흡 상피, 편도 조직, 유선을 포함하나, 이에 한정되지 않는 다양한 조직의 지속성인 에스. 아우레우스(S. aureus) 감염과 관련될 수 있다. 본 발명의 구현예에서, 세균 감염은 바람직하게는 스타필로코쿠스, 스트렙토코쿠스(예를 들면, 에스. 피오게네스(S. pyogenes), 에스. 아갈락티아에(S. agalactiae) 및 에스. 뉴모니아(S.pneumonia)), 악티노마이세스(Actinomyces), 노카르디아(Nocardia), 바실러스(Bacillus)(예를 들면, 비. 안트라키스(B. anthracis)), 콕시엘라(Coxiella)(예를 들면, 콕시엘라 부르네티이(Coxiella burnetii)), 리케챠(Rickettsia), 마이코박테리아(Mycobacteria)(예를 들면, 엠. 투베르쿨로시스(M. tuberculosis) 및 엠. 레프라에(M. leprae)), 레지오넬라(Legionella)(예를 들면, 엘. 뉴모필라(L. pneumophila)), 마이코플라즈마(Mycoplasma, 살모넬라(Salmonella)(예를 들면, 에스. 티피무리움(S. typhimurium)), 시겔라(Shigella)(예를 들면, 에스. 디센테리아에(S. dysenteriae)), 예르시니아(Yersinia)(예를 들면, 와이, 페스티스(Y. pestis)), 브루셀라(Brucella), 리스테리아(Listeria)(예를 들면, 엘. 모노사이토게네시스(L. monocytogenesis)), 악티노바실러스(Actinobacillus)(예를 들면, 에이. 악티노마이세테코미탄스(A. actinomycetemcomitans)), 가르드네렐라(Gardnerella)(예를 들면, 지. 바기날리스(G. vaginalis)), 클라미디아(Chlamydia)(예를 들면, 씨. 트라코마티스(C. trachomatis)), 및 클라미도필라(Chlamidophila)로 이루어진 세균 그룹으로부터 선택된 종에 의한 감염이고; 보다 바람직한 세균은 스타필로코쿠스(Staphylococcus)의 종이며; 스타필로코쿠스의 바람직한 종은 에스. 아우레우스이다.
본 발명의 구현예에서, 세균 감염을 유발하는 세균은 바람직하게는 세포질에서와 같이, 숙주 세포내에서 세포내에 존재하며; 세균 감염을 유발하는 보다 바람직한 세균은 바람직하게는 에스. 아우레우스의 세포내 구획내에 존재한다. 바람직하게는, 세균 감염은 MRSA를 포함하나, 이에 한정되지 않는 살세균제(예를 들면, 항생제)에 대해 내성인 세균의 적어도 하나의 소집단을 포함하는 세균의 집단을 포함한다. 본 발명의 구현예에서, 세포내 구획은 어떠한 세포내 구획일 수 있으며, 여기서 세균이 지속적으로 존재할 수 있고/있거나 지속적으로 존재하고; 바람직하게는 세포내 구획은 낮은 pH를 갖는 구획이다. 바람직한 세포내 구획은 엔도솜, 라이소좀, 파고솜, 파고라이소좀, 오토파고좀 및 오토라이소좀으로 이루어진 그룹으로부터 선택된 구획이고, 보다 바람직한 세포내 구획은 파고솜, 파고라이소좀의 라이소좀이며; 심지어 보다 바람직한 세포내 구획은 파고라이소좀이다.
치료가 요구되는 대상체내 본 발명의 구현예에서, 감염된 숙주 세포는 바람직하게는 진핵세포 숙주 세포이고; 진핵세포 숙주 세포는 바람직하게는 전문적 및 비-전문적 대식세포, 림프구 세포, 편도 조직 세포, 호흡기 상피 세포, 볼 상피 세포, 골수 세포, 골아 세포, 각질형성 세포, 근육 세포, 단핵구, 대식구, 수지 세포, 내피 세포, 상피 세포, 섬유아세포, 별아교세포, 및 미세아교 세포로 이루어진 그룹으로부터 선택되며; 보다 바람직한 진핵 숙주 세포는 전문적 또는 비-전문적 대식세포이다. 대상체는 사람 또는 동물을 포함하나, 이에 한정되지 않는, 지속성 세균 감염에 민감하거나 이로 고생하는 어떠한 대상체일 수 있다. 동물은 애완 동물일 수 있거나 사육 동물, 애견, 고양이, 가금류 등일 수 있다. 바람직한 대상체는 사람이고 신생아, 청소년, 성인 또는 노인 대상체일 수 있다.
본 발명의 구현예에서, 세포내 pH 및 또는 세포내 구획의 pH를 증가시키는 제제는 세포내 pH 및 또는 세포내 구획의 pH를 효과적으로 증가시키는 당해 분야의 숙련가에게 공지된 어떠한 제제일 수 있다.
바람직하게는, 본 발명의 구현예에서, pH 및 또는 세포내 구획의 pH를 증가시키는 제제는 알칼리화제, 바람직하게는 라이소좀향성 알칼리화제(lysosomotropic alkalizing agent)이며, 바람직하게는 클로로퀸, 바필로마이신 A1, 염화암모늄의 그룹으로부터 선택되고; 보다 바람직한 알칼리화제는 클로로퀸이다. 바람직한 알칼리화제는 숙주 세포로 도입할 수 있고 보다 바람직하게는 세포내 구획을 표적화하는 제제이며, 여기서 감염성 세균이 존재한다. 알칼리화제는 세포에 수동적으로 도입될 수 있거나 비히클(vehicle)이 사용될 수 있다. 비히클온 숙주 세포내로의 전달을 가능하도록 하는 당해 분야의 숙련가에게 공지된 어떠한 비히클일 수 있다.
본 발명의 구현예에서, 용어 "낮은 pH"는 pH 7.4보다 낮은 pH; 바람직하게는 낮은 pH는 약 pH 7.0, 6.8, 6.6, 6.4, 6.2, 6.0, 5.8, 5.6, 5.4, 5.2, 5.0, 4.8, 4.6, 4.4, 4.2, 4.0, 3.8, 3.6, 3.4, 3.2, 3.0 또는 약 2.8이다. 낮은 pH는 바람직하게는 약 pH 6.5보다 낮은 pH이다. 보다 바람직한 낮은 pH는 약 5.0이다.
본 발명의 구현예에서, 용어 pH를 상승시키는 및 pH를 증가시키는 것은 상호교환적으로 사용되며 pH를 약 0.2, 0.4, 0.6, 0.8, 1.0, 1.2, 1.4, 1.6, 1.8, 2.0, 2.2, 2.4, 2.6, 2.8, 3.0, 3.2, 3.4, 3.6, 3.8, 4.0, 4.2, 4.4, 4.6 또는 약 4.8로 상승시키는 것으로 정의된다. 바람직하게는, pH는 약 pH 6 내지 약 7의 범위내로 상승되며; 바람직하게는 약 6.5 내지 약 7의 범위내로 상승된다.
pH 및 또는 세포내 구획의 pH를 증가시키는 제제의 내용에서 본 발명의 구현예에서 용어 "유효량"은 세포내 pH 및/또는 세포내 구획의 pH를 적어도 약 0.2, 0.4, 0.6, 0.8, 1.0, 1.2, 1.4, 1.6, 1.8, 2.0, 2.2, 2.4, 2.6, 2.8, 3.0으로 증가시켜 상기 본원에 기재한 바와 같은 범위에 도달하기에 충분한 어떠한 양으로 정의된다. 사용된 정확한 양은 사용된 제제 및 대상체의 체중에 특히 의존할 것이다.
살세균제의 내용에서 본 발명의 구현예에서 용어 "유효량"은 세포내적으로 및/또는 세포내 구획내에서 살세균 효과를 갖기에 충분한 어떠한 양으로 정의된다. 사용된 정확한 양은 사용된 제제 및 대상체의 체중에 특히 의존할 것이다.
세포내 pH 및 또는 세포내 구획의 pH는 당해 분야의 숙련가에게 공지된 어떠한 수단으로도 평가할 수 있다. 본 발명의 구현예에서, 살세균 제제는 당해 분야의 숙련가에게 공지된 어떠한 살세균 제제일 수 있으며 바람직하게는 숙주 세포 및/또는 숙주 세포의 세포내 구획에 도입할 수 있는 살세균제이다. 바람직한 살세균제는 키메릭 살세균제이다. 바람직한 살세균제는 박테리오신 또는 이의 작용성 부분, 세균 라이신 또는 오토라이신 또는 이의 작용성 부분, 박테리오파아지 라이신 또는 이의 작용성 부분, 바이러스 라이신 또는 이의 부분, 항미생물성 펩타이드 및 항생제로 이루어진 그룹으로부터 선택된다. 추가로 바람직한 살세균제는 박테리오파아지 기원한 용해 구조 단백질(예를 들면, 테일 라이신 및 비리온 관련 용해 단백질) 또는 이러한 용해 구조 단백질 또는 박테리오파아지 라이신으로부터의 분리된 용해 도메인이다. 본 발명의 살세균제는 단독으로 사용될 수 있거나 본 발명의 2개 또는 3개 이상의 살세균제와 함께 사용될 수 있다. 바람직한 살세균제는 숙주 세포로 도입될 수 있는 것이며 보다 바람직하게는 세포내 구획에 표적화되고 여기서 감염 세균이 존재한다. 살세균제는 세포에 수동적으로 도입될 수 있거나 비히클이 사용될 수 있다. 비히클은 숙주 세포내로 전달할 수 있는 당해 분야의 숙련가에게 공지된 어떠한 비히클일 수 있다.
항생제는 당해 분야의 숙련가에게 공지된 어떠한 항생제일 수 있다. 바람직한 항생제는 페니실린 유도체, 세팔로스포린, 모노박탐, 카르바페넴, 반코마이신, 다프토마이신, 플루오로퀴놀론, 메트로니다졸, 니트로푸란토인, 공-트리목사졸, 텔리트로마이신, 아미노글리코시드성 항생제와 같은 베타-락탐 항생제로 이루어진 그룹으로부터 선택되며; 보다 바람직한 항생제는 플루플록사실린이다.
박테리오신은 당해 분야의 숙련가에게 공지된 어떠한 박테리오신, 바람직하게는 어떠한 I 내지 IV 부류의 박테리오신일 수 있다.
본원의 제I 부류 박테리오신은 작은 펩타이드 억제제이며 니신 및 다른 란티바이오틱을 포함한다.
본원의 제II 부류 박테리오신은 작은(<10 kDa) 열-안정성 단백질이다. 당해 부류는 5개의 소부류로 세분된다. 제IIa 부류 박테리오신(페디오신-유사 박테리오신)은 가장 큰 소그룹이고 당해 그룹에 걸쳐 N-말단 컨센수스(consensus) 서열 -Tyr-Gly-Asn-Gly-Val-Xaa-Cys을 함유한다. C-말단은 종-특이적인 활성에 관여하여, 표적 세포벽을 투과함으로써 세포-누출을 유발한다. 제IIb 부류 박테리오신(2개-펩타이드 박테리오신)은 활성을 위해 2개의 상이한 펩타이드를 필요로 한다. 하나의 이러한 예는 락토코신 G이며, 이는 2가 이온이 아닌, Na 및 K와 같은 1가 이온에 대해 세포 막을 투과시킨다. 이들 박테리오신 중 거의 모두는 GxxxG 모티프를 갖는다. 당해 모티프는 또한 전이막 단백질에서 발견되며, 여기서 이들은 나선-나선 상호작용에 관여한다. 박테리오신의 GxxxG 모티프는 세균 세포의 막내에서 모티프와 상호작용하여 이를 수행함으로써 세균을 사멸시킬 수 있다. 제IIc 부류는 사이클릭 펩타이드를 포함하며, 이는 공유결합으로 연결된 N-말단 및 C-말단 영역을 지닌다. 엔테로신 AS-48이 이러한 그룹의 표현형이다. 제IId 부류는 단일-펩타이드 박테리오신을 포함하며, 이는 해독후 변형되지 않고 페디오신-유사 신호를 나타내지 않는다. 이러한 그룹의 가장 우수한 예는 매우 안정한 아우레오신 A53이다. 당해 박테리오신은 고 산성 환경(HCl 6N) 하에서 안정하며, 프로테아제 및 열내성에 영향받지 않는다. 가장 최근에 제안된 소부류는 제IIe 부류이며, 이는 3개 또는 4개의 비-페디오신 유사 펩타이드로 구성된 박테리오신을 포함한다. 가장 우수한 예는 잠재적인 생물공학 적용을 지닌 엘. 모노사이토게네스(L. monocytogenes)에 대해 고도로 활성인, 4개의 펩타이드 박테리오신아우레오신인 아우레오신 A70이다.
제III 부류 박테리오신은 크고, 열-불안정성(>10 kDa)인 단백질 박테리오신이다. 당해 부류는 2개의 소부류로 세분된다: 제IIIa 소부류 또는 박테리오신 및 제IIIb 소부류. 제IIIa 소부류는 세포-벽 분해에 의해 세균 세포를 사멸시키는 펩타이드를 포함하므로, 세포 용해를 유발한다. 가장 잘 연구된 박테리오라이신은 수개의 스타필로코쿠스 아종(Staphylococcus spp.) 세포 벽, 주로 에스. 아우레우스(S. aureus)를 가수분해하는 27kDa 펩타이드인, 라이소스타핀이다. 대조적으로 제IIIb 소부류는 세포 용해를 유발하지 않고 막 전위를 파괴하여, ATP 유출을 유발함으로써 표적 세포를 사멸시키는 펩타이드를 포함한다.
제IV 부류 박테리오신은 지질 또는 탄수화물 잔기를 함유하는 복합 박테리오신으로 정의된다. 확인 실험 데이타는 2개의 독립된 그룹에 의해 서브란신 및 글리코신 F(GccF)의 특성화로 단지 최근에 확립되었다.
바람직한 박테리오신은 악시도신, 악타가르딘, 아그로신, 알베이신, 아우레오신, 아우레오신 A53, 아우레오신 A70, 카르노신, 카르노사이클린 시르쿨라린 A, 콜리신, 쿠르바티신, 디베르신, 두라마이신, 엔테로신, 엔테롤라이신, 에피데르민/갈리데르민, 에르위니오신, 가쎄리신 A, 글리시네신, 할로신, 할로두라신, 락토신 S, 락토코신, 락티신, 류코친, 라이소스타핀 마세도신, 메르사시딘, 메센테리신, 마이크로비스포리신, 마이크로신 S, 무타신, 니신, 파에니박실린, 플라노스포리신, 페디오신, 펜토신, 플란타리신, 피오신, 류테리신 6, 사카신, 살리바리신, 수브틸린, 설폴로비신, 투리신 17, 트리폴리톡신, 바리아신, 와르네리신 및 와르네린으로 이루어진 그룹으로부터 선택된다.
박테리오신은 슈도모나스 아에루기노사(Pseudomonas aeruginosa), 바람직하게는 피오신 SA189로부터의 피오신을 포함하나, 이에 한정되지 않는 세균 자체(24)로부터 기원할 수 있다(25).
항미생물성 펩타이드는 당해 분야의 숙련가에게 공지된 어떠한 항미생물성 펩타이드일 수 있다. 때때로 당해 분야에서, 항미생물성 펩타이드는 상기에서 본원에 나열된 바와 같이 박테리오신으로 고려된다. 바람직한 항미생물성 펩타이드는 양이온성 또는 다가양이온성 펩타이드, 양쪽성 펩타이드, 스시 펩타이드(sushi peptide), 데펜신 및 소수성 펩타이드로 이루어진 그룹으로부터 선택된다.
세균 오토라이신은 당해 분야의 숙련가에게 공지된 어떠한 세균 오토라이신일 수 있다. 바람직한 세균 오토라이신은 LytM이다.
박테리오파아지 라이신은 당해 분야의 숙련가게에 공지된 어떠한 박테리오파아지 라이신일 수 있다. 본원에서, 용어 박테리오파아지 라이신, 박테리오파아지 엔도라이신 및 엔도라이신은 상호교환적으로 사용된다. 바람직한 엔도라이신은 제WO2012/150858호, 제WO2013/169104호, 제WO2011/023702호, 및 제WO2012146738호에 정의된 그룹으로부터 선택되며, 이는 본원에서 이들의 전체 내용과 함께 참고로 포함된다.
살세균제는 상기 본원에 기술된 어떠한 제제일 수 있거나 이의 작용성 단편일 수 있다. 여기서, 용어 작용성 단편은 용어 작용성 도메인과 상호교환적으로 사용된다. 작용성 단편은 본원에서 동일한 조건에서 검정하는 경우 작용성 단편이 기원하는 모 단편과 비교하여 적어도 20, 30, 40, 50, 60, 70, 80, 90, 95, 또는 99, 또는 적어도 99.9%의 살세균 활성을 갖는 단편으로 정의된다. 살세균제의 바람직한 작용성 단편은 제WO2012/150858호 및 제WO2013/169104호에 기술되어 있다.
살세균제는 살세균제 및 살세균제의 작용성 단편의 융합체일 수 있거나 상이하거나, 유사하거나 동일한 살세균제 또는 살세균제의 작용성 단편의 융합체일 수 있다.
본 발명자들은, 본 발명에 따른 치료 방법의 효능이 살세균제를 숙주 세포내로 표적화하는 경우 현저히 향상된다는 놀라운 발견을 달성하였다. 이러한 표적화는 상기 본원에서 이미 기재한 바와 같이, 당해 분야의 숙련가에 공지된 어떠한 수단으로도 달성할 수 있다. 바람직한 수단은 살세균제에 작동적으로 연결된 단백질 형질도입(transduction) 도메인이며; 추가로 본원에서 본 발명에 따른 단백질 형질도입 도메인으로 언급된다. 용어 "단백질 형질도입 도메인"은 본원에서 용어 "세포 투과성 단백질(CPP)" 및 용어 "막 전좌 서열"과 사용교환적으로 사용된다. 작동적으로 연결된은 본원에서 살세균제가 세포내로 표적화된 살세균제와 단백질 형질도입 도메인의 이러한 연합으로 정의된다. 바람직한 작동 연결은 살세균제에 대한 단백질 형질도입 도메인의 공유결합성 결합의 수단에 의한 융합체이다.
본 발명의 구현예에서, 살세균제는 바람직하게는 세포 벽 용해 효소의 작용성 효소 도메인을 포함한다. 세포 벽 용해 효소는 본원에서 이것이 효과적인 세균의 세포 벽에 작용하는 어떠한 살세균제로서 정의된다.
바람직하게는 본 발명의 구현예에서, 살세균제는 세포 벽 용해 효소로부터의 작용성 도메인을 포함하며 본 발명에 따른 단백질 형질도입 도메인을 추가로 포함한다. 바람직하게는 본 발명의 구현예에서, 살세균제는 항미생물성 펩타이드를 추가로 포함할 수 있다. 이러한 융합 살세균제는 바람직하게는 양이온성 또는 다가양이온성 펩타이드, 양쪽성 펩타이드, 스시 펩타이드, 데펜신 및 소수성 펩타이드로 이루어진 그룹으로부터 선택된 항미생물성 펩타이드에 융합된 세포벽 용해 효소, 보다 바람직하게는 본원에 이의 전문이 참고로 포함된 제US8,383,102호에 기재된 융합 단백질이다. 항미생물성 펩타이드에 융합된 이러한 살세균제는 본 발명에 따른 단백질 형질도입 도메인을 추가로 포함할 수 있다.
본 발명의 구현예에서, 본원에서 상기 기술된 바와 같이 세포 벽 용해 효소의 작용성 효소 도메인을 포함하고/하거나 단백질 형질도입 도메인을 포함하고/하거나 항미생물성 펩타이드를 포함하는 살세균제는 추가로 바람직하게는 세포 벽 결합 도메인을 추가로 포함할 수 있다. 상기 세포벽 결합 도메인은 바람직하게는 치료될 세균 감염을 유발하는 세균의 펩티도글리칸 세포 벽에 결합하는 세포 벽 결합 도메인이다.
본 발명의 구현예에서, 단백질 형질도입 도메인은 당해 분야의 숙련가에게 공지된 이러한 도메인 중 어느 것일 수 있다. 바람직하게는, 본 발명에 따른 살세균제에서, 단백질 형질도입 도메인은 서열번호 12 내지 25로 이루어진 그룹으로부터 선택되며, 여기서 상기 변이체는 작용성 단백질 형질도입 도메인이고 서열번호 12 내지 25로 이루어진 그룹으로부터 선택된 서열 각각과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 갖는다.
바람직하게는, 본 발명에 따른 살세균제에서, 세포 벽 용해 효소로부터의 작용성 효소 도메인은 서열번호 1 내지 7, 또는 이의 변이체로 이루어진 그룹으로부터 선택되며, 여기서 상기 변이체는 세포 벽 용해 효소로부터의 작용성 효소 도메인이고 서열번호 1 내지 7로 이루어진 그룹으로부터 선택된 서열 각각과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 갖는다.
바람직하게는, 본 발명에 따른 살세균제에서, 세포 벽 결합 도메인은 서열 번호; 8 내지 11, 또는 이의 변이체로 이루어진 그룹으로부터 선택되고, 여기서 상기 변이체는 서열번호 8 내지 11로 이루어진 그룹으로부터 선택된 서열 각각과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 갖는다.
바람직하게는, 본 발명에 따른 살세균제에서, 항미생물성 펩타이드는 서열번호 70 내지 90, 또는 이의 변이체로 이루어진 그룹으로부터 선택되며, 여기서 상기 변이체는 작용성 세포 벽 결합 도메인이고 서열번호 70 내지 90으로 이루어진 그룹으로부터 선택된 서열 각각과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 갖는다.
본 발명에 따른 바람직한 살세균제는 서열번호 27, 28, 29, 30 내지 47로 이루어진 그룹으로부터 선택된 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 지닌 살세균제이거나 서열번호 50 내지 67로 이루어진 그룹으로부터 선택된 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 지닌 폴리뉴클레오타이드 서열에 의해 암호화된 살세균제이다.
본 발명에 따른 바람직한 살세균제는 서열번호 91 내지 108로 이루어진 그룹으로부터 선택된 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 지닌 서열을 지닌 벡터로부터의 발현 생성물이다.
본 발명에 따른 추가로 바람직한 살세균제는 서열번호 12 내지 25, 또는 이의 변이체, 서열번호 1 내지 7, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 세포 벽 용해 효소로부터의 작용성 효소 도메인 및, 임의로 서열 번호; 8 내지 11, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 세포 벽 결합 도메인 및/또는 서열번호 70 내지 90으로 이루어진 그룹으로부터 선택된 항미생물성 펩타이드로 이루어진 그룹으로부터 선택된 단백질 형질도입 도메인을 포함하는 살세균제로 이루어진 그룹으로부터 선택된 것이며; 여기서 변이체는 각각의 원래의 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 갖는다.
치료 방법에 더하여, 당해 국면은 이를 필요로 하는 대상체에서 세균 감염의 치료를 위한 의약의 제조를 위한 당해 국면의 구현예에 관한 것이다. 치료 방법에 더하여, 당해 국면은 이를 필요로 하는 대상체에서 세균 감염의 치료시 사용하기 위한 당해 국면의 구현예에 관한 것이다.
제2 국면에서, 본 발명은 서열번호 1 내지 7, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 작용성 효소 도메인(여기서, 상기 변이체는 세포 벽 용해 효소로부터의 작용성 효소 도메인이고 각각 서열번호 1 내지 7로 이루어진 그룹으로부터 선택된 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 지닌다); 및 서열번호 12 내지 25, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 단백질 형질도입 도메인(여기서, 상기 변이체는 각각 서열번호 12 내지 25로 이루어진 그룹으로부터 선택된 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 또는 99% 또는 100% 서열 동일성을 지닌다)을 포함하고; 임의로 서열번호 8 내지 11, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 세포 벽 결합 도메인(여기서, 상기 변이체는 작용성 세포 벽 결합 도메인이고 서열번호 8 내지 11로 이루어진 그룹으로부터 선택된 서열 각각과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 갖는다); 및/또는 서열 번호; 70 내지 90, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 항미생물성 펩타이드(여기서, 상기 변이체는 작용성 세포 벽 결합 도메인이고 서열번호 70 내지 90으로 이루어진 그룹으로부터 선택된 서열 각각과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 갖는다)를 포함하는 키메릭 살세균 폴리펩타이드를 제공한다.
본 발명은 또한 서열번호 27, 28, 29, 30 내지 47로 이루어진 그룹으로부터 선택된 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 또는 99% 또는 100% 서열 동일성을 지닌 키메릭 살세균 폴리펩타이드, 또는 서열번호 50 내지 67로 이루어진 그룹으로부터 선택된 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 지닌 폴리뉴클레오타이드 서열에 의해 암호화된 키메릭 살세균제를 제공한다.
본 발명은 또한 서열번호 12 내지 25, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 단백질 형질도입 도메인, 서열번호 1 내지 7, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 세포 벽 용해 효소로부터의 작용성 효소 도메인 및, 임의로, 서열번호 8 내지 11, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 세포 벽 결합 도메인, 및/또는 서열번호 70 내지 90, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 항미생물성 펩타이드를 포함하는 살세균제로 이루어진 그룹으로부터 선택된 키메릭 살세균 폴리펩타이드를 제공하며; 여기서 상기 변이체는 각각의 원래의 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성을 갖는다.
본 발명은 또한 본 발명의 당해 국면에 따른 키메릭 살세균 폴리펩타이드를 암호화하는 폴리뉴클레오타이드를 제공한다.
본 발명은 또한 본 발명에 따른 키메릭 살세균 폴리펩타이드를 암호화하는 폴리뉴클레오타이드를 포함하는 폴리뉴클레오타이드 작제물을 제공한다.
본 발명은 또한 본 발명에 따른 키메릭 살세균 폴리펩타이드의 발현 및 생산용 벡터를 제공한다. 본 발명에 따른 벡터는 바람직하게는 본 발명에 따른 키메릭 살세균 폴리펩타이드를 암호화하는 폴리뉴클레오타이드를 포함한다. 본 발명에 따른 바람직한 벡터는 서열번호 91 내지 108로 이루어진 그룹으로부터 선택된 서열과 적어도 40%, 45%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 또는 100% 서열 동일성의 서열을 지닌 벡터이다.
본 발명은 또한 본 발명에 다른 폴리뉴클레오타이드 작제물 또는 벡터를 포함하는, 본 발명에 따른 키메릭 살세균 폴리펩타이드의 생산용 숙주 세포를 제공한다. 숙주 세포는 원핵 및 진핵 숙주 세포와 같이 본 발명에 따른 키메릭 살세균 폴리펩타이드의 생산에 적합한 어떠한 숙주 세포일 수 있다.
본 발명은 또한 본 발명에 따른 폴리뉴클레오타이드 작제물 또는 벡터를 포함하는, 본 발명에 따른 숙주 세포를 본 발명에 따른 키메릭 살세균 폴리펩타이드의 생산을 수행하는 조건 하에서 배양하고, 임의로, 생산된 키메릭 살세균 폴리펩타이드를 분리 및/또는 정제함을 포함하여, 본 발명에 따른 키메릭 살세균 폴리펩타이드를 생산하는 방법을 제공한다.
어떠한 적합한 투여 경로도 사용하여 본 발명에 따른 알칼리화제 및 본 발명에 따른 살세균제를 투여할 수 있으며, 이는 다음을 포함하나, 이에 한정되지 않는다: 경구, 에어로졸 또는 폐로 전달하기 위한 다른 장치, 비강 스프레이, 정맥내, 근육내, 복강내, 수막내, 질, 직장, 국소, 요추 천공, 척추강내, 및 뇌 및/또는 뇌척수막에 직접적인 적용. 본 발명에 따른 알칼리화제 및 본 발명에 따른 살세균제는 이를 필요로 하는 대상체 또는 상기 대상체의 세포, 조직 또는 기관에 1일에 적어도 1회, 1주당 1회, 1개월당 1회, 6개월마다 1회, 1년마다 1회 또는 요법이 치료에 적합한 경우에 투여될 수 있다.
당해 서류 및 이의 청구범위에서, 동사 "포함하는" 및 이의 접합부는 당해 단어를 수반하는 항목이 포함됨을 의미하는 이의 비-제한적인 의미로 사용되지만 구체적으로 언급되지 않는 상기 항목을 제외하지 않는다. 또한, 부정 관사("a" 또는 "an")에 의한 성분에 대한 참고는 내용이 성분들 중 하나 및 단지 하나가 존재함을 명확하게 요구하지 않는 한, 하나 이상의 성분이 존재할 가능성을 배제하지 않는다. 부정 관사("a" 또는 "an")는 따라서 일반적으로 "적어도 하나"를 의미한다. 수치(예를 들면, 약 10)와 관련하여 사용된 경우 용어 "약" 또는 "대략"은 바람직하게는 당해 값이 주어진 값(10의) 이상 또는 상기 값의 0.1% 미만일 수 있음을 의미한다.
본원에서 인용된 모든 특허 및 문헌 참고는 이의 전문이 본원에 참고로 포함된다.
본원에서, 특수한 서열과의 서열 동일성은 바람직하게는 상기 특수한 폴리펩타이드 또는 폴리뉴클레오타이드 서열의 전체 길이에 걸친 서열 동일성을 의미한다. 본원에 제공된 바와 같은 서열 정보는 잘못 확인된 염기의 포함을 포함하는 것으로 협의적으로 구성되지 않아야 한다. 숙련가는 이러한 잘못 정의된 염기를 확인할 수 있으며 이러한 오류를 정정하는 방법을 알고 있다.
2개의 아미노산 서열 사이의 "유사성"은 하나의 펩타이드 또는 폴리펩타이드의 아미노산 서열 및 이의 보존된 아미노산 치환체를 제2의 펩타이드 또는 폴리펩타이드의 서열과 비교함으로써 측정된다. 바람직하게는, 동일성 또는 유사성은 본원에 정의된 바와 같은 전체 서열 번호에 걸쳐 계산된다. "동일성" 및 "유사성"은 Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heine, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM J. Applied Math., 48:1073 (1988)을 포함하나, 이에 한정되지 않는 공지된 방법에 의해 용이하게 계산될 수 있다.
동일성을 측정하기 위한 바람직한 방법은 시험한 서열 사이의 최대 일치를 제공하도록 설계된다. 동일성 및 유사성을 측정하기 위한 방법은 공공으로 이용가능한 컴퓨터 프로그램에서 암호화되어 있다. 2개의 서열 사이의 동일성 및 유사성을 측정하기에 바람직한 컴퓨터 프로그램 방법은 예를 들면, GCG 프로그램 패키지(Devereux, J., et al., Nucleic Acids Research 12 (1): 387 (1984)), BestFit, BLASTP, BLASTN, 및 FASTA(Altschul, S. F. et al., J. Mol. Biol. 215:403-410 (1990))를 포함한다. BLAST X 프로그램은 NCBI 및 다른 공급원(BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 (1990))으로부터 공공 이용가능하다. 잘 공지된 스미쓰 워터만 알고리즘(Smith Waterman algorithm) 또한 동일성을 측정하기 위해 사용될 수 있다.
폴리펩타이드 서열 비교를 위한 바람직한 매개변수는 다음을 포함한다: 알고리즘: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970); 비교 매트릭스: Hentikoff and Hentikoff, Proc. Natl. Acad. Sci. USA. 89:10915-10919 (1992)로부터의 BLOSSUM62; 갭 패널티(Gap Penalty): 12; 및 갭 길이 패널티(Gap Length Penalty): 4. 이들 매개변수를 사용한 유용한 프로그램은 위스콘신주 매디슨에 소재하는 Genetics Computer Group으로부터의 "Ogap" 프로그램으로서 공공 이용가능하다. 상술한 매개변수는 아미노산 비교용 디폴트 매개변수(default parameter; 말단 갭에 대한 패널티는 없다)이다.
핵산 비교를 위한 바람직한 매개변수는 다음을 포함한다: 알고리즘: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970); 비교 매트릭스: 일치 = +10, 불일치 = 0; 갭 패널티: 50; 갭 길이 패널티: 3. 위스콘신주 매디슨에 소재하는 Genetics Computer Group으로부터의 갭 프로그램으로 이용가능하다. 상기 제공된 것은 핵산 비교용의 디폴트 매개변수이다.
임의로, 아미노산 유사성 정도를 측정하는데 있어서, 숙련가는 또한 숙련가에게 명백하게 될 것으로서, 소위 "보존적" 아미노산 치환을 고려할 수 있다. 보존적 아미노산 치환은 유사한 측쇄를 갖는 잔기의 상호교환가능성을 말한다. 예를 들면, 지방족 측쇄를 갖는 아미노산의 그룹은 글리신, 알라닌, 발린, 루이신, 및 이소루이신이고; 지방족-하이드록실 측쇄를 갖는 아미노산의 그룹은 세린 및 트레오닌이며; 아미노-함유 측쇄를 갖는 아미노산의 그룹은 아스파라긴 및 글루탐산이고; 방향족 측쇄를 갖는 아미노산의 그룹은 페닐알라닌, 타이로신, 및 트립토판이며; 염기성 측쇄를 갖는 아미노산의 그룹은 라이신, 아르기닌, 및 히스티딘이고; 황-함유 측쇄를 갖는 아미노산의 그룹은 시스테인 및 메티오닌이다. 바람직한 보존적 아미노산 치환 그룹은: 발린-루이신-이소루이신, 페닐알라닌-타이로신, 라이신-아르기닌, 알라닌-발린, 및 아스파라긴-글루타민이다. 본원에 개시된 아미노산 서열의 치환 변이체는 개시된 서열내 적어도 하나의 잔기가 제거되고 상이한 잔기가 이의 위치에 삽입된 것들이다. 바람직하게는, 아미노산 변화는 보존적이다. 천연적으로 존재하는 아미노산 각각에 대한 바람직한 보존적 치환은 다음과 같다: Ala에서 ser으로; Arg에서 lys으로; Asn에서 gln 또는 his으로; Asp에서 glu로; Cys에서 ser 또는 ala으로; Gln에서 asn으로; Glu에서 asp으로; Gly에서 pro으로; His에서 asn 또는 gln으로; Ile에서 leu 또는 val으로; Leu에서 ile 또는 val으로; Lys에서 arg으로; gln에서 glu으로; Met에서 leu 또는 ile으로; Phe에서 met, leu 또는 tyr으로; Ser에서 thr으로; Thr에서 ser으로; Trp에서 tyr으로; Tyr에서 trp 또는 phe으로; 및, Val에서 ile 또는 leu으로.
폴리뉴클레오타이드는 뉴클레오타이드 서열에 의해 나타낸다. 폴리펩타이드는 아미노산 서열로 나타낸다.
도 1.
낮은 pH에 의한 에스. 아우레우스 SCV의 유도 및 pH 증가를 통한 세균 재성장.
MRSA 에스. 아우레우스 균주 6850(a), JE2(b) 및 코완(Cowan)(c)를 나타낸 바와 같은 상이한 pH에서 완충된 배지 속에 접종하였다. 생존성 세균의 콜로니 표현형을 측정하고 SCV의 퍼센트를 시간에 따라 플롯팅(plotting)하였다. 3회 수행된 3개의 독립된 실험을 평균 ± SEM으로 나타낸다.
낮은 pH-유도된 MRSA 에스. 아우레우스 균주 6850(d), JE2(e) 및 코완(f) 소집단을 나타낸 바와 같은 다양하게 완충된 pH 배지 속에 재-접종하고 성장을 시간에 따라 수행하였다. 3회로 수행된 3개의 독립된 실험을 평균 ± SEM으로 나타낸다.
도 2
파고라이소좀내에서 에스. 아우레우스의 세포내 지속성
A549 세포를 에스. 아우레우스 코완으로 감염시키고 세포외 세균을 플루클록사실린을 첨가하여 사멸시켰다. 생존하는 세포내 지속성 세균의 수(a) 및 표현형(b)을 나타낸 시점에서 측정하였다. 데이타를 3회 수행된 2개의 실험, 평균 ± SEM으로부터 수집(pooling)하였다.
도 3
파고라이소좀 알칼리화를 통한 에스. 아우레우스 소집단의 감소.
에스. 아우레우스 코완-감염된 A549 세포를 플루클록사신 단독(대조군) 또는 라이소좀향성 알칼리화제(클로로퀸(a), 바필로마이신 A1(b) 및 염화암모늄(c))가 보충된 플루클록사신으로 처리하였다. 살아있는 세포내 지속성 세균의 콜로니 표현형을 측정하고 나타낸 시점에서 열거하였다. 데이타를 3회 수행된 3개의 독립된 실험, 평균 ± SEM으로부터 수집하였다. 이원 변량분석(two-way ANOVA)은 인자들 시간 및 치료가 유의적임을 발견하였다(p-값 < 0.01).
도 4
생체내 감염 모델에서 클로로퀸에 의한 에스. 아우레우스 소집단의 감소.
마우스를 에스. 아우레우스 코완으로 복강내 감염시켰다. 감염시킨지 6시간 및 2일 후 마우스를 1mg의 플루클록사신 및 0.2mg 클로로퀸(+ CQ)으로 치료하였다. 플루클록사신 만으로 치료된 마우스를 대조군(-CQ)으로 제공하였다. ♣, 희생(a). 표적 조직(b), 말초 혈액 및 복강액(c)으로부터 회수된 세균의 콜로니 표현형을 측정하고 열거하였다. 각각의 점은 1마리의 마우스를 나타낸다. 수평 바아는 평균 ± SEM, n은 그룹당 11마리의 마우스를 나타낸다. PL, 복강액. 이원 변량분석은 인자 치료가 유의적임을 발견하였다(p-값 < 0.01).
도 5
가공된 엔도라이신의 혼합물에 의한 골육종 세포내에서 에스. 아우레우스의 세포내 표적화. (A) 에스. 아우레우스 뉴만(MOI 0.1)으로 3시간 동안 감염시킨 세포를 엔도라이신 혼합물로 1시간 및 4시간 동안 치료하였다. (B) 에스. 아우레우스 뉴만(MOI 0.1)으로 3시간 동안 감염시킨 세포를 엔도라이신 혼합물로 1시간 및 4시간 동안 20μM의 클로로퀸의 존재하에 치료하였다. (C) 에스. 아우레우스 코완(MOI 0.01)으로 3시간 동안 감염시킨 세포를 엔도라이신 혼합물로 1시간 및 4시간 동안 치료하였다. (D) 에스. 아우레우스 코완(MOI 0.01)으로 3시간 동안 감염시킨 세포를 엔도라이신 혼합물로 1시간 및 4시간 동안 20μM의 클로로퀸의 존재하에 치료하였다.
도 6
가공된 엔도라이신의 혼합물에 의한 골육종 세포(MOI 1.0)에서 에스. 아우레우스의 세포내 표적화. (A) 에스. 아우레우스 뉴만으로 24시간 동안 감염시킨 세포를 엔도라이신 혼합물로 4시간 동안 치료하였다. (B) 에스. 아우레우스 뉴만으로 24시간 동안 감염시킨 세포를 엔도라이신 혼합물로 4시간 동안 20μM의 클로로퀸의 존재하에 치료하였다. (C) 에스. 아우레우스 코완으로 72시간 동안 감염시킨 세포를 엔도라이신 혼합물로 4시간 동안 치료하였다. (D) 에스. 아우레우스 코완으로 72시간 동안 감염시킨 세포를 엔도라이신 혼합물로 4시간 동안 20μM의 클로로퀸의 존재하에 치료하였다.
도 7
세포벽 용해 효소로부터의 작용성 효소 도메인을 포함하고 분자의 N-말단 측면의 단백질 형질도입 도메인을 추가로 포함하는 본 발명에 따른 살세균제의 활성: (A) R9-CHAP-CBD, (B) R9-M23-CBD, (C) TAT-Ami-CBD, (D) TAT-CHAP-CBD, 및 (E) TAT-M23-CBD.
낮은 pH에 의한 에스. 아우레우스 SCV의 유도 및 pH 증가를 통한 세균 재성장.
MRSA 에스. 아우레우스 균주 6850(a), JE2(b) 및 코완(Cowan)(c)를 나타낸 바와 같은 상이한 pH에서 완충된 배지 속에 접종하였다. 생존성 세균의 콜로니 표현형을 측정하고 SCV의 퍼센트를 시간에 따라 플롯팅(plotting)하였다. 3회 수행된 3개의 독립된 실험을 평균 ± SEM으로 나타낸다.
낮은 pH-유도된 MRSA 에스. 아우레우스 균주 6850(d), JE2(e) 및 코완(f) 소집단을 나타낸 바와 같은 다양하게 완충된 pH 배지 속에 재-접종하고 성장을 시간에 따라 수행하였다. 3회로 수행된 3개의 독립된 실험을 평균 ± SEM으로 나타낸다.
도 2
파고라이소좀내에서 에스. 아우레우스의 세포내 지속성
A549 세포를 에스. 아우레우스 코완으로 감염시키고 세포외 세균을 플루클록사실린을 첨가하여 사멸시켰다. 생존하는 세포내 지속성 세균의 수(a) 및 표현형(b)을 나타낸 시점에서 측정하였다. 데이타를 3회 수행된 2개의 실험, 평균 ± SEM으로부터 수집(pooling)하였다.
도 3
파고라이소좀 알칼리화를 통한 에스. 아우레우스 소집단의 감소.
에스. 아우레우스 코완-감염된 A549 세포를 플루클록사신 단독(대조군) 또는 라이소좀향성 알칼리화제(클로로퀸(a), 바필로마이신 A1(b) 및 염화암모늄(c))가 보충된 플루클록사신으로 처리하였다. 살아있는 세포내 지속성 세균의 콜로니 표현형을 측정하고 나타낸 시점에서 열거하였다. 데이타를 3회 수행된 3개의 독립된 실험, 평균 ± SEM으로부터 수집하였다. 이원 변량분석(two-way ANOVA)은 인자들 시간 및 치료가 유의적임을 발견하였다(p-값 < 0.01).
도 4
생체내 감염 모델에서 클로로퀸에 의한 에스. 아우레우스 소집단의 감소.
마우스를 에스. 아우레우스 코완으로 복강내 감염시켰다. 감염시킨지 6시간 및 2일 후 마우스를 1mg의 플루클록사신 및 0.2mg 클로로퀸(+ CQ)으로 치료하였다. 플루클록사신 만으로 치료된 마우스를 대조군(-CQ)으로 제공하였다. ♣, 희생(a). 표적 조직(b), 말초 혈액 및 복강액(c)으로부터 회수된 세균의 콜로니 표현형을 측정하고 열거하였다. 각각의 점은 1마리의 마우스를 나타낸다. 수평 바아는 평균 ± SEM, n은 그룹당 11마리의 마우스를 나타낸다. PL, 복강액. 이원 변량분석은 인자 치료가 유의적임을 발견하였다(p-값 < 0.01).
도 5
가공된 엔도라이신의 혼합물에 의한 골육종 세포내에서 에스. 아우레우스의 세포내 표적화. (A) 에스. 아우레우스 뉴만(MOI 0.1)으로 3시간 동안 감염시킨 세포를 엔도라이신 혼합물로 1시간 및 4시간 동안 치료하였다. (B) 에스. 아우레우스 뉴만(MOI 0.1)으로 3시간 동안 감염시킨 세포를 엔도라이신 혼합물로 1시간 및 4시간 동안 20μM의 클로로퀸의 존재하에 치료하였다. (C) 에스. 아우레우스 코완(MOI 0.01)으로 3시간 동안 감염시킨 세포를 엔도라이신 혼합물로 1시간 및 4시간 동안 치료하였다. (D) 에스. 아우레우스 코완(MOI 0.01)으로 3시간 동안 감염시킨 세포를 엔도라이신 혼합물로 1시간 및 4시간 동안 20μM의 클로로퀸의 존재하에 치료하였다.
도 6
가공된 엔도라이신의 혼합물에 의한 골육종 세포(MOI 1.0)에서 에스. 아우레우스의 세포내 표적화. (A) 에스. 아우레우스 뉴만으로 24시간 동안 감염시킨 세포를 엔도라이신 혼합물로 4시간 동안 치료하였다. (B) 에스. 아우레우스 뉴만으로 24시간 동안 감염시킨 세포를 엔도라이신 혼합물로 4시간 동안 20μM의 클로로퀸의 존재하에 치료하였다. (C) 에스. 아우레우스 코완으로 72시간 동안 감염시킨 세포를 엔도라이신 혼합물로 4시간 동안 치료하였다. (D) 에스. 아우레우스 코완으로 72시간 동안 감염시킨 세포를 엔도라이신 혼합물로 4시간 동안 20μM의 클로로퀸의 존재하에 치료하였다.
도 7
세포벽 용해 효소로부터의 작용성 효소 도메인을 포함하고 분자의 N-말단 측면의 단백질 형질도입 도메인을 추가로 포함하는 본 발명에 따른 살세균제의 활성: (A) R9-CHAP-CBD, (B) R9-M23-CBD, (C) TAT-Ami-CBD, (D) TAT-CHAP-CBD, 및 (E) TAT-M23-CBD.
실시예
본 발명은 본 발명의 영역을 제한하는 것으로 해석되지 않아야 하는 다음의 실시예에 의해 추가로 설명된다.
달리 기술하지 않는 한, 본 발명의 실시는 분자 생물학, 바이러스학, 미생물학 또는 생화학의 표준 통상의 방법을 사용할 것이다. 이러한 기술은 Sambrook et al. (1989) Molecular Cloning, A Laboratory Manual (2nd edition), Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press; in Sambrook and Russell (2001) Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor Laboratory Press, NY; in Volumes 1 and 2 of Ausubel et al. (1994) Current Protocols in Molecular Biology, Current Protocols, USA; and in Volumes I and II of Brown (1998) Molecular Biology LabFax, Second Edition, Academic Press (UK); Oligonucleotide Synthesis (N. Gait editor); Nucleic Acid Hybridization (Hames and Higgins, eds.)에 기술되어 있다.
실시예
1.
낮은 pH에 의한
스타필로코쿠스
아우레우스
소집단의 유도,
파고라이소좀
알칼리화에 의한 자각 및
살세균제와
조합된
파고라이소좀
알칼리화에 의한 효과적인 치료
1.1
에스
.
아우레우스
소
콜로니
변이체(SCV)의
pH-의존성 유도
잘-정의된 MSSA 균주 6850 및 코완 및 MRSA 균주 JE2를 라이소좀, 종기 및 혈액과 같은 생리학적 부위에서 발견된 pH를 모방하는, 4.0, 5.5, 6.5 및 7.4 pH 배지에서 성장시켰다. 에스. 아우레우스의 접종 직후 pH와는 독립적으로 거대 콜로니 표현형을 나타내었다. SCV의 빈도는 시간에 따라 pH 4.0 성장 배지에서 유의적으로 증가하였으며 5일 후 39%(JE2 및 6850) 및 28%(코완)에 이르렀다. 대조적으로 pH 7.4 성장 배지는 시험한 모든 균주에서 SCV를 2% 이하로 지속시켰다(도 1a 내지 도 c). 중간 퍼센트의 SCV는 pH 5.5 및 6.5에서 발견되었다. 따라서, 본 발명자들은 낮은 pH와 SCV 형성 사이에서 명확한 상관관계를 나타내었다. 당해 방법은 다양한 에스. 아우레우스 균주에서 불안정한, 비-유전적으로 변형된 SCV의 용이하고 조절된 형성을 허용하였다.
1.2 낮은 pH에 의한 복제하지 않는
에스
.
아우레우스의
유도
에스. 아우레우스 코완의 pH-의존성 성장은 세균 세포 벽을 형광성 세포 벽 결합 도메인(CBD)으로 표지함으로써 시간에 따라 수행하였다. 염색 직후, 세균 세포 벽을 완전히 표지하였다. pH 4.0 성장 배지에서 3일 후에, 대부분의 세균은 여전히 형광성 세포벽을 나타내었으며 세균 복제의 부재와 일치하였다. 대조적으로, pH 7.4에서 성장한 세균은 강렬하게 증식하였으며, 이는 고도로 단편화되고 89 감소된 형광성 세포 벽 표지에 의해 입증되었다. 낮은 pH 조건하에서 수득된, 작은 콜로니로부터 기원하는 세균의 주사 전자 현미경(SEM)은 정상의 세포 분열을 나타내는 거대한 콜로니 세균과는 대조적으로, 손상된 세포 분열을 나타내어 막대형 에스. 아우레우스를 생성하였다.
1.3 낮은 pH-적응된
에스
.
아우레우스
소집단의 성장 재개
본 발명자들의 발견은, SCV 및 복제하지 않는 소집단 둘 다가 낮은 pH에 의해 유도됨을 나타내었다. 임상에서, 이들 지속되는 세균의 존재는, 세균이 고 독성 및 신속히 성장하는 형태 14로 전환함을 암시하는 감염의 증가된 재발과 상관관계가 있다. 따라서, 본 발명자들은 낮은 pH-유도된 SCV 및/또는 복제하지 않는 소집단이 중성 pH에서 정상 성장을 회복할 수 있는지를 시험관내에서 시험하였다. 복제하지 않는 에스. 아우레우스 소집단을 유도하고 3일 동안 pH 4.0에서 유지시킨 후 pH 4.0, 5.5, 6.5 또는 7.4 성장 배지로 이전시켰다. pH 7.4 및 6.5에서 세균은 대략 12시간 후 성장을 재개(도 1d-f)한 반면 낮은 pH(< 6.5)에서 유지시킨 세균은 비증식 상태로 남았다(도 1d-f). 이들 데이타는, 지속성 표현형이 pH의 중화시 재성장 능력에 의해 뒷받침되는 바와 같이 낮은 pH에 대해 가역성 적응됨을 나타낸다.
1.4
에스
.
아우레우스
SCV의
세포내
유도
본 발명자들은 내부화된 에스. 아우레우스가 SCV 표현형을 나타내는지를 시험하였다. MSSA 균주 코완은 고도로 침입성이지만 세포독성이 아니며 이는 이들 균주를 폐 내피 세포주 A549에서 수일에 걸쳐 세포내적으로 유지되도록 하였다. 세포외 세균을, 고 투여량의 플루클록사실린을 감염된 숙주 세포에 가하여 사멸시켰다. 플루클록사신을 전형적으로 사용하여 환자에서 에스. 아우레우스 심장내막염을 치료하였다. 세포외 세균의 부재는 배양된 상층액의 멸균성으로 확인하였다. 숙주 세포를 용해하여 세포내 세균을 방출시키고 콜로니 수, 및 또한 콜로니 표현형을 다양한 시점에서 측정하였다. 감염 5시간 후, 0.2%의 모든 생존하는 세포내 세균은 SCV 표현형을 가졌다(도 2a-b). 생존하는 세포내 지속성 세균의 수는 감염의 과정 동안 감소한 반면 SCV의 빈도는 증가하여 7일 후 5.6%에 도달하였다.
1.5 지속하는
에스
.
아우레우스의
파고라이소좀
국재화
본 발명자들의 데이타는 산도가 SCV 형성을 선호함을 나타내었으며, 이는 산성 파고라이소좀 환경이 동일한 효과를 가질 수 있음을 나타낸다. A549 세포를 에스. 아우레우스 코완으로 감염시켰다. 세포내 세균을 LAMP-2 항체 양성 소포 내에 위치시키고, 형광 현미경으로 가시화하였다. LAMP-2(CD 107b)는 파고라이소좀내에서 고도로 발현되었으며, 세포내 지속성 에스. 아우레우스가 파고라이소좀 내에 주로 잔류함을 나타낸다.
1.6
파고라이소좀
알칼리화를 통한
에스
.
아우레우스
SCV의
감소
낮은 pH는 SCV 및/또는 복제하지 않는 에스. 아우레우스를 유도하였으며 배지 pH 중화는 세균 재성장을 생성하였으므로, 본 발명자들은 감염된 숙주 세포를 라이소좀향성 알칼리화제로 처리하였다. 클로로퀸, 바필로마이신 A1 또는 염화암모늄 모두는 파고라이소좀 pH를 중화시켰다. 라이소좀향성 알칼리화제로 처리한 숙주 세포는 감염 7일 후 유의적으로 보다 낮은 퍼센트의 SCV를 나타내었다(도 3). 대조군과 처리한 세포 사이의 총 콜로니 수에 있어서의 차이는 관찰되지 않았다. 라이소좀향성 알칼리화제는 사용된 농도에서 세균 성장을 억제하지 않았다. 숙주 세포 생존능에 있어서의 유의적이지 않은 차이는 알칼리화제로 처리한 후 관찰되었다.
1.7 세포 및 마우스에서
SCV
퍼센트의 감소를 생성하는
클로로퀸
처리에 의한 파고라이소좀내 지속성 에스. 아우레우스의 성장 재개
클로로퀸을 사용한 숙주 세포의 처리를 통한 에스. 아우레우스 소집단의 성장 재개를 평가하였다. 형광 현미경은 에스. 아우레우스가 감염 3일째에 대조군 및 클로로퀸-처리된 숙주 세포 둘 다에서 파고라이조좀내에 국재화함을 나타내었다. 본 발명자들은 클로로퀸 처리를 하지 않은 감염된 숙주 세포내에서 분열하는 세균을 관찰하지 못하였다. 그러나, 클로로퀸은 투과 전자 현미경(TEM)에 의해 평가된 것으로서 세균 세포 분열을 촉진시켰다.
에스. 아우레우스 코완으로 감염된 마우스를 플루클록사실린 단독(대조군), 또는 클로로퀸과 함께 치료하였다(도 5a). 클로로퀸 치료는 다양한 기관(도 5b) 및 구획(도 5c)에서 마우스내 SCV의 빈도를 유의적으로 감소시켰다. 절대 세균 수는 클로로퀸 치료와는 독립적으로, 비교가능하였다.
1.8 논의
당해 연구는, 종기 및 라이소좀내에서 발견되는 바와 같은, 낮은 pH가 에스. 아우레우스 소집단 SCV 및 복제하지 않는 소집단을 유도하였음을 나타내었다. 배양물 배지 속에서 또는 파고라이소좀 내에서 알칼리화제를 사용하여 pH를 상승시키는 것은 에스. 아우레우스를 정상 성장으로 전환시켰다. SCV 형성은 항생제 압력에 의해 개시됨을 나타내었다. 또한, 저온에 대한 연장된 노출, 매우 산성이거나 알칼리성인 환경, 또는 삼투압 스트레스는 에스. 아우레우스 및 코아굴라제-음성 스타필로코쿠스에서 SCV 및/또는 소집단 형성을 개시할 수 있다. 본 발명자들의 새로운 발견과 함께, 이들 관찰은, 다수의 자극이 에스. 아우레우스 소집단 형성을 초래하는 방법을 나타낸다. 숙주 세포내에서의 국재화는 불량한 세포 침투를 지닌 세포외적으로 활성인 베타-락탐과 같은 일반적으로 사용된 항생제로부터 에스. 아우레우스를 차폐한다. 또한, 낮은 파고라이소좀 내 pH는 클린다마이신 및 플우로로퀴놀론과 같은 세포내 활성을 지닌 항생제가 거의 활성을 지니지 않도록 한다. 본 발명자들은, 라이소좀향성 알칼리화제를 플루클록사실린과 같은 일반적으로 처방되는 항생제에 첨가하는 것은 시험관내 및 또한 생체내에서 에스. 아우레우스 SCV의 빈도를 감소시켰음을 발견하였다. 따라서, 본 발명자들은 에스. 아우레우스 소집단 형성의 숙주 의존성 성분을 피하는 단순한 전략을 확인하였다. 임상 세팅에서, 골수염 및 장치-관련된 감염에서 SCV의 존재는 항생제의 투여에도 불구하고, 증가된 재발률과 관련되어 왔다. 세균은 SCV 및/또는 소집단 형성에 의해 항생제 스트레스에 적응한다. 본 발명자들은 본 발명에 이르러 에스. 아우레우스 SCV 및 복제하지 않는 소집단이 고 독성 및 신속하게 성장하는 형태로의 전환 능력을 보유하였음을 확인하였다. 신속한 성장으로의 전환능(표현형 스위칭(phenotype switching))은 재발 감염을 생성한다. 또한, 이는 SCV의 확인을 어렵게 한다. 또한 임상에서 악화하는 SCV 문제는, 이들이 아주 작아서 이들의 신속히 성장하는 대응부에 의해 용이하게 과성장하는 콜로니를 검출하기 힘들게 하므로, 임상 미생물학 실험실에서 SCV의 과소평가이다. 본 발명자들은 일반적으로 처방된 항생제에 대한 알칼리화제의 첨가가 SCV의 빈도를 감소시킬 것이므로 미래의 재발률을 감소시킬 것으로 추정한다. 지속성 세균은 에스. 아우레우스에 대해 유일하지 않지만 또한 다양한 다른 사람 병원체, 예를 들면, 살모넬라 아종, 슈모도나스 아우레기노사(Pseudomonas aeruginosa), 에스케리키아 콜라이(Escherichia coli) 및 마이코박테리움 투베르쿨로시스(Mycobacterium tuberculosis)내에서 발생하는 것으로 기술되어 왔다. 낮은 pH 외에도, 세균 소집단은 독소-항독소 시스템을 포함하는 메카니즘으로 인하여 발생할 수 있다. 따라서, 산화 스트레스로 인한 DNA 손상에 대한 반응시 SOS 반응(ppGpp)의 활성화는 감소된 ATP 수준을 생성한다. 이는 감소된 성장을 생성하는 대사의 셧다운(shutdown)으로 이끈다. 다양한 독소-항독소 모듈이 살모넬라에서 산성화 및/또는 영양소 고갈에 의해 활성화되어, 소집단의 형성을 유발한다. 본 발명자들의 발견에 따라서, 살모넬라 소집단 형성은 바필로마이신 A1 44의 첨가에 의해 가역성이었던 살모넬라-함유 액포(vacuole)의 산성 및 영양학적으로 불량한 환경에 의해 개시된 대식구에서 보고되어 왔다. 바필로마이신 A1과는 대조적으로, 클로로퀸은 말라리아 및 또한 일부 류마티스 질환을 치료하기 위해 환자에서 정규적으로 사용된다. 따라서, 클로로퀸을 사용한 파고라이소좀 pH 중화는 세포내 지속성 스타필로코쿠스 저장기(reservoir)에 대한 신규 치료학적 근절 전략을 제공할 수 있다.
실시예
2.
단백질 형질도입 도메인을 포함하는
살세균제와
조합된
파고라이소좀
알칼리화에 의한
스타필로코쿠스
아우레우스
소집단의 효과적인 치료
서열번호 30 내지 47로 이루어진 그룹으로부터 선택된 서열을 사용한 감염된 숙주 세포내로의 효율적인 전달을 위한 단백질 형질도입 도메인과 함께 본 발명에 따른 살세균제를 시험관내 및 생체내에서 세포내 에스. 아우레우스 감염의 치료를 위한 파고라이소좀 pH 중화와 함께 사용한다. 당해 치료는 사용된 본 발명에 따른 특이적인 살세균제에 따라 효능에 있어서 일부 다양성으로, 세포내 에스. 아우레우스 감염의 효과적인 치료를 생성한다.
실시예
3.
스타필로코쿠스
아우레우스를
사용한
세포내
감염의 효과적인 치료
서열번호 27 내지 47로 이루어진 그룹으로부터 선택된 서열을 지닌 감염된 숙주 세포내로의 효율적인 전달을 위한 단백질 형질도입 도메인과 본 발명에 따른 살세균제를 파고라이소좀 pH 중화와 함께 또는 이를 사용하지 않고 세포내 에스. 아우레우스 감염의 치료에 사용하였다.
세포내
에스
.
아우레우스
사멸 검정 방법.
에스. 아우레우스를 LB 브로쓰 속에서 37℃에서 220rpm에서 진탕하면서 밤새 성장시켰다. 밤새 배양물을 신선한 LB(1:10) 속에 희석시키고 추가로 2시간 동안 성장시켰다. 이후에, 세균을 원심분리하고, 펠렛(pellet)을 PBS로 세척하고 배양물을 PBS 속에서 OD600을 0.4(약 2x108 CFU/mL)로 조절하면서 재-현탁시켰다. 세균 세포를 감염(SONOPULS HD 2070) 전에 1분 동안 40%의 동력에서 1초 펄스(pulse)로 초음파처리하였다. MG-63 골육종 세포를 12-웰 디쉬(well dish) 속에서 5x105개 세포/웰의 양으로 10% 태아 송아지 혈청(FBS)이 들어있는 1mL의 EMEM 배양 배지 속에서 감염 전 24시간 동안 성장시켰다. 이후에, 세포를 에스. 아우레우스 뉴만 및 코완으로 다음의 조건에서 감염시켰다: (A) 0.1의 MOI로 3시간 동안 에스. 아우레우스 뉴만, (B) 1.0의 MOI로 24시간 동안 에스. 아우레우스 뉴만, 및 (C) 1.0의 MOI로 72시간 동안 에스. 아우레우스 코완. 플레이트를 1200rpm에서 5분 동안 원심분리하고 37℃에서 CO2의 플러쉬(flush)와 함께 항온처리하였다. 침입 후, 진핵 세포를 PBS로 3회 세척하고 나머지 세포외 에스. 아우레우스를 제거하고 플록사실린(1 mg/mL)에 2시간 동안 노출시켜 어떠한 남겨진 비-내부화된 세균도 사멸시켰다. 각각의 실험(A, B, 및 C)을 위해, 샘플 중 1부를 클로로퀸 치료(20μM)에 노출시켜 라이소좀 pH를 증가시키고 엔도라이신으로 추가로 처리시 이의 효과를 평가하였다. 항생체-치료된 세포로부터의 상층액을 플레이팅(plating)하여 플록사실린 치료 효능에 대해 점검하였다. 이후에, 진핵 세포를 다시 PBS(3x)로 세척하고 사멸한 세균을 제거하고 엔도라이신 치료에 적용시켰다. 적용된 엔도라이신 제제의 조성물은 표 2에 요약한다. 진핵 세포를 1mL의 1μM 엔도라이신 제제(1mg/mL 플록사실린 및 +/- 20μM 클로로퀸이 보충된 EMEM 속에 희석됨)로 (A) 1시간 및 4시간, (B 및 C) 4시간 동안 치료하였다. 대조군은 EMEM 중 1mg/mL 플록사실린, +/- 20μM 클로로퀸 1mL로 치료하였다. 이후에 배양물을 PBS로 3회 세척하고 현미경 하에서 실험하여 조골 세포 용해가 있었는지를 측정하였다. 다음에, 이들을 트립신처리(트립신-EDTA 0.25%, Gibco®)하고 800μL의 0.1% 트리톤 X-100으로 용해하였다. 세포 용해물을 LB에서 일련의 희석 플레이팅에 적용시키고 37℃에서 밤새 항온처리하였다.
엔도라이신 혼합물의 성분 | 비 | 각각의 성분의 농도 |
CHAP-CBD + M23-CBD | 1:1 | 500nM:500nM |
CHAP-CBD-TAT + M23-CBD-TAT | 1:1 | 500nM:500nM |
CHAP-CBD-R9 + M23-CBD-R9 | 1:1 | 500nM:500nM |
CHAP-CBD-페네트라틴 + M23-CBD-페네트라틴 | 1:1 | 500nM:500nM |
CHAP-TAT + M23-TAT + Ami-TAT | 1:1:1 | 333nM:333nM:333nM |
CHAP-R9 + M23-R9 + Ami-R9 | 1:1:1 | 333nM:333nM:333nM |
CHAP-페네트라틴 + M23-페네트라틴 + Ami-페네트라틴 | 1:1:1 | 333nM:333nM:333nM |
결과
모든 단백질 작제물의 성공적인 발현 및 정제를 달성하였다. 상응하는 분자량 및 농도를 지닌 모든 발현된 엔도라이신의 요약은 표 3에 나타낸다. 비교적 고 농도가 대부분의 엔도라이신 작제물에 대해 수득되었으며, 이는 정확한 단백질 발현 및 정제 전략이 사용되었음을 암시한다.
서열번호 | 단백질 | 몰 분자량 (KDa) | 농도(mg/ml) | 농도(μM) |
27 | M23-CBD | 29.973 | 3.82 | 127.4 |
28 | CHAP-CBD | 33.088 | 2.36 | 71.33 |
29 | Ami-CBD | 35.385 | 3.30 | 93.26 |
44 | M23-CBD-TAT | 31.916 | 1.13 | 34.30 |
38 | CHAP-CBD-TAT | 35.031 | 1.85 | 52.81 |
32 | Ami-CBD-TAT | 37.328 | 0.40 | 10.71 |
43 | M23-CBD-R9 | 31.620 | 0.45 | 14.23 |
37 | CHAP-CBD-R9 | 34.736 | 1.05 | 30.20 |
31 | Ami-CBD-R9 | 37.033 | 0.30 | 8.10 |
42 | M23-CBD-페네트라틴 | 32.444 | 0.56 | 17.26 |
36 | CHAP-CBD-페네트라틴 | 35.559 | 2.03 | 57.08 |
30 | Ami-CBD-페네트라틴 | 37.856 | 0.41 | 10.82 |
47 | M23-TAT | 17.485 | 1.26 | 72.00 |
41 | CHAP-TAT | 20.600 | 1.44 | 69.90 |
35 | Ami-TAT | 22.897 | 0.83 | 36.25 |
46 | M23-R9 | 17.189 | 2.51 | 146.00 |
40 | CHAP-R9 | 20.305 | 0.83 | 40.88 |
34 | Ami-R9 | 22.601 | 0.76 | 33.63 |
45 | M23-페네트라틴 | 18.012 | 0.80 | 44.41 |
39 | CHAP-페네트라틴 | 21.128 | 0.73 | 34.55 |
33 | Ami-페네트라틴 | 23.424 | 0.96 | 41.00 |
모든 발현된 엔도라이신은 플레이트 검정 및 타임-사멸 검정(time-killing assay)에서 에스. 아우레우스 코완을 사멸시키는데 효과적이었다.
세포내
에스
.
아우레우스
사멸 검정
표 2에 나열된 바와 같은 엔도라이신 작제물의 혼합물을 세포 조직 배양물 속에서 이들의 세균 사멸 효능에 대해 추가로 시험하였다.
본 발명자들의 모델 MG-63에서 골육종 세포를 사용하여 골수염의 상태를 모사하였다. 세포를 우선 특정 시간 동안 병원체 에스. 아우레우스 뉴만 또는 코완으로 감염시킨 후 클로로퀸의 존재 또는 부재하에서 엔도라이신 혼합물로 치료하였다. 본 발명자들은, 클로로퀸의 적용에 의한 세포내 pH의 증가가 엔도라이신 활성에 대한 보다 양호한 조건을 생성할 수 있다고 예측하였다.
골육종 세포를 에스. 아우레우스 뉴만 및 코완에 3시간 동안에 이어 2시간 플록사실린 치료에 노출시켜 어떠한 내재화되지 않은 세균도 불활성화시켰다. 이후에, 조직 세포를 1시간 및 4시간 동안 1μM의 엔도라이신 혼합물과 클로로퀸의 존재 및 부재하에서 치료하였다. 결과는 도 5에 나타낸다. 도 5a는 엔도라이신 혼합물에 의한 에스. 아우레우스 뉴만의 치료 결과를 독점적으로 나타내며 도 5b는 클로로퀸과의 조합 치료의 결과를 나타낸다. 명확하게, 4시간 치료는 실험적 셋팅 둘 다에서 보다 효과적이었다. 더욱이, 클로로퀸의 존재하에서 엔도라이신 치료는 세균 수에 있어서 보다 큰 감소를 생성하였다. 흥미롭게도, CPP가 없는 엔도라이신의 혼합물은 매우 우수한 사멸 효능을 나타내었으며, 이는 예측되지 않았던 것이었다. 그러나, CBD와 관련된 다량의 양의 전하는 효소를 음으로 하전된 세포 막으로 이끌며 이의 세포내 전좌를 유도한다. CPP가 없는 엔도라이신 전좌에 대한 이러한 추정의 메카니즘은 CPP의 전좌의 핵심이다. 모든 엔도라이신 혼합물은, 이들의 활성이 CBD의 존재 및 CPP 태그의 유형에 따라 실질적으로 상이하다고 해도, 당해 검정에서 유사한 사멸 특성을 나타내었다. 이는 상이한 작제물이 CPP, EAD 및 CBD 존재에 의존하여, 상이한 침투 특성을 가지는 경향이 있다. 예를 들면, EAD-CBD 혼합물은 매우 고 활성을 가지지만 어떠한 CPP-함유 변이체와 같이 효율적으로 세포 막을 통해 수송되지 않을 수 있다. 반면, EAD-CPP 작제물은 고 활성을 가지지 않지만 보다 작고 형질도입 도메인을 함유하며, 이는 이들의 세포내 수송을 보다 효율적이 되도록 하고 CBD 결합에 의해 매개된 세포 표면에서 효소의 고정화의 결여로 인하여 이들의 분산성을 보다 우수하도록 한다. 그 결과, 상이한 특성을 지닌 효소들은 궁극적으로 유사한 결과를 제공할 수 있다.
도 5c 및 도 5d는 상이한 병원체 균주: 에스.아우레우스 코완으로 실시된 동일한 실혐의 결과를 나타낸다. 이 경우에, 상기 치료는 효과적인 것으로 보이지 않았다. 샘플 둘 다의 경우, 클로로퀸의 부재(도 5c) 및 클로로퀸의 존재(도 5d)시, 생존 수에 있어서의 감소는 무시할 정도이다. 클로로퀸의 존재하에서 CHAP-CBD-R9 + M23-CBD-R9의 혼합물을 사용한 치료만이 병원체의 유의적인 근절을 생성하였다. 에스. 아우레우스 코완이 라이소좀 내에 세포내적으로 존재함은 알려져 있다. 에스. 아우레우스 뉴만의 세포내 국제화는 상이한 경향이 매우 크며 - 이는 세포질에만 존재할 수 있으므로, 2개 실험의 결과는 상이하다. 더욱이, CPP-태그된 엔도라이신 작제물의 세포내 운명은 당해 시점에서 알려져 있지 않으므로, R9 태그와 함께 융합된 엔도라이신 작제물이 라이소좀내에 축적되는 유일한 것이므로, 이들은 에스. 아우레우스 코완에 대해 우수한 사멸 특성을 나타내었음을 유일하게 추정할 수 있다.
실제 감염을 모사하기 위하여, 골육종 조직 세포를 치료 전 에스. 아우레우스로 24시간 및 72시간 동안 감염시켰다. 이러한 장기간 감염 시간은 세균이 이들의 최종 도착지에 세포내적으로 정착하도록 하였다. 다시, 세포를 엔도라이신 작제물의 혼합물과 클로로퀸의 존재 및 부재하에서 치료하였다. 도 6a 및 도 6b는 클로로퀸의 존재 및 부재하에서 각각 에스. 아우레우스 뉴만으로 24시간 동안 감염된 골육종 세포의 치료의 결과를 나타낸다. 도 6c 및 도 6d는 에스. 아우레우스 코완으로 72시간 동안 감염된 치료된 세포의 결과에 상응한다. 둘 다의 경우에서 엔도라이신 혼합물을 사용한 치료는 특히 클로로퀸을 첨가할 때 효과적인 것으로 여겨진다. 가장 우수한 결과는 에스. 아우레우스 뉴만과 EAD-CBD-R9 혼합물을 클로로퀸의 존재하에 치료하는 경우 수득되었으며, 여기서 병원체의 약 60%가 근절되었다(도 6b). 동일하게 우수한 결과가 EAD-CBD, EAD-CBD-TAT, EAD-CBD-R9 및 EAD-CBD-페네트라틴의 혼합물로 치료한 에스. 아우레우스 코완의 경우 달성되었으며, 여기서 병원체의 약 50%가 사멸되었다. 일반적으로, EAD-CBD 혼합물을 포함하나, 형질도입 도메인과 융합되지 않은, CBD-함유 엔도라이신 작제물이 보다 우수한 사멸 효능을 나타내었다.
모두 함께 클로로퀸과 함께 가공된 엔도라이신에 의한 세포내 에스. 아우레우스의 치료는 클로로퀸의 부재하에서의 것보다 더 효과적이었음이 입증되었다. 이러한 알칼리화제의 적용은 용해 효소의 활성을 세포내적으로 향상시키는 경향이 있으며 엔도라이신과 클로로퀸의 조합 치료는 통상의 항생제 치료요법에 대한 우수한 대안일 수 있었다.
결론적으로, 당해 연구는, 세포내 및 세포외 에스. 아우레우스의 치료시 CPP-융합된 엔도라이신의 효능을 나타내었다. 이러한 전략 방법을 사용하여 예를 들면, 골수염, 심내막염, 균혈증 또는 패혈증, 및 젖소에서 소 유방염과 같은 상태를 지닌 환자를 치료할 수 있었다.
실시예
4.
세포 벽 용해 효소로부터의 작용성 효소 도메인을 포함하고 분자의 N-말단 측면에 단백질 형질도입 도메인을 추가로 포함하는 본 발명에 따른 수개의 살세균제를 제조하고 본원의 어딘가에 기술된 바와 같은 방법에 따라 분석하였다(R9-CHAP-CBD, R9-M23-CBD, TAT-Ami-CBD, TAT-CHAP-CBD, 및 TAT-M23-CBD). 모든 데이타를 에스. 아우레우스 SA113 기질 세포에서 및 100nM의 단백질 농도(1μM이 사용된 Ami-CBD 작제물은 제외함)에서 제WO2013/169104호에 앞서 기술된 바에 따라 혼탁도 감소 검정에서 수집하였다. 도 7은 작제물의 수행능을 나타낸다. 도 7로부터 관찰될 수 있는 바와 같이, 일반적으로, N-말단 CPP 태그된 단백질은 활성이지만, 에스. 아우레우스 세포에서 분석한 경우 태그되지 않거나 C-말단 태그된 변이체에 비해 덜 수행한다. 세포내 사멸 효능을 분석하기 위하여, 작제물을 실시예 3에서와 같은 세포내 에스. 아우레우스 사멸 검정에서 시험할 수 있었다.
참고 목록
SEQUENCE LISTING
<110> Micreos Human Health B.V.
<120> Combination of bactericidal agent with a lysosomotropic
alkalinising agent for the treatment of a bacterial infection
<130> IPA171002-NL
<150> EP15158880.3
<151> 2015-03-12
<160> 108
<170> PatentIn version 3.5
<210> 1
<211> 180
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 1
Met Leu Lys His Ile Tyr Ser Asn His Ile Lys Gly Asn Lys Ile Thr
1 5 10 15
Ala Pro Lys Pro Ser Ile Gln Gly Val Val Ile His Asn Asp Tyr Gly
20 25 30
Ser Met Thr Pro Ser Gln Tyr Leu Pro Trp Leu Tyr Ala Arg Glu Asn
35 40 45
Asn Gly Thr His Val Asn Gly Trp Ala Ser Val Tyr Ala Asn Arg Asn
50 55 60
Glu Val Leu Trp Tyr His Pro Thr Asp Tyr Val Glu Trp His Cys Gly
65 70 75 80
Asn Gln Trp Ala Asn Ala Asn Leu Ile Gly Phe Glu Val Cys Glu Ser
85 90 95
Tyr Pro Gly Arg Ile Ser Asp Lys Leu Phe Leu Glu Asn Glu Glu Ala
100 105 110
Thr Leu Lys Val Ala Ala Asp Val Met Lys Ser Tyr Gly Leu Pro Val
115 120 125
Asn Arg Asn Thr Val Arg Leu His Asn Glu Phe Phe Gly Thr Ser Cys
130 135 140
Pro His Arg Ser Trp Asp Leu His Val Gly Lys Gly Glu Pro Tyr Thr
145 150 155 160
Thr Thr Asn Ile Asn Lys Met Lys Asp Tyr Phe Ile Lys Arg Ile Lys
165 170 175
His Tyr Tyr Asp
180
<210> 2
<211> 141
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 2
Ala Ala Thr His Glu His Ser Ala Gln Trp Leu Asn Asn Tyr Lys Lys
1 5 10 15
Gly Tyr Gly Tyr Gly Pro Tyr Pro Leu Gly Ile Asn Gly Gly Met His
20 25 30
Tyr Gly Val Asp Phe Phe Met Asn Ile Gly Thr Pro Val Lys Ala Ile
35 40 45
Ser Ser Gly Lys Ile Val Glu Ala Gly Trp Ser Asn Tyr Gly Gly Gly
50 55 60
Asn Gln Ile Gly Leu Ile Glu Asn Asp Gly Val His Arg Gln Trp Tyr
65 70 75 80
Met His Leu Ser Lys Tyr Asn Val Lys Val Gly Asp Tyr Val Lys Ala
85 90 95
Gly Gln Ile Ile Gly Trp Ser Gly Ser Thr Gly Tyr Ser Thr Ala Pro
100 105 110
His Leu His Phe Gln Arg Met Val Asn Ser Phe Ser Asn Ser Thr Ala
115 120 125
Gln Asp Pro Met Pro Phe Leu Lys Ser Ala Gly Tyr Gly
130 135 140
<210> 3
<211> 161
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 3
Met Ser Ile Ile Met Glu Val Ala Thr Met Gln Ala Lys Leu Thr Lys
1 5 10 15
Asn Glu Phe Ile Glu Trp Leu Lys Thr Ser Glu Gly Lys Gln Phe Asn
20 25 30
Val Asp Leu Trp Tyr Gly Phe Gln Cys Phe Asp Tyr Ala Asn Ala Gly
35 40 45
Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp
50 55 60
Ile Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gln Asn
65 70 75 80
Thr Pro Asp Phe Leu Ala Gln Pro Gly Asp Met Val Val Phe Gly Ser
85 90 95
Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val Ile Glu Ala Thr
100 105 110
Leu Asp Tyr Ile Ile Val Tyr Glu Gln Asn Trp Leu Gly Gly Gly Trp
115 120 125
Thr Asp Gly Ile Glu Gln Pro Gly Trp Gly Trp Glu Lys Val Thr Arg
130 135 140
Arg Gln His Ala Tyr Asp Phe Pro Met Trp Phe Ile Arg Pro Asn Phe
145 150 155 160
Lys
<210> 4
<211> 145
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 4
Met Lys Thr Leu Lys Gln Ala Glu Ser Tyr Ile Lys Ser Lys Val Asn
1 5 10 15
Thr Gly Thr Asp Phe Asp Gly Leu Tyr Gly Tyr Gln Cys Met Asp Leu
20 25 30
Ala Val Asp Tyr Ile Tyr His Val Thr Asp Gly Lys Ile Arg Met Trp
35 40 45
Gly Asn Ala Lys Asp Ala Ile Asn Asn Ser Phe Gly Gly Thr Ala Thr
50 55 60
Val Tyr Lys Asn Tyr Pro Ala Phe Arg Pro Lys Tyr Gly Asp Val Val
65 70 75 80
Val Trp Thr Thr Gly Asn Phe Ala Thr Tyr Gly His Ile Ala Ile Val
85 90 95
Thr Asn Pro Asp Pro Tyr Gly Asp Leu Gln Tyr Val Thr Val Leu Glu
100 105 110
Gln Asn Trp Asn Gly Asn Gly Ile Tyr Lys Thr Glu Leu Ala Thr Ile
115 120 125
Arg Thr His Asp Tyr Thr Gly Ile Thr His Phe Ile Arg Pro Asn Phe
130 135 140
Ala
145
<210> 5
<211> 151
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragement
<400> 5
Met Gly Leu Pro Ser Pro Lys Lys Arg Lys Pro Thr Ala Ser Glu Val
1 5 10 15
Ala Ala Trp Ala Lys Arg Met Ile Gly Arg Arg Val Asp Val Asp Gly
20 25 30
Tyr His Gly Ala Gln Cys Trp Asp Leu Pro Asn Tyr Ile Phe Asn Arg
35 40 45
Tyr Trp His Phe Lys Thr Thr Gly Asn Ala Ile Ala Met Ala Trp Tyr
50 55 60
Arg Tyr Pro Lys Gly Phe Lys Phe Tyr Arg Asn Thr Arg Asn Phe Val
65 70 75 80
Pro Lys Pro Gly Asp Met Ala Val Trp Gly Lys Gly Ser Phe Asn Asn
85 90 95
Gly Val Gly His Thr Ala Val Val Ile Gly Pro Ser Thr Lys Ser Tyr
100 105 110
Phe Thr Ser Val Asp Gln Asn Trp Ile Gly Ala Asn Ser Tyr Thr Gly
115 120 125
Ser Pro Gly Ala Lys Ile Lys His Ser Tyr Asn Gly Ile Ser Gly Phe
130 135 140
Val Arg Pro Pro Tyr His Ala
145 150
<210> 6
<211> 147
<212> PRT
<213> Artificial
<220>
<223> Polyppetide fragment
<400> 6
Met Ala Leu Pro Lys Thr Gly Lys Pro Thr Ala Lys Gln Val Val Asp
1 5 10 15
Trp Ala Ile Asn Leu Ile Gly Ser Gly Val Asp Val Asp Gly Tyr Tyr
20 25 30
Gly Arg Gln Cys Trp Asp Leu Pro Asn Tyr Ile Phe Asn Arg Tyr Trp
35 40 45
Asn Phe Lys Thr Pro Gly Asn Ala Arg Asp Met Ala Trp Tyr Arg Tyr
50 55 60
Pro Glu Gly Phe Lys Val Phe Arg Asn Thr Ser Asp Phe Val Pro Lys
65 70 75 80
Pro Gly Asp Ile Ala Val Trp Thr Gly Gly Asn Tyr Asn Trp Asn Thr
85 90 95
Trp Gly His Thr Gly Ile Val Val Gly Pro Ser Thr Lys Ser Tyr Phe
100 105 110
Tyr Ser Val Asp Gln Asn Trp Asn Asn Ser Asn Ser Tyr Val Gly Ser
115 120 125
Pro Ala Ala Lys Ile Lys His Ser Tyr Phe Gly Val Thr His Phe Val
130 135 140
Arg Pro Ala
145
<210> 7
<211> 165
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 7
Met Ala Lys Thr Gln Ala Glu Ile Asn Lys Arg Leu Asp Ala Tyr Ala
1 5 10 15
Lys Gly Thr Val Asp Ser Pro Tyr Arg Ile Lys Lys Ala Thr Ser Tyr
20 25 30
Asp Pro Ser Phe Gly Val Met Glu Ala Gly Ala Ile Asp Ala Asp Gly
35 40 45
Tyr Tyr His Ala Gln Cys Gln Asp Leu Ile Thr Asp Tyr Val Leu Trp
50 55 60
Leu Thr Asp Asn Lys Val Arg Thr Trp Gly Asn Ala Lys Asp Gln Ile
65 70 75 80
Lys Gln Ser Tyr Gly Thr Gly Phe Lys Ile His Glu Asn Lys Pro Ser
85 90 95
Thr Val Pro Lys Lys Gly Trp Ile Ala Val Phe Thr Ser Gly Ser Tyr
100 105 110
Gln Gln Trp Gly His Ile Gly Ile Val Tyr Asp Gly Gly Asn Thr Ser
115 120 125
Thr Phe Thr Ile Leu Glu Gln Asn Trp Asn Gly Tyr Ala Asn Lys Lys
130 135 140
Pro Thr Lys Arg Val Asp Asn Tyr Tyr Gly Leu Thr His Phe Ile Glu
145 150 155 160
Ile Pro Val Lys Ala
165
<210> 8
<211> 128
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 8
Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser Asp
1 5 10 15
Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys Ala
20 25 30
Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu His
35 40 45
Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr Lys
50 55 60
Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly Gln
65 70 75 80
Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp Val
85 90 95
Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg Thr
100 105 110
Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile Lys
115 120 125
<210> 9
<211> 78
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 9
Ala Pro Lys Ser Lys Pro Ser Lys Ile Lys Thr Thr Trp Asn Trp Gly
1 5 10 15
Gly Lys Phe Thr Ala Asn Ser Thr Ile Lys Val Arg Lys Ser Pro Gly
20 25 30
Leu Lys Gly Ile Val Val Glu Ser Gly Ser Trp Leu Tyr Lys Gly Asn
35 40 45
Tyr Val Pro Phe Asp Gln Val Ile Lys Lys Asp Gly Tyr Trp Trp Ile
50 55 60
Arg Phe Lys Tyr Val Gln Pro Gly Ser Ser Asn Lys His Phe
65 70 75
<210> 10
<211> 92
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 10
Trp Lys Thr Asn Lys Tyr Gly Thr Leu Tyr Lys Ser Glu Ser Ala Ser
1 5 10 15
Phe Thr Pro Asn Thr Asp Ile Ile Thr Arg Thr Thr Gly Pro Phe Arg
20 25 30
Ser Met Pro Gln Ser Gly Val Leu Lys Ala Gly Gln Thr Ile His Tyr
35 40 45
Asp Glu Val Met Lys Gln Asp Gly His Val Trp Val Gly Tyr Thr Gly
50 55 60
Asn Ser Gly Gln Arg Ile Tyr Leu Pro Val Arg Thr Trp Asn Lys Ser
65 70 75 80
Thr Asn Thr Leu Gly Val Leu Trp Gly Thr Ile Lys
85 90
<210> 11
<211> 92
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 11
Tyr Lys Thr Asn Lys Tyr Gly Thr Leu Tyr Lys Ser Glu Ser Ala Ser
1 5 10 15
Phe Thr Ala Asn Thr Asp Ile Ile Thr Arg Leu Thr Gly Pro Phe Arg
20 25 30
Ser Met Pro Gln Ser Gly Val Leu Arg Lys Gly Leu Thr Ile Lys Tyr
35 40 45
Asp Glu Val Met Lys Gln Asp Gly His Val Trp Val Gly Tyr Asn Thr
50 55 60
Asn Ser Gly Lys Arg Val Tyr Leu Pro Val Arg Thr Trp Asn Glu Ser
65 70 75 80
Thr Gly Glu Leu Gly Pro Leu Trp Gly Thr Ile Lys
85 90
<210> 12
<211> 30
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 12
Trp Glu Ala Lys Leu Ala Lys Ala Leu Ala Lys Ala Leu Ala Lys His
1 5 10 15
Leu Ala Lys Ala Leu Ala Lys Ala Leu Lys Ala Cys Glu Ala
20 25 30
<210> 13
<211> 22
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 13
Met Val Thr Val Leu Phe Arg Arg Leu Arg Ile Arg Arg Ala Ser Gly
1 5 10 15
Pro Pro Arg Val Arg Val
20
<210> 14
<211> 18
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 14
Lys Leu Ala Leu Lys Leu Ala Leu Lys Ala Leu Lys Ala Ala Leu Lys
1 5 10 15
Leu Ala
<210> 15
<211> 27
<212> PRT
<213> Artificial
<220>
<223> GALFLGFLGAAGSTMGAWSQPKKKRKV
<400> 15
Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly
1 5 10 15
Ala Trp Ser Gln Pro Lys Lys Lys Arg Lys Val
20 25
<210> 16
<211> 16
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 16
Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met Lys Trp Lys Lys
1 5 10 15
<210> 17
<211> 21
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 17
Lys Glu Thr Trp Trp Glu Thr Trp Trp Thr Glu Trp Ser Gln Pro Lys
1 5 10 15
Lys Lys Arg Lys Val
20
<210> 18
<211> 12
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 18
Arg Arg Gln Arg Arg Thr Ser Lys Leu Met Lys Arg
1 5 10
<210> 19
<211> 18
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 19
Leu Leu Ile Ile Leu Arg Arg Arg Ile Arg Lys Gln Ala His Ala His
1 5 10 15
Ser Lys
<210> 20
<211> 9
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 20
Arg Arg Trp Trp Arg Arg Trp Arg Arg
1 5
<210> 21
<211> 16
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment comprises 6 to 15 R residues
<400> 21
Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg Arg
1 5 10 15
<210> 22
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 22
Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg Pro Pro Gln
1 5 10
<210> 23
<211> 9
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 23
Arg Lys Lys Arg Arg Gln Arg Arg Arg
1 5
<210> 24
<211> 27
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 24
Gly Trp Thr Leu Asn Ser Ala Gly Tyr Leu Leu Gly Lys Ile Asn Leu
1 5 10 15
Lys Ala Leu Ala Ala Leu Ala Lys Lys Ile Leu
20 25
<210> 25
<211> 21
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 25
Ala Gly Tyr Leu Leu Gly Lys Ile Asn Leu Lys Ala Leu Ala Ala Leu
1 5 10 15
Ala Lys Lys Ile Leu
20
<210> 26
<400> 26
000
<210> 27
<211> 269
<212> PRT
<213> Artificial Sequence
<220>
<223> Polypeptide fragment
<400> 27
Met Ala Ala Thr His Glu His Ser Ala Gln Trp Leu Asn Asn Tyr Lys
1 5 10 15
Lys Gly Tyr Gly Tyr Gly Pro Tyr Pro Leu Gly Ile Asn Gly Gly Met
20 25 30
His Tyr Gly Val Asp Phe Phe Met Asn Ile Gly Thr Pro Val Lys Ala
35 40 45
Ile Ser Ser Gly Lys Ile Val Glu Ala Gly Trp Ser Asn Tyr Gly Gly
50 55 60
Gly Asn Gln Ile Gly Leu Ile Glu Asn Asp Gly Val His Arg Gln Trp
65 70 75 80
Tyr Met His Leu Ser Lys Tyr Asn Val Lys Val Gly Asp Tyr Val Lys
85 90 95
Ala Gly Gln Ile Ile Gly Trp Ser Gly Ser Thr Gly Tyr Ser Thr Ala
100 105 110
Pro His Leu His Phe Gln Arg Met Val Asn Ser Phe Ser Asn Ser Thr
115 120 125
Ala Gln Asp Pro Met Pro Phe Leu Lys Ser Ala Gly Tyr Gly Gly Lys
130 135 140
Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser Asp Val Lys Gln
145 150 155 160
Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys Ala Thr Asp Trp
165 170 175
Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu His Ala Ser Phe
180 185 190
Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr Lys Gly Pro Trp
195 200 205
Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly Gln Thr Ile Lys
210 215 220
Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp Val Ser Trp Glu
225 230 235 240
Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg Thr Trp Asp Ala
245 250 255
Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile Lys
260 265
<210> 28
<211> 289
<212> PRT
<213> Artificial Sequence
<220>
<223> Polypeptide fragment
<400> 28
Met Ser Ile Ile Met Glu Val Ala Thr Met Gln Ala Lys Leu Thr Lys
1 5 10 15
Asn Glu Phe Ile Glu Trp Leu Lys Thr Ser Glu Gly Lys Gln Phe Asn
20 25 30
Val Asp Leu Trp Tyr Gly Phe Gln Cys Phe Asp Tyr Ala Asn Ala Gly
35 40 45
Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp
50 55 60
Ile Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gln Asn
65 70 75 80
Thr Pro Asp Phe Leu Ala Gln Pro Gly Asp Met Val Val Phe Gly Ser
85 90 95
Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val Ile Glu Ala Thr
100 105 110
Leu Asp Tyr Ile Ile Val Tyr Glu Gln Asn Trp Leu Gly Gly Gly Trp
115 120 125
Thr Asp Gly Ile Glu Gln Pro Gly Trp Gly Trp Glu Lys Val Thr Arg
130 135 140
Arg Gln His Ala Tyr Asp Phe Pro Met Trp Phe Ile Arg Pro Asn Phe
145 150 155 160
Lys Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser
165 170 175
Asp Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys
180 185 190
Ala Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu
195 200 205
His Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr
210 215 220
Lys Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly
225 230 235 240
Gln Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp
245 250 255
Val Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg
260 265 270
Thr Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile
275 280 285
Lys
<210> 29
<211> 308
<212> PRT
<213> Artificial Sequence
<220>
<223> Polypeptide fragment
<400> 29
Met Leu Lys His Ile Tyr Ser Asn His Ile Lys Gly Asn Lys Ile Thr
1 5 10 15
Ala Pro Lys Pro Ser Ile Gln Gly Val Val Ile His Asn Asp Tyr Gly
20 25 30
Ser Met Thr Pro Ser Gln Tyr Leu Pro Trp Leu Tyr Ala Arg Glu Asn
35 40 45
Asn Gly Thr His Val Asn Gly Trp Ala Ser Val Tyr Ala Asn Arg Asn
50 55 60
Glu Val Leu Trp Tyr His Pro Thr Asp Tyr Val Glu Trp His Cys Gly
65 70 75 80
Asn Gln Trp Ala Asn Ala Asn Leu Ile Gly Phe Glu Val Cys Glu Ser
85 90 95
Tyr Pro Gly Arg Ile Ser Asp Lys Leu Phe Leu Glu Asn Glu Glu Ala
100 105 110
Thr Leu Lys Val Ala Ala Asp Val Met Lys Ser Tyr Gly Leu Pro Val
115 120 125
Asn Arg Asn Thr Val Arg Leu His Asn Glu Phe Phe Gly Thr Ser Cys
130 135 140
Pro His Arg Ser Trp Asp Leu His Val Gly Lys Gly Glu Pro Tyr Thr
145 150 155 160
Thr Thr Asn Ile Asn Lys Met Lys Asp Tyr Phe Ile Lys Arg Ile Lys
165 170 175
His Tyr Tyr Asp Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile
180 185 190
Lys Gln Ser Asp Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln
195 200 205
Ile Val Lys Ala Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr
210 215 220
Lys Ala Glu His Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile
225 230 235 240
Thr Arg Tyr Lys Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu
245 250 255
Gln Lys Gly Gln Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly
260 265 270
His Val Trp Val Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met
275 280 285
Pro Val Arg Thr Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp
290 295 300
Gly Glu Ile Lys
305
<210> 30
<211> 326
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 30
Met Leu Lys His Ile Tyr Ser Asn His Ile Lys Gly Asn Lys Ile Thr
1 5 10 15
Ala Pro Lys Pro Ser Ile Gln Gly Val Val Ile His Asn Asp Tyr Gly
20 25 30
Ser Met Thr Pro Ser Gln Tyr Leu Pro Trp Leu Tyr Ala Arg Glu Asn
35 40 45
Asn Gly Thr His Val Asn Gly Trp Ala Ser Val Tyr Ala Asn Arg Asn
50 55 60
Glu Val Leu Trp Tyr His Pro Thr Asp Tyr Val Glu Trp His Cys Gly
65 70 75 80
Asn Gln Trp Ala Asn Ala Asn Leu Ile Gly Phe Glu Val Cys Glu Ser
85 90 95
Tyr Pro Gly Arg Ile Ser Asp Lys Leu Phe Leu Glu Asn Glu Glu Ala
100 105 110
Thr Leu Lys Val Ala Ala Asp Val Met Lys Ser Tyr Gly Leu Pro Val
115 120 125
Asn Arg Asn Thr Val Arg Leu His Asn Glu Phe Phe Gly Thr Ser Cys
130 135 140
Pro His Arg Ser Trp Asp Leu His Val Gly Lys Gly Glu Pro Tyr Thr
145 150 155 160
Thr Thr Asn Ile Asn Lys Met Lys Asp Tyr Phe Ile Lys Arg Ile Lys
165 170 175
His Tyr Tyr Asp Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile
180 185 190
Lys Gln Ser Asp Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln
195 200 205
Ile Val Lys Ala Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr
210 215 220
Lys Ala Glu His Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile
225 230 235 240
Thr Arg Tyr Lys Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu
245 250 255
Gln Lys Gly Gln Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly
260 265 270
His Val Trp Val Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met
275 280 285
Pro Val Arg Thr Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp
290 295 300
Gly Glu Ile Lys Glu Leu Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg
305 310 315 320
Arg Met Lys Trp Lys Lys
325
<210> 31
<211> 319
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 31
Met Leu Lys His Ile Tyr Ser Asn His Ile Lys Gly Asn Lys Ile Thr
1 5 10 15
Ala Pro Lys Pro Ser Ile Gln Gly Val Val Ile His Asn Asp Tyr Gly
20 25 30
Ser Met Thr Pro Ser Gln Tyr Leu Pro Trp Leu Tyr Ala Arg Glu Asn
35 40 45
Asn Gly Thr His Val Asn Gly Trp Ala Ser Val Tyr Ala Asn Arg Asn
50 55 60
Glu Val Leu Trp Tyr His Pro Thr Asp Tyr Val Glu Trp His Cys Gly
65 70 75 80
Asn Gln Trp Ala Asn Ala Asn Leu Ile Gly Phe Glu Val Cys Glu Ser
85 90 95
Tyr Pro Gly Arg Ile Ser Asp Lys Leu Phe Leu Glu Asn Glu Glu Ala
100 105 110
Thr Leu Lys Val Ala Ala Asp Val Met Lys Ser Tyr Gly Leu Pro Val
115 120 125
Asn Arg Asn Thr Val Arg Leu His Asn Glu Phe Phe Gly Thr Ser Cys
130 135 140
Pro His Arg Ser Trp Asp Leu His Val Gly Lys Gly Glu Pro Tyr Thr
145 150 155 160
Thr Thr Asn Ile Asn Lys Met Lys Asp Tyr Phe Ile Lys Arg Ile Lys
165 170 175
His Tyr Tyr Asp Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile
180 185 190
Lys Gln Ser Asp Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln
195 200 205
Ile Val Lys Ala Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr
210 215 220
Lys Ala Glu His Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile
225 230 235 240
Thr Arg Tyr Lys Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu
245 250 255
Gln Lys Gly Gln Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly
260 265 270
His Val Trp Val Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met
275 280 285
Pro Val Arg Thr Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp
290 295 300
Gly Glu Ile Lys Glu Leu Arg Arg Arg Arg Arg Arg Arg Arg Arg
305 310 315
<210> 32
<211> 323
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 32
Met Leu Lys His Ile Tyr Ser Asn His Ile Lys Gly Asn Lys Ile Thr
1 5 10 15
Ala Pro Lys Pro Ser Ile Gln Gly Val Val Ile His Asn Asp Tyr Gly
20 25 30
Ser Met Thr Pro Ser Gln Tyr Leu Pro Trp Leu Tyr Ala Arg Glu Asn
35 40 45
Asn Gly Thr His Val Asn Gly Trp Ala Ser Val Tyr Ala Asn Arg Asn
50 55 60
Glu Val Leu Trp Tyr His Pro Thr Asp Tyr Val Glu Trp His Cys Gly
65 70 75 80
Asn Gln Trp Ala Asn Ala Asn Leu Ile Gly Phe Glu Val Cys Glu Ser
85 90 95
Tyr Pro Gly Arg Ile Ser Asp Lys Leu Phe Leu Glu Asn Glu Glu Ala
100 105 110
Thr Leu Lys Val Ala Ala Asp Val Met Lys Ser Tyr Gly Leu Pro Val
115 120 125
Asn Arg Asn Thr Val Arg Leu His Asn Glu Phe Phe Gly Thr Ser Cys
130 135 140
Pro His Arg Ser Trp Asp Leu His Val Gly Lys Gly Glu Pro Tyr Thr
145 150 155 160
Thr Thr Asn Ile Asn Lys Met Lys Asp Tyr Phe Ile Lys Arg Ile Lys
165 170 175
His Tyr Tyr Asp Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile
180 185 190
Lys Gln Ser Asp Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln
195 200 205
Ile Val Lys Ala Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr
210 215 220
Lys Ala Glu His Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile
225 230 235 240
Thr Arg Tyr Lys Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu
245 250 255
Gln Lys Gly Gln Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly
260 265 270
His Val Trp Val Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met
275 280 285
Pro Val Arg Thr Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp
290 295 300
Gly Glu Ile Lys Glu Leu Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg
305 310 315 320
Pro Pro Gln
<210> 33
<211> 199
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 33
Met Leu Lys His Ile Tyr Ser Asn His Ile Lys Gly Asn Lys Ile Thr
1 5 10 15
Ala Pro Lys Pro Ser Ile Gln Gly Val Val Ile His Asn Asp Tyr Gly
20 25 30
Ser Met Thr Pro Ser Gln Tyr Leu Pro Trp Leu Tyr Ala Arg Glu Asn
35 40 45
Asn Gly Thr His Val Asn Gly Trp Ala Ser Val Tyr Ala Asn Arg Asn
50 55 60
Glu Val Leu Trp Tyr His Pro Thr Asp Tyr Val Glu Trp His Cys Gly
65 70 75 80
Asn Gln Trp Ala Asn Ala Asn Leu Ile Gly Phe Glu Val Cys Glu Ser
85 90 95
Tyr Pro Gly Arg Ile Ser Asp Lys Leu Phe Leu Glu Asn Glu Glu Ala
100 105 110
Thr Leu Lys Val Ala Ala Asp Val Met Lys Ser Tyr Gly Leu Pro Val
115 120 125
Asn Arg Asn Thr Val Arg Leu His Asn Glu Phe Phe Gly Thr Ser Cys
130 135 140
Pro His Arg Ser Trp Asp Leu His Val Gly Lys Gly Glu Pro Tyr Thr
145 150 155 160
Thr Thr Asn Ile Asn Lys Met Lys Asp Tyr Phe Ile Lys Arg Ile Lys
165 170 175
His Tyr Tyr Asp Gly Glu Leu Arg Gln Ile Lys Ile Trp Phe Gln Asn
180 185 190
Arg Arg Met Lys Trp Lys Lys
195
<210> 34
<211> 192
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 34
Met Leu Lys His Ile Tyr Ser Asn His Ile Lys Gly Asn Lys Ile Thr
1 5 10 15
Ala Pro Lys Pro Ser Ile Gln Gly Val Val Ile His Asn Asp Tyr Gly
20 25 30
Ser Met Thr Pro Ser Gln Tyr Leu Pro Trp Leu Tyr Ala Arg Glu Asn
35 40 45
Asn Gly Thr His Val Asn Gly Trp Ala Ser Val Tyr Ala Asn Arg Asn
50 55 60
Glu Val Leu Trp Tyr His Pro Thr Asp Tyr Val Glu Trp His Cys Gly
65 70 75 80
Asn Gln Trp Ala Asn Ala Asn Leu Ile Gly Phe Glu Val Cys Glu Ser
85 90 95
Tyr Pro Gly Arg Ile Ser Asp Lys Leu Phe Leu Glu Asn Glu Glu Ala
100 105 110
Thr Leu Lys Val Ala Ala Asp Val Met Lys Ser Tyr Gly Leu Pro Val
115 120 125
Asn Arg Asn Thr Val Arg Leu His Asn Glu Phe Phe Gly Thr Ser Cys
130 135 140
Pro His Arg Ser Trp Asp Leu His Val Gly Lys Gly Glu Pro Tyr Thr
145 150 155 160
Thr Thr Asn Ile Asn Lys Met Lys Asp Tyr Phe Ile Lys Arg Ile Lys
165 170 175
His Tyr Tyr Asp Gly Glu Leu Arg Arg Arg Arg Arg Arg Arg Arg Arg
180 185 190
<210> 35
<211> 196
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 35
Met Leu Lys His Ile Tyr Ser Asn His Ile Lys Gly Asn Lys Ile Thr
1 5 10 15
Ala Pro Lys Pro Ser Ile Gln Gly Val Val Ile His Asn Asp Tyr Gly
20 25 30
Ser Met Thr Pro Ser Gln Tyr Leu Pro Trp Leu Tyr Ala Arg Glu Asn
35 40 45
Asn Gly Thr His Val Asn Gly Trp Ala Ser Val Tyr Ala Asn Arg Asn
50 55 60
Glu Val Leu Trp Tyr His Pro Thr Asp Tyr Val Glu Trp His Cys Gly
65 70 75 80
Asn Gln Trp Ala Asn Ala Asn Leu Ile Gly Phe Glu Val Cys Glu Ser
85 90 95
Tyr Pro Gly Arg Ile Ser Asp Lys Leu Phe Leu Glu Asn Glu Glu Ala
100 105 110
Thr Leu Lys Val Ala Ala Asp Val Met Lys Ser Tyr Gly Leu Pro Val
115 120 125
Asn Arg Asn Thr Val Arg Leu His Asn Glu Phe Phe Gly Thr Ser Cys
130 135 140
Pro His Arg Ser Trp Asp Leu His Val Gly Lys Gly Glu Pro Tyr Thr
145 150 155 160
Thr Thr Asn Ile Asn Lys Met Lys Asp Tyr Phe Ile Lys Arg Ile Lys
165 170 175
His Tyr Tyr Asp Gly Glu Leu Gly Arg Lys Lys Arg Arg Gln Arg Arg
180 185 190
Arg Pro Pro Gln
195
<210> 36
<211> 307
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 36
Met Ser Ile Ile Met Glu Val Ala Thr Met Gln Ala Lys Leu Thr Lys
1 5 10 15
Asn Glu Phe Ile Glu Trp Leu Lys Thr Ser Glu Gly Lys Gln Phe Asn
20 25 30
Val Asp Leu Trp Tyr Gly Phe Gln Cys Phe Asp Tyr Ala Asn Ala Gly
35 40 45
Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp
50 55 60
Ile Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gln Asn
65 70 75 80
Thr Pro Asp Phe Leu Ala Gln Pro Gly Asp Met Val Val Phe Gly Ser
85 90 95
Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val Ile Glu Ala Thr
100 105 110
Leu Asp Tyr Ile Ile Val Tyr Glu Gln Asn Trp Leu Gly Gly Gly Trp
115 120 125
Thr Asp Gly Ile Glu Gln Pro Gly Trp Gly Trp Glu Lys Val Thr Arg
130 135 140
Arg Gln His Ala Tyr Asp Phe Pro Met Trp Phe Ile Arg Pro Asn Phe
145 150 155 160
Lys Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser
165 170 175
Asp Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys
180 185 190
Ala Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu
195 200 205
His Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr
210 215 220
Lys Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly
225 230 235 240
Gln Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp
245 250 255
Val Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg
260 265 270
Thr Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile
275 280 285
Lys Glu Leu Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met Lys
290 295 300
Trp Lys Lys
305
<210> 37
<211> 300
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 37
Met Ser Ile Ile Met Glu Val Ala Thr Met Gln Ala Lys Leu Thr Lys
1 5 10 15
Asn Glu Phe Ile Glu Trp Leu Lys Thr Ser Glu Gly Lys Gln Phe Asn
20 25 30
Val Asp Leu Trp Tyr Gly Phe Gln Cys Phe Asp Tyr Ala Asn Ala Gly
35 40 45
Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp
50 55 60
Ile Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gln Asn
65 70 75 80
Thr Pro Asp Phe Leu Ala Gln Pro Gly Asp Met Val Val Phe Gly Ser
85 90 95
Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val Ile Glu Ala Thr
100 105 110
Leu Asp Tyr Ile Ile Val Tyr Glu Gln Asn Trp Leu Gly Gly Gly Trp
115 120 125
Thr Asp Gly Ile Glu Gln Pro Gly Trp Gly Trp Glu Lys Val Thr Arg
130 135 140
Arg Gln His Ala Tyr Asp Phe Pro Met Trp Phe Ile Arg Pro Asn Phe
145 150 155 160
Lys Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser
165 170 175
Asp Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys
180 185 190
Ala Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu
195 200 205
His Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr
210 215 220
Lys Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly
225 230 235 240
Gln Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp
245 250 255
Val Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg
260 265 270
Thr Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile
275 280 285
Lys Glu Leu Arg Arg Arg Arg Arg Arg Arg Arg Arg
290 295 300
<210> 38
<211> 304
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 38
Met Ser Ile Ile Met Glu Val Ala Thr Met Gln Ala Lys Leu Thr Lys
1 5 10 15
Asn Glu Phe Ile Glu Trp Leu Lys Thr Ser Glu Gly Lys Gln Phe Asn
20 25 30
Val Asp Leu Trp Tyr Gly Phe Gln Cys Phe Asp Tyr Ala Asn Ala Gly
35 40 45
Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp
50 55 60
Ile Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gln Asn
65 70 75 80
Thr Pro Asp Phe Leu Ala Gln Pro Gly Asp Met Val Val Phe Gly Ser
85 90 95
Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val Ile Glu Ala Thr
100 105 110
Leu Asp Tyr Ile Ile Val Tyr Glu Gln Asn Trp Leu Gly Gly Gly Trp
115 120 125
Thr Asp Gly Ile Glu Gln Pro Gly Trp Gly Trp Glu Lys Val Thr Arg
130 135 140
Arg Gln His Ala Tyr Asp Phe Pro Met Trp Phe Ile Arg Pro Asn Phe
145 150 155 160
Lys Gly Gly Lys Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser
165 170 175
Asp Val Lys Gln Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys
180 185 190
Ala Thr Asp Trp Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu
195 200 205
His Ala Ser Phe Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr
210 215 220
Lys Gly Pro Trp Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly
225 230 235 240
Gln Thr Ile Lys Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp
245 250 255
Val Ser Trp Glu Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg
260 265 270
Thr Trp Asp Ala Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile
275 280 285
Lys Glu Leu Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg Pro Pro Gln
290 295 300
<210> 39
<211> 179
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 39
Met Ser Ile Ile Met Glu Val Ala Thr Met Gln Ala Lys Leu Thr Lys
1 5 10 15
Asn Glu Phe Ile Glu Trp Leu Lys Thr Ser Glu Gly Lys Gln Phe Asn
20 25 30
Val Asp Leu Trp Tyr Gly Phe Gln Cys Phe Asp Tyr Ala Asn Ala Gly
35 40 45
Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp
50 55 60
Ile Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gln Asn
65 70 75 80
Thr Pro Asp Phe Leu Ala Gln Pro Gly Asp Met Val Val Phe Gly Ser
85 90 95
Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val Ile Glu Ala Thr
100 105 110
Leu Asp Tyr Ile Ile Val Tyr Glu Gln Asn Trp Leu Gly Gly Gly Trp
115 120 125
Thr Asp Gly Ile Glu Gln Pro Gly Trp Gly Trp Glu Lys Val Thr Arg
130 135 140
Arg Gln His Ala Tyr Asp Phe Pro Met Trp Phe Ile Arg Pro Asn Phe
145 150 155 160
Lys Glu Leu Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met Lys
165 170 175
Trp Lys Lys
<210> 40
<211> 172
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 40
Met Ser Ile Ile Met Glu Val Ala Thr Met Gln Ala Lys Leu Thr Lys
1 5 10 15
Asn Glu Phe Ile Glu Trp Leu Lys Thr Ser Glu Gly Lys Gln Phe Asn
20 25 30
Val Asp Leu Trp Tyr Gly Phe Gln Cys Phe Asp Tyr Ala Asn Ala Gly
35 40 45
Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp
50 55 60
Ile Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gln Asn
65 70 75 80
Thr Pro Asp Phe Leu Ala Gln Pro Gly Asp Met Val Val Phe Gly Ser
85 90 95
Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val Ile Glu Ala Thr
100 105 110
Leu Asp Tyr Ile Ile Val Tyr Glu Gln Asn Trp Leu Gly Gly Gly Trp
115 120 125
Thr Asp Gly Ile Glu Gln Pro Gly Trp Gly Trp Glu Lys Val Thr Arg
130 135 140
Arg Gln His Ala Tyr Asp Phe Pro Met Trp Phe Ile Arg Pro Asn Phe
145 150 155 160
Lys Glu Leu Arg Arg Arg Arg Arg Arg Arg Arg Arg
165 170
<210> 41
<211> 176
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 41
Met Ser Ile Ile Met Glu Val Ala Thr Met Gln Ala Lys Leu Thr Lys
1 5 10 15
Asn Glu Phe Ile Glu Trp Leu Lys Thr Ser Glu Gly Lys Gln Phe Asn
20 25 30
Val Asp Leu Trp Tyr Gly Phe Gln Cys Phe Asp Tyr Ala Asn Ala Gly
35 40 45
Trp Lys Val Leu Phe Gly Leu Leu Leu Lys Gly Leu Gly Ala Lys Asp
50 55 60
Ile Pro Phe Ala Asn Asn Phe Asp Gly Leu Ala Thr Val Tyr Gln Asn
65 70 75 80
Thr Pro Asp Phe Leu Ala Gln Pro Gly Asp Met Val Val Phe Gly Ser
85 90 95
Asn Tyr Gly Ala Gly Tyr Gly His Val Ala Trp Val Ile Glu Ala Thr
100 105 110
Leu Asp Tyr Ile Ile Val Tyr Glu Gln Asn Trp Leu Gly Gly Gly Trp
115 120 125
Thr Asp Gly Ile Glu Gln Pro Gly Trp Gly Trp Glu Lys Val Thr Arg
130 135 140
Arg Gln His Ala Tyr Asp Phe Pro Met Trp Phe Ile Arg Pro Asn Phe
145 150 155 160
Lys Glu Leu Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg Pro Pro Gln
165 170 175
<210> 42
<211> 287
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 42
Met Ala Ala Thr His Glu His Ser Ala Gln Trp Leu Asn Asn Tyr Lys
1 5 10 15
Lys Gly Tyr Gly Tyr Gly Pro Tyr Pro Leu Gly Ile Asn Gly Gly Met
20 25 30
His Tyr Gly Val Asp Phe Phe Met Asn Ile Gly Thr Pro Val Lys Ala
35 40 45
Ile Ser Ser Gly Lys Ile Val Glu Ala Gly Trp Ser Asn Tyr Gly Gly
50 55 60
Gly Asn Gln Ile Gly Leu Ile Glu Asn Asp Gly Val His Arg Gln Trp
65 70 75 80
Tyr Met His Leu Ser Lys Tyr Asn Val Lys Val Gly Asp Tyr Val Lys
85 90 95
Ala Gly Gln Ile Ile Gly Trp Ser Gly Ser Thr Gly Tyr Ser Thr Ala
100 105 110
Pro His Leu His Phe Gln Arg Met Val Asn Ser Phe Ser Asn Ser Thr
115 120 125
Ala Gln Asp Pro Met Pro Phe Leu Lys Ser Ala Gly Tyr Gly Gly Lys
130 135 140
Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser Asp Val Lys Gln
145 150 155 160
Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys Ala Thr Asp Trp
165 170 175
Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu His Ala Ser Phe
180 185 190
Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr Lys Gly Pro Trp
195 200 205
Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly Gln Thr Ile Lys
210 215 220
Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp Val Ser Trp Glu
225 230 235 240
Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg Thr Trp Asp Ala
245 250 255
Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile Lys Glu Leu Arg
260 265 270
Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met Lys Trp Lys Lys
275 280 285
<210> 43
<211> 280
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 43
Met Ala Ala Thr His Glu His Ser Ala Gln Trp Leu Asn Asn Tyr Lys
1 5 10 15
Lys Gly Tyr Gly Tyr Gly Pro Tyr Pro Leu Gly Ile Asn Gly Gly Met
20 25 30
His Tyr Gly Val Asp Phe Phe Met Asn Ile Gly Thr Pro Val Lys Ala
35 40 45
Ile Ser Ser Gly Lys Ile Val Glu Ala Gly Trp Ser Asn Tyr Gly Gly
50 55 60
Gly Asn Gln Ile Gly Leu Ile Glu Asn Asp Gly Val His Arg Gln Trp
65 70 75 80
Tyr Met His Leu Ser Lys Tyr Asn Val Lys Val Gly Asp Tyr Val Lys
85 90 95
Ala Gly Gln Ile Ile Gly Trp Ser Gly Ser Thr Gly Tyr Ser Thr Ala
100 105 110
Pro His Leu His Phe Gln Arg Met Val Asn Ser Phe Ser Asn Ser Thr
115 120 125
Ala Gln Asp Pro Met Pro Phe Leu Lys Ser Ala Gly Tyr Gly Gly Lys
130 135 140
Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser Asp Val Lys Gln
145 150 155 160
Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys Ala Thr Asp Trp
165 170 175
Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu His Ala Ser Phe
180 185 190
Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr Lys Gly Pro Trp
195 200 205
Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly Gln Thr Ile Lys
210 215 220
Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp Val Ser Trp Glu
225 230 235 240
Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg Thr Trp Asp Ala
245 250 255
Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile Lys Glu Leu Arg
260 265 270
Arg Arg Arg Arg Arg Arg Arg Arg
275 280
<210> 44
<211> 284
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 44
Met Ala Ala Thr His Glu His Ser Ala Gln Trp Leu Asn Asn Tyr Lys
1 5 10 15
Lys Gly Tyr Gly Tyr Gly Pro Tyr Pro Leu Gly Ile Asn Gly Gly Met
20 25 30
His Tyr Gly Val Asp Phe Phe Met Asn Ile Gly Thr Pro Val Lys Ala
35 40 45
Ile Ser Ser Gly Lys Ile Val Glu Ala Gly Trp Ser Asn Tyr Gly Gly
50 55 60
Gly Asn Gln Ile Gly Leu Ile Glu Asn Asp Gly Val His Arg Gln Trp
65 70 75 80
Tyr Met His Leu Ser Lys Tyr Asn Val Lys Val Gly Asp Tyr Val Lys
85 90 95
Ala Gly Gln Ile Ile Gly Trp Ser Gly Ser Thr Gly Tyr Ser Thr Ala
100 105 110
Pro His Leu His Phe Gln Arg Met Val Asn Ser Phe Ser Asn Ser Thr
115 120 125
Ala Gln Asp Pro Met Pro Phe Leu Lys Ser Ala Gly Tyr Gly Gly Lys
130 135 140
Leu Glu Val Ser Lys Ala Ala Thr Ile Lys Gln Ser Asp Val Lys Gln
145 150 155 160
Glu Val Lys Lys Gln Glu Ala Lys Gln Ile Val Lys Ala Thr Asp Trp
165 170 175
Lys Gln Asn Lys Asp Gly Ile Trp Tyr Lys Ala Glu His Ala Ser Phe
180 185 190
Thr Val Thr Ala Pro Glu Gly Ile Ile Thr Arg Tyr Lys Gly Pro Trp
195 200 205
Thr Gly His Pro Gln Ala Gly Val Leu Gln Lys Gly Gln Thr Ile Lys
210 215 220
Tyr Asp Glu Val Gln Lys Phe Asp Gly His Val Trp Val Ser Trp Glu
225 230 235 240
Thr Phe Glu Gly Glu Thr Val Tyr Met Pro Val Arg Thr Trp Asp Ala
245 250 255
Lys Thr Gly Lys Val Gly Lys Leu Trp Gly Glu Ile Lys Glu Leu Gly
260 265 270
Arg Lys Lys Arg Arg Gln Arg Arg Arg Pro Pro Gln
275 280
<210> 45
<211> 160
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 45
Met Ala Ala Thr His Glu His Ser Ala Gln Trp Leu Asn Asn Tyr Lys
1 5 10 15
Lys Gly Tyr Gly Tyr Gly Pro Tyr Pro Leu Gly Ile Asn Gly Gly Met
20 25 30
His Tyr Gly Val Asp Phe Phe Met Asn Ile Gly Thr Pro Val Lys Ala
35 40 45
Ile Ser Ser Gly Lys Ile Val Glu Ala Gly Trp Ser Asn Tyr Gly Gly
50 55 60
Gly Asn Gln Ile Gly Leu Ile Glu Asn Asp Gly Val His Arg Gln Trp
65 70 75 80
Tyr Met His Leu Ser Lys Tyr Asn Val Lys Val Gly Asp Tyr Val Lys
85 90 95
Ala Gly Gln Ile Ile Gly Trp Ser Gly Ser Thr Gly Tyr Ser Thr Ala
100 105 110
Pro His Leu His Phe Gln Arg Met Val Asn Ser Phe Ser Asn Ser Thr
115 120 125
Ala Gln Asp Pro Met Pro Phe Leu Lys Ser Ala Gly Tyr Gly Glu Leu
130 135 140
Arg Gln Ile Lys Ile Trp Phe Gln Asn Arg Arg Met Lys Trp Lys Lys
145 150 155 160
<210> 46
<211> 153
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 46
Met Ala Ala Thr His Glu His Ser Ala Gln Trp Leu Asn Asn Tyr Lys
1 5 10 15
Lys Gly Tyr Gly Tyr Gly Pro Tyr Pro Leu Gly Ile Asn Gly Gly Met
20 25 30
His Tyr Gly Val Asp Phe Phe Met Asn Ile Gly Thr Pro Val Lys Ala
35 40 45
Ile Ser Ser Gly Lys Ile Val Glu Ala Gly Trp Ser Asn Tyr Gly Gly
50 55 60
Gly Asn Gln Ile Gly Leu Ile Glu Asn Asp Gly Val His Arg Gln Trp
65 70 75 80
Tyr Met His Leu Ser Lys Tyr Asn Val Lys Val Gly Asp Tyr Val Lys
85 90 95
Ala Gly Gln Ile Ile Gly Trp Ser Gly Ser Thr Gly Tyr Ser Thr Ala
100 105 110
Pro His Leu His Phe Gln Arg Met Val Asn Ser Phe Ser Asn Ser Thr
115 120 125
Ala Gln Asp Pro Met Pro Phe Leu Lys Ser Ala Gly Tyr Gly Glu Leu
130 135 140
Arg Arg Arg Arg Arg Arg Arg Arg Arg
145 150
<210> 47
<211> 157
<212> PRT
<213> Artificial
<220>
<223> Polypeptide construct
<400> 47
Met Ala Ala Thr His Glu His Ser Ala Gln Trp Leu Asn Asn Tyr Lys
1 5 10 15
Lys Gly Tyr Gly Tyr Gly Pro Tyr Pro Leu Gly Ile Asn Gly Gly Met
20 25 30
His Tyr Gly Val Asp Phe Phe Met Asn Ile Gly Thr Pro Val Lys Ala
35 40 45
Ile Ser Ser Gly Lys Ile Val Glu Ala Gly Trp Ser Asn Tyr Gly Gly
50 55 60
Gly Asn Gln Ile Gly Leu Ile Glu Asn Asp Gly Val His Arg Gln Trp
65 70 75 80
Tyr Met His Leu Ser Lys Tyr Asn Val Lys Val Gly Asp Tyr Val Lys
85 90 95
Ala Gly Gln Ile Ile Gly Trp Ser Gly Ser Thr Gly Tyr Ser Thr Ala
100 105 110
Pro His Leu His Phe Gln Arg Met Val Asn Ser Phe Ser Asn Ser Thr
115 120 125
Ala Gln Asp Pro Met Pro Phe Leu Lys Ser Ala Gly Tyr Gly Glu Leu
130 135 140
Gly Arg Lys Lys Arg Arg Gln Arg Arg Arg Pro Pro Gln
145 150 155
<210> 48
<400> 48
000
<210> 49
<400> 49
000
<210> 50
<211> 981
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 50
atgctgaaac atatttactc caaccacatt aaaggtaaca aaatcacagc ccctaaaccg 60
tcaattcagg gcgtggtgat ccacaacgat tatggctcaa tgaccccttc acagtacctg 120
ccttggctgt acgctcgcga aaacaacggt acacatgtga atggctgggc ctcagtgtat 180
gccaatcgca acgaggtgct gtggtatcat cctacagact acgtggaatg gcactgcggc 240
aaccaatggg ccaacgccaa cctgatcggc tttgaagttt gcgaatcata tcctggtcgc 300
atctcagaca aactgtttct ggaaaacgag gaagccacac tgaaagtagc tgccgacgtg 360
atgaaatcgt atggcctgcc tgtgaatcgc aacacagtgc gcctgcacaa cgaatttttc 420
ggtacatcat gccctcatcg ttcatgggac ctgcacgtgg gcaaaggcga gccttatacc 480
acaacaaata tcaataaaat gaaagattat ttcattaaac ggattaaaca ctactatgac 540
ggtggcaaac tggaagttag caaagcagcc accattaaac agagtgatgt taaacaagaa 600
gtgaaaaaac aagaggccaa acaaattgtg aaagccaccg attggaaaca gaacaaagat 660
ggcatctggt ataaagcaga acatgccagc tttaccgtga ccgcaccgga aggcattatt 720
acccgttata aaggtccgtg gaccggtcat ccgcaggcag gcgtgctgca gaaaggtcag 780
accatcaaat atgatgaagt gcagaaattt gatggccatg tttgggttag ctgggaaacc 840
tttgaaggtg aaaccgttta tatgccggtt cgtacctggg atgcaaaaac cggtaaagtt 900
ggtaaactgt ggggtgagat taaagagctc cgccagatca aaatttggtt tcagaatcgt 960
cgcatgaaat ggaaaaaata a 981
<210> 51
<211> 960
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 51
atgctgaaac atatttactc caaccacatt aaaggtaaca aaatcacagc ccctaaaccg 60
tcaattcagg gcgtggtgat ccacaacgat tatggctcaa tgaccccttc acagtacctg 120
ccttggctgt acgctcgcga aaacaacggt acacatgtga atggctgggc ctcagtgtat 180
gccaatcgca acgaggtgct gtggtatcat cctacagact acgtggaatg gcactgcggc 240
aaccaatggg ccaacgccaa cctgatcggc tttgaagttt gcgaatcata tcctggtcgc 300
atctcagaca aactgtttct ggaaaacgag gaagccacac tgaaagtagc tgccgacgtg 360
atgaaatcgt atggcctgcc tgtgaatcgc aacacagtgc gcctgcacaa cgaatttttc 420
ggtacatcat gccctcatcg ttcatgggac ctgcacgtgg gcaaaggcga gccttatacc 480
acaacaaata tcaataaaat gaaagattat ttcattaaac ggattaaaca ctactatgac 540
ggtggcaaac tggaagttag caaagcagcc accattaaac agagtgatgt taaacaagaa 600
gtgaaaaaac aagaggccaa acaaattgtg aaagccaccg attggaaaca gaacaaagat 660
ggcatctggt ataaagcaga acatgccagc tttaccgtga ccgcaccgga aggcattatt 720
acccgttata aaggtccgtg gaccggtcat ccgcaggcag gcgtgctgca gaaaggtcag 780
accatcaaat atgatgaagt gcagaaattt gatggccatg tttgggttag ctgggaaacc 840
tttgaaggtg aaaccgttta tatgccggtt cgtacctggg atgcaaaaac cggtaaagtt 900
ggtaaactgt ggggtgagat taaagagctc cgtcgtcgtc gccgtcggcg tcgtcgttaa 960
<210> 52
<211> 972
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 52
atgctgaaac atatttactc caaccacatt aaaggtaaca aaatcacagc ccctaaaccg 60
tcaattcagg gcgtggtgat ccacaacgat tatggctcaa tgaccccttc acagtacctg 120
ccttggctgt acgctcgcga aaacaacggt acacatgtga atggctgggc ctcagtgtat 180
gccaatcgca acgaggtgct gtggtatcat cctacagact acgtggaatg gcactgcggc 240
aaccaatggg ccaacgccaa cctgatcggc tttgaagttt gcgaatcata tcctggtcgc 300
atctcagaca aactgtttct ggaaaacgag gaagccacac tgaaagtagc tgccgacgtg 360
atgaaatcgt atggcctgcc tgtgaatcgc aacacagtgc gcctgcacaa cgaatttttc 420
ggtacatcat gccctcatcg ttcatgggac ctgcacgtgg gcaaaggcga gccttatacc 480
acaacaaata tcaataaaat gaaagattat ttcattaaac ggattaaaca ctactatgac 540
ggtggcaaac tggaagttag caaagcagcc accattaaac agagtgatgt taaacaagaa 600
gtgaaaaaac aagaggccaa acaaattgtg aaagccaccg attggaaaca gaacaaagat 660
ggcatctggt ataaagcaga acatgccagc tttaccgtga ccgcaccgga aggcattatt 720
acccgttata aaggtccgtg gaccggtcat ccgcaggcag gcgtgctgca gaaaggtcag 780
accatcaaat atgatgaagt gcagaaattt gatggccatg tttgggttag ctgggaaacc 840
tttgaaggtg aaaccgttta tatgccggtt cgtacctggg atgcaaaaac cggtaaagtt 900
ggtaaactgt ggggtgagat taaagagctc ggtcgtaaaa aacgtcgtca gcgtcgtcgt 960
ccgcctcagt aa 972
<210> 53
<211> 600
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 53
atgctgaaac atatttactc caaccacatt aaaggtaaca aaatcacagc ccctaaaccg 60
tcaattcagg gcgtggtgat ccacaacgat tatggctcaa tgaccccttc acagtacctg 120
ccttggctgt acgctcgcga aaacaacggt acacatgtga atggctgggc ctcagtgtat 180
gccaatcgca acgaggtgct gtggtatcat cctacagact acgtggaatg gcactgcggc 240
aaccaatggg ccaacgccaa cctgatcggc tttgaagttt gcgaatcata tcctggtcgc 300
atctcagaca aactgtttct ggaaaacgag gaagccacac tgaaagtagc tgccgacgtg 360
atgaaatcgt atggcctgcc tgtgaatcgc aacacagtgc gcctgcacaa cgaatttttc 420
ggtacatcat gccctcatcg ttcatgggac ctgcacgtgg gcaaaggcga gccttatacc 480
acaacaaata tcaataaaat gaaagattat ttcattaaac ggattaaaca ctactatgac 540
ggtgagctcc gccagatcaa aatttggttt cagaatcgtc gcatgaaatg gaaaaaataa 600
<210> 54
<211> 579
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 54
atgctgaaac atatttactc caaccacatt aaaggtaaca aaatcacagc ccctaaaccg 60
tcaattcagg gcgtggtgat ccacaacgat tatggctcaa tgaccccttc acagtacctg 120
ccttggctgt acgctcgcga aaacaacggt acacatgtga atggctgggc ctcagtgtat 180
gccaatcgca acgaggtgct gtggtatcat cctacagact acgtggaatg gcactgcggc 240
aaccaatggg ccaacgccaa cctgatcggc tttgaagttt gcgaatcata tcctggtcgc 300
atctcagaca aactgtttct ggaaaacgag gaagccacac tgaaagtagc tgccgacgtg 360
atgaaatcgt atggcctgcc tgtgaatcgc aacacagtgc gcctgcacaa cgaatttttc 420
ggtacatcat gccctcatcg ttcatgggac ctgcacgtgg gcaaaggcga gccttatacc 480
acaacaaata tcaataaaat gaaagattat ttcattaaac ggattaaaca ctactatgac 540
ggtgagctcc gtcgtcgtcg ccgtcggcgt cgtcgttaa 579
<210> 55
<211> 591
<212> DNA
<213> Artificial
<220>
<223> Polynculeotide
<400> 55
atgctgaaac atatttactc caaccacatt aaaggtaaca aaatcacagc ccctaaaccg 60
tcaattcagg gcgtggtgat ccacaacgat tatggctcaa tgaccccttc acagtacctg 120
ccttggctgt acgctcgcga aaacaacggt acacatgtga atggctgggc ctcagtgtat 180
gccaatcgca acgaggtgct gtggtatcat cctacagact acgtggaatg gcactgcggc 240
aaccaatggg ccaacgccaa cctgatcggc tttgaagttt gcgaatcata tcctggtcgc 300
atctcagaca aactgtttct ggaaaacgag gaagccacac tgaaagtagc tgccgacgtg 360
atgaaatcgt atggcctgcc tgtgaatcgc aacacagtgc gcctgcacaa cgaatttttc 420
ggtacatcat gccctcatcg ttcatgggac ctgcacgtgg gcaaaggcga gccttatacc 480
acaacaaata tcaataaaat gaaagattat ttcattaaac ggattaaaca ctactatgac 540
ggtgagctcg gtcgtaaaaa acgtcgtcag cgtcgtcgtc cgcctcagta a 591
<210> 56
<211> 924
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 56
atgtccatta tcatggaagt ggccacaatg caagccaaac tgacaaaaaa tgagttcatt 60
gagtggctga aaacgtccga gggtaaacag ttcaacgtgg acctgtggta cggttttcag 120
tgtttcgact acgccaacgc tggctggaaa gtgctgttcg gcctgctgct gaaaggcctg 180
ggagccaaag acatcccttt tgcaaacaat ttcgatggcc tggccacagt ttatcaaaac 240
acccctgact ttctggccca accaggcgac atggtggtgt ttggttctaa ttatggcgca 300
ggctatggcc acgtagcctg ggtgatcgaa gccacactgg actacattat tgtttatgag 360
caaaactggc tgggaggcgg atggacagac ggcatcgaac agcctggctg gggctgggag 420
aaagtgacac gccgtcaaca tgcctatgac ttccctatgt ggttcatccg tcctaatttc 480
aaaggtggta aactggaagt tagcaaagca gcaaccatta aacagtccga tgttaaacaa 540
gaagtgaaaa aacaagaggc caaacaaatt gtgaaagcca ccgattggaa acagaacaaa 600
gatggcattt ggtataaagc agaacatgcc agctttaccg ttaccgcacc ggaaggcatt 660
attacccgtt ataaaggtcc gtggaccggt catccgcagg caggcgtact gcagaaaggt 720
cagaccatta aatacgatga agtgcagaaa tttgatggcc atgtttgggt tagctgggaa 780
acctttgaag gtgaaaccgt ttatatgccg gttcgtacct gggatgcaaa aaccggtaaa 840
gtgggcaaac tgtggggtga aatcaaagag ctccgccaga tcaaaatttg gtttcagaat 900
cgtcgcatga aatggaaaaa ataa 924
<210> 57
<211> 903
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 57
atgtccatta tcatggaagt ggccacaatg caagccaaac tgacaaaaaa tgagttcatt 60
gagtggctga aaacgtccga gggtaaacag ttcaacgtgg acctgtggta cggttttcag 120
tgtttcgact acgccaacgc tggctggaaa gtgctgttcg gcctgctgct gaaaggcctg 180
ggagccaaag acatcccttt tgcaaacaat ttcgatggcc tggccacagt ttatcaaaac 240
acccctgact ttctggccca accaggcgac atggtggtgt ttggttctaa ttatggcgca 300
ggctatggcc acgtagcctg ggtgatcgaa gccacactgg actacattat tgtttatgag 360
caaaactggc tgggaggcgg atggacagac ggcatcgaac agcctggctg gggctgggag 420
aaagtgacac gccgtcaaca tgcctatgac ttccctatgt ggttcatccg tcctaatttc 480
aaaggtggta aactggaagt tagcaaagca gcaaccatta aacagtccga tgttaaacaa 540
gaagtgaaaa aacaagaggc caaacaaatt gtgaaagcca ccgattggaa acagaacaaa 600
gatggcattt ggtataaagc agaacatgcc agctttaccg ttaccgcacc ggaaggcatt 660
attacccgtt ataaaggtcc gtggaccggt catccgcagg caggcgtact gcagaaaggt 720
cagaccatta aatacgatga agtgcagaaa tttgatggcc atgtttgggt tagctgggaa 780
acctttgaag gtgaaaccgt ttatatgccg gttcgtacct gggatgcaaa aaccggtaaa 840
gtgggcaaac tgtggggtga aatcaaagag ctccgtcgtc gtcgccgtcg gcgtcgtcgt 900
taa 903
<210> 58
<211> 915
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 58
atgtccatta tcatggaagt ggccacaatg caagccaaac tgacaaaaaa tgagttcatt 60
gagtggctga aaacgtccga gggtaaacag ttcaacgtgg acctgtggta cggttttcag 120
tgtttcgact acgccaacgc tggctggaaa gtgctgttcg gcctgctgct gaaaggcctg 180
ggagccaaag acatcccttt tgcaaacaat ttcgatggcc tggccacagt ttatcaaaac 240
acccctgact ttctggccca accaggcgac atggtggtgt ttggttctaa ttatggcgca 300
ggctatggcc acgtagcctg ggtgatcgaa gccacactgg actacattat tgtttatgag 360
caaaactggc tgggaggcgg atggacagac ggcatcgaac agcctggctg gggctgggag 420
aaagtgacac gccgtcaaca tgcctatgac ttccctatgt ggttcatccg tcctaatttc 480
aaaggtggta aactggaagt tagcaaagca gcaaccatta aacagtccga tgttaaacaa 540
gaagtgaaaa aacaagaggc caaacaaatt gtgaaagcca ccgattggaa acagaacaaa 600
gatggcattt ggtataaagc agaacatgcc agctttaccg ttaccgcacc ggaaggcatt 660
attacccgtt ataaaggtcc gtggaccggt catccgcagg caggcgtact gcagaaaggt 720
cagaccatta aatacgatga agtgcagaaa tttgatggcc atgtttgggt tagctgggaa 780
acctttgaag gtgaaaccgt ttatatgccg gttcgtacct gggatgcaaa aaccggtaaa 840
gtgggcaaac tgtggggtga aatcaaagag ctcggtcgta aaaaacgtcg tcagcgtcgt 900
cgtccgcctc agtaa 915
<210> 59
<211> 540
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 59
atgtccatta tcatggaagt ggccacaatg caagccaaac tgacaaaaaa tgagttcatt 60
gagtggctga aaacgtccga gggtaaacag ttcaacgtgg acctgtggta cggttttcag 120
tgtttcgact acgccaacgc tggctggaaa gtgctgttcg gcctgctgct gaaaggcctg 180
ggagccaaag acatcccttt tgcaaacaat ttcgatggcc tggccacagt ttatcaaaac 240
acccctgact ttctggccca accaggcgac atggtggtgt ttggttctaa ttatggcgca 300
ggctatggcc acgtagcctg ggtgatcgaa gccacactgg actacattat tgtttatgag 360
caaaactggc tgggaggcgg atggacagac ggcatcgaac agcctggctg gggctgggag 420
aaagtgacac gccgtcaaca tgcctatgac ttccctatgt ggttcatccg tcctaatttc 480
aaagagctcc gccagatcaa aatttggttt cagaatcgtc gcatgaaatg gaaaaaataa 540
<210> 60
<211> 519
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 60
atgtccatta tcatggaagt ggccacaatg caagccaaac tgacaaaaaa tgagttcatt 60
gagtggctga aaacgtccga gggtaaacag ttcaacgtgg acctgtggta cggttttcag 120
tgtttcgact acgccaacgc tggctggaaa gtgctgttcg gcctgctgct gaaaggcctg 180
ggagccaaag acatcccttt tgcaaacaat ttcgatggcc tggccacagt ttatcaaaac 240
acccctgact ttctggccca accaggcgac atggtggtgt ttggttctaa ttatggcgca 300
ggctatggcc acgtagcctg ggtgatcgaa gccacactgg actacattat tgtttatgag 360
caaaactggc tgggaggcgg atggacagac ggcatcgaac agcctggctg gggctgggag 420
aaagtgacac gccgtcaaca tgcctatgac ttccctatgt ggttcatccg tcctaatttc 480
aaagagctcc gtcgtcgtcg ccgtcggcgt cgtcgttaa 519
<210> 61
<211> 531
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 61
atgtccatta tcatggaagt ggccacaatg caagccaaac tgacaaaaaa tgagttcatt 60
gagtggctga aaacgtccga gggtaaacag ttcaacgtgg acctgtggta cggttttcag 120
tgtttcgact acgccaacgc tggctggaaa gtgctgttcg gcctgctgct gaaaggcctg 180
ggagccaaag acatcccttt tgcaaacaat ttcgatggcc tggccacagt ttatcaaaac 240
acccctgact ttctggccca accaggcgac atggtggtgt ttggttctaa ttatggcgca 300
ggctatggcc acgtagcctg ggtgatcgaa gccacactgg actacattat tgtttatgag 360
caaaactggc tgggaggcgg atggacagac ggcatcgaac agcctggctg gggctgggag 420
aaagtgacac gccgtcaaca tgcctatgac ttccctatgt ggttcatccg tcctaatttc 480
aaagagctcg gtcgtaaaaa acgtcgtcag cgtcgtcgtc cgcctcagta a 531
<210> 62
<211> 864
<212> DNA
<213> Artificial
<220>
<223> Polynculeotide
<400> 62
atggcagcca cacatgaaca ctctgcccaa tggctgaaca actacaaaaa aggctacggt 60
tatggccctt accctctggg cattaacggt ggcatgcact acggcgttga cttttttatg 120
aacatcggca cccctgtgaa agccattagc tcaggcaaaa tcgtggaagc cggttggtca 180
aactatggcg gtggcaacca gatcggtctg atcgagaacg atggtgtgca ccgccaatgg 240
tacatgcacc tgtccaaata caacgttaaa gttggtgact acgtgaaagc aggccagatt 300
atcggctggt caggttcaac cggttattca acagcccctc atctgcactt ccaacgcatg 360
gtgaatagtt ttagtaattc taccgctcaa gatccgatgc cattcctgaa atctgccggt 420
tatggtggca aactggaagt tagcaaagca gcaaccatta aacagtccga tgttaaacaa 480
gaagtgaaaa aacaagaggc caaacaaatt gtgaaagcga ccgattggaa acagaacaaa 540
gatggcattt ggtataaagc agaacatgcc agctttaccg tgaccgcacc ggaaggcatt 600
attacccgtt ataaaggtcc gtggaccggt catccgcagg caggcgtgct gcagaaaggt 660
cagaccatca aatatgatga ggtgcagaaa tttgatggcc atgtttgggt tagctgggaa 720
acctttgaag gtgaaaccgt ttatatgccg gttcgtacct gggatgcaaa aaccggtaaa 780
gtgggtaaac tgtggggtga aatcaaagag ctccgccaga tcaaaatttg gtttcagaat 840
cgtcgcatga aatggaaaaa ataa 864
<210> 63
<211> 843
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 63
atggcagcca cacatgaaca ctctgcccaa tggctgaaca actacaaaaa aggctacggt 60
tatggccctt accctctggg cattaacggt ggcatgcact acggcgttga cttttttatg 120
aacatcggca cccctgtgaa agccattagc tcaggcaaaa tcgtggaagc cggttggtca 180
aactatggcg gtggcaacca gatcggtctg atcgagaacg atggtgtgca ccgccaatgg 240
tacatgcacc tgtccaaata caacgttaaa gttggtgact acgtgaaagc aggccagatt 300
atcggctggt caggttcaac cggttattca acagcccctc atctgcactt ccaacgcatg 360
gtgaatagtt ttagtaattc taccgctcaa gatccgatgc cattcctgaa atctgccggt 420
tatggtggca aactggaagt tagcaaagca gcaaccatta aacagtccga tgttaaacaa 480
gaagtgaaaa aacaagaggc caaacaaatt gtgaaagcga ccgattggaa acagaacaaa 540
gatggcattt ggtataaagc agaacatgcc agctttaccg tgaccgcacc ggaaggcatt 600
attacccgtt ataaaggtcc gtggaccggt catccgcagg caggcgtgct gcagaaaggt 660
cagaccatca aatatgatga ggtgcagaaa tttgatggcc atgtttgggt tagctgggaa 720
acctttgaag gtgaaaccgt ttatatgccg gttcgtacct gggatgcaaa aaccggtaaa 780
gtgggtaaac tgtggggtga aatcaaagag ctccgtcgtc gtcgccgtcg gcgtcgtcgt 840
taa 843
<210> 64
<211> 855
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 64
atggcagcca cacatgaaca ctctgcccaa tggctgaaca actacaaaaa aggctacggt 60
tatggccctt accctctggg cattaacggt ggcatgcact acggcgttga cttttttatg 120
aacatcggca cccctgtgaa agccattagc tcaggcaaaa tcgtggaagc cggttggtca 180
aactatggcg gtggcaacca gatcggtctg atcgagaacg atggtgtgca ccgccaatgg 240
tacatgcacc tgtccaaata caacgttaaa gttggtgact acgtgaaagc aggccagatt 300
atcggctggt caggttcaac cggttattca acagcccctc atctgcactt ccaacgcatg 360
gtgaatagtt ttagtaattc taccgctcaa gatccgatgc cattcctgaa atctgccggt 420
tatggtggca aactggaagt tagcaaagca gcaaccatta aacagtccga tgttaaacaa 480
gaagtgaaaa aacaagaggc caaacaaatt gtgaaagcga ccgattggaa acagaacaaa 540
gatggcattt ggtataaagc agaacatgcc agctttaccg tgaccgcacc ggaaggcatt 600
attacccgtt ataaaggtcc gtggaccggt catccgcagg caggcgtgct gcagaaaggt 660
cagaccatca aatatgatga ggtgcagaaa tttgatggcc atgtttgggt tagctgggaa 720
acctttgaag gtgaaaccgt ttatatgccg gttcgtacct gggatgcaaa aaccggtaaa 780
gtgggtaaac tgtggggtga aatcaaagag ctcggtcgta aaaaacgtcg tcagcgtcgt 840
cgtccgcctc agtaa 855
<210> 65
<211> 483
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 65
atggcagcca cacatgaaca ctctgcccaa tggctgaaca actacaaaaa aggctacggt 60
tatggccctt accctctggg cattaacggt ggcatgcact acggcgttga cttttttatg 120
aacatcggca cccctgtgaa agccattagc tcaggcaaaa tcgtggaagc cggttggtca 180
aactatggcg gtggcaacca gatcggtctg atcgagaacg atggtgtgca ccgccaatgg 240
tacatgcacc tgtccaaata caacgttaaa gttggtgact acgtgaaagc aggccagatt 300
atcggctggt caggttcaac cggttattca acagcccctc atctgcactt ccaacgcatg 360
gtgaatagtt ttagtaattc taccgctcaa gatccgatgc cattcctgaa atctgccggt 420
tatggtgagc tccgccagat caaaatttgg tttcagaatc gtcgcatgaa atggaaaaaa 480
taa 483
<210> 66
<211> 462
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 66
atggcagcca cacatgaaca ctctgcccaa tggctgaaca actacaaaaa aggctacggt 60
tatggccctt accctctggg cattaacggt ggcatgcact acggcgttga cttttttatg 120
aacatcggca cccctgtgaa agccattagc tcaggcaaaa tcgtggaagc cggttggtca 180
aactatggcg gtggcaacca gatcggtctg atcgagaacg atggtgtgca ccgccaatgg 240
tacatgcacc tgtccaaata caacgttaaa gttggtgact acgtgaaagc aggccagatt 300
atcggctggt caggttcaac cggttattca acagcccctc atctgcactt ccaacgcatg 360
gtgaatagtt ttagtaattc taccgctcaa gatccgatgc cattcctgaa atctgccggt 420
tatggtgagc tccgtcgtcg tcgccgtcgg cgtcgtcgtt aa 462
<210> 67
<211> 474
<212> DNA
<213> Artificial
<220>
<223> Polynucleotide
<400> 67
atggcagcca cacatgaaca ctctgcccaa tggctgaaca actacaaaaa aggctacggt 60
tatggccctt accctctggg cattaacggt ggcatgcact acggcgttga cttttttatg 120
aacatcggca cccctgtgaa agccattagc tcaggcaaaa tcgtggaagc cggttggtca 180
aactatggcg gtggcaacca gatcggtctg atcgagaacg atggtgtgca ccgccaatgg 240
tacatgcacc tgtccaaata caacgttaaa gttggtgact acgtgaaagc aggccagatt 300
atcggctggt caggttcaac cggttattca acagcccctc atctgcactt ccaacgcatg 360
gtgaatagtt ttagtaattc taccgctcaa gatccgatgc cattcctgaa atctgccggt 420
tatggtgagc tcggtcgtaa aaaacgtcgt cagcgtcgtc gtccgcctca gtaa 474
<210> 68
<400> 68
000
<210> 69
<400> 69
000
<210> 70
<211> 37
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 70
Leu Leu Gly Asp Phe Phe Arg Lys Ser Lys Glu Lys Ile Gly Lys Glu
1 5 10 15
Phe Lys Arg Ile Val Gln Arg Ile Lys Asp Phe Leu Arg Asn Leu Val
20 25 30
Pro Arg Thr Glu Ser
35
<210> 71
<211> 29
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 71
Arg Gly Leu Arg Arg Leu Gly Arg Lys Ile Ala His Gly Val Lys Lys
1 5 10 15
Tyr Gly Pro Thr Val Leu Arg Ile Ile Arg Ile Ala Gly
20 25
<210> 72
<211> 13
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 72
Ile Leu Pro Trp Lys Trp Pro Trp Trp Pro Trp Arg Arg
1 5 10
<210> 73
<211> 18
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 73
Arg Gly Gly Arg Leu Cys Tyr Cys Arg Arg Arg Phe Cys Val Cys Val
1 5 10 15
Gly Arg
<210> 74
<211> 31
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 74
Ser Trp Leu Ser Lys Thr Ala Lys Lys Leu Glu Asn Ser Ala Lys Lys
1 5 10 15
Arg Ile Ser Glu Gly Ile Ala Ile Ala Ile Gln Gly Gly Pro Arg
20 25 30
<210> 75
<211> 23
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 75
Gly Ile Gly Lys Phe Leu His Ser Ala Lys Lys Phe Gly Lys Ala Phe
1 5 10 15
Val Gly Glu Ile Met Asn Ser
20
<210> 76
<211> 25
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 76
Gly Trp Gly Ser Phe Phe Lys Lys Ala Ala His Val Gly Lys His Val
1 5 10 15
Gly Lys Ala Ala Leu Thr His Tyr Leu
20 25
<210> 77
<211> 36
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 77
Gly Gly Leu Lys Lys Leu Gly Lys Lys Leu Glu Gly Ala Gly Lys Arg
1 5 10 15
Val Phe Asn Ala Ala Glu Lys Ala Leu Pro Val Val Ala Gly Ala Lys
20 25 30
Ala Leu Arg Lys
35
<210> 78
<211> 40
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 78
Gly Trp Leu Lys Lys Ile Gly Lys Lys Ile Glu Arg Val Gly Gln His
1 5 10 15
Thr Arg Asp Ala Thr Ile Gln Gly Leu Gly Ile Pro Gln Gln Ala Ala
20 25 30
Asn Val Ala Ala Thr Ala Arg Gly
35 40
<210> 79
<211> 21
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 79
Thr Arg Ser Ser Arg Ala Gly Leu Gln Phe Pro Val Gly Arg Val His
1 5 10 15
Arg Leu Leu Arg Lys
20
<210> 80
<211> 39
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 80
Gly Trp Leu Lys Lys Ile Gly Lys Lys Ile Glu Arg Val Gly Gln His
1 5 10 15
Thr Arg Asp Ala Thr Ile Gln Gly Leu Gly Ile Ala Gln Gln Ala Ala
20 25 30
Asn Val Ala Ala Thr Ala Arg
35
<210> 81
<211> 24
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 81
Gly Ile Lys Asp Trp Ile Lys Gly Ala Ala Lys Lys Leu Ile Lys Thr
1 5 10 15
Val Ala Ser His Ile Ala Asn Gln
20
<210> 82
<211> 17
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragement
<400> 82
Ala Asn Arg Pro Val Tyr Ile Pro Pro Pro Arg Pro Pro His Pro Arg
1 5 10 15
Leu
<210> 83
<211> 22
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 83
Gly Leu Leu Ser Lys Val Leu Gly Val Gly Lys Lys Val Leu Cys Gly
1 5 10 15
Val Ser Gly Leu Val Cys
20
<210> 84
<211> 24
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 84
Gly Leu Asn Thr Leu Lys Lys Val Phe Gln Gly Leu His Glu Ala Ile
1 5 10 15
Lys Leu Ile Asn Asn His Val Gln
20
<210> 85
<211> 19
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 85
Lys Gly Arg Gly Lys Gln Gly Gly Lys Val Arg Ala Lys Ala Lys Thr
1 5 10 15
Arg Ser Ser
<210> 86
<211> 25
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 86
Ile Trp Leu Thr Ala Leu Lys Phe Leu Gly Lys His Ala Ala Lys Lys
1 5 10 15
Leu Ala Lys Gln Gln Leu Ser Lys Leu
20 25
<210> 87
<211> 18
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 87
Phe Leu Gly Gly Leu Ile Val Pro Ala Met Ile Cys Ala Val Thr Lys
1 5 10 15
Lys Cys
<210> 88
<211> 26
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 88
Gly Ile Gly Ala Val Leu Lys Val Leu Thr Thr Gly Leu Pro Ala Leu
1 5 10 15
Ile Ser Trp Ile Lys Arg Lys Arg Gln Gln
20 25
<210> 89
<211> 54
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 89
Lys Thr Tyr Tyr Gly Thr Asn Gly Val His Cys Thr Lys Asn Ser Leu
1 5 10 15
Trp Gly Lys Val Arg Leu Lys Asn Met Lys Tyr Asp Gln Asn Thr Thr
20 25 30
Tyr Met Gly Arg Leu Gln Asp Ile Leu Leu Gly Trp Ala Thr Gly Ala
35 40 45
Phe Gly Lys Thr Phe His
50
<210> 90
<211> 39
<212> PRT
<213> Artificial
<220>
<223> Polypeptide fragment
<400> 90
Ala Gly Arg Gly Lys Gln Gly Gly Lys Val Arg Ala Lys Ala Lys Thr
1 5 10 15
Arg Ser Ser Arg Ala Gly Leu Gln Phe Pro Val Gly Arg Val His Arg
20 25 30
Leu Leu Arg Lys Gly Asn Tyr
35
<210> 91
<211> 6628
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 91
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gctgaaacat 120
atttactcca accacattaa aggtaacaaa atcacagccc ctaaaccgtc aattcagggc 180
gtggtgatcc acaacgatta tggctcaatg accccttcac agtacctgcc ttggctgtac 240
gctcgcgaaa acaacggtac acatgtgaat ggctgggcct cagtgtatgc caatcgcaac 300
gaggtgctgt ggtatcatcc tacagactac gtggaatggc actgcggcaa ccaatgggcc 360
aacgccaacc tgatcggctt tgaagtttgc gaatcatatc ctggtcgcat ctcagacaaa 420
ctgtttctgg aaaacgagga agccacactg aaagtagctg ccgacgtgat gaaatcgtat 480
ggcctgcctg tgaatcgcaa cacagtgcgc ctgcacaacg aatttttcgg tacatcatgc 540
cctcatcgtt catgggacct gcacgtgggc aaaggcgagc cttataccac aacaaatatc 600
aataaaatga aagattattt cattaaacgg attaaacact actatgacgg tggcaaactg 660
gaagttagca aagcagccac cattaaacag agtgatgtta aacaagaagt gaaaaaacaa 720
gaggccaaac aaattgtgaa agccaccgat tggaaacaga acaaagatgg catctggtat 780
aaagcagaac atgccagctt taccgtgacc gcaccggaag gcattattac ccgttataaa 840
ggtccgtgga ccggtcatcc gcaggcaggc gtgctgcaga aaggtcagac catcaaatat 900
gatgaagtgc agaaatttga tggccatgtt tgggttagct gggaaacctt tgaaggtgaa 960
accgtttata tgccggttcg tacctgggat gcaaaaaccg gtaaagttgg taaactgtgg 1020
ggtgagatta aagagctccg ccagatcaaa atttggtttc agaatcgtcg catgaaatgg 1080
aaaaaataag gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg ctgctgccac 1140
cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 1200
gctgaaagga ggaactatat ccggatatcc cgcaagaggc ccggcagtac cggcataacc 1260
aagcctatgc ctacagcatc cagggtgacg gtgccgagga tgacgatgag cgcattgtta 1320
gatttcatac acggtgcctg actgcgttag caatttaact gtgataaact accgcattaa 1380
agctagctta tcgatgataa gctgtcaaac atgagaatta attcttgaag acgaaagggc 1440
ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca 1500
ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat 1560
tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa 1620
aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt 1680
tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag 1740
ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt 1800
tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg 1860
gtattatccc gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag 1920
aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta 1980
agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg 2040
acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta 2100
actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac 2160
accacgatgc ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt 2220
actctagctt cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca 2280
cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag 2340
cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta 2400
gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag 2460
ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt 2520
tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat 2580
aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 2640
gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 2700
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt 2760
tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag 2820
ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta 2880
atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca 2940
agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 3000
cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 3060
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 3120
acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 3180
gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 3240
ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt 3300
gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt 3360
gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 3420
gaagcggaag agcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac 3480
cgcaatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac 3540
tccgctatcg ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga 3600
cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 3660
cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga ggcagctgcg 3720
gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg tctgcctgtt catccgcgtc 3780
cagctcgttg agtttctcca gaagcgttaa tgtctggctt ctgataaagc gggccatgtt 3840
aagggcggtt ttttcctgtt tggtcactga tgcctccgtg taagggggat ttctgttcat 3900
gggggtaatg ataccgatga aacgagagag gatgctcacg atacgggtta ctgatgatga 3960
acatgcccgg ttactggaac gttgtgaggg taaacaactg gcggtatgga tgcggcggga 4020
ccagagaaaa atcactcagg gtcaatgcca gcgcttcgtt aatacagatg taggtgttcc 4080
acagggtagc cagcagcatc ctgcgatgca gatccggaac ataatggtgc agggcgctga 4140
cttccgcgtt tccagacttt acgaaacacg gaaaccgaag accattcatg ttgttgctca 4200
ggtcgcagac gttttgcagc agcagtcgct tcacgttcgc tcgcgtatcg gtgattcatt 4260
ctgctaacca gtaaggcaac cccgccagcc tagccgggtc ctcaacgaca ggagcacgat 4320
catgcgcacc cgtggccagg acccaacgct gcccgagatg cgccgcgtgc ggctgctgga 4380
gatggcggac gcgatggata tgttctgcca agggttggtt tgcgcattca cagttctccg 4440
caagaattga ttggctccaa ttcttggagt ggtgaatccg ttagcgaggt gccgccggct 4500
tccattcagg tcgaggtggc ccggctccat gcaccgcgac gcaacgcggg gaggcagaca 4560
aggtataggg cggcgcctac aatccatgcc aacccgttcc atgtgctcgc cgaggcggca 4620
taaatcgccg tgacgatcag cggtccaatg atcgaagtta ggctggtaag agccgcgagc 4680
gatccttgaa gctgtccctg atggtcgtca tctacctgcc tggacagcat ggcctgcaac 4740
gcgggcatcc cgatgccgcc ggaagcgaga agaatcataa tggggaaggc catccagcct 4800
cgcgtcgcga acgccagcaa gacgtagccc agcgcgtcgg ccgccatgcc ggcgataatg 4860
gcctgcttct cgccgaaacg tttggtggcg ggaccagtga cgaaggcttg agcgagggcg 4920
tgcaagattc cgaataccgc aagcgacagg ccgatcatcg tcgcgctcca gcgaaagcgg 4980
tcctcgccga aaatgaccca gagcgctgcc ggcacctgtc ctacgagttg catgataaag 5040
aagacagtca taagtgcggc gacgatagtc atgccccgcg cccaccggaa ggagctgact 5100
gggttgaagg ctctcaaggg catcggtcga gatcccggtg cctaatgagt gagctaactt 5160
acattaattg cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg 5220
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ccagggtggt 5280
ttttcttttc accagtgaga cgggcaacag ctgattgccc ttcaccgcct ggccctgaga 5340
gagttgcagc aagcggtcca cgctggtttg ccccagcagg cgaaaatcct gtttgatggt 5400
ggttaacggc gggatataac atgagctgtc ttcggtatcg tcgtatccca ctaccgagat 5460
atccgcacca acgcgcagcc cggactcggt aatggcgcgc attgcgccca gcgccatctg 5520
atcgttggca accagcatcg cagtgggaac gatgccctca ttcagcattt gcatggtttg 5580
ttgaaaaccg gacatggcac tccagtcgcc ttcccgttcc gctatcggct gaatttgatt 5640
gcgagtgaga tatttatgcc agccagccag acgcagacgc gccgagacag aacttaatgg 5700
gcccgctaac agcgcgattt gctggtgacc caatgcgacc agatgctcca cgcccagtcg 5760
cgtaccgtct tcatgggaga aaataatact gttgatgggt gtctggtcag agacatcaag 5820
aaataacgcc ggaacattag tgcaggcagc ttccacagca atggcatcct ggtcatccag 5880
cggatagtta atgatcagcc cactgacgcg ttgcgcgaga agattgtgca ccgccgcttt 5940
acaggcttcg acgccgcttc gttctaccat cgacaccacc acgctggcac ccagttgatc 6000
ggcgcgagat ttaatcgccg cgacaatttg cgacggcgcg tgcagggcca gactggaggt 6060
ggcaacgcca atcagcaacg actgtttgcc cgccagttgt tgtgccacgc ggttgggaat 6120
gtaattcagc tccgccatcg ccgcttccac tttttcccgc gttttcgcag aaacgtggct 6180
ggcctggttc accacgcggg aaacggtctg ataagagaca ccggcatact ctgcgacatc 6240
gtataacgtt actggtttca cattcaccac cctgaattga ctctcttccg ggcgctatca 6300
tgccataccg cgaaaggttt tgcgccattc gatggtgtcc gggatctcga cgctctccct 6360
tatgcgactc ctgcattagg aagcagccca gtagtaggtt gaggccgttg agcaccgccg 6420
ccgcaaggaa tggtgcatgc aaggagatgg cgcccaacag tcccccggcc acggggcctg 6480
ccaccatacc cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc 6540
catcggtgat gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg 6600
ccacgatgcg tccggcgtag aggatcga 6628
<210> 92
<211> 6607
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 92
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gctgaaacat 120
atttactcca accacattaa aggtaacaaa atcacagccc ctaaaccgtc aattcagggc 180
gtggtgatcc acaacgatta tggctcaatg accccttcac agtacctgcc ttggctgtac 240
gctcgcgaaa acaacggtac acatgtgaat ggctgggcct cagtgtatgc caatcgcaac 300
gaggtgctgt ggtatcatcc tacagactac gtggaatggc actgcggcaa ccaatgggcc 360
aacgccaacc tgatcggctt tgaagtttgc gaatcatatc ctggtcgcat ctcagacaaa 420
ctgtttctgg aaaacgagga agccacactg aaagtagctg ccgacgtgat gaaatcgtat 480
ggcctgcctg tgaatcgcaa cacagtgcgc ctgcacaacg aatttttcgg tacatcatgc 540
cctcatcgtt catgggacct gcacgtgggc aaaggcgagc cttataccac aacaaatatc 600
aataaaatga aagattattt cattaaacgg attaaacact actatgacgg tggcaaactg 660
gaagttagca aagcagccac cattaaacag agtgatgtta aacaagaagt gaaaaaacaa 720
gaggccaaac aaattgtgaa agccaccgat tggaaacaga acaaagatgg catctggtat 780
aaagcagaac atgccagctt taccgtgacc gcaccggaag gcattattac ccgttataaa 840
ggtccgtgga ccggtcatcc gcaggcaggc gtgctgcaga aaggtcagac catcaaatat 900
gatgaagtgc agaaatttga tggccatgtt tgggttagct gggaaacctt tgaaggtgaa 960
accgtttata tgccggttcg tacctgggat gcaaaaaccg gtaaagttgg taaactgtgg 1020
ggtgagatta aagagctccg tcgtcgtcgc cgtcggcgtc gtcgttaagg atccggctgc 1080
taacaaagcc cgaaaggaag ctgagttggc tgctgccacc gctgagcaat aactagcata 1140
accccttggg gcctctaaac gggtcttgag gggttttttg ctgaaaggag gaactatatc 1200
cggatatccc gcaagaggcc cggcagtacc ggcataacca agcctatgcc tacagcatcc 1260
agggtgacgg tgccgaggat gacgatgagc gcattgttag atttcataca cggtgcctga 1320
ctgcgttagc aatttaactg tgataaacta ccgcattaaa gctagcttat cgatgataag 1380
ctgtcaaaca tgagaattaa ttcttgaaga cgaaagggcc tcgtgatacg cctattttta 1440
taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat 1500
gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg 1560
agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 1620
catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 1680
ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 1740
atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 1800
ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc 1860
gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 1920
ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 1980
ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag 2040
gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 2100
ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg 2160
gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa 2220
ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg 2280
gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt 2340
gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt 2400
caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag 2460
cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat 2520
ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 2580
taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 2640
tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 2700
gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 2760
agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 2820
aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 2880
gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 2940
gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 3000
tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg 3060
agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 3120
cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 3180
gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 3240
gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg 3300
ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc 3360
cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg 3420
cggtattttc tccttacgca tctgtgcggt atttcacacc gcaatggtgc actctcagta 3480
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 3540
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 3600
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 3660
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 3720
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 3780
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 3840
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 3900
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 3960
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 4020
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 4080
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 4140
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 4200
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 4260
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggccagga 4320
cccaacgctg cccgagatgc gccgcgtgcg gctgctggag atggcggacg cgatggatat 4380
gttctgccaa gggttggttt gcgcattcac agttctccgc aagaattgat tggctccaat 4440
tcttggagtg gtgaatccgt tagcgaggtg ccgccggctt ccattcaggt cgaggtggcc 4500
cggctccatg caccgcgacg caacgcgggg aggcagacaa ggtatagggc ggcgcctaca 4560
atccatgcca acccgttcca tgtgctcgcc gaggcggcat aaatcgccgt gacgatcagc 4620
ggtccaatga tcgaagttag gctggtaaga gccgcgagcg atccttgaag ctgtccctga 4680
tggtcgtcat ctacctgcct ggacagcatg gcctgcaacg cgggcatccc gatgccgccg 4740
gaagcgagaa gaatcataat ggggaaggcc atccagcctc gcgtcgcgaa cgccagcaag 4800
acgtagccca gcgcgtcggc cgccatgccg gcgataatgg cctgcttctc gccgaaacgt 4860
ttggtggcgg gaccagtgac gaaggcttga gcgagggcgt gcaagattcc gaataccgca 4920
agcgacaggc cgatcatcgt cgcgctccag cgaaagcggt cctcgccgaa aatgacccag 4980
agcgctgccg gcacctgtcc tacgagttgc atgataaaga agacagtcat aagtgcggcg 5040
acgatagtca tgccccgcgc ccaccggaag gagctgactg ggttgaaggc tctcaagggc 5100
atcggtcgag atcccggtgc ctaatgagtg agctaactta cattaattgc gttgcgctca 5160
ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc 5220
gcggggagag gcggtttgcg tattgggcgc cagggtggtt tttcttttca ccagtgagac 5280
gggcaacagc tgattgccct tcaccgcctg gccctgagag agttgcagca agcggtccac 5340
gctggtttgc cccagcaggc gaaaatcctg tttgatggtg gttaacggcg ggatataaca 5400
tgagctgtct tcggtatcgt cgtatcccac taccgagata tccgcaccaa cgcgcagccc 5460
ggactcggta atggcgcgca ttgcgcccag cgccatctga tcgttggcaa ccagcatcgc 5520
agtgggaacg atgccctcat tcagcatttg catggtttgt tgaaaaccgg acatggcact 5580
ccagtcgcct tcccgttccg ctatcggctg aatttgattg cgagtgagat atttatgcca 5640
gccagccaga cgcagacgcg ccgagacaga acttaatggg cccgctaaca gcgcgatttg 5700
ctggtgaccc aatgcgacca gatgctccac gcccagtcgc gtaccgtctt catgggagaa 5760
aataatactg ttgatgggtg tctggtcaga gacatcaaga aataacgccg gaacattagt 5820
gcaggcagct tccacagcaa tggcatcctg gtcatccagc ggatagttaa tgatcagccc 5880
actgacgcgt tgcgcgagaa gattgtgcac cgccgcttta caggcttcga cgccgcttcg 5940
ttctaccatc gacaccacca cgctggcacc cagttgatcg gcgcgagatt taatcgccgc 6000
gacaatttgc gacggcgcgt gcagggccag actggaggtg gcaacgccaa tcagcaacga 6060
ctgtttgccc gccagttgtt gtgccacgcg gttgggaatg taattcagct ccgccatcgc 6120
cgcttccact ttttcccgcg ttttcgcaga aacgtggctg gcctggttca ccacgcggga 6180
aacggtctga taagagacac cggcatactc tgcgacatcg tataacgtta ctggtttcac 6240
attcaccacc ctgaattgac tctcttccgg gcgctatcat gccataccgc gaaaggtttt 6300
gcgccattcg atggtgtccg ggatctcgac gctctccctt atgcgactcc tgcattagga 6360
agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca 6420
aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac 6480
aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat 6540
aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga 6600
ggatcga 6607
<210> 93
<211> 6619
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 93
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gctgaaacat 120
atttactcca accacattaa aggtaacaaa atcacagccc ctaaaccgtc aattcagggc 180
gtggtgatcc acaacgatta tggctcaatg accccttcac agtacctgcc ttggctgtac 240
gctcgcgaaa acaacggtac acatgtgaat ggctgggcct cagtgtatgc caatcgcaac 300
gaggtgctgt ggtatcatcc tacagactac gtggaatggc actgcggcaa ccaatgggcc 360
aacgccaacc tgatcggctt tgaagtttgc gaatcatatc ctggtcgcat ctcagacaaa 420
ctgtttctgg aaaacgagga agccacactg aaagtagctg ccgacgtgat gaaatcgtat 480
ggcctgcctg tgaatcgcaa cacagtgcgc ctgcacaacg aatttttcgg tacatcatgc 540
cctcatcgtt catgggacct gcacgtgggc aaaggcgagc cttataccac aacaaatatc 600
aataaaatga aagattattt cattaaacgg attaaacact actatgacgg tggcaaactg 660
gaagttagca aagcagccac cattaaacag agtgatgtta aacaagaagt gaaaaaacaa 720
gaggccaaac aaattgtgaa agccaccgat tggaaacaga acaaagatgg catctggtat 780
aaagcagaac atgccagctt taccgtgacc gcaccggaag gcattattac ccgttataaa 840
ggtccgtgga ccggtcatcc gcaggcaggc gtgctgcaga aaggtcagac catcaaatat 900
gatgaagtgc agaaatttga tggccatgtt tgggttagct gggaaacctt tgaaggtgaa 960
accgtttata tgccggttcg tacctgggat gcaaaaaccg gtaaagttgg taaactgtgg 1020
ggtgagatta aagagctcgg tcgtaaaaaa cgtcgtcagc gtcgtcgtcc gcctcagtaa 1080
ggatccggct gctaacaaag cccgaaagga agctgagttg gctgctgcca ccgctgagca 1140
ataactagca taaccccttg gggcctctaa acgggtcttg aggggttttt tgctgaaagg 1200
aggaactata tccggatatc ccgcaagagg cccggcagta ccggcataac caagcctatg 1260
cctacagcat ccagggtgac ggtgccgagg atgacgatga gcgcattgtt agatttcata 1320
cacggtgcct gactgcgtta gcaatttaac tgtgataaac taccgcatta aagctagctt 1380
atcgatgata agctgtcaaa catgagaatt aattcttgaa gacgaaaggg cctcgtgata 1440
cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact 1500
tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg 1560
tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt 1620
atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct 1680
gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca 1740
cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc 1800
gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc 1860
cgtgttgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg 1920
gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta 1980
tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc 2040
ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt 2100
gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg 2160
cctgcagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct 2220
tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc 2280
tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct 2340
cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac 2400
acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc 2460
tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat 2520
ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg 2580
accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc 2640
aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa 2700
ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag 2760
gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta 2820
ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta 2880
ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag 2940
ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg 3000
gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg 3060
cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag 3120
cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc 3180
cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa 3240
aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg 3300
ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct 3360
gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa 3420
gagcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcaatggt 3480
gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc 3540
gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg 3600
acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg 3660
catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg aggcagctgc ggtaaagctc 3720
atcagcgtgg tcgtgaagcg attcacagat gtctgcctgt tcatccgcgt ccagctcgtt 3780
gagtttctcc agaagcgtta atgtctggct tctgataaag cgggccatgt taagggcggt 3840
tttttcctgt ttggtcactg atgcctccgt gtaaggggga tttctgttca tgggggtaat 3900
gataccgatg aaacgagaga ggatgctcac gatacgggtt actgatgatg aacatgcccg 3960
gttactggaa cgttgtgagg gtaaacaact ggcggtatgg atgcggcggg accagagaaa 4020
aatcactcag ggtcaatgcc agcgcttcgt taatacagat gtaggtgttc cacagggtag 4080
ccagcagcat cctgcgatgc agatccggaa cataatggtg cagggcgctg acttccgcgt 4140
ttccagactt tacgaaacac ggaaaccgaa gaccattcat gttgttgctc aggtcgcaga 4200
cgttttgcag cagcagtcgc ttcacgttcg ctcgcgtatc ggtgattcat tctgctaacc 4260
agtaaggcaa ccccgccagc ctagccgggt cctcaacgac aggagcacga tcatgcgcac 4320
ccgtggccag gacccaacgc tgcccgagat gcgccgcgtg cggctgctgg agatggcgga 4380
cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg 4440
attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccggc ttccattcag 4500
gtcgaggtgg cccggctcca tgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg 4560
gcggcgccta caatccatgc caacccgttc catgtgctcg ccgaggcggc ataaatcgcc 4620
gtgacgatca gcggtccaat gatcgaagtt aggctggtaa gagccgcgag cgatccttga 4680
agctgtccct gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc 4740
ccgatgccgc cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg 4800
aacgccagca agacgtagcc cagcgcgtcg gccgccatgc cggcgataat ggcctgcttc 4860
tcgccgaaac gtttggtggc gggaccagtg acgaaggctt gagcgagggc gtgcaagatt 4920
ccgaataccg caagcgacag gccgatcatc gtcgcgctcc agcgaaagcg gtcctcgccg 4980
aaaatgaccc agagcgctgc cggcacctgt cctacgagtt gcatgataaa gaagacagtc 5040
ataagtgcgg cgacgatagt catgccccgc gcccaccgga aggagctgac tgggttgaag 5100
gctctcaagg gcatcggtcg agatcccggt gcctaatgag tgagctaact tacattaatt 5160
gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga 5220
atcggccaac gcgcggggag aggcggtttg cgtattgggc gccagggtgg tttttctttt 5280
caccagtgag acgggcaaca gctgattgcc cttcaccgcc tggccctgag agagttgcag 5340
caagcggtcc acgctggttt gccccagcag gcgaaaatcc tgtttgatgg tggttaacgg 5400
cgggatataa catgagctgt cttcggtatc gtcgtatccc actaccgaga tatccgcacc 5460
aacgcgcagc ccggactcgg taatggcgcg cattgcgccc agcgccatct gatcgttggc 5520
aaccagcatc gcagtgggaa cgatgccctc attcagcatt tgcatggttt gttgaaaacc 5580
ggacatggca ctccagtcgc cttcccgttc cgctatcggc tgaatttgat tgcgagtgag 5640
atatttatgc cagccagcca gacgcagacg cgccgagaca gaacttaatg ggcccgctaa 5700
cagcgcgatt tgctggtgac ccaatgcgac cagatgctcc acgcccagtc gcgtaccgtc 5760
ttcatgggag aaaataatac tgttgatggg tgtctggtca gagacatcaa gaaataacgc 5820
cggaacatta gtgcaggcag cttccacagc aatggcatcc tggtcatcca gcggatagtt 5880
aatgatcagc ccactgacgc gttgcgcgag aagattgtgc accgccgctt tacaggcttc 5940
gacgccgctt cgttctacca tcgacaccac cacgctggca cccagttgat cggcgcgaga 6000
tttaatcgcc gcgacaattt gcgacggcgc gtgcagggcc agactggagg tggcaacgcc 6060
aatcagcaac gactgtttgc ccgccagttg ttgtgccacg cggttgggaa tgtaattcag 6120
ctccgccatc gccgcttcca ctttttcccg cgttttcgca gaaacgtggc tggcctggtt 6180
caccacgcgg gaaacggtct gataagagac accggcatac tctgcgacat cgtataacgt 6240
tactggtttc acattcacca ccctgaattg actctcttcc gggcgctatc atgccatacc 6300
gcgaaaggtt ttgcgccatt cgatggtgtc cgggatctcg acgctctccc ttatgcgact 6360
cctgcattag gaagcagccc agtagtaggt tgaggccgtt gagcaccgcc gccgcaagga 6420
atggtgcatg caaggagatg gcgcccaaca gtcccccggc cacggggcct gccaccatac 6480
ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga 6540
tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgccg gccacgatgc 6600
gtccggcgta gaggatcga 6619
<210> 94
<211> 6247
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 94
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gctgaaacat 120
atttactcca accacattaa aggtaacaaa atcacagccc ctaaaccgtc aattcagggc 180
gtggtgatcc acaacgatta tggctcaatg accccttcac agtacctgcc ttggctgtac 240
gctcgcgaaa acaacggtac acatgtgaat ggctgggcct cagtgtatgc caatcgcaac 300
gaggtgctgt ggtatcatcc tacagactac gtggaatggc actgcggcaa ccaatgggcc 360
aacgccaacc tgatcggctt tgaagtttgc gaatcatatc ctggtcgcat ctcagacaaa 420
ctgtttctgg aaaacgagga agccacactg aaagtagctg ccgacgtgat gaaatcgtat 480
ggcctgcctg tgaatcgcaa cacagtgcgc ctgcacaacg aatttttcgg tacatcatgc 540
cctcatcgtt catgggacct gcacgtgggc aaaggcgagc cttataccac aacaaatatc 600
aataaaatga aagattattt cattaaacgg attaaacact actatgacgg tgagctccgc 660
cagatcaaaa tttggtttca gaatcgtcgc atgaaatgga aaaaataagg atccggctgc 720
taacaaagcc cgaaaggaag ctgagttggc tgctgccacc gctgagcaat aactagcata 780
accccttggg gcctctaaac gggtcttgag gggttttttg ctgaaaggag gaactatatc 840
cggatatccc gcaagaggcc cggcagtacc ggcataacca agcctatgcc tacagcatcc 900
agggtgacgg tgccgaggat gacgatgagc gcattgttag atttcataca cggtgcctga 960
ctgcgttagc aatttaactg tgataaacta ccgcattaaa gctagcttat cgatgataag 1020
ctgtcaaaca tgagaattaa ttcttgaaga cgaaagggcc tcgtgatacg cctattttta 1080
taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat 1140
gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg 1200
agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 1260
catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 1320
ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 1380
atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 1440
ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc 1500
gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 1560
ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 1620
ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag 1680
gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 1740
ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg 1800
gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa 1860
ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg 1920
gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt 1980
gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt 2040
caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag 2100
cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat 2160
ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 2220
taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 2280
tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 2340
gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 2400
agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 2460
aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 2520
gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 2580
gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 2640
tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg 2700
agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 2760
cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 2820
gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 2880
gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg 2940
ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc 3000
cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg 3060
cggtattttc tccttacgca tctgtgcggt atttcacacc gcaatggtgc actctcagta 3120
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 3180
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 3240
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 3300
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 3360
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 3420
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 3480
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 3540
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 3600
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 3660
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 3720
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3780
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3840
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3900
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggccagga 3960
cccaacgctg cccgagatgc gccgcgtgcg gctgctggag atggcggacg cgatggatat 4020
gttctgccaa gggttggttt gcgcattcac agttctccgc aagaattgat tggctccaat 4080
tcttggagtg gtgaatccgt tagcgaggtg ccgccggctt ccattcaggt cgaggtggcc 4140
cggctccatg caccgcgacg caacgcgggg aggcagacaa ggtatagggc ggcgcctaca 4200
atccatgcca acccgttcca tgtgctcgcc gaggcggcat aaatcgccgt gacgatcagc 4260
ggtccaatga tcgaagttag gctggtaaga gccgcgagcg atccttgaag ctgtccctga 4320
tggtcgtcat ctacctgcct ggacagcatg gcctgcaacg cgggcatccc gatgccgccg 4380
gaagcgagaa gaatcataat ggggaaggcc atccagcctc gcgtcgcgaa cgccagcaag 4440
acgtagccca gcgcgtcggc cgccatgccg gcgataatgg cctgcttctc gccgaaacgt 4500
ttggtggcgg gaccagtgac gaaggcttga gcgagggcgt gcaagattcc gaataccgca 4560
agcgacaggc cgatcatcgt cgcgctccag cgaaagcggt cctcgccgaa aatgacccag 4620
agcgctgccg gcacctgtcc tacgagttgc atgataaaga agacagtcat aagtgcggcg 4680
acgatagtca tgccccgcgc ccaccggaag gagctgactg ggttgaaggc tctcaagggc 4740
atcggtcgag atcccggtgc ctaatgagtg agctaactta cattaattgc gttgcgctca 4800
ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc 4860
gcggggagag gcggtttgcg tattgggcgc cagggtggtt tttcttttca ccagtgagac 4920
gggcaacagc tgattgccct tcaccgcctg gccctgagag agttgcagca agcggtccac 4980
gctggtttgc cccagcaggc gaaaatcctg tttgatggtg gttaacggcg ggatataaca 5040
tgagctgtct tcggtatcgt cgtatcccac taccgagata tccgcaccaa cgcgcagccc 5100
ggactcggta atggcgcgca ttgcgcccag cgccatctga tcgttggcaa ccagcatcgc 5160
agtgggaacg atgccctcat tcagcatttg catggtttgt tgaaaaccgg acatggcact 5220
ccagtcgcct tcccgttccg ctatcggctg aatttgattg cgagtgagat atttatgcca 5280
gccagccaga cgcagacgcg ccgagacaga acttaatggg cccgctaaca gcgcgatttg 5340
ctggtgaccc aatgcgacca gatgctccac gcccagtcgc gtaccgtctt catgggagaa 5400
aataatactg ttgatgggtg tctggtcaga gacatcaaga aataacgccg gaacattagt 5460
gcaggcagct tccacagcaa tggcatcctg gtcatccagc ggatagttaa tgatcagccc 5520
actgacgcgt tgcgcgagaa gattgtgcac cgccgcttta caggcttcga cgccgcttcg 5580
ttctaccatc gacaccacca cgctggcacc cagttgatcg gcgcgagatt taatcgccgc 5640
gacaatttgc gacggcgcgt gcagggccag actggaggtg gcaacgccaa tcagcaacga 5700
ctgtttgccc gccagttgtt gtgccacgcg gttgggaatg taattcagct ccgccatcgc 5760
cgcttccact ttttcccgcg ttttcgcaga aacgtggctg gcctggttca ccacgcggga 5820
aacggtctga taagagacac cggcatactc tgcgacatcg tataacgtta ctggtttcac 5880
attcaccacc ctgaattgac tctcttccgg gcgctatcat gccataccgc gaaaggtttt 5940
gcgccattcg atggtgtccg ggatctcgac gctctccctt atgcgactcc tgcattagga 6000
agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca 6060
aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac 6120
aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat 6180
aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga 6240
ggatcga 6247
<210> 95
<211> 6226
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 95
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gctgaaacat 120
atttactcca accacattaa aggtaacaaa atcacagccc ctaaaccgtc aattcagggc 180
gtggtgatcc acaacgatta tggctcaatg accccttcac agtacctgcc ttggctgtac 240
gctcgcgaaa acaacggtac acatgtgaat ggctgggcct cagtgtatgc caatcgcaac 300
gaggtgctgt ggtatcatcc tacagactac gtggaatggc actgcggcaa ccaatgggcc 360
aacgccaacc tgatcggctt tgaagtttgc gaatcatatc ctggtcgcat ctcagacaaa 420
ctgtttctgg aaaacgagga agccacactg aaagtagctg ccgacgtgat gaaatcgtat 480
ggcctgcctg tgaatcgcaa cacagtgcgc ctgcacaacg aatttttcgg tacatcatgc 540
cctcatcgtt catgggacct gcacgtgggc aaaggcgagc cttataccac aacaaatatc 600
aataaaatga aagattattt cattaaacgg attaaacact actatgacgg tgagctccgt 660
cgtcgtcgcc gtcggcgtcg tcgttaagga tccggctgct aacaaagccc gaaaggaagc 720
tgagttggct gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg 780
ggtcttgagg ggttttttgc tgaaaggagg aactatatcc ggatatcccg caagaggccc 840
ggcagtaccg gcataaccaa gcctatgcct acagcatcca gggtgacggt gccgaggatg 900
acgatgagcg cattgttaga tttcatacac ggtgcctgac tgcgttagca atttaactgt 960
gataaactac cgcattaaag ctagcttatc gatgataagc tgtcaaacat gagaattaat 1020
tcttgaagac gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata 1080
atggtttctt agacgtcagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt 1140
ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg 1200
cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt 1260
cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta 1320
aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc 1380
ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa 1440
gttctgctat gtggcgcggt attatcccgt gttgacgccg ggcaagagca actcggtcgc 1500
cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt 1560
acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact 1620
gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac 1680
aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata 1740
ccaaacgacg agcgtgacac cacgatgcct gcagcaatgg caacaacgtt gcgcaaacta 1800
ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg 1860
gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat 1920
aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt 1980
aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga 2040
aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa 2100
gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag 2160
gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac 2220
tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc 2280
gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat 2340
caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat 2400
actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct 2460
acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt 2520
cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg 2580
gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 2640
cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg 2700
gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg 2760
tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc 2820
tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg 2880
gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 2940
aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc 3000
agcgagtcag tgagcgagga agcggaagag cgcctgatgc ggtattttct ccttacgcat 3060
ctgtgcggta tttcacaccg caatggtgca ctctcagtac aatctgctct gatgccgcat 3120
agttaagcca gtatacactc cgctatcgct acgtgactgg gtcatggctg cgccccgaca 3180
cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag 3240
acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa 3300
acgcgcgagg cagctgcggt aaagctcatc agcgtggtcg tgaagcgatt cacagatgtc 3360
tgcctgttca tccgcgtcca gctcgttgag tttctccaga agcgttaatg tctggcttct 3420
gataaagcgg gccatgttaa gggcggtttt ttcctgtttg gtcactgatg cctccgtgta 3480
agggggattt ctgttcatgg gggtaatgat accgatgaaa cgagagagga tgctcacgat 3540
acgggttact gatgatgaac atgcccggtt actggaacgt tgtgagggta aacaactggc 3600
ggtatggatg cggcgggacc agagaaaaat cactcagggt caatgccagc gcttcgttaa 3660
tacagatgta ggtgttccac agggtagcca gcagcatcct gcgatgcaga tccggaacat 3720
aatggtgcag ggcgctgact tccgcgtttc cagactttac gaaacacgga aaccgaagac 3780
cattcatgtt gttgctcagg tcgcagacgt tttgcagcag cagtcgcttc acgttcgctc 3840
gcgtatcggt gattcattct gctaaccagt aaggcaaccc cgccagccta gccgggtcct 3900
caacgacagg agcacgatca tgcgcacccg tggccaggac ccaacgctgc ccgagatgcg 3960
ccgcgtgcgg ctgctggaga tggcggacgc gatggatatg ttctgccaag ggttggtttg 4020
cgcattcaca gttctccgca agaattgatt ggctccaatt cttggagtgg tgaatccgtt 4080
agcgaggtgc cgccggcttc cattcaggtc gaggtggccc ggctccatgc accgcgacgc 4140
aacgcgggga ggcagacaag gtatagggcg gcgcctacaa tccatgccaa cccgttccat 4200
gtgctcgccg aggcggcata aatcgccgtg acgatcagcg gtccaatgat cgaagttagg 4260
ctggtaagag ccgcgagcga tccttgaagc tgtccctgat ggtcgtcatc tacctgcctg 4320
gacagcatgg cctgcaacgc gggcatcccg atgccgccgg aagcgagaag aatcataatg 4380
gggaaggcca tccagcctcg cgtcgcgaac gccagcaaga cgtagcccag cgcgtcggcc 4440
gccatgccgg cgataatggc ctgcttctcg ccgaaacgtt tggtggcggg accagtgacg 4500
aaggcttgag cgagggcgtg caagattccg aataccgcaa gcgacaggcc gatcatcgtc 4560
gcgctccagc gaaagcggtc ctcgccgaaa atgacccaga gcgctgccgg cacctgtcct 4620
acgagttgca tgataaagaa gacagtcata agtgcggcga cgatagtcat gccccgcgcc 4680
caccggaagg agctgactgg gttgaaggct ctcaagggca tcggtcgaga tcccggtgcc 4740
taatgagtga gctaacttac attaattgcg ttgcgctcac tgcccgcttt ccagtcggga 4800
aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt 4860
attgggcgcc agggtggttt ttcttttcac cagtgagacg ggcaacagct gattgccctt 4920
caccgcctgg ccctgagaga gttgcagcaa gcggtccacg ctggtttgcc ccagcaggcg 4980
aaaatcctgt ttgatggtgg ttaacggcgg gatataacat gagctgtctt cggtatcgtc 5040
gtatcccact accgagatat ccgcaccaac gcgcagcccg gactcggtaa tggcgcgcat 5100
tgcgcccagc gccatctgat cgttggcaac cagcatcgca gtgggaacga tgccctcatt 5160
cagcatttgc atggtttgtt gaaaaccgga catggcactc cagtcgcctt cccgttccgc 5220
tatcggctga atttgattgc gagtgagata tttatgccag ccagccagac gcagacgcgc 5280
cgagacagaa cttaatgggc ccgctaacag cgcgatttgc tggtgaccca atgcgaccag 5340
atgctccacg cccagtcgcg taccgtcttc atgggagaaa ataatactgt tgatgggtgt 5400
ctggtcagag acatcaagaa ataacgccgg aacattagtg caggcagctt ccacagcaat 5460
ggcatcctgg tcatccagcg gatagttaat gatcagccca ctgacgcgtt gcgcgagaag 5520
attgtgcacc gccgctttac aggcttcgac gccgcttcgt tctaccatcg acaccaccac 5580
gctggcaccc agttgatcgg cgcgagattt aatcgccgcg acaatttgcg acggcgcgtg 5640
cagggccaga ctggaggtgg caacgccaat cagcaacgac tgtttgcccg ccagttgttg 5700
tgccacgcgg ttgggaatgt aattcagctc cgccatcgcc gcttccactt tttcccgcgt 5760
tttcgcagaa acgtggctgg cctggttcac cacgcgggaa acggtctgat aagagacacc 5820
ggcatactct gcgacatcgt ataacgttac tggtttcaca ttcaccaccc tgaattgact 5880
ctcttccggg cgctatcatg ccataccgcg aaaggttttg cgccattcga tggtgtccgg 5940
gatctcgacg ctctccctta tgcgactcct gcattaggaa gcagcccagt agtaggttga 6000
ggccgttgag caccgccgcc gcaaggaatg gtgcatgcaa ggagatggcg cccaacagtc 6060
ccccggccac ggggcctgcc accataccca cgccgaaaca agcgctcatg agcccgaagt 6120
ggcgagcccg atcttcccca tcggtgatgt cggcgatata ggcgccagca accgcacctg 6180
tggcgccggt gatgccggcc acgatgcgtc cggcgtagag gatcga 6226
<210> 96
<211> 6238
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 96
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gctgaaacat 120
atttactcca accacattaa aggtaacaaa atcacagccc ctaaaccgtc aattcagggc 180
gtggtgatcc acaacgatta tggctcaatg accccttcac agtacctgcc ttggctgtac 240
gctcgcgaaa acaacggtac acatgtgaat ggctgggcct cagtgtatgc caatcgcaac 300
gaggtgctgt ggtatcatcc tacagactac gtggaatggc actgcggcaa ccaatgggcc 360
aacgccaacc tgatcggctt tgaagtttgc gaatcatatc ctggtcgcat ctcagacaaa 420
ctgtttctgg aaaacgagga agccacactg aaagtagctg ccgacgtgat gaaatcgtat 480
ggcctgcctg tgaatcgcaa cacagtgcgc ctgcacaacg aatttttcgg tacatcatgc 540
cctcatcgtt catgggacct gcacgtgggc aaaggcgagc cttataccac aacaaatatc 600
aataaaatga aagattattt cattaaacgg attaaacact actatgacgg tgagctcggt 660
cgtaaaaaac gtcgtcagcg tcgtcgtccg cctcagtaag gatccggctg ctaacaaagc 720
ccgaaaggaa gctgagttgg ctgctgccac cgctgagcaa taactagcat aaccccttgg 780
ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccggatatcc 840
cgcaagaggc ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg 900
gtgccgagga tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag 960
caatttaact gtgataaact accgcattaa agctagctta tcgatgataa gctgtcaaac 1020
atgagaatta attcttgaag acgaaagggc ctcgtgatac gcctattttt ataggttaat 1080
gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga 1140
acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa 1200
ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt 1260
gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg 1320
ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg 1380
gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg 1440
agcactttta aagttctgct atgtggcgcg gtattatccc gtgttgacgc cgggcaagag 1500
caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca 1560
gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg 1620
agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc 1680
gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg 1740
aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgcagcaat ggcaacaacg 1800
ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca attaatagac 1860
tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg 1920
tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg 1980
gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact 2040
atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa 2100
ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca tttttaattt 2160
aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag 2220
ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct 2280
ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt 2340
tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg 2400
cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct 2460
gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc 2520
gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg 2580
tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa 2640
ctgagatacc tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg 2700
gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg 2760
ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga 2820
tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt 2880
ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc gttatcccct 2940
gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg ccgcagccga 3000
acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcctgat gcggtatttt 3060
ctccttacgc atctgtgcgg tatttcacac cgcaatggtg cactctcagt acaatctgct 3120
ctgatgccgc atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc 3180
tgcgccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc 3240
atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc 3300
gtcatcaccg aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga 3360
ttcacagatg tctgcctgtt catccgcgtc cagctcgttg agtttctcca gaagcgttaa 3420
tgtctggctt ctgataaagc gggccatgtt aagggcggtt ttttcctgtt tggtcactga 3480
tgcctccgtg taagggggat ttctgttcat gggggtaatg ataccgatga aacgagagag 3540
gatgctcacg atacgggtta ctgatgatga acatgcccgg ttactggaac gttgtgaggg 3600
taaacaactg gcggtatgga tgcggcggga ccagagaaaa atcactcagg gtcaatgcca 3660
gcgcttcgtt aatacagatg taggtgttcc acagggtagc cagcagcatc ctgcgatgca 3720
gatccggaac ataatggtgc agggcgctga cttccgcgtt tccagacttt acgaaacacg 3780
gaaaccgaag accattcatg ttgttgctca ggtcgcagac gttttgcagc agcagtcgct 3840
tcacgttcgc tcgcgtatcg gtgattcatt ctgctaacca gtaaggcaac cccgccagcc 3900
tagccgggtc ctcaacgaca ggagcacgat catgcgcacc cgtggccagg acccaacgct 3960
gcccgagatg cgccgcgtgc ggctgctgga gatggcggac gcgatggata tgttctgcca 4020
agggttggtt tgcgcattca cagttctccg caagaattga ttggctccaa ttcttggagt 4080
ggtgaatccg ttagcgaggt gccgccggct tccattcagg tcgaggtggc ccggctccat 4140
gcaccgcgac gcaacgcggg gaggcagaca aggtataggg cggcgcctac aatccatgcc 4200
aacccgttcc atgtgctcgc cgaggcggca taaatcgccg tgacgatcag cggtccaatg 4260
atcgaagtta ggctggtaag agccgcgagc gatccttgaa gctgtccctg atggtcgtca 4320
tctacctgcc tggacagcat ggcctgcaac gcgggcatcc cgatgccgcc ggaagcgaga 4380
agaatcataa tggggaaggc catccagcct cgcgtcgcga acgccagcaa gacgtagccc 4440
agcgcgtcgg ccgccatgcc ggcgataatg gcctgcttct cgccgaaacg tttggtggcg 4500
ggaccagtga cgaaggcttg agcgagggcg tgcaagattc cgaataccgc aagcgacagg 4560
ccgatcatcg tcgcgctcca gcgaaagcgg tcctcgccga aaatgaccca gagcgctgcc 4620
ggcacctgtc ctacgagttg catgataaag aagacagtca taagtgcggc gacgatagtc 4680
atgccccgcg cccaccggaa ggagctgact gggttgaagg ctctcaaggg catcggtcga 4740
gatcccggtg cctaatgagt gagctaactt acattaattg cgttgcgctc actgcccgct 4800
ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga 4860
ggcggtttgc gtattgggcg ccagggtggt ttttcttttc accagtgaga cgggcaacag 4920
ctgattgccc ttcaccgcct ggccctgaga gagttgcagc aagcggtcca cgctggtttg 4980
ccccagcagg cgaaaatcct gtttgatggt ggttaacggc gggatataac atgagctgtc 5040
ttcggtatcg tcgtatccca ctaccgagat atccgcacca acgcgcagcc cggactcggt 5100
aatggcgcgc attgcgccca gcgccatctg atcgttggca accagcatcg cagtgggaac 5160
gatgccctca ttcagcattt gcatggtttg ttgaaaaccg gacatggcac tccagtcgcc 5220
ttcccgttcc gctatcggct gaatttgatt gcgagtgaga tatttatgcc agccagccag 5280
acgcagacgc gccgagacag aacttaatgg gcccgctaac agcgcgattt gctggtgacc 5340
caatgcgacc agatgctcca cgcccagtcg cgtaccgtct tcatgggaga aaataatact 5400
gttgatgggt gtctggtcag agacatcaag aaataacgcc ggaacattag tgcaggcagc 5460
ttccacagca atggcatcct ggtcatccag cggatagtta atgatcagcc cactgacgcg 5520
ttgcgcgaga agattgtgca ccgccgcttt acaggcttcg acgccgcttc gttctaccat 5580
cgacaccacc acgctggcac ccagttgatc ggcgcgagat ttaatcgccg cgacaatttg 5640
cgacggcgcg tgcagggcca gactggaggt ggcaacgcca atcagcaacg actgtttgcc 5700
cgccagttgt tgtgccacgc ggttgggaat gtaattcagc tccgccatcg ccgcttccac 5760
tttttcccgc gttttcgcag aaacgtggct ggcctggttc accacgcggg aaacggtctg 5820
ataagagaca ccggcatact ctgcgacatc gtataacgtt actggtttca cattcaccac 5880
cctgaattga ctctcttccg ggcgctatca tgccataccg cgaaaggttt tgcgccattc 5940
gatggtgtcc gggatctcga cgctctccct tatgcgactc ctgcattagg aagcagccca 6000
gtagtaggtt gaggccgttg agcaccgccg ccgcaaggaa tggtgcatgc aaggagatgg 6060
cgcccaacag tcccccggcc acggggcctg ccaccatacc cacgccgaaa caagcgctca 6120
tgagcccgaa gtggcgagcc cgatcttccc catcggtgat gtcggcgata taggcgccag 6180
caaccgcacc tgtggcgccg gtgatgccgg ccacgatgcg tccggcgtag aggatcga 6238
<210> 97
<211> 6571
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 97
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gtccattatc 120
atggaagtgg ccacaatgca agccaaactg acaaaaaatg agttcattga gtggctgaaa 180
acgtccgagg gtaaacagtt caacgtggac ctgtggtacg gttttcagtg tttcgactac 240
gccaacgctg gctggaaagt gctgttcggc ctgctgctga aaggcctggg agccaaagac 300
atcccttttg caaacaattt cgatggcctg gccacagttt atcaaaacac ccctgacttt 360
ctggcccaac caggcgacat ggtggtgttt ggttctaatt atggcgcagg ctatggccac 420
gtagcctggg tgatcgaagc cacactggac tacattattg tttatgagca aaactggctg 480
ggaggcggat ggacagacgg catcgaacag cctggctggg gctgggagaa agtgacacgc 540
cgtcaacatg cctatgactt ccctatgtgg ttcatccgtc ctaatttcaa aggtggtaaa 600
ctggaagtta gcaaagcagc aaccattaaa cagtccgatg ttaaacaaga agtgaaaaaa 660
caagaggcca aacaaattgt gaaagccacc gattggaaac agaacaaaga tggcatttgg 720
tataaagcag aacatgccag ctttaccgtt accgcaccgg aaggcattat tacccgttat 780
aaaggtccgt ggaccggtca tccgcaggca ggcgtactgc agaaaggtca gaccattaaa 840
tacgatgaag tgcagaaatt tgatggccat gtttgggtta gctgggaaac ctttgaaggt 900
gaaaccgttt atatgccggt tcgtacctgg gatgcaaaaa ccggtaaagt gggcaaactg 960
tggggtgaaa tcaaagagct ccgccagatc aaaatttggt ttcagaatcg tcgcatgaaa 1020
tggaaaaaat aaggatccgg ctgctaacaa agcccgaaag gaagctgagt tggctgctgc 1080
caccgctgag caataactag cataacccct tggggcctct aaacgggtct tgaggggttt 1140
tttgctgaaa ggaggaacta tatccggata tcccgcaaga ggcccggcag taccggcata 1200
accaagccta tgcctacagc atccagggtg acggtgccga ggatgacgat gagcgcattg 1260
ttagatttca tacacggtgc ctgactgcgt tagcaattta actgtgataa actaccgcat 1320
taaagctagc ttatcgatga taagctgtca aacatgagaa ttaattcttg aagacgaaag 1380
ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt ttcttagacg 1440
tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata 1500
cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga 1560
aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca 1620
ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat 1680
cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag 1740
agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc 1800
gcggtattat cccgtgttga cgccgggcaa gagcaactcg gtcgccgcat acactattct 1860
cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca 1920
gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt 1980
ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat 2040
gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt 2100
gacaccacga tgcctgcagc aatggcaaca acgttgcgca aactattaac tggcgaacta 2160
cttactctag cttcccggca acaattaata gactggatgg aggcggataa agttgcagga 2220
ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt 2280
gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc 2340
gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct 2400
gagataggtg cctcactgat taagcattgg taactgtcag accaagttta ctcatatata 2460
ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt 2520
gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc 2580
gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg 2640
caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact 2700
ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg 2760
tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg 2820
ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac 2880
tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca 2940
cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga 3000
gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc 3060
ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct 3120
gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg 3180
agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct 3240
tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc 3300
tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc 3360
gaggaagcgg aagagcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca 3420
caccgcaatg gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagtata 3480
cactccgcta tcgctacgtg actgggtcat ggctgcgccc cgacacccgc caacacccgc 3540
tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 3600
ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgaggcagct 3660
gcggtaaagc tcatcagcgt ggtcgtgaag cgattcacag atgtctgcct gttcatccgc 3720
gtccagctcg ttgagtttct ccagaagcgt taatgtctgg cttctgataa agcgggccat 3780
gttaagggcg gttttttcct gtttggtcac tgatgcctcc gtgtaagggg gatttctgtt 3840
catgggggta atgataccga tgaaacgaga gaggatgctc acgatacggg ttactgatga 3900
tgaacatgcc cggttactgg aacgttgtga gggtaaacaa ctggcggtat ggatgcggcg 3960
ggaccagaga aaaatcactc agggtcaatg ccagcgcttc gttaatacag atgtaggtgt 4020
tccacagggt agccagcagc atcctgcgat gcagatccgg aacataatgg tgcagggcgc 4080
tgacttccgc gtttccagac tttacgaaac acggaaaccg aagaccattc atgttgttgc 4140
tcaggtcgca gacgttttgc agcagcagtc gcttcacgtt cgctcgcgta tcggtgattc 4200
attctgctaa ccagtaaggc aaccccgcca gcctagccgg gtcctcaacg acaggagcac 4260
gatcatgcgc acccgtggcc aggacccaac gctgcccgag atgcgccgcg tgcggctgct 4320
ggagatggcg gacgcgatgg atatgttctg ccaagggttg gtttgcgcat tcacagttct 4380
ccgcaagaat tgattggctc caattcttgg agtggtgaat ccgttagcga ggtgccgccg 4440
gcttccattc aggtcgaggt ggcccggctc catgcaccgc gacgcaacgc ggggaggcag 4500
acaaggtata gggcggcgcc tacaatccat gccaacccgt tccatgtgct cgccgaggcg 4560
gcataaatcg ccgtgacgat cagcggtcca atgatcgaag ttaggctggt aagagccgcg 4620
agcgatcctt gaagctgtcc ctgatggtcg tcatctacct gcctggacag catggcctgc 4680
aacgcgggca tcccgatgcc gccggaagcg agaagaatca taatggggaa ggccatccag 4740
cctcgcgtcg cgaacgccag caagacgtag cccagcgcgt cggccgccat gccggcgata 4800
atggcctgct tctcgccgaa acgtttggtg gcgggaccag tgacgaaggc ttgagcgagg 4860
gcgtgcaaga ttccgaatac cgcaagcgac aggccgatca tcgtcgcgct ccagcgaaag 4920
cggtcctcgc cgaaaatgac ccagagcgct gccggcacct gtcctacgag ttgcatgata 4980
aagaagacag tcataagtgc ggcgacgata gtcatgcccc gcgcccaccg gaaggagctg 5040
actgggttga aggctctcaa gggcatcggt cgagatcccg gtgcctaatg agtgagctaa 5100
cttacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag 5160
ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgccagggt 5220
ggtttttctt ttcaccagtg agacgggcaa cagctgattg cccttcaccg cctggccctg 5280
agagagttgc agcaagcggt ccacgctggt ttgccccagc aggcgaaaat cctgtttgat 5340
ggtggttaac ggcgggatat aacatgagct gtcttcggta tcgtcgtatc ccactaccga 5400
gatatccgca ccaacgcgca gcccggactc ggtaatggcg cgcattgcgc ccagcgccat 5460
ctgatcgttg gcaaccagca tcgcagtggg aacgatgccc tcattcagca tttgcatggt 5520
ttgttgaaaa ccggacatgg cactccagtc gccttcccgt tccgctatcg gctgaatttg 5580
attgcgagtg agatatttat gccagccagc cagacgcaga cgcgccgaga cagaacttaa 5640
tgggcccgct aacagcgcga tttgctggtg acccaatgcg accagatgct ccacgcccag 5700
tcgcgtaccg tcttcatggg agaaaataat actgttgatg ggtgtctggt cagagacatc 5760
aagaaataac gccggaacat tagtgcaggc agcttccaca gcaatggcat cctggtcatc 5820
cagcggatag ttaatgatca gcccactgac gcgttgcgcg agaagattgt gcaccgccgc 5880
tttacaggct tcgacgccgc ttcgttctac catcgacacc accacgctgg cacccagttg 5940
atcggcgcga gatttaatcg ccgcgacaat ttgcgacggc gcgtgcaggg ccagactgga 6000
ggtggcaacg ccaatcagca acgactgttt gcccgccagt tgttgtgcca cgcggttggg 6060
aatgtaattc agctccgcca tcgccgcttc cactttttcc cgcgttttcg cagaaacgtg 6120
gctggcctgg ttcaccacgc gggaaacggt ctgataagag acaccggcat actctgcgac 6180
atcgtataac gttactggtt tcacattcac caccctgaat tgactctctt ccgggcgcta 6240
tcatgccata ccgcgaaagg ttttgcgcca ttcgatggtg tccgggatct cgacgctctc 6300
ccttatgcga ctcctgcatt aggaagcagc ccagtagtag gttgaggccg ttgagcaccg 6360
ccgccgcaag gaatggtgca tgcaaggaga tggcgcccaa cagtcccccg gccacggggc 6420
ctgccaccat acccacgccg aaacaagcgc tcatgagccc gaagtggcga gcccgatctt 6480
ccccatcggt gatgtcggcg atataggcgc cagcaaccgc acctgtggcg ccggtgatgc 6540
cggccacgat gcgtccggcg tagaggatcg a 6571
<210> 98
<211> 6550
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 98
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gtccattatc 120
atggaagtgg ccacaatgca agccaaactg acaaaaaatg agttcattga gtggctgaaa 180
acgtccgagg gtaaacagtt caacgtggac ctgtggtacg gttttcagtg tttcgactac 240
gccaacgctg gctggaaagt gctgttcggc ctgctgctga aaggcctggg agccaaagac 300
atcccttttg caaacaattt cgatggcctg gccacagttt atcaaaacac ccctgacttt 360
ctggcccaac caggcgacat ggtggtgttt ggttctaatt atggcgcagg ctatggccac 420
gtagcctggg tgatcgaagc cacactggac tacattattg tttatgagca aaactggctg 480
ggaggcggat ggacagacgg catcgaacag cctggctggg gctgggagaa agtgacacgc 540
cgtcaacatg cctatgactt ccctatgtgg ttcatccgtc ctaatttcaa aggtggtaaa 600
ctggaagtta gcaaagcagc aaccattaaa cagtccgatg ttaaacaaga agtgaaaaaa 660
caagaggcca aacaaattgt gaaagccacc gattggaaac agaacaaaga tggcatttgg 720
tataaagcag aacatgccag ctttaccgtt accgcaccgg aaggcattat tacccgttat 780
aaaggtccgt ggaccggtca tccgcaggca ggcgtactgc agaaaggtca gaccattaaa 840
tacgatgaag tgcagaaatt tgatggccat gtttgggtta gctgggaaac ctttgaaggt 900
gaaaccgttt atatgccggt tcgtacctgg gatgcaaaaa ccggtaaagt gggcaaactg 960
tggggtgaaa tcaaagagct ccgtcgtcgt cgccgtcggc gtcgtcgtta aggatccggc 1020
tgctaacaaa gcccgaaagg aagctgagtt ggctgctgcc accgctgagc aataactagc 1080
ataacccctt ggggcctcta aacgggtctt gaggggtttt ttgctgaaag gaggaactat 1140
atccggatat cccgcaagag gcccggcagt accggcataa ccaagcctat gcctacagca 1200
tccagggtga cggtgccgag gatgacgatg agcgcattgt tagatttcat acacggtgcc 1260
tgactgcgtt agcaatttaa ctgtgataaa ctaccgcatt aaagctagct tatcgatgat 1320
aagctgtcaa acatgagaat taattcttga agacgaaagg gcctcgtgat acgcctattt 1380
ttataggtta atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga 1440
aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc 1500
atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt 1560
caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct 1620
cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt 1680
tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt 1740
tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtgttgac 1800
gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac 1860
tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct 1920
gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg 1980
aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg 2040
gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgcagca 2100
atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa 2160
caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt 2220
ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc 2280
attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg 2340
agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt 2400
aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt 2460
catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc 2520
ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct 2580
tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta 2640
ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc 2700
ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac 2760
ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct 2820
gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat 2880
aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg 2940
acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa 3000
gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg 3060
gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga 3120
cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc 3180
aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct 3240
gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct 3300
cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgcctg 3360
atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcaatgg tgcactctca 3420
gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga 3480
ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg 3540
tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca 3600
gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg 3660
gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc 3720
cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg 3780
tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat 3840
gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga 3900
acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca 3960
gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca 4020
tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact 4080
ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca 4140
gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca 4200
accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca 4260
ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga 4320
tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc 4380
aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg 4440
gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct 4500
acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc 4560
agcggtccaa tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc 4620
tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg 4680
ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgc gaacgccagc 4740
aagacgtagc ccagcgcgtc ggccgccatg ccggcgataa tggcctgctt ctcgccgaaa 4800
cgtttggtgg cgggaccagt gacgaaggct tgagcgaggg cgtgcaagat tccgaatacc 4860
gcaagcgaca ggccgatcat cgtcgcgctc cagcgaaagc ggtcctcgcc gaaaatgacc 4920
cagagcgctg ccggcacctg tcctacgagt tgcatgataa agaagacagt cataagtgcg 4980
gcgacgatag tcatgccccg cgcccaccgg aaggagctga ctgggttgaa ggctctcaag 5040
ggcatcggtc gagatcccgg tgcctaatga gtgagctaac ttacattaat tgcgttgcgc 5100
tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa 5160
cgcgcgggga gaggcggttt gcgtattggg cgccagggtg gtttttcttt tcaccagtga 5220
gacgggcaac agctgattgc ccttcaccgc ctggccctga gagagttgca gcaagcggtc 5280
cacgctggtt tgccccagca ggcgaaaatc ctgtttgatg gtggttaacg gcgggatata 5340
acatgagctg tcttcggtat cgtcgtatcc cactaccgag atatccgcac caacgcgcag 5400
cccggactcg gtaatggcgc gcattgcgcc cagcgccatc tgatcgttgg caaccagcat 5460
cgcagtggga acgatgccct cattcagcat ttgcatggtt tgttgaaaac cggacatggc 5520
actccagtcg ccttcccgtt ccgctatcgg ctgaatttga ttgcgagtga gatatttatg 5580
ccagccagcc agacgcagac gcgccgagac agaacttaat gggcccgcta acagcgcgat 5640
ttgctggtga cccaatgcga ccagatgctc cacgcccagt cgcgtaccgt cttcatggga 5700
gaaaataata ctgttgatgg gtgtctggtc agagacatca agaaataacg ccggaacatt 5760
agtgcaggca gcttccacag caatggcatc ctggtcatcc agcggatagt taatgatcag 5820
cccactgacg cgttgcgcga gaagattgtg caccgccgct ttacaggctt cgacgccgct 5880
tcgttctacc atcgacacca ccacgctggc acccagttga tcggcgcgag atttaatcgc 5940
cgcgacaatt tgcgacggcg cgtgcagggc cagactggag gtggcaacgc caatcagcaa 6000
cgactgtttg cccgccagtt gttgtgccac gcggttggga atgtaattca gctccgccat 6060
cgccgcttcc actttttccc gcgttttcgc agaaacgtgg ctggcctggt tcaccacgcg 6120
ggaaacggtc tgataagaga caccggcata ctctgcgaca tcgtataacg ttactggttt 6180
cacattcacc accctgaatt gactctcttc cgggcgctat catgccatac cgcgaaaggt 6240
tttgcgccat tcgatggtgt ccgggatctc gacgctctcc cttatgcgac tcctgcatta 6300
ggaagcagcc cagtagtagg ttgaggccgt tgagcaccgc cgccgcaagg aatggtgcat 6360
gcaaggagat ggcgcccaac agtcccccgg ccacggggcc tgccaccata cccacgccga 6420
aacaagcgct catgagcccg aagtggcgag cccgatcttc cccatcggtg atgtcggcga 6480
tataggcgcc agcaaccgca cctgtggcgc cggtgatgcc ggccacgatg cgtccggcgt 6540
agaggatcga 6550
<210> 99
<211> 6562
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 99
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gtccattatc 120
atggaagtgg ccacaatgca agccaaactg acaaaaaatg agttcattga gtggctgaaa 180
acgtccgagg gtaaacagtt caacgtggac ctgtggtacg gttttcagtg tttcgactac 240
gccaacgctg gctggaaagt gctgttcggc ctgctgctga aaggcctggg agccaaagac 300
atcccttttg caaacaattt cgatggcctg gccacagttt atcaaaacac ccctgacttt 360
ctggcccaac caggcgacat ggtggtgttt ggttctaatt atggcgcagg ctatggccac 420
gtagcctggg tgatcgaagc cacactggac tacattattg tttatgagca aaactggctg 480
ggaggcggat ggacagacgg catcgaacag cctggctggg gctgggagaa agtgacacgc 540
cgtcaacatg cctatgactt ccctatgtgg ttcatccgtc ctaatttcaa aggtggtaaa 600
ctggaagtta gcaaagcagc aaccattaaa cagtccgatg ttaaacaaga agtgaaaaaa 660
caagaggcca aacaaattgt gaaagccacc gattggaaac agaacaaaga tggcatttgg 720
tataaagcag aacatgccag ctttaccgtt accgcaccgg aaggcattat tacccgttat 780
aaaggtccgt ggaccggtca tccgcaggca ggcgtactgc agaaaggtca gaccattaaa 840
tacgatgaag tgcagaaatt tgatggccat gtttgggtta gctgggaaac ctttgaaggt 900
gaaaccgttt atatgccggt tcgtacctgg gatgcaaaaa ccggtaaagt gggcaaactg 960
tggggtgaaa tcaaagagct cggtcgtaaa aaacgtcgtc agcgtcgtcg tccgcctcag 1020
taaggatccg gctgctaaca aagcccgaaa ggaagctgag ttggctgctg ccaccgctga 1080
gcaataacta gcataacccc ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa 1140
aggaggaact atatccggat atcccgcaag aggcccggca gtaccggcat aaccaagcct 1200
atgcctacag catccagggt gacggtgccg aggatgacga tgagcgcatt gttagatttc 1260
atacacggtg cctgactgcg ttagcaattt aactgtgata aactaccgca ttaaagctag 1320
cttatcgatg ataagctgtc aaacatgaga attaattctt gaagacgaaa gggcctcgtg 1380
atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc 1440
acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat 1500
atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag 1560
agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt 1620
cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt 1680
gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga gagttttcgc 1740
cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta 1800
tcccgtgttg acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac 1860
ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa 1920
ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg 1980
atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc 2040
cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg 2100
atgcctgcag caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta 2160
gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg accacttctg 2220
cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg 2280
tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc 2340
tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt 2400
gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt 2460
gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc 2520
atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag 2580
atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa 2640
aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg 2700
aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag 2760
ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg 2820
ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga 2880
tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc 2940
ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc 3000
acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga 3060
gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt 3120
cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg 3180
aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac 3240
atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga 3300
gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg 3360
gaagagcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcaat 3420
ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat acactccgct 3480
atcgctacgt gactgggtca tggctgcgcc ccgacacccg ccaacacccg ctgacgcgcc 3540
ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag 3600
ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgaggcagc tgcggtaaag 3660
ctcatcagcg tggtcgtgaa gcgattcaca gatgtctgcc tgttcatccg cgtccagctc 3720
gttgagtttc tccagaagcg ttaatgtctg gcttctgata aagcgggcca tgttaagggc 3780
ggttttttcc tgtttggtca ctgatgcctc cgtgtaaggg ggatttctgt tcatgggggt 3840
aatgataccg atgaaacgag agaggatgct cacgatacgg gttactgatg atgaacatgc 3900
ccggttactg gaacgttgtg agggtaaaca actggcggta tggatgcggc gggaccagag 3960
aaaaatcact cagggtcaat gccagcgctt cgttaataca gatgtaggtg ttccacaggg 4020
tagccagcag catcctgcga tgcagatccg gaacataatg gtgcagggcg ctgacttccg 4080
cgtttccaga ctttacgaaa cacggaaacc gaagaccatt catgttgttg ctcaggtcgc 4140
agacgttttg cagcagcagt cgcttcacgt tcgctcgcgt atcggtgatt cattctgcta 4200
accagtaagg caaccccgcc agcctagccg ggtcctcaac gacaggagca cgatcatgcg 4260
cacccgtggc caggacccaa cgctgcccga gatgcgccgc gtgcggctgc tggagatggc 4320
ggacgcgatg gatatgttct gccaagggtt ggtttgcgca ttcacagttc tccgcaagaa 4380
ttgattggct ccaattcttg gagtggtgaa tccgttagcg aggtgccgcc ggcttccatt 4440
caggtcgagg tggcccggct ccatgcaccg cgacgcaacg cggggaggca gacaaggtat 4500
agggcggcgc ctacaatcca tgccaacccg ttccatgtgc tcgccgaggc ggcataaatc 4560
gccgtgacga tcagcggtcc aatgatcgaa gttaggctgg taagagccgc gagcgatcct 4620
tgaagctgtc cctgatggtc gtcatctacc tgcctggaca gcatggcctg caacgcgggc 4680
atcccgatgc cgccggaagc gagaagaatc ataatgggga aggccatcca gcctcgcgtc 4740
gcgaacgcca gcaagacgta gcccagcgcg tcggccgcca tgccggcgat aatggcctgc 4800
ttctcgccga aacgtttggt ggcgggacca gtgacgaagg cttgagcgag ggcgtgcaag 4860
attccgaata ccgcaagcga caggccgatc atcgtcgcgc tccagcgaaa gcggtcctcg 4920
ccgaaaatga cccagagcgc tgccggcacc tgtcctacga gttgcatgat aaagaagaca 4980
gtcataagtg cggcgacgat agtcatgccc cgcgcccacc ggaaggagct gactgggttg 5040
aaggctctca agggcatcgg tcgagatccc ggtgcctaat gagtgagcta acttacatta 5100
attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 5160
tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgccaggg tggtttttct 5220
tttcaccagt gagacgggca acagctgatt gcccttcacc gcctggccct gagagagttg 5280
cagcaagcgg tccacgctgg tttgccccag caggcgaaaa tcctgtttga tggtggttaa 5340
cggcgggata taacatgagc tgtcttcggt atcgtcgtat cccactaccg agatatccgc 5400
accaacgcgc agcccggact cggtaatggc gcgcattgcg cccagcgcca tctgatcgtt 5460
ggcaaccagc atcgcagtgg gaacgatgcc ctcattcagc atttgcatgg tttgttgaaa 5520
accggacatg gcactccagt cgccttcccg ttccgctatc ggctgaattt gattgcgagt 5580
gagatattta tgccagccag ccagacgcag acgcgccgag acagaactta atgggcccgc 5640
taacagcgcg atttgctggt gacccaatgc gaccagatgc tccacgccca gtcgcgtacc 5700
gtcttcatgg gagaaaataa tactgttgat gggtgtctgg tcagagacat caagaaataa 5760
cgccggaaca ttagtgcagg cagcttccac agcaatggca tcctggtcat ccagcggata 5820
gttaatgatc agcccactga cgcgttgcgc gagaagattg tgcaccgccg ctttacaggc 5880
ttcgacgccg cttcgttcta ccatcgacac caccacgctg gcacccagtt gatcggcgcg 5940
agatttaatc gccgcgacaa tttgcgacgg cgcgtgcagg gccagactgg aggtggcaac 6000
gccaatcagc aacgactgtt tgcccgccag ttgttgtgcc acgcggttgg gaatgtaatt 6060
cagctccgcc atcgccgctt ccactttttc ccgcgttttc gcagaaacgt ggctggcctg 6120
gttcaccacg cgggaaacgg tctgataaga gacaccggca tactctgcga catcgtataa 6180
cgttactggt ttcacattca ccaccctgaa ttgactctct tccgggcgct atcatgccat 6240
accgcgaaag gttttgcgcc attcgatggt gtccgggatc tcgacgctct cccttatgcg 6300
actcctgcat taggaagcag cccagtagta ggttgaggcc gttgagcacc gccgccgcaa 6360
ggaatggtgc atgcaaggag atggcgccca acagtccccc ggccacgggg cctgccacca 6420
tacccacgcc gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg 6480
tgatgtcggc gatataggcg ccagcaaccg cacctgtggc gccggtgatg ccggccacga 6540
tgcgtccggc gtagaggatc ga 6562
<210> 100
<211> 6187
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 100
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gtccattatc 120
atggaagtgg ccacaatgca agccaaactg acaaaaaatg agttcattga gtggctgaaa 180
acgtccgagg gtaaacagtt caacgtggac ctgtggtacg gttttcagtg tttcgactac 240
gccaacgctg gctggaaagt gctgttcggc ctgctgctga aaggcctggg agccaaagac 300
atcccttttg caaacaattt cgatggcctg gccacagttt atcaaaacac ccctgacttt 360
ctggcccaac caggcgacat ggtggtgttt ggttctaatt atggcgcagg ctatggccac 420
gtagcctggg tgatcgaagc cacactggac tacattattg tttatgagca aaactggctg 480
ggaggcggat ggacagacgg catcgaacag cctggctggg gctgggagaa agtgacacgc 540
cgtcaacatg cctatgactt ccctatgtgg ttcatccgtc ctaatttcaa agagctccgc 600
cagatcaaaa tttggtttca gaatcgtcgc atgaaatgga aaaaataagg atccggctgc 660
taacaaagcc cgaaaggaag ctgagttggc tgctgccacc gctgagcaat aactagcata 720
accccttggg gcctctaaac gggtcttgag gggttttttg ctgaaaggag gaactatatc 780
cggatatccc gcaagaggcc cggcagtacc ggcataacca agcctatgcc tacagcatcc 840
agggtgacgg tgccgaggat gacgatgagc gcattgttag atttcataca cggtgcctga 900
ctgcgttagc aatttaactg tgataaacta ccgcattaaa gctagcttat cgatgataag 960
ctgtcaaaca tgagaattaa ttcttgaaga cgaaagggcc tcgtgatacg cctattttta 1020
taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat 1080
gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg 1140
agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 1200
catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 1260
ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 1320
atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 1380
ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc 1440
gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 1500
ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 1560
ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag 1620
gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 1680
ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg 1740
gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa 1800
ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg 1860
gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt 1920
gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt 1980
caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag 2040
cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat 2100
ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 2160
taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 2220
tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 2280
gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 2340
agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 2400
aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 2460
gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 2520
gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 2580
tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg 2640
agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 2700
cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 2760
gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 2820
gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg 2880
ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc 2940
cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg 3000
cggtattttc tccttacgca tctgtgcggt atttcacacc gcaatggtgc actctcagta 3060
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 3120
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 3180
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 3240
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 3300
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 3360
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 3420
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 3480
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 3540
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 3600
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 3660
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3720
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3780
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3840
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggccagga 3900
cccaacgctg cccgagatgc gccgcgtgcg gctgctggag atggcggacg cgatggatat 3960
gttctgccaa gggttggttt gcgcattcac agttctccgc aagaattgat tggctccaat 4020
tcttggagtg gtgaatccgt tagcgaggtg ccgccggctt ccattcaggt cgaggtggcc 4080
cggctccatg caccgcgacg caacgcgggg aggcagacaa ggtatagggc ggcgcctaca 4140
atccatgcca acccgttcca tgtgctcgcc gaggcggcat aaatcgccgt gacgatcagc 4200
ggtccaatga tcgaagttag gctggtaaga gccgcgagcg atccttgaag ctgtccctga 4260
tggtcgtcat ctacctgcct ggacagcatg gcctgcaacg cgggcatccc gatgccgccg 4320
gaagcgagaa gaatcataat ggggaaggcc atccagcctc gcgtcgcgaa cgccagcaag 4380
acgtagccca gcgcgtcggc cgccatgccg gcgataatgg cctgcttctc gccgaaacgt 4440
ttggtggcgg gaccagtgac gaaggcttga gcgagggcgt gcaagattcc gaataccgca 4500
agcgacaggc cgatcatcgt cgcgctccag cgaaagcggt cctcgccgaa aatgacccag 4560
agcgctgccg gcacctgtcc tacgagttgc atgataaaga agacagtcat aagtgcggcg 4620
acgatagtca tgccccgcgc ccaccggaag gagctgactg ggttgaaggc tctcaagggc 4680
atcggtcgag atcccggtgc ctaatgagtg agctaactta cattaattgc gttgcgctca 4740
ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc 4800
gcggggagag gcggtttgcg tattgggcgc cagggtggtt tttcttttca ccagtgagac 4860
gggcaacagc tgattgccct tcaccgcctg gccctgagag agttgcagca agcggtccac 4920
gctggtttgc cccagcaggc gaaaatcctg tttgatggtg gttaacggcg ggatataaca 4980
tgagctgtct tcggtatcgt cgtatcccac taccgagata tccgcaccaa cgcgcagccc 5040
ggactcggta atggcgcgca ttgcgcccag cgccatctga tcgttggcaa ccagcatcgc 5100
agtgggaacg atgccctcat tcagcatttg catggtttgt tgaaaaccgg acatggcact 5160
ccagtcgcct tcccgttccg ctatcggctg aatttgattg cgagtgagat atttatgcca 5220
gccagccaga cgcagacgcg ccgagacaga acttaatggg cccgctaaca gcgcgatttg 5280
ctggtgaccc aatgcgacca gatgctccac gcccagtcgc gtaccgtctt catgggagaa 5340
aataatactg ttgatgggtg tctggtcaga gacatcaaga aataacgccg gaacattagt 5400
gcaggcagct tccacagcaa tggcatcctg gtcatccagc ggatagttaa tgatcagccc 5460
actgacgcgt tgcgcgagaa gattgtgcac cgccgcttta caggcttcga cgccgcttcg 5520
ttctaccatc gacaccacca cgctggcacc cagttgatcg gcgcgagatt taatcgccgc 5580
gacaatttgc gacggcgcgt gcagggccag actggaggtg gcaacgccaa tcagcaacga 5640
ctgtttgccc gccagttgtt gtgccacgcg gttgggaatg taattcagct ccgccatcgc 5700
cgcttccact ttttcccgcg ttttcgcaga aacgtggctg gcctggttca ccacgcggga 5760
aacggtctga taagagacac cggcatactc tgcgacatcg tataacgtta ctggtttcac 5820
attcaccacc ctgaattgac tctcttccgg gcgctatcat gccataccgc gaaaggtttt 5880
gcgccattcg atggtgtccg ggatctcgac gctctccctt atgcgactcc tgcattagga 5940
agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca 6000
aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac 6060
aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat 6120
aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga 6180
ggatcga 6187
<210> 101
<211> 6166
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 101
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gtccattatc 120
atggaagtgg ccacaatgca agccaaactg acaaaaaatg agttcattga gtggctgaaa 180
acgtccgagg gtaaacagtt caacgtggac ctgtggtacg gttttcagtg tttcgactac 240
gccaacgctg gctggaaagt gctgttcggc ctgctgctga aaggcctggg agccaaagac 300
atcccttttg caaacaattt cgatggcctg gccacagttt atcaaaacac ccctgacttt 360
ctggcccaac caggcgacat ggtggtgttt ggttctaatt atggcgcagg ctatggccac 420
gtagcctggg tgatcgaagc cacactggac tacattattg tttatgagca aaactggctg 480
ggaggcggat ggacagacgg catcgaacag cctggctggg gctgggagaa agtgacacgc 540
cgtcaacatg cctatgactt ccctatgtgg ttcatccgtc ctaatttcaa agagctccgt 600
cgtcgtcgcc gtcggcgtcg tcgttaagga tccggctgct aacaaagccc gaaaggaagc 660
tgagttggct gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg 720
ggtcttgagg ggttttttgc tgaaaggagg aactatatcc ggatatcccg caagaggccc 780
ggcagtaccg gcataaccaa gcctatgcct acagcatcca gggtgacggt gccgaggatg 840
acgatgagcg cattgttaga tttcatacac ggtgcctgac tgcgttagca atttaactgt 900
gataaactac cgcattaaag ctagcttatc gatgataagc tgtcaaacat gagaattaat 960
tcttgaagac gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata 1020
atggtttctt agacgtcagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt 1080
ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg 1140
cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt 1200
cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta 1260
aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc 1320
ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa 1380
gttctgctat gtggcgcggt attatcccgt gttgacgccg ggcaagagca actcggtcgc 1440
cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt 1500
acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact 1560
gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac 1620
aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata 1680
ccaaacgacg agcgtgacac cacgatgcct gcagcaatgg caacaacgtt gcgcaaacta 1740
ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg 1800
gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat 1860
aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt 1920
aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga 1980
aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa 2040
gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag 2100
gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac 2160
tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc 2220
gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat 2280
caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat 2340
actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct 2400
acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt 2460
cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg 2520
gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 2580
cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg 2640
gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg 2700
tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc 2760
tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg 2820
gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 2880
aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc 2940
agcgagtcag tgagcgagga agcggaagag cgcctgatgc ggtattttct ccttacgcat 3000
ctgtgcggta tttcacaccg caatggtgca ctctcagtac aatctgctct gatgccgcat 3060
agttaagcca gtatacactc cgctatcgct acgtgactgg gtcatggctg cgccccgaca 3120
cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag 3180
acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa 3240
acgcgcgagg cagctgcggt aaagctcatc agcgtggtcg tgaagcgatt cacagatgtc 3300
tgcctgttca tccgcgtcca gctcgttgag tttctccaga agcgttaatg tctggcttct 3360
gataaagcgg gccatgttaa gggcggtttt ttcctgtttg gtcactgatg cctccgtgta 3420
agggggattt ctgttcatgg gggtaatgat accgatgaaa cgagagagga tgctcacgat 3480
acgggttact gatgatgaac atgcccggtt actggaacgt tgtgagggta aacaactggc 3540
ggtatggatg cggcgggacc agagaaaaat cactcagggt caatgccagc gcttcgttaa 3600
tacagatgta ggtgttccac agggtagcca gcagcatcct gcgatgcaga tccggaacat 3660
aatggtgcag ggcgctgact tccgcgtttc cagactttac gaaacacgga aaccgaagac 3720
cattcatgtt gttgctcagg tcgcagacgt tttgcagcag cagtcgcttc acgttcgctc 3780
gcgtatcggt gattcattct gctaaccagt aaggcaaccc cgccagccta gccgggtcct 3840
caacgacagg agcacgatca tgcgcacccg tggccaggac ccaacgctgc ccgagatgcg 3900
ccgcgtgcgg ctgctggaga tggcggacgc gatggatatg ttctgccaag ggttggtttg 3960
cgcattcaca gttctccgca agaattgatt ggctccaatt cttggagtgg tgaatccgtt 4020
agcgaggtgc cgccggcttc cattcaggtc gaggtggccc ggctccatgc accgcgacgc 4080
aacgcgggga ggcagacaag gtatagggcg gcgcctacaa tccatgccaa cccgttccat 4140
gtgctcgccg aggcggcata aatcgccgtg acgatcagcg gtccaatgat cgaagttagg 4200
ctggtaagag ccgcgagcga tccttgaagc tgtccctgat ggtcgtcatc tacctgcctg 4260
gacagcatgg cctgcaacgc gggcatcccg atgccgccgg aagcgagaag aatcataatg 4320
gggaaggcca tccagcctcg cgtcgcgaac gccagcaaga cgtagcccag cgcgtcggcc 4380
gccatgccgg cgataatggc ctgcttctcg ccgaaacgtt tggtggcggg accagtgacg 4440
aaggcttgag cgagggcgtg caagattccg aataccgcaa gcgacaggcc gatcatcgtc 4500
gcgctccagc gaaagcggtc ctcgccgaaa atgacccaga gcgctgccgg cacctgtcct 4560
acgagttgca tgataaagaa gacagtcata agtgcggcga cgatagtcat gccccgcgcc 4620
caccggaagg agctgactgg gttgaaggct ctcaagggca tcggtcgaga tcccggtgcc 4680
taatgagtga gctaacttac attaattgcg ttgcgctcac tgcccgcttt ccagtcggga 4740
aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt 4800
attgggcgcc agggtggttt ttcttttcac cagtgagacg ggcaacagct gattgccctt 4860
caccgcctgg ccctgagaga gttgcagcaa gcggtccacg ctggtttgcc ccagcaggcg 4920
aaaatcctgt ttgatggtgg ttaacggcgg gatataacat gagctgtctt cggtatcgtc 4980
gtatcccact accgagatat ccgcaccaac gcgcagcccg gactcggtaa tggcgcgcat 5040
tgcgcccagc gccatctgat cgttggcaac cagcatcgca gtgggaacga tgccctcatt 5100
cagcatttgc atggtttgtt gaaaaccgga catggcactc cagtcgcctt cccgttccgc 5160
tatcggctga atttgattgc gagtgagata tttatgccag ccagccagac gcagacgcgc 5220
cgagacagaa cttaatgggc ccgctaacag cgcgatttgc tggtgaccca atgcgaccag 5280
atgctccacg cccagtcgcg taccgtcttc atgggagaaa ataatactgt tgatgggtgt 5340
ctggtcagag acatcaagaa ataacgccgg aacattagtg caggcagctt ccacagcaat 5400
ggcatcctgg tcatccagcg gatagttaat gatcagccca ctgacgcgtt gcgcgagaag 5460
attgtgcacc gccgctttac aggcttcgac gccgcttcgt tctaccatcg acaccaccac 5520
gctggcaccc agttgatcgg cgcgagattt aatcgccgcg acaatttgcg acggcgcgtg 5580
cagggccaga ctggaggtgg caacgccaat cagcaacgac tgtttgcccg ccagttgttg 5640
tgccacgcgg ttgggaatgt aattcagctc cgccatcgcc gcttccactt tttcccgcgt 5700
tttcgcagaa acgtggctgg cctggttcac cacgcgggaa acggtctgat aagagacacc 5760
ggcatactct gcgacatcgt ataacgttac tggtttcaca ttcaccaccc tgaattgact 5820
ctcttccggg cgctatcatg ccataccgcg aaaggttttg cgccattcga tggtgtccgg 5880
gatctcgacg ctctccctta tgcgactcct gcattaggaa gcagcccagt agtaggttga 5940
ggccgttgag caccgccgcc gcaaggaatg gtgcatgcaa ggagatggcg cccaacagtc 6000
ccccggccac ggggcctgcc accataccca cgccgaaaca agcgctcatg agcccgaagt 6060
ggcgagcccg atcttcccca tcggtgatgt cggcgatata ggcgccagca accgcacctg 6120
tggcgccggt gatgccggcc acgatgcgtc cggcgtagag gatcga 6166
<210> 102
<211> 6178
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 102
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat gtccattatc 120
atggaagtgg ccacaatgca agccaaactg acaaaaaatg agttcattga gtggctgaaa 180
acgtccgagg gtaaacagtt caacgtggac ctgtggtacg gttttcagtg tttcgactac 240
gccaacgctg gctggaaagt gctgttcggc ctgctgctga aaggcctggg agccaaagac 300
atcccttttg caaacaattt cgatggcctg gccacagttt atcaaaacac ccctgacttt 360
ctggcccaac caggcgacat ggtggtgttt ggttctaatt atggcgcagg ctatggccac 420
gtagcctggg tgatcgaagc cacactggac tacattattg tttatgagca aaactggctg 480
ggaggcggat ggacagacgg catcgaacag cctggctggg gctgggagaa agtgacacgc 540
cgtcaacatg cctatgactt ccctatgtgg ttcatccgtc ctaatttcaa agagctcggt 600
cgtaaaaaac gtcgtcagcg tcgtcgtccg cctcagtaag gatccggctg ctaacaaagc 660
ccgaaaggaa gctgagttgg ctgctgccac cgctgagcaa taactagcat aaccccttgg 720
ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccggatatcc 780
cgcaagaggc ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg 840
gtgccgagga tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag 900
caatttaact gtgataaact accgcattaa agctagctta tcgatgataa gctgtcaaac 960
atgagaatta attcttgaag acgaaagggc ctcgtgatac gcctattttt ataggttaat 1020
gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga 1080
acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa 1140
ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt 1200
gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg 1260
ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg 1320
gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg 1380
agcactttta aagttctgct atgtggcgcg gtattatccc gtgttgacgc cgggcaagag 1440
caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca 1500
gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg 1560
agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc 1620
gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg 1680
aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgcagcaat ggcaacaacg 1740
ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca attaatagac 1800
tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg 1860
tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg 1920
gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact 1980
atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa 2040
ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca tttttaattt 2100
aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag 2160
ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct 2220
ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt 2280
tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg 2340
cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct 2400
gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc 2460
gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg 2520
tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa 2580
ctgagatacc tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg 2640
gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg 2700
ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga 2760
tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt 2820
ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc gttatcccct 2880
gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg ccgcagccga 2940
acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcctgat gcggtatttt 3000
ctccttacgc atctgtgcgg tatttcacac cgcaatggtg cactctcagt acaatctgct 3060
ctgatgccgc atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc 3120
tgcgccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc 3180
atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc 3240
gtcatcaccg aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga 3300
ttcacagatg tctgcctgtt catccgcgtc cagctcgttg agtttctcca gaagcgttaa 3360
tgtctggctt ctgataaagc gggccatgtt aagggcggtt ttttcctgtt tggtcactga 3420
tgcctccgtg taagggggat ttctgttcat gggggtaatg ataccgatga aacgagagag 3480
gatgctcacg atacgggtta ctgatgatga acatgcccgg ttactggaac gttgtgaggg 3540
taaacaactg gcggtatgga tgcggcggga ccagagaaaa atcactcagg gtcaatgcca 3600
gcgcttcgtt aatacagatg taggtgttcc acagggtagc cagcagcatc ctgcgatgca 3660
gatccggaac ataatggtgc agggcgctga cttccgcgtt tccagacttt acgaaacacg 3720
gaaaccgaag accattcatg ttgttgctca ggtcgcagac gttttgcagc agcagtcgct 3780
tcacgttcgc tcgcgtatcg gtgattcatt ctgctaacca gtaaggcaac cccgccagcc 3840
tagccgggtc ctcaacgaca ggagcacgat catgcgcacc cgtggccagg acccaacgct 3900
gcccgagatg cgccgcgtgc ggctgctgga gatggcggac gcgatggata tgttctgcca 3960
agggttggtt tgcgcattca cagttctccg caagaattga ttggctccaa ttcttggagt 4020
ggtgaatccg ttagcgaggt gccgccggct tccattcagg tcgaggtggc ccggctccat 4080
gcaccgcgac gcaacgcggg gaggcagaca aggtataggg cggcgcctac aatccatgcc 4140
aacccgttcc atgtgctcgc cgaggcggca taaatcgccg tgacgatcag cggtccaatg 4200
atcgaagtta ggctggtaag agccgcgagc gatccttgaa gctgtccctg atggtcgtca 4260
tctacctgcc tggacagcat ggcctgcaac gcgggcatcc cgatgccgcc ggaagcgaga 4320
agaatcataa tggggaaggc catccagcct cgcgtcgcga acgccagcaa gacgtagccc 4380
agcgcgtcgg ccgccatgcc ggcgataatg gcctgcttct cgccgaaacg tttggtggcg 4440
ggaccagtga cgaaggcttg agcgagggcg tgcaagattc cgaataccgc aagcgacagg 4500
ccgatcatcg tcgcgctcca gcgaaagcgg tcctcgccga aaatgaccca gagcgctgcc 4560
ggcacctgtc ctacgagttg catgataaag aagacagtca taagtgcggc gacgatagtc 4620
atgccccgcg cccaccggaa ggagctgact gggttgaagg ctctcaaggg catcggtcga 4680
gatcccggtg cctaatgagt gagctaactt acattaattg cgttgcgctc actgcccgct 4740
ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga 4800
ggcggtttgc gtattgggcg ccagggtggt ttttcttttc accagtgaga cgggcaacag 4860
ctgattgccc ttcaccgcct ggccctgaga gagttgcagc aagcggtcca cgctggtttg 4920
ccccagcagg cgaaaatcct gtttgatggt ggttaacggc gggatataac atgagctgtc 4980
ttcggtatcg tcgtatccca ctaccgagat atccgcacca acgcgcagcc cggactcggt 5040
aatggcgcgc attgcgccca gcgccatctg atcgttggca accagcatcg cagtgggaac 5100
gatgccctca ttcagcattt gcatggtttg ttgaaaaccg gacatggcac tccagtcgcc 5160
ttcccgttcc gctatcggct gaatttgatt gcgagtgaga tatttatgcc agccagccag 5220
acgcagacgc gccgagacag aacttaatgg gcccgctaac agcgcgattt gctggtgacc 5280
caatgcgacc agatgctcca cgcccagtcg cgtaccgtct tcatgggaga aaataatact 5340
gttgatgggt gtctggtcag agacatcaag aaataacgcc ggaacattag tgcaggcagc 5400
ttccacagca atggcatcct ggtcatccag cggatagtta atgatcagcc cactgacgcg 5460
ttgcgcgaga agattgtgca ccgccgcttt acaggcttcg acgccgcttc gttctaccat 5520
cgacaccacc acgctggcac ccagttgatc ggcgcgagat ttaatcgccg cgacaatttg 5580
cgacggcgcg tgcagggcca gactggaggt ggcaacgcca atcagcaacg actgtttgcc 5640
cgccagttgt tgtgccacgc ggttgggaat gtaattcagc tccgccatcg ccgcttccac 5700
tttttcccgc gttttcgcag aaacgtggct ggcctggttc accacgcggg aaacggtctg 5760
ataagagaca ccggcatact ctgcgacatc gtataacgtt actggtttca cattcaccac 5820
cctgaattga ctctcttccg ggcgctatca tgccataccg cgaaaggttt tgcgccattc 5880
gatggtgtcc gggatctcga cgctctccct tatgcgactc ctgcattagg aagcagccca 5940
gtagtaggtt gaggccgttg agcaccgccg ccgcaaggaa tggtgcatgc aaggagatgg 6000
cgcccaacag tcccccggcc acggggcctg ccaccatacc cacgccgaaa caagcgctca 6060
tgagcccgaa gtggcgagcc cgatcttccc catcggtgat gtcggcgata taggcgccag 6120
caaccgcacc tgtggcgccg gtgatgccgg ccacgatgcg tccggcgtag aggatcga 6178
<210> 103
<211> 6511
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 103
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat ggcagccaca 120
catgaacact ctgcccaatg gctgaacaac tacaaaaaag gctacggtta tggcccttac 180
cctctgggca ttaacggtgg catgcactac ggcgttgact tttttatgaa catcggcacc 240
cctgtgaaag ccattagctc aggcaaaatc gtggaagccg gttggtcaaa ctatggcggt 300
ggcaaccaga tcggtctgat cgagaacgat ggtgtgcacc gccaatggta catgcacctg 360
tccaaataca acgttaaagt tggtgactac gtgaaagcag gccagattat cggctggtca 420
ggttcaaccg gttattcaac agcccctcat ctgcacttcc aacgcatggt gaatagtttt 480
agtaattcta ccgctcaaga tccgatgcca ttcctgaaat ctgccggtta tggtggcaaa 540
ctggaagtta gcaaagcagc aaccattaaa cagtccgatg ttaaacaaga agtgaaaaaa 600
caagaggcca aacaaattgt gaaagcgacc gattggaaac agaacaaaga tggcatttgg 660
tataaagcag aacatgccag ctttaccgtg accgcaccgg aaggcattat tacccgttat 720
aaaggtccgt ggaccggtca tccgcaggca ggcgtgctgc agaaaggtca gaccatcaaa 780
tatgatgagg tgcagaaatt tgatggccat gtttgggtta gctgggaaac ctttgaaggt 840
gaaaccgttt atatgccggt tcgtacctgg gatgcaaaaa ccggtaaagt gggtaaactg 900
tggggtgaaa tcaaagagct ccgccagatc aaaatttggt ttcagaatcg tcgcatgaaa 960
tggaaaaaat aaggatccgg ctgctaacaa agcccgaaag gaagctgagt tggctgctgc 1020
caccgctgag caataactag cataacccct tggggcctct aaacgggtct tgaggggttt 1080
tttgctgaaa ggaggaacta tatccggata tcccgcaaga ggcccggcag taccggcata 1140
accaagccta tgcctacagc atccagggtg acggtgccga ggatgacgat gagcgcattg 1200
ttagatttca tacacggtgc ctgactgcgt tagcaattta actgtgataa actaccgcat 1260
taaagctagc ttatcgatga taagctgtca aacatgagaa ttaattcttg aagacgaaag 1320
ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt ttcttagacg 1380
tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata 1440
cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga 1500
aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca 1560
ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat 1620
cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag 1680
agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc 1740
gcggtattat cccgtgttga cgccgggcaa gagcaactcg gtcgccgcat acactattct 1800
cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca 1860
gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt 1920
ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat 1980
gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt 2040
gacaccacga tgcctgcagc aatggcaaca acgttgcgca aactattaac tggcgaacta 2100
cttactctag cttcccggca acaattaata gactggatgg aggcggataa agttgcagga 2160
ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt 2220
gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc 2280
gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct 2340
gagataggtg cctcactgat taagcattgg taactgtcag accaagttta ctcatatata 2400
ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt 2460
gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc 2520
gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg 2580
caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact 2640
ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg 2700
tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg 2760
ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac 2820
tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca 2880
cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga 2940
gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc 3000
ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct 3060
gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg 3120
agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct 3180
tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc 3240
tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc 3300
gaggaagcgg aagagcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca 3360
caccgcaatg gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagtata 3420
cactccgcta tcgctacgtg actgggtcat ggctgcgccc cgacacccgc caacacccgc 3480
tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 3540
ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgaggcagct 3600
gcggtaaagc tcatcagcgt ggtcgtgaag cgattcacag atgtctgcct gttcatccgc 3660
gtccagctcg ttgagtttct ccagaagcgt taatgtctgg cttctgataa agcgggccat 3720
gttaagggcg gttttttcct gtttggtcac tgatgcctcc gtgtaagggg gatttctgtt 3780
catgggggta atgataccga tgaaacgaga gaggatgctc acgatacggg ttactgatga 3840
tgaacatgcc cggttactgg aacgttgtga gggtaaacaa ctggcggtat ggatgcggcg 3900
ggaccagaga aaaatcactc agggtcaatg ccagcgcttc gttaatacag atgtaggtgt 3960
tccacagggt agccagcagc atcctgcgat gcagatccgg aacataatgg tgcagggcgc 4020
tgacttccgc gtttccagac tttacgaaac acggaaaccg aagaccattc atgttgttgc 4080
tcaggtcgca gacgttttgc agcagcagtc gcttcacgtt cgctcgcgta tcggtgattc 4140
attctgctaa ccagtaaggc aaccccgcca gcctagccgg gtcctcaacg acaggagcac 4200
gatcatgcgc acccgtggcc aggacccaac gctgcccgag atgcgccgcg tgcggctgct 4260
ggagatggcg gacgcgatgg atatgttctg ccaagggttg gtttgcgcat tcacagttct 4320
ccgcaagaat tgattggctc caattcttgg agtggtgaat ccgttagcga ggtgccgccg 4380
gcttccattc aggtcgaggt ggcccggctc catgcaccgc gacgcaacgc ggggaggcag 4440
acaaggtata gggcggcgcc tacaatccat gccaacccgt tccatgtgct cgccgaggcg 4500
gcataaatcg ccgtgacgat cagcggtcca atgatcgaag ttaggctggt aagagccgcg 4560
agcgatcctt gaagctgtcc ctgatggtcg tcatctacct gcctggacag catggcctgc 4620
aacgcgggca tcccgatgcc gccggaagcg agaagaatca taatggggaa ggccatccag 4680
cctcgcgtcg cgaacgccag caagacgtag cccagcgcgt cggccgccat gccggcgata 4740
atggcctgct tctcgccgaa acgtttggtg gcgggaccag tgacgaaggc ttgagcgagg 4800
gcgtgcaaga ttccgaatac cgcaagcgac aggccgatca tcgtcgcgct ccagcgaaag 4860
cggtcctcgc cgaaaatgac ccagagcgct gccggcacct gtcctacgag ttgcatgata 4920
aagaagacag tcataagtgc ggcgacgata gtcatgcccc gcgcccaccg gaaggagctg 4980
actgggttga aggctctcaa gggcatcggt cgagatcccg gtgcctaatg agtgagctaa 5040
cttacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag 5100
ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgccagggt 5160
ggtttttctt ttcaccagtg agacgggcaa cagctgattg cccttcaccg cctggccctg 5220
agagagttgc agcaagcggt ccacgctggt ttgccccagc aggcgaaaat cctgtttgat 5280
ggtggttaac ggcgggatat aacatgagct gtcttcggta tcgtcgtatc ccactaccga 5340
gatatccgca ccaacgcgca gcccggactc ggtaatggcg cgcattgcgc ccagcgccat 5400
ctgatcgttg gcaaccagca tcgcagtggg aacgatgccc tcattcagca tttgcatggt 5460
ttgttgaaaa ccggacatgg cactccagtc gccttcccgt tccgctatcg gctgaatttg 5520
attgcgagtg agatatttat gccagccagc cagacgcaga cgcgccgaga cagaacttaa 5580
tgggcccgct aacagcgcga tttgctggtg acccaatgcg accagatgct ccacgcccag 5640
tcgcgtaccg tcttcatggg agaaaataat actgttgatg ggtgtctggt cagagacatc 5700
aagaaataac gccggaacat tagtgcaggc agcttccaca gcaatggcat cctggtcatc 5760
cagcggatag ttaatgatca gcccactgac gcgttgcgcg agaagattgt gcaccgccgc 5820
tttacaggct tcgacgccgc ttcgttctac catcgacacc accacgctgg cacccagttg 5880
atcggcgcga gatttaatcg ccgcgacaat ttgcgacggc gcgtgcaggg ccagactgga 5940
ggtggcaacg ccaatcagca acgactgttt gcccgccagt tgttgtgcca cgcggttggg 6000
aatgtaattc agctccgcca tcgccgcttc cactttttcc cgcgttttcg cagaaacgtg 6060
gctggcctgg ttcaccacgc gggaaacggt ctgataagag acaccggcat actctgcgac 6120
atcgtataac gttactggtt tcacattcac caccctgaat tgactctctt ccgggcgcta 6180
tcatgccata ccgcgaaagg ttttgcgcca ttcgatggtg tccgggatct cgacgctctc 6240
ccttatgcga ctcctgcatt aggaagcagc ccagtagtag gttgaggccg ttgagcaccg 6300
ccgccgcaag gaatggtgca tgcaaggaga tggcgcccaa cagtcccccg gccacggggc 6360
ctgccaccat acccacgccg aaacaagcgc tcatgagccc gaagtggcga gcccgatctt 6420
ccccatcggt gatgtcggcg atataggcgc cagcaaccgc acctgtggcg ccggtgatgc 6480
cggccacgat gcgtccggcg tagaggatcg a 6511
<210> 104
<211> 6490
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 104
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat ggcagccaca 120
catgaacact ctgcccaatg gctgaacaac tacaaaaaag gctacggtta tggcccttac 180
cctctgggca ttaacggtgg catgcactac ggcgttgact tttttatgaa catcggcacc 240
cctgtgaaag ccattagctc aggcaaaatc gtggaagccg gttggtcaaa ctatggcggt 300
ggcaaccaga tcggtctgat cgagaacgat ggtgtgcacc gccaatggta catgcacctg 360
tccaaataca acgttaaagt tggtgactac gtgaaagcag gccagattat cggctggtca 420
ggttcaaccg gttattcaac agcccctcat ctgcacttcc aacgcatggt gaatagtttt 480
agtaattcta ccgctcaaga tccgatgcca ttcctgaaat ctgccggtta tggtggcaaa 540
ctggaagtta gcaaagcagc aaccattaaa cagtccgatg ttaaacaaga agtgaaaaaa 600
caagaggcca aacaaattgt gaaagcgacc gattggaaac agaacaaaga tggcatttgg 660
tataaagcag aacatgccag ctttaccgtg accgcaccgg aaggcattat tacccgttat 720
aaaggtccgt ggaccggtca tccgcaggca ggcgtgctgc agaaaggtca gaccatcaaa 780
tatgatgagg tgcagaaatt tgatggccat gtttgggtta gctgggaaac ctttgaaggt 840
gaaaccgttt atatgccggt tcgtacctgg gatgcaaaaa ccggtaaagt gggtaaactg 900
tggggtgaaa tcaaagagct ccgtcgtcgt cgccgtcggc gtcgtcgtta aggatccggc 960
tgctaacaaa gcccgaaagg aagctgagtt ggctgctgcc accgctgagc aataactagc 1020
ataacccctt ggggcctcta aacgggtctt gaggggtttt ttgctgaaag gaggaactat 1080
atccggatat cccgcaagag gcccggcagt accggcataa ccaagcctat gcctacagca 1140
tccagggtga cggtgccgag gatgacgatg agcgcattgt tagatttcat acacggtgcc 1200
tgactgcgtt agcaatttaa ctgtgataaa ctaccgcatt aaagctagct tatcgatgat 1260
aagctgtcaa acatgagaat taattcttga agacgaaagg gcctcgtgat acgcctattt 1320
ttataggtta atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga 1380
aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc 1440
atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt 1500
caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct 1560
cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt 1620
tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt 1680
tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtgttgac 1740
gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac 1800
tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct 1860
gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg 1920
aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg 1980
gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgcagca 2040
atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa 2100
caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt 2160
ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc 2220
attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg 2280
agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt 2340
aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt 2400
catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc 2460
ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct 2520
tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta 2580
ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc 2640
ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac 2700
ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct 2760
gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat 2820
aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg 2880
acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa 2940
gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg 3000
gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga 3060
cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc 3120
aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct 3180
gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct 3240
cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgcctg 3300
atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcaatgg tgcactctca 3360
gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga 3420
ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg 3480
tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca 3540
gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg 3600
gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc 3660
cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg 3720
tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat 3780
gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga 3840
acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca 3900
gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca 3960
tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact 4020
ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca 4080
gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca 4140
accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca 4200
ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga 4260
tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc 4320
aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg 4380
gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct 4440
acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc 4500
agcggtccaa tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc 4560
tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg 4620
ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgc gaacgccagc 4680
aagacgtagc ccagcgcgtc ggccgccatg ccggcgataa tggcctgctt ctcgccgaaa 4740
cgtttggtgg cgggaccagt gacgaaggct tgagcgaggg cgtgcaagat tccgaatacc 4800
gcaagcgaca ggccgatcat cgtcgcgctc cagcgaaagc ggtcctcgcc gaaaatgacc 4860
cagagcgctg ccggcacctg tcctacgagt tgcatgataa agaagacagt cataagtgcg 4920
gcgacgatag tcatgccccg cgcccaccgg aaggagctga ctgggttgaa ggctctcaag 4980
ggcatcggtc gagatcccgg tgcctaatga gtgagctaac ttacattaat tgcgttgcgc 5040
tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa 5100
cgcgcgggga gaggcggttt gcgtattggg cgccagggtg gtttttcttt tcaccagtga 5160
gacgggcaac agctgattgc ccttcaccgc ctggccctga gagagttgca gcaagcggtc 5220
cacgctggtt tgccccagca ggcgaaaatc ctgtttgatg gtggttaacg gcgggatata 5280
acatgagctg tcttcggtat cgtcgtatcc cactaccgag atatccgcac caacgcgcag 5340
cccggactcg gtaatggcgc gcattgcgcc cagcgccatc tgatcgttgg caaccagcat 5400
cgcagtggga acgatgccct cattcagcat ttgcatggtt tgttgaaaac cggacatggc 5460
actccagtcg ccttcccgtt ccgctatcgg ctgaatttga ttgcgagtga gatatttatg 5520
ccagccagcc agacgcagac gcgccgagac agaacttaat gggcccgcta acagcgcgat 5580
ttgctggtga cccaatgcga ccagatgctc cacgcccagt cgcgtaccgt cttcatggga 5640
gaaaataata ctgttgatgg gtgtctggtc agagacatca agaaataacg ccggaacatt 5700
agtgcaggca gcttccacag caatggcatc ctggtcatcc agcggatagt taatgatcag 5760
cccactgacg cgttgcgcga gaagattgtg caccgccgct ttacaggctt cgacgccgct 5820
tcgttctacc atcgacacca ccacgctggc acccagttga tcggcgcgag atttaatcgc 5880
cgcgacaatt tgcgacggcg cgtgcagggc cagactggag gtggcaacgc caatcagcaa 5940
cgactgtttg cccgccagtt gttgtgccac gcggttggga atgtaattca gctccgccat 6000
cgccgcttcc actttttccc gcgttttcgc agaaacgtgg ctggcctggt tcaccacgcg 6060
ggaaacggtc tgataagaga caccggcata ctctgcgaca tcgtataacg ttactggttt 6120
cacattcacc accctgaatt gactctcttc cgggcgctat catgccatac cgcgaaaggt 6180
tttgcgccat tcgatggtgt ccgggatctc gacgctctcc cttatgcgac tcctgcatta 6240
ggaagcagcc cagtagtagg ttgaggccgt tgagcaccgc cgccgcaagg aatggtgcat 6300
gcaaggagat ggcgcccaac agtcccccgg ccacggggcc tgccaccata cccacgccga 6360
aacaagcgct catgagcccg aagtggcgag cccgatcttc cccatcggtg atgtcggcga 6420
tataggcgcc agcaaccgca cctgtggcgc cggtgatgcc ggccacgatg cgtccggcgt 6480
agaggatcga 6490
<210> 105
<211> 6502
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 105
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat ggcagccaca 120
catgaacact ctgcccaatg gctgaacaac tacaaaaaag gctacggtta tggcccttac 180
cctctgggca ttaacggtgg catgcactac ggcgttgact tttttatgaa catcggcacc 240
cctgtgaaag ccattagctc aggcaaaatc gtggaagccg gttggtcaaa ctatggcggt 300
ggcaaccaga tcggtctgat cgagaacgat ggtgtgcacc gccaatggta catgcacctg 360
tccaaataca acgttaaagt tggtgactac gtgaaagcag gccagattat cggctggtca 420
ggttcaaccg gttattcaac agcccctcat ctgcacttcc aacgcatggt gaatagtttt 480
agtaattcta ccgctcaaga tccgatgcca ttcctgaaat ctgccggtta tggtggcaaa 540
ctggaagtta gcaaagcagc aaccattaaa cagtccgatg ttaaacaaga agtgaaaaaa 600
caagaggcca aacaaattgt gaaagcgacc gattggaaac agaacaaaga tggcatttgg 660
tataaagcag aacatgccag ctttaccgtg accgcaccgg aaggcattat tacccgttat 720
aaaggtccgt ggaccggtca tccgcaggca ggcgtgctgc agaaaggtca gaccatcaaa 780
tatgatgagg tgcagaaatt tgatggccat gtttgggtta gctgggaaac ctttgaaggt 840
gaaaccgttt atatgccggt tcgtacctgg gatgcaaaaa ccggtaaagt gggtaaactg 900
tggggtgaaa tcaaagagct cggtcgtaaa aaacgtcgtc agcgtcgtcg tccgcctcag 960
taaggatccg gctgctaaca aagcccgaaa ggaagctgag ttggctgctg ccaccgctga 1020
gcaataacta gcataacccc ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa 1080
aggaggaact atatccggat atcccgcaag aggcccggca gtaccggcat aaccaagcct 1140
atgcctacag catccagggt gacggtgccg aggatgacga tgagcgcatt gttagatttc 1200
atacacggtg cctgactgcg ttagcaattt aactgtgata aactaccgca ttaaagctag 1260
cttatcgatg ataagctgtc aaacatgaga attaattctt gaagacgaaa gggcctcgtg 1320
atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc 1380
acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat 1440
atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag 1500
agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt 1560
cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt 1620
gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga gagttttcgc 1680
cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta 1740
tcccgtgttg acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac 1800
ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa 1860
ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg 1920
atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc 1980
cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg 2040
atgcctgcag caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta 2100
gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg accacttctg 2160
cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg 2220
tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc 2280
tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt 2340
gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt 2400
gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc 2460
atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag 2520
atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa 2580
aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg 2640
aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag 2700
ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg 2760
ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga 2820
tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc 2880
ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc 2940
acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga 3000
gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt 3060
cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg 3120
aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac 3180
atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga 3240
gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg 3300
gaagagcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcaat 3360
ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat acactccgct 3420
atcgctacgt gactgggtca tggctgcgcc ccgacacccg ccaacacccg ctgacgcgcc 3480
ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag 3540
ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgaggcagc tgcggtaaag 3600
ctcatcagcg tggtcgtgaa gcgattcaca gatgtctgcc tgttcatccg cgtccagctc 3660
gttgagtttc tccagaagcg ttaatgtctg gcttctgata aagcgggcca tgttaagggc 3720
ggttttttcc tgtttggtca ctgatgcctc cgtgtaaggg ggatttctgt tcatgggggt 3780
aatgataccg atgaaacgag agaggatgct cacgatacgg gttactgatg atgaacatgc 3840
ccggttactg gaacgttgtg agggtaaaca actggcggta tggatgcggc gggaccagag 3900
aaaaatcact cagggtcaat gccagcgctt cgttaataca gatgtaggtg ttccacaggg 3960
tagccagcag catcctgcga tgcagatccg gaacataatg gtgcagggcg ctgacttccg 4020
cgtttccaga ctttacgaaa cacggaaacc gaagaccatt catgttgttg ctcaggtcgc 4080
agacgttttg cagcagcagt cgcttcacgt tcgctcgcgt atcggtgatt cattctgcta 4140
accagtaagg caaccccgcc agcctagccg ggtcctcaac gacaggagca cgatcatgcg 4200
cacccgtggc caggacccaa cgctgcccga gatgcgccgc gtgcggctgc tggagatggc 4260
ggacgcgatg gatatgttct gccaagggtt ggtttgcgca ttcacagttc tccgcaagaa 4320
ttgattggct ccaattcttg gagtggtgaa tccgttagcg aggtgccgcc ggcttccatt 4380
caggtcgagg tggcccggct ccatgcaccg cgacgcaacg cggggaggca gacaaggtat 4440
agggcggcgc ctacaatcca tgccaacccg ttccatgtgc tcgccgaggc ggcataaatc 4500
gccgtgacga tcagcggtcc aatgatcgaa gttaggctgg taagagccgc gagcgatcct 4560
tgaagctgtc cctgatggtc gtcatctacc tgcctggaca gcatggcctg caacgcgggc 4620
atcccgatgc cgccggaagc gagaagaatc ataatgggga aggccatcca gcctcgcgtc 4680
gcgaacgcca gcaagacgta gcccagcgcg tcggccgcca tgccggcgat aatggcctgc 4740
ttctcgccga aacgtttggt ggcgggacca gtgacgaagg cttgagcgag ggcgtgcaag 4800
attccgaata ccgcaagcga caggccgatc atcgtcgcgc tccagcgaaa gcggtcctcg 4860
ccgaaaatga cccagagcgc tgccggcacc tgtcctacga gttgcatgat aaagaagaca 4920
gtcataagtg cggcgacgat agtcatgccc cgcgcccacc ggaaggagct gactgggttg 4980
aaggctctca agggcatcgg tcgagatccc ggtgcctaat gagtgagcta acttacatta 5040
attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 5100
tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgccaggg tggtttttct 5160
tttcaccagt gagacgggca acagctgatt gcccttcacc gcctggccct gagagagttg 5220
cagcaagcgg tccacgctgg tttgccccag caggcgaaaa tcctgtttga tggtggttaa 5280
cggcgggata taacatgagc tgtcttcggt atcgtcgtat cccactaccg agatatccgc 5340
accaacgcgc agcccggact cggtaatggc gcgcattgcg cccagcgcca tctgatcgtt 5400
ggcaaccagc atcgcagtgg gaacgatgcc ctcattcagc atttgcatgg tttgttgaaa 5460
accggacatg gcactccagt cgccttcccg ttccgctatc ggctgaattt gattgcgagt 5520
gagatattta tgccagccag ccagacgcag acgcgccgag acagaactta atgggcccgc 5580
taacagcgcg atttgctggt gacccaatgc gaccagatgc tccacgccca gtcgcgtacc 5640
gtcttcatgg gagaaaataa tactgttgat gggtgtctgg tcagagacat caagaaataa 5700
cgccggaaca ttagtgcagg cagcttccac agcaatggca tcctggtcat ccagcggata 5760
gttaatgatc agcccactga cgcgttgcgc gagaagattg tgcaccgccg ctttacaggc 5820
ttcgacgccg cttcgttcta ccatcgacac caccacgctg gcacccagtt gatcggcgcg 5880
agatttaatc gccgcgacaa tttgcgacgg cgcgtgcagg gccagactgg aggtggcaac 5940
gccaatcagc aacgactgtt tgcccgccag ttgttgtgcc acgcggttgg gaatgtaatt 6000
cagctccgcc atcgccgctt ccactttttc ccgcgttttc gcagaaacgt ggctggcctg 6060
gttcaccacg cgggaaacgg tctgataaga gacaccggca tactctgcga catcgtataa 6120
cgttactggt ttcacattca ccaccctgaa ttgactctct tccgggcgct atcatgccat 6180
accgcgaaag gttttgcgcc attcgatggt gtccgggatc tcgacgctct cccttatgcg 6240
actcctgcat taggaagcag cccagtagta ggttgaggcc gttgagcacc gccgccgcaa 6300
ggaatggtgc atgcaaggag atggcgccca acagtccccc ggccacgggg cctgccacca 6360
tacccacgcc gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg 6420
tgatgtcggc gatataggcg ccagcaaccg cacctgtggc gccggtgatg ccggccacga 6480
tgcgtccggc gtagaggatc ga 6502
<210> 106
<211> 6130
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 106
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat ggcagccaca 120
catgaacact ctgcccaatg gctgaacaac tacaaaaaag gctacggtta tggcccttac 180
cctctgggca ttaacggtgg catgcactac ggcgttgact tttttatgaa catcggcacc 240
cctgtgaaag ccattagctc aggcaaaatc gtggaagccg gttggtcaaa ctatggcggt 300
ggcaaccaga tcggtctgat cgagaacgat ggtgtgcacc gccaatggta catgcacctg 360
tccaaataca acgttaaagt tggtgactac gtgaaagcag gccagattat cggctggtca 420
ggttcaaccg gttattcaac agcccctcat ctgcacttcc aacgcatggt gaatagtttt 480
agtaattcta ccgctcaaga tccgatgcca ttcctgaaat ctgccggtta tggtgagctc 540
cgccagatca aaatttggtt tcagaatcgt cgcatgaaat ggaaaaaata aggatccggc 600
tgctaacaaa gcccgaaagg aagctgagtt ggctgctgcc accgctgagc aataactagc 660
ataacccctt ggggcctcta aacgggtctt gaggggtttt ttgctgaaag gaggaactat 720
atccggatat cccgcaagag gcccggcagt accggcataa ccaagcctat gcctacagca 780
tccagggtga cggtgccgag gatgacgatg agcgcattgt tagatttcat acacggtgcc 840
tgactgcgtt agcaatttaa ctgtgataaa ctaccgcatt aaagctagct tatcgatgat 900
aagctgtcaa acatgagaat taattcttga agacgaaagg gcctcgtgat acgcctattt 960
ttataggtta atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga 1020
aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc 1080
atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt 1140
caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct 1200
cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt 1260
tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt 1320
tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtgttgac 1380
gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac 1440
tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct 1500
gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg 1560
aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg 1620
gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgcagca 1680
atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa 1740
caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt 1800
ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc 1860
attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg 1920
agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt 1980
aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt 2040
catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc 2100
ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct 2160
tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta 2220
ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc 2280
ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac 2340
ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct 2400
gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat 2460
aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg 2520
acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa 2580
gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg 2640
gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga 2700
cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc 2760
aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct 2820
gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct 2880
cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgcctg 2940
atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcaatgg tgcactctca 3000
gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga 3060
ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg 3120
tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca 3180
gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg 3240
gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc 3300
cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg 3360
tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat 3420
gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga 3480
acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca 3540
gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca 3600
tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact 3660
ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca 3720
gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca 3780
accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca 3840
ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga 3900
tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc 3960
aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg 4020
gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct 4080
acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc 4140
agcggtccaa tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc 4200
tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg 4260
ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgc gaacgccagc 4320
aagacgtagc ccagcgcgtc ggccgccatg ccggcgataa tggcctgctt ctcgccgaaa 4380
cgtttggtgg cgggaccagt gacgaaggct tgagcgaggg cgtgcaagat tccgaatacc 4440
gcaagcgaca ggccgatcat cgtcgcgctc cagcgaaagc ggtcctcgcc gaaaatgacc 4500
cagagcgctg ccggcacctg tcctacgagt tgcatgataa agaagacagt cataagtgcg 4560
gcgacgatag tcatgccccg cgcccaccgg aaggagctga ctgggttgaa ggctctcaag 4620
ggcatcggtc gagatcccgg tgcctaatga gtgagctaac ttacattaat tgcgttgcgc 4680
tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa 4740
cgcgcgggga gaggcggttt gcgtattggg cgccagggtg gtttttcttt tcaccagtga 4800
gacgggcaac agctgattgc ccttcaccgc ctggccctga gagagttgca gcaagcggtc 4860
cacgctggtt tgccccagca ggcgaaaatc ctgtttgatg gtggttaacg gcgggatata 4920
acatgagctg tcttcggtat cgtcgtatcc cactaccgag atatccgcac caacgcgcag 4980
cccggactcg gtaatggcgc gcattgcgcc cagcgccatc tgatcgttgg caaccagcat 5040
cgcagtggga acgatgccct cattcagcat ttgcatggtt tgttgaaaac cggacatggc 5100
actccagtcg ccttcccgtt ccgctatcgg ctgaatttga ttgcgagtga gatatttatg 5160
ccagccagcc agacgcagac gcgccgagac agaacttaat gggcccgcta acagcgcgat 5220
ttgctggtga cccaatgcga ccagatgctc cacgcccagt cgcgtaccgt cttcatggga 5280
gaaaataata ctgttgatgg gtgtctggtc agagacatca agaaataacg ccggaacatt 5340
agtgcaggca gcttccacag caatggcatc ctggtcatcc agcggatagt taatgatcag 5400
cccactgacg cgttgcgcga gaagattgtg caccgccgct ttacaggctt cgacgccgct 5460
tcgttctacc atcgacacca ccacgctggc acccagttga tcggcgcgag atttaatcgc 5520
cgcgacaatt tgcgacggcg cgtgcagggc cagactggag gtggcaacgc caatcagcaa 5580
cgactgtttg cccgccagtt gttgtgccac gcggttggga atgtaattca gctccgccat 5640
cgccgcttcc actttttccc gcgttttcgc agaaacgtgg ctggcctggt tcaccacgcg 5700
ggaaacggtc tgataagaga caccggcata ctctgcgaca tcgtataacg ttactggttt 5760
cacattcacc accctgaatt gactctcttc cgggcgctat catgccatac cgcgaaaggt 5820
tttgcgccat tcgatggtgt ccgggatctc gacgctctcc cttatgcgac tcctgcatta 5880
ggaagcagcc cagtagtagg ttgaggccgt tgagcaccgc cgccgcaagg aatggtgcat 5940
gcaaggagat ggcgcccaac agtcccccgg ccacggggcc tgccaccata cccacgccga 6000
aacaagcgct catgagcccg aagtggcgag cccgatcttc cccatcggtg atgtcggcga 6060
tataggcgcc agcaaccgca cctgtggcgc cggtgatgcc ggccacgatg cgtccggcgt 6120
agaggatcga 6130
<210> 107
<211> 6114
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 107
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat ggcagccaca 120
catgaacact ctgcccaatg gctgaacaac tacaaaaaag gctacggtta tggcccttac 180
cctctgggca ttaacggtgg catgcactac ggcgttgact tttttatgaa catcggcacc 240
cctgtgaaag ccattagctc aggcaaaatc gtggaagccg gttggtcaaa ctatggcggt 300
ggcaaccaga tcggtctgat cgagaacgat ggtgtgcacc gccaatggta catgcacctg 360
tccaaataca acgttaaagt tggtgactac gtgaaagcag gccagattat cggctggtca 420
ggttcaaccg gttattcaac agcccctcat ctgcacttcc aacgcatggt gaatagtttt 480
agtaattcta ccgctcaaga tccgatgcca ttcctgaaat ctgccggtta tggtgagctg 540
agctccgtcg tcgtcgccgt cggcgtcgtc gttaaggatc cggctgctaa caaagcccga 600
aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc ccttggggcc 660
tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg atatcccgca 720
agaggcccgg cagtaccggc ataaccaagc ctatgcctac agcatccagg gtgacggtgc 780
cgaggatgac gatgagcgca ttgttagatt tcatacacgg tgcctgactg cgttagcaat 840
ttaactgtga taaactaccg cattaaagct agcttatcga tgataagctg tcaaacatga 900
gaattaattc ttgaagacga aagggcctcg tgatacgcct atttttatag gttaatgtca 960
tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc 1020
ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 1080
gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 1140
cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 1200
tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 1260
tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 1320
cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac 1380
tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 1440
agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 1500
ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 1560
ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 1620
aagccatacc aaacgacgag cgtgacacca cgatgcctgc agcaatggca acaacgttgc 1680
gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 1740
tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 1800
ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc 1860
cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 1920
atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 1980
cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 2040
ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 2100
cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 2160
ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 2220
tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 2280
taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 2340
caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 2400
agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 2460
gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 2520
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 2580
ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 2640
acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 2700
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 2760
ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 2820
ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 2880
ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg tattttctcc 2940
ttacgcatct gtgcggtatt tcacaccgca atggtgcact ctcagtacaa tctgctctga 3000
tgccgcatag ttaagccagt atacactccg ctatcgctac gtgactgggt catggctgcg 3060
ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc 3120
gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca 3180
tcaccgaaac gcgcgaggca gctgcggtaa agctcatcag cgtggtcgtg aagcgattca 3240
cagatgtctg cctgttcatc cgcgtccagc tcgttgagtt tctccagaag cgttaatgtc 3300
tggcttctga taaagcgggc catgttaagg gcggtttttt cctgtttggt cactgatgcc 3360
tccgtgtaag ggggatttct gttcatgggg gtaatgatac cgatgaaacg agagaggatg 3420
ctcacgatac gggttactga tgatgaacat gcccggttac tggaacgttg tgagggtaaa 3480
caactggcgg tatggatgcg gcgggaccag agaaaaatca ctcagggtca atgccagcgc 3540
ttcgttaata cagatgtagg tgttccacag ggtagccagc agcatcctgc gatgcagatc 3600
cggaacataa tggtgcaggg cgctgacttc cgcgtttcca gactttacga aacacggaaa 3660
ccgaagacca ttcatgttgt tgctcaggtc gcagacgttt tgcagcagca gtcgcttcac 3720
gttcgctcgc gtatcggtga ttcattctgc taaccagtaa ggcaaccccg ccagcctagc 3780
cgggtcctca acgacaggag cacgatcatg cgcacccgtg gccaggaccc aacgctgccc 3840
gagatgcgcc gcgtgcggct gctggagatg gcggacgcga tggatatgtt ctgccaaggg 3900
ttggtttgcg cattcacagt tctccgcaag aattgattgg ctccaattct tggagtggtg 3960
aatccgttag cgaggtgccg ccggcttcca ttcaggtcga ggtggcccgg ctccatgcac 4020
cgcgacgcaa cgcggggagg cagacaaggt atagggcggc gcctacaatc catgccaacc 4080
cgttccatgt gctcgccgag gcggcataaa tcgccgtgac gatcagcggt ccaatgatcg 4140
aagttaggct ggtaagagcc gcgagcgatc cttgaagctg tccctgatgg tcgtcatcta 4200
cctgcctgga cagcatggcc tgcaacgcgg gcatcccgat gccgccggaa gcgagaagaa 4260
tcataatggg gaaggccatc cagcctcgcg tcgcgaacgc cagcaagacg tagcccagcg 4320
cgtcggccgc catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac 4380
cagtgacgaa ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga 4440
tcatcgtcgc gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca 4500
cctgtcctac gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc 4560
cccgcgccca ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc 4620
ccggtgccta atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc 4680
agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg 4740
gtttgcgtat tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga 4800
ttgcccttca ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc 4860
agcaggcgaa aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg 4920
gtatcgtcgt atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg 4980
gcgcgcattg cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg 5040
ccctcattca gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc 5100
cgttccgcta tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc 5160
agacgcgccg agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat 5220
gcgaccagat gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg 5280
atgggtgtct ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc 5340
acagcaatgg catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc 5400
gcgagaagat tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac 5460
accaccacgc tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac 5520
ggcgcgtgca gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc 5580
agttgttgtg ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt 5640
tcccgcgttt tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa 5700
gagacaccgg catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg 5760
aattgactct cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg 5820
gtgtccggga tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag 5880
taggttgagg ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc 5940
caacagtccc ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag 6000
cccgaagtgg cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac 6060
cgcacctgtg gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcga 6114
<210> 108
<211> 6126
<212> DNA
<213> Artificial
<220>
<223> Vector construct
<400> 108
gatctcgatc ccgcgaaatt aatacgactc actatagggg aattgtgagc ggataacaat 60
tcccctctag aaataatttt gtttaaactt taagaaggag atatacatat ggcagccaca 120
catgaacact ctgcccaatg gctgaacaac tacaaaaaag gctacggtta tggcccttac 180
cctctgggca ttaacggtgg catgcactac ggcgttgact tttttatgaa catcggcacc 240
cctgtgaaag ccattagctc aggcaaaatc gtggaagccg gttggtcaaa ctatggcggt 300
ggcaaccaga tcggtctgat cgagaacgat ggtgtgcacc gccaatggta catgcacctg 360
tccaaataca acgttaaagt tggtgactac gtgaaagcag gccagattat cggctggtca 420
ggttcaaccg gttattcaac agcccctcat ctgcacttcc aacgcatggt gaatagtttt 480
agtaattcta ccgctcaaga tccgatgcca ttcctgaaat ctgccggtta tggtgagctg 540
agctcggtcg taaaaaacgt cgtcagcgtc gtcgtccgcc tcagtaagga tccggctgct 600
aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata actagcataa 660
ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg aactatatcc 720
ggatatcccg caagaggccc ggcagtaccg gcataaccaa gcctatgcct acagcatcca 780
gggtgacggt gccgaggatg acgatgagcg cattgttaga tttcatacac ggtgcctgac 840
tgcgttagca atttaactgt gataaactac cgcattaaag ctagcttatc gatgataagc 900
tgtcaaacat gagaattaat tcttgaagac gaaagggcct cgtgatacgc ctatttttat 960
aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt cggggaaatg 1020
tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 1080
gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 1140
atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 1200
cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 1260
tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 1320
caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt gttgacgccg 1380
ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 1440
cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 1500
taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 1560
agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 1620
cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gcagcaatgg 1680
caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 1740
taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 1800
ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 1860
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 1920
aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 1980
attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 2040
tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 2100
aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 2160
gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 2220
cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 2280
gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca 2340
agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 2400
ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 2460
cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 2520
acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 2580
gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 2640
ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 2700
agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 2760
cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 2820
tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 2880
gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcctgatgc 2940
ggtattttct ccttacgcat ctgtgcggta tttcacaccg caatggtgca ctctcagtac 3000
aatctgctct gatgccgcat agttaagcca gtatacactc cgctatcgct acgtgactgg 3060
gtcatggctg cgccccgaca cccgccaaca cccgctgacg cgccctgacg ggcttgtctg 3120
ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat gtgtcagagg 3180
ttttcaccgt catcaccgaa acgcgcgagg cagctgcggt aaagctcatc agcgtggtcg 3240
tgaagcgatt cacagatgtc tgcctgttca tccgcgtcca gctcgttgag tttctccaga 3300
agcgttaatg tctggcttct gataaagcgg gccatgttaa gggcggtttt ttcctgtttg 3360
gtcactgatg cctccgtgta agggggattt ctgttcatgg gggtaatgat accgatgaaa 3420
cgagagagga tgctcacgat acgggttact gatgatgaac atgcccggtt actggaacgt 3480
tgtgagggta aacaactggc ggtatggatg cggcgggacc agagaaaaat cactcagggt 3540
caatgccagc gcttcgttaa tacagatgta ggtgttccac agggtagcca gcagcatcct 3600
gcgatgcaga tccggaacat aatggtgcag ggcgctgact tccgcgtttc cagactttac 3660
gaaacacgga aaccgaagac cattcatgtt gttgctcagg tcgcagacgt tttgcagcag 3720
cagtcgcttc acgttcgctc gcgtatcggt gattcattct gctaaccagt aaggcaaccc 3780
cgccagccta gccgggtcct caacgacagg agcacgatca tgcgcacccg tggccaggac 3840
ccaacgctgc ccgagatgcg ccgcgtgcgg ctgctggaga tggcggacgc gatggatatg 3900
ttctgccaag ggttggtttg cgcattcaca gttctccgca agaattgatt ggctccaatt 3960
cttggagtgg tgaatccgtt agcgaggtgc cgccggcttc cattcaggtc gaggtggccc 4020
ggctccatgc accgcgacgc aacgcgggga ggcagacaag gtatagggcg gcgcctacaa 4080
tccatgccaa cccgttccat gtgctcgccg aggcggcata aatcgccgtg acgatcagcg 4140
gtccaatgat cgaagttagg ctggtaagag ccgcgagcga tccttgaagc tgtccctgat 4200
ggtcgtcatc tacctgcctg gacagcatgg cctgcaacgc gggcatcccg atgccgccgg 4260
aagcgagaag aatcataatg gggaaggcca tccagcctcg cgtcgcgaac gccagcaaga 4320
cgtagcccag cgcgtcggcc gccatgccgg cgataatggc ctgcttctcg ccgaaacgtt 4380
tggtggcggg accagtgacg aaggcttgag cgagggcgtg caagattccg aataccgcaa 4440
gcgacaggcc gatcatcgtc gcgctccagc gaaagcggtc ctcgccgaaa atgacccaga 4500
gcgctgccgg cacctgtcct acgagttgca tgataaagaa gacagtcata agtgcggcga 4560
cgatagtcat gccccgcgcc caccggaagg agctgactgg gttgaaggct ctcaagggca 4620
tcggtcgaga tcccggtgcc taatgagtga gctaacttac attaattgcg ttgcgctcac 4680
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 4740
cggggagagg cggtttgcgt attgggcgcc agggtggttt ttcttttcac cagtgagacg 4800
ggcaacagct gattgccctt caccgcctgg ccctgagaga gttgcagcaa gcggtccacg 4860
ctggtttgcc ccagcaggcg aaaatcctgt ttgatggtgg ttaacggcgg gatataacat 4920
gagctgtctt cggtatcgtc gtatcccact accgagatat ccgcaccaac gcgcagcccg 4980
gactcggtaa tggcgcgcat tgcgcccagc gccatctgat cgttggcaac cagcatcgca 5040
gtgggaacga tgccctcatt cagcatttgc atggtttgtt gaaaaccgga catggcactc 5100
cagtcgcctt cccgttccgc tatcggctga atttgattgc gagtgagata tttatgccag 5160
ccagccagac gcagacgcgc cgagacagaa cttaatgggc ccgctaacag cgcgatttgc 5220
tggtgaccca atgcgaccag atgctccacg cccagtcgcg taccgtcttc atgggagaaa 5280
ataatactgt tgatgggtgt ctggtcagag acatcaagaa ataacgccgg aacattagtg 5340
caggcagctt ccacagcaat ggcatcctgg tcatccagcg gatagttaat gatcagccca 5400
ctgacgcgtt gcgcgagaag attgtgcacc gccgctttac aggcttcgac gccgcttcgt 5460
tctaccatcg acaccaccac gctggcaccc agttgatcgg cgcgagattt aatcgccgcg 5520
acaatttgcg acggcgcgtg cagggccaga ctggaggtgg caacgccaat cagcaacgac 5580
tgtttgcccg ccagttgttg tgccacgcgg ttgggaatgt aattcagctc cgccatcgcc 5640
gcttccactt tttcccgcgt tttcgcagaa acgtggctgg cctggttcac cacgcgggaa 5700
acggtctgat aagagacacc ggcatactct gcgacatcgt ataacgttac tggtttcaca 5760
ttcaccaccc tgaattgact ctcttccggg cgctatcatg ccataccgcg aaaggttttg 5820
cgccattcga tggtgtccgg gatctcgacg ctctccctta tgcgactcct gcattaggaa 5880
gcagcccagt agtaggttga ggccgttgag caccgccgcc gcaaggaatg gtgcatgcaa 5940
ggagatggcg cccaacagtc ccccggccac ggggcctgcc accataccca cgccgaaaca 6000
agcgctcatg agcccgaagt ggcgagcccg atcttcccca tcggtgatgt cggcgatata 6060
ggcgccagca accgcacctg tggcgccggt gatgccggcc acgatgcgtc cggcgtagag 6120
gatcga 6126
Claims (17)
- - 숙주 세포 및/또는 숙주 세포의 세포내 구획의 세포내 pH를 증가시키는 제제의 유효량을 투여하는 단계, 및
- 항세균제의 유효량을 투여하는 단계를 포함하여, 세균 감염의 치료를 필요로 하는 대상체에서 세균 감염을 치료하는 방법. - 제1항에 있어서, 상기 세균 감염이 세포내 및/또는 지속성 세균 감염이고/이거나 스타필로코쿠스(Staphylococcus) 감염, 바람직하게는 에스. 아우레우스(S. aureus) 감염인 방법.
- 제1항 또는 제2항에 있어서, 상기 숙주 세포가 치료를 필요로 하는 대상체내 진핵 숙주 세포이고/이거나 여기서 상기 세포내 구획이 파고라이소좀(phagolysosome)인 방법.
- 제1항 내지 제3항 중의 어느 한 항에 있어서, 상기 세포내 pH 및/또는 세포내 구획의 pH를 증가시키는 제제가 바람직하게는 클로로퀸, 바필로마이신 A1, 염화암모늄의 그룹으로부터 선택된 알칼리화제, 바람직하게는 라이소좀향성 알칼리화제(lysosomotropic alkalizing agent)인 방법.
- 제1항 내지 제4항 중의 어느 한 항에 있어서, 상기 살세균제가 숙주 세포 및/또는 숙주 세포의 세포내 구획에 들어갈 수 있는 살세균제인 방법.
- 제1항 내지 제5항 중의 어느 한 항에 있어서, 상기 살세균제가 박테리오신 또는 이의 작용성 부분, 세균 오토라이신 또는 이의 작용성 부분, 박테리오파아지 라이신 또는 이의 작용성 부분, 항미생물성 펩타이드 및 항생제로 이루어진 그룹으로부터 선택되는 방법.
- 제1항 내지 제6항 중의 어느 한 항에 있어서, 상기 항생제가 베타-락탐 항생제, 예를 들면, 페니실린 유도체, 세팔로스포린, 모노박탐, 카르바페넴, 반코마이신, 답토마이신, 플루오로퀴놀론, 메트로니다졸, 니트로푸란토인, 코-트리목사졸, 텔리트로마이신, 아미노글리코시드성 항생제로 이루어진 그룹으로부터 선택되고, 보다 바람직하게는 플루클록사실린인 방법.
- 제1항 내지 제7항 중의 어느 한 항에 있어서, 상기 살세균제가 세포 벽 용해 효소로부터의 작용성 효소 도메인을 포함하고 바람직하게는 단백질 형질도입 도메인을 추가로 포함하는 방법.
- 제8항에 있어서, 상기 살세균제가 항미생물성 펩타이드를 추가로 포함하는 방법.
- 제8항 또는 제9항에 있어서, 상기 살세균제가 세포 벽 결합 도메인을 추가로 포함하는 방법.
- 제8항 내지 제10항 중의 어느 한 항에 있어서, 상기 단백질 형질도입 도메인이 서열번호 12 내지 25, 또는 이의 변이체로 이루어진 그룹으로부터 선택되는 방법.
- 제8항 내지 제11항 중의 어느 한 항에 있어서, 세포 벽 용해 효소로부터의 상기 작용성 효소 도메인이 서열번호 1 내지 7, 또는 이의 변이체로 이루어진 그룹으로부터 선택되는 방법.
- 제8항 내지 제12항 중의 어느 한 항에 있어서, 상기 세포 벽 결합 도메인이 서열번호 8 내지 11, 또는 이의 변이체로 이루어진 그룹으로부터 선택되는 방법.
- 제8항 내지 제13항 중의 어느 한 항에 있어서, 항미생물성 펩타이드가 서열번호 70 내지 90, 또는 이의 변이체로 이루어진 그룹으로부터 선택되는 방법.
- 서열번호 1 내지 7, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 세포 벽 용해 효소로부터의 작용성 효소 도메인 및 서열번호 12 내지 25, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 단백질 형질도입 도메인을 포함하고, 임의로 서열번호 8 내지 11, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 세포 벽 결합 도메인, 및/또는 서열번호 70 내지 90, 또는 이의 변이체로 이루어진 그룹으로부터 선택된 항미생물성 펩타이드를 포함하는, 키메릭 살세균 폴리펩타이드.
- 제15항에 있어서, 상기 살세균제가 서열번호 27 내지 47로 이루어진 그룹으로부터 선택된 서열과 적어도 40% 서열 동일성을 갖거나, 서열번호 50 내지 67로 이루어진 그룹으로부터 선택된 서열과 적어도 40% 서열 동일성을 갖는 폴리뉴클레오타이드 서열에 의해 암호화된 살세균제인 키메릭 살세균 폴리펩타이드.
- 제15항에 따른 키메릭 살세균 폴리펩타이드를 암호화하는 폴리뉴클레오타이드.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP15158880.3 | 2015-03-12 | ||
EP15158880 | 2015-03-12 | ||
PCT/EP2016/055076 WO2016142445A2 (en) | 2015-03-12 | 2016-03-10 | A method of treatment of a bacterial infection |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20170132201A true KR20170132201A (ko) | 2017-12-01 |
KR102683284B1 KR102683284B1 (ko) | 2024-07-10 |
Family
ID=52648929
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020177028879A KR102683284B1 (ko) | 2015-03-12 | 2016-03-10 | 세균 감염의 치료를 위한 살세균제와 라이소좀향성 알칼리화제와의 조합물 |
Country Status (11)
Country | Link |
---|---|
US (1) | US11690899B2 (ko) |
EP (1) | EP3268023B1 (ko) |
JP (1) | JP2018509415A (ko) |
KR (1) | KR102683284B1 (ko) |
CN (1) | CN107580503B (ko) |
AU (1) | AU2016231141B2 (ko) |
CA (1) | CA2979873A1 (ko) |
HK (1) | HK1249033A1 (ko) |
IL (1) | IL254457B (ko) |
SG (1) | SG11201707345VA (ko) |
WO (1) | WO2016142445A2 (ko) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11427812B2 (en) * | 2016-11-18 | 2022-08-30 | Lysando Ag | Antimicrobial agents against Staphylococcus aureus |
EP3692149B1 (en) | 2017-10-06 | 2023-01-04 | Micreos Human Health B.V. | Treatment of a condition associated with infection with an oncogenic bacterium |
EP3688011A4 (en) * | 2017-10-25 | 2021-11-24 | The Administrators Of The Tulane Educational Fund | PEPTIDE COMPOSITIONS AND METHOD FOR THEIR USE |
US11149068B2 (en) | 2018-01-05 | 2021-10-19 | The Administrator Of The Tulane Educational Fund | Pore-forming peptides and uses thereof |
JP7131803B2 (ja) * | 2018-05-15 | 2022-09-06 | 学校法人大阪医科薬科大学 | Gb3蓄積起因性疾患の予防又は治療剤 |
CN109350618B (zh) * | 2018-12-10 | 2021-09-28 | 山东农业大学 | 自噬调节剂在消除牛乳腺上皮细胞内金黄色葡萄球菌中的应用 |
GB201906653D0 (en) * | 2019-05-10 | 2019-06-26 | Cc Biotech Ltd | Polypeptides for treatment of bacterial infections |
IL296529A (en) | 2020-03-19 | 2022-11-01 | Micreos Human Health Bv | Preferably a stabilized protein |
WO2024020533A1 (en) * | 2022-07-22 | 2024-01-25 | Endolytix Technology, Inc. | Immunogenic compositions for treating or preventing actinomycetia infections |
WO2024133850A1 (en) | 2022-12-23 | 2024-06-27 | Micreos Pharmaceuticals Ag | A chimeric endolysin polypeptide |
CN118308337A (zh) * | 2023-01-09 | 2024-07-09 | 华中农业大学 | 一种细菌裂解酶LLysSA9.10及其应用 |
WO2024200687A1 (en) * | 2023-03-29 | 2024-10-03 | Universiteit Gent | Chimeric endolysins with activity against streptococci and staphylococci |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2001259205A1 (en) * | 2000-04-28 | 2001-11-12 | New Horizons Diagnostic Corporation | The use of bacterial phage associated lysing enzymes for treating various illnesses |
US20040023897A1 (en) * | 2001-11-13 | 2004-02-05 | Caplan Michael J. | Methods for preventing or treating disease mediated by toxin-secreting bacteria |
KR100759988B1 (ko) * | 2006-08-04 | 2007-09-19 | 주식회사 인트론바이오테크놀로지 | 황색포도상구균에 특이적인 항균 단백질 |
US20100028334A1 (en) * | 2006-12-15 | 2010-02-04 | Trustees Of Boston University | Compositions and methods to potentiate colistin activity |
US8383102B2 (en) * | 2009-05-21 | 2013-02-26 | The United States Of America As Represented By The Secretary Of Agriculture | Fusion of peptidoglycan hydrolase enzymes to a protein transduction domain allows eradication of both extracellular and intracellular gram positive pathogens |
EP2470648B1 (en) | 2009-08-24 | 2019-04-10 | Katholieke Universiteit Leuven, K.U. Leuven R&D | New endolysin obpgplys |
EP2338916A1 (en) * | 2009-12-23 | 2011-06-29 | Hyglos Invest GmbH | Chimeric polypeptides and their use in bacterial decoloniation |
EP2702070B1 (en) | 2011-04-27 | 2015-09-30 | Lysando AG | New antimicrobial agents |
US9382298B2 (en) * | 2011-05-04 | 2016-07-05 | Micreos Human Health B.V. | Polypeptide |
BR112014027629A2 (pt) * | 2012-05-07 | 2017-06-27 | Micreos Human Health Bv | misturas de polipeptídeo com atividade antibacteriana |
DK2849782T3 (da) * | 2012-05-09 | 2020-06-22 | Contrafect Corp | Bacteriophag lysin og antibiotiske sammensætninger imod gram-positive bakterier |
US9603909B2 (en) * | 2012-05-29 | 2017-03-28 | Intron Biotechnology, Inc. | Composition capable of improving stability of bacteriophage lysin proteins |
US9206411B2 (en) * | 2012-06-13 | 2015-12-08 | The United States Of America, As Represented By The Secretary Of Agriculture | Staphylococcal Phage2638A endolysin amidase domain is lytic for Staphylococcus aureus |
EP2679677A1 (en) * | 2012-06-29 | 2014-01-01 | Lysando Aktiengesellschaft | Composition for use in Mycobacteria therapy |
US20160097044A1 (en) * | 2014-10-01 | 2016-04-07 | The United States Of America, As Represented By The Secretary Of Agriculture | Antimicrobial Enzyme Fusions Reduce Resistance and Kill Intracellular Staphylococcus aureus |
-
2016
- 2016-03-10 US US15/557,296 patent/US11690899B2/en active Active
- 2016-03-10 CA CA2979873A patent/CA2979873A1/en active Pending
- 2016-03-10 WO PCT/EP2016/055076 patent/WO2016142445A2/en active Application Filing
- 2016-03-10 JP JP2017547980A patent/JP2018509415A/ja active Pending
- 2016-03-10 AU AU2016231141A patent/AU2016231141B2/en active Active
- 2016-03-10 SG SG11201707345VA patent/SG11201707345VA/en unknown
- 2016-03-10 EP EP16709389.7A patent/EP3268023B1/en active Active
- 2016-03-10 KR KR1020177028879A patent/KR102683284B1/ko active IP Right Grant
- 2016-03-10 CN CN201680026953.2A patent/CN107580503B/zh active Active
-
2017
- 2017-09-12 IL IL254457A patent/IL254457B/en active IP Right Grant
-
2018
- 2018-07-06 HK HK18108776.3A patent/HK1249033A1/zh unknown
Non-Patent Citations (4)
Title |
---|
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 45(11), pp.2977-2986(2001) * |
JOURNAL OF ANTIMICROBIAL CHEMOTHERAPY, 57(5), pp.883-890(2006) * |
JOURNAL OF INFLAMMATION RESEARCH, vol.8, pp.29-47(2015) * |
NATURE REVI EWS. MICROBIOLOGY, 12(3), pp.152-153(2014) * |
Also Published As
Publication number | Publication date |
---|---|
CN107580503A (zh) | 2018-01-12 |
IL254457B (en) | 2020-03-31 |
EP3268023B1 (en) | 2021-08-18 |
WO2016142445A3 (en) | 2016-11-10 |
IL254457A0 (en) | 2017-11-30 |
CN107580503B (zh) | 2021-07-09 |
US11690899B2 (en) | 2023-07-04 |
HK1249033A1 (zh) | 2018-10-26 |
SG11201707345VA (en) | 2017-10-30 |
WO2016142445A2 (en) | 2016-09-15 |
EP3268023A2 (en) | 2018-01-17 |
CA2979873A1 (en) | 2016-09-15 |
BR112017019413A2 (pt) | 2018-05-02 |
AU2016231141A1 (en) | 2017-09-28 |
JP2018509415A (ja) | 2018-04-05 |
AU2016231141B2 (en) | 2022-03-03 |
KR102683284B1 (ko) | 2024-07-10 |
US20180271952A1 (en) | 2018-09-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102683284B1 (ko) | 세균 감염의 치료를 위한 살세균제와 라이소좀향성 알칼리화제와의 조합물 | |
KR102451510B1 (ko) | Pd-1 호밍 엔도뉴클레아제 변이체, 조성물 및 사용 방법 | |
DK2768848T3 (en) | METHODS AND PROCEDURES FOR EXPRESSION AND SECRETARY OF PEPTIDES AND PROTEINS | |
KR102604096B1 (ko) | 윌슨병을 치료하기 위한 유전자 치료 | |
KR20170108946A (ko) | Fc 수용체-유사 5를 표적화하는 키메라 항원 수용체 및 그의 용도 | |
KR102654180B1 (ko) | 프롤린 및 알라닌 잔기가 풍부한 반복적인 아미노산 서열을 암호화하고 낮은 반복적인 뉴클레오티드 서열을 갖는 핵산 | |
KR20210149060A (ko) | Tn7-유사 트랜스포존을 사용한 rna-유도된 dna 통합 | |
CN108753824A (zh) | 用于治疗视网膜营养不良的病毒载体 | |
CN112912112A (zh) | 肝特异性核酸调节元件以及其方法及用途 | |
KR20210005146A (ko) | 유전자 편집된 t 세포에서의 인간 foxp3의 발현 | |
CN114990157B (zh) | 用于构建lmna基因突变的扩张型心肌病模型猪核移植供体细胞的基因编辑系统及其应用 | |
KR20230062873A (ko) | 염기 편집 효소 | |
CN116083398B (zh) | 分离的Cas13蛋白及其应用 | |
KR102409420B1 (ko) | 형질전환 생물체 선별용 마커 조성물, 형질전환 생물체 및 형질전환 방법 | |
KR20220142502A (ko) | 근육 특이적 핵산 조절 요소 및 이의 방법 및 용도 | |
CN112063669A (zh) | 酶法反应组合物、增加酶法反应中三磷酸腺苷(atp)量的方法及其应用 | |
US20030059870A1 (en) | Recombinant bacterial strains for the production of natural nucleosides and modified analogues thereof | |
CN107988259B (zh) | SmartBac杆状病毒表达系统及其应用 | |
RU2779747C2 (ru) | Химерные антигенные рецепторы, нацеленные на подобный fc-рецептору белок 5, и их применение | |
KR102712198B1 (ko) | 재조합 발생이 최소화된 유전자치료 벡터, 상기 벡터를 포함하는 재조합 레트로바이러스 및 상기 재조합 레트로바이러스를 포함하는 암의 예방 또는 치료용 약학 조성물 | |
CN116323924B (zh) | 基因疗法载体、包含载体的重组逆转录病毒和用于预防或治疗癌症的药物组合物 | |
RU2781083C2 (ru) | Варианты, композиции и методы применения хоминг-эндонуклеазы pd-1 | |
TW201030146A (en) | High-expression promoter and method for producing gene product using the same | |
KR20230123115A (ko) | Ace-2 변이체 및 이의 용도 | |
KR20240021866A (ko) | 효소적 핵산 합성을 위한 조성물 및 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AMND | Amendment | ||
E902 | Notification of reason for refusal | ||
AMND | Amendment | ||
E601 | Decision to refuse application | ||
X091 | Application refused [patent] | ||
AMND | Amendment | ||
X701 | Decision to grant (after re-examination) |