AU700080B2 - Streptococcal heat shock proteins members of the HSP70 family - Google Patents
Streptococcal heat shock proteins members of the HSP70 family Download PDFInfo
- Publication number
- AU700080B2 AU700080B2 AU56828/96A AU5682896A AU700080B2 AU 700080 B2 AU700080 B2 AU 700080B2 AU 56828/96 A AU56828/96 A AU 56828/96A AU 5682896 A AU5682896 A AU 5682896A AU 700080 B2 AU700080 B2 AU 700080B2
- Authority
- AU
- Australia
- Prior art keywords
- ala
- asp
- gly
- lys
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 101710178376 Heat shock 70 kDa protein Proteins 0.000 title claims description 48
- 101710163595 Chaperone protein DnaK Proteins 0.000 title claims description 47
- 101710152018 Heat shock cognate 70 kDa protein Proteins 0.000 title claims description 47
- 102000002812 Heat-Shock Proteins Human genes 0.000 title description 23
- 108010004889 Heat-Shock Proteins Proteins 0.000 title description 23
- 108010027814 HSP72 Heat-Shock Proteins Proteins 0.000 claims description 227
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 196
- 102100040352 Heat shock 70 kDa protein 1A Human genes 0.000 claims description 179
- 108090000623 proteins and genes Proteins 0.000 claims description 171
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 162
- 229920001184 polypeptide Polymers 0.000 claims description 131
- 102000004169 proteins and genes Human genes 0.000 claims description 131
- 239000012634 fragment Substances 0.000 claims description 126
- 238000000034 method Methods 0.000 claims description 92
- 150000001413 amino acids Chemical class 0.000 claims description 90
- 108020004414 DNA Proteins 0.000 claims description 66
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 64
- 239000013612 plasmid Substances 0.000 claims description 59
- 210000004027 cell Anatomy 0.000 claims description 51
- 239000002773 nucleotide Substances 0.000 claims description 46
- 125000003729 nucleotide group Chemical group 0.000 claims description 46
- 241000894006 Bacteria Species 0.000 claims description 43
- 241000193996 Streptococcus pyogenes Species 0.000 claims description 41
- 230000014509 gene expression Effects 0.000 claims description 40
- 241000193985 Streptococcus agalactiae Species 0.000 claims description 37
- 239000000203 mixture Substances 0.000 claims description 35
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 34
- 208000015181 infectious disease Diseases 0.000 claims description 29
- 229960005486 vaccine Drugs 0.000 claims description 23
- 238000001514 detection method Methods 0.000 claims description 22
- 241000194017 Streptococcus Species 0.000 claims description 21
- 241000193998 Streptococcus pneumoniae Species 0.000 claims description 21
- 239000012472 biological sample Substances 0.000 claims description 21
- 102000037865 fusion proteins Human genes 0.000 claims description 21
- 108020001507 fusion proteins Proteins 0.000 claims description 21
- 229940031000 streptococcus pneumoniae Drugs 0.000 claims description 21
- 239000013598 vector Substances 0.000 claims description 21
- 210000004408 hybridoma Anatomy 0.000 claims description 17
- 239000003298 DNA probe Substances 0.000 claims description 16
- 230000004927 fusion Effects 0.000 claims description 16
- 239000000523 sample Substances 0.000 claims description 16
- 238000002360 preparation method Methods 0.000 claims description 14
- 238000003752 polymerase chain reaction Methods 0.000 claims description 13
- 108020003215 DNA Probes Proteins 0.000 claims description 12
- 239000013604 expression vector Substances 0.000 claims description 10
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims description 10
- 238000006243 chemical reaction Methods 0.000 claims description 9
- 239000008194 pharmaceutical composition Substances 0.000 claims description 9
- 238000004519 manufacturing process Methods 0.000 claims description 8
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 claims description 6
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 claims description 5
- 241000124008 Mammalia Species 0.000 claims description 5
- 241001529936 Murinae Species 0.000 claims description 5
- 230000001939 inductive effect Effects 0.000 claims description 5
- 241001478240 Coccus Species 0.000 claims description 4
- 206010035664 Pneumonia Diseases 0.000 claims description 4
- 239000002253 acid Substances 0.000 claims description 4
- 210000002966 serum Anatomy 0.000 claims description 4
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 claims description 3
- 150000007513 acids Chemical class 0.000 claims description 3
- 230000000295 complement effect Effects 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- 230000008105 immune reaction Effects 0.000 claims description 3
- 102000053602 DNA Human genes 0.000 claims description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 claims description 2
- 201000000050 myeloid neoplasm Diseases 0.000 claims description 2
- 210000000628 antibody-producing cell Anatomy 0.000 claims 2
- 239000000546 pharmaceutical excipient Substances 0.000 claims 2
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 156
- 235000018102 proteins Nutrition 0.000 description 128
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 87
- 235000001014 amino acid Nutrition 0.000 description 79
- 108091007433 antigens Proteins 0.000 description 74
- 102000036639 antigens Human genes 0.000 description 74
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 68
- 239000000427 antigen Substances 0.000 description 67
- 241000699670 Mus sp. Species 0.000 description 61
- 241000588724 Escherichia coli Species 0.000 description 48
- 230000003053 immunization Effects 0.000 description 37
- 238000002649 immunization Methods 0.000 description 36
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 29
- 230000009257 reactivity Effects 0.000 description 25
- 108700026244 Open Reading Frames Proteins 0.000 description 24
- 230000001580 bacterial effect Effects 0.000 description 22
- 238000003119 immunoblot Methods 0.000 description 22
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 22
- 239000002671 adjuvant Substances 0.000 description 21
- 239000000284 extract Substances 0.000 description 21
- 108010050848 glycylleucine Proteins 0.000 description 21
- 201000010099 disease Diseases 0.000 description 20
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 20
- 229930182817 methionine Natural products 0.000 description 19
- 238000001262 western blot Methods 0.000 description 19
- 239000007924 injection Substances 0.000 description 18
- 238000002347 injection Methods 0.000 description 18
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 17
- 238000004458 analytical method Methods 0.000 description 15
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 15
- 230000028993 immune response Effects 0.000 description 15
- 230000002163 immunogen Effects 0.000 description 15
- 230000035939 shock Effects 0.000 description 15
- 239000013592 cell lysate Substances 0.000 description 14
- 239000000047 product Substances 0.000 description 14
- 241001465754 Metazoa Species 0.000 description 13
- 241000283973 Oryctolagus cuniculus Species 0.000 description 13
- 210000004899 c-terminal region Anatomy 0.000 description 13
- 238000003786 synthesis reaction Methods 0.000 description 13
- 239000012528 membrane Substances 0.000 description 12
- 238000006467 substitution reaction Methods 0.000 description 12
- 241000282414 Homo sapiens Species 0.000 description 11
- 241000699666 Mus <mouse, genus> Species 0.000 description 11
- HWMGTNOVUDIKRE-UWVGGRQHSA-N Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 HWMGTNOVUDIKRE-UWVGGRQHSA-N 0.000 description 11
- 241000193990 Streptococcus sp. 'group B' Species 0.000 description 11
- 108010005233 alanylglutamic acid Proteins 0.000 description 11
- 125000000539 amino acid group Chemical group 0.000 description 11
- 230000000890 antigenic effect Effects 0.000 description 11
- 230000015572 biosynthetic process Effects 0.000 description 11
- 210000004369 blood Anatomy 0.000 description 11
- 239000008280 blood Substances 0.000 description 11
- 238000005119 centrifugation Methods 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 108010049041 glutamylalanine Proteins 0.000 description 11
- 230000001681 protective effect Effects 0.000 description 11
- 238000002965 ELISA Methods 0.000 description 10
- 101150031823 HSP70 gene Proteins 0.000 description 10
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 10
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 10
- 230000005875 antibody response Effects 0.000 description 10
- 230000004044 response Effects 0.000 description 10
- 241000894007 species Species 0.000 description 10
- 108010073969 valyllysine Proteins 0.000 description 10
- 241000282693 Cercopithecidae Species 0.000 description 9
- 108091026890 Coding region Proteins 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- 108010042283 HSP40 Heat-Shock Proteins Proteins 0.000 description 9
- 238000002105 Southern blotting Methods 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 108010037850 glycylvaline Proteins 0.000 description 9
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 9
- 239000006166 lysate Substances 0.000 description 9
- 238000000746 purification Methods 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- 238000011282 treatment Methods 0.000 description 9
- HONKEGXLWUDTCF-YFKPBYRVSA-N (2s)-2-amino-2-methyl-4-phosphonobutanoic acid Chemical compound OC(=O)[C@](N)(C)CCP(O)(O)=O HONKEGXLWUDTCF-YFKPBYRVSA-N 0.000 description 8
- 101100125027 Dictyostelium discoideum mhsp70 gene Proteins 0.000 description 8
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 8
- 102000004447 HSP40 Heat-Shock Proteins Human genes 0.000 description 8
- 101000616438 Homo sapiens Microtubule-associated protein 4 Proteins 0.000 description 8
- 102100023174 Methionine aminopeptidase 2 Human genes 0.000 description 8
- 102100021794 Microtubule-associated protein 4 Human genes 0.000 description 8
- 108020004511 Recombinant DNA Proteins 0.000 description 8
- 108091081024 Start codon Proteins 0.000 description 8
- 210000003719 b-lymphocyte Anatomy 0.000 description 8
- 239000013611 chromosomal DNA Substances 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 238000012217 deletion Methods 0.000 description 8
- 230000037430 deletion Effects 0.000 description 8
- 101150052825 dnaK gene Proteins 0.000 description 8
- 239000000499 gel Substances 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- 238000003018 immunoassay Methods 0.000 description 8
- 230000001965 increasing effect Effects 0.000 description 8
- 238000002372 labelling Methods 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 108010009298 lysylglutamic acid Proteins 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- 150000007523 nucleic acids Chemical class 0.000 description 8
- 238000012163 sequencing technique Methods 0.000 description 8
- 208000035143 Bacterial infection Diseases 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 7
- 108090000790 Enzymes Proteins 0.000 description 7
- 101000979001 Homo sapiens Methionine aminopeptidase 2 Proteins 0.000 description 7
- 101000969087 Homo sapiens Microtubule-associated protein 2 Proteins 0.000 description 7
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 7
- 241000880493 Leptailurus serval Species 0.000 description 7
- 108010079364 N-glycylalanine Proteins 0.000 description 7
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 7
- 239000011543 agarose gel Substances 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 108010092854 aspartyllysine Proteins 0.000 description 7
- 208000022362 bacterial infectious disease Diseases 0.000 description 7
- 229940088598 enzyme Drugs 0.000 description 7
- 108010081551 glycylphenylalanine Proteins 0.000 description 7
- 230000005847 immunogenicity Effects 0.000 description 7
- 230000006698 induction Effects 0.000 description 7
- 108091008146 restriction endonucleases Proteins 0.000 description 7
- 239000006228 supernatant Substances 0.000 description 7
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 6
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 6
- 102000006303 Chaperonin 60 Human genes 0.000 description 6
- 108010058432 Chaperonin 60 Proteins 0.000 description 6
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 6
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 6
- 108060003951 Immunoglobulin Proteins 0.000 description 6
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 6
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 6
- 108091034117 Oligonucleotide Proteins 0.000 description 6
- 241001505901 Streptococcus sp. 'group A' Species 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 238000011161 development Methods 0.000 description 6
- 238000010790 dilution Methods 0.000 description 6
- 239000012895 dilution Substances 0.000 description 6
- 238000001962 electrophoresis Methods 0.000 description 6
- STKYPAFSDFAEPH-LURJTMIESA-N glycylvaline Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 6
- 230000001900 immune effect Effects 0.000 description 6
- 102000018358 immunoglobulin Human genes 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 239000002953 phosphate buffered saline Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 239000012723 sample buffer Substances 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- DVUFTQLHHHJEMK-IMJSIDKUSA-N Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O DVUFTQLHHHJEMK-IMJSIDKUSA-N 0.000 description 5
- HSPSXROIMXIJQW-BQBZGAKWSA-N Asp-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 HSPSXROIMXIJQW-BQBZGAKWSA-N 0.000 description 5
- 238000001712 DNA sequencing Methods 0.000 description 5
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 5
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 5
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 5
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 5
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 5
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 5
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 5
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 5
- TWVKGYNQQAUNRN-ACZMJKKPSA-N Ile-Ser Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O TWVKGYNQQAUNRN-ACZMJKKPSA-N 0.000 description 5
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 5
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 5
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 239000000020 Nitrocellulose Substances 0.000 description 5
- 238000011579 SCID mouse model Methods 0.000 description 5
- 101710137500 T7 RNA polymerase Proteins 0.000 description 5
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 5
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 5
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 5
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 5
- 230000000692 anti-sense effect Effects 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 238000009835 boiling Methods 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 239000007330 chocolate agar Substances 0.000 description 5
- 230000001086 cytosolic effect Effects 0.000 description 5
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 108010005942 methionylglycine Proteins 0.000 description 5
- 229920001220 nitrocellulos Polymers 0.000 description 5
- 244000052769 pathogen Species 0.000 description 5
- 239000008188 pellet Substances 0.000 description 5
- 229920002401 polyacrylamide Polymers 0.000 description 5
- 230000002265 prevention Effects 0.000 description 5
- 238000001243 protein synthesis Methods 0.000 description 5
- 238000003156 radioimmunoprecipitation Methods 0.000 description 5
- 108010048818 seryl-histidine Proteins 0.000 description 5
- 238000000527 sonication Methods 0.000 description 5
- 108010005652 splenotritin Proteins 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- RVLOMLVNNBWRSR-KNIFDHDWSA-N (2s)-2-aminopropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound C[C@H](N)C(O)=O.NCCCC[C@H](N)C(O)=O RVLOMLVNNBWRSR-KNIFDHDWSA-N 0.000 description 4
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 4
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 4
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 4
- QYLJIYOGHRGUIH-CIUDSAMLSA-N Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N QYLJIYOGHRGUIH-CIUDSAMLSA-N 0.000 description 4
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 4
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 4
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 4
- 241000193830 Bacillus <bacterium> Species 0.000 description 4
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 4
- 238000011537 Coomassie blue staining Methods 0.000 description 4
- 241000192125 Firmicutes Species 0.000 description 4
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 4
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 4
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 4
- 101000969594 Homo sapiens Modulator of apoptosis 1 Proteins 0.000 description 4
- UWBDLNOCIDGPQE-GUBZILKMSA-N Ile-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN UWBDLNOCIDGPQE-GUBZILKMSA-N 0.000 description 4
- DEFJQIDDEAULHB-IMJSIDKUSA-N L-alanyl-L-alanine Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(O)=O DEFJQIDDEAULHB-IMJSIDKUSA-N 0.000 description 4
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 4
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 4
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 4
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 4
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 4
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 4
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 4
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 4
- HGNRJCINZYHNOU-LURJTMIESA-N Lys-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(O)=O HGNRJCINZYHNOU-LURJTMIESA-N 0.000 description 4
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 4
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 4
- 102000016943 Muramidase Human genes 0.000 description 4
- 108010014251 Muramidase Proteins 0.000 description 4
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 4
- 101100131116 Oryza sativa subsp. japonica MPK3 gene Proteins 0.000 description 4
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 4
- 101100456045 Schizosaccharomyces pombe (strain 972 / ATCC 24843) map3 gene Proteins 0.000 description 4
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 4
- 241000680505 Streptococcus pneumoniae WU2 Species 0.000 description 4
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 4
- BECPPKYKPSRKCP-ZDLURKLDSA-N Thr-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BECPPKYKPSRKCP-ZDLURKLDSA-N 0.000 description 4
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 4
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 4
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 238000007792 addition Methods 0.000 description 4
- 108010056243 alanylalanine Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- ZVDPYSVOZFINEE-BQBZGAKWSA-N alpha-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O ZVDPYSVOZFINEE-BQBZGAKWSA-N 0.000 description 4
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 4
- 238000010171 animal model Methods 0.000 description 4
- 239000003242 anti bacterial agent Substances 0.000 description 4
- 229940088710 antibiotic agent Drugs 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 229940098773 bovine serum albumin Drugs 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 238000000855 fermentation Methods 0.000 description 4
- 230000004151 fermentation Effects 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 4
- 210000004201 immune sera Anatomy 0.000 description 4
- 229940042743 immune sera Drugs 0.000 description 4
- 210000003000 inclusion body Anatomy 0.000 description 4
- 239000012139 lysis buffer Substances 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 235000010335 lysozyme Nutrition 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 239000012071 phase Substances 0.000 description 4
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 4
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 4
- 230000031070 response to heat Effects 0.000 description 4
- 230000004083 survival effect Effects 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 3
- ZSOICJZJSRWNHX-ACZMJKKPSA-N Ala-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@H](C)[NH3+] ZSOICJZJSRWNHX-ACZMJKKPSA-N 0.000 description 3
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 3
- GHBSKQGCIYSCNS-NAKRPEOUSA-N Ala-Leu-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GHBSKQGCIYSCNS-NAKRPEOUSA-N 0.000 description 3
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 3
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 3
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 3
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 3
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 3
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 3
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 3
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 3
- ALKWEXBKAHPJAQ-NAKRPEOUSA-N Asn-Leu-Asp-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ALKWEXBKAHPJAQ-NAKRPEOUSA-N 0.000 description 3
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 3
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 3
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 3
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 3
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 3
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 3
- NTQDELBZOMWXRS-IWGUZYHVSA-N Asp-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O NTQDELBZOMWXRS-IWGUZYHVSA-N 0.000 description 3
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 3
- 241000606153 Chlamydia trachomatis Species 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 241000194032 Enterococcus faecalis Species 0.000 description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 3
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 3
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 3
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 3
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 3
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 3
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 description 3
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 3
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 3
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 3
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 3
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 3
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 3
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 3
- 206010021531 Impetigo Diseases 0.000 description 3
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 description 3
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 3
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 3
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 3
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 3
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 3
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 3
- FMIIKPHLJKUXGE-GUBZILKMSA-N Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN FMIIKPHLJKUXGE-GUBZILKMSA-N 0.000 description 3
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 3
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 3
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 3
- 241000282567 Macaca fascicularis Species 0.000 description 3
- 201000009906 Meningitis Diseases 0.000 description 3
- CRVSHEPROQHVQT-AVGNSLFASA-N Met-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N CRVSHEPROQHVQT-AVGNSLFASA-N 0.000 description 3
- 102100021440 Modulator of apoptosis 1 Human genes 0.000 description 3
- 108010065395 Neuropep-1 Proteins 0.000 description 3
- NYQBYASWHVRESG-MIMYLULJSA-N Phe-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 NYQBYASWHVRESG-MIMYLULJSA-N 0.000 description 3
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 3
- GLEOIKLQBZNKJZ-WDSKDSINSA-N Pro-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GLEOIKLQBZNKJZ-WDSKDSINSA-N 0.000 description 3
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 description 3
- GVUVRRPYYDHHGK-VQVTYTSYSA-N Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GVUVRRPYYDHHGK-VQVTYTSYSA-N 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- SBMNPABNWKXNBJ-BQBZGAKWSA-N Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO SBMNPABNWKXNBJ-BQBZGAKWSA-N 0.000 description 3
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 3
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 3
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 3
- 241000702208 Shigella phage SfX Species 0.000 description 3
- 244000057717 Streptococcus lactis Species 0.000 description 3
- 235000014897 Streptococcus lactis Nutrition 0.000 description 3
- 241000194023 Streptococcus sanguinis Species 0.000 description 3
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 3
- IOWJRKAVLALBQB-IWGUZYHVSA-N Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O IOWJRKAVLALBQB-IWGUZYHVSA-N 0.000 description 3
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 3
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 3
- YKRQRPFODDJQTC-CSMHCCOUSA-N Thr-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN YKRQRPFODDJQTC-CSMHCCOUSA-N 0.000 description 3
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 3
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 3
- HSRXSKHRSXRCFC-WDSKDSINSA-N Val-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(O)=O HSRXSKHRSXRCFC-WDSKDSINSA-N 0.000 description 3
- OBTCMSPFOITUIJ-FSPLSTOPSA-N Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O OBTCMSPFOITUIJ-FSPLSTOPSA-N 0.000 description 3
- XCTHZFGSVQBHBW-IUCAKERBSA-N Val-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])C(C)C XCTHZFGSVQBHBW-IUCAKERBSA-N 0.000 description 3
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 3
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 3
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 238000013019 agitation Methods 0.000 description 3
- 108010070783 alanyltyrosine Proteins 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 239000011324 bead Substances 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 210000002421 cell wall Anatomy 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- 238000002405 diagnostic procedure Methods 0.000 description 3
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 3
- 238000002635 electroconvulsive therapy Methods 0.000 description 3
- 229940032049 enterococcus faecalis Drugs 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000001502 gel electrophoresis Methods 0.000 description 3
- 150000004676 glycans Chemical class 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 3
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 3
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 231100000518 lethal Toxicity 0.000 description 3
- 230000001665 lethal effect Effects 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 230000000813 microbial effect Effects 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 230000001717 pathogenic effect Effects 0.000 description 3
- 230000007918 pathogenicity Effects 0.000 description 3
- 238000010647 peptide synthesis reaction Methods 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 229920001282 polysaccharide Polymers 0.000 description 3
- 239000005017 polysaccharide Substances 0.000 description 3
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 3
- 230000003938 response to stress Effects 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 239000007790 solid phase Substances 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 231100000419 toxicity Toxicity 0.000 description 3
- 230000001988 toxicity Effects 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 108010036320 valylleucine Proteins 0.000 description 3
- 239000007762 w/o emulsion Substances 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 108010052418 (N-(2-((4-((2-((4-(9-acridinylamino)phenyl)amino)-2-oxoethyl)amino)-4-oxobutyl)amino)-1-(1H-imidazol-4-ylmethyl)-1-oxoethyl)-6-(((-2-aminoethyl)amino)methyl)-2-pyridinecarboxamidato) iron(1+) Proteins 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 2
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 2
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- ALZVPLKYDKJKQU-XVKPBYJWSA-N Ala-Tyr Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ALZVPLKYDKJKQU-XVKPBYJWSA-N 0.000 description 2
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 2
- LIWMQSWFLXEGMA-WDSKDSINSA-N Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)N LIWMQSWFLXEGMA-WDSKDSINSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- 101000748781 Anthoceros angustus Uncharacterized 3.0 kDa protein in psbT-psbN intergenic region Proteins 0.000 description 2
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 2
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 2
- SIFXMYAHXJGAFC-WDSKDSINSA-N Arg-Asp Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O SIFXMYAHXJGAFC-WDSKDSINSA-N 0.000 description 2
- OSASDIVHOSJVII-WDSKDSINSA-N Arg-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N OSASDIVHOSJVII-WDSKDSINSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 2
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 2
- ZRNWJUAQKFUUKV-SRVKXCTJSA-N Arg-Met-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZRNWJUAQKFUUKV-SRVKXCTJSA-N 0.000 description 2
- PQBHGSGQZSOLIR-RYUDHWBXSA-N Arg-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PQBHGSGQZSOLIR-RYUDHWBXSA-N 0.000 description 2
- SJUXYGVRSGTPMC-IMJSIDKUSA-N Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O SJUXYGVRSGTPMC-IMJSIDKUSA-N 0.000 description 2
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 2
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- QJMCHPGWFZZRID-BQBZGAKWSA-N Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O QJMCHPGWFZZRID-BQBZGAKWSA-N 0.000 description 2
- GADKFYNESXNRLC-WDSKDSINSA-N Asn-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GADKFYNESXNRLC-WDSKDSINSA-N 0.000 description 2
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- VBKIFHUVGLOJKT-FKZODXBYSA-N Asn-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)N)O VBKIFHUVGLOJKT-FKZODXBYSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- PSZNHSNIGMJYOZ-WDSKDSINSA-N Asp-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PSZNHSNIGMJYOZ-WDSKDSINSA-N 0.000 description 2
- VGRHZPNRCLAHQA-IMJSIDKUSA-N Asp-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O VGRHZPNRCLAHQA-IMJSIDKUSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 2
- DYDKXJWQCIVTMR-WDSKDSINSA-N Asp-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O DYDKXJWQCIVTMR-WDSKDSINSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- CPMKYMGGYUFOHS-FSPLSTOPSA-N Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O CPMKYMGGYUFOHS-FSPLSTOPSA-N 0.000 description 2
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 102100021277 Beta-secretase 2 Human genes 0.000 description 2
- 241000589969 Borreliella burgdorferi Species 0.000 description 2
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 101000792449 Cyanophora paradoxa Uncharacterized 3.4 kDa protein in atpE-petA intergenic region Proteins 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 230000004544 DNA amplification Effects 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- 230000007023 DNA restriction-modification system Effects 0.000 description 2
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 2
- 241000194033 Enterococcus Species 0.000 description 2
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 2
- 101000686777 Escherichia phage T7 T7 RNA polymerase Proteins 0.000 description 2
- 241000701959 Escherichia virus Lambda Species 0.000 description 2
- 241001678517 Ginaia Species 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 2
- SNFUTDLOCQQRQD-ZKWXMUAHSA-N Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SNFUTDLOCQQRQD-ZKWXMUAHSA-N 0.000 description 2
- XMBSYZWANAQXEV-QWRGUYRKSA-N Glu-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-QWRGUYRKSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- KGVHCTWYMPWEGN-FSPLSTOPSA-N Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CN KGVHCTWYMPWEGN-FSPLSTOPSA-N 0.000 description 2
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 2
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 2
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- WKXVAXOSIPTXEC-HAFWLYHUSA-N Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O WKXVAXOSIPTXEC-HAFWLYHUSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- KTGFOCFYOZQVRJ-ZKWXMUAHSA-N Ile-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O KTGFOCFYOZQVRJ-ZKWXMUAHSA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- JWBXCSQZLLIOCI-GUBZILKMSA-N Ile-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C JWBXCSQZLLIOCI-GUBZILKMSA-N 0.000 description 2
- BBIXOODYWPFNDT-CIUDSAMLSA-N Ile-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O BBIXOODYWPFNDT-CIUDSAMLSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 description 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- HSQGMTRYSIHDAC-BQBZGAKWSA-N Leu-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(O)=O HSQGMTRYSIHDAC-BQBZGAKWSA-N 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 2
- JPNRPAJITHRXRH-BQBZGAKWSA-N Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O JPNRPAJITHRXRH-BQBZGAKWSA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 2
- QCZYYEFXOBKCNQ-STQMWFEESA-N Lys-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCZYYEFXOBKCNQ-STQMWFEESA-N 0.000 description 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- MYTOTTSMVMWVJN-STQMWFEESA-N Lys-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MYTOTTSMVMWVJN-STQMWFEESA-N 0.000 description 2
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 2
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- 101000626970 Marchantia polymorpha Uncharacterized 3.3 kDa protein in psbT-psbN intergenic region Proteins 0.000 description 2
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 2
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 2
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 2
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 2
- IRVONVRHHJXWTK-RWMBFGLXSA-N Met-Lys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N IRVONVRHHJXWTK-RWMBFGLXSA-N 0.000 description 2
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 2
- 241000186362 Mycobacterium leprae Species 0.000 description 2
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- BXNGIHFNNNSEOS-UWVGGRQHSA-N Phe-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 BXNGIHFNNNSEOS-UWVGGRQHSA-N 0.000 description 2
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 2
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 2
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 2
- 208000035109 Pneumococcal Infections Diseases 0.000 description 2
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 2
- JQOHKCDMINQZRV-WDSKDSINSA-N Pro-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 JQOHKCDMINQZRV-WDSKDSINSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 2
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 2
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- 206010040047 Sepsis Diseases 0.000 description 2
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- LTFSLKWFMWZEBD-IMJSIDKUSA-N Ser-Asn Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O LTFSLKWFMWZEBD-IMJSIDKUSA-N 0.000 description 2
- VBKBDLMWICBSCY-IMJSIDKUSA-N Ser-Asp Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O VBKBDLMWICBSCY-IMJSIDKUSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 2
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 2
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 241000256251 Spodoptera frugiperda Species 0.000 description 2
- 241000191967 Staphylococcus aureus Species 0.000 description 2
- 241000194019 Streptococcus mutans Species 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 206010043376 Tetanus Diseases 0.000 description 2
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 2
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 2
- 101000764204 Trieres chinensis Uncharacterized 3.3 kDa protein in rpl11-trnW intergenic region Proteins 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 2
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 2
- GJNDXQBALKCYSZ-RYUDHWBXSA-N Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 GJNDXQBALKCYSZ-RYUDHWBXSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- -1 amino, carboxyl Chemical group 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000034994 death Effects 0.000 description 2
- 231100000517 death Toxicity 0.000 description 2
- 229940009976 deoxycholate Drugs 0.000 description 2
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000000032 diagnostic agent Substances 0.000 description 2
- 229940039227 diagnostic agent Drugs 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 239000012153 distilled water Substances 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 230000000521 hyperimmunizing effect Effects 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 229940072221 immunoglobulins Drugs 0.000 description 2
- 238000001114 immunoprecipitation Methods 0.000 description 2
- 238000009169 immunotherapy Methods 0.000 description 2
- 238000001802 infusion Methods 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010071185 leucyl-alanine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 230000029226 lipidation Effects 0.000 description 2
- 150000002632 lipids Chemical group 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- KXKVLQRXCPHEJC-UHFFFAOYSA-N methyl acetate Chemical compound COC(C)=O KXKVLQRXCPHEJC-UHFFFAOYSA-N 0.000 description 2
- 238000005497 microtitration Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- YHHSONZFOIEMCP-UHFFFAOYSA-O phosphocholine Chemical compound C[N+](C)(C)CCOP(O)(O)=O YHHSONZFOIEMCP-UHFFFAOYSA-O 0.000 description 2
- 229940124733 pneumococcal vaccine Drugs 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 239000013615 primer Substances 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 229940070741 purified protein derivative of tuberculin Drugs 0.000 description 2
- 230000002285 radioactive effect Effects 0.000 description 2
- 238000010814 radioimmunoprecipitation assay Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 206010040872 skin infection Diseases 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 230000035882 stress Effects 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 230000002865 vaccinogenic effect Effects 0.000 description 2
- 230000001018 virulence Effects 0.000 description 2
- 239000000304 virulence factor Substances 0.000 description 2
- 230000007923 virulence factor Effects 0.000 description 2
- 239000012130 whole-cell lysate Substances 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- KZFMOINJHMONLW-FOCLMDBBSA-N (2e)-4,7-dichloro-2-(4,7-dichloro-3-oxo-1-benzothiophen-2-ylidene)-1-benzothiophen-3-one Chemical compound S\1C(C(=CC=C2Cl)Cl)=C2C(=O)C/1=C1/C(=O)C(C(Cl)=CC=C2Cl)=C2S1 KZFMOINJHMONLW-FOCLMDBBSA-N 0.000 description 1
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- MHKBMNACOMRIAW-UHFFFAOYSA-N 2,3-dinitrophenol Chemical group OC1=CC=CC([N+]([O-])=O)=C1[N+]([O-])=O MHKBMNACOMRIAW-UHFFFAOYSA-N 0.000 description 1
- 239000001763 2-hydroxyethyl(trimethyl)azanium Substances 0.000 description 1
- JRBJSXQPQWSCCF-UHFFFAOYSA-N 3,3'-Dimethoxybenzidine Chemical compound C1=C(N)C(OC)=CC(C=2C=C(OC)C(N)=CC=2)=C1 JRBJSXQPQWSCCF-UHFFFAOYSA-N 0.000 description 1
- BRMWTNUJHUMWMS-UHFFFAOYSA-N 3-Methylhistidine Natural products CN1C=NC(CC(N)C(O)=O)=C1 BRMWTNUJHUMWMS-UHFFFAOYSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- 101710092702 47 kDa protein Proteins 0.000 description 1
- HLXHCNWEVQNNKA-UHFFFAOYSA-N 5-methoxy-2,3-dihydro-1h-inden-2-amine Chemical group COC1=CC=C2CC(N)CC2=C1 HLXHCNWEVQNNKA-UHFFFAOYSA-N 0.000 description 1
- 101710191936 70 kDa protein Proteins 0.000 description 1
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- 241001502050 Acis Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- SITWEMZOJNKJCH-WDSKDSINSA-N Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SITWEMZOJNKJCH-WDSKDSINSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- HCZXHQADHZIEJD-CIUDSAMLSA-N Ala-Leu-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HCZXHQADHZIEJD-CIUDSAMLSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- UJJUHXAJSRHWFZ-DCAQKATOSA-N Ala-Leu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O UJJUHXAJSRHWFZ-DCAQKATOSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- FSHURBQASBLAPO-WDSKDSINSA-N Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)N FSHURBQASBLAPO-WDSKDSINSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 206010060937 Amniotic cavity infection Diseases 0.000 description 1
- 108010039627 Aprotinin Proteins 0.000 description 1
- WVRUNFYJIHNFKD-WDSKDSINSA-N Arg-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N WVRUNFYJIHNFKD-WDSKDSINSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- JSLGXODUIAFWCF-WDSKDSINSA-N Arg-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O JSLGXODUIAFWCF-WDSKDSINSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- WYBVBIHNJWOLCJ-IUCAKERBSA-N Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N WYBVBIHNJWOLCJ-IUCAKERBSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- ROWCTNFEMKOIFQ-YUMQZZPRSA-N Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N ROWCTNFEMKOIFQ-YUMQZZPRSA-N 0.000 description 1
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- LQJAALCCPOTJGB-YUMQZZPRSA-N Arg-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O LQJAALCCPOTJGB-YUMQZZPRSA-N 0.000 description 1
- JJIBHAOBNIFUEL-SRVKXCTJSA-N Arg-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCN=C(N)N)N JJIBHAOBNIFUEL-SRVKXCTJSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- XNSKSTRGQIPTSE-ACZMJKKPSA-N Arg-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XNSKSTRGQIPTSE-ACZMJKKPSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- XTWSWDJMIKUJDQ-RYUDHWBXSA-N Arg-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XTWSWDJMIKUJDQ-RYUDHWBXSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 1
- RGGVDKVXLBOLNS-JQWIXIFHSA-N Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)N)C(O)=O)=CNC2=C1 RGGVDKVXLBOLNS-JQWIXIFHSA-N 0.000 description 1
- FYRVDDJMNISIKJ-UWVGGRQHSA-N Asn-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FYRVDDJMNISIKJ-UWVGGRQHSA-N 0.000 description 1
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- CNKAZIGBGQIHLL-GUBZILKMSA-N Asp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N CNKAZIGBGQIHLL-GUBZILKMSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- BSWHERGFUNMWGS-UHFFFAOYSA-N Asp-Ile Chemical compound CCC(C)C(C(O)=O)NC(=O)C(N)CC(O)=O BSWHERGFUNMWGS-UHFFFAOYSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 1
- LIQNMKIBMPEOOP-IHRRRGAJSA-N Asp-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)O)N LIQNMKIBMPEOOP-IHRRRGAJSA-N 0.000 description 1
- UKGGPJNBONZZCM-WDSKDSINSA-N Asp-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- UXIPUCUHQBIQOS-SRVKXCTJSA-N Asp-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UXIPUCUHQBIQOS-SRVKXCTJSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 241000969130 Atthis Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 208000031729 Bacteremia Diseases 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- 201000004569 Blindness Diseases 0.000 description 1
- 241000701822 Bovine papillomavirus Species 0.000 description 1
- 241000589568 Brucella ovis Species 0.000 description 1
- 101100129088 Caenorhabditis elegans lys-2 gene Proteins 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 101100533230 Caenorhabditis elegans ser-2 gene Proteins 0.000 description 1
- 101100507655 Canis lupus familiaris HSPA1 gene Proteins 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 102100035023 Carboxypeptidase B2 Human genes 0.000 description 1
- 108090000201 Carboxypeptidase B2 Proteins 0.000 description 1
- 206010007882 Cellulitis Diseases 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 201000005019 Chlamydia pneumonia Diseases 0.000 description 1
- 241000282552 Chlorocebus aethiops Species 0.000 description 1
- 108010049048 Cholera Toxin Proteins 0.000 description 1
- 102000009016 Cholera Toxin Human genes 0.000 description 1
- 235000019743 Choline chloride Nutrition 0.000 description 1
- 206010008748 Chorea Diseases 0.000 description 1
- 208000008158 Chorioamnionitis Diseases 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- HAYVTMHUNMMXCV-IMJSIDKUSA-N Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CS HAYVTMHUNMMXCV-IMJSIDKUSA-N 0.000 description 1
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- QNNYDGBKNFDYOD-UBHSHLNASA-N Cys-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N QNNYDGBKNFDYOD-UBHSHLNASA-N 0.000 description 1
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 101100016370 Danio rerio hsp90a.1 gene Proteins 0.000 description 1
- 101100481408 Danio rerio tie2 gene Proteins 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 238000009007 Diagnostic Kit Methods 0.000 description 1
- 101100285708 Dictyostelium discoideum hspD gene Proteins 0.000 description 1
- 239000006145 Eagle's minimal essential medium Substances 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 208000004145 Endometritis Diseases 0.000 description 1
- 241000701867 Enterobacteria phage T7 Species 0.000 description 1
- 201000000297 Erysipelas Diseases 0.000 description 1
- 101100340330 Escherichia coli (strain K12) idlP gene Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241000193789 Gemella Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- 206010018364 Glomerulonephritis Diseases 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- TUTIHHSZKFBMHM-WHFBIAKZSA-N Glu-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O TUTIHHSZKFBMHM-WHFBIAKZSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 1
- VFZIDQZAEBORGY-GLLZPBPUSA-N Glu-Gln-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VFZIDQZAEBORGY-GLLZPBPUSA-N 0.000 description 1
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- SXGAGTVDWKQYCX-BQBZGAKWSA-N Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SXGAGTVDWKQYCX-BQBZGAKWSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- YSWHPLCDIMUKFE-QWRGUYRKSA-N Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YSWHPLCDIMUKFE-QWRGUYRKSA-N 0.000 description 1
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- PFMUCCYYAAFKTH-YFKPBYRVSA-N Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CN PFMUCCYYAAFKTH-YFKPBYRVSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- RUDRIZRGOLQSMX-IUCAKERBSA-N Gly-Met-Met Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O RUDRIZRGOLQSMX-IUCAKERBSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 101000636168 Grapevine leafroll-associated virus 3 (isolate United States/NY1) Movement protein p5 Proteins 0.000 description 1
- 108010027992 HSP70 Heat-Shock Proteins Proteins 0.000 description 1
- 102000018932 HSP70 Heat-Shock Proteins Human genes 0.000 description 1
- IPIVXQQRZXEUGW-UWJYBYFXSA-N His-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IPIVXQQRZXEUGW-UWJYBYFXSA-N 0.000 description 1
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 1
- MDCTVRUPVLZSPG-BQBZGAKWSA-N His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 MDCTVRUPVLZSPG-BQBZGAKWSA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- IDXZDKMBEXLFMB-HGNGGELXSA-N His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 IDXZDKMBEXLFMB-HGNGGELXSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 1
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 1
- XMAUFHMAAVTODF-STQMWFEESA-N His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XMAUFHMAAVTODF-STQMWFEESA-N 0.000 description 1
- BFOGZWSSGMLYKV-DCAQKATOSA-N His-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N BFOGZWSSGMLYKV-DCAQKATOSA-N 0.000 description 1
- LCWXJXMHJVIJFK-UHFFFAOYSA-N Hydroxylysine Natural products NCC(O)CC(N)CC(O)=O LCWXJXMHJVIJFK-UHFFFAOYSA-N 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- RCFDOSNHHZGBOY-ACZMJKKPSA-N Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(O)=O RCFDOSNHHZGBOY-ACZMJKKPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- HZYHBDVRCBDJJV-HAFWLYHUSA-N Ile-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O HZYHBDVRCBDJJV-HAFWLYHUSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- BCVIOZZGJNOEQS-XKNYDFJKSA-N Ile-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)[C@@H](C)CC BCVIOZZGJNOEQS-XKNYDFJKSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 108700005091 Immunoglobulin Genes Proteins 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 206010022971 Iron Deficiencies Diseases 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N L-Prolyl-L-glutamic acid Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000589242 Legionella pneumophila Species 0.000 description 1
- 101000839464 Leishmania braziliensis Heat shock 70 kDa protein Proteins 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- ZTUWCZQOKOJGEX-DCAQKATOSA-N Leu-Ala-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O ZTUWCZQOKOJGEX-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- LCPYQJIKPJDLLB-UWVGGRQHSA-N Leu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C LCPYQJIKPJDLLB-UWVGGRQHSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 1
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- LHSGPCFBGJHPCY-STQMWFEESA-N Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-STQMWFEESA-N 0.000 description 1
- MDSUKZSLOATHMH-IUCAKERBSA-N Leu-Val Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C([O-])=O MDSUKZSLOATHMH-IUCAKERBSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- 241000186779 Listeria monocytogenes Species 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- XBZOQGHZGQLEQO-IUCAKERBSA-N Lys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN XBZOQGHZGQLEQO-IUCAKERBSA-N 0.000 description 1
- SPNKGZFASINBMR-IHRRRGAJSA-N Lys-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N SPNKGZFASINBMR-IHRRRGAJSA-N 0.000 description 1
- ZZHPLPSLBVBWOA-WDSOQIARSA-N Lys-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N ZZHPLPSLBVBWOA-WDSOQIARSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- PSVAVKGDUAKZKU-BZSNNMDCSA-N Lys-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N)O PSVAVKGDUAKZKU-BZSNNMDCSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- JHKXZYLNVJRAAJ-WDSKDSINSA-N Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(O)=O JHKXZYLNVJRAAJ-WDSKDSINSA-N 0.000 description 1
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- JMEWFDUAFKVAAT-WDSKDSINSA-N Met-Asn Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O JMEWFDUAFKVAAT-WDSKDSINSA-N 0.000 description 1
- QTZXSYBVOSXBEJ-WDSKDSINSA-N Met-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O QTZXSYBVOSXBEJ-WDSKDSINSA-N 0.000 description 1
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- NHDMNXBBSGVYGP-PYJNHQTQSA-N Met-His-Ile Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)CC1=CN=CN1 NHDMNXBBSGVYGP-PYJNHQTQSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- ZYTPOUNUXRBYGW-YUMQZZPRSA-N Met-Met Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CCSC ZYTPOUNUXRBYGW-YUMQZZPRSA-N 0.000 description 1
- KAKJTZWHIUWTTD-VQVTYTSYSA-N Met-Thr Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)O)C([O-])=O KAKJTZWHIUWTTD-VQVTYTSYSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- ALTHVGNGGZZSAC-SRVKXCTJSA-N Met-Val-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N ALTHVGNGGZZSAC-SRVKXCTJSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- 102100028379 Methionine aminopeptidase 1 Human genes 0.000 description 1
- 101710161855 Methionine aminopeptidase 1 Proteins 0.000 description 1
- 108090000192 Methionyl aminopeptidases Proteins 0.000 description 1
- 101100109158 Mus musculus Asprv1 gene Proteins 0.000 description 1
- 101100481410 Mus musculus Tek gene Proteins 0.000 description 1
- JDHILDINMRGULE-LURJTMIESA-N N(pros)-methyl-L-histidine Chemical compound CN1C=NC=C1C[C@H](N)C(O)=O JDHILDINMRGULE-LURJTMIESA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- MDSUKZSLOATHMH-UHFFFAOYSA-N N-L-leucyl-L-valine Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(O)=O MDSUKZSLOATHMH-UHFFFAOYSA-N 0.000 description 1
- BACYUWVYYTXETD-UHFFFAOYSA-N N-Lauroylsarcosine Chemical compound CCCCCCCCCCCC(=O)N(C)CC(O)=O BACYUWVYYTXETD-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 206010060860 Neurological symptom Diseases 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 206010033078 Otitis media Diseases 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 101150012394 PHO5 gene Proteins 0.000 description 1
- 208000030852 Parasitic disease Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 201000007100 Pharyngitis Diseases 0.000 description 1
- MIDZLCFIAINOQN-WPRPVWTQSA-N Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 MIDZLCFIAINOQN-WPRPVWTQSA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- JXWLMUIXUXLIJR-QWRGUYRKSA-N Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JXWLMUIXUXLIJR-QWRGUYRKSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- QRUOLOPKCOEZKU-HJWJTTGWSA-N Phe-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N QRUOLOPKCOEZKU-HJWJTTGWSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- IEHDJWSAXBGJIP-RYUDHWBXSA-N Phe-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 IEHDJWSAXBGJIP-RYUDHWBXSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 101100271190 Plasmodium falciparum (isolate 3D7) ATAT gene Proteins 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- CKXMGSJPDQXBPG-JYJNAYRXSA-N Pro-Cys-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O CKXMGSJPDQXBPG-JYJNAYRXSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 1
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- OIDKVWTWGDWMHY-RYUDHWBXSA-N Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 OIDKVWTWGDWMHY-RYUDHWBXSA-N 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 108091006629 SLC13A2 Proteins 0.000 description 1
- 206010039587 Scarlet Fever Diseases 0.000 description 1
- 101100071627 Schizosaccharomyces pombe (strain 972 / ATCC 24843) swo1 gene Proteins 0.000 description 1
- 101100457843 Schizosaccharomyces pombe (strain 972 / ATCC 24843) tit1 gene Proteins 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- YZMPDHTZJJCGEI-BQBZGAKWSA-N Ser-His Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 YZMPDHTZJJCGEI-BQBZGAKWSA-N 0.000 description 1
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- PBUXMVYWOSKHMF-WDSKDSINSA-N Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO PBUXMVYWOSKHMF-WDSKDSINSA-N 0.000 description 1
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 206010062255 Soft tissue infection Diseases 0.000 description 1
- 206010061372 Streptococcal infection Diseases 0.000 description 1
- 208000017757 Streptococcal toxic-shock syndrome Diseases 0.000 description 1
- 101000804193 Streptococcus agalactiae Chaperone protein DnaK Proteins 0.000 description 1
- 241000194024 Streptococcus salivarius Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 230000005867 T cell response Effects 0.000 description 1
- 239000008049 TAE buffer Substances 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- UQTNIFUCMBFWEJ-IWGUZYHVSA-N Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O UQTNIFUCMBFWEJ-IWGUZYHVSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- LUMXICQAOKVQOB-YWIQKCBGSA-N Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O LUMXICQAOKVQOB-YWIQKCBGSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- APIDTRXFGYOLLH-VQVTYTSYSA-N Thr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O APIDTRXFGYOLLH-VQVTYTSYSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 1
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- SJPDTIQHLBQPFO-VLCNGCBASA-N Thr-Tyr-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SJPDTIQHLBQPFO-VLCNGCBASA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- 231100000650 Toxic shock syndrome Toxicity 0.000 description 1
- 206010044251 Toxic shock syndrome streptococcal Diseases 0.000 description 1
- 101000980463 Treponema pallidum (strain Nichols) Chaperonin GroEL Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- OSYOKZZRVGUDMO-HSCHXYMDSA-N Trp-Lys-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OSYOKZZRVGUDMO-HSCHXYMDSA-N 0.000 description 1
- 101710162629 Trypsin inhibitor Proteins 0.000 description 1
- 229940122618 Trypsin inhibitor Drugs 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- JXNRXNCCROJZFB-RYUDHWBXSA-N Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JXNRXNCCROJZFB-RYUDHWBXSA-N 0.000 description 1
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- PDSLRCZINIDLMU-QWRGUYRKSA-N Tyr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PDSLRCZINIDLMU-QWRGUYRKSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- VNYDHJARLHNEGA-RYUDHWBXSA-N Tyr-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 1
- ZSXJENBJGRHKIG-UWVGGRQHSA-N Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UWVGGRQHSA-N 0.000 description 1
- BMPPMAOOKQJYIP-WMZOPIPTSA-N Tyr-Trp Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C([O-])=O)C1=CC=C(O)C=C1 BMPPMAOOKQJYIP-WMZOPIPTSA-N 0.000 description 1
- OYOQKMOWUDVWCR-RYUDHWBXSA-N Tyr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OYOQKMOWUDVWCR-RYUDHWBXSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- IBIDRSSEHFLGSD-YUMQZZPRSA-N Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-YUMQZZPRSA-N 0.000 description 1
- WITCOKQIPFWQQD-FSPLSTOPSA-N Val-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O WITCOKQIPFWQQD-FSPLSTOPSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- PNVLWFYAPWAQMU-CIUDSAMLSA-N Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C PNVLWFYAPWAQMU-CIUDSAMLSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- KRNYOVHEKOBTEF-YUMQZZPRSA-N Val-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O KRNYOVHEKOBTEF-YUMQZZPRSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- UZQJVUCHXGYFLQ-AYDHOLPZSA-N [(2s,3r,4s,5r,6r)-4-[(2s,3r,4s,5r,6r)-4-[(2r,3r,4s,5r,6r)-4-[(2s,3r,4s,5r,6r)-3,5-dihydroxy-6-(hydroxymethyl)-4-[(2s,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]oxy-3,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-3,5-dihydroxy-6-(hy Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O)O[C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O)O[C@H]1CC[C@]2(C)[C@H]3CC=C4[C@@]([C@@]3(CC[C@H]2[C@@]1(C=O)C)C)(C)CC(O)[C@]1(CCC(CC14)(C)C)C(=O)O[C@H]1[C@@H]([C@@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O[C@H]4[C@@H]([C@@H](O[C@H]5[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O5)O)[C@H](O)[C@@H](CO)O4)O)[C@H](O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O UZQJVUCHXGYFLQ-AYDHOLPZSA-N 0.000 description 1
- HGEVZDLYZYVYHD-UHFFFAOYSA-N acetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound CC(O)=O.OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O HGEVZDLYZYVYHD-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000003463 adsorbent Substances 0.000 description 1
- 230000037006 agalactosis Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000007818 agglutination assay Methods 0.000 description 1
- 238000011256 aggressive treatment Methods 0.000 description 1
- 108010017893 alanyl-alanyl-alanine Proteins 0.000 description 1
- 108010084094 alanyl-alanyl-alanyl-alanine Proteins 0.000 description 1
- 108010039538 alanyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 150000003862 amino acid derivatives Chemical class 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical class N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 208000019812 amnionitis Diseases 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000001727 anti-capsular Effects 0.000 description 1
- 230000000845 anti-microbial effect Effects 0.000 description 1
- 229960004405 aprotinin Drugs 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- XKNKHVGWJDPIRJ-UHFFFAOYSA-N arsanilic acid Chemical compound NC1=CC=C([As](O)(O)=O)C=C1 XKNKHVGWJDPIRJ-UHFFFAOYSA-N 0.000 description 1
- 229950002705 arsanilic acid Drugs 0.000 description 1
- 206010003246 arthritis Diseases 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 239000012911 assay medium Substances 0.000 description 1
- 230000001363 autoimmune Effects 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 239000000440 bentonite Substances 0.000 description 1
- 229910000278 bentonite Inorganic materials 0.000 description 1
- SVPXDRXYRYOSEX-UHFFFAOYSA-N bentoquatam Chemical compound O.O=[Si]=O.O=[Al]O[Al]=O SVPXDRXYRYOSEX-UHFFFAOYSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 229910002091 carbon monoxide Inorganic materials 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 239000013522 chelant Substances 0.000 description 1
- 238000010382 chemical cross-linking Methods 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 229940038705 chlamydia trachomatis Drugs 0.000 description 1
- 108010031071 cholera toxoid Proteins 0.000 description 1
- 229960003178 choline chloride Drugs 0.000 description 1
- SGMZJAMFUVOLNK-UHFFFAOYSA-M choline chloride Chemical compound [Cl-].C[N+](C)(C)CCO SGMZJAMFUVOLNK-UHFFFAOYSA-M 0.000 description 1
- 208000012601 choreatic disease Diseases 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 1
- 101150096252 ctc-2 gene Proteins 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 108700041286 delta Proteins 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Natural products NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000003391 densitometric scan Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 208000027751 diffuse rash Diseases 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- FRTGEIHSCHXMTI-UHFFFAOYSA-N dimethyl octanediimidate Chemical compound COC(=N)CCCCCCC(=N)OC FRTGEIHSCHXMTI-UHFFFAOYSA-N 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 125000002228 disulfide group Chemical group 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- MOTZDAYCYVMXPC-UHFFFAOYSA-N dodecyl hydrogen sulfate Chemical compound CCCCCCCCCCCCOS(O)(=O)=O MOTZDAYCYVMXPC-UHFFFAOYSA-N 0.000 description 1
- 229940043264 dodecyl sulfate Drugs 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 206010014665 endocarditis Diseases 0.000 description 1
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical compound NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 238000001215 fluorescent labelling Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000005251 gamma ray Effects 0.000 description 1
- 238000005227 gel permeation chromatography Methods 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 101150053330 grpE gene Proteins 0.000 description 1
- 229960004198 guanidine Drugs 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- 230000003862 health status Effects 0.000 description 1
- 230000010370 hearing loss Effects 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 210000003709 heart valve Anatomy 0.000 description 1
- 230000008642 heat stress Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 230000008348 humoral response Effects 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 150000002431 hydrogen Chemical class 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- QJHBJHUKURJDLG-UHFFFAOYSA-N hydroxy-L-lysine Natural products NCCCCC(NO)C(O)=O QJHBJHUKURJDLG-UHFFFAOYSA-N 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 239000012133 immunoprecipitate Substances 0.000 description 1
- 230000003308 immunostimulating effect Effects 0.000 description 1
- 230000001024 immunotherapeutic effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 239000007928 intraperitoneal injection Substances 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 239000012948 isocyanate Substances 0.000 description 1
- 238000001738 isopycnic centrifugation Methods 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 201000003723 learning disability Diseases 0.000 description 1
- 229940115932 legionella pneumophila Drugs 0.000 description 1
- DVCSNHXRZUVYAM-BQBZGAKWSA-N leu-asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O DVCSNHXRZUVYAM-BQBZGAKWSA-N 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010091798 leucylleucine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 239000008297 liquid dosage form Substances 0.000 description 1
- 239000006193 liquid solution Substances 0.000 description 1
- 239000006194 liquid suspension Substances 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 101150023497 mcrA gene Proteins 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 229920012128 methyl methacrylate acrylonitrile butadiene styrene Polymers 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 208000029744 multiple organ dysfunction syndrome Diseases 0.000 description 1
- 125000001446 muramyl group Chemical group N[C@@H](C=O)[C@@H](O[C@@H](C(=O)*)C)[C@H](O)[C@H](O)CO 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 238000011328 necessary treatment Methods 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 238000013492 plasmid preparation Methods 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229960001973 pneumococcal vaccines Drugs 0.000 description 1
- 208000030773 pneumonia caused by chlamydia Diseases 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 238000002731 protein assay Methods 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 230000036647 reaction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 201000003068 rheumatic fever Diseases 0.000 description 1
- 208000004124 rheumatic heart disease Diseases 0.000 description 1
- 108700004121 sarkosyl Proteins 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 239000008299 semisolid dosage form Substances 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000004402 sodium ethyl p-hydroxybenzoate Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007909 solid dosage form Substances 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 208000002254 stillbirth Diseases 0.000 description 1
- 231100000537 stillbirth Toxicity 0.000 description 1
- 210000001768 subcellular fraction Anatomy 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000012134 supernatant fraction Substances 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 239000012085 test solution Substances 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 230000008646 thermal stress Effects 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- PIEPQKCYPFFYMG-UHFFFAOYSA-N tris acetate Chemical compound CC(O)=O.OCC(N)(CO)CO PIEPQKCYPFFYMG-UHFFFAOYSA-N 0.000 description 1
- 239000003656 tris buffered saline Substances 0.000 description 1
- 239000002753 trypsin inhibitor Substances 0.000 description 1
- 239000006150 trypticase soy agar Substances 0.000 description 1
- 201000008827 tuberculosis Diseases 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 208000019206 urinary tract infection Diseases 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010021889 valylvaline Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000004393 visual impairment Effects 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/12—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria
- C07K16/1267—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-positive bacteria
- C07K16/1275—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from bacteria from Gram-positive bacteria from Streptococcus (G)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/02—Bacterial antigens
- A61K39/09—Lactobacillales, e.g. aerococcus, enterococcus, lactobacillus, lactococcus, streptococcus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/315—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/315—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci
- C07K14/3156—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci from Streptococcus pneumoniae (Pneumococcus)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/30—Immunoglobulins specific features characterized by aspects of specificity or valency
- C07K2317/34—Identification of a linear epitope shorter than 20 amino acid residues or of a conformational epitope defined by amino acid residues
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Biomedical Technology (AREA)
- Pharmacology & Pharmacy (AREA)
- Biotechnology (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Microbiology (AREA)
- Veterinary Medicine (AREA)
- Immunology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Pulmonology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Communicable Diseases (AREA)
- Plant Pathology (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Distillation Of Fermentation Liquor, Processing Of Alcohols, Vinegar And Beer (AREA)
Description
WO 96/40928 PCT/CA96/00322 STREPTOCOCCAL HEAT SHOCK PROTEINS MEMBERS OF THE HSP70 FAMILY TECHNICAL FIELD OF THE INVENTION This invention relates to novel heat shock proteins of Streptococcus pneumoniae, Streptococcus pyogenes and Streptococcus agalactiae and immunologically related polypeptides, which provide the basis for new immunotherapeutic, prophylactic and diagnostic agents useful in the treatment, prevention and diagnosis of disease. More particularly, this invention relates to heat shock proteins of S. pneumoniae, S. pyogenes and S.
agalactiae, members of the HSP70 family which have an apparent molecular mass of 70-72 kilodaltons, to the corresponding nucleotide and derived amino acid sequences, to recombinant DNA methods for the production of HSP70/HSP72 and immunologically related polypeptides, to antibodies that bind to these HSP's, and to methods and compositions for the diagnosis, prevention and treatment of diseases caused by S. pneumoniae and related bacteria, such as Streptococcus pyogenes and Streptococcus agalactiae BACKGROUND OF THE INVENTION S. pneumoniae is an important agent of disease in humans, especially among infants, the elderly and immunocompromised persons. It is a bacterium frequently isolated from patients with invasive diseases such as bacteraemia/septicaemia, pneumonia, and meningitis with high morbidity and mortality throughout the world.
Although the advent of antimicrobial drugs has reduced the overall mortality from pneumococcal diseases, the presence of resistant pneumococcal organisms has become a major problem in the world today. Effective pneumococcal vaccines could have a major impact on the morbidity and mortality associated with S. pneunoniae disease. Such I I ~LLI L -re~s s WO 96/40928 PCT/CA96/00322 vaccines would also potentially be useful to prevent otitis media in infants and young children.
It is clear that a number of pneumococcal factors are potentially important in the pathogenesis of disease Boulnois, J. Gen. Microbiol., 138, pp. 249- 259 (1992); C.J. Lee et al., Crit. Rev. Microbiol., 18, pp. 89-114 (1991)]. The capsule of the pneumococcus, despite its lack of toxicity, is considered to be the sine qua non of pneumococcal virulence. More than pneumococcal capsular serotypes are identified on the basis of antigenic differences. Antibodies are the mechanism of protection and the importance of anticapsular antibodies in host defenses against S. pneumoniae is well established Austrian, Am. J. Med., 67, pp. 547-549 (1979)]. Nevertheless, the currently available pneumococcal vaccine, comprising 23 capsular polysaccharides that most frequently caused disease, has significant shortcomings such as the poor immunogenicity of capsular polysaccharides, the diversity of the serotypes and the differences in the distribution of serotypes over time, geographic areas and age groups. In particular, the failure of existing vaccines to protect young children against most serotypes has spurred evaluation of other S. pneumoniae components. Increasing evidence indicates that certain pneumococcal proteins may play an active role both in terms of protection and pathogenicity Paton, Ann. Rev. Microbiol., 47, pp. 89-115 (1993)]. So far, however, only a few S.
pneumoniae proteins have been studied. This might result from the lack of protein-specific antibodies which renders difficult the study of the role of protein antigens in protection and pathogenicity. It is believed that the pneumococcal protein antigens are not very immunogenic and that most antibody responses are to the phosphocholine and the capsular polysaccharides McDaniel et al., J.
Exp. Med., 160, pp. 386-397 (1984); R.M. Krause, Adv.
Immunol., 12, pp. 1-56 (1970); D.G. Braun et al., J. Exp.
I
WO 96/40928 PCT/CA96/00322 Med., 129, pp. 809-830 (1969)]. In a study using X-linked immunodeficient mice, which respond poorly to carbohydrate antigens and to phosphocholine, but make relatively normal responses to protein antigens, the frequency for obtaining monoclonal antibodies reactive with pneumococcal protein antigens was less than 10%, thus suggesting that S.
pneumoniae proteins are poor immunogens [McDaniel et al., supra].
Streptococcus agalactiae, also called Group B Streptococcus (GBS),is the most common cause of sepsis (blood infection) and meningitis in newborns. GBS is also a frequent cause of newborn pneumonia. Approximately 8,000 babies in the United States get GBS disease each year; 5%-15% of these babies die. Babies that survive, particularly those who have meningitis, may have long-term problems, such as hearing or vision loss or learning disabilities. In pregnant women, GBS can cause urinary tract infections, womb infections (amnionitis, endometritis), and stillbirth. Among women who are not pregnant and men, the most common diseases caused by GBS are blood infections, skin or soft tissue infections, and pneumonia. Approximately 20% of men and nonpregnant women with GBS disease die of the disease. GBS infections in both newborns and adults are usually treated with antibiotics penicillin or ampicillin) given intravenously. Most GBS disease in newborns can be prevented by giving certain pregnant women antibiotics intravenously during labor. Vaccines to prevent GBS disease are being developed. In the future, it is expected that women who will be vaccinated will make antibodies that cross the plac.mta and protect th? baby during birth and early infancy.
Since the 1980s, Streptococcus pyogenes, also called Group A Streptococcus (GAS) is reemerging as a cause of severe diseases which would be due to an increase WO 96/40928 PCT/CA96/00322 in virulence of the organism. GAS causes pharyngitis, commonly called "strep throat", and skin infections (impetigo, erysipelas/cellulitis). "Strep throat" and impetigo can lead to glomerulonephritis (kidney damage).
Approximately 3% of "strep throat" infections result into rheumatic fever (migrating arthritis) whose complications include chorea (neurological symptoms) and, in 50% of the cases, rheumatic heart disease (heart valve damage) with endocarditis as a possible long term consequence. It is important to treat impetigo and "strep throat" with antibiotics to prevent the development of complications.
Infection with toxin-producing strains can result in scarlet fever (diffuse rash and fever) or in the extremely severe streptococcal toxic shock syndromes (TSS; GAS have been termed 'flesh eating bacteria') which are characterized by the rapid development of shock and multiple organ system failure. TSS have a 30 to fatality rate in spite of aggressive treatment involving the removing of the focus of bacterial infection and antibiotic therapy. The incidence of TSS is 10 to cases per 100,000. No vaccine against GAS is presently available.
Heat shock or stress proteins ("HSPs") are among the most highly conserved and abundant proteins found in nature Neidhardt et al., Ann. Rev. Genet., 18, pp. 295-329 (1984); S. Lindquist, Ann. Rev. Biochem., pp. 1151-1191 (1986)]. They are produced by all cells in response to various physiological and nonphysiological stimuli. The heat shock response, in which a sudden increase in temperature induces the synthesis of HSPs, is the best studied of the stress responses. Other environmental conditions such as low pH, iron deficiency and hydrogen peroxyde can also induce HSPs. The HSPs have been defined by their size, and members of hsp90, and hsp60 families are among the major HSPs found in all prokaryotes and eukaryotes. These proteins fulfill a WO 96/40928 PCT/CA96/00322 variety of chaperon functions by aiding protein folding and assembly and assisting translocation across membranes Georgopoulos and W.J. Welch, Ann. Rev. Cell. Biol., 9, pp. 601-634 (1993); D. Ang et al., J. Biol. Chem., 266, pp. 24233-24236 (1991)]. As molecular chaperons and possibly via other mechanisms, HSPs are likely involved in protecting cells from the deleterious effects of stress.
The fact that several virulence factors are regulated by environmental conditions suggests a role for HSPs in microbial pathogenicity Mekalanos, J. Bacteriol., 174, pp. 1-7 (1992); P.J. Murray and R.A. Young, J.
Bacteriol., 174, pp. 4193-4196 (1992)]. In that respect, recent studies on Salmonella species suggest that the stress response might: be critically linked to the ability of intracellular pathogens to initiate and sustain an infection Buchmeir and F. Heffron, Science, 248, pp. 730-732 (1990); K.Z. Abshire and F.C. Neidhardt, J. Bacteriol., 175, pp. 3734-3743 (1993); B.B. Finlay et al., Science, 243, pp. 940-943 (1989)]. Others have demonstrated that lysteriolysin, an essential virulence factor in L. monocytogenes, is induced under heat shock conditions Sokolovic and W. Goebel, Infect. Immun., 57, pp. 295-298 (1989)].
Evidence is now accumulating that HSPs are major antigens of many pathogens. Members of the hsp60 family, also called GroEL-related proteins for their similarity to the E. coli GroEL protein, are major antigens of a variety of bacterial pathogens including Mycobacterium leprae and Mycobacterium tuberculosis Young et al., Proc. Natl.
Acad. Sci. USA, 85, pp. 4267-4270 (1988)], Legionella pneumophila Plikaytis et al., J. Clin. Microbiol., pp. 2080-2084 (1987)], Borrelia burgdorferi Luft et al., J. Immunol., 146, pp. 2776-2782 (1991)], and Chlamydia trachomatis Wagar et al., J. Infect. Dis., 162, pp. 922-927 (1990)]. This antigen is a homologue of the ubiquitous "common antigen", and is believed to be present in every bacterium Thole et al., Microb.
I WO 96/40928 PCT/CA96/00322 Pathogen., 4, pp. 71-83 (1988). Antibodies to the members of the hsp70 family, or DnaK-related proteins, have also been described for several bacterial and parasitic infections [Young et al., supra; Luft et al., supra; D.M.
Engman et al., J. Immunol., 144, pp. 3987-3991 (1990); N.M. Rothstein et al., Molec. Biochem. Parasitol., 33, pp. 229-235 (1989); V. Nussenzweig and R.S. Nussenzweig, Adv. Immunol., 45, pp. 283-334 (1989)]. HSPs can elicit strong B- and T- cell responses and it was shown that of the CD4' T-lymphocytes from mice inoculated with M.
tuberculosis were reactive to the hsp60 protein alone Kaufman et al., Eur. J. Immunol., 17, pp. 351-357 (1987)]. Similarly, 7 out of a collection of 24 monoclonal antibodies to M. leprae proteins recognized determinants on hsp60 Engers et al., Infect. Immun., 48, pp. 603-605 (1985)]. It seems that the immune response to stress proteins might play an important role in protection against infection. Consistent with that is the demonstration that antibodies and T cells reactive with microbial HSPs can exhibit neutralizing and protective activities Noll et al., Infect. Immun., 62, pp. 2784-2791 (1994); and S.L. Danilition et al., Infect.
Immun., 58, pp. 189-196 (1990)]. The immunological properties of stress proteins make them attractive as vaccine components and several HSPs are presently being considered for preventing microbial infection and treating cancer. So far, however, studies have focused on intracellular pathogens such as Mycobacteria, Salmonella, Chlamydia and several parasites. Information concerning the heat shock protein antigens in extracellular grampositive bacteria is far less documented. In S.
pneumoniae, S. pyogenes and S. agalactiae, neither the heat shock proteins nor their gene structures have been identified.
DISCLOSURE OF THE INVENTION The present invention addresses the problems referred to above by providing novel heat shock proteins ~e WO 96/40928 PCT/CA96/00322 from S. pneumoniae, S. pyogenes and S. agalactiae, and immunologically related polypeptides. Also provided are DNA sequences that code for the foregoing polypeptides, vectors containing the polypeptides, unicellular hosts transformed with those vectors, and a process for making substantially pure, recombinant polypeptides. Also provided are antibodies specific to the foregoing polypeptides. The polypeptides, DNA sequences and antibodies of this invention provide the basis for novel methods and pharmaceutical compositions for the detection, prevention and treatment of disease. Particularly, this invention provides a novel vaccine based on fragments of these polypeptides that are specific to streptococcal strains.
The novel heat shock protein is the approximately 72 kDa heat shock protein of Streptococcus pneumoniae ("HSP72") (SEQ ID NO:5), the approximately kDa heat shock protein of Streptococcus pyogenes (SEQ ID NO:20)and the approximately 70 kDa heat shock protein of Streptococcus agalactiae ("HSP70") (SEQ ID NO:22), including analogues, homologues, and derivatives thereof, and fragments of the foregoing polypeptides containing at least one immunogenic epitope. Preferred fragments of HSP70/72 include the C-terminal portion of the HSP70/72 polypeptides. More particularly,it includes the C_terminal 169-residue fragment (residues 439-607, SEQ ID NO:5), the C-terminal 151-residue fragment (residues 457-607, SEQ ID No:5),and smaller fragments consisting of peptide epitopes within the C-169 region. Particularly preferred fragments within the C-169 region of HSP72 include the peptide sequences GFDAERDAAQAALDD (residues 527-541 of SEQ ID NO:5) and AEGAQATGNAGDDW (residues 586-600 of SEQ ID NO:5), which are exclusive to HSP72 of Streptococcus pneumoniae. Even more preferred are fragments that elicit an immune reaction against S. pneumoniae, S. pyogenes and S.
C
WO 96/40928 PCT/CA96/00322 agalactiae but do not provoke auto-immune reaction in a human host. Such fragments may be selected from the following peptides: CS870, CS873, CS874, CS875, CS876, CS877, CS878, CS879, CS880, CS882, MAP1, MAP2, MAP3 and MAP4 (see TABLE 5, supra).
Preferred antibodies of this invention are the Fl-Pn3.1, F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4 monoclonal antibodies which are specific to HSP72.
More preferred antibodies are the F2-Pn3.2 and F2-Pn3.4 monoclonal anibodies that are specific to both HSP 70 and HSP72. Even more preferred are the Fl-Pn3.1 antibodies that are specific for Streptococcus pneumoniae.
The preferred polypeptides and antibodies of this invention provide the basis for novel mrethods and pharmaceutical compositions for the detection, prevention and treatment of pneumococcal diseases.
BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 depicts a fluorogram, which shows the effect of heat shock on S. pneumoniae protein synthesis.
The cell extracts in panel A are S. pneumoniae type 6 strain 64. The cell extracts in panel B are S. pneumoniae type 4 strain 53. The cell extracts in the odd numbered lanes were incubated at 37 0 C. The cell extracts in the even numbered lanes were incubated at 45 0 C for 5 minutes.
The cell extracts were then labeled with 35 S]methionine for 10 minutes (lanes 1, 2 and 7, 30 minutes (lanes 3, 4 and 9, 10), or 60 minutes (lanes 5, Molecular mass markers in kilodaltons are shown to the left. The positions of HSP80, HSP72 and HSP62 are shown by arrows at the right-hand side of each panel.
FIG. 2 is a graphical depiction of a comparison cf the electrophoretic profiles of 35 S]methionine-labeled proteins in S. pneumoniae in the presence or absence of exposure to heat shock. Densitometric tracings were determined by measuring the relative optical WO 96/40928 PCT/CA96!00322 density (Y axis) vs. the mobility of labeled protein bands (X axis). The densitometric scans of the SDS PAGE of FIG.
1, lanes 1 and 2, is shown.
FIG. 3 depicts a fluorogram, which shows the S. pneumoniae protein antigens immunoprecipitated by sera from mice immunized with detergent-soluble S. pneumoniae protein extract. 35 S]methionine-labeled proteins from S. pneumoniae grown at 37 0 C and incubated at 37 0 C (lanes 3, 7 and 9) or heat-shocked at 45 0 C (lanes 4, 6, 8 and were immunoprecipitated with sera from mouse 1 (lanes 3 to 6) or mouse 2 (lanes 7 to 10) and then analyzed by SDS- PAGE and fluorography. The sera were tested after the first (lanes 3,4 and 7,8) and after the second (lanes 5,6 and 9,10) immunization. Cell lysates from 3 S]methioninelabeled non heat-shocked and heat-shocked S. pneumoniae are shown in lanes 1 and 2, respectively. The position of HSPs is indicated by the arrows at the left of tle fluorogram.
FIG. 4 depicts a fluorogram, which shows the S. pneumoniae protein antigens immunoprecipitated by sera from mice immunized with heat-killed S. pneumoniae bacteria. 35 S]methiunine-labeled proteins from S. pneumoniae grown at 37 0 C and incubated at 37 0 C (lanes 3, and 7) or heat-shocked at 45 0 C (lanes 4, 6 and 8) were immunoprecipitated with sera from mouse 1 (lanes 3,4), mouse 2 (lanes 5,6) or mouse 3 (lanes 7, 8) and then analyzed by SDS-PAGE and fluorography. Sera were tested after the second immunization only. Cell lysates from 35 S]methionine-labeled non heat- and heat-shocked S. pneumoniae are shown in lanes 1 and 2, respectively.
The position of HSPs is indicated by the arrows at the left of the fluorogram.
FIG. 5 depicts a photograph, which shows the S. pneumoniae antigens detected by Western blot analysis.
Whole cell extracts were probed with sera from 15 mice (lanes 1-15) immunized with heat-killed S. pneumoniae bacteria. Lane 16 shows the HSP72 protein detected by MAb ~--~IIIPSk-P _I--dl b~ II I WO 96/40928 PCT/CA96/00322 Fl-Pn3.1. In panel A, the sera were tested after the second immunization. In panel B, the reactivity of 4 out of 15 sera tested after the first immunization is shown.
The positions of 53.5 kDa- and 47 kDa-protein bands are indicated by the bars at the left. The position of HSP72 is shown by the arrows at the right of each pan.l.
FIG. 6 depicts a fluorogram showing the specificity of MAb Fl-Pn3.1 for HSP72. ["3S]methioninelabeled proteins of S. pneumoniae in the absence (lanes 1, 3 and 5) or presence (lanes 2, 4 and 6) of exposure to heat shock were immunoprecipitated with IgG2a-control MAb (lane 3,4) or Fi-Pn3.1 (lane 5,6) and then analyzed by SDS-PAGE and fluorography. Cell lysates from 35 S]methionine-labeled non heat-shocked and heat-shocked S. pneumoniae are shown in lanes 1 and 2, respectively.
The position of HSPs (all three) is shown by the arrows at the left of the fluorogram.
FIG. 7, panel A, depicts an immunoblot, which shows the reaction of heat-shocked and non heat-shocked 35 S]methionine-labelled S. pneumoniae cell extracts with MAb Fl-Pn3.1. Lane 1 contains heat-shocked cell lysates 0 C) Lane 2 contains non heat-shocked cell lysates (37 0 Panel B depicts a fluorogram of the immunoblot shown in panel A.
FIG. 8 depicts a Western Blot, which shows subcellular localization of S. pneumoniae HSP72. Sample containing 15 pg protein of membrane fraction (lane 1) and cytoplasmic fraction (lane 2) of S. pneumoniae were electrophoresced on SDS-PAGE transferred to nitrocellulose and probed with MAb Fl-Pn3.1.
FIG. 9 is a photograph of an immunoblot showing the reactivity of recombinant fusion proteins containing the C-169 region of S. pneumoniae HSP72 with MAb Fl-Pn3.1.
Lane 1 contains whole cell extracts from S. pneumoniae strain 64 prcbed with HSP72-specific MAb Fl-Pn3.1.
Lanes 2 and 3 contain phage lysates from E. coli infected with XJBD17 cultured in the presence or absence of I I I II 4 =~sl WO 96/40928 PCT/CA96/00322 IPTG and probed with HSP72-specific MAb Fl-Pn3.1. Lanes 4 and 5 contain phage lysates from E. coli infected with XJBD7 cultured in the presence or absence of IPTG and probed with HSP72-specific MAb Fl-Pn3.1. Molecular mass markers are shown to the left. The positions of the 74kDa- and 160 kDa-reactive proteins are shown on the left and on the right, respectively.
FIG. 10 is a schematic representation of the restriction map of the HSP72(DnaK) and Fuc loci and inserts of recombinant clones. The relationships between DNA fragments are shown with respect to each other.
FIGS. 10A and 10C illustrate the restriction map of the HSP72(DnaK) and Fuc loci, respectively. FIG illustrates the inserts of the various phages and plasmids described in Example 3. H(HindIII); E(EcoRI); V(EcoRV); P(PstI); and X(XhoI) indicate positions of restriction endonuclease sites. DNA fragments on the HSP72/DnaK locus the Fuc locus and fragments used as probes in the Southern blot analyses are indicated.
FIG. 11 depicts the SDS-PAGE and Western blot analyses of the recombinant 74 kDa protein. Whole ce.
extracts from E. coli transformed with plasmids pJBD179 (lane pJBDf51 (lanes 2 and 3) and pJBDf62 (lane 4 and and cultured in presence or absence of IPTG were subjected to 10% polyacrylamide gel electrophoresis.
The proteins were then visualized by Coomassie Blue staining or Western blotting using HSP-specific MAb Fl-Pn3.1. Molecular mass markers in kilodaltons are shown to the left. The arrow at the left-hand side of each panel marks the 74 kDa protein marker.
FIG. 12 depicts the detection of native and recombinant HSP72 antigens by Western blot analysis.
Whole cell lysates from E. coli transformed with plasmids pJBDk51 (lanes 1 and 3) and pJBD291 (lane 2) and cell lysates from S. pneumoniae strain 64 (lane were subjected to 10% polyacrylamide gel electrophoresis and WO 96/40928 PCT/CA96/O0322 were electrotransferred to nitrocellulose. The immunoblot.
was probed with HSP72-specific MAb Fl-Pn3.1.
FIGS. 13A-13D depict a comparison of the predicted amino acid sequence of the S. pneumoniae HSP72 open reading frame (HSP72 SPNEU) with those previously reported for the following HSP70/DnaK proteins: ECOLI, Escherichia coli; BORBU, Borrelia burgdorferi; BRUOV, Brucella ovis; CHLPN, Chlamydia pneumonia; BACME, Bacillus megatorium; BACSU, Bacillus subtilis; STAAU, Staphylococcus aureus; LACLA, Lactococcus lactis; and MYCTU, Mycobacterium tuberculosis. Only mismatched amino acids are indicated. Identical and conserved amino acids are boxed and shadowed, respectively.
FIG. 14 depicts a photograph of an SDS-PAGE, which shows the recombinant S. pneumoniae HSP72 purified by affinity chromatography. Supernatant fractions from E. coli (pJBDk51) lysates (lane 2) and 20 pg of immunoaffinity-purified HSP72rc (lane 3) were subjected to polyacrylamide gel electrophoresis. The proteins were then visualized by Coomassie Blue staining. Lane 1 I the migration of molecular mass markers (106 kDa, 80 kDa, 49.5 kDa, 32.5 kDa, 27.5 kDa and 18.5 kDa).
FIG. 15 depicts a photograph of SDS-PAGE, which shows the recombinant S. pneumoniae C-169 fragment purified by solubilization of inclusion bodies. Various amounts of purified C-169 protein (lane 1, 5 pg; lane 2, ug; and lane 3, 1 pg) and whole cell lysates from E. coli transformed with plasmids pDELTAl (lane 4) and pJBDAl (lane 5) were subjected to 10% polyacrylamide gel electrophoresis. The proteins were then visualized by Coomassie Blue staining.
FIG. 16 is a graphical depiction of the survival curve of Balb/c mice protected from S. pneumoniae infection by immunization with HSP72rec. Data are presented as the per cent survival over a period of 14 days for a total of 10 mice per experimental group.
WO 96/40928 PCT/CA96/00322 FIG. 17 is a graphical depiction of the survival curve of Balb/c mice protected from S. pneumoniae infection by immunization with C-169rac. Data are presented as the per cent survival over a period of 14 days for a total of 10 mice per experimental group.
FIG. 18 is a map of plasmid pURV3 containing C- 151 rec, the coding region for the 151 amino acids at the carboxyl end of the HSP72 of S. pneumoniae; AmpiR, ampicillin-resistance coding region; ColE1 ori, origin of replication; c1857, bacteriophage X c1857 temperaturesensitive repressor gene; X PL, bacteriophage X transcription promoter; Tl, T1 transcription terminator.
The direction of transcription is indicated by the arrows.
BglII and BamHI are the restriction sites used to insert the coding region for the C-151rec of the HSP72 of S.
pneumoniae. FIG. 19 illustrates the distribution of anti-S. pneumoniae titers in sera from Balb/c mice immunized with HSP 72 rec. Sera were collected after the first, second and third injection with 1 pg (0) )r 5 pg of HSP 7 2 rec and evaluated individually for anti-S. pneumoniae antibody by ELISA. Titers were defined as the highest dilution at which the A410 values were 0.1 above the background values. Plain lines indicate the median reciprocal of antibody titers for each group of mice while the dashed line indicates the median value for preimmune sera.
FIG. 20 illustrates the distribution of anti-S.
pneumoniae titers in sera from Balb/c mice immunized with C-169rec. Sera were collected after the first, second and third injection with 1 pg or 5 pg of C-169rec and evaluated individually for anti-S. pneumoniae antibody by ELISA. Titers were defined as the highest dilution at which the A410 values were 0.1 above the background values. Plain lines indicate the median reciprocal of antibody titers for each group of mice while the dashed line indicates the median value for preimmune sera.
WO 96/40928 PCT/CA96/00322 FIG. 21 illustrates the distribution of anti-S.
pneumoniae titers in sera from Balb/c mice immunized with C-151rec. Sera were collected after the first, second and third injection with 0.5 pg of C-151rec and evaluated individually for anti-S. pneumoniae antibody by ELISA.
Titers were defined as the highest dilution at which the A410 values were 0.1 above the background values. Plain lines indicate the median reciprocal of antibody titers for each group of mice while the dashed line indicates the median value for preimmune sera.
FIG. 22 illustrates the antibody response of cynomolgus monkeys immunized with recombinant HSP72 antigens. Groups of two monkeys were immunized with either HSP 72 rec or C-169rec protein at day 1, day 22 and day 77. Sera were collected regularly during the course of the immunization and evaluated individually for pneumococcal HSP72 specific antibody by Western blot analysis. Titers were defined as the highest dilution at which the HSP72 band was visualized.
FIG. 23 illustrates the binding of hyperimmune sera to peptides in a solid-phase ELISA. Rabbit, mouse and monkey sera from animals immunized with either HSP 72 rec or C-169rec protein were tested for their reactivity to peptides. Optical density values were obtained with sera tested at a dilution of 1:100 except for the values corresponding to the reactivity of rabbit sera to peptide MAP2 and murine sera to peptides MAP2 and MAP4 which were obtained with sera diluted 1:1000.
FIG. 24 depicts the consensus sequence established from the DNA sequences of the hsp70/dnak open reading frames of Streptococcus pneumoniae (spn-orf), Streptococcus pyogenes (sga-orf) and Streptococcus agalactiae (sgb-orf) and indicates the substitutions and insertions of nucleotides specific to each species.
FIG. 25 depicts the consensus sequence established from the protein sequences of the Hsp70 of Streptococcus pneumoniae (spn-prot), Streptococcus pyogenes (sga-prot) WO 96/40928 PCT/CA96/00322 and Streptococcus agalactiae (sgb-prot) and indicates the.
substitutions and insertions of amino acids specific to each species.
FIG. 26 depicts a fluorogram, which shows the effect of heat shock on S. agalactiae protein synthesis and the S. agalactiae protein antigen immunoprecipitated by MAb F2-Pn3.4. Cell lysates from 35 S]methionine-labeled proteins from S. agalactiae grown at 37 0 C and incubated at 37 0 C (odd numbered lanes) or heat-shocked at 43 0 C (even numbered lanes) were analysed by SDS-PAGE and fluorography. Lanes 3 and 4 show the immunoprecipitates obtained using MAb F2-Pn3.4.
DETAILED DESCRIPTION OF THE INVENTION According to one aspect of the invention, we provide novel heat shock proteins of S. pneumoniae, S.
pyogenes and S. agalactiae, and analogues, homologues, derivatives and fragments thereof, containing at least one immunogenic epitope. As used herein, a "heat shock protein" is a naturally occurring protein that exhibits preferential transcription during heat stress conditions.
The heat shock protein according to the invention may be of natural origin, or may be obtained through the application of recombinant DNA techniques, or conventional chemical synthesis techniques.
As used herein, "immunogenic" means having the ability to elicit an immune response. The novel heat shock proteins of this invention are characterized by their ability to elicit a protective immune response against Streptococcal infections, more particularly against lethal S. pneumoniae, S. pyogenes and S.
agalactiae.
The invention particularly provides a Streptoccus pneumoniae heat shock protein of approximately 72 kDa ("HSP72"), having the deduced amino acid sequence of SEQ ID NO:5, and analogues, homologues, derivatives and WO 96/40928 PCT/CA96/00322 fragments thereof, containing at least one immunogenic epitope.
As used herein, "analogues" of HSP72 are those S. pneumoniae proteins wherein one or more amino acid residues in the HSP72 amino acid sequence (SEQ ID NO:5) is replaced by another amino acid residue, providing that the overall functionality and immunogenic properties of the analogue protein are preserved. Such analogues may be naturally occurring, or m'v be produced synthetically or by recombinant DNA technology, for example, by mutagenesis of the HSP72 sequence. Analogues of HSP72 will possess at least one antigen capable of eliciting antibodies that react with HSP72, e.g. Streptococcus pyogenes and Streptococcus agalactiae.
As used herein, "homologues" of HSP72 are proteins from Streptococcal species other than pneumoniae, pyogenes or agalactiae, or genera other than Streptococcus wherein one or more amino acid residues in the HSP72 amino acid sequence (SEQ ID NO:5) is replaced by another amino acid residue, providing that the overall functionality and immunogenic properties of the homologue protein are preserved. Such homologues may be naturally occurring, or may be produced synthetically or by recombinant DNA technology. Homologues of HSP72 will possess at least one antigen capable of eliciting antibodies that react with HSP72, e.g. Enterococcus faecalis.
As used herein, a "derivative" is a polypeptide in which one or more physical, chemical, or biological properties has been altered. Such alterations include, but are not limited to: amino acid substitutions, modifications, additions or deletions; alterations in the pattern of lipidation, glycosylation or phosphorylation; reactions of free amino, carboxyl, or hydroxyl side groups of the amino acid residues present in the polypeptide with other organic and non-organic molecules; and other alterations, any of which may result in changes in primary, secondary or tertiary structure.
WO 96/40928 PCT/CA96/00322 The "fragments" of this invention will have at least one immunogenic epitope. An "immunogenic epitope" is an epitope that is instrumental in eliciting an immune response. The preferred fragments of this invention will elicit an immune response sufficient to prevent or lessen the severity of infection, S. pneumoniae infection.
Preferred fragments of HSP72 include the C-terminal region of the polypeptides. More preferred fragment include the C-terminal 169-residue fragment (SEQ ID residues 439-607), the C-terminal 151-residue ("C-151") (SEQ ID No:5, residues 457-607) and smaller fragments consisting of peptide epitopes within the C-169 region.
Particularly preferred fragments within the C-169 region of HSP72 include the peptide sequences GFDAERDAAQAALDD (residues 527-541 of SEQ ID NO:5) and AEGAQATGNAGDDVV (residues 586-600 of SEQ ID NO:5), which are exclusive to HSP72 of Streptococcus pneumoniae, or corresponding degenerate fragments from S. pyogenes or S. agalactiae (see FIG. 25). Even more preferred are fragments that elicit a specific immune reaction against Streptococcal strains. Such fragments may be selected from the following peptides: CS870, CS873, CS874, CS875, CS876, CS877, CS878, CS879, CS880, CS882, MAP1, MAP2, MAP3 and MAP4 (see TABLE 5, supra), or homologues thereof.
In a further aspect of the invention, we provide polypeptides that are immunologically related to HSP70/72.
As used herein, "immunologically related" polypeptides are characterized by one or more of the following properties: they are immunologically reactive with antibodies generated by infection of a mammalian host with Streptococcus pneumoniae cells, which antibodies are immunologically reactive with HSP72 (SEQ ID NO:5) and (SEQ ID NO:20 and SEQ ID NO:22); they are capable of eliciting antibodies that are immunologically reactive with HSP72 (SEQ ID NO:5) and (SEQ ID NO:20 and SEQ ID NO:22); WO 96/40928 PCT/CA96/00322 they are immunologically reactive with antibodies elicited by immunization of a mammal with HSP72 (SEQ ID By definition, analogues, homologues and derivatives of HSP70/72 are immunologically related polypeptides. Moreover, all immunologically related polypeptides contain at least one HSP70/72 antigen.
Accordingly, "HSP70/72 antigens" may be found in HSP70/72 itself, or in immunologically related polypeptides.
In a further aspect of the invention, we provide polypeptides that are immunologically related to HSP72.
As used herein, "immunologically related" polypeptides are characterized by one or more of the following properties: they are immunologically reactive with antibodies generated by infection of a mammalian host with Streptococcus pneumoniae cells, which antibodies are immunologically reactive with HSP72 (SEQ ID they are capable of eliciting antibodies that are immunologically reactive with HSP72 (SEQ ID they are immunologically reactive with antibodies elicited by immunization of a mammal with HSP72 (SEQ ID By definition, analogues, homologues and derivatives of HSP72 are immunologically related polypeptides. Moreover, all immunologically related polypeptides contain at least one HSP72 antigen.
Accordingly, "HSP72 antigens" may be found in HSP72 itself, or in immunologically related polypeptides.
As used herein, "related bacteria" are bacteria that possess antigens capable of eliciting antibodies that react with HSP72. Examples of related bacteria include Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus mutans, Streptococcus sanguis, Streptococcus agalactiae and Enterococcus faecalis.
It will be understood that by following the examples of this invention, one of skill in the art may determine without undue experimentation whether a WO 96/40928 PCT/CA96/00322 particular analogue, homologue, derivative, immunologically related polypeptide, or fragment would be useful in the diagnosis, prevention or treatment of disease. Useful polypeptides and fragments will elicit antibodies that are i-mmunoreactive with HSP72 (Example 4).
Preferably, useful polypeptides and fragments will demonstrate the ability to elicit a protective immune response against lethal bacterial infection (Example Also included are polymeric forms of the polypeptides of this invention. These polymeric f6rms include, for example, one or more polypeptides that have been crosslinked with crosslinkers such as avidin/biotin, glutaraldehyde or dimethylsuberimidate. Such polymeric forms also include polypeptides containing two or more tandem or inverted contiguous protein sequences, produced from mult .istronic mRNAs generated by recombinant DNA technology.
This invention provides substantially pure HSP72 and immunologically related polypeptides. T.e term "substantially pure" means that the polypep .%:,ccording to the invention, and the DNA sequences encocii, them, are substantially free from other proteins of bacterial origin. Substantially pure protein preparations may be obtained by a variety of conventional processes, for example the procedures described in Examples 3 and In another aspect, this invention provides, for the first time, a DNA sequence coding for a heat shock protein of S. pneumoniae, specifically, HSP72 (SEQ ID NO:4, nucleotides 682-2502).
The DNA sequences of this invention also include DNA sequences coding for polypeptide analogues and homologues of HSP72, DNA sequences coding for immunologically related polypeptides, DNA sequences that are degenerate to any of the foregoing DNA sequences, and fragments of any of the foregoing DNA sequences. It will be readily appreciated that a person of ordinary skill in the art will be able to determine the DNA sequence of any WO 96/40928 PCT/CA96/00322 of the polypeptides of this invention, once the polypeptide has been identified and isolated, using conventional DNA sequencing techniques.
Oligonucleotide primers and other nucleic acid probes derived from the genes encoding the polypeptides of this invention may also be used to isolate and clone other related proteins from S. pneumoniae and related bacteria which may contain regions of DNA bacteria that are homologous to the DNA sequences of this invention. In addition, the DNA sequences of this invention may be used in PCR reactions to detect the presence of S. pneumoniae or related bacteria in a biological sample.
The polypeptides of this invention may be prepared from a variety of processes, for example by protein fractionation from appropriate cell extracts, using conventional separation techniques such as ion exchange and gel chromatography and electrophoresis, or by the use of reconbinant DNA techniques. The use of recombinant DNA techniques is particularly suitable for preparing substantially pure polypeptides accord .ig to the invention.
Thus according to a further aspect of uhe invention, we provide a process for the pr.-cuctLon of HSP72, immunologically related polypeptides, and fagments thereof, comprising the steps of culturing a unicellular host organism transformed with a vector containing a DNA sequence coding for said polypeptide or fragment and one or more expression control sequences operatively linked to the DNA sequence, and recovering a substantially pure polypeptide or fragment.
As is well known in the art, in order to obtain high expression levels of a transfected gene in a host, the gene must be operatively linked to transcriptional and ranslational expression control sequences that are functional in the chosen expression host. Preferably, the expression control sequences, and the gene of interest, will be contained in an expression vector that further WO 96/40928 PCT/CA96/00322 comprises a bacterial selection marker and origin of replication. If the expression host is a eukaryotic cell, the expression vector should further comprise an expression marker useful in the eukaryotic expression host.
The DNA sequences encoding the polypeptides of this invention may or may not encode a signal sequence.
If the e .pression host is eukaryotic, it generally is preferred that a signal sequence be encoded so that the mature protein is secreted from the eukaryotic host.
An amino terminal methionine may or may not be present on the expressed polypeptides of this invention.
If the terminal methionine is not cleaved by the expression host, i. may, if desired, be chemically removed by standard techniques.
A wide variety of expression host/vector combinations may be enmloyed in expressing the DNA sequences of this invention. Useful expression vectors for eukaryotic hosts include, for example, vectors comprising expression control sequences from SV40, bovine papilloma virus, adenovirus, adeno-associated virus, cytomegalovirus, and retroviruses. Useful expression vectors for bacterial hosts include bacterial plasmids, such as those from E. coli, including pBluescript, pGEX2T, pUC vectors, col El, pCR1, pBR322, pMB9 and their derivatives, wider host range plasmids, such as RP4, phage DNAs, the numerous derivatives of phage lambda, e.g.
Xgtl0 and Xgtll, NM989, and other DNA phages, such as M13 and filamentous single stranded DNA phages. Useful expression vectors for yeast cells include the 2p plasmid and derivatives thereof. Useful vectors for insect cells include pVL 941.
In addition, any of a wide variety of expression control sequences may be used in these vectors to express the DNA sequences of this invention. Useful expression control sequences include the expression control sequences associated with structural genes of the foregoing WO 96/40928 PCT/CA96/00322 expression vectors. Examples of useful expression control sequences include, for example, the early and late promoters of SV40 or adenovirus, the lac system, the trp system, the TAC or TRC system, the T? and T7 promoters the major operator and promoter regions of phage lambda, the control regions of fd coat protein, the promoter for 3phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, Pho5, the promoters of the yeast alpha-mating system and other constitutive and inducible promoter sequences known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof. The T7 RNA polymerase promoter (10 is particularly useful in the expression of HSP72 in E. coli (Example 3).
Host cells transformed with the foregoing vectors form a further aspect of this invention. A wide variety of unicellular host cells are useful in expressing the DNA sequences of this invention. These hosts may include well known eukaryotic and prokaryotic hosts, such as strains of E. coli, Pseudomonas, Bacillus, Streptomyces, fungi, yeast, insect cells such as Spodoptera frugiperda (SF9), animal cells such as CHO and mouse cells, African green monkey cells such as COS 1, COS 7, BSC 1, BSC 40, and BMT 10, human cells, and plant cells in tissue culture. Preferred host organisms include bacteria such as E. coli and B. subtilis, and mammalian cells in tissue culture.
It should of course be understood that not all vectors and expression control sequences will function equally well to express the DNA sequences of this invention. Neither will all hosts function equally well with the same expression system. However, one of skill in the art may make a selection among these vectors, expression control sequences and hosts without undue experimentation and without departing from the scope of this invention. For example, in selecting a vector, the host must be considered because the vector must replicate WO 96/40928 PCT/CA96/00322 in it. The vector's copy number, the ability to control that copy number, and the expression of any other proteins encoded by the vector, such as antibiotic markers, should also be considered. In selecting an expression control sequence, a variety of factors should also be considered.
These include, for example, the relative strength of the sequence, its controllability, and its compatibility with the DNA sequences of this invention, particularly as regards potential secondary structures. Unicellular hosts should be selected by consideration of their compatibility with the chosen vector, the toxicity of the product coded for by the DNA sequences of this invention, their secretion characteristics, their ability to fold the protein correctly, their fermentation or culture requirements, and the ease of purification from them of the products coded for by the DNA sequences of this invention. Within these parameters, one of skill in the art may select various vector/expression control sequence/host combinations that will express the DNA sequences of this invention on fermentation or in large scale animal culture.
The polypeptides encoded by the DNA sequences of this invention may be isolated from the fermentation or cell culture and purified using any of a variety of conventional methods including: liquid chromatography such as normal or reversed phase, using HPLC, FPLC and the like; affinity chromatography (such as with inorganic ligands or monoclonal antibodies); size exclusion chromatography; immobilized metal chelate chromatography; gel electrophoresis; and the like. One of skill in the art may select the most appropriate isolation and purification techniques without departing from the scope of this invention.
In addition, the polypeptides of this invention may be generated by any of several chemical techniques.
For example, they may be prepared using the lid-phase synthetic technique originally described by R. B.
WO 96/40928 PCT/CA96/00322 Merrifield, "Solid Phase Peptide Synthesis. I. The Synthesis Of A Tetrapeptide", J. Am. Chem. Soc., 83, pp. 2149-54 (1963), or they may be prepared by synthesis in solution. A summary of peptide synthesis techniques may be found in E. Gross H. J. Meinhofer, 4 The Peptides: Analysis, Synthesis, Biology; Modern Techniques Of Peptide And Amino Acid Analysis, John Wiley Sons, (1981) and M. Bodanszky, Principles Of Peptide Synthesis, Springer-Verlag (1984).
The preferred compositions and methods of this invention comprise polypeptides having enhanced immunogenicity. Such polypeptides may result when the native forms of the polypeptides or fragments thereof are modified or subjected to treatments to enhance their immunogenic character in the intended recipient.
Preferred polypeptides are fragments that are specific to Streptococcal species such as fragments selected from the C-terminal portion of thenative polypeptides. Numerous techniques are available and well known to those of skill in the art which may be used, without undue experimentation, to substantially increase the immunogenicity of the polypeptides herein disclosed. For example, the polypeptides may be modified by coupling to dinitrophenol groups or arsanilic acid, or by denaturation with heat and/or SDS. Particularly if the polypeptides are small polypeptides synthesized chemically, it may be desirable to couple them to an immunogenic carrier. The coupling of course, must not interfere with the ability of either the polypeptide or the carrier to function appropriately. For a review of some general considerations in coupling strategies, see Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, ed. E.
Harlow and D. Lane (1988). Useful immunogenic carriers are well known in the art. Examples of such carriers are keyhole limpet hemocyanin (KLH); albumins such as bovine serum albumin (BSA) and ovalbumin, PPD (purified protein derivative of tuberculin); red blood cells; tetanus WO 96/40928 PCT/CA96/00322 toxoid; cholera toxoid; agarose beads; activated carbon; or bentonite.
Modification of the amino acid sequence of the polypeptides disclosed herein in order to alter the lipidation state is also a method which may be used to increase their immunogenicity and biochemical properties.
For example, the polvpeptides or fragments thereof may be expressed with or without the signal sequences that direct addition of lipid moieties.
In accordance with this invention, derivatives of the polypeptides may be prepared by a variety of methods, including by in vitro manipulation of the DNA encoding the native polypeptides and subsequent expression of the modified DNA, by chemical synthesis of derivatized DNA sequences, or by chemical or biological manipulation of expressed amino acid sequences.
For example, derivatives may be produced by substitution of one or more amino acids with a different natural amino acid, an amino acid derivative or non-native amino acid, conservative substitution being preferred, 3-methylhistidine may be substituted for histidine, 4-hydroxyproline may be substituted for proline, hydroxylysine may be substituted for lysine, and the like.
Causing amino acid substitutions which are less conservative may also result in desired derivatives, e.g., by causing changes in charge, conformation and other biological properties. Such substitutions would include for example, substitution of a h'.-drophilic esidue for a hydrophobic residue, substitution of a cysteine or proline for another residue, substitution of a residue having a small side chain for a residue having a bulky side chain or substitution of a residue having a net positive charge for a residue having a net negative charge. When the result of a given substitution cannot be predicted with certainty, the derivatives may be readily assayed according to the methods disclosed herein to determine the presence or absence of the desired characteristics.
WO 96/40928 PCT/CA96/003"2 The polypeptides may also be prepared with the objective of increasing stability or rendering the molecules more amenable to purification and preparation.
One such technique is to express the polypeptides as fusior proteins comprising other S. pneumoniae or non- S. pneumoniae sequences. It is preferred that the fusion proteins comprising the polypeptides of this invention be produced at the DNA level, by constructing a nucleic acid molecule encoding the fusion, transforming host cells with the molecule, inducing the cells to express the fusion protein, and recovering the fusion protein from the cll culture. Alternatively, the fusion proteins may be produced after gene expression according to known methods.
An example of a fusion protein according to tl:is invention is the FucI/HSP72 (C-169) protein of Example 3, infra.
The polypeptides of this invention may also be par.: of larger multimeric molecules which may be produced recombinantly or may be synthesized chemically. Such multimers may also include the polypeptides fused or coupled to moieties other than amino acids, including lipids id carbohydrates.
The polypeptides of this invention are particularly well-suited for the generation of antibodies and for the development of a protective response against disease. Accordingly, in another aspect of this invention, v' provide antibodies, or fragments thereof, that are immunologically rea"tive with HSP72. The antibodies of this invention are either elicited by immunization with HSP72 or an immunologically related polypeptide, or are identified by their reactivity with HSP72 or an immunologically related polypeptide. It should be understood that the antibodies of this invention are not intended to include those antibodies which are normally elicited in an animal upon infection with naturally occurring S. pneumoniae and which have not been removed from or altered within the animal in which they were elicited.
WO 96/40928 PCT/CA96/00322 The antibodies of this invention may be intact immunoglobulin molecules or fragments thereof that contain an intact antigen binding site, including those fragments known in the art as Fab, Fab' and F(ab')2. The antibodies may also be genetically engineered or synthetically produced. The antibody or fragment may be of animal origin, specifically of mammalian origin, and more specifically of murine, rat, monkey or human origin.
It may be a natural antibody or fragment, or if desired, a recombinant antibody or fragment. The antibody or antibody fragments may be of polyclonal, or preferably, of monoclonal origin. They may be specific for a number of epitopes but are preferably specific for one.
Specifically preferred are the monoclonal antibodies Fl- Pn3.1, F2-Pn3,2, F2-Pn3.3 and F2-Pn3.4 of Example 2, infra. One of skill in the art may use the polypeptides of this invention to produce other monoclonal antibodies which could be screened for their ability to confer protection against S. pneumoniae S. pyogenes, S.
agalactiae or other Streptococcal related bacterial infection when used to immunize naive animals. Once a given monoclonal antibody is found to confer protection, the particular epitope that is recognized by that antibody may then be identified. Methods to produce polyclonal and monoclonal antibodies are well known to those of skill in the art. For a review of such methods, see Antibodies, A Laboratory Manual, supra, and D.E. Yelton, et al., Ann.
Rev. of Biochem., 50, pp. 657-80 (1981). Determination of immunoreactivity with a polypeptide of this invention may be made by any of several methods well known in the art, including by immunoblot assay and ELISA.
An antibody of this invention may also be a hybrid molecule formed from immunoglobulin sequences from different species mouse and human) or from portions of immunoglobulin light and heavy chain sequences from the same species. It may be a molecule that has multiple binding specificities, such as a bifunctional antibody WO 96/40928 PCT/CA96/00322 prepared by any one of a number of techniques known to those of skill in the art including: the production of hybrid hybridomas; disulfide exchange; chemical crosslinking; addition of peptide linkers between two monoclonal antibodies; the introduction of two sets of immunoglobulin heavy and light chains into a particular cell line; and so forth. The antibodies of this invention may also be human monoclonal antibodies, for example those produced by immortalized human cells, by SCID-hu mice or other non-human animals capable of producing "human" antibodies, or by the axpression of cloned human immunoglobulin genes.
In sum, one of skill in the art, provided with the teachings of this invention, has available a variety of methods which may be used to alter the biological properties of the antibodies of this invention including methods which would increase or decrease the stability or half-life, immunogenicity, toxicity, affinity or yield of a given antibody molecule, or to alter it in any other way that may render it more suitable for a particular application.
The polypeptides, DNA sequences and antibodies of this invention are useful in prophylactic, therapeutic and diagnostic compositions for preventing, treating and diagnosing disease.
Standard immunological techniques may be employed with the polypeptides and antibodies of this invent..on in order to use them as immunogens and as vaccines. In particular, any suitable host may be injected with a pharmaceutically effective amount of polypeptide to generate monoclonal or polyvalent antibodies or to induce the development of a protective immunological response against disease. Preferably, the polypeptide is selected from the group consisting of HSP72 (SEQ ID NO:5), HSP70 (SEQ ID NO:20 and SEQ ID NO:22) or fragments thereof.
WO 96/40928 PCT/CA96/00322 As used herein, a "pharmaceutically effective amount" of a polypeptide or of an antibody is the amount that, when administered to a patient, elicits an immune response that is effective to prevent or lessen the severity of Streptococcal or related bacterial infections.
The administration of the polypeptides or antibodies of this invention may be accomplished by any of the methods described in Example 10, infra, or by a variety of other standard procedures. For a detailed discussion of such techniques, see Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, ed.
E. Harlow and D. Lane (1988). Preferably, if a polypeptide is used, it will be administered with a pharmaceutically acceptable adjuvant, such as complete or incomplete Freund's adjuvant, RIBI (muramyl dipeptides) or ISCOM (immunostimulating complexes). Preferably, the composition will include a water-in-oil emulsion or aluminum hydroxide as adjuvant and will be administered intramuscularly. The vaccine composition may be administered to the patient at one time or over a series of trea\:ments. The most effective mode of administration and dosage regimen will depend upon the level of immunogenicity, the particular composition and/or adjuvant used for treatment, the severity and course of the expected infection, previous therapy, the patient's health status and response to immunization, and the judgment of the treating physician. For example, in an immunocompetent patient, the more highly immunogenic the polypeptide, the lower the dosage and necessary number of immunizations. Similarly, the dosage and necessary treatment time will be lowered if the polypeptide is administered with an adjuvant.
Generally, the dosage will consist of an initial injection, most probably with adjuvant, of about 0.01 to mg, and preferable 0.1 to 1.0 mg, HSP72 antigen per patient, followed most probably by one or maybe more WO 96/40928 PCT/CA96/00322 booster injections. Preferably, boosters will be administered at about 1 and 6 months after the initial injection.
Any of the polypeptides of this invention may be used in the form of a pharmaceutically acceptable salt.
Suitable acids and bases which are capable of forming salts with the polypeptides of the present invention are well known to those of skill in the art, and include inorganic and organic acids and bases.
To screen the polypeptides and antibodies of this invention for their ability to confer protection against diseases caused by S. pneumoniae or related bacteria, or their ability to lessen the severity of such infection, one of skill in the art will recognize that a number of animal models may be used. Any animal that is susceptible to infection with S. pneumoniae or related bacteria may be useful. The Balb/c mice of Example infra, are the preferred animal model for active immunoprotection screening, and the severe-combined immunodeficient mice of Example 5 are the preferred animal model for passive screening. Thus, by administering a particular polypeptide or antibody to these animal models, one of skill in the art may determine without undue experimentation whether that polypeptide or antibody would be useful in the methods and compositions claimed herein.
According to another embodiment of this invention, we describe a method which comprises the steps of treating a patient with a vaccine comprising a pharmaceutically effective amount of any of the polypeptides of this invention in a manner sufficient to prevent or lessen the severity, for some period of time, of Streptococcal or related bacterial infection. Again, the preferred polypeptide for use in such methods is HSP70/HSP72, or fragments thereof.
The polypeptides, DNA sequences and antibodies of this invention may also form the basis for diagnostic methods and kits for the detection of pathogenic WO 96/40928 PCT/CA96/00322 organisms. Several diagnostic methods are possible. For example, this invention provides a method for the detection of Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus agalactiae or related bacteria in a biological sample comprising the steps of: isolating the biological sample from a patient; incubating an antibody of this invention, or fragment thereof with the biological sample to form a mixture; and detecting specifically bound antibody or fragment in the mixture which indicates the presence of Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus agalactiae or related bacteria. Preferable antibodies for use in this method include monoclonal antibodies Fl-Pn3.1, F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4.
Alternatively, this invention provides a method for the detection of antibodies specific to Streptococcus pneumoniae or related bacteria in a biological sample comprising: isolating the biological sample from a patient; incubating a polypeptide of this invention or fragment thereof, with the biological sample to form a mixture; and detecting specifically bound polypeptide in the mixture which indicates the presence of antibodies specific to Streptococcus pneumoniae or related bacteria.
HSP72 (SEQ ID NO:5), che C-169 fragment thereof (residues 439-607 of SEQ ID NO:5), the C-151 fragment thereof (residues 457-607 of SEQ ID NO;5) and peptide fragments GFDAERDAAQAALDD (residues 527-541 of SEQ ID NO:5) and AEGAQATGNAGDDW (residues 586-600 of SEQ ID NO:5) are the preferred polypeptide and fragments in the above method for the detection of antibodies.
One of skill in the art will recognize that these diagnostic tests may take several forms, including WO 96/40928 PCT/CA96/00322 an enzyme-linked immunosorbent assay (ELISA), a radioimmunoassay or a latex agglutination assay.
The diagnostic agents may be included in a kit which may also comprise instructions for use and other appropriate reagents, preferably a means for detecting when the polypeptide or antibody is bound. For example, the polypeptide or antibody may be labeled with a detection means that allows for the detection of the polypeptide when it is bound to an antibody, or for the detection of the antibody when it is bound to S. pneumoniae or related bacteria. The detection means may be a fluorescent labeling agent such as fluorescein isocyanate (FIC), fluorescein isothiocyanate (FITC), and the like, an enzyme, such as horseradish peroxidase (HRP), glucose oxidase or the like, a radioactive element such as 125I or 51 Cr that produces gamma ray emissions, or a radioactive element that emits positrons which produce gamma rays upon encounters with electrons present in the test solution, such as 1C, 150, or 1N. Binding may also be detected by other methods, for example via avidinbiotin complexes. The linking of the detection means is well known in the art. For instance, monoclonal antibody molecules produced by a hybridoma may be metabolically labeled by incorporation of radioisotope-containing amino acids in the culture medium, or polypeptides may be conjugated or coupled to a detection means through activated functional groups.
The DNA sequences of this invention may be used to design DNA probes for use in detecting the presence of Streptococcus pneumoniae or related bacteria in a biological sample. The probe-based detection method of this invention comprises the steps of: isolating the biological sample from a patient; incubating a DNA probe having a DNA sequence of this invention with the biological sample to form a mixture; and WO 96/40928 PCT/CA96/00322 detecting specifically bound DNA probe in the mixture which indicates the presence of Streptococcus pneumoniae or related bacteria.
The DNA probes of this invention may also be used for detecting circulating nucleic acids in a sample, for example using a polymerase chain reaction, as a method of diagnosing Streptococcus pneumoniae or related bacterial infections. The probes may be synthesized using conventional techniques and may be immobilized on a solid phase, or may be labeled with a detectable label. A preferred DNA probe for this application is an oligomer having a sequence complementary to at least about 6 contiguous nucleotides of HSP72 (SEQ ID NO:4, nucleotides 682-2502).
The polypeptides of this invention may also be used to purify antibodies directed against epitopes present on the protein, for example, using immunoaffinity purification of antibodies on an antigen column.
The antibodies or antibody fragments of this invention may be used to prepare substantially pure proteins according to the invention for example, using immunoaffinity purification of antibodies on an antigen column.
EXAMPLES
In order that this invention may be better understood, the following examples are set forth. These examples are for purposes of illustration only, and are not to be construed as limiting the scope of the invention in any manner.
Example 1 describes the identification of HSP72, an immunoreactive heat shock protein according to the invention. Example 2 describes the isolation of monoclonal antibodies against epitopes of HSP72. Example 3 describes the preparation of recombinant HSP72 and fragments of HSP72 according to the invention. Example 4 describes the antigenic specificity and immunoreactivity WO 96/40928 PCT/CA96/00322 of monoclonal antibodies directed against HSP72, and the identification of immunologically related proteins according to the invention. Example 5 describes processes for obtaining substantially pure HSP72, and the use of HSP7? or antibodies against it to protect against experimental S. pneumoniae infection. Example 6 describes the preparation of recombinant C-151 fragment of HSP72 according to the invention. Example 7 describes the humoral immune response following the immunization with recombinant HSP72 or fragments of HSP72 according to the invention. Example 8 describes the localization of linear B-cell epitopes on the HSP72. Example 9 describes the genes and HSP70 proteins from S. agalactiae and S.
pyogenes. Example 10 describes the use of HSP72 antigen in a human vaccine.
EXAMPLE 1 Identification of Immunoreactive S. pneumoniae Heat Shock Proteins A. Procedures Unless otherwise noted, the following procedures were used throughout the Examples herein.
1. Bacteria S. pneumoniae strains were provided by the Laboratoire de la Sante Publique du Quebec, Sainte-Anne de Bellevue. S. pneumoniae strains included type 4 strain 53 and type 6 strain 64. If not specified, S. pneumoniae type 6 strain 64 was used. Bacterial strains were grown overnight at 37 0 C in 5% CO 2 on chocolate agar plates.
2. Antigen Preparations Various S. pneumoniae antigens were prepared for immunization and immunoassays. Heat-killed whole cell antigens were obtained by incubating bacterial suspensions WO 96/40928 PCT/CA96/00322 in a water bath prewarmed at 56 C for 20 minutes.
Detergent-soluble proteins were extracted from S. pneumoniae as follows. Heat-killed bacteria were suspended in 10 mM Hepes buffer (4-(2-Hydroxyethyl)-1piperazinethan-sulfonsaure) (Boehringer Mannheim GmbH, Germany) at pH 7.4 and sonicated at 20,000 Kz/second, four times for 30 seconds. Intact cells and large debris were removed by centrifugation at 1,700 g for 20 minutes. The supernatant was collected and centrifuged at 100,000 g for 60 minutes. The pellet was resuspended in 1 ml of Hepes buffer, and 1 ml of 2% N-lauroyl sarcosine (Sigma Chemical Co., St. Louis, Mo.) was added. The mixture was incubated for 30 minutes at room temperature and the detergentsoluble fraction was harvested by centrifugation at 100,000 g for 60 minutes.
3. Heat Shock Treatment S. pneumoniae bacteria (type 4, strain 53 and type 6, strain 64) were resuspended in Eagle's Minimal Essential Medium lacking methionine (ICN Biomedicals Inc., Costa Mesa, CA) and supplemented with 1% BIO-X® (Quelab Laboratories, Montreal, Canada) for 15 minutes at 37 0 C and then divided into fractions of equal volume. The samples were incubated at either 37 0 C or 45 0 C for 5 minutes and then labeled with 100 pCi/ml 3 S]methionine (ICN) for or 60 minutes at37 0 C. The bacteria were harvested and cell extracts were prepared using Tris-HCl lysis buffer as described above, or SDS-PAGE sample buffer.
4. Immunization Of Mice Female Balb/c mice (Charles River Laboratories, St-Constant, Quebec, Canada) were immunized with S. pneumoniae antigens. Immune sera to S. pneumoniae type 6 strain 64 were obtained from mice immunized, at two-week intervals, by subcutaneous injections of 10 7 heatkilled bacteria or 20 pg of detergent-soluble pneumococcal WO 96/40928 PCT/CA96/00322 proteins absorbed to aluminum hydroxide adjuvant (Alhydrogel®; Cedarlane Laboratories Ltd., Horny, Ontario, Canada). Blood samples were collected prior to immunization and at seven days following the first and second immunization.
SDS-PAGE and Immunoassays Cell extracts were prepared for SDS-PAGE, Western blot analysis and radioimmunoprecipitation assay by incubating bacterial suspensions in Tris-HCl lysis buffer (50mM Tris, 150 mM NaC1, 0.1% Na dodecyl sulfate, Na deoxycholate, 2% Triton® X-100, 100 pg/ml phenylmethylsulfonylfluoride, and 2pg/ml aprotinin) at pH 8.0 for 30 minutes on ice. Lysed cells were cleared by centrifugation and the supernatants were aliquoted and kept frozen at -70 C.
SDS-PAGE were performed on a 10% polyacrylamide gel according to the method of Laemmli [Nature, 227, pp. 680-685 (1970)], using the Mini Protean® system (Bio- Rad Laboratories Ltd., Mississauga, Canada). Samples were denatured by boiling for 5 minutes in sample buffer containing 2% 2-mercaptoethanol. Proteins were resolved by staining the polyacrylamide gel with PhastGel Blue® (Pharmacia Biotech Inc., Baie d'Urfe, Canada). The radiolabeled products were visualized by fluorography.
Fluorograms were scanned using a laser densitometer.
Immunoblot procedures were performed according to the method of Towbin et al. [Proc. Natl. Acad. Sci.
USA, 76, pp. 4350-4354 (1979)]. The detection of antigens reactive with antibodies was performed by an indirect antibody immunoassay using peroxidase-labeled anti-mouse immunoglobulins and the o-dianisidine color substrate.
Radioimmunoprecipitation assays were perforned as described by J.A. Wiley et al. Virol., 66, pp. 5744-5751 (1992)]. Briefly, sera or hybridoma culture supernatants were added to radiolabeled samples containing WO 96/40928 PCT/CA96/00322 equal amounts of [3S]methionine. The mixtures were allowed to incubate for 90 minutes at 4 C with constant agitation. The immune complexes were then precipitated with bovine serum albumin-treated protein A Sepharose (Pharma.cia) for 1 hour at 4 C. The beads were pelleted and washed three times in Tris buffered saline at pH and the antigen complexes were then dissociated by boiling in sample buffer. The antigens were analyzed by electrophoresis on SDS-PAGE. The gels were fixed, enhanced for fluorography using Amplify® (Amersham'Canada Limited, Oakville, Ontario, Canada), dried, and then exposed to X-ray film.
B. Characterization of the Heat Shock Response in S. pneumoniae We studiel the heat shock response of S. pneumoniae by examining the pattern of protein synthesis before and after a shift from 37 0 C to FIG. 1 shows the results when S. pneumoniae type 6 strain 64 (panel A) and type 4 strain 53 (panel B) were grown at 37 0 C, incubated at 37°C (lanes 1,3,5,7 and 9) or at (lanes 2, 4, 6, 8 and 10) for 5 minutes, and then labeled with 3 S]methionine for 10 minutt.s (lanes 1,2 and minutes (lanes 3,4 and 9,10), or 60 minutes (lanes 5,6).
The fluorogram derived from SDS-PAGE indicated that the synthesis of at least three proteins was increased by increasing the temperature (FIG. The most prominent induced protein was about 72 kDa (HSP72), whereas the other two were approximately 80 kDa and 62 kDa (HSP62). Increased protein synthesis was already apparent after 10 minutes of labeling (FIG. 1, lanes 1, 2 and 7, 8) and became more significant when the labeling period was prolonged to 30 minutes (FIG. 1, lanes 3, 4 and 9, 10) and 60 minutes (FIG. 1, lanes 5, 6).
The effect of elevated temperature on the protein synthesis profile of two different S. pneumoniae strains WO 96/40928 PCT/CA96/00322 was similar, with HSPs of similar molecular mass being synthesized (compare Panel A (type 6 strain 64) to Panel B (type 4 strain 53) in FIG. 1).
Ana'v 13 of the densitometric tracings from scanning the piStnc synthesis profiles allowed the estimation of the relative amounts of proteins. For example, with respect to heat-shocked S. pneumoniae type 6 strain 64, after 10 minutes of labeling, HSP80 and HSP62 made up 2.9% and 6.8% of the labeled proteins, respectively, compared to less than 0.1% at 37 0 C (FIG. 2) Labeled proteins having an apparent molecular mass of 72 kDa were detected at both 37 0 C and 45 0 C conditions (FIG. Radioimmunoprecipitation analysis revealed, however, that HSP72 was undetectable at 37 0 C (supra; and FIGS. 3, 4 and 6) thus indicating that peak 9 from FIG. 2 corresponds to protein component(s) comigrating with HSP72. Assuming no variation in the labeling of this material, these results would suggest that the amount of HSP72 represents 8.7% of the total labeled cell protein after heat shock treatment. A comparison of the densitometric tracings revealed that cellular proteins corresponding to peaks 4, 10, 13, 17, 19, and 21 were synthesized at almost the same rate irrespective of heat shock treatment (FIG. However, the synthesis of several proteins (peaks 1, 2, 3, 15, 20, 22, 24, and 26) declined considerably in response to heat shock (FIG. 2).
C. Immune Responses to S. pneumoniae HSPs In order to assess the antibody response to pneumococcal HSPs, mouse sera were first assayed by radioimmunoprecipitation. The repertoire of labeled proteins recognized by sera from mice immunized with S. pneumoniae antigen preparations are shown in FIGS. 3 and 4. FIG. 3 relates to detergent soluble protein preparations. FIG. 4 relates to heat-killed bacterial preparation. Although many bands were detected by most antisera, HSP72 was a major precipitation product. The 1 I WO 96/40928 PCT/CA96/00322 specificity of antibodies for HSP72 was demonstrated by the detection of proteins among heat-.aocked products only (FIG. 3, lanes 4, 6, 8 and 10; FIG. 4, lanes 4, 6 and 8).
Interestingly, all immunized mice consistently recognized HSP72. The antibodies reactive with the HSP72 were not specific to the strain used during the immunization since strong reactivities were observed with heterologous S. pneumoniae HSP72. It should be noted that in addition to HSP72, one sera precipitated comigrating product labeled at both 37 0 C and 45 0 C (FIG. 4, lr-'e This 72 kDa-product probably corresponds to component from peak 9 in FIG. and was not detected in immunoblots. HSP62 is another immune target which was precipitated by some but not all immune sera (FIG. 3, lane 6 and, FIG. 4, lanes 4 and None of the sera tested reacted with HSP80. No proteins were precipitated when preimmune sera taken from the mice used in this study were tested for the presence of antibodies reactive with the labeled products.
As depicted in FIGS. 3 and 5, antibodies to HSP72 could be detected after one immunization with either detergent-soluble proteins or whole cells extracts of S. pneumoniae. In addition, a marked increase in the antibody response to HSP72 was observed after a second immunization (FIG. 3, compare 4 and 6, and lanes 8 and The immunoblot patterns of 15 mice immunized with heat-killed S. pneumoniae bacteria were remarkably consistent with the results of the previously described radioimmunoprecipitation. Although antibody response variation occurred to a variety of proteins, HSP72 was a major immunoreactive antigen with 8 positive sera after the first immunization (FIG. Antibodies to HSP72 were detected in 13 out of 15 immune sera tested after the second immunization. Two other prominent antigens having apparent molecular mass of 53.5 and 47 kDa were detected in 5 and 7 sera, respectively i WO 96/40928 PCT/CA96/00322 (FIG. The 72 kDa-reactive band was confirmed as the pneumococcal HSP72 by using recombinant HSP72 antigens (Example 3, infra) in an immunoblot assay. Preimmune sera failed to detect any pneumococcal proteins.
EXAMPLE 2 Isolation of Monoclonal Antibodies Against Epitopes of HSP72 A. Procedures 1. Immunization of Mice And Fusion Female Dalb/c mice (Charles River Laboratories) were immunized with S. pneumoniae antigens. Cne set of mice (fusion experiment 1) were immunized 'by peritoneal injection with 107 formalin-kiJled whole cell antigen from strain MTL suspended in Freund's cmplete adjuvant, and were boosted at two-week intervals with the same antigen and then with a sonicate from heat-killed bacteria in Freund's incomplete adjuvant. A second group of mice (fusion experiment 2) wer- immunized three times at threeweek intervals with 75 p detergent-soluble pneumococcal antigens extracted from strain 64 (type 6) in pg of Quil A adjuvant (Cedarlane Laboratorie. Ltd., Hornby, Ontario, Canada). Three days before fusion, all mice were injected intraperitoneally with the respective antigen suspended in PBS alone. Hybridomas were produced by fusion of spleen cells with nonsecreting SP2/0 myeloma cells as previously described by J. Hamel et al. Med.
Microbiol., 23, pp. 163-170 (1987)]. Specific hybridoma were cloned by sequential limiting dilutions, expanded and frozen in liquid nitrogen. The class, subclass, and light-chain type of MAbs were determined by ELiSA as described by D. Martin et al., (Eur. J. Immunol 18, pp. 601-606 (1988)] using reagents obtained from Scuthern Biotechnology Associates Inc. (Birmingham, AL).
WO 96/40928 PCT/CA96/00322 2. Subcellular Fractionation Pneumococci were separated into subcellular fractions according to the technique described by Pearce et al. [Mol. Microbiol., 9, pp. 1037-1050 (1993)].
Briefly, S. pneumoniae strain 64 (type 6) was grown in Todd Hewitt broth supplemented with 0.5% yeast extract for 6 hours at 37C and isolated by centrifugation.
Cell pellets were resuspended in 25 mM Tris-HCl pH 8.0, 1 mM EDTA, 1 mM phenylmethylsulphonylfluoride (PMSF) and sonicated for 4 minutes with 15 second bursts. Cellular debris were removed by centrifugation. The bacterial membranes and cytoplasmic contents were separated by centrifugation at 98,000 g for 4 hours. The cytoplasmic (supernatant) and the membrane (pellet) fractions were adjusted to 1 mg protein per ml and subjected to SDS-PAGE and immunoblot analyses.
B. Identification and Characterization of MAbs to the HSP72 of S. pneumoniae Culture supernatants of hybridomas were initially screened by dot enzyme immunoassay using whole cells from S. pneumoniae strain 65 (type 4) according to the procedures described in D. Martin et al. (supra).
Positive hybridomas were then retested by immunoblotting in order to identify the hybridomas secreting MAbs reactive with the HSP72. Of 26 hybridomas with anti- S. pneumoniae reactivity in immunoblot, four were found to recognize epitopes present on a protein band with an apparent molecular mass of 72 kDa. The four hybridomas were designated Fl-Pn3.1 (from fusion experiment 1) and F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4 (from fusion experiment Isotype analysis revealed that hybridoma Fl-Pn3.1 (from fusion experiment 1) secreted IgG-2ak immunoglobulins, whereas hybridomas F2-Pn3.2, F2-Pn3.3, and F2-Pn3.4 (from WO 96/40928 PCT/CA96/00322 fusion experiment 2) all secreted IgGik. The specificity of the MAbs for HSP72 was clearly demonstrated by the lack of radioimmunoprecipitation activity against [asS]methionine-labeled S. pneumoniae proteins obtained from cultures incubated at 37 0 C and the immunoprecipitation of a 72kDa-protein with heat shock-derived lysates incubated at 45 0 C. FIG. 6, (lanes 5 and 6) demonstrates the results obtained for MAb Fl-Pn3.1. The same results were obtained with MAbs F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4 3 5 S]methionine-labelled lysates from nonheatshocked and heat-shocked S. pneumoniae cells probed with the MAbs were electrophoresed on SDS-PAGE gels and then subjected to Western blot analysis. The resulting immunoblots revealed the presence of HSP72 antigen in both samples. FIG. 7, panel A, shows the results obtained for MAb Fl-Pn3.1. The same results were obtained with MAbs F2-Pn3.2, F2-Pn3.3 and F2-Pn3.4. Accordingly, the heat shock stress did not significantly increase the reactivity of anti-HSP72 monoclonal antibodies. The fluorograph of the immunoblots, however, clearly showed that the heat shock response had occurred (FIG. 7, panel These experiments revealed that the rate of synthesis of S. pneumoniae HSP72 increases in response to heat shock, but that the absolute amounts of HSP72 do not increase after heat shock.
C. Cellular localization of HSP72 In order to investigate the cellular location of HSP72, S. pneumoniae cell lysates wer\ fractionated by diffeiential centrifugation resulting in a soluble fraction and a particulate fraction, enriched in membrane proteins, supra. Sample containing 15 pg protein of membrane fraction (lane 1) and cytoplasmic fraction (lane 2) cf S. pneumoniae were electrophoresed on SDS-PAGE, transferred to nitrocellulose and probed with MIAb Fl- WO 96/40928 PCT/CA96/00322 Pn3.1. In the resulting Western blots, HSP72 was found in both fractions, with the majority of the protein associated with the cytoplasmic fraction (FIG. 8) EXAMPLE 3 Molecular Cloning, Sequencing and Expression of Genes Coding for HSP72 Antigens A. Procedures 1. Strains and Plasmids Strains and plasmids used in this study are listed in Table 1.
WO 96/40928 PTC9/02 PCT/CA96/00322 TABLE 1: BACTERIAL STR~AINS, PHAGES AND PLASMIDS Strain, Phage Plasmid E. coli Strai-ns JM1 09 Relevant Characteristics Reference or Source Yl1090 BL21 (DE3) Phages Xgtl XJBD7 XJBD17 A (lac-proAB) [F'traD proAB rk-Mk- ion supF [p14C9] lacUV5-T7 RNA polymerase c1857 S100 cloning vector LacZ-HSP72 fusion; 2.3 kb EcoRI fragment in Xgtll FucI-HSP72 chimeric; 2.4 kb EcoRI and 2.3 kb EcoRI fragments in ?Xgtil Studier et al. (infra)
BRL
Amers ham Amersham This study This study Plasmids pWSK29 pJBDl7 1 pJBD177 pJBDl79 pT7-6 pJBDf 51 pjBDf 62 pDELTAl pJBDAl Ampr; low copy number cloning vector same as pWSK29 but opposite multi cloning site same as )XJBDl7 but in pWSK29 2.8 kb XhoI-EcoRT fragment in no recombinant HSP72 protein expressed FucI-HSP72 fusion; 2 4 kb EcoRI and 0.8 kb EcoRI- EcoRV fragments in pWSK29 Amp'; T7 promoter 0 same as pT*7-5 but opposite multi cloning site same as pjBDl79 but in same as pJBDl79 but in pT7-6 .AMpr; Tn 1000 same as pJBDl79 but in p DELTA 1 Wang et al.
(infra) Wang et al.
(infra) This study This study This study Tabor et al.
(infra) Tabor et al.
(infra) This study This study
BRL
This study WO 96/40928 PCT/CA96/00322 pJBD291 HSP72; 3.2 kb HindIII This study fragment in pWSK29 pJBDk51 same as pJBD291 but in This study pJBDA4 same as pJBD291 but in This study pDELTAl E. coli strains were grown in L broth or on L agar at 37 0 C. When necessary, ampicillin was added to the media at the concentration of 50 pg/ml. Plasmids were isolated by using the Magic/Wizard® Mini-Preps kit (Promega, Fisher Scientific, Ottawa, Canada).
2. General Recombinant DNA Techniques Restriction endonucleases, T4 DNA ligase, and DNA molecular weight standards were purchased from Boehringer Mannheim Canada, Laval, Quebec or Pharmacia Biotech, Uppsala, Sweden. DNA restriction endonuclease digestion and ligation were performed as described by J. Sambrook et al. [Molecular cloning. A laboratory manual. Cold Spring Harbor Laboratory Press, N.Y.
(1989)]. Agarose gel electrophoresis of DNA fragments was performed following the procedure of J. Sambrook et al.
(supra) using the TAE buffer (0.04 M Tris-acetate; 0.002 M EDTA) from Boehringer Mannheim. DNA fragments were purified from agarose gel by using the Prep-A-Gene® DNA purification kit (Bio-Rad Laboratories Ltd., Mississauga, Ontario). Transformation was carried out by electroporation with the Gene Pulser® (Bio-Rad) following the protocol provided by the manufacturer.
3. Construction and Screening of Genomic Library A genomic S. pneumoniae DNA library was generated in the bacteriophage expression vector Xgtll (Xgtll clonirg system, Amersham) according to the WO 96/40928 PCT/CA96/00322 procedure provided by the manufacturer. Chromosomal DNA of S. pneumoniae type 6 strain 64 was prepared by following the procedure of J.C. Paton et al. [Infect.
Immun., 54, pp. 50-55 (1986)]. The S. pneumoniae chromosomal DNA was partially digested with EcoRI, and the 4- to 7-kb fragments were fractionated and purified from agarose gel. The fragments were ligated into Igtll arms, packaged, and the resulting phage mixtures used to infect E. coli Y1090. Immunoscreening of plaques expressing recombinant HSP72 antigens was performed using HSP72specific monoclonal antibody Fl-Pn3.1, supra. Plaque clones expressing peptides recognized by MAb Fl-Pn3.1 were isolated and purified. Liquid lysates were prepared and DNA was purified from a Promega LambdaSorb phage adsorbent according to the manufacturer's directions followed by conventional DNA purification procedures.
4. Southern Blot Analysis The nonradioactive DIG DNA Labelling and Detection kit, obtained from Boehringer Mannheim, was used to perform Southern blot analysis in this example. The DNA fragments selected for use as probes (infra) were purified by agarose gel electrophoresis and then labelled with digoxigenin (DIG)-11-dUTP. Pneumococcal chromosomal DNA was digested with HindIII and the digests were separated by electrophoresis on an 0.8% SDS-PAGE gel and transformed onto positive charged nylon membranes (Boehringer Mannheim) as described by J. Sambrook et al.
(supra). The membrane was then blotted with the DIGlabelled DNA probes according to the protocol of the manufacturer.
DNA Sequencing and Sequence Analysis The DNA fragments sequenced in this example were first cloned into plasmid pDELTA 1 (GIBCO BRL Life WO 96/40928 PCT/CA96/00322 Technologies, Burlington, Ontario). A series of nested deletions were generated from both strands by in vivo deletion mediated by Tn 1000 transposon transposition (Deletion Factory System, GIBCO BRL) following the procedures provided by the supplier. These deletions were sized by agarose gel electrophoresis and appropriate deletion derivatives were selected for sequencing by the dideoxynucleotide chain terminating method of F. Sanger et al. [Proc. Natl. Acad. Sci. USA, 74, pp. 5463-5467 (1977)]. To sequence the gaps between deletion templates, oligonucleotides were synthesized by oligonucleotide synthesizer 392 (ABI, Applied Biosystems Inc., Foster City, CA). The sequencing reaction was carried out by PCR (DNA Thermal Cycler 480®, Perkin Elmer) using the Taq DyeDeoxy Terminator Cycle Sequencing kit (ABI), and DNA electrophoresis was performed on automated DNA sequencer 373A (ABI).
6. Expression of Cloned Gene in E. coli T7 RNA pol/promoter system High level expression of the cloned gene in this example was achieved by employing the bacteriophage T7 RNA polymerase/promoter system in E. coli. The DNA fragment specifying the recombinant protein was ligated into plasmids pT7-5 or pT7-6 Tabor and C.C. Richardson, Proc. Natl. Acad. Sci. USA, 82, PP. 1074-1078 (1985)], in a proper orientation in which the gene to be expressed was placed under the control of phage T7 RNA polymerase specific promoter 010. The resulting plasmid was transformed into E. coli strain BL21(DE3) Studier, and B.A. Moffatt, J. Mol. Biol., 189, pp. 113-130 (1986)] which carries the T7 RNA polymerase structural gene on its chromosome under the control of the inducible promoter. Upon IPTG induction, the T7 RNA polymerase induced in the BL21(DE3) transformants specifically WO 96/40928 PCT/CA96/00322 transcribed the gene under the control of T7 promoter The overexpressed recombinant proteins were visualized by either Western blotting or Coomassie Blue staining.
7. N-terminal Amino Acid Sequence Analysis of HSP72 Pneumococcal HSP72 was purified by immunoprecipitation using MAb Fl-Pn3.1 (supra) and samples of cell wall extracts of S. pneumoniae strain 64 prepared as described by L.S. Daniels et al. [Microb. Pathogen., 1, pp. 519-531 (1986)] as antigen. The immune precipitates were resolved by SDS-PAGE and then transferred to polyvinylidene difluoride (PVDF) membrane by the method of P. Matsudaira Biol. Chem., 262, pp. 10035-10038 (1987)]. PVDF membrane was stained with Coomassie Blue, the HSP72 band excised and then analyzed in an automated protein sequencer (ABI), according to standard procedures.
B. Construction of Plasmids Containing S. pneumoniae HSP72 Gene Fragments Corresponding to C-169 The Xgtll S. pneumoniae genomic DNA library was screened with the HSP72-specific MAb Fl-Pn3.1. Seventeen (17) immunoreactive clones were isolated and purified from a total of 1500 phages tested. To confirm the specificity of the proteins expressed by the recombinant phages, Western blot analysis of the recombinant phage lysates was performed. Two groups of clones were identified among the 17 positive clones recognized by MAb Fl-Pn3.1 and their representatives were designated as kJBD7 and XJBD17 for further characterization. As shown in FIG. 9, whole cell extracts from S. pneumoniae strain 64 (lane 1) and phage lysates from E. coli infected with kJBD17 (lanes 2 and 3) or XJBD7 (lanes 4 and 5) cultured in the presence or absence of IPTG were subjected to 10% polyacrylamide WO 96/40928 PCT/CA9/00322 gel electrophoresis and were electrotransferred to nitrocellulose. The immunoblot was probed with HSP72specific MAb Fl-Pn3.1. Clone XJBD17 had two EcoRI-EcoRI insert fragments of 2.4 kb and 2.3 kb (FIG. 10), and expressed a chimeric recombinant protein having an apparent molecular mass of 74 kDa on SDS-PAGE gel (FIG. 9, lanes 2 and Clone XJBD7 was found to contain a 2.3 kb EcoRI insert fragment and produced an apparent fusion protein consisting of LacZ and the 74 kDa chimeric.protein expressed from clone XJBD17. The fusion protein had an apparent molecular mass of 160 kDa as estimated by SDS- PAGE (FIG. 9, lane The expression of the chimeric recombinant protein encoded by phage XJBD17 was independent of IPTG induction (FIG. 9, lanes 2 and 3) while the expression of the recombinant fusion protein encoded by phage XJBD7 was dependent on induction of the lac promoter (FIG. 9, lanes 4 and In an attempt to subclone the HSP72 gene, the pneumococcal DNA insert from clone XJBD17 was extracted, purified and ligated into a low copy plasmid pWSK29 [R.F.
Wang and S.R. Kushner, Gene, 100, pp. 195-199 (1991)] to generate plasmid pJBD171. The insert from pJBD171 was characterized by restriction mapping (Fig. 10B), and a series of subcloning and immunoblotting was carried out to define the boundaries of the gene coding for the antigen reactive with MAb Fl-Pn3.1. The region responsible for expression of the 74 kDa chimeric protein was found to localize on the 3.2 kb EcoRI-EcoRV fragment, which consists of the intact 2.4 kb EcoRI-EcoRI fragment and the 0.8 kb EcoRI-EcoRV portion of the 2.3 kb EcoRI-EcoRI fragment. The plasmid carrying the 3.2 kb EcoRI-EcoRV insert was designated pJBD179.
WO 96/40928 PCT/CA96/00322 C. Expression and DNA Sequence Analysis of a Chimeric Gene Coding for C-169 To further determine the transcriptional direction of the gene coding for the 74 kDa chimeric protein on the 3.2 kb EcoRI-EcoRV fragment, and to increase the yield of the 74 kDa chimeric protein for immunological study, we decided to express the 74 kDa chimeric protein in the E. coli T7 RNA and T7 promoter system. The 3.2 kb EcoRI-EcoRV fragment, derived from pJBD179, was ligated into plasmids pT7-5 and pT7-6 in which the multi-cloning sites were placed in opposite orientation with respect to the T7 RNA polymerase specific T7 promoter 010. The ligation mixture was used to transform E. coli JM109 and positive transformants reactive with MAb Fl-Pn3.1 were identified by the colony lifting method described by J. Sambrook et al. [supra].
The resulting recombinant plasmids, derived from pT7-5 and pT7-6, were designated pJBDf51 and pJBDf62, respectively.
The intact 3.2 kb EcoRI-EcoRV insert in these recombinant plasmids and their orientation was determined by restriction mapping. To achieve overexpression of the 74 kDa chimeric protein, pJBDf51 and pJBDf62 were transformed, separately, into E. coli BL21(DE3). The transformants were induced with IPTG (1 mM) for 3 hours at 37 0 C. The cells were harvested, washed, resuspended in 1% SDS and boiled for 10 minutes. The lysates were then used for SDS-PAGE and immunoblot analysis. As expected, both transformants produced the 74 kDa chimeric protein readily detected by Western blotting with MAb FI-Pn3.1 (FIG. 11). However, under the IPTG induction condition, only transformants BL21(DE3)(pJBDf51) overexpressed the 74 kDa chimeric protein (FIG. 11A and B, lane 2) indicating that the transcriptional direction of the gene on the 3.2 WO 96/40928 PCT/CA96/00322 kb EcoRI-EcoRV fragment is from the EcoRI end towards the EcoRV end (FIG. The 3.2 kb EcoRI-EcoRV fragment was cloned in,o plasmid pDELTA 1 to yield plasmid pJBDA1. A series of overlapping deletions were generated and used as DNA sequencing templates. The DNA sequence of the entire 3.2 kb EcoRI-EcoRV insert is SEQ ID NO:1. Two open reading frames ("ORFs") were found and their orientation is indicated in FIG. 10B ("ORF27" and "FucI-HSP72 In front of these two ORFs, putative ribosome-binding sites were identified (SEQ ID NO:1, nucleotides 18-21 and 760-763). No obvious -10 and -35 promoter sequences were detected. ORF27 spans nucleotides 30-755 (SEQ ID NO:1) and encodes a protein of 242 amino acids with a calculated molecular weight of 27,066 daltons. The deduced amino acid sequence of this protein is SEQ ID NO:2. We designated this gene orf27, and compared it to other known sequences. No homologous gene or protein was found. The large ORF (nucleotides 771-2912, SEQ ID NO:1) specifies a protein of 714 amino acids with a predicted molecular mass of 79,238 daltons. The deduced amino acid sequence of this protein is SEQ ID NO:3. This ORF was compared with other known sequences to determine its relationship to other amino acid sequences. This analysis revealed a high degree of similarity of the encoded protein to the sequence of E. coli fucose isomerase (FucI) and to several gene family members, also known as DnaK genes.
Alignment of SEQ ID NO:3 and those of the E. coli FucI and (Dnak) proteins indicated that the N-terminal portion corresponding to amino acids 1 to 545 (SEQ ID NO:3) of the 74 kDa chimeric protein is highly homologous to E. coli FucI, while the C-terminal portion corresponding to amino acids 546-714 (SEQ ID NO:3) is similar to HSP70 (DnaK) proteins. It is noteworthy that there is an EcoRI restriction site lying in the junction of these two portions of the gene coding for the 74 kDa protein (SEQ ID NO:1, between nucleotides 2404 and 2405).
WO 96/40928 PCT/CA96/00322 Other restriction sites exist between nucleotides 971 and 972 (Pst nucleotides 1916 and 1917 (Pst I), nucleotides 1978 and 1979 (Xho and nucleotides 3164 and 3165 (EcoRV:. From these data we concluded that the 74 kDa protein was a chimeric protein encoded by two pieces of S. pneumoniae chromosomal DNA, a 2.4 kb EcoRI- EcoRI fragment derived from the FucI homologous gene and a 2.3 kb EcoRI-EcoRI fragment derived from the HSP72 gene.
D. Southern Blot Analysis Southern blotting was performed in order to confirm that the 74 kDa protein is a chimeric protein and to attempt to clone the entire pneumococcal HSP72 gene.
Chromosomal S. pneumoniae DNA was digested with HindIII to completion, separated on a 0.8% agarose gel, and transferred onto two positively charged nylon membranes (Boehringer Mannheim). The membranes were then blotted with either the 0.8 kb EcoRI-EcoRV probe, derived from the 2.3 kb EcoRI-EcoRI fragment, or the 1 kb PstI-PstI probe, obtained from the 2.4 kb EcoRI-EcoRI fragment. Both probes had been previously labelled wih digoxigenin-dUTP.
These two probes hybridized two individual HindIII fragments of different sizes (FIGS. 10B and 10C). The 0.8 kb EcoRI-EcoRV probe recognized the 3.2 kb HindIII fragment and the 1 kb PstI-PstI probe reacted with the 4 kb HindIII fragment. This result further indicated that the gene responsible for the expression of the 74 kDa chimeric protein was generated by fusion, in frame, of two pieces of EcoRI fragments, one originated from the fragment containing the 5' portion of the S. pneumoniae FucI homologue, the other derived from the segment carrying the C-169 fragment of the pneumococcal HSP72 gene. The fact that the 0.8 kb EcoRI-EcoRV probe hybridized a single 3.2 kb fragment suggested that there is only a single HSP72 gene copy in S. pneumoniae.
WO 96/40928 PCT/CA96/00322 E. Production of Recombinant HSP72 A partial pneumococcal genomic library was generated by ligation of the pool of HindIII digests of chromosomal DNA, with sizes ranging from 2.8 to 3.7 kb, into plasmid pWSK29/HindIII. The ligation mixture was used to transform E. coli strain JM 109 and the transformants were screened by hybridization with the 0.8 kb EcoRI-EcoRV probe. One representative plasmid from four positive hybridizing clones was named pJBD291.
Restriction analysis of the insert and Western blot of the cell lysate of transformants were employed to verify that the plasmid pJBD291 indeed carries the 3.2 kb HindIII fragment containing the HSP72 gene expressing the recombinant HSP72 protein (FIG. 10B). The HSP72 protein expressed by the transformants (pJBD291) migrated on the SDS-PAGE gel at the same position as the native HSP72 protein (FIG. 12). To sequence the entire HSP72 gene and to overexpress the full-length HSP72 protein, the 3.2 kb HindIII fragment was isolated from plasmid pJBD291, and subcloned into plasmids pDELTA 1 and pT7-5 to generate pJBDA4 and pJBDk51, respectively.
The entire 3.2 kb HindIII DNA fragment carried on the plasmid pJBDA4 and the 2.3 kb EcoRI-EcoRI DNA fragment contained on the plasmid pJBD177 were sequenced.
Altogether, the nucleotide sequence comprised 4320 base pairs and revealed two ORFs (SEQ ID NO:4). The first ORF, starting at nucleotide 682 and ending at nucleotide 2502 (SEQ ID NO:4), was identified as the pneumococcal HSP72 gene, and the second ORF, spanning from nucleotide 3265 to nucleotide 4320 (SEQ ID NO:4), was located 764 base pairs downstream from the HSP72 structural gene and was identified as the 5' portion of the pneumococcal DnaJ gene. The putative ribosome binding site ("AGGA") was located 9 base pairs upstream from the start codon of the HSP72 structural gene, while the typical ribosome binding I _L WO 96/40928 PCT/CA96/00322 site ("AGGA") was found 66 base pairs upstream from the start codon of the DnaJ structural gene. No typical regulatory region was identified in front of these two gencs. Restriction sites are located between nucleotides 1 and 2 (HindIII), nucleotides 1318 and 1319 (EcoRI), nucleotides 1994 and 1995 (EcoRI), nucleotides 3343 and 3344 (HindIII), and nucleotides 4315 and 4316 (EcoRI) The gene organization of HSP72 (DnaK) and DnaJ in S. pneumoniae is similar to that of E. coli [Saito, H. and Uchida, Mcj. Gen. Genet. 164, 1-8 (1978)] as well as several other Gram positive bacteria [Watzstein, M.
et al., J. Bacteriol. 174, 3300-3310 (1992)]. However, the intragenic region of S. pneumoniae is significantly larger and no ORF for the grpE gene was found upstream of the HSP72 (DnaK) structural gene.
The predicted HSP72 protein has 607 amino acids and a calculated molecular mass of 64,755 daltons, as compared t, the 72 kDa molecular mass estimated by SDS- PAGE. The predicted HSP72 protein is acidic with an isoelectric point (pI) of 4.35. Automated Edman degradation of the purified native HSP72 protein extracted from S. pneumoniae strain 64 revealed SKIIGIDLGTTN-AVAVLE as the 19 amino acid N-terminal sequence of the protein.
The amino-terminal methionine was not detected, presumably due to in .itu processing which is known to occur in many proteins. No amino acid residue was identified on position 13. The 19 amino acid N-terminal sequence obtained from the native HSP72 protein is in full agreement with the 19 amino acid N-terminal sequence deduced from the nucleotide sequence of the recombinant S. pneumoniae HSP72 gene (SEQ ID NO:5) thus confirming the cloning. This N-terminal sequence showed complete identity with the DnaK protein from Lactococcus lactis and 68.4% identity with the DnaK protein from Escherichia Coli. Similarly, the alignment of the predicted amino acid sequence of HSP72 (SEQ ID NO:5) with those from other bacterial HSP70 (DnaK) proteins also revealed high st I WO 96/40928 PCT/CA96/00322 homology (FIGS. 13A-13D). For example, HSP72 showed 54% identity with the E. coli DnaK protein. The highest identity value was obtained from comparison with the Gram positive bacterium Lactococcus lactis, showing identity with HSP72. Like other HSP70 proteins of Gram positive bacteria, HSP72 miss=s a stretch of 24 amino acids near the amino terminus when compared with DnaK proteins from Grain negative bacteria (FIGS. 13A-13D).
Although HFP72 shares homology with HSP70 (DnaK) proteins from other organisms, it does possess som6 unique features. Sequence divergence of the HSP70 (DnaK) pr,-eins is largely localized to two regions (residues 244 to 330 and 510 to 607, SEQ ID NO:5). More specifically, the peptide sequences GFDAERDAAQAALDD (residues 527 to 541, SEQ ID NO:5) and AEGAQATGNAGDDW (residues 586 to 600, SEQ ID NO:5) are exclusive to HSP72. The fact that the C-terminal portion of HSP72 is highly variable suggests that this portion carries antigenic deteraminants specific to S. pneumoniae. Consistent with this hypothesis, monoclonal antibodies directed against the C- 169 fragment of HSP72 (infra), were not reactive with E. coli and S. aureus, which are known to express DnaK proteins similar to HSP72.
The truncated DnaJ protein of S. pneumoniae (SEQ ID NO:6) has 352 amino acids, which show a high degree of similarity with the corresponding portions of the L.
lactis DnaJ protein (72% identity) and the E. coli DnaJ protein (51% identity). The predicted truncated DnaJ protein contains high glycine content Four Gly-, Cys-rich repeats, each with the Cys-X-X-Cys-X-Gly-X-Gly motif characteristic of DnaJ proteins Silver and J.C. Way, Cell, 74, pp. 5-6 (1993)], were identified between amino acids 148 and 212 of the S. pneumoniae DnaJ protein (SEQ ID NO:6). Three repeated GGFGG sequences (residues 75-79, 81-85, and 90-94) were found near the Nterminus.
-II
I WO 96/40928 PCT/CA96/00322 F. Reactivity of MAbs Against Recombinant Antigens The four HSP72 specific MAbs (Fl-Pn3.1, F2- Pn3.2, F2-Pn3.3 and F2-Pn3.4, supra) were tested for their reactivity against proteins expressed by E. coli infected or transformed with recombinant phages and plasmids containing HSP72 sequences. The four individual MAbs reacted with the lacZ-HSP72 fusion protein expressed by the clone kJBD7, thus localizing the epitopes recognized by these MAbs to the C-terminal 169 residues.
Surprisingly, the proteins encoded by the pneumoccocal inserts in kJBD17 and pJBDAl were recognized by only 3 of 4 Mabs. These results suggest that although the C-169 fragments synthesized in E. coli infected with kJBD7 and kJBD17 have the same primary structure, they have distinct conformation. The lack of reactivity of MAb F2-Pn3.2 with some recombinant proteins raised the possibili:y that this particular MAb recognizes a more complex epitope.
Although complex, F2-Pn3.2 epitopes are still recognizable on Western immunoblots. The complete HSP 7 2rc protein expressed by E. coli containing the recombinant plasmid pJBDA4 was reactive with all four MAbs.
EXAMPLE 4 Antigenic Specificity and Reactivity of HSP72-Specific Monoclonal Antibodies The reactivity of MAbs Fl-Pn3.1, F2-3.2., F2- Pn3.3 and F2-Pn3.4 to a collection of bacterial strains including 20 S. pneumoniae strains representing 16 capsular serotypes (types 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 14, 15, 19, 20, and 22) and the 17 non-pneumococcal bacterial strains listed in Table 2, was tested using a dot enzyme immunoassay as described by D. Martin et al.
[supra] and immunoblotting. For dot enzyme immunoassay, the bacteria were grown overnight on chocolate agar plates
I
I-C P WO 96/40928 PCT/CA96/00322 and then suspended in PBS, pH 7.4. A volume of 5 pl of a suspension containing approximately 109 CFU/ml was applied to a nitrocellulose paper, blocked with PBS containing 3% bovine serum albumin, and then incubated sequentially with MAbs and peroxydase-labeled secondary antibody. Whole cell extrz-ts were prepared for Western blot analysis by boiling bacterial suspensions in sample buffer for minutes.
TABLE 2:LIST OF NON-PNEUMOCOCCAL ISOLATES TESTED BY DOT ENZYME IMMUNOASSAY Strain Designation C-2 C-3 C-7 C-9 C-14 C-19 C-21 C-22 C-23 C-24 C-27 C-33 C-36 Genus species Streptococcus pyogenes Streptococcus agalactiae Enterococcus faecalis Streptococcus bovis Streptococcus mutans Streptococcus salivarius Streptococcus sanguis Streptococcus sanguis Streptococcus sanguis Streptococcus sanguis Streptococcus sanguis Streptococcus sanguis Streptococcus sanguis Gemella morbillorum Staphylococcus aureus Bacillus Escherichia coli group or type group A group B group D group D
I
I
I
II
II
II
II
When tested by dot enzyme immunoassay, each MAb reacted with each of the S. pneumoniae strains and none of the non-pneumococcal isolates. These results were unexpected since comparison studies revealed that HSP72 is -p e~I WO 96/40928 PCT/CA96/00322 very similar to other known bacterial HSP70 (DnaK) proteins, for example those from E. coli and S. aureus.
Immunoblots were then performed to further investigate the immunoreactivities of our MAbs. As shown in Table 3, each MAb exhibited some reactivity. Although the percent identity of the E. coli amino acid sequence and the HSP72 amino acid sequence (SEQ ID NO:5) is 54%, the four HSP72-specific MAbs did not recognize the E. coli (DnaK) protein. Similarly, the HSP72-specific MAbs did not react with the C. trachomatis HSP70 (DnaK) protein, which has 56% amino acid identity with the amino acid sequence of HSP72. High amino acid sequence homology is observed between HSP72 and the HSP70 (DnaK) proteins from gram positive bacterial species. However, again, none of the HSP72-specific MAbs reacted with S. aureaus or Bacillus gram positive species, which exhibit 74% and 76% amino acid sequence homology, respectively, with HSP72.
From these data it is clear that although HSP70 (DnaK) proteins may be structurally related to HSP72, they are immunologically distinct. Among the non-pneumococcal isolates that reacted with at least one MAb, there is S.
pyogenes, Enterococcus faecalis, S. mutans and S. sanguis, which all belong to the Streptococcus or Streptococcusrelated Enterococcus genus. So far, neither the protein, nor the gene structure has been identified in these Streptococcus or Enterococcus species. Altogether, these observations indicate that hypervariable amino acid sequences or residues within HSP70 (DnaK) proteins are involved in antigenicity. Interestingly, immunoblotting analysis revealed that there was no significant variation in the molecular mass of the HSP70 (DnaK) proteins among both S. pneumoniae isolates and immunoreactive nonpneumocuccal isolates.
I
WO 96/40928 WO 9640928PCT/CA96/00322 TABLE 3: REACTIVITY OF MABS WITH NON-PEUMOCOCCAL ISOLATES IN WESTERN IMKNOBLCTY'iING Bacterial Strain MAbs Designation C-2 C3 C-7 C-9 C-14 C-19 C-21 C-23 C-24 C-27 C-33 genus /species Strep to coccus pyogen es Streptococcus agala ctia e En terococcus faecalis Streptococcus bovi s Streptococcus mu tans Strepto coccus salivarius Strep to coccus san gui Strep to coccus sanguis Strept-ococcus san gui 5 Streptococcus san gui Streptococcus sangui s Streptococcus san gui s Strepto coccus san gui Gemella morbil11orurn Staphylococcus aureus Bacillus E:scheric' a coli fbi arnyd: a Lrachoma r.stype group A group B group D group D Fl- PN3 1 F2 F2 PN-3 3 F2 Pn3 .4 C-36
C-RP
a indicates a weak signal compared to Lne reactivity observed with S. pnenoriiae antigens b C. trachomatis purified elementary bodies were tested.
WO 96/40928 PCT/CA96/00322 EXAMPLE 5 Purification of HSP72 And Its Use As An Immunogen to Protect Against Lethal S. Pneumoniae Infection A. Procedures 1. Preparation of Purified Recombinant HSP72 Protein and Recombinant C-169 High level exclusive expression of the HSP72 gene was achieved by employing the bacteriophage T7 RNA polymerase/T7 promoter system in E. coli. The 3.2 kb HindIII fragment was cloned in both orientations in front of the T7 promoter 010 in the plasmid pT7-5. The resulting plasmid pJBDk51 was then transformed into E. coli strain BL21 (DE3). Overexpression of the recombinant HSP72 protein (HSP72roc) was induced by culturing in broth supplemented with antibiotics for a 3hour period after the addition of IPTG to a final concentration of 1 mM. E. coli expressing high levels of HSP72rc were concentrated by centrifugation and lysed by mild sonication in 50 mM Tris-Cl (pH 1 mM EDTA and 100 mM NaCl lysis buffer containing 0.2 mg/ml lysozyme.
The cell lysates were centrifuged at 12,000 g for minutes and the supernatants were collected. HSP72rc was purified by immunoaffinity using monoclonal antibody Fl- Pn3.1 immobilized on sepharose 4B beads (Pharmacia). The purity of eluates was assessed on SDS-PAGE.
The recombinant C-169 protein (C-169rec) was expressed in the form of insoluble inclusion bodies in E. coli strain JM109 transformed with the plasmid pJBDAl.
Protein inclusion bodies were recovered from pelleted bacterial cells disrupted by sonication as described before. The pellets were washed in lysis buffer containing 1 mg/ml of deoxycholate to remove contaminating materials, and the protein inclusion bodies were then solubilized in urea 6 M. The pro- -in solution was WO 96/40928 PCT/CA96/00322 centrifuged at 100,000 g and the cleared supernatant collected and dialysed against phosphate-buffered saline.
After purification, the protein content was determined by the Bio-Rad protein assay (Bio-Rad Laboratories, Mississauga, Ontario, Canada).
2. Active Immunoprotection Studies Two groups of 10 female Balb/c mice (Charles River Laboratories) were immunized subcutaneously three times at two-week intervals with 0.1 ml of purified HSP72rec or C-169r.c antigens absorbed to Alhydrogel adjuvant. Two antigen doses, approximately 1 and 5 pg, were tested. A third group of 10 control mice were immunized identically via the same route with Alhydrogel adjuvant alone. Blood samples were collected from the orbital sinus prior to each immmunization and five to seven days following the third injection. The mice were then challenged with approximately 106 CFU of the type 3 S. pneumoniae strain WU2. Samples of the S. pneumoniae challenge inoculum were plated on chocolate agar plates to determine the CFU and to verify the challenge dose.
Deaths were recorded at 6-hour intervals for the first 3-4 days post-infection and then at 24-hour intervals for a period of 14 days. On days 14 or 15, the surviving mice were sacrificed and blood samples tested for the presence of S. pneumoniae organisms. Antibody responses to the recombinant HSP72 antigens are described in Example 7.
3. Passive Immunoprotection Studies One NZW rabbit (Charles River Laboratories) was immunized subcutaneously at multiple sites with approximately 50 ug of the purified C-169rc protein adsorbed to Alhydrogel adjuvant. The rabbit was boosted three times at two-week intervals with the same antigen and blood samples collected 7 and 14 days following the WO 96/40928 PCT/CA96/00322 last immunization. The serum samples were pooled and antibodies were purified by precipitation using saturated ammonium sulfate.
Severe-combined immunodeficient SCID mice were injected intraperitoneally with 0.25 ml of the purified rabbit antibodies 1 hour before intravenous challenge with 5000 or 880 CFU of the type 3 S. pneumoniae strain WU2.
Control SCID mice received sterile buffer or antibodies purified from nonimmune rabbit sera. Samples of the S. pneumoniae challenge inoculum were plated on chocolate agar plates to determine the CFU and to verify the challenge dose. The SCID mice were chosen because of their high susceptibility to S. pneumoniae infection.
Blood samples (20 pl each) obtained 24 hours postchallenge were plated on chocolate agar and tested for the presence of S. pneumoniae organisms. The level of detection was 50 CFU/ml. Deaths were recorded at 24-hour intervals for a period of 5 days.
B. Results The availability of cloned S. pneumoniae DNA inserts encoding the complete or partial (C-169) HSP72 protein and the expression of recombinant proteins in E. coli allowed the obtention of purified proteins useful for the investigation of the vaccinogenic potential of HSP72 protein. Both HSP72rac and C-169rc proteins were obtained in a relatively pure state with no contaminants detected on Coomassie Blue-stained SDS polyacrylamide gels (FIGS. 14 and 15, respectively).
To evaluate the vaccinogenic potential of HSP72, we first examined the ability of HSP72rac to elicit a protective immune response. Groups of 10 mice were immunized with full-length HSP72rc (1 pg or 5 pg dose) and challenged with 4.2 million CFU of S. pneumoniae type 3 strain WU2. Eighty percent of the mice dosed with 1 pg HSP72r.c survived the challenge, as did 50% of the mice WO 96/40928 PCT/CA96/00322 dosed with 5 pg HSP72. None of the naive mice immunized with Alhydrogel adjuvant alone without antigen survived the challenge (FIG. 16). No S. pneumoniae organisms were detected in any of the blood samples collected on days 14 or 15 from mice surviving infection. The observation that HSP72r.c elicited protection against type 3 strain WU2 pneumococci indicated that HSP72 derived from DNA extracted from a type 6 strain contains epitopes capable of eliciting protection against a heterologous strain having a different capsular type.
We further examined the immune response to the HSP72 protein by using recombinant prcLein fragments expressed from E. coli transformed with a chimeric fucl- HSP72 gene. Mice immunized with purified C-169ec were protected from fatal pneumococcal challenge, thus demonstrating that some, if not all, epitopes eliciting protection are present in the C-terminal region of the HSP72 molecule comprising the last 169 residues. Groups of 10 mice were immunized with C-169r.c (1 ug or 5 pg doses) and challenged with 6 million CFU of S. pneumoniae type 3 strain WU2. Sixty percent of the mice dosed with 1 pg C-169roc survived the challenge, as did 70% of the mice dosed with 5 pg C-169roc (FIG. 17 In contrast, all of the naive mice were dead by 2 days post-challenge.
Therefore, the C-terminal portion of S. pneumoniae HSP72, which includes the region of maximum divergence among DnaK proteins, is a target for the protective immune response.
As illustrated in Table 4 below, two independent experiments demonstrated that SCID mice passively transferred with rabbit anti-C-169rec antibodies were protected from fatal infection with S. pneumoniae WU2. In contrast, none of the 15 control mice survived. The control mice received antibodies from nonimmune rabbit sera or received sterile buffer alone. In addition, all mice from the control groups had positive S. pneumoniae hemoculture 2? hours post-challenge, while S. pneumoniae WO 96/40928 PCT/CA96/00322 organisms were detected in only 2 out of a total of immunized SCID mice.
TABLE 4: PASSIVE IMMUNIZATION STUDIES SHOWING PROTECTION OF SCID MICE FROM EXPERIMENTAL S. PNEUMONIAE INFECTION BY ANTI-C-169rac RABBIT ANTIBODIES Experiment Injection No. of Mice No. of Mice Surviving Testing Challenge Positive for after 5 days the Presence of S. pneumoniae 1 sterile 0/5 buffer anti-C-169rc 4/5 control 0/5 antibodies 2 sterile 0/5 buffer anti-C-169rc 5/5 In experiments 1 and 2 (Table mice were challenged with 5000 and 880 CFU of type 3 S. pneumoniae strain WU2, respectively. Results in Table 4 are expressed as the number of mice surviving challenge, or testing positive for the presence of S. pneumoniae, compared to the total number of mice in each group.
Demonstration of the anti-HSP72 specificity of the antibody elicited by immunization with recombinant HSP72 or C-169 proteins came from Western Blot analyses using S. pneumoniae cell lysates as antigens. A single band corresponding to HSP72 was detected by all rabbit and mouse antisera tested. These serologic results suggested that the protection following the immunization with recombinant proteins was due to the production of antibodies reactive with S. pneumoniae HSP72.
EXAMPLE 6 Heat-Inducible Expression System for High Level Production of the C-151 Terminal Portion of the HSP72 Protein 64 WO 96/40928 PCT/CA96/00322 A. Construction of Plasmid pURV3 Containing the C- 151 terminal coding region of the HSP72 of S.
pneumoniae The DNA region coding for 151 amino acids at the carboxyl end of the HSP72 of S. pneumoniae was inserted downstream of the promoter X PL into the translation vector p629 J. George et al., Bio/Technology 5, pp. 600-603 (1987)]. This vector contains a cassette of the bacteriophage X c1857 temperature sensitive repressor gene from which the functional PR promoter has been deleted. The inactivation of the c1857 repressor by a temperature increase from the ranges of 30-37C to 37-42 0 C results in the induction of the gene under the control of X PL. The induction of gene expression in E. coli cells by a temperature shift is advantageous for large scale fermentation since it can easily be achieved with modern fermenters. However, it should be understood that while E. coli was the microorganism of choice in the experiments herein described, other host organisms, such as yeast, are intended to be included within the scope of this invention.
A fragment of 477 nucleotides, including the region of 457 bases between 2050 to 2506 in HSP72 gene of S. pneumoniae (see :TQ ID NO was amplified by the polymerase chain reaction (PCR) from the S. pneumoniae type 6 strain 64 genomic DNA using the oligonucleotide primers OCRR26 and OCRR27 Chromosomal DNA was prepared from a 90 ml culture of exponentionally growing cells of S. pneumoniae in heart infusion broth using the method of Jayarao et al. [J.
Clin. Microbiol., 29, pp. 2774-2778 (1991)]. DNA amplification reactions were made using a DNA Thermal Cycler, Perkin Elmer, San Jose, CA. In OCRR26, an ATG start codon is present in frame just upstream of the WO 96/40928 PCT/CA96/00322 coding region for the amino-terminus region of the C-151.
The primers OCRR26 and OCRR27 contain, respectively, a BglII (AGATCT) and a BamHI (GGATCC) recognition site in order to facilitate the cloning of the PCR product into the dephosphorylated restriction sites BglII and BamHI of p629. The PCR product was purified from agarose gels by the method of phenol freeze A. Benson, Biotechniques 2, pp. 67-68 (1984)] and digested with the restriction enzymes BglII and BamHI. The BglII-BamHI fragment of 471 base pairs was then ligated into the BglII and BamHI recognition sites dephosphorylated of p629. A partial map of the resulting plasmid pURV3 is shown in FIG. 18. This plasmid was transformed by the method of Simanis [Hanahan, D. In D. M. Glover DNA Cloning, pp. 109-135, (1985)] into the E. coli strain XLI Blue MRF' (A(mcrA)183 A(mcrCB-hsdSMR-mxr')173 endAl supE44 thi-1 recAl gyrA96 relAl lac proAB laclqZAM15 TnlO (Tetr)]c which was obtained from Stratagene, La Jolla, CA. The transformants grown at 37 0 C were screened by colony immunoblot [J.
Sambrook et al. (supra)] using the MAb Fl-Pn3.1 reactive with C-1 69 rec. Plasmid DNA was purified from a selected transformant and the DNA insert was sequenced by PCR using the Taq Dye Deoxy Terminator Cycle Sequencing kit of Applied Biosystems Inc. (ABI) and DNA electrophoresis was performed on automated DNA sequencer 373A (ABI). The nucleotide sequence of the insert perfectly matched the nucleotide sequence of the C-151 coding region of the HSP72 gene. (See SEQ ID No: 25 and corresponding amino acid sequence at SEQ ID No: 26.) The plasmid was transformed into the prototrophic E. coli strain W3110 (ATCC 27325) for the production of C-151rec.
B. Expression of C-151rec and Antigen Preparation The recombinant C-151rec was synthesized with a methionine residue at its amino end in E. coli strain W3110 harboring the plasmid pURV3. E. coli cells were "1 e dl I- s WO 96/40928 PCT/CA96/00322 grown at 300C in LB broth containing 100 pg of ampicillin per ml until the A 600 reached a value of 0.6. The cells were then cultivated at 400C for 18 hours to induce the production of C- 151 rec protein. A semi-purified C- 151 rec protein was prepared using the following procedures. The bacterial cells were harvested by centrifugation and the resulting pellet was washed and resuspended in phosphatebuffered saline. Lysozyme was added and the cells were incubated for 15 min on ice before disruption by pulse sonication. The cell lysates were cleared by centrifugation and the supernatants were collected and subjected to separation using an Amicon's ultrafiltration equipment (stirred cells series 8000, Amicon Canada Ltd.
Oakville, Ontario). The ultrafiltrate not retained by a YM30 membrane was recovered, analysed by SDS-PAGE and stained with Coomassie blue R-250. Protein concentrations were estimated by comparing the staining intensity of the
C-
151 rec protein with those obtained with defined concentrations of soybean trypsin inhibitor.
C. Reactivity of MAbs Against C-151rec A panel of 10 monoclonal antibodies selected for their reactivity with the S. pneumoniae HSP72 protein were tested for their teactivity to C-151rec by Western blot analysis using YM30-ultrafiltrates prepared as described above. The MAbs included a series of six monoclonal antibodies raised to the HSP 72 rec protein (F3to F3-Pn3.10) and monoclonal antibodies Fl-Pn3.1, F2-Pn3.2, F2-Pn3.3, F2-Pn3.4. The three MAbs Fl-Pn3.1, F2- Pn3.3 and F2-Pn3.4 that were reactive with C- 169 rec also recognized the C-151rec fragment. All other MAbs were only reactive with HSP 72 rec thus indicating that they may be directed against epitopes present in the amino terminal region of the HSP72 protein.
it C WO 96/40928 PCT/CA96/00322 EXAMPLE 7 Antibody Response of Balb/c Mice and Macaca- Fascicularis (cynomolg' Monkeys to Recombinant HSP72 Antigens A. Procedures 1. Immunization of Animals Groups of 10 female Balb/c mice were immunized subcutaneously with either HSP72 rec or C-169 rec as described in Example 5. In order to assess the antibody response 'o C-151rec, a group of 6 mice were immunized three times at two-week intervals w4th 0.5 pg of C-151rec absorbed to Alhydrogel adjuvant by intraperitoneal injection. Sera from blood samples collected prior each immunization and four to seven days after the third immunization were tested for antibody reactive with S.
pnei moniae by ELISA using plates coated with S. pneumoniae cell wall extracts.
Female cynomolgus monkeys were immunized intramuscularly at Day 1, 22 and 77 with 0.5 ml containing 150 pg of purified HSP 7 2 rec or C-169rec antigens absorbed to Alhydrogel adjuvant. Blood samples were collected regularly before and after each immunization and the sera were tested for antibody reactive with S. pneumoniae HSP72 antigen by Western blot analysis.
The specificity of the raised antibodies for S.
pneumoniae HSP72 was confirmed by Western blot analyses to S. pneumoniae cell extracts and purified recombinant antigens.
B. Results The results previously described in Example clearly demonstrate tLe protective nature of the antibody response elicited following immunization with recombinant HSP72 antigens. Here we monitored the appearance of serum antibody response in mice (FIG. 20 and 21) and in monkeys (FIG. 22) during the immunization schedule. Both species responded strongly to the full-length and truncated recombinant HSP72 proteins used as immunogens WO 96/40928 PCT/CA96/00322 with average titers of 1:64000 after the third injection.- Detailed analysis of individual sera revealed that each animal responded to the immunization in developping antibodies reactive with S. pneumoniae HSP72.
In mice immunized with C-169rec, the two doses tested, i.e. 1 and 5 pg, were similarly efficient with the induction of similar antibody titers (FIG. 20). A strong boost response was observed after the second injection with C-169rec with no enhancement in the antibody titers after a third injection. In contrast to this, we observed that the immune response to the HSP 72 rec was dosedependent. Increases in the specific antibody titers were observed after a second and a third injection with either
HSP
72 rec or C-1Slrec (FIG. 19 and 21).
Study of the immune response of monkeys clearly indicated that the immunogenicity of recombinant HSP72 antigens is not restricted to rodents such as rabbit and mouse. The humoral response following the second injection with either antigen is characterized by a strong increase in HSP72-specific antibody titers that can persist for several weeks without any detectable decrease in their antibody titers (FIG. 22). In addition, specific serum antibodies were detectable in the sera of each monkey after a single injection of recombinant antigens.
EXAMPLE 8 B-Cell Epitope Mapping of HSP72 Stress Protein In Example 3, it was shown that significant variability in the primary sequence of the HSP70 proteins was mainly localized to two regions corresponding to amino acid residues 244 to 330 and 510 to 607 of the S.
pneumoniae HSP72 protein. These variable regions may co:: ain B-cell epitopes responsable for the antigenic h-terogeneity reported in Example 4. To investigate this )o :"ibility, the reactivity of polyclonal and monoclonal WO 96/40928 PCT/CA96/00322 antibodies to S. pneumoniae HSP72 were tested against fourteen peptides selected to cover most of these regions.
A. Procedures Fourteen peptides of 14 to 30 amino acids residues were synthesized. The peptide sequences and their locations in the protein are summarized in Table Peptides CS870, CS873, CS874, CS875, CS876, CS877, CS878, CS879, CS880 and CS882 were synthesized by Biochem Immunosystem Inc. (Montreal, Canada) using an automated peptide synthesizer. Peptides MAP1, MAP2, MAP3 and MAP4 were synthesized onto a branching lysine core as Multiple Antigenic Peptides (MAP) by the Service de Sequence de Peptides de l'Est du Quebec, Centre de recherche du CHUL (Sainte-Foy, Canada). Peptides were purified by reversephase high-pressure liquid chromatography. Peptides were solubilized in distilled water except for peptides CS874 and CS876 which were solubilized in a small volume of either 6M guanidine-HCl or dimethyl sulfoxide and then adjusted to 1 mg/ml with distilled water.
Peptide ELISA were performed by coating synthetic peptides onto Immunolon 4 microtitration plates (Dynatech Laboratories, Inc., Chantilly, VA) at a concentration of ug/ml according to the prodedures described in J. Hamel et al. [supra]. To confirm the reactivity of MAbs with peptides, the ability of fluid-phase peptides to inhibit MAb binding to solid HSP72 was determined. For the inhibition assay, microtitration plates were coated with S. pneumoniae cell wall extracts. Hybridoma culture supernatants containing the HSP72-specific MAbs were incubated overnight at 4 0 C with several concentrations of peptide. Peptide treated and control supernatants were then tested by ELISA as described above.
Immune sera were from animals immunized three times with recombinant HSP72 antigens. One rabbit was immunized with 37.5 pg of purified HSP 7 2 rec according to the immunization protocol described in Example 5. Pool murine sera were from three Balb/c mice immunized with WO 96/40928 PCT'/CA96/00322
HSP
7 2 rec from Example 5 and monkey pool sera were from groups of two animals immunized with either HSP 72 rec or C- 169 rec- TABLE 5: SEQUENCES AND LOCATIONS OF SYNTHETIC PEPTIDES CORRESPONDING TO S. PNEUNONIAE HSP72 AMINO ACID RESIDUES Peptide Location Sequence Sequenlce 11) No.
CS876 247-261 TSTQISLPFITAGEA 7 CS877 257-271 TAGEAGPLHLEMTLT 8 CS878 268-281 MTLTPAKFDDLTRD 9 CS879 1276-290 DDLTRDLVERTIVPV CS880 286-299 TKVPVRQALSDAGL 11 CS882 315-333 RIPAVVEAVKAETGKEPNK 23 CS873 457-471 KAKDLGTQKEQTIVI 12 CS874 467-481 QTIVIQSNSGLTDEE 24 CS875 477-491 LTDEIDRMMKDAEA 13 MAP 1 487-510 KDAEANAESDKKRKEEVDL,rfVV*. 14 CS870 507-521 NEVDQAIFATEKTIK 151 MAP 2 517-544 EKTIKETEGKGFDAERDAAQAALD 16
DLKK
MAP 3 .544-573 KAQEDNNLDDMKAKLEALNEKAQG 17
_______LAVKLY
MAP 4 .583-607 QEGAEGAQATGNAGDDWVDGEFTE 18 1 i KI B Cell Epito Identification and Localization of Linear B- )es The results presented in FIG. 23 revealed that most of the immunological reactivity was observed with the WO 96/40928 PCT/CA96/00322 peptides localized within amino acid residues 457 and 607 corresponding to the C-151 fragment of HSP72. Rabbit, mice and monkey sera antibody from animals immunized with either recombinant HSP 72 rec of C-169rec were reactive with both, peptide MAP2 and peptide MAP4. Interestingly, the sequence of peptides MAP2 and MAP4 spans the hypervariable carboxyl-terminal region containing the sequences GFDAERDAAQAALDD (residues 527 to 541) and AEGAQATGNAGDDVV (residues 586 to 600) defined as exclusive to S. pneumoniae HSP72 based on the comparison of protein sequences available in the data banks. Our data thus revealed that both peptide sequences contain linear B-cell epitopes. In addition, the peptide MAP4 alone was also recognized by the MAb Fl-Pn3.1. This reactivity was confirmed by fluid-phase inhibition assays in which pg/ml of MAP4 caused complete inhibition of Fl-Pn3.1 binding to HSP72. Polyclonal antisera from animals immunized with the complete HSP72 recombinant protein also recognized B-cell epitopes localized on peptides CS875, MAP1 and MAP3. All together these data indicate that the hypervariable C-151 terminal fragment of the HSP72 stimulates B-cell responses and possibly constitutes the immunodominant portion of the HSP72 protein. The lack of reactivity of MAbs F2-Pn3.3 and F2-Pn3.4 with the synthetic peptides suggest that they react with conformational determinants present on the C-terminal region of the HSP72. The existence of protective epitopes in the C-151 region was strongly suggested in Example where mice immunized with purified C-169rec were protected from fatal infection with a virulent strain of S.
pneumoniae thus suggesting that the carboxyl-terminal fragments C-169 or C-151 of S. pneumoniae HSP72 or even smaller fragments thereof may prove very useful for the development of a future vaccine.
The variable region comprised within the amino acid residues 244 to 330 also constitutes an antigenic domain. Linear epitopes located on overlapping peptides WO 96/40928 PCT/CA96/00322 CS877 (amino acids 257 to 271) and CS878 (amino acids 268 to 281), peptides CS880 (amino acis 286-299) and peptides CS882 (amino acids 315-333) were identified by hyperimmune sera.
EXAMPLE 9 HSP70 (DnaK) from Streptococcus pyogenes and Streptococcus agalactiae: Molecular Cloning and DNA Sequencing of the hsp70 Genes; Nucleotide and Protein Sequence Analyses; Antigenic Relatedness to S. pneumoniae; Increased Streptococcus agalactiae HSP70 synthesis"in response to heat.
A. Procedures 1. Bacterial Strains and Plasmid Vector The strains of S. pyogenes (Group A Streptococcus) and S. agalactiae (Group B Streptococcus) used in this study were provided by the Laboratoire de la Sant6 Publique du Quebec (LSPQ), Sainte-Anne de Bellevue, Qu6bec, Canada. S. agalactiae type II strain V8 corresponds to the ATCC strain 12973. S. pyogenes strain Bruno corresponds to the ATCC strain 19615. The E. coli strain XLI Blue MRF' was obtained from Stratagene.
Streptococcal strains were grown at 37 0 C in a
CO
2 incubator. The streptococci were streaked on tryptic soy agar plates containing 5 sheep blood (Les Laboratoires Quelab, Montreal, Canada), liquid cultures were made in heart infusion broth (Difco Laboratories, Detroit, MI) without agitation. The E. coli strain was grown at 37 0 C in L-broth with agitation at 250 rpm or on Lagar.
The general cloning phagemid pBluescript KS(-) was purchased from Stratagene.
2. Recombinant DNA Techniques Restriction enzymes, T4 DNA ligase, and calf intestinal phosphatase were used as recommended by the suppliers (Pharmacia [Canada] Inc., Baie d'Urfe, Canada; and New England Biolabs Ltd., Mississauga, Canada).
I
WO 96/40928 PCT/CA96/00322 Preparation of plasmids by equilibrium centrifugation in CsCl-ethidium bromide gradients, agarose gel lectrophoresis of DNA fragments, Southern hybridization, and colony DNA hybridization were performed as described by J. Sambrook et al.[ supra]. Chromosomal DNA of the streptococcal bacteria was prepared using the procedure of B. M. Jayarao et al. Clin. Microbiol., 29, pp. 2774- 2778 (1991)] adapted for bacterial cultures of 90 ml.
Rapid plasmid preparations were made accordingly to D.
Ish-Horowicz et al. [Nucl. Acids Res. 9, pp. 2989-2998 (1981)]. Plasmids used for DNA sequencing were purified using plasmid kits from Qiagen Inc. (Chatsworth, CA). DNA fragments were purified from agarose gels by the method of phenol freeze A. Benson, Biotechniques 2, pp. 67-68 (1984)]. DNA probes were labeled with a 32 P-dCTP or digoxigenin (DIG)-11-dUTP using the random primer labeling kits of Boehringer Mannheim (Laval, Canada). Plasmid transformations were carried out by the method of Simanis [Hanahan, D. In D. M. Glover DNA Cloning, pp. 109- 135, (1985)]. The sequencing of genomic DNA inserts in plasmids was done using synthetic oligonucleotides. The sequencing reactions were carried out by the polymerase chain reaction (PCR) using the Taq Dye Deoxy Terminator Cycle Sequencing kit (ABI) and DNA electrophoresis was performed on automated DNA sequencer 373A (ABI). The assembly of the DNA sequence was performed using the program Sequencher 3.0 from the Gene Codes Corporation (Ann Arbor, MI). Analysis of the DNA sequences and their predicted polypeptides were performed with the program Gene Works version 2.45 from Intelligenetics, Inc.
(Mountain View, CA). DNA amplification reactions were made using a DNA Thermal Cycler 480, Perkin Elmer.
Oligonucleotides were synthesized by oligonucleotide synthesizer model 394 (ABI).
~L
WO 96/40928 PCT/CA96/00322 3. Molecular Cloning of the Genes of S. agalactiae and S. pyogenes Chromosomal DNA from S. agalactiae and S.
pyogenes was digested to completion with various restriction enzymes with palindromic hexanucleotide recognition sequences. The digests were analysed by Southern hybridization using a labeled PCR-amplified DNA probe corresponding to a 782 base-pairs region starting at base 332 downstream from the ATG initiation codon of the HSP72 gene of S. pneunoniae (see SEQ ID NO This DNA region was selected because it is relatively well conserved among the hsp70 genes of Gram-positive bacteria that have been characterized. The PCR amplification was done on the genomic DNA of S. pneumoniae using the oligonucleotides OCRR2 (5'-AAGCTGTTATCACAGTTCCGG) and OCRR3 (5'-GATACCAAGTGACAATGGCG). Hybridizing genomic restriction fragments of sufficient size to code for a kDa polypeptide kb) were partially purified by extraction of genomic fragments of corresponding size from agarose gel. Verification of the presence of the gene among the purified genomic restriction fragments was done by Southern hybridization using the labeled 782-bp S.
pneumoniae DNA probe.
The purified genomic DNA restriction fragments were cloned into dephosphorylated compatible restriction sites of pBluescript and transformed into the E.
coli strain XL. Blue MRF'. The colonies were screened by DNA hybridization using the labeled 782-bp S. pneumoniae DNA probe. Extracted plasmids were digested with various restriction enzymes to evaluate the size of the inserts and to verify the presence of the hsp70 gene by Southern hybridization using the labeled 782-bp S. pneumoniae DNA probe. Plasmid pURV5 contains a 4.2-kb HindIII insert of the genomic DNA of S. agalactiae. Plasmid pURV4 contains a 3.5-kb HindIII fragment of the genomic DNA of S.
pyogenes.
WO 96/40928 PCT/CA96/00322 4. Heat Shock and Protein Labeling The stress response of S. agalactiae to an heat shock was assayed by pulse-labeling with 35 S]methionine as described before in Example 1. S. agalactiae bacteria grown overnight in SMAM (Methionine assay Medium supplemented with 1 mg/l methionine, 1% Isovitalex and 1 mg/l choline chloride) were pelleted by centrifugation and then resuspended in the methionine-free SMAM medium. The bacteria were incubated at 37 0 C for 1 h and then divided into two fractions of equal volume. The samples were either incubated at 37 or 43 0 C for 10 minutes and then labeled with 100 pCi/ml 35 S]methionine for minutes at 37 0 C. The bacteria were extensively washed with PBS and cell extracts were prepared by treatment with mutanolysine and lysozyme as described for the DNA isolation (M.Jayarao et al., supra) followed by sonication.
Immunological Characterization A series of six monoclonal antibodies raised to the HSP 72 rec protein (F3-Pn3.5 to F3-Pn3.10) and the monoclonal antibodies Fl-Pn3.1, F2-Pn3.2, F2-Pn3.3, F2- Pn3.4 were tested for their reactivity to HSP70 antigens from S. pyogenes and S. agalactiaeby Western blot analysis. Cell lysates from S. pyogenes and_S. agalactiae were obtained from treatment with mutanolysine and lysozyme (M.Jayarao et al., supra)., sonication and boiling in SDS-PAGE sample buffer. Cell lysates from E.
coli transformed with either pURV4 or pURV6 producing truncated S._pyogenes HSP70 antigens were tested after boiling in SDS-PAGE sample buffer.
B. DNA Sequence Analysis of the hsp70 /dnak Genes of Streptococcus pyogenes, Streptococcus agalactiae and Streptococcus pneumoniae_ A region of 2438 bases in the 4.2-kb HindIII insert of plasmid pURV5 was sequenced. This sequence WO 96/40928 PCT/CA96/00322 contains an open reading frame (ORF) of 1830 nucleotides coding for a polypeptide of 609 amino acids with a molecular weight of 64907 (see SEQ ID NO: The ORF has an ATG start codon beginning at position 248 and TAA stop codon ending at position 2077. The ATG start codon is preceeded by the sequence GAGG, starting at position 237, which is complementary to 16S rRNA and serves as a ribosome binding site in E. coli D. Stormo et al., Nucleic Acids Res. 10, pp. 2971-2996 (1982)]. The ORF and the polypeptide of the HSP70 of S. agalactiae are, respectively, identical at 85 and 95 to the ORF and polypeptide of the HSP72 of S. pneumoniae.
Preliminary sequence comparisons with the HSP72 of S. pneumoniae showed that the 3.5-kb HindIII insert in plasmid pURV4 lacks the 3'-end coding region of the of S. pyogenes. An attempt to clone a 3-kb SalI genomic fragment containing the entire coding region of hsp70 of S. pyogenes yielded plasmid pURV6 containing a 3.1-kb insert lacking the 5'-end coding region of the gene. The assembly of the hsp70 gene regions present in plasmids pURV4 and pURV6 gave a 2183 nucleotide region containing an ORF of 1824 bases coding for a polypeptide of 608 amino acids with a molecular weight of 64847 (see SEQ ID NO: The ATG start codon begins at position 204 and the TAA stop codon extends to position 2030. Similarly to the of S. agalactiae, the ATG start codon is preceeded by a putative ribosome binding site sequence GAGG starting at position 193[G. D. Stormo, supra]. The ORF and the deduced polypeptide of the hsp70 of S. pyogenes are, respectively, identical at 85 and 94 to the ORF and polypeptide of the HSP72 of S. pneumoniae. The ORF of plasmid pURV4 lacks 125 base pairs coding for 41 amino acids at the carboxyl end of the HSP70 of S. pyogenes the ORF thus codes for the 567 amino acids of the amino end of that HSP70 (N-567rec). The ORF of plasmid pURV6 lacks 114 base pairs coding for 38 amino acids at the amino end of the HSP70 of S. pyogenes the ORF thus codes WO 96/40928 PCT/CA96/00322 for the 570 amino acids of the carboxyl end of that (C-570rec)- The global comparison of the DNA open reading frames (FIG. 24) and amino acid sequences (FIG. 25) of the HSP70/DnaK of S.pyogenes, S. agalactiae, and S. pneumoniae gave percentages of identity of 82 and 93 respectively.
C. Increased Synthesis of HSP70 by S. agalactiae in Response to Heat One dimensional SDS-polyacrylamide gel electrophoretic analysis of cel-. extracts of heat-shocked and control S. agalactiae pulse-labeled with 3 5 S]methionine revealed that the synthesis of a 70 kDaprotein was significantly increased after a thermal stress (FIG. 26, lanes 1 and Radioimmunoprecipitation analysis revealed that the heat inducible was easily detected at 43 0 C using monoclonal antibody F2- Pn3.4 thus indicating that the protein belongs to the heat shock protein 70 (hsp70/DnaK) family (FIG. 26, lanes 3 and 4).
D. Antigenic Relatedness of HSP70 Proteins in S. pneumoniae, S. pyogenes and S. agalactiae In this study, a panel of MAbs were used to investigate the antigenic relatedness of S. pyogenes, S.
agalactiae and S. pneumoniae HSP70 proteins. Eight of ten MAbs reacted with all three Streptoccocus species thus indicating that some B-cell epitopes are widely distributed among S. pneumoniae S. pyogenes and S.
agalactiae. The MAb Fl-Pn3.1 which is directed against an epitope located between amino acid residues 584 and 607 of HSP72 from S. pneumoniae did not react with antigens from either S.pyogenes or S. agalactiae.
Comparison of this region among the three Streptococcus species revealed differences in 5 to 8 amino acids located between amino acids 589 and 596. The MAb F2-Pn3.3 which ,I WO 96/40928 PCT/CA96/00322 was also directed against epitopes present in the C-151 region was reactive with S. agalactiae but not wih S.
pyogenes. These data clearly indicate that HSP70 proteins from Streptococcus species are structurally and immunologically related. There is however immunological distinction.
Analysis of the reactivity of MAbs F3-Pn3.5, F3- Pn3.6, F3-Pn3.7 and F3-Pn3.10 with truncated recombinant S. pyogenes HSP70 antigens allowed the identification of an antigenic region near the amino-terminal end on the S.
pneumoniae HSP72. These MAbs reacted with constructs expressing the N-terminal 567 amino acid residues but failed to react with constructs expressing the C-570 fragment. These data localized the epitopes recognized by the MAbs F3-Pn3.5, F3-Pn3.6, F3-Pn3.7 and F3-Pn3.10 to between residues 1 and 38 of the HSP72 protein.
EXAMPLE 10 Use of HSP70/HSP72 As A Human Vaccine To formulate a vaccine for human use, appropriate HSP72 antigens may be selected from the polypeptides described herein. For example, one of skill in the art could design a vaccine around the HSP70/HSP72 polypeptide or fragments thereof containing an immunogenic epitope. The use of molecular biology techniques is particularly well-suited for the preparation of substantially pure recombinant antigens.
The vaccine composition may take a variety of forms. These include, for example solid, semi-solid and liquid dosage forms, such as powders, liquid solutions or suspensions, and liposomes. Based on our belief that the HSP70/HSP72 antigens of this invention may elicit a protective immune response when administered to a human, the compositions of this invention will be similar to those used for immunizing humans with other proteins and polypeptides, e.g. tetanus and diphtheria. Therefore, the St ra IPI WO 96/40928 PCT/CA96/00322 compositions of this invention will preferably comprise a pharmaceutcially acceptable adjuvant such as incomplete Freund's adjuvant, aluminum hydroxide, a muramyl peptide, a water-in oil emulsion, a liposome, an ISCOM or CTB, or a non-toxic B subunit from cholera toxin. Most preferably, the compositions will include a water-in-oil emulsion or aluminum hydroxide as adjuvant.
The composition would be administered to the patient in any of a number of pharmaceutically acceptable forms including intramuscular, intradermal, subcutaneous or topic. Preferrably, the vaccine will be administered intramuscularly.
Generally, the dosage will consist of an initial injection, most probably with adjuvant, of about 0.01 to 10 mg, and preferably 0.1 to 1.0 mg HSP72 antigen per patient, followed most probably by one or more booster injections. Preferably, boosters will be administered at about 1 and 6 months after the initial injection.
An important consideration relating to pneumococcal vaccine development is the question of mucosal immunity. The ideal mucosal vaccine will be safely taken orally or intranasally as one or a few doses and would elicit protective antibodies on the appropriate surfaces along with systemic immunity. The mucosal vaccine composition may include adjuvants, inert particulate carriers or recombinant live vectors.
The anti-HSP72 antibodies of this invention are useful for passive immunotherapy and immunoprophylaxis of humans infected with S. pneumoniae, S. pyogenes, S.
agalactiae or related bacteria. The dosage forms and regimens for such passive immunization would be similar to those of other passive immunotherapies.
An antibody according to this invention is exemplified by a hybridoma producing MAb Fl-Pn3.1 deposited in the American Type Culture Collection in Rockville, Maryland, USA on July 21, 1995, and identified -r I WO 96/40928 PCT/CA96/00322 as Murine Hybridoma Cell Line, Fl-Pn3.1. This deposit was assigned accession number HB 11960.
While we have described herein a number of embodiments of this invention, it is apparent that our basic embodiments may be altered to provide other embodiments that utilize the compositions and processes of this invention. Therefore, it will be appreciated that the scope of this invention includes all alternative embodiments and variations that are defined in the foregoing specification and by the claims appended hereto; and the invention is not to be limited by the specific embodiments which have been presented herein by way of example.
I I~ C- es 1 WO 96/40928 PCT/CA96/00322 SEQUENCE LISTING GENERAL INFORMATION: APPLICANT: Hamel, Josee Brodeur, Bernard R Martin, Denis Rioux, Clement (ii) TITLE OF INVENTION: STREPTOCOCCAL HEAT SHOCK PROTEINS MEMBERS OF THE HSP70 FAMILY (iii) NUMBER OF SEQUENCES: 26 (iv) CORRESPONDENCE ADDRESS: ADDRESSEE: Goudreau Gage Dubuc Martineau Walker STREET: 800 Place Victoria, Suite 3400, Stock Exchange Tower CITY: Montreal STATE: Quebec COUNTRY: CANADA ZIP: H4ZlE9 COMPUTER READABLE FORM: MEDIUM TYPE: Floppy disk COMPUTER: IBM PC compatible OPERATING SYSTEM: PC-DOS/MS-DOS SOFTWARE: PatentIn Release Version #1.25 (vi) CURRENT APPLICATION DATA: APPLICATION NUMBER: FILING DATE:
CLASSIFICATION:
(vii) PRIOR APPLICATION DATA: APPLICATION NUMBER: US 08/472,534 FILING DATE: 07-JUN-1995 (vii) PRIOR APPLICATION DATA: APPLICATION NUMBER: US (PROVIS)60/001,805 FILING DATE: 04-AUG-1995 (viii) ATTORNEY/AGENT INFORMATION: NAME: Leclerc/Dubuc/Prince, Alain/Jean/Gaetan REFERENCE/DOCKET NUMBER: BIOVAC2-PCT (ix) TELECOMMUNICATION INFORMATION: TELEPHONE: :514) 397-7400 TELEFAX: (514) 397-4382 INFORMATION FOR SEQ ID NO:1: SEQUENCE CHARACTERISTICS: LENGTH: 3167 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Streptococcus pneumoniae (ix) FEATURE: NAME/KEY: CDS 4 -1 L~~ WO 96/40928 PCT/CA9600322 LOCATION: 30..755 (ix) FEATURE: NAME/KEY: CDS LOCATION: 771..2912 OTHER INFORMATION: /product= "FucI/-SP72 (C-169) (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: GAACTTCATT TTTAGMAGG AGTGAGTTT ATG Met
CGT
Arg
GTT
Val
ATT
Ile
AAG
Lys
GAT
CAG
Gin
GCT
Ala
GCA
Ala
GTG
Val
TGT
Asp Tyr Cys
GAA
Glu
ACA
Thr 105
GAG
Glu
CCG
Pro
CAT
His
GAT
Asp
AAG
Lys 185
GAG
Glu Glu ATT TGT GAT Ile Cys Asp AAC GAT GGG Asn Asp Gly 30 ACA CCT ACT Thr Pro Thr 45 AAG TTA AAT Lys Leu Asn CCT TCT AGT Pro Ser Ser GAT GTT CGT Asp Val Arg GCT CTT GCA Ala Leu Ala 110 ATT GTG GTT Ile Val Val 125 ATG GAA GTG Met Giu Val 140 ATG CTA TTA Met Leu Leu ACA GCA TAC Thr Ala Tyr TC CAC GGA Phe His Gly 190 ATT GCT CGT Ile Ala Arg 205 MG GTT ACA Lys Val Thr 220 OTT TGT CAT Val C~s His MT GTA TC'2 Ajm Vai Ser GGT ATC AGC Gly Ile Ser CTT AAA GGA Leu Lys Gly 65 GAA ATT AAA Glu Ile Lys 80 TCA OTT OTT Ser Val Val CAC ATT CCT His lie Pro GGG GCA ATT Gly Ala Ile CCA GAA GCA Pro Giu Ala 145 GM AAT CAT Glu Asn His 160 TAC COT ATG Tyr Arg Met 175 AGA ATG TTA Arg Met Leu CCG ACT TA Pro Thr Leu GGT CGT CAC Gly Arg His TCT CM OAT Ser Gin Asp AAG ATG TGG Lys Met Trp OTT CGA TTA Val Arg Le,.
35 AMA AGT TTT Lys Ser Phe 50 GAG ATT TTA Glu Ile Leu ATO CAC ATT Met His Ile CAC GCG CAT His Ala His 100 TTA GAr" ACT Leu Asp Thr 115 CCT ATT ACC Pro lie Thr 130 ATT ACA CCT Ile Thr Pro GGA GCT CTG Gly Ala Leu GAA ACT TTA Glu Thr Leu 180 CTT TCT ACA Leu Ser Thr 195 AA CGT CTA Glu Arg Leu 210 CCA GOC TAC Pro Gly Tyr GMA AM Glu Lys CAA CTT Gin Leu GAT GAG Asp Glu ATT ACA Ile Thr GAA GCA Glu Ala CGG TGC Arg Cys CCA CCG Pro Pro TAT TCA Tyr Ser CCA TTT Pro Phe TAT CTG Tyr Leu 150 ACT GTC Thr Val 165 GM TTA Glu Leu AAG GGC Lys Gly ITC TCA Phe Ser :GT AAA Arg Lys 230
TTA
Leu
GGT
Gly
OAT
Asp
CCA
Pro
GAA
Glu
TAC
Tyr
ATT
Ile
CTA
Leu
GGA
Gly 135
CCC
Pro
GGA
Gly
GTC
Val
ATT
Ile
ATG
Met 215
TAT
Lyr
ATT
Ile
TGG
Trp
ACC
Thr
GAA
Glu
GGT
Gly
GAA
Glu
GCA
Ala
ATT
Ile 120
OTA
Val
OAT
Asp
AGC
Ser
GCA
Ala
GAG
Glu 200
CGA
Arg
MAT
Asn 53 101 149 197 245 293 341 389 437 485 533 581 629 677 725 225 I ~~_LIL~ WO 96/40928 GGC GAT C Gly Asp C PCTICA96/00322 AAA GAA ACA AAA AAA TAAGAGGAAA GTAT ATG ATC 776 ;GT AGT ATA 'ly Ser Ile Lys Giu Thr Lys Lys 240 Met Ile CAA CAT CCA CGT ATT GGG ATT CGT CCG ACT ATT GAT GGT CGT CGT CAA Gin His Pro Arg Ile Gly Ile Arg Pro Thr Ile Asp Arg Arg Gin GGT GTA Gly Vai GTG GCA Val Ala GTG GAA Val Giu GCA GCT Ala Ala ACA GTT Thr Val CCA GAT Pro Asp 100 GGA GCT I Gly Ala 115 ATT CCA Ile Pro ACA GCT Thr Ala GCA GTT Ala Vai GGT AGT C Gly Ser N 180 TTC CAA C Phe Gin C 195 TTC ACG C Phe Thr P CGT GCG C Arg Ala L AAC CGT C
CGC
Arg
GAT
Asp
TGT
Cys
TCC
Ser
ACA
Thr
ATT
Ile 3TC Val
GCC
Ala
ATT
Ile
:TT
Leu
TT
lal
.AA
lu
:GC
~rg
TC
reu
'AA
GAA TCA CTT Glu Ser Leu *TG ATT TCA Leu Ile Ser 40 GTG ATT TCT Val Ile Ser 55 CAT GAG TTG His Glu Leu CCA TGC TGG Pro Cys Trp CCT CAT GCT Pro His Ala TAT CTT GCA I Tyr Leu Ala 120 TIT GGG ATT Phe Gly Ile 135 CCA GAA GAT Pro Glu Asp 150 GCA ACT GGC Ala Thr Gly I TCG ATG GGG 2 Ser Met Gly I 1 TAC TTA GGA I Tyr Leu Gly t 200 CGT ATG GAC C Arg Met Asp E 215 AAA TGG GTG A Lys Trp Val I 230 GAC CTT GTT I
GA
G1 2'
AG(
Sei CC2 Prc
TT
Phe
TG
Cys
ATT
Ile 105
GCT
Ala
TAT
ITyr
GTC
Jal ErG LeU
.TT
[le .85
'TG
et
GT
Lrg
AA
,ys 7AA
GTA
u Val 5
ACA
r Thr k TCT Ser
SAAA
Lys
TAT
Tyr 90
TGG
Trp G OTA Val
GGT
Gly
AAA(
Lys
ATG
Met 170
GGTC
Gly CGA 2 Arg I GGT I Gly I GAA P Glu 2
AGC
Ser 2 250
CAA
Gin
TTG
Leu
ACC
Thr
AAA
Lys 75
GGT
Gly
GGA
Gly
CTA
Leu
AGAA
Arg
GAA
lu L55
AGA
;GT
gly
AT
\sn
TT
:1e
AC
Is !35
ACA
Thr
AAA
Lys
ATT
Ile 60
TCA
Ser
AGT
Ser
ITT
Phe
GCT
Ala
GAT
Asp 140
AAA
Lys
GAC
Asp
TCT
Ser
GAA
Glu
TAC
Tyr 220
GTA
Val ATG AAC Met Asn TAT CCA Tyr Pro 45 *GGT CGT Gly Arg A.AT GTT Asn Val GAA ACT Glu Thr AAT GGG Asn Gly 110 TCA CAT Ser His 125 GTT CAG Val Gin CTT TTA Leu Leu ACT GCT Thr Ala ATT GTA Ile Val 190 TCG GTA Ser Val 205 GAC CCT Asp Pro C AAA GAA C Lys Glu C ATG GCq Met Ale GAT GC Asp Gly GTT CCA Val Pro TOC GCA Cys Ala ATG GAT Met Asp ACA GAA Thr Glu ACT CAA Thr Gin GAA GCT Glu Ala CGT TAT Arg Tyr 160 TAC CTA Tyr Leu 175 !AT CCA Asn Pro GAT ATG Asp Met 3AA GAG 1lu Glu ;GA TTC 'ly Phe 240 AAA AGT Lys Ser GAA CCT Olu Pro GAG GCT Glu Ala ACA ATT Thr Ile ATG TCT Met Ser CGC CCA Arg Pro AAA GGG Lys Gly 130 AAT GAT Asn Asp 145 GCG CGG Ala Arg TCA ATG Ser Met GAT TTC Asp Phe ACG GAG Thr Glu 210 TTC GAA Phe Glu 225 GAC CAT Asp His 872 920 968 1016 1064 1112 1160 1208 1256 1304 1352 1400 1448 1496 1544 ,T GAA GAA AAA GAT AGA CAA TG Asn Arg Glu 245 Asp Leu Val Leu rg Glu Glu Lys Asp Arg Gin Trp 255 -L I _1 I WO 96/40928 GAA TT GTI Glu Phe Val 260 AAC CCA AGA Asn Pro Arg 275 ATT AAG ATG TTC ATG ATT Ile Lys Met Phe Met Ile 265 CTT GCT GAA CTT GGT TTT GGA CGT Gly Arg GAG GAA ATG GTT GGT Met Val Gly GTT GGT CAC CT/CA96/00322 1592 1640 Leu Ala Glu 280 Leu Giy Phe Glu Glu Glu Ala Val Gly His
CAT
His
TTT
Phe
TGG
Trp
CTA
Leu
CAA
Gin 355
CGT
Arg
CAT
His ACT I Thr
AGT
Ser CGC C Arg 435 GGG C Gly I GGT C Gly I GAT C Asp ACT Thr T TAT C Tyr A 515 GCT TTA GTA GCT GGT TTC CAA Ala Leu Val Gly Phe Gin
CCA
Prc
AAT
Asn
AAT
Asn 340
ATC
Ile
GTA
Val
CTA
Leu
CGA
Arg
GAA
Glu 420
,AA
lu
AT
sp
:CA
?ro
;TT
al 'rGG 'rp ;00
,AC
AAT
Asn
GGT
Gly 325
GGT
Gly
TTT
Phe
ACA
Thr
ATC
Ile
GAT
Asp 405
GTA
Val
TAC
Tyr
ATG
Met
GTG
Vai I
CAC
His 1 485 TTT C Phe I GTC I
GGC
Gl 31C
ATT
Ile
GTG
Val
GCT
Ala
GGA
Gly
AAC
Asn 390
GGC
Giy
CAG
G1n
ITC
Phe
CCA
Pro
CTA
Leu 170
:AT
is
;CT
la
\TG
GAC TTT ATG GMA Asp Phe Met Glu
CGA
Arg
TCT
Ser
GAT
Asp
TAT
Tyr 375
TCT
Ser
AAA
Lys
GCT
Ala I
CGT
Arg
GTA
Val 455
CAA
Gin ACT Thr I CCA C Pro P
AAT
AAA
Lys
ATG
Met
GTG
Val 360
ACT
Thr
GGA
Gly
CCT
Pro
ATG
Met
GGA
Gly 440 kCA Thr kTT Ile
ETA
~eu
GT
~rg
CCA
Pro
CTC
Leu 345
CGT
Arg
TTA
Leu
TCT
Ser
GTT
Val
CTT
Leu 425
GGA
Gly
ATG
Met
GCA
Ala C GAT 2 Asp 2 4
TTG
Leu J 505
TTT
Phe 330
TTT
Phe
ACT
Thr
GAG
Glu
TGT
Cys
ATG
Met 410
GAA
Glu
GGA
Gly 3TA Ial
"AA
3lu
AT
Lsn 190
~CA
.hr GGT CAA Gly Gin 300 ACT TTC Thr Phe 315 GTA TTT Val Phe AAT TAT Asn Tyr TAT TGG Tyr Trp GGT CGT Gly Arg 380 ACA TTO Thr Leu 395 AAA CCA Lys Pro AAT ACA Asn Thr 2 TTC TCA I Phe Ser 4 CGT CTC 2 Arg Leu 2 460 GGT TAC P Gly Tyr rj 475 CGT ACA C Arg Thr A GGA MA C Gly Lys C
CG
Are
CT
Let
GCC
Al
CTI
Leu
AGT
Ser 365
GCT
Ala
GAT
Asp
ITC
Phe
GAC
%sp
~CT
1hr 145
AT
~sn
~CA
.hr
AT
sp
GT
ly 9 Gin
AAT
u Asn 3 ACA i Thr
'ITA
Leu 350
CCA
Pro
GCA
Ala
GGT
Gly
TGG
Trp
TTC
Phe I 430
CGT
Arg I CTT Leu I CTT C Leu C CCA C Pro G 4 GCT I Ala P 510 Trr
ACT
Thr
GAG
Glu 335
ACA
Thr
GAG
Glu
GCT
Ala
ACA
Thr
GAG
lu 41S
CCA
?ro
ETC
Phe
ETA
.eu
'AA
;lu
;GA
;ly
TC
he Thr
CAG
Gin 320
AAT
Asn
AAT
Asn
GCT
Ala
GGA
Gly
GGT
Gly 400
TIG
Leu
CCA
Pro
TTG
Leu
AAA
Lys
CTT
Leu 480
TGG
Trp
MG
Lys r CAG TGG ACA GAC CAT *Asp His 305 TT GAC Phe Asp GAT TCA Asp Ser ACT CCA Thr Pro GTT GMA Val Glu 370 TTC TTA Phe Leu 385 CAA GCT Gin Ala GAT GAA Asp Glu GCA AAC Ala Asn ACG MAG Thr Lys 450 GGG GTT Gly Val 465 CCT GAA Pro Glu CCA ACT Pro Thr TCT GTC Ser Val 1688 1736 1784 1832 1880 1928 1976 2024 2072 2120 2168 2216 2264 2312 2360 ~sp Val Met Asn A 5 LAT TGG GGA GCT AAT CAC GGA GCC ATA ACA TAT ~sn Trp Gly Ala Asn His Giy Ala Ile Thr Tyr 20 525 530 I s, WO 96/40928 GGA CAC k.TT GGA GCA GAC FI'G ATT ACC ITG GCT TCT ATG TI'G AGA AT
P(
e Gly His Ile Gly Ala Asp 535 Leu Ile Thr Ala Ser Met Leu Arg Il 545 CCT CA.A Pro Gin GTT AAG Vai Lys CAA TCG Gin Ser 580 GAT GCA Asp Aia 595 GAC CTT Asp Leu ATC AAG Ile Lys CAA GCT Gin Aia GAC GAC Asp Asp 660 GTT GCT Leu Ala 675 GAA GGA Glu Gly GTA GAG Val Asp
ATC
Ile
GCC
Ala 565
AAC
Asn
GAA
Giu
CGT
Arg
GAA
GiU
GCC
Ala 645
ATG
Met
GTT
Val
GCA
Al a
GGA
Gly GAA GTA Giu Val 550 AAA GAC Lys Asp TCA GGT Ser Gly GCA AAC Ala Asn AAT GAA Asn Giu 615 ACT GAA Thr Giu 630 CTT GAT Leu Asp AAA GCA Lys Ala AAA GTG Lys Leu GAA GGC Giu Gly 695 GAG TT Giu Phe
ACA
Thr
CTT
Leu
TTG
Leu
GGT
Al a 600
GTG
Val
GGT
Gly
GAG
Asp
AAA
Lys
TAG
Tyr 680
GGA
Ala
AG
TTT GAG Phe Asp GGA AGT Gly Thr 570 ACT GAG Thr Asp 585 GAA TGG Glu Ser GAG CAA Asp Gin AAA GGG Lys Gly GTT AAG Leu Lys 650 GTT GAA Leu Glu 665 GAA CAA Giu Gin GAkA GGA Gin Ala GAA AAG ATG GAG AAG AAG GOT ATG GTG TGT Ile Asp Lys Asn Gly Ilie Val Ser 555 560 CAA JAAA GAA CAA ACT ATT GTG ATG Gin Lys Giu Gin Thr Ile Val Ile 575 GAA GAA ATG GAG CG ATG ATG AAA Glu Giu Ile Asp Arg Met Met Lys 590 GAT AAG AAA GGT AAA GAA GAA GTA Asp Lys Lys Arg Lys Giu Giu Val 605 610 GGA ATG TI'T GCG ACT GAA AAG ACA Ala Ile Phe Ala Thr Giu Lys Thr 620 625 TI'G GAG GGA GAA CGT GAG GGT GGG Phe Asp Aia Glu Arg Asp Ala Ala 635 640 AAA GGT CAA GAA GAG MGC AAG rTG Lys Ala Gin Glu Asp Asn Asn Leu 655 GGA TTIG AAG GAA AAA GGT CAA GGA Ala Leu Asn Glu Lys Ala Gin Gly 670 GGG GGA GGA GGG CAA GAA GGT CAA Aia Ala Ala Ala Gin Gin Ala Gin 685 690 AGA GGA MGC GGA GG GAT GAG GTG Thr Gly Asn Ala Gly Asp Asp Vai 700 705 TAAGATGAGT GTATTGGATG AAGAGTATGT T/CA96/00322 2408 2456 2504 2552 2600 2648 2696 2744 2792 2840 2888 2942 Thr Giu Lys AAAAAATACA CGAAM-GTTT ATAATGATTT TTIGTAATGAA GGTGATAACT ATAGAACA.
AAAAGATTIT A'ITGATAATA TI'GCAATAGA ATAITTAGCT AGATATAGAG AAAITATA~ AGGTGAGCAT GATAGITGTG TCAAAAATGA TGAAGCGGTA AGGAATT 1G TTAGCTCAC ATTG'ITGTCT GCATI'TGTAT CGGCGATGGT ATGAGGTATG A' ATC INFORMATION FOR SEQ ID NO:2: Wi SEQUENCE CHARACTERISTICS: LENGTH: 242 amino acids TYPE: amino acid TOPOLOGY: iinear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: Met Ser Gin Asp Glu Lys Leu Ile Arg Giu Gin Ile Gys Asp Val Gys 1 5 10 86
['T
3002 3062 3122 3167 WO 96/40928 WO 9640928PCTICA96/00322 His Ser Met Trp Arg Leu Gin Leu Asp Glu Gly Trp Asp Thr Ala Ala Leu Ala Asp Gly Pro Thr Asn Val Gly Ile 40 Ser Gly Ser Phe Ile Thr Pro Glu Lys Leu Val Ile Leu Giu Gly Asp Ty'r Lys Met His Ile Val Pro Ile Ala 145 His Met Leu Leu His 225 Lys Arg Pro Tyr Pro Tyr Thr 165 Glu Cys Tyr Giu Giu Lys Pro Asp Ala Ile Met 140 Met Thr Phe Leu Asn Leu Lys Ser Val Leu Val 125 Glu Leu Al a His Giu Ser His Gly Pro Glu Tyr 175 Arg I Thr Lys Gly Ile GlU Giu Gin Glu Ile Ala 200 205 Arg Pro Thr Ser Met 215 Lys Tyr 230 Arg Glu Asn Gly Lys Val Thr 220 Ser Ile Lys Giy Arg Giu Thr 240 INFORMATION FOR SEQ ID NO:3: r,(iW SEQUENCE CHARACTERISTICS: LENGTH: 714 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: Met Ile Gin His Pro Arg Ile Gly Ile Arg Pro Thr Ile Asp Gly Arg 1 5 10 Arg Gin Gly Val Arg Giu Ser Leu Giu Val Gin Thr Met Asn Met Ala 25 Lys Ser Val Ala Asp Leu Ile Ser Ser Thr Leu Lys Tyr Pro Asp Gly 40 Giu Pro Val Giu Cys Val Ile Ser Pro Ser Thr Ile Gly Arg Val Pro 55 WO 96/40928 Glu Ala PCT/CA96/00322 Ala Ala Ser His Glu Leu Phe Lys Lys Ser Asn Val CYs Ala 70 75 Thr Ile Thr Val Thr Pro Cys Trp Cys Tyr Gly Ser Glu Thr 90 Met Asp Met Ser Pro Arg Pro Gly 115 Pro His Ala Ile 105 Tyr Leu Ala Ala Trp Gly Phe Asn Gly Thr Glu 110 Val Leu Ala Ser His Thr Gin Lys Gly Ile Pro Ala Phe 4 4 6 6 130 Asn Asp 145 Ala Arg Ser Met Asp Phe Thr Glu 210 Phe Gu 225 Asp His Gin Trp 00 Val Gly Gly His r5 290 Asp His 305 0 Phe Asp Asp Ser I Thr Pro C Vai Glu 2 0 370 Phe Leu 1 385 5 Gin Ala Thr Ala Gly Phe 195 Phe Arg Asn Glu Asn 275 His Phe rrp Leu 31n 355 rg Iis 7hr Ala Val Ser 180 Gin Thr Ala Arg Phe 260 Pro Ala Pro Asn Asn 340 Ile Val Leu Arg Ile Leu 165 Val Glu Arg Leu Glu 245 Val Arg Leu Asn Gly 325 Gly Phe 2 Thr C Ile 3 Asp C 405 Pro 150 Ala Ser Tyr Arg Lys 230 Asp Ile Leu Val ly 310 Tie 2 Jai dla I Uy I 3 ~sn S 90 ;iy L Gl) 13E Glu Thr Met Leu Met 215 Trp Leu Lys Ala a 295 %sp krg ;er Isp ,yr '75 er ,ys 120 r Ile Ty: Asp Va.
Gly Let Gly liE 18E Gly Met 200 Asp Arg Val Lys Val Leu Met Phe 265 Glu Leu 280 Gly Phe Phe Met Lys Pro Met Leu 345 Val Arg 360 Thr Leu Gly Ser Pro Val r Gly Arg Asp 140 1 Lys Giu Lys 155 i Met Arg Asp 170 Gly Gly Ser Arg Asn Glu Gly Ile Ty r 220 Glu Asn Val 235 Ser Arg Glu 250 Met Ile Gly Gly Phe Glu Gin Gly Gin 300 Glu Thr Phe 315 Phe Val Phe 330 Phe Asn Tyr Thr Tyr Trp Glu Gly Arg 380 Cys Thr Leu 2 395 Met Lys Pro 410 Glu Asn Thr I 125 Va1 Leu Thr Ile Ser 205 Asp Lys Glu Arg Glu 285 Arg Leu Ala Leu I Ser I 365 k1a I ksp C Phe T1 tsp P 4 Gir Le Ala Val 190 Vai Pro Glu Lys Asp 270 Glu GIn A.sn Thr eu 350 ?ro 1 a ily 'rp he 1 Giu Arg Tyr 175 Asn Asp Glu Gly Asp 255 Leu Ala Trp Thr Glu 2 335 Thr I Glu I Ala C Thr G 4 Glu L 415 Pro P Ala Tyr 160 Leu Pro Met Glu Phe 240 Arg Met Vral Ihr 31n 320 ksn s n l a 1 y ;1y 00 eu ro Asp Glu Ser Glu 420 Vai Gin Ala Met Leu 425 II g C ~A WO 96/40928 PCT/CA96/00322 Ala Asn Arg Glu Tyr Phe Arg 435 Gly Gly Gly Phe 440 Ser Thr Arg Phe Leu 445 Thr Lys 450 Gly Val 465 Pro Glu Pro Thr Ser Val Thr Tyr 530 Arg Ile 545 Val Ser Val Ile Met Lys Glu Val 610 Lys Thr 625 Ala Ala Asn Leu Gin Gly I Ala Gin C 690 Asp Val 705 Gly Asp Thr Tyr 515 Gly Pro Val Gin Asp 595 Asp Ile Gln Asp Leu 675 ;lu Val Pro Val Trp 500 Asp His Gin Lys Ser 580 Ala Leu Lys Ala Asp 660 Ala Gly Asp Val His 485 Phe Val Ile Ile Ala 565 Asn Glu Arg Glu Ala 645 Met I Val I Ala C Gly C 7 Gly Asp Met Pro Val Thr Met Val Arg Leu Asn Leu Leu Lys Leu 470 His Ala Met Gly Glu 550 Lys Ser Ala Asn Thr 530 Leu Lys Lys ;lu 455 SGin Thr Pro Asn Ala 535 Val Asp Gly Asn Glu 615 Glu Asp Ala Leu Gly 695 Ile Ala Leu Asp Arg Leu 505 Asn Trp 520 Asp Leu Thr Phe Leu Gly Leu Thr 585 Ala Glu 600 Val Asp Gly Lys Asp Leu Lys Leu 665 Tyr Glu 680 Ala Gin I Glu Asn 490 Thr Gly lie Asp Thr 570 Asp Ser Gln Gly Lys 650 3lu G1n Ala 460 Gly Tyr 475 Arg Thr Gly Lys Ala Asn Thr Leu 540 Ile Asp 555 Gin Lys Glu Glu Asp Lys Ala Ile 620 Phe Asp 635 Lys Ala Ala Leu Ala Ala Thr Gly 700 Thr Asp Gly His 525 Ala Lys Glu Ile Lys 605 Phe Ala Gin Asn Ala 685 Asn Leu Pro Ala 510 Gly Ser Asn Gin Asp 590 Arg Ala Glu Glu Glu 670 Ala Ala Glu Gly 495 Phe Ala Met Gly Thr 575 Arg Lys Thr Arg Asp 655 Lys Gin Gly Leu 480 Trp Lys Ile Leu Ile 560 Ile Met Glu Glu Asp 640 Asn Ala Gln Asp ;lu Phe Thr Glu Lys INFORMATION FOR SEQ ID NO:4: SEQUENCE CHARACTERISTICS: LENGTH: 4320 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Streptococcus pneumoniae 89 II -1 WO 96/40928 WO 9640928PCT/CA96/00322 (ix) FEATURE: NAM~E/KEY: CDS LOCATION: 682. .2502 OTHER INFORMATION: /product= ""Heat-shock protein 72"" (ix) FEATURE: NAME/KEY: CDS LOCATI ON: 3 2 65. .4 32 0 OTHER INFORMATION: /product= "NH2-terminai portion of DNA J" (ix) FEATURE: NAME/KEY: mat..peptide LOCATION: 682. .2502 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: AAGCTIGATT CACGCTTTGA AAGAAGAAGG TGACCATMAC TACCATATGG CCATCCAAAC TACCATCGCC CAAGTCTTTC AAAAAGGCTA AATGGTAGTG GTGTATAACT AAGATACAAA GATTGACGAA GTGTCGATG AACACAAGAA GTGTTCGA'IT CGGCAATTCT GACGGTAGCT ATGGCGTITTG TCTAGC'I CC 'ITACTMACTC CGTGTCGCAA TTI'ACATAAT AGAAAACTG TAAAATATGT TTGGCT1TGT AATAGTGAGC TGCGCTAFT TGCGCAAATT TTGAGACCTI' AAGTCAAGCT CTGACGGCGT CGCCACTTAA AATTGAAGAA ATCGCAGCAG ATGGCGAATT
TCTCCCAGCA
CAAACTCCAT
GCCCGTAAAA
AATCTATCTT
AMAGCAACTC
GTCGTCGAA.A
TCCGAAACGA
GAAGCGAACC
AGGCTCAAAG
GAAGAGTATC
GACGATGAAC ACCCAGTAGA GACCGCATCC TACGCCCAGC AGCTCGCAGT AAAAATAGGA 'ITTTACTCAG AGC FIAGGGC GTCAGAAAAC GGCAGTCGCT TAAA-ATCGAT TTCGACTCTI' CAATAAACTA TGAAGAAAGA AAAGACGATA CTCTTCGCTG 'ITTAGTCAAA GAGATTGACA AAAAAGAAAA ATAGAAAATT AACTAACAAG GAGMMAACA C ATG TCT AAA ATT ATC GGT ATT GAC TTA GGT Met Ser Lys Ile Ile Gly Ile Asp Leu Gly ACA ACA MAC TCA GCA GTT GCA Thr Thr Asn Ser Ala Val Ala ATC GCA AAC CCA GAA GGA AAC Ile Ala Asn Pro Giu Gly Asn GTT CTT GAA GGA Val Leu Giu Gly 20 CGC ACA ACT CCA Arg '1h r Thr Pro 35 GGT GAT GCT GCA Gly Asp Ala Ala 50 ACT GAA AGC AAA ATC Thr Giu Ser Lys Ile TCT GTA GTC TCA TI'C Ser Val Val Ser Phe AAA AAC GGA Lys Asn Gly ACA AAC CCA Thr Asn Pro GAA AAA GTT Giu Lys Val GAA ATC ATC GTT Gu Ile Ile Val GAT ACA GTT ATC Asp Thr Val Ile 65 TCT .3CA MAT GGA Ser Ala Asn Gly 80 AMA CGT Lys Arg TCT ATC AMA TCT MAG ATG Ser Ile Lys Ser Lys Met CMA GCA GTT Gin Aia Val GGA ACT TCT Gly Thr Ser GMA ATC TCA Giu Ile Ser AMA GMA TAC Lys Giu Tyr ACT CCA CMA Thr Pro Gin 85 GCT ATG ATC CTT CMA TAC 'PIG AMA GGC TAC GCT GMA GAC TAC CTT GGT Ala Met Ile Leu Gin Tyr Leu Lys Gly Tyr Ala Giu Asp Tyr Leu Gly 100 105 WO 96/40928 GAG AAA GTA ACC Glu Lys Val Thr 110 GCT CAA CGT CAA Ala Gin Arg Gin 125 GTA GAA CGT ATT Val Glu Arg Ile 140 TTG GAC AAG ACT PCT/CA96/00322 C 1047 GCT Ala
ACA
Thr
AAC
Asn
AAA
Lys 160 ACA GTT CCG GCT TAC Thr Val Pro Ala Tyr 115 GCT GGT AAA ATT GCT Ala Gly Lys Ile Ala 135 ACT GCA GCA GCT CTT Thr Ala Ala Ala Leu 150 AAA ATC TTG GTA MTT Leu 155 Asp Lys Thr Asp Glu Glu Lys Ile Leu Val Phe Asp Leu Gly GGT GGT Gly Gly GAC GTA Asp Val GAC CAA Asp Gin GGT ATC Gly Ile 220 GCG GCT Ala Ala 235 ATC AGC Ile Ser GAA ATG Glu Met GTT GAA Val Glu TTG AGC Leu Ser 300 CGT ATC Arg Ile 315 CCA AAC Pro Asn ATC CAA Ile Gin
ACI
Thi
TTG
Leu A2 Lys 205
GAC
Asp
GMA
Glu
TTG
Leu
ACT
Thr
CGT
Arg 285
TTG
Eeu
:CT
Pro Lys 3GT ,ly ITC GAC Phe Asp 175 TCA ACT Ser Thr 190 ATC ATT Ile Ile TTG TCT Leu Ser AAA GCG Lys Ala CCA TTT Pro Phe 255 TTA ACT Leu Thr 270 ACA AAA Thr Lys TCA GAA Ser Glu GCC GTT Ala Val TCA GTA Ser Val 2 335 GGT GTG 2 Gly Vai 350 GTC TCT ATC Val Ser IlE
GCA
Ala
GAC
Asp
ACT
Thr
AAG
Lys 240
ATC
Ile
CGT
Arg 3TT Val
ATC
Ile
GTT
Val 320 kAC ksn kTT Ile GGG GAC Gly Asp CAC TI'G His Leu 210 GAC AAG Asp Lys 225 AAA GAC Lys Asp ACT GCA Thr Ala GCG AAA Ala Lys CCA GTT Pro Val 290 GAC GAA Asp Glu 305 GAA GCT Glu Ala CCT GAT Pro Asp ACT GGT Thr Gly CTT GAA ITG Leu Giu Leu 180 AAC AAA CTT p Asn Lys Leu 195 GTA GCA GAA Val Ala Glu ATG GCA ATG Met Ala Met CTT TCT GGT Leu Ser Gly 245 GGT GAG GCT Gly Glu Ala 260 TTT GAT GAT Phe Asp Asp 275 CGT CAA GCC Arg Gin Ala GTT ATC CTT Val Ile Leu GTT AAA GCT Val Lys Ala 325 GAA GTA GTT Glu Val Val j40 GAT GTC AAG Asp Val Lys 355
GGT
Gly
GGT
Gly
TITC
Phe
CAA
Gin 230
GTA
Val
GGA
Gly
TTG
Leu
CTT
Leu
GTT
Val 310
GAA
Glu
GCT
Ala
GAT
Asp
GA
Asl
GG
G11
AAC
Lys 215
CGT
Arg
ACT
Thr
CCT
Pro
ACT
Thr
TCA
Ser 295
GGT
Gly
ACT
Thr
ATG
Met
GTT
Val C GGT GTC Gly Val 185 r GAC GAC Asp Asp 200 AAA GAA Lys Glu TTG AA Leu Lys TCA ACA Ser Thr CTT CAC Leu His 265 CGT GAC Arg Asp 280 GAT GCA Asp Ala GGT TCA Gly Ser GGT AAA Gly Lys GGT GCG Gly Ala 345 GTC CTT Val Leu I 360
TC
Phe
TTT
Phe
AAC
Asn
GAT
Asp
CAM
Gin 250
ITG
Leu
CTT
Leu
GGT
Gly
ACT
Thr
GAA
Glu 330
GCT
Ala
CTT
Leu 1095 1143 1191 1239 '287 1335 1383 1431 1479 1527 1575 1623 1671 1719 1767 1815 GAT GTA ACG CCA TTG TCA CTT GOT ATC GAA ACA ATG GOT GGA GTA ITT Asp Val Thr 365 Pro Leu Ser Leu Gly 370 Ile Glu TJhr Met Gly 375 Gly Val Phe ~111~ -4 1 1_ WO 96/40928 ACA AMA CTT ATC GAT CGC Thr Lys Leu Ile Asp Arg 380 AAC ACT ACA ATC Asn Thr Thr Ile 385 CCA ACA TCT AAA TCA CAA Pro Thr 390 Ser Lys Ser G.
GTC
Val 39
CT]
Lei.
TTC
Phe
GAA
Glu
AAA
Lys
TCA
Ser 475
GCA
Ala
AAT
Asn
ACT
Thr
CTT
Leu
AAA
Lys 555
A
Lys
GAA
Glu
GAG
Giu TITC TC, *Phe Se: CAA GG'.
IGin G1, CAA TC *Gin Lei *GTA AC; Val Thi 44E GAC CT] Asp LeL 460 GGT TTG- Gly Leu AAC GCT Asn Ala GAA GTG Giu Val GAA GGT Giu Gly 525 GAT GAC Asp Asp 540 GCA AAA Ala Lys CTC TAC Leu Tyr GGC GCA Gly Ala TTT ACG Phe Thr kACA GC; r- Thr AlE r' GMA CGC Giu Arg 415 ACT GA7 Thffr Asp 430 TTT GAC Phe Asp GGA ACT Gly Thr ACT GAC Thr Asp GAA TCC Glu Ser 495 GAC CAA Asp Gin 510 AAA GGC Lys Gly CTT AAG Leu Lys CTT GAA Leu Glu GAP. CAA Giu Gin 575 CAA GCA Gin Ala 590 GAA AAG Giu Lys GCA GAC .AAC CAA CCA GCC GTT GAT ATC CAC GTT Ile Pro Ala Pro Arg Gly ATC GAC Ile Asp CAA A Gin Lys 465 GAA GAA Giu Giu 480 GAT MAG Asp Lys GCA ATC Ala Ile TT GAC Phe Asp MAA GCT Lys Ala 545 GCA 'ITG Ala Leu 560 GCC GCA Ala Ala ACA GGA
MAG
Lys 450
GMA
Gi u
ATC
Ile
AMA
Lys Phe
GCA
Al a 53 0 Gin
MAC
Asn
GCA
Al a
MAC
MAC GGT Asn Gly CMA ACT Gin Th-r GAC CGC Asp Arg CGT AMA Arg Lys 500 GCG ACT Ala Thr 515 GM. CGT Giu Arg GMA GAC Giu Asp GMA AMA Giu Lys GCG CMA Ala Gin 580 GCA GGC
ATC
Ile
ATT
Ile
ATG
Met 485
GMA
Giu
GMA
Giu
GAC
Asp
MAC
Asn
GCT
Ala 565
CMA
Gin
GAT
GTG
Val
GTC
Val 470
ATG
Met
GMA
Glu
MAG
Lys
GCT
Ala
AAC
Asn 550 Sin
GCT
Al a
GAC
Asr
AC]
AT]
Ile
TCT
Ser 455
ATC
Ile
AAA
Lys
GTA
Val
ACA
Thr
GCC
Al a 535
TI'G
Leu
GGA
Gly
CMA
Gin
GTC
Ile His Val 410 CTT GGA CGC Leu Gly Arg 425 CCT CMA ATC Pro Gin Ile 440 GTT MAG GCC Val Lys Ala CMA TCG MAC Gin Ser Asn GAT GCA GMA Asp Ala Giu 490 GAC CTT CGT Asp Leu Arg 505 ATC MAG GMA Ile Lys Giu 520 CMA GCT GCC Gin Ala Ala GAC GAC ATG Asp Asp Met CTT GCT GTT Leu Ala Val 570 GMA GGA GCA Glu Gly Ala 585 GTA GAC GGA Val Asp Gly CT/CA 96/00322 1863 1911 1959 2007 2055 2103 2151 "199 2247 2295 2343 2391 2439 2487 2542 Thr Gly Asn Ala Gly Asp Asp Val TMAGATGAGT GTATTGGATG MAGAGTATCT AAAAXATACA 605 CGAAMAGTrI' ATM.ATGA'ITT TI'GTMTCMA GCTGATMACT ATAGMACATC AAMAGAT TTT ATTGATMATA TTCCM-TAGA ATATTTAGCT AGATATAGAG MAATTATAFI AGCTGAGCAT GATAGTTGTG TCAAAPATGA TGAAGCGGTA AGGMATrTTG, TACCTCAGT ATTGTTGTCT GCATTTGTAT CGGCGATGGT ATCAGCTATIG ATATCATTAC '\?\ATACAMAC ATATAMFT GTMATACCGT TCATMATTGG TATGATITGG ACAGTAGTI'G TAFI'TCTI'AT GATCMATTGG 2602 2662 2722 2782 2842
I
w 0 96/40928 PCT AATTATATAG GCAAATACTA AGAAGAGACA AAA.ATATATA AATATTTCTG TACTTATAGG ATAT'ITAA.AA TCCAA.ATAAA GTTAATTTAC TTATTTGCAG AGGTTGCAAC CCAGCCTCTG TIT1'TCGATA MAAAGGGACG G.AATCTCATT TGT'ITGGGTT TTGTCTCATC AATAGAAAGG A.ACAAAGAGT GITCGTAACT GAACACGGGT TTCAGAATI'T CTTACTAMAT ATAAAAGAAA GGAA'FTGAAC CCGACCTAA.A TGGTGGTTCG AITCAGAACA TCAATAGAAA GGAATAAGGG TGITCGTAAC TGAACACGGG CTACGGACTG TGCCAAAAAG ATAGTTTTT CTAGGA-GTA AGCGTCCGTC GTCAAMACTC CTAGATGGCT GTGTCCGTTT GACGCCCTTT GTATCTTGMA TT? ATO AAC AAT ACT GAA TT1' TA~T GAT CGT CTG GGG GTA TCC AAA AAC Met Asn Asn Thr Glu Phe Tyr Asp Arg Leu Gly Val Ser Lys Asn 2. 5 10 GCT TCG GCA GAC GAA ATC AJA AAG GCT TAT CGT MAG CTT TCC AAA AAA /CA96/00322 2902 2 962 3022 1082 3142 3202 3262 3309 3357 3405 3453 3501 3549 Ala Ser Ala Asp Giu Ile Lys Lys TAT CAC CCA GAT ATC Tyr His Pro Asp Ile A.AC AAG GAG Asn Lys Glu Ala Tyr Arg Lys Leu CCT GGT GCT GAG GAC Pro Gly Ala Giu Asp 40 TJG AGT GAC GAC CA.
Leu Ser Asp Asp Gin Ser Lys Lys AAG TAC AAG Lys Tyr Lys AAA CGT GCT Lys Arg Ala TTT GGT GGk.
Phe Gly Gly GA-k GTT CA.A GAA GCC TAT Giu Val Gin Giu Ala Tyr GAG ACT Giu Thr GCC TAT Ala Tyr GCT GOT Ala Gly s0 GAC CAG TAT GGT Asp Gin TEyr Gly GCA GGC GCC Ala Giy Ala AAT GGG GCA Asn Gly Ala AAT GGT GGT Asn Gly Gly GGT ITC GGC Gly Phe Gly GGC TTC GGT GGT Gly Phe Gly Gly GAG GAT ATT TTC Glu Asp Ile Phe AGT TTC ITC GGC Ser Phe Phe Gly GGC GGT TCT TCG Gly Gly Ser Ser CGC MAT Arg Asn 110 CCA AAC GCT Pro Asn Al a ACC TTT GAA Thr Phe Glu 130 CGT GAA GCT Arg Giu Ala CAA GGA GAT Gin Gly Asp GAT CTC CAG Asp Leu Gin 120 TAT CGT Tyr Arg GCT ATC TTC Ala Ile Phe
ACT
Thr
AAT
Asn GGC TGT CGT Gly Cys Arg GAG AAG GAA GTT Giu Lys Giu Val 140 GGA TCT CST GCT Gly Ser Gly Ala 155 CAT GGC GCT GGT His Gly Ala Giy 170 GTC AAT TI'G Val Asn Leu 125 AAG TAT CAT Lys Tyr His AAG CCA GGG Lys Pro Gly GTC ATT MAC Val Ile Asn 175 145 ACA AGT Thr Ser 160 3597 3645 3693 3741 3789 3837 3885 3933 CCA GTC ACT Pro Val Thr CGC TGT Arg Cys GTC GAT ACG CAG ACT Val Asp Thr Gin Thr 180 CCT CTT GGT Pro Leu Gly ATG ATG Met Met 185 GMA ATC Glu Ile 200 CGT CGC CMA GTA ACC TGT Arg Arg Gin Val Thr Cys 190 AAA TAT CCA TGT ACA ACC Lys Tyr Pro Cys Thr Thr 205 GAT GTC TGT CAC GGT CGA GGA AAA Asp Val Cys His Gly Arg Gly Lys 195 TGT CAT GGA ACA GGT CAT GAG AMA CYS His Gly Thr Gly {-is Glu Lys 210 215 CMA GCT CAT AGC GTA CAT GTG MA Gin Ala His Ser Val His Val Lys 220 WO 96/40928 ATC CCT GCT Ile Pro Ala 225 GGT GAA GCA Gly Giu Ala 240 GTT TCT GTG Val Ser Val.
TrTC TAC AAT Phe Ty r Asn GTA GAT ATT Val Asp Ilie 290 GGA ACT CAG Gly Thr Gin 305 AGC CTT CGT Ser Leu Arg 320 GTA ACA CCG Val Thr Pro
TTC
Phe PCTICA96/00322 kA 3981 *GTG GAA ACA Vai Giu Thr 230 TTT MAC GGT Phe Asn Giy 245 GCT AGT GAC Ala Ser Asp 260 AAC CTC MAC Asn Leu Asn ACT GTT CAC Thr Val His GGT MAG MAA Giy Lys Lys 310 GGT GCA GTT Gly Aia Val 325 GGC 'FIG MAC Gly Leu Asn 340 GGT CMA CMA Gly Gin Gin GGA CCT TAT Giy Pro Tyr MAG 'FIT GMA Lys Phe Giu 265 TT7 GTC CMA Phe Val Gin 280 GGT GAT GTT Giy Asp Vai 295 TTC CGC CTA Phe Arg Leu GGT GA C CMA Giy Asp Gin GAC CGC CMA Asp Arg Gin 345 ATT CGC CTC Ile Arg Leu 235 GGT GAC TTG Gly Asp Leu 250 CGT GMA GGA Arg Giu Giy GCG GCT CTT Aia Ala Leu GM 'FIG GTT Giu Leu Vai 300 CGT AGT MAG Arg Ser Lys 315 TAC GTT ACT Tyr Vai Thr 330 AAA GTA GCC Lys Val Aia GCT GGT Ala Giy TAT GTA Tyr Val ACG ACT Thr Thr 270 GGT GAT Giy Asp 285 ATT CCA Ile Pro GGG GCA Giy Aia GTT MAT Val Asn 'FIG A Leu Lys 350 4029 4077 4125 4173 4221 4269 4317 4320 INFORMATION FO)R SEQ ID NO: SEQUENCE CHARACTERISTICS: LENGTH: 607 amino acids fB) TYPE: amino acid TOPOLOGY: iinear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID Met Ser Lys Ile Ilie Gly Ilie Asp Leu Gly 1 5 Aia Val Leu Giu Giy Thr Giu Ser Lys Ile Asn Arg Thr Thr Pro Ser Val Vai Ser Phe Val Gly Asp Ala Ala Ly-, Arg Gin Ala Vai 50 55 Ile Ser Ilie Lys Ser Lys Met Gly Thr Ser 70 Gly Lys Giu Tyr Thr Pro Gin Giu Ilie Ser Leu Lys Gly Tyr Ala Glu Asp Tyr Leu Gly 100 105 NO0: Thr Thr Ile Ala Lys Asn flir Asn Glu Lys Ala Met Glu Lys WO 96/40928 'ral Ile Thr Val Pro Ala Tyr Phe Asn Asp Ala Gin 115 120 PCT/CA96/00322 Arg Gin Ala Thr 125 Lys Asp Ala Gly Lys 130 Glu Pro Thr Ala AlaI Ile Ala Gly Leu Glu Val Glu 1 145 Glu Ser Gly ,0 His Asp 225 Lys Thr Ala Pro Asp 305 Glu Pro Thr Leu Asn 385 Asp Met I Pro 1 Asp I 4 Lys G 465 Glu Ile Asp Leu 210 Lys Asp Ala Lys Val 290 Glu Ala Asp Gly Gly 370 Thr \sn Ala Ula Lys 150 Lys Leu Asn 195 Val Met Leu Gly Phe 275 Arg Val Val Glu Asp 355 Ile Thr Gin Ala Ala 435 Asn Ile Glu 130 Lys Ala Ala Ser Glu 260 Asp Gin Ile Lys Val 340 Val Glu Ile Pro Asp 420 Pro Gly Leu 165 Leu Leu Glu Met Gly 245 Ala Asp Ala Leu Ala 325 Val Lys Thr Pro Ala 405 Asn I Arg G Ile V Al 15 Va.
GlI G12 Gly Phe Glr 230 Val Gly Leu Leu Val 310 Glu Ala Asp let Thr 90 Val ys ;ly 'al 135 a Leu 0 1 Phe y Asp SGly Lys 215 1 Arg Thr Pro Thr Ser Gly Thr Met Val Gly 375 Ser I Asp Thr I Ile F 4 Ser V 455 Al Asp Gly Asp 200 Lys Leu Ser Leu Arg 280 Asp Gly Gly Gly Val 360 Gly .ys :le Leu Pro 40 'al a Tyr SLeu Val 185 SAsp Glu Lys Thr His 265 Asp Ala Ser Lys Ala 345 Leu Val Ser His 4 Gly 1 425 Gin I Lys Gl Gl 17 Ph Ph Asi Asi Glr 250 Leu Leu Gly Thr Glu 330 Ala Leu Phe 3ln Val 110 Arg :le ila y Leu 155 y Gly 0 a Asp a Asp I Gly Ala 235 Ile SGlu Val SLeu Arg 315 Pro Ile Asp Thr Val 395 Leu Phe Glu Lys 1 4 Ser G 475 140 Asp Gly Val Gin Ile 220 Ala Ser Met Glu Ser 300 Ile Asn 1n Val Lys I 380 Phe S 31n C Gln L Val 4 Asp L 160 ;ly L Ar Lys Thr Leu Lys 205 Asp Glu Leu Thr Arg 285 Leu Pro Lys Gly Thr 365 Leu Ser Gly Leu 'hr eu eu g Ile Val Asn SThr Asp Lys 160 Phe Asp Val 175 SSer Thr Ala 190 Ile Ile Asp Leu Ser Thr Lys Ala Lys 240 Pro Phe lie 255 Leu Thr Arg 270 Thr Lys Val Ser Glu Ile Ala Val Val 320 Ser Val Asn 335 Gly Val Ile 350 Pro Leu Ser Ile Asp Arg Thr Ala Ala 400 Glu Arg Pro 415 Thr Asp Ile 430 Phe Asp Ile Gly Thr Gin Thr Asp Glu 480 lu Gin Thr Ile Val 470 Ile Gin Ser Asn
L
I I_ WO 96/40928 Glu Ile Asp Arg Met Met Lys Asp Ala Glu Ala 485 490 Lys Lys Arg Lys Glu Glu Val Asp Leu Arg Asn 500 505 Ile Phe Ala Thr Glu Lys Thr Ile Lys Glu Thr 515 520 Asp Ala Glu Arg Asp Ala Ala Gin Ala Ala Leu 530 535 Ala Gin Glu Asp Asn Asn Leu Asp Asp Met Lys 545 550 555 Leu Asn Glu Lys Ala Gin Gly Leu Ala Val Lys 565 570 Ala Ala Ala Gin Gin Ala Gin Glu Gly Ala Glu 580 585 Gly Asn Ala Gly Asp Asp Val Val Asp Gly Glu 595 600 INFORMATION FOR SEQ ID NO:6: SEQUENCE CHARACTERISTICS: LENGTH: 352 amino acids TYPE: amino acid TOPOLOGY: linear PCT/CA96/00322 Ser Asp 495 Gin Ala Gly Phe Lys Lys Glu Ala 560 Gin Ala 575 Ala Thr Lys iii) MOLECULE TYPE: protein (xi) SEQUENCE Met Asn Asn Thr Glu 1 Ser Ala Asp Glu Ile His Pro Asp Ile Asn Val Gin Glu Ala Tyr Tyr Asp Gin Tyr Gly Gly Gly Phe Gly Gly Asp Ile Phe Ser Ser 100 Asn Ala Pro Arg Gin 115 Phe Glu Glu Ala Ile 130 Glu Ala Gly Cys Arg 145 Ser Pro Val Thr Cys 165 DESCRIPTION: SEQ ID NO:6: Phe Tyr Asp Arg Leu Gly Val Ser Lys Asn Ala Tyr Arg Gly Ala Ser Asp Ala Asn Ala Gly Gly Gly 105 Leu Gin Glu Lys Gly Ser His Gly 170 I ~1 I WO 96/40928 Asp Thr PCT/CA96/00322 Gin Thr Pro Leu Gly Met Met Arg Arg Gin Val Thr Cys Asp 180 185 190 Val Cys His Gly 195 His Gly Thr Gly 210 Pro Ala Gly Val 225 Glu Ala Gly Phe Ser Val Glu Ala 260 Tyr Asn Leu Asn 275 Asp Ile Pro Thr 290 Thr Gin Thr Gly 305 Leu Arg Gly Gly Arg Gly Lys Glu Ile 200 His Glu Lys Gin Ala 215 Glu Thr Gly Gin Gin 230 Asn Gly Gly Pro Tyr 245 Ser Asp Lys Phe Glu 265 Leu Asn Phe lal Gin 280 Val His Gly Asp Val 295 Lys Lys Phe Arg Leu 310 Ala Val Gly Asp Gin 325 Lys Tyr Pro His Ile Gly 250 Arg Ala Glu Arg Tyr' 330 Thr Thr Cys Val Lys Ile Gly Gln Gly 240 Val Val Val 255 Thr Ile Phe 270 Asp Thr Val Pro Glu Gly Ala Pro Ser 320 Asn Val Val 335 Lys Glu Phe 350 Thr Pro Thr Gly Leu 340 Asn Asp Arg Lys Val Ala Leu INFORMATION FOR SEQ ID NO:7: SEQUENCE CHARACTERISTICS: LENGTH: 15 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: Thr Ser Thr Gin Ile Ser Leu Pro Phe Ile Thr Ala Gly Glu Ala 1 5 10 INIFORMATION FOR SEQ ID N0:8: SEQUENCE CHARACTERISTICS: LENGTH: 15 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: Thr Ala Gly Gli Ala Gly Pro Leu His Leu Glu Met Thr Leu Thr 1 5 10 INFORMATION FOR SEQ ID NO:9: SEQUENCE CHARACTERISTICS: LENGTH: 1' amino acids c WO 96/40928 PCT/CA96/00322 TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: Met Thr Leu Thr Arg Ala Lys Phe Asp Asp Leu Thr Arg Asp 1 5 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 15 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID Asp Asp Leu Thr Arg Asp Leu Val Glu Arg Thr Lys Val Pro Val 1 5 10 INFORMATION FOR SEQ ID NO:11: SEQUENCE CHARACTERISTICS: LENGTH: 14 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: Thr Lys Val Pro Val Arg Gln Ala Leu Ser Asp Ala Gly Leu 1 5 INFORMATION FOR SEQ ID NO:12: SEQUENCE CHARACTERISTICS: LENGTH: 15 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: Lys Ala Lys Asp Leu Gly Thr Gln Lys Glu Gln Thr Ile Val Ile 1 5 10 INFORMATION FOR SEQ ID NO:13: SEQUENCE CHARACTERISTICS: LENGTH: 14 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide WO 96/40928 PCT/CA96/00322 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: Leu Thr Asp Glu Ile Asp Arg Met Met Lys Asp Ala Glu Ala 1 5 INFORMATION FOR SEQ ID NO:14: SEQUENCE CHARACTERISTICS: LENGTH: 24 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: Lys Asp Ala Glu Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu Glu 1 5 10 Val Asp Leu Arg Asn Glu Val Asp INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 15 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID Asn Glu Val Asp Gln Ala Ile Phe Ala Thr Glu Lys Thr Ile Lys 1 5 10 INFORMATION FOR SEQ ID NO:16: SEQUENCE CHARACTERISTICS: LENGTH: 28 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: Glu Lys Thr Ile Lys Glu Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg 1 5 10 Asp Ala Ala Gin Ala Ala Leu Asp Asp Leu Lys Lys INFORMATION FOR SEQ ID NO:17: SEQUENCE CHARACTERISTICS: LENGTH: 30 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: WO 96/40928 PCT/CA96/00322 Lys Ala Gin Glu Asp Asn Asn Leu Asp Asp Met Lys Ala Lys Leu Glu 1 5 10 Ala Leu Asn Glu Lys Ala Gin Gly Leu Ala Val Lys Leu Tyr 25 INFORMATION FOR SEQ ID NO:18: SEQUENCE CHARACTERISTICS: LENGTH: 25 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: Gin Glu Gly Ala Glu Gly Ala Gin Ala Thr Gly Asn Ala Gly Asp Asp 1 5 10 Val Val Asp Gly Glu Phe Thr Glu Lys 20 INFORMATION FOR SEQ ID NO:19: SEQUENCE CHARACTERISTICS: LENGTH: 2183 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Streptococcus pyogenes (ix) FEATURE: NAME/KEY: CDS LOCATION: 204..2030 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: CAGCGATGGT AGTTGTTTAT AACTAAGGTA AATGAGTTTT CGTTTTTGTC CGTAATGACA GTAAACTAGA TAGCAAGTTA GAAGCTATTT CGCTTGCTGA TTAAACTATA GTGATTGCTT AGAATTGGAA GTAAAATAAT TCGAGTGCTT ACTAAGATAA ATTGAAATAA AAAGTAATAA AGTATAAAAT AAGAGGTATT AAC ATG TCT AAA ATT AZT GGT ATT GAC TTA Met Ser Lys Ile Ile Gly Ile Asp Leu 1 GGT ACA ACA AAC TCA GCA GTA GCA GTT CTT GAA GGG ACT GAA TCA AAA Gly Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser Lys 15 20 ATC ATT GCT AAC CCA GAA GGC AAT CGT ACA ACT CCT TCA GTA GTA TCA Ile Ile Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val Ser 35 TTC AAA AAT GGT GAA ATT ATC GTG GGT GAT GCT GCA AAA CGC CAA GCA Phe Lys Asn Gly Glu Ile Ile Val Gly Asp Ala Ala Lys Arg Gin Ala 50 WO 96/40928 GTG ACA AAC CCA GAA ACA GTA ATC TCT ATT AAA TCT ALA ATO GGA ACT Val Thr Asn Pro Giu Thr Val Ile Ser Ile Lys Ser Lys Met Gly Thr 65 TCT GAA AAA Ser Giu Lys TCA GCA ATO Ser Ala Met OGA GAA AAA GLy Giu Lys OAT OCA CAA Asp Ala Gin GAA GTA GAA Glu Val Glu 140 GGT ATG GAC Gly Met Asp 155 GOT GOT GGT Gly Oly Gly 170 TTC GAC GTT Phe Asp Val TTT GAC CAA Phe Asp Gin AAT GOT ATT Asn Oly Ile 220 OAT OCT OCT Asp Ala Ala 235 CAA ATT TCA Gin Ile Ser 250 TTA GAG ATG Leu Giu Met CTT GTT GAA Leu Val Glu OGA TG TCA Gly Leu Ser 300 OTT TCT OCA Vai Ser Ala ATT CTT CAA Ile Leu Gin 95 GTA OAA AAA Val Glu Lys 110 CGT CAA OCA Arg Gin Ala 125 CGT ATC OTT Arg Ile Val AAO ACT GAC Lys Thr Asp ACA TTT GAC Thr Phe Asp 175 CTT OCA ACA Leu Ala Thr 190 AAA ATT ATT Lys Ile Ile 205 GAC TA TCA Asp Leu Ser GAA AAA GCT Glu Lys Ala ITA CCG TTC Leu Pro Phe 255 AGC TTA TCT Ser Leu Ser 270 0GT ACO AAA Arg Thr Lys 285 ITO TCA GAA 2 Leu Ser Glu
AA
As 8[
TAC
Tyl GC1 Ala
ACI
Thr
AAI
Asn
AAG
Lys 160
OTA
Val
OCA
Ala
OAT
Asp
CAA
Gin
AAA
Lys 240
ATC
Ile
:GT
Arg
ACT
T'hr kTT Ile T GOT I Gly
CTT
Leu
OTT
Vai
AAA
Lys
GAA
Glu 145
OAT
Asp
TCA.
Ser
GGT
Gly
TTC
Phe OAT 2 Asp I 225
AA.C
Lys 2 ACT C Thr OCT Ala I CCA C Pro V 2 OAT C Asp G 305
AAA
Lys
AAA
Lys
ATT
Ile
GAC
Asp 130
CCA
Pro
GAM
Glu iTC Ile
GAT
Asp
TTA
Leu 210 G4 Lys
AT
ksp
CT
la LkA ,ys
;TT
tal 90
'AA
;lu GAA TAT ACT CC9 Glu Tyr Thr Prc GGT TAT Oly Tyr 100 ACT OTT Thr Val 115 OCT GGT Ala Gly ACA OCA Thr Ala AAA ATC Lys Ile CTT OAA Leu Olu 180 AAC AA Asn Lys 195 GTG GCT Val Ala ATG GCA Met Ala CTT TCA Leu Ser I GGT TCT Oly Ser 260 TTT GAC Phe Asp 275 OCT GAA Ala Glu CCA GCI Pro Ala AAA ATT Lys Ile OCT OCA Ala Ala 150 TTA OTT Leu Val 165 TTA GGT Ieu Gly CTT GGT Leu Oly GAA TTT Glu Phe CTT CAA Leu Gin 230 GOT GTG Gly Val 245 GCT GGT Ala Oly GAT CTC Asp Leu CM-A GM.A ATT Gin Giu Ile GAC TAT CTT Asp Tyr Leu 105 TAT 'TC AAC Tyr Phe Asn 120 OCA GOT CTT Ala Gly Leu 135 CTT OCT TAT Leu Ala Tyr TTT GAC CTT Phe Asp Leu OAT GOT GTC Asp Gly Val 185 GGT GAC GAC Oly Asp Asp 200 AAG AAA 3MA Lys Lys Glu 215 COC TTG AAA Arg Leu Lys ACA CM.A ACA Thr Gin Thr CCT CTT CAC Pro Leu His 265 ACT COT GAC Thr Arg Asp 280 TCA GAT OCA CT/CA96/00322 422 470 518 566 614 662 710 758 806 854 902 950 998 1046 1094 CGT CAA GCT CTT Arg Gin Ala Leu Ser Asp Ala GTT ATC Val Ile 295 CTT OTT GOT GOA TCA Leu Val Oly Oly Ser 310 1142 1190 ACT CGT Thr Arg 315 ATC CCA OCA OTT GTC GAA GCT OTA AAA GCT GMA ACT GOT AAA Ile Pro Ala Val Val Olu Ala Val Lys Ala Glu Thr Gly Lys I L I= WO 96/40928 PCT/CA96/00322 GAA CCA AAT AAA TCT GTA AAC CCT GAT GAA GTG GTT GCT ATG GGT GCT 1238 Glu Pro Asn Lys Ser Val Asn Pro Asp Glu Val Val Ala Met Gly Ala 330 335 340 345 GCT ATC CAA GGT GGG GTT ATC ACT GGG GAT GTG AAA GAC GTT GTC CTT 1286 Ala Ile Gin Gly Gly Val Ile Thr Gly Asp Val Lys Asp Val Val Leu 350 355 360 CTT GAC GTA ACA CCA TTG TCA CTT GGT ATT GAA ACA ATG GGT GGT GTC 1334 Leu Asp Val Thr Pro Leu Ser Leu Gly Ile Glu Thr Met Gly Gly Val 365 370 375 TTC ACT AAA TTG ATC GAC CGC AAT ACA ACT ATC CCA ACA TCT AAA TCA 1382 Phe Thr Lys Leu Ile Asp Arg Asn Thr Thr Ile Pro Thr Ser Lys Ser 380 385 390 CAA GTC TTIC TCA ACA GCA GCA GAC AAC CAA CCA GCC GTT GAT ATC CAT 1430 Gin Val Phe Ser Thr Ala Ala Asp Asn Gin Pro Ala Val Asp Ile His 395 400 405 GTT CTT CAA GGT GAA CGC CCA ATG GCA GCA GAT AAC AAG ACT CTT GGT 1478 Val Leu Gin Gly Glu Arg Pro Met Ala Ala Asp Asn Lys Thr Leu Gly 410 415 420 425 CGC TTC CAA TTG A. GAT ATC CCA GCT GCA CCT CGT GGA ATC CCA CAA 1526 Arg Phe Gin Leu Asp Ile Pro Ala Ala Pro Arg Gly Ile Pro Gin 430 435 440 ATT GAA GTA ACA TTT GAT ATC GAT AAA AAC GGT ATT GTT TCT GTA AAA 1574 Ile Glu Val Thr Phe Asp Ile Asp Lys Asn Gly Ile Val Ser Val Lys 445 450 455 GCT AAA GAC CTT GGT ACG CAA AAG GAA CAA CAC ATC GTT ATC AAA TCA 2 Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin His Ile Val Ile Lys Ser 460 465 470 AAC GAC GGA CTT TCT GAA GAA GAA ATT GAT CGC ATG ATG AAA GAC GCT 1670 Asn Asp Gly Leu Ser Glu Glu Glu Ile Asp Arg Met Met Lys Asp Ala 475 480 485 GAA GCT AAT GCC GAA GCC GAT GCG AAA CGT AAA GAA GAA GTT GAC CTT 1718 Glu Ala Asn Ala Glu Ala Asp Ala Lys Arg Lys Glu Glu Val Asp Leu 490 495 500 505 AAA AAC GAA GTT GAC CAA GCT ATC TTT GCT ACT GAA AAA ACA ATC AAA 1766 Lys Asn Glu Val Asp Gin Ala Ile Phe Ala Thr Glu Lys Thr Ile Lys 510 515 520 GAA ACT GAA GGT AAA GGC TTT GAC ACA GAA CGC GAT GCA GCG CAA TCA 1814 Glu Thr Glu Gly Lys Gly Phe Asp Thr Glu Arg Asp Ala Ala Gin Ser 525 530 535 GCT CTT GAC GAG TTA AAA GCT GCG CAA GAA TCT GGC AAC CTT GAC GAC 1862 Ala Leu Asp Glu Leu Lys Ala Ala Gin Glu Ser Gly Asn Leu Asp Asp 540 545 550 ATG AAA GCT AAA CTT GAA GCA TTA AAT GAA AAA GCG CAA GCT TTG GCT 1910 Met Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Ala Leu Ala 555 560 565 GTT AAA ATG TAC GAG CAA GCT GCA GCA GCT CAA CAA GCA GCA CAA GGT 1958 Val Lys Met Ty- Glu Gin Ala Ala Ala Ala Gin Gin Ala Ala Gin Gly 570 575 580 585 GCA GAA GGT GCA CAA GCT AAT GAT TCA GCA AAT AAT GAT GAT GTT GTA 2006 Ala Glu Gly Ala Gin Ala Asn Asp Ser Ala Agn Asn Asp Asp Val Val 590 595 600 WO 96/40928 PCT/CA96/00322 GAT GGC GAA 'rTT ACA GAA AAG TAATGATTTA GTTATCTAGT AACATTAATA 2057 Asp Gly Glu Phe Thr Glu LYS 605 TCCGAATTCA GAGGT TGTAC CAAACCTCTG TTTTTGGCTA AATAAAATGT AAAAA-4CTG 2117 ACGTCAAAAT ATTTTAAZGAA AGGAATACAA GTTCGATTAT TCGA.ACACAG GCTAAAGCGT 2177 GTAAAG 2183 INFOR~MATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 608 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID Met Ser Lys Ile Ile Gly Ile Asp Leu 1 Al a Asn Val Ile Gly Leu Val Lys Gi u 145 Asp Ser Gly Phe Asp 225 Lys Gly Pro Ala Ser Thr 85 Al a Pro Lys Al a Leu 165 Leu Leu Gi u Leu Gly 245 Gly Thr Ile Ile Phe Lys Val Thr Ser Glu Ser Ala Gly Glu Asp Ala Giu Val Gly Met 155 Gly Gly 170 Phe Asp Phe Asp Asn Gly Asp Ala 235 Gin Ile 250 Thr Asn Ser Ala Val Gi u Ile Thr Ala
GI
Ala Val Asp Asp 175 Thr Ile Ser Al a Phe 255 WO 96/40928 Thr Ala Gly Ser 260 Ala Lys Phe Asp 275 Pro Val Arg Gin 290 Asp Glu Val Ilie 305 Glu Ala Val Lys Pro Asp Glu Val 340 Thr Gly Asp Val 355 Leu Gly Ile Glu 370 Asn Thr Thar Ilie 385 Asp Asn Gin Pro Met Ala Ala Asp 420 Pro Ala Ala Pro 435 Asp Lys Asn Gly 450 Lys Glu Gin His 465 Glu Ile Asp Arg Ala Lys Arg Lys 500 Ile Phe Ala Thr 515 Asp Thr Glu Arg 530 Ala Gin Glu Ser 545 Leu Asn Glu Lys Ala Ala Ala Gin 580 Asp Ser Ala Asn 595 Al a Asp Al a Leu Al a 325 Val Lys Thr Pro a 405 A~n Arg Ile Ilie Met 485 Glu Giu Asp Gly Al a 565 Gin Asn Gly Leu Leu Val 310 Giu Ala Asp Met Thr 390 Val Lys Gly Val Val 470 Met Giu Lys Al a Asn 550 Gin Al a Asp Pro Thr Ser 295 Gly Thr Met Val Gly 375 Ser Asp Thr Ile Ser 455 Ilie Lys Val Thr Al a 535 Leu Al a Al a Asp Leu Arg 280 Asp Gly Gly Gly Val 360 C-'Ily Lys Ile Leu Pro 440 Val Lys Asp Asp Ilie 520 Gin Asp Leu Gin Val 600 His 265 Asp Ala Ser Lys Ala 345 Leu Val Ser His Gly 425 Gin Lys Ser Al a Leu 505 Lys Ser Asp Al a Gly 585 Val Leu *Leu Gly Thr Glu 330 Ala Leu Phe Gin Val 410 Arg Ile Al a As n Giu 490 Lys Gi U Ala Met Val 570 hl a Asp Ser Arg 285 Leu Pro Lys Gly Thr 365 Leu Ser Gly Leu Thr 445 Leu Leu Al a Val Gly 525 Giu Lys Tyr Al' a Phe 605 Leu Ser 270 Thr Lys Ser Giu Ala Val Ser Val 335 Gly Val 350 Pro Leu Ile Asp Thr Ala Giu Arg 415 Thr Asp 430 Phe Asp Gly Thr Ser Giu Giu Aia 495 Asp Gin 510 Lys Gly Leu Lys Leu Giu Giu Gin 575 Gin Ala 590 Thr Glu PCT/CA96/00322 Ar g Th-r .Ie Val1 320 Asn Ile Ser Arg Al a 400 Pro Ile Ile Gin Giu 480 Asp Al a Phe Al a Al a 560 Al a Asn Lys WO 96/40928 INFORMATION FOR SEQ ID NO:21: SEQUENCE CHARACTERISTICS: LENGTH: 2438 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Streptococcus agalactiae (ix) FEATURE: NAME/KEY: CDS LOCATION: 248..2077 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: CTTTCAAAAG GGATATAAAT TGCACGAGCG 1'CTGCTAAGA CCAGC TAACTAAGGT AAATGAGTTT TCGTTTTTGT CCGTAATGAC AGTAA AGAAGCTATT CAGCTTGCTG ATTAAACTAT AGTGATTGCT TAGAA TTCGAGTGCT TACTAAGATA AATTGAAATA AAAAGTAATA AAGTA TATTAAC ATG TCT AAA ATT ATT GGT ATT GAC TTA GGT A PCT/CA96/00322 :GATGG TAGTTGTCTA ACTAG ATAGCAAGTT TTGGA AGTAAAATAA ,TTATA AAATAAGAGG CA ACA AAC TCA Met Ser Lys Ile Ile Gly Ile Asp Leu Gly Thr Thr Asn Ser
GCA
Ala 15
GAA
Glu
ATT
Ile
ACT
Thr
GCA
Ala
CAA
Gin 95
AAA
Lys
GCA
Ala
GTT
Val
CGT
Arg
GGT
Gly
TCT
Ser
AAA
Lys
AAA
Lys
ATT
Ile
GAC
Asp 130 CTT GAA Leu Glu 20 ACA ACT Thr Thr GAT GCT Asp Ala ATC AAA Ile Lys GAA TAT Glu Tyr GGT TAT Gly Tyr 100 ACT GTT Thr Val 115 GCT GGT Ala Gly GGG ACT Gly Thr CCT TCA Pro Ser GCA AAA Ala Lys TCA AAG Ser Lys ACT CCT Thr Pro GCT GAA Ala Glu CCA GCT Pro Ala AAA ATT Lys Ile
GAA
Glu
GTA
Val
CGT
Arg 55
ATG
Met
CAA
Gln
GAC
Asp
TAC
Tyr
GCA
Ala 135 TCA AAA Ser Lys 25 GTA TCA Val Ser 40 CAA GCG Gin Ala GGA ACT Gly Thr GAA ATT Glu Ile TAT CTT Tyr Leu 105 ITC AAC Phe Asn 120 GGT CTT Gly Leu ATT GCT Ile Ala AAA AAT Lys Asn ACA AAT Thr Asn GAA AAA Glu Lys GCA ATG Ala Met GAA AAA Glu Lys GCA CAA Ala Gin GTA GAA Val Glu 140 AAC CCA Asn Pro GGT GAA Gly Glu CCA GAT Pro Asp GTT TCT Val Ser ATT CTT Ile Leu GTA GAA Val Glu 110 CGT CAG Arg 125 CGT ATC Arg Ile 120 160 240 289 337 385 433 481 529 577 625 673 WO 96/40928 GTT AAC GAA Val Asn Glu PCT/CA96/00322 CCA ACA GCA GCC GCA CTT GCT TAT GGT ATG GAC AAG ACT 721 Pro Thr Ala Ala Ala Leu Ala Tyr Gly Met Asp Lys Thr 145 150 155
GAC
Asp
GAC
Asp 175
ACA
Thr
ATT
Ile
TCT
Ser
GCT
Ala TTc Phe 255
TCA
Ser
AAA
Lys
GAA
Glu
GTT
Val
GTT
Val 335
GTT
Val
TTG
Leu
GAC
Asp
GCA
Ala
AAG
Lys 160
GTA
Val
GCA
Ala
GAT
Asp
CAA
Gln
AAA
Lys 240
ATC
Ile
CGT
Arg
ACT
Thr
ATT
Ile
GTT
Val 320
AAC
Asn
ATC
Ile
TCA
Ser
CGC
Arg
GCA
Ala 400 GAT GAA AAA Asp Glu Lys TCA ATC CTT Ser Ile Leu GGT GAT AAC Gly Asp Asn 195 TTC TTG GTA Phe Leu Val 210 GAC AAA ATG Asp Lys Met 225 AAA GAC CTT Lys Asp Leu ACT GCT GGT Thr Ala Gly GCT AAA TTT Ala Lys Phe 275 CCA GTT CGT Pro Val Arg 290 GAT GAA GTT Asp Glu Val 305 GAA GCT GTA Glu Ala Val CCT GAT GAA Pro Asp Glu ACT GGG GAT Thr Gly Asp 355 CTT GGT ATT Leu Gly Ile 370 AAC ACA ACT Asn Thr Thr 385 GAC AAC CAA Asp Asn Gin
ATC
Ile
GAA
Glu 180
AAA
Lys
GAA
Glu
GCT
Ala
TCA
Ser
TCT
Ser 260
GAC
Asp
CA
Gin
ATC
Ile
AAA
Lys
GTG
Val 340
GTG
Val
GA
31u
ATC
lie
CCA
Pro TTA GTT Leu Val 165 TTA GGT Leu Gly CTT GGT Leu Gly GAA TTC Glu Phe CTT CAA Leu Gin 230 GGT GTA Gly Val 245 GCT GGT Ala Gly GAT CTC Asp Leu GCT CTT Ala Leu CTC GTT Leu Val 310 GCT GAA Ala Glu 325 GTT GCC Val Ala AAA GAC Lys Asp ACA ATG Thr Met CCA ACA Pro Thr 390 GCC GTT Ala Val 405 TTT GAC Phe Asp GAT GGT Asp Gly GGT GAC Gly Asp 200 AAG AAA Lys Lys 215 CGC TTG Arg Leu ACT CAA Thr Gin CCT CTT Pro Leu ACT CGT Thr Arg 280 TCA GAT Ser Asp 295 GGT GGA Gly Gly ACT GGT Thr Gly ATG GGT Met Gly GTT GTA Val Val 360 GGT GGT G1y Gly 375 TCT AAA Ser Lys GAT ATC Asp Ile
CTT
Leu
GTC
Val 185
GAC
Asp
GAA
Glu
AAA
Lys
ACT
Thr
CAC
His 265
GAC
Asp
GCA
Ala
TCA
Ser
AAA
Lys
GCT
Ala 345
CTT
Leu
GTC
Val
TCA
Ser
CAT
His GGT GGT GGT Gly Gly Gly 170 TTC GAC GTT Phe Asp Val 'ITT GAC CAG Phe Asp Gin AAT GGT ATT Asn Gly Ile 220 GAT GCT GCT Asp Ala Ala 235 CAA ATT TCA Gin Ile Ser 250 TTG GAG ATG Leu Giu Met CTT GTT GAA Leu Val Glu GGC TTG TCA Gly Leu Ser 300 ACA CGT ATC Thr Arg Ile 315 GAA CCA AAT Glu Pro Asn 330 GCT ATC CAA Ala Ile Gin CTT GAC GTA Leu Asp Val TTC ACT AAA Phe Thr Lys 380 CAA GTC TTC Gin Val Phe 395 GTT CTT CAA Val Leu Gin 410
ACA'
Thr
CTT
Leu
AAA
Lys 205
GAT
Asp
GAA
Glu
ITA
Leu
AGC
Ser
CGT.
Arg 285
TTG
Leu
CCA
Pro AAA Lys
GGT
Gly
ACA
Thr 365 Leu
TCA
Ser
GGT
Gly 769 817 865 913 961 1109 1057 1105 1153 1201 1249 1297 1345 1393 1441 1489 s WO 96/40928 CGC CCA ATG GCA GCA Arg Pro Met Ala Ala 415 GAT ATC CCA GCT GCA Asp Ile Pro Ala Ala 435 GAT ATC GAT AAA AAT Asp Ile Asp Lys Asn 450 ACT CAA AAA GAA CAA Thr Gin Lys Giu Gin 465 GAT GAA GAA ATT GAT Asp Giu Glu Ile Asp 480 GCA GAT GCA AA CGT Ala Asp Ala Lys Arg 495 CAA GCC ATC TTT GCA Gln Ala Ile Phe Ala 515 GGT TTT GAT ACA GAA Gly Phe Asp Thr Glu 530 AAA AAA GCT CAA GAA Lys Lys Ala Gin Glu 545 GAA GCT CTT AAC GAA Glu Aia Leu Asn Glu 560 CAA GCG GCT GCA GCA Gin Ala Ala Ala Ala 575 TCA GCT GAT TCA TCA PCT/CA96/00322 :T 1537 GAT AAC AAA ACA Asp Asn Lys Thr 420 CCT CGT GGA ATC Pro Arg Gly Ile GGT ATT Gly Ile CAC ATT His Ile AAA ATG Lys Met 485 AAA GAA Lys Glu 500 ACA GAA Thr Glu CGC GAT Arg Asp TCA GGT Ser Gly AAA GCA Lys Ala 565 CAA CAA Glm Gin 580 AGC AAG
GTA
Va1
GTT
Val 470
ATG
Met
GAA
Glu
AAA
Lys
GCA
Ala
AAC
Asn 550
CAA
Gin
GCA
Ala
GGT
TCT
Ser 455
ATC
Ile
AAA
Lys
GTT
Val
ACT
Thr
GCG
Ala 535
CTT
Leu
GCT
Ala
GCT
Ala
GAT
CTC GGT Leu Gly 425 CCA CAA Pro Gin 440 GTT AAA Val Lys CAA TCT Gin Ser GAT GCT Asp Ala GAT CTT Asp Leu 505 ATT AAA Ile Lys 520 CAA TCA Gin Ser GAC GAC Asp Asp CTT GCA Leu Ala CAA GGG Gin Gly 585 GAT GTT CGC TTC Arg Phe ATT GAAk Ile Glu GCT AAA Ala Lys AAT TCA Asn Ser 475 GAA GCA Glu Ala 490 AAA AAT Lys Asn GAA ACT Glu Thr GCA CTT Ala Leu ATG AAA Met Lys 555 GTT AAA Val Lys 570 GCT GAA Ala Glu GTA GAT
CAA
Gin
GTA
Val
GAT
Asp 460
GGA
Gly
AAT
Asn
GAA
Glu
GAA
Glu
GAT
Asp 540
GCT
Ala
CTT
Leu
GGT
Gly
GGC
TTG
Leu
ACA
Thr 445
CTC
Leu
TTA
Leu
GCT
Ala
GTT
Val
GGC
Gly 525
GAG
Glu
AAA
Lys
TAC
Tyr
GCA
Ala
GAA
1585 1633 1681 1729 1777 1825 1873 1921 1969 2017 2065 2114 2174 2234 2294 2354 2414 2438 Ser Ala Asp Ser Ser Ser Lys Gly Asp Asp Val Val Asp Gly Giu Phe 595 600 605 ACT GAG AAA TAATTATAA TATTGITCAG ATTCATTTGA ATATAAGCAT Thr Giu Lys 610 GAAAACTATA CTAGCATAGT AAAGTICTTC GTGATAGGGA GTTTCAGATT ACATAAGCTA ATTTCGCTAT CACTAAATAA GGCGGGGCGC CTCGCTCCGT CTGTTTTATI AAGTGTCATA CTGTAACTGG GCAAGAATAA TITGTTAATCT CTTCAAGTGT AGATTAGAT AATGAACAAT ACAGAATTTT ATGATCGTCT CTCAGGACGA AATAAAAAAA GCTT INFORMATION FOR SEQ ID NO:22: SEQUENCE CHARACTERISTICS: LENGTH: 609 amino acids TYPE: amino acid WGCTCAATA ATCTAGATAA AAACATATTA ATAATAAATA TATATGTTAA CTATTTAGAG AGTATATGAA CAAAATATAA TGGCGTTTCA AAAGATGCTT WO 96/40928 PCT/CA96/00322 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: Met Ser Lys Ile Ile Gly Ile Asp Leu Gly Thr Thr Asn Ser Ala Val 1 5 10 Ala Val Leu Glu Gly Thr Glu Ser Lys Ile Ile Ala Asn Pro Glu Gly ?5 Asn Arg Thr Thr Pro Ser Val Val Ser Phe Lys Asn Gly Glu Ile Ile 40 Val Gly Asp Ala Ala Lys Arg Gin Ala Val Thr Asn Pro Asp Thr Val 55 Ile Ser Ile Lys Ser Lys Met Gly Thr Ser Glu Lys Val Ser Ala Asn 65 70 75 Gly Lys Glu Tyr Thr Pro Gin Glu Ile Ser Ala Met Ile Leu Gin Tyr 90 Leu Lys Gly Tyr Ala Glu Asp Tyr Leu Gly Glu Lys Val Glu Lys Ala 100 105 110 Val Ile Thr Val Pro Ala Tyr Phe Asn Asp Ala Gin Arg Gin Ala Thr 115 120 125 Lys Asp Ala Gly Lys Ile Ala Gly Leu Glu Val Glu Arg Ile Val Asn 130 135 140 Glu Pro Thr Ala Ala Ala Leu Ala Tyr Gly Met Asp Lys Thr Asp Lys 145 150 155 160 Asp Glu Lys Ile Leu Val Phe Asp Leu G,1y Gly Gly Thr Phe Asp Val 165 170 175 Ser Ile Leu Glu Leu Gly As- Gly Val Phe Asp Val Leu Ala Thr Ala 180 185 190 Gly Asp Asn Lys Leu Gly Gly Asp Asp Phe Asp Gin Lys Ile Ile Asp 195 200 205 Phe Leu Val Glu Glu Phe Lys Lys Glu Asn Gly Ile Asp Leu Ser Gin 210 215 220 Asp Lys Met Ala Leu Gin Arg Leu Lys Asp Ala Ala Glu Lys Ala Lys 225 230 235 24L Lys Asp Leu Ser Gly Val Thr Gin Thr Gin Ile Ser Leu Pro Phe Ile 245 250 255 Thr Ala Gly Ser Ala Gly &ro Leu His Leu Glu Met Ser Leu Ser Arg 260 265 270 Ala Lys Phe Asp Asp Leu Thr Arg Asp Leu Val Glu Arg Thr Lys Thr 275 280 285 Pro Val Arg Gin Ala Leu Ser Asp Ala Gly Leu Ser Leu Ser Glu Ile 290 295 300 Asp Glu Val Ile Leu Val Gly Gly Ser Thr Arg Ile Pro Ala Val Val 305 310 315 320 Glu Ala Val Lys Ala Glu Thr Gly Lys Glu Pro Asn Lys Ser Val Asn 325 330 335 Pro Asp Glu Val Val Ala Met Gly Ala Ala Ile Gin Gly Gly Val Ile 340 345 350 WO 96/40928 Thr Gly Asp 355 PCT/CA96/00322 Val Lys Asp Val Val Leu Leu Asp Val. Thr Pro Leu Ser 360 Leu Asn 365 Asp Met Pro Asp Lys 465 Giu Ala ILie Asp Ala 545 Leu Ala Asp Lys Ile Thr Gin Ala Ala 435 Asn Gin A ~p Arg Ala 515 Giu Giu Giu Ala Ser 595 Met Thr 390 Val Lys Gly Val Val 470 Met Gi; Lys Ala Asn 550 Gin Ala Gi" Gly Lys Ile Leu Pro 440 Val Gin Asp Asp 11 e 520 Gin Asp~ Leu Gin Asp 600 Phe Gin Val 410 Arg Ile Ala Asn Glu 490 Lys Glu Ala Met V~q 3 570 Ala Val 365 Leu Ile Asp Arg Ser Thr Ala Ala 400 Gly Glu Arg Pro 415 Leu Thr Asp Ile 430 Thr Phe Asp Ile 445 Leu Gly Thr Gin Leu Thr Asp Giu 480 Ala Giu Ala Asp 495 Val Asp Gin Ala 510 Gly Lys Gly Phe 525 Glu Leu Lys Lys Lys Leu Giu Ala 560 Tyr Giu Gin Ala 575 Gin Ser Ala 590 Giu Phe Thr Giu 605 2) INFORMATION FOR SEQ ID NO:23: SEQUENCE CHARACTERISTICS: LENGTH: 19 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: Arg Ile Pro Ala Val Val Giu Ala V).1 Lys Ala Giu Tbhr Gly Lys Glu 1 5 10 Pro Asn Lys WO 96/40928 PCT/CA96/00322 INFORMATION FOR SEQ ID NO:24: SEQUENCE CHARACTERISTICS: LENGTH: 15 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: Gin Thr Ile Val Ile Gin Ser Asn Ser Gly Leu Thr Asp Glu Glu 1 5 10 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 460 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Streptococcus pneumoniae (ix) FEATURE: NAME/KEY: CDS LOCATION: 1..456 OTHER INFORM?1TION: /product= "C-terminal 151-residue fragment (C-151) of HSP72" (xi) SEQUENCE DESCRIPTION: SEQ ID ATG AAG GCC AAA GAC CTT GGA ACT CAA AAA GAA CAA ACT ATT GTC ATC 48 Met Lys Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin Thr Ile Val Ile 1 5 10 CAA TCG AAC TCA GGT TTG ACT GAC GAA GAA ATC GAC CGC ATG ATG AAA 96 Gin Ser Asn Ser Gly Leu Thr Asp Glu Glu Ile Asp Arg Met Met Lys 20 25 GAT GCA GAA GCA AAC GCT GAA TCC GAT AAG AAA CGT AAA GAA GAA GTA 144 Asp Ala Glu Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu Glu Val 40 GAC CTT CGT AAT GAA GTG GAC CAA GCA ATC TTT GCG ACT GAA AAG ACA 192 Asp Leu Arg Asn Glu Val Asp Gln Ala Ile Phe Ala Thr Glu Lys Thr 55 ATC AAG GAA ACT GAA GGT AAA GGC TTC GAC GCA GAA CGT GAC GCT GCC 240 Ile Lys Glu Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg Asp Ala Ala 70 75 CAA GCT GCC CTT GAT GAC CTT AAG AAA GCT CAA GAA GAC AAC AAC TTG 288 Gin Ala Ala Leu Asp Asp Leu Lys Lys Ala Gin Glu Asp Asn Asn Leu 90 GAC GAC ATG AAA GCA AAA CTT GAA GCA TTG AAC GAA AAA GCT CAA GO'A 336 Asp Asp Met Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Gly 100 105 110 WO 96/40928 PCT/CA96/00322 CTT GCT GTT AAA CTC TAC GAA CAA GCC GCA GCA GCG CAA CAA GCT CAA 384 Leu Ala Val Lys Leu Tyr Glu Gin Ala Ala Ala Ala Gn Gin Ala Gin 115 120 125 GAA GGA GCA GAA GGC GCA CAA GCA ACA GGA AAC GCA GGC GAT GAC GTC 432 Glu Gly Ala Glu Gly Ala Gin Ala Thr Gly Azn Ala Gly Asp Asp Val 130 135 140 GTA GAC GGA GAG TTT ACG GAA AAG TAAG 460 Val Asp Gly Glu Phe Thr Glu Lys 145 150 INFORMATION FOR SEQ ID NO:26: SEQUENCE CHARACTERISTICS: LENGTH: 152 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: Met Lys Ala Lys Asp Leu Gly Thr Gin Lys Glu Gin Thr Ile Val Ile 1 5 10 Gin Ser Asn Ser Gly Leu Thr Asp Glu Glu Ile Asp Arg Met Met Lys 25 Asp Ala Glu Ala Asn Ala Glu Ser Asp Lys Lys Arg Lys Glu Glu Val 40 Asp Leu Arg Asn Glu Val Asp Gin Ala Ile Phe Ala Thr Glu Lys Thr 50 55 Ile Lys Glu Thr Glu Gly Lys Gly Phe Asp Ala Glu Arg Asp Ala Ala 70 75 Gin Ala Ala Leu Asp Asp Leu Lys Lys Ala Gin Glu Asp Asn Asn Leu 90 Asp Asp Met Lys Ala Lys Leu Glu Ala Leu Asn Glu Lys Ala Gin Gly 100 105 110 Leu Ala Val Lys Leu Tyr Glu Gin Ala Ala Ala Ala Gin Gin Ala Gin 115 120 125 Glu Gly Ala Glu Gly Ala Gin Ala Thr Gly Asn Ala Gly Asp Asp Val 130 135 140 Val Asp Gly Glu Phe Thr Glu Lys 145 150
Claims (18)
- 2. The polypentide of claim 1, wherein the fragments of paragraph are selected from the group consisting of amino acids 439-607 of SEQ ID NO:5 (C-169) amiro acids 457-607 Df SEQ ID NO:5 (C-151), amino acids 527-541 of SEQ ID NO:5, and amino acids 586-600 of SEQ ID
- 3. A polypepcide according mo claim 1 .aving the amino acid sequence of SEQ ID NO:5, or analcgusa or derivatives thereof.
- 4. A polypeptide according to claim 1 having the amino acid sequence of SEQ ID NO:20, or analogues or derivatives thereof. 112 AMENDED SHEET HAij 4' 4I N I INtAU 0hN''4 Q I I I .1 i w t I 1 11 L-1 A4 FaY14Y S. A polypeptide accordirzg to claim 1 having the amino ac-id sequence Of SEQ ID NO:22, or analogues or dorivatiYes thcrcof. S. A polypeptide according to claim 1 havring =he amino acid sequence of SEQ ID NC:26, or analogues or derivatives thereof.
- 7. A poJlypeptide according to claim 1 having the amino acid sequience of SEQ ID NO:7, or analogues or derivatives thereof.
- 8. A polypeptide according to claim I having the amino acid sequence of SEQ ID N0:8, or analog-ues or derivatives thereof.
- 9. A polypeptide according to claim having the amino acid sequence of SEQ ID INO:9, or analogues or derivatives thereof. A poly-peptide according to Clair, 1 having the amino acid sequence of SEQ 1D N0:10, or analcoguna or derivatives thereof.
- 11. A polypeptide according to claim I havi.ng the amino acid sequence of SEQ ID NO:1l, or analogues or derivatives thereof'. 113 RA4 I' Ar AMENDED SHEET I I M I I I i I I I I j e_ U U e 1 4 4-kIAXI INtAU ALkH A WG.c j s 1 4-W-4J3U ii!1 -1 ,j h I'4 J 'ayeU 1.2. A polypeptide according to claim I. having :he amino ac .d mequence off SEQ ID NO;!2, or an-aloguee or derivativ.es thereof.
- 12. A polypeptide according to cl-aimn i having the amino acid oaquarica of SEQ ID NO:13, or anal.oguaQ or derivatives thereof. 1~4. A pol.ypeptide according to claim I having the amino acid sequence of SEQ ID NO!14, or analogues or derivatives thereoff, A poJlypeptide according to claim 2- having the amino acid sequence of SEQ ID NO:15, or analogues or derivatives chereof.
- 16. A polypeptide according to claim hain he amino acid sequence of SEQ ID NC:16, or analogues or derivatives chereof,
- 17. A poly-peptide according to claim i having the amino acid sequence of SEQ ID NO:17, or analogues or derivativea hereof.
- 18. A poly-peptide according to claim having the amino acid sequence of SEQ ID NO:18, or analogues or derivatives hereof, 114 '1 '.12DSnH 1. I I i I I I I I I 14w 1
- 19. A polypeptide according to claim I having t. amino ac-id sequence of SEQ ID NO:23, or analogues or derivat ivr-o thereof, A polypeptide. according to claim having the amino acid seguence of SRQ ID NO:24, or analogues or derivatives thereof.
- 21. The polypeptide of any one of claims 1 zo wherein said polypeptide elicits an immune reaction that is specific t~o Streptaococcal strains,
- 22. A polypeptide according to claim 1 selected froma the group consisting of: che 1HSP72 polypeptide having the am~.no acid sequence of SEQ ID NO;5; an-d fragments off the foregoing poly-peptide, either alone or in combination with other Dolypepcides to formt a fusion protein.
- 23. The polypeptide of claim 22, wherein- the fragments of paragraph are selected from the group consisting of amino acids 439-607 of SEQ ID NO-S IC-169) amino acids 527-541 of SEQ ID NO:5, and amino acids 586-60C of SEQ ID .NC
- 24. The polypeptide of claim 22, wherein the fusion protein of paragraph is the Fucose :aomerase-HS?72 (C-169) protein having the amino acid sequence of SEQ ID NO:]. .ED SHEET M AX:J Uo- I M II ZAU R'5 A 'JL. 4 4 4 h-j 4 J 1 A DNA sequence selected from the group consisting of: the HSP72 DNA qcquence of SEQ ID NO:4; the HSP70 (DnaK) DNA sequence of SEQ ID NO: 19; the HSP70 (DnaK) DNA sequence of SEQ ID NO:21; DNA sequences that are degenerate to any of the foregoing DNA sequences; and fragments of any of the foregoing DNA sequences, either alone or in combination with other DNA sequences to form a fusion DNA sequence.
- 26. A DNA sequence according to claim 25 comprising the formula of SEQ ID NO:4 from nucleotide 682 to nucleotide
- 2502. 27. A DNA sequence according to claim 25 comprising the formula of SEQ 1D NO:4 from nucleotide 1996 to nucleocide 2502. 28. A DNA sequence according to claim 25 comprising the formula of SEQ ID NO:4 from nucleotide 2C50 to nucleotide 2502, 29. A DNA sequence according to claim 25 comprising the formula of SEQ ID NO:4 from nucleotide 2260 to nucleotide 2304. 116 ED SHEET I A DNA sequence dccarirg to Claim 25 compris.inq the forTvua of SEQ ID NO:4 from nuclteotide 2437 co nlucleotide 2481. 31. A DNA sequence according to claim 25 coipr-'Ing the formula of SEQ ID NO:19 from nucleotide 204 to nuclectide 2027. 32, A DNA sequence according to claim 25 com-orising th~e formula of SEQ ID NO:21 from nucleotide 248 to nucleotide 2074. 33. A DNA sequence according to claim 25 comprising the formula of SEQ 1D NO:25 from nucleotide 4 to nucleotide 456, 34. A DNA sequence coding for a polypeptide according co any one of claims 1-20. A DNA auquance according o claim 25 select-ed from the group consisting of: r-he 14SP7-2 DNA oequence of SRQ :D ,TO:4; DNA siequancea that are degenerate to the foregoing DNA sequence; and fragments of any of the foregoing DNA sequences, either alone or in combination with other DNA sequences to form a fusion DNA seq~uence. 117 L:;rJSHEET e~ud~ J U-~AIv INtAU MAhN ASWU.t~ t- 1'n 'ave4d 36. The DNA sequence of claim 35, wherein t).he fragments off paragraph are salectad from then group COnaistizng Of nucleotide 1996-2502 (amino acids 439-0-07) of 9FQ ID NO:4 (C-169); nucleotide 2260-2304 (amino acids 527-54:) of SEQ ID NO:4; and nucleotide 2437-2481 (amino acids 58 8-600) of SEQ ID NO:4. 37, The DNA sequence of claim 35, wherein the fusion DNA sequence of paragraph is the Fucose Isomerase-HS?72 (C- 169) DNA sequence of SEQ ID NO:l (nucleocides 771-2912). 38. AnP expression vector including at least one DNA sequence according to claim 35 operably linked to a promoter.- 39. A recombinant IDNA molecule comprising a DNA sequence according to any one of claims 25 to 34, and one or more expression control sequence operably linked to the DNA sequence, The recombinant rDNA molecule of clalm 39, wherein said expression control sequence is an inducible expression vector. 41. The recombinant molecule of claim 40, w-erein said expression vector comprises the X PL promoter. 42. A recomhinant molecule according to claim 39 comprising a plasmid selected from the group consistingc of: AENDED SHEET -rN 41-U-VI e tJb UaPMNIIMAU MtAkXl MAIU.b t--3J 14 el 4 aL1 'ptRV3, PU?.V4, ptJRVS, puRVx6, PJBD-191, pJ2DA4, PJBD--k~l, FCBD1.71, pJBD177, pJBD179, PJBDA1, PJBDf51, and pj~f2 43. A unicellular hosc transformed with an e-xPressiOn vector of claim 38. 44. A Urlicellular host transformed with a recombinant; DNA molecule of claim 39, A unicellular host according to claim 44, wherein said host is selected from the group consisting of; E.coZi strains XLI Blue M4.jFt, W3110, w5M109, Y1090 and BL21(DE3). 46. A method for producing a oolypeptide or fragment thereof comprising Che stePs of cultu-ring nhe unicellular host of any one of4 claims 43-45 and isolating said polypeptide or fragment. 47. A polypantide in subatan=4 ally pure form as obtained by the method of clai;m 46, 48. Az~ antibody or fragment thariaof that specifically binds to a polypePtide of any one of claims 1-20, 49. An antibody or fragment thereof t:hat specifically binds to the epicope recognized by mor4oclona- antibody F1-Pn3.l. 119 I' CN II-UU-,I I, Ub UVtA-M I1MAU RALAtN ASWULIh 1-I-i-44U t-j( 1 I Ira/1 1 The antibody or fragment of claim 48, which is a monocloral antibody. 51, The monoclonal antibody or fragment of claim which is of murine origin, 52. The moncclonal antibody or fragment of claim 51, which is of IgG type. 53, The monoclonal antibody FI-Pn3,1. 54. A method for isolating the antibody of claim 48 comprising: introducing a preparation of the polypeptide of any one of claims 1-20 intc a mammal; and isolating serum from the mammal containing said antibody. A method for isolating the monoclonal antibody of claim 50 comprising: introducing a preparation of the polypeptide of any one of claims 1-20 to antibody producing cells of a mammal; fusing the antibody producing cells with myeloma cells to form hybridoma cells, and isolating said monoclonal antibody from the hybridoma cells. 120 I, ,t II I \I I i I I t) MI I 1 11 I J I I l: 56. A pharmaceutical composition comprising a polypeptide of any one of claims 1-20. 57. The pharmaceutical composition of claim 56, which is a vaccine. 58. The pharmaceutical composition of claim further comprising one or more pharmaceutically acceptable excipients. 59. A pharmaceutical composition comprising one or more antibodies or fragments thereof according to claim 4. The pharmaceutical composition of claim 59, which is a vaccine. 61. The pharmaceutical composition of claim further comprising a pharmaceutically acceptable excipient. G2. The pharmaceutical composition of claim 60 or 61, wherein the antibody is Fl-Pn3.1. 63. A method for preventing infection of a patient by Streptococcus pneumoniae or related bacteria comprising the administration of a pharmaceutically effective amount of the vaccine of claim 57, 60 or 61. 64, A mechod for preventing infection of a patient by Streptococcus pneumaniae, Streptococcus pyogenes or ,I StrePtococcusO agalacriao compriirjS the ddmin-tratjo oZ a pharmace.utically effective amount of the V'accine o:E Claim 57, or 91. A method for treating a patient infected with or suspected of being infected with St-rep cOcoccua pneumoniae or related bacteria comprising the administration of a pharmaceutically effective amount of the vaccine ,f claim 60 or 61. 66. A method for the detection of Stzxeptococcus pne.umon.ae or related bacteria in a biological sample comprising: incubating the antibody or fragment of claim 48 with the biological sample to form a mixture; and detecting specifically bound antibody or fragment in the mixture which indicatea the presence of Stretocccu~pneumonia.e vzr related bacteria. 67. The method of claim 66, wherein the antibody is FI-Pn3 .1. 68, A method for the detection of antibodia& -)ecific to ScreptococcuB pnewnonlae or related bactaxia in a biological sample comprisi4ng! incubating a polypeptide of claim 2 or with the biological sample to form a mixture; and 122 ~-uw eju Ufl-ORIMAU MAER~ ASWUCO+-AJ-SUh31 4/I rra1 4 detecting specifioaily bound poly-peptide in the mix~ture, tihich inidicates the presence of antibodies specific to S'treptccoccuBs pnewnor iae or related bacteria. 69. A method f or the detecti~on of Screptococcus pneumonlae or related bacc~aria in a biological sample Compriaing; incubating a DNA probe having the EmA sequence of claim 35 with the biological sample to form a mixture; and detecting specifically bound DNA probe in the mixture whl-.h indicates t.'e prelience of Streptococcus pneurnoniae and reiated bacteria. The method of cli- 69, wherein the DNA probe is an oligorner having a sequence cotr-lementary to at least about 6 contiguous nucleot~ides of a DNA sequence of claim 71. The method of clai~m 70, which further rcmpvises: pr'oviding a set of ollgomers which. are primers "-or a polymerase chain reaction method an~d which flank the target region;, and amplifying the target region via the po~ymerase chain reaction method. 72. A method for the detection of Screptcco~vcua .pzeuzoniae, $treptococcus pyogernes or Streptococcus agala--tiae in a biological sample comprising: incubating the antibody or ffragmeac of claim 4E8with the biological Oarnple Lo form a mixture, and deteccir, specifically bound antibody o~r fragjment in the mixture which indicates the presence of Streptococcus pneu-monlae, Streptococcua pyogeies or Streptococcus agalactiae. 73, A method for the detection of antibodiea specific to Streptococcua prneumoniae, Streptococcus pyogenes or Streptococcus agralacclae in a biological sample comprising: incubating a polypeptide of claim I. or 21 with the biological sample to form a mixture; and detecting specifically bound polypeptide in the mixture, which indicates the presence of antibodies specific- to StZ-Cpt;QCCCUS pneumoniae, Strep tococcus pyogeines or. Screptococcus agalact.iae. 74. A Method for the detectio~n off Streptococcus pneuxnoniae, Streptococcus pycgenes or Straptococcus ayalactiae 'n biological sample comprising; incubating a DNA probe having t.-e DNP, sequence of claim 25 or 34 with the biological sample to form a mixturel and detecting specifically bound IDNA probe in -he mfixrure wh~ch indicates the presence of Strepcococcua pneunioniae, Streptococcus pyogenes or Strepto~coccus agalactiae. The method of claim '74, wherein the DNA probe is an oligomer hav$,ng a maquenco complementary to at least about 6 contiguous nucleotides ol a DNA caquance of claim 2s or 34. 76. The method of claim 75, which further cornpi-isaa: providing a set of oligomers which are primers for a polymerase chain reaction method and which flank the target region; and amplifying the target region via the polymerase chain reaction method. Dated this 30th day of June 1998. BIOCHEM VACCINES INC By their Patent Attorneys GRIFFITH HACK Fellows Institute of Patent Attorneys of Australia .125
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/472534 | 1995-06-07 | ||
US08/472,534 US5919620A (en) | 1995-06-07 | 1995-06-07 | Heat shock protein HSP72 of Streptococcus pneumoniae |
US180595P | 1995-08-04 | 1995-08-04 | |
US60/001805 | 1995-08-04 | ||
PCT/CA1996/000322 WO1996040928A1 (en) | 1995-06-07 | 1996-05-17 | Streptococcal heat shock proteins members of the hsp70 family |
Publications (2)
Publication Number | Publication Date |
---|---|
AU5682896A AU5682896A (en) | 1996-12-30 |
AU700080B2 true AU700080B2 (en) | 1998-12-17 |
Family
ID=26669494
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU56828/96A Ceased AU700080B2 (en) | 1995-06-07 | 1996-05-17 | Streptococcal heat shock proteins members of the HSP70 family |
Country Status (18)
Country | Link |
---|---|
EP (1) | EP0832238A1 (en) |
JP (1) | JPH11507214A (en) |
KR (1) | KR19990022742A (en) |
CN (1) | CN1192241A (en) |
AP (1) | AP9701163A0 (en) |
AR (1) | AR003124A1 (en) |
AU (1) | AU700080B2 (en) |
BR (1) | BR9609399A (en) |
CA (1) | CA2224015A1 (en) |
CZ (1) | CZ394297A3 (en) |
EA (1) | EA199800046A1 (en) |
HU (1) | HUP0600442A3 (en) |
IL (1) | IL118329A0 (en) |
NO (1) | NO975752L (en) |
PL (1) | PL323781A1 (en) |
SK (1) | SK168497A3 (en) |
TR (1) | TR199701537T1 (en) |
WO (1) | WO1996040928A1 (en) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6245335B1 (en) | 1996-05-01 | 2001-06-12 | The Rockefeller University | Choline binding proteins for anti-pneumococcal vaccines |
JP2000511411A (en) * | 1996-05-01 | 2000-09-05 | ザ ロックフェラー ユニヴァーシティ | Choline binding protein for anti-pneumococcal vaccine |
WO1999035270A1 (en) * | 1997-12-31 | 1999-07-15 | Stressgen Biotechnologies Corporation | Streptococcal heat shock proteins of the hsp60 family |
US6497880B1 (en) | 1998-12-08 | 2002-12-24 | Stressgen Biotechnologies Corporation | Heat shock genes and proteins from Neisseria meningitidis, Candida glabrata and Aspergillus fumigatus |
TR200200633T2 (en) * | 1998-12-23 | 2002-06-21 | Shire Biochem Inc. | New streptococcus antigens |
US7128918B1 (en) | 1998-12-23 | 2006-10-31 | Id Biomedical Corporation | Streptococcus antigens |
HU228499B1 (en) | 1999-03-19 | 2013-03-28 | Smithkline Beecham Biolog | Streptococcus vaccine |
US7015309B1 (en) | 1999-06-23 | 2006-03-21 | The Wistar Institute Of Anatomy And Biology | Pyrrhocoricin-derived peptides, and methods of use thereof |
GB9918319D0 (en) | 1999-08-03 | 1999-10-06 | Smithkline Beecham Biolog | Vaccine composition |
GB9919734D0 (en) * | 1999-08-19 | 1999-10-20 | Colaco Camilo | Vaccines from infectious agents |
AU2005204321B2 (en) * | 1999-08-19 | 2008-07-10 | Immunobiology Limited | Vaccines from Infectious Agents |
AU2795801A (en) * | 2000-01-21 | 2001-07-31 | Creighton University | Biocidal molecules, macromolecular targets and methods of production and use |
US6866855B2 (en) | 2000-06-12 | 2005-03-15 | University Of Saskatchewan | Immunization of dairy cattle with GapC protein against Streptococcus infection |
US6833134B2 (en) | 2000-06-12 | 2004-12-21 | University Of Saskacthewan | Immunization of dairy cattle with GapC protein against Streptococcus infection |
EP1294771B1 (en) | 2000-06-12 | 2008-10-29 | University Of Saskatchewan | Chimeric GapC protein from Streptococcus and its use in vaccination and diagnosis |
GB0021757D0 (en) * | 2000-09-04 | 2000-10-18 | Colaco Camilo | Vaccine against microbial pathogens |
GB0022742D0 (en) | 2000-09-15 | 2000-11-01 | Smithkline Beecham Biolog | Vaccine |
WO2004015099A2 (en) | 2002-08-02 | 2004-02-19 | Glaxosmithkline Biologicals Sa | Vaccine composition comprising lipooligosaccharide with reduced phase variability |
DK2395073T3 (en) | 2002-11-01 | 2017-10-23 | Glaxosmithkline Biologicals Sa | Process for drying. |
WO2004078907A2 (en) * | 2003-03-04 | 2004-09-16 | Intercell Ag | Streptococcus pyogenes antigens |
EP2333114A1 (en) * | 2003-04-15 | 2011-06-15 | Intercell AG | S. pneumoniae antigens |
WO2005032584A2 (en) | 2003-10-02 | 2005-04-14 | Glaxosmithkline Biologicals S.A. | Pertussis antigens and use thereof in vaccination |
GB0505996D0 (en) | 2005-03-23 | 2005-04-27 | Glaxosmithkline Biolog Sa | Fermentation process |
TWI457133B (en) | 2005-12-13 | 2014-10-21 | Glaxosmithkline Biolog Sa | Novel composition |
ZA200805602B (en) | 2006-01-17 | 2009-12-30 | Arne Forsgren | A novel surface exposed haemophilus influenzae protein (protein E; pE) |
JP2013521770A (en) | 2010-03-10 | 2013-06-13 | グラクソスミスクライン バイオロジカルズ ソシエテ アノニム | Vaccine composition |
GB201015132D0 (en) | 2010-09-10 | 2010-10-27 | Univ Bristol | Vaccine composition |
CN103146734B (en) * | 2013-03-12 | 2014-10-01 | 中国人民解放军军事医学科学院军事兽医研究所 | Anti-burn and scald infection multiple organ failure Pseudomonas aeruginosa toxin vaccine |
CN104001164A (en) * | 2014-04-23 | 2014-08-27 | 杭州师范大学 | Aeromonas hydrophila heat shock protein subunit vaccine and preparation method thereof |
CN114805567B (en) * | 2022-06-27 | 2022-09-16 | 和元生物技术(上海)股份有限公司 | Monoclonal antibody, method and application of marker protein HSPA1A for recognizing exosomes |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1992014488A1 (en) * | 1991-02-15 | 1992-09-03 | Uab Research Foundation | Structural gene of pneumococcal protein |
IT1262896B (en) * | 1992-03-06 | 1996-07-22 | CONJUGATE COMPOUNDS FORMED FROM HEAT SHOCK PROTEIN (HSP) AND OLIGO-POLY-SACCHARIDES, THEIR USE FOR THE PRODUCTION OF VACCINES. |
-
1996
- 1996-05-17 TR TR97/01537T patent/TR199701537T1/en unknown
- 1996-05-17 JP JP9500026A patent/JPH11507214A/en active Pending
- 1996-05-17 CZ CZ973942A patent/CZ394297A3/en unknown
- 1996-05-17 HU HU0600442A patent/HUP0600442A3/en unknown
- 1996-05-17 PL PL96323781A patent/PL323781A1/en unknown
- 1996-05-17 WO PCT/CA1996/000322 patent/WO1996040928A1/en not_active Application Discontinuation
- 1996-05-17 AU AU56828/96A patent/AU700080B2/en not_active Ceased
- 1996-05-17 EP EP96914821A patent/EP0832238A1/en not_active Withdrawn
- 1996-05-17 EA EA199800046A patent/EA199800046A1/en unknown
- 1996-05-17 AP APAP/P/1997/001163A patent/AP9701163A0/en unknown
- 1996-05-17 CA CA002224015A patent/CA2224015A1/en not_active Abandoned
- 1996-05-17 SK SK1684-97A patent/SK168497A3/en unknown
- 1996-05-17 BR BR9609399-4A patent/BR9609399A/en not_active Application Discontinuation
- 1996-05-17 KR KR1019970709184A patent/KR19990022742A/en not_active Application Discontinuation
- 1996-05-17 CN CN96195891A patent/CN1192241A/en active Pending
- 1996-05-20 AR ARP960102631A patent/AR003124A1/en unknown
- 1996-05-20 IL IL11832996A patent/IL118329A0/en unknown
-
1997
- 1997-12-05 NO NO975752A patent/NO975752L/en unknown
Also Published As
Publication number | Publication date |
---|---|
JPH11507214A (en) | 1999-06-29 |
CA2224015A1 (en) | 1996-12-19 |
NO975752L (en) | 1998-02-06 |
AP9701163A0 (en) | 1998-01-31 |
NO975752D0 (en) | 1997-12-05 |
PL323781A1 (en) | 1998-04-27 |
WO1996040928A1 (en) | 1996-12-19 |
CZ394297A3 (en) | 1998-04-15 |
AU5682896A (en) | 1996-12-30 |
IL118329A0 (en) | 1996-09-12 |
BR9609399A (en) | 2001-08-28 |
SK168497A3 (en) | 1998-07-08 |
HUP0600442A2 (en) | 2006-08-28 |
CN1192241A (en) | 1998-09-02 |
EP0832238A1 (en) | 1998-04-01 |
KR19990022742A (en) | 1999-03-25 |
AR003124A1 (en) | 1998-07-08 |
HUP0600442A3 (en) | 2007-03-28 |
EA199800046A1 (en) | 1998-06-25 |
TR199701537T1 (en) | 1998-03-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU700080B2 (en) | Streptococcal heat shock proteins members of the HSP70 family | |
US6344552B1 (en) | Compositions and methods comprising DNA sequences encoding B. burgdorferi polypeptides | |
AU689075B2 (en) | Membrane-associated immunogens of mycobacteria | |
AU648251B2 (en) | Vaccines for nontypable haemophilus influenzae | |
US7501132B2 (en) | Multiple antigenic peptides immunogenic against Streptococcus pneumonia | |
JP2002533123A (en) | Novel streptococcal antigen | |
WO2003054007A2 (en) | Streptococcus antigens | |
JPH08502417A (en) | Haemophilus outer membrane protein | |
AU2001270381A1 (en) | Streptococcus antigens | |
WO2001098334A2 (en) | Streptococcus antigens | |
NZ303118A (en) | Proteinase K resistant surface protein of Neisseria meningitidis and its use in treating Neisseria infections with a Neisseria-derived monoclonal antibody. | |
KR20010034518A (en) | Group B streptococcus antigens | |
CA2116261A1 (en) | Epitopic regions of pneumococcal surface protein a | |
US5807685A (en) | OspE, OspF, and S1 polypeptides in Borrelia burgdorferi | |
US7074415B2 (en) | Streptococcus antigens | |
AU758764B2 (en) | Epitope peptides immunogenic against (streptococcus pneumoniae) | |
Huygen et al. | Influence of genes from the major histocompatibility complex on the antibody repertoire against culture filtrate antigens in mice infected with live Mycobacterium bovis BCG | |
US5919620A (en) | Heat shock protein HSP72 of Streptococcus pneumoniae | |
CA2416224C (en) | Multiple antigenic peptides immunogenic against streptococcus pneumoniae | |
AU2001271935A1 (en) | Multiple antigenic peptides immunogenic against streptococcus pneumoniae | |
MXPA97009557A (en) | Members of streptococal thermal shock proteins of the hs family | |
KR100216390B1 (en) | Haemophilus outer membrane protein | |
HUYGEN | Mice Infected with Live Mycobacterium bovis BCG | |
AU2007207883A1 (en) | Streptococcus antigens |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |