KR101170203B1 - 신규한 스트렙토코커스 항원 - Google Patents
신규한 스트렙토코커스 항원 Download PDFInfo
- Publication number
- KR101170203B1 KR101170203B1 KR1020087008264A KR20087008264A KR101170203B1 KR 101170203 B1 KR101170203 B1 KR 101170203B1 KR 1020087008264 A KR1020087008264 A KR 1020087008264A KR 20087008264 A KR20087008264 A KR 20087008264A KR 101170203 B1 KR101170203 B1 KR 101170203B1
- Authority
- KR
- South Korea
- Prior art keywords
- glu
- lys
- ser
- leu
- ala
- Prior art date
Links
- 241000194017 Streptococcus Species 0.000 title claims abstract description 41
- 108091007433 antigens Proteins 0.000 title abstract description 32
- 102000036639 antigens Human genes 0.000 title abstract description 32
- 239000000427 antigen Substances 0.000 title abstract description 28
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 82
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 82
- 239000002157 polynucleotide Substances 0.000 claims abstract description 82
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 224
- 229920001184 polypeptide Polymers 0.000 claims description 218
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 218
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 65
- 238000000034 method Methods 0.000 claims description 26
- 239000013598 vector Substances 0.000 claims description 25
- 239000002773 nucleotide Substances 0.000 claims description 23
- 125000003729 nucleotide group Chemical group 0.000 claims description 23
- 230000014509 gene expression Effects 0.000 claims description 15
- 230000028993 immune response Effects 0.000 claims description 9
- 238000012258 culturing Methods 0.000 claims description 6
- 230000000295 complement effect Effects 0.000 claims description 5
- 241000193998 Streptococcus pneumoniae Species 0.000 claims description 4
- 230000001580 bacterial effect Effects 0.000 claims description 4
- 230000001939 inductive effect Effects 0.000 claims description 4
- 229940031000 streptococcus pneumoniae Drugs 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 230000004044 response Effects 0.000 claims description 3
- 235000015097 nutrients Nutrition 0.000 claims description 2
- 108090000623 proteins and genes Proteins 0.000 abstract description 150
- 102000004169 proteins and genes Human genes 0.000 abstract description 93
- 208000015181 infectious disease Diseases 0.000 abstract description 20
- 230000000890 antigenic effect Effects 0.000 abstract description 10
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 abstract description 7
- 201000010099 disease Diseases 0.000 abstract description 6
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 6
- 230000002265 prevention Effects 0.000 abstract description 5
- 241001465754 Metazoa Species 0.000 abstract description 2
- 238000010188 recombinant method Methods 0.000 abstract description 2
- 229940124856 vaccine component Drugs 0.000 abstract description 2
- 208000035143 Bacterial infection Diseases 0.000 abstract 1
- 108010008038 Synthetic Vaccines Proteins 0.000 abstract 1
- 238000003556 assay Methods 0.000 abstract 1
- 208000022362 bacterial infectious disease Diseases 0.000 abstract 1
- 239000012634 fragment Substances 0.000 description 147
- 241000282326 Felis catus Species 0.000 description 94
- 108020004414 DNA Proteins 0.000 description 76
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 65
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 58
- 108010065920 Insulin Lispro Proteins 0.000 description 54
- 241000699670 Mus sp. Species 0.000 description 40
- 108010073969 valyllysine Proteins 0.000 description 40
- 239000003155 DNA primer Substances 0.000 description 39
- 108010005233 alanylglutamic acid Proteins 0.000 description 37
- 108010017391 lysylvaline Proteins 0.000 description 34
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 32
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 32
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 28
- 108010047857 aspartylglycine Proteins 0.000 description 27
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 25
- 108010038633 aspartylglutamate Proteins 0.000 description 25
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 25
- 108010057821 leucylproline Proteins 0.000 description 25
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 24
- 108010093581 aspartyl-proline Proteins 0.000 description 24
- 108010037850 glycylvaline Proteins 0.000 description 24
- 108010092114 histidylphenylalanine Proteins 0.000 description 24
- 108010064235 lysylglycine Proteins 0.000 description 24
- 108010031719 prolyl-serine Proteins 0.000 description 24
- 108010034529 leucyl-lysine Proteins 0.000 description 23
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 22
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 21
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 21
- 150000001413 amino acids Chemical class 0.000 description 21
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 20
- 108010077245 asparaginyl-proline Proteins 0.000 description 20
- 239000013612 plasmid Substances 0.000 description 20
- 229960005486 vaccine Drugs 0.000 description 20
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 19
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 19
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 19
- OSXNCKRGMSHWSQ-ACRUOGEOSA-N Tyr-His-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSXNCKRGMSHWSQ-ACRUOGEOSA-N 0.000 description 19
- 108010047495 alanylglycine Proteins 0.000 description 19
- 108020001507 fusion proteins Proteins 0.000 description 19
- 102000037865 fusion proteins Human genes 0.000 description 19
- 108010050848 glycylleucine Proteins 0.000 description 19
- 108010009298 lysylglutamic acid Proteins 0.000 description 19
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 18
- 108091028043 Nucleic acid sequence Proteins 0.000 description 18
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 18
- 108010087924 alanylproline Proteins 0.000 description 18
- 108010003700 lysyl aspartic acid Proteins 0.000 description 18
- 108010038320 lysylphenylalanine Proteins 0.000 description 18
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 18
- 108010079317 prolyl-tyrosine Proteins 0.000 description 18
- 108010070643 prolylglutamic acid Proteins 0.000 description 18
- 108010061238 threonyl-glycine Proteins 0.000 description 18
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 17
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 17
- 108700026244 Open Reading Frames Proteins 0.000 description 17
- 108010078144 glutaminyl-glycine Proteins 0.000 description 17
- 108010003137 tyrosyltyrosine Proteins 0.000 description 17
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 16
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 16
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 16
- 239000002671 adjuvant Substances 0.000 description 16
- 108010049041 glutamylalanine Proteins 0.000 description 16
- 108010018006 histidylserine Proteins 0.000 description 16
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 15
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 15
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 15
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 15
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 15
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 15
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 15
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 15
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 15
- 108010064997 VPY tripeptide Proteins 0.000 description 15
- 108010013835 arginine glutamate Proteins 0.000 description 15
- 238000010367 cloning Methods 0.000 description 15
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 15
- 108010054155 lysyllysine Proteins 0.000 description 15
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 14
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 14
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 14
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 14
- 241000588724 Escherichia coli Species 0.000 description 14
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 14
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 14
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 14
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 14
- 108010010147 glycylglutamine Proteins 0.000 description 14
- 108010015792 glycyllysine Proteins 0.000 description 14
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 14
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 13
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 13
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 13
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 13
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 13
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 13
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 13
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 13
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 13
- 108010044940 alanylglutamine Proteins 0.000 description 13
- 210000004027 cell Anatomy 0.000 description 13
- 108010089804 glycyl-threonine Proteins 0.000 description 13
- 108010036413 histidylglycine Proteins 0.000 description 13
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 12
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 12
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 12
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 12
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 12
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 12
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 12
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 12
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 12
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 12
- 108010070783 alanyltyrosine Proteins 0.000 description 12
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 11
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 11
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 11
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 11
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 11
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 11
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 11
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 11
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 11
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 11
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 11
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 11
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 11
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 11
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 11
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 11
- 108010092854 aspartyllysine Proteins 0.000 description 11
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 11
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 11
- 108091008146 restriction endonucleases Proteins 0.000 description 11
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 10
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 10
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 10
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 10
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 10
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 10
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 10
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 10
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 10
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 10
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 10
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 10
- 241001644525 Nastus productus Species 0.000 description 10
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 10
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 10
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 10
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 10
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 10
- 239000012472 biological sample Substances 0.000 description 10
- 108010054813 diprotin B Proteins 0.000 description 10
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 10
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 10
- 230000003053 immunization Effects 0.000 description 10
- 238000002649 immunization Methods 0.000 description 10
- 239000002953 phosphate buffered saline Substances 0.000 description 10
- 108010048818 seryl-histidine Proteins 0.000 description 10
- 238000011282 treatment Methods 0.000 description 10
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 9
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 9
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 9
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 9
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 9
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 9
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 9
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 9
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 9
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 9
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 9
- 108091026890 Coding region Proteins 0.000 description 9
- KKZHXOOZHFABQQ-UWJYBYFXSA-N Cys-Ala-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKZHXOOZHFABQQ-UWJYBYFXSA-N 0.000 description 9
- LLRJEFPKIIBGJP-DCAQKATOSA-N Gln-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LLRJEFPKIIBGJP-DCAQKATOSA-N 0.000 description 9
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 9
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 9
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 9
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 9
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 9
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 9
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 9
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 9
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 9
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 9
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 9
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 9
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 9
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 9
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 9
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 9
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 9
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 9
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 9
- LQZZPNDMYNZPFT-KKUMJFAQSA-N Pro-Gln-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LQZZPNDMYNZPFT-KKUMJFAQSA-N 0.000 description 9
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 9
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 9
- 238000012181 QIAquick gel extraction kit Methods 0.000 description 9
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 9
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 9
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 9
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 9
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 9
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 9
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 9
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 9
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 9
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 9
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 9
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 9
- 108010008355 arginyl-glutamine Proteins 0.000 description 9
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 9
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 9
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 9
- 230000004927 fusion Effects 0.000 description 9
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 9
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 9
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 9
- 108010085325 histidylproline Proteins 0.000 description 9
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 108010026333 seryl-proline Proteins 0.000 description 9
- 230000004083 survival effect Effects 0.000 description 9
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 8
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 8
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 8
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 8
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 8
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 8
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 8
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 8
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 8
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 8
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 8
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 8
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 8
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 8
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 8
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 8
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 8
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 8
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 8
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 8
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 8
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 8
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 8
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 8
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 8
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 8
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 8
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 8
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 8
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 8
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 8
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 8
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 8
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 8
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 8
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 8
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 8
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 8
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 8
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 8
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 8
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 8
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 8
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 8
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 8
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 8
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 8
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 8
- 230000027455 binding Effects 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 8
- 108010087823 glycyltyrosine Proteins 0.000 description 8
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 8
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 8
- 108010077112 prolyl-proline Proteins 0.000 description 8
- 230000001681 protective effect Effects 0.000 description 8
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 8
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 7
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 7
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 7
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 7
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 7
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 7
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 7
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 7
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 7
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 7
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 7
- 238000002965 ELISA Methods 0.000 description 7
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 7
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 7
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 7
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 7
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 7
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 7
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 7
- HYLIOBDWPQNLKI-HVTMNAMFSA-N Ile-His-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HYLIOBDWPQNLKI-HVTMNAMFSA-N 0.000 description 7
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 7
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 7
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 7
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 7
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 7
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 7
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 7
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 7
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 7
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 7
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 7
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 7
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 7
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 7
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 7
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 7
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 7
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 7
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 7
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 7
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 7
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 7
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 7
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 7
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 7
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 7
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 7
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 7
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 7
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 7
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 7
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 7
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 7
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 7
- 108010078274 isoleucylvaline Proteins 0.000 description 7
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 7
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 7
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 6
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 6
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 6
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 6
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 6
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 6
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 6
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 6
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 6
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 6
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 6
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 6
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 6
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 6
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 6
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 6
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 6
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 6
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 6
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 6
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 6
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 6
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 6
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 6
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 6
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 6
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 6
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 6
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 6
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 6
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 6
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 6
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 6
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 6
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 6
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 6
- UWNUQPZUSRFIIN-JUKXBJQTSA-N His-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N UWNUQPZUSRFIIN-JUKXBJQTSA-N 0.000 description 6
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 6
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 6
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 6
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 6
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 6
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 6
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 6
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 6
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 6
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 6
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 6
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 6
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 6
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 6
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 6
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 6
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 6
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 6
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 6
- ZZVUXQCQPXSUFH-JBACZVJFSA-N Phe-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ZZVUXQCQPXSUFH-JBACZVJFSA-N 0.000 description 6
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 6
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 6
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 6
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 6
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 6
- 108010025216 RVF peptide Proteins 0.000 description 6
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 6
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 6
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 6
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 6
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 6
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 6
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 6
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 6
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 6
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 6
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 6
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 6
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 6
- 239000011543 agarose gel Substances 0.000 description 6
- 230000034994 death Effects 0.000 description 6
- 231100000517 death Toxicity 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 108010040030 histidinoalanine Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108010004914 prolylarginine Proteins 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 6
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 5
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 5
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 5
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 5
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 5
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 5
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 5
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 5
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 5
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 5
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 5
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 5
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 5
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 5
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 5
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 5
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 5
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 5
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 5
- QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 5
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 5
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 5
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 5
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 5
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 5
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 5
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 5
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 5
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 5
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 5
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 5
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 5
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 5
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 5
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 5
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 5
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 5
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 5
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 5
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 5
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 5
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 5
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 5
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 5
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 5
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 5
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 5
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 5
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 5
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 5
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 5
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 5
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 5
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 5
- JLCYOCDGIUZMKQ-JBACZVJFSA-N Glu-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N JLCYOCDGIUZMKQ-JBACZVJFSA-N 0.000 description 5
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 5
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 5
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 5
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 5
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 5
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 5
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 5
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 5
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 5
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 5
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 5
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 5
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 5
- LPBWRHRHEIYAIP-KKUMJFAQSA-N His-Tyr-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LPBWRHRHEIYAIP-KKUMJFAQSA-N 0.000 description 5
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 5
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 5
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 5
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 5
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 5
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 5
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 5
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 5
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 5
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 5
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 5
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 5
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 5
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 5
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 5
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 5
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 5
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 5
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 5
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 5
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 5
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 5
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 5
- 241000699666 Mus <mouse, genus> Species 0.000 description 5
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 5
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 5
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 5
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 5
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 5
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 5
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 5
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 5
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 5
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 5
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 5
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 5
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 5
- 108010076504 Protein Sorting Signals Proteins 0.000 description 5
- 101100228799 Schizosaccharomyces pombe (strain 972 / ATCC 24843) gem6 gene Proteins 0.000 description 5
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 5
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 5
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 5
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 5
- NBUKGEFVZJMSIS-XIRDDKMYSA-N Ser-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CO)N NBUKGEFVZJMSIS-XIRDDKMYSA-N 0.000 description 5
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 5
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 5
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 5
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 5
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 5
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 5
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 5
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 5
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 5
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 5
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 5
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 5
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 5
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 5
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 5
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 5
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 5
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 5
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 5
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 5
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 5
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 5
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 5
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 5
- 108010070944 alanylhistidine Proteins 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 150000004676 glycans Chemical class 0.000 description 5
- 230000002163 immunogen Effects 0.000 description 5
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 5
- 101150066555 lacZ gene Proteins 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 244000005700 microbiome Species 0.000 description 5
- 108010024607 phenylalanylalanine Proteins 0.000 description 5
- 229920001282 polysaccharide Polymers 0.000 description 5
- 239000005017 polysaccharide Substances 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 231100000331 toxic Toxicity 0.000 description 5
- 230000002588 toxic effect Effects 0.000 description 5
- 108010036211 5-HT-moduline Proteins 0.000 description 4
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 4
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 4
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 4
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 4
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 4
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 4
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 4
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 4
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 4
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 4
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 4
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 4
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 4
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 4
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 4
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 4
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 4
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 4
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 4
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 4
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 4
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 4
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 4
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 4
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 4
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 4
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 4
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 4
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 4
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 4
- STHNZYKCJHWULY-AVGNSLFASA-N Arg-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O STHNZYKCJHWULY-AVGNSLFASA-N 0.000 description 4
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 4
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 4
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 4
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 4
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 4
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 4
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 4
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 4
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 4
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 4
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 4
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 4
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 4
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 4
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 4
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 4
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 4
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 4
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 4
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 4
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 4
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 4
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 4
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 4
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 4
- WYOSXGYAKZQPGF-SRVKXCTJSA-N Asp-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N WYOSXGYAKZQPGF-SRVKXCTJSA-N 0.000 description 4
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 4
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 4
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 4
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 4
- MQANCSUBSBJNLU-KKUMJFAQSA-N Gln-Arg-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQANCSUBSBJNLU-KKUMJFAQSA-N 0.000 description 4
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 4
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 4
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 4
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 4
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 4
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 4
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 4
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 4
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 4
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 4
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 4
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 4
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 4
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 4
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 4
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 4
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 4
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 4
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 4
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 4
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 4
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 4
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 4
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 4
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 4
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 4
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 4
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 4
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 4
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 4
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 4
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 4
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 4
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 4
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 4
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 4
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 4
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 4
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 4
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 4
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 4
- WSEITRHJRVDTRX-QTKMDUPCSA-N His-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N)O WSEITRHJRVDTRX-QTKMDUPCSA-N 0.000 description 4
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 4
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 4
- CSRRMQFXMBPSIL-SIXJUCDHSA-N His-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N CSRRMQFXMBPSIL-SIXJUCDHSA-N 0.000 description 4
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 4
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 4
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 4
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 4
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 4
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 4
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 4
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 4
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 4
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 4
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 4
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 4
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 4
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 4
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 4
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 4
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 4
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 4
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 4
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 4
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 4
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 4
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 4
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 4
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 4
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 4
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 4
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 4
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 4
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 4
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 4
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 4
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 4
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 4
- SBFPAAPFKZPDCZ-JYJNAYRXSA-N Met-Pro-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SBFPAAPFKZPDCZ-JYJNAYRXSA-N 0.000 description 4
- KZKVVWBOGDKHKE-QTKMDUPCSA-N Met-Thr-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 KZKVVWBOGDKHKE-QTKMDUPCSA-N 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 4
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 4
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 4
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 4
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 4
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 4
- 208000035109 Pneumococcal Infections Diseases 0.000 description 4
- 206010035664 Pneumonia Diseases 0.000 description 4
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 4
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 4
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 4
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 4
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 4
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 4
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 4
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 4
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 4
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 4
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 4
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 4
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 4
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 4
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 4
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 4
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 4
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 4
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 4
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 4
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 4
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 4
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 4
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 4
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 4
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 4
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 4
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 4
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 4
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 4
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 4
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 4
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 4
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 4
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 4
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 4
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 4
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 4
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 4
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 4
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 4
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 4
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 4
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 4
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 4
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 4
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 4
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 4
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 4
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 4
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 4
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 4
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 4
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 4
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 4
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 4
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 4
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 4
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 4
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 4
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 4
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 4
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 4
- YIKDYZDNRCNFQB-KKUMJFAQSA-N Tyr-His-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O YIKDYZDNRCNFQB-KKUMJFAQSA-N 0.000 description 4
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 4
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 4
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 4
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 4
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 4
- QHONGSVIVOFKAC-ULQDDVLXSA-N Tyr-Pro-His Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QHONGSVIVOFKAC-ULQDDVLXSA-N 0.000 description 4
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 4
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 4
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 4
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 4
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 4
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 4
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 4
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 4
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 4
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 4
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 4
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 4
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 4
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 4
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 4
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 4
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 4
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 210000004369 blood Anatomy 0.000 description 4
- 239000008280 blood Substances 0.000 description 4
- 239000000969 carrier Substances 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 238000002405 diagnostic procedure Methods 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010020688 glycylhistidine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 210000004408 hybridoma Anatomy 0.000 description 4
- 230000036039 immunity Effects 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 239000013600 plasmid vector Substances 0.000 description 4
- 229960001973 pneumococcal vaccines Drugs 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 210000002966 serum Anatomy 0.000 description 4
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 4
- 238000010254 subcutaneous injection Methods 0.000 description 4
- 239000007929 subcutaneous injection Substances 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 4
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 4
- 101150000874 11 gene Proteins 0.000 description 3
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 3
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 3
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 3
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 3
- OAIGZYFGCNNVIE-ZPFDUUQYSA-N Ala-Val-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O OAIGZYFGCNNVIE-ZPFDUUQYSA-N 0.000 description 3
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 3
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 3
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 3
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 3
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 3
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 3
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 3
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 3
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 3
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 3
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 3
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 3
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 3
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 3
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 3
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 3
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 3
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 3
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 3
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 3
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 3
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 3
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 3
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 3
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 3
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 3
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 3
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 3
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 3
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 3
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 3
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 3
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 3
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 3
- 102000004457 Granulocyte-Macrophage Colony-Stimulating Factor Human genes 0.000 description 3
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 3
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 3
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 3
- 102000002265 Human Growth Hormone Human genes 0.000 description 3
- 108010000521 Human Growth Hormone Proteins 0.000 description 3
- 239000000854 Human Growth Hormone Substances 0.000 description 3
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 3
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 3
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 3
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 3
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 3
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 3
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 3
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 3
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 3
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 3
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 3
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 3
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 3
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 3
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 3
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 3
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 3
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 3
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 3
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 3
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 3
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 3
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 3
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 3
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 3
- 201000009906 Meningitis Diseases 0.000 description 3
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 3
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 3
- 206010033078 Otitis media Diseases 0.000 description 3
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 3
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 3
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 3
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 3
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 3
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 3
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 3
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 3
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 3
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 3
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 3
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 3
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 3
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 3
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 3
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 3
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 3
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 3
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- 241000680505 Streptococcus pneumoniae WU2 Species 0.000 description 3
- 102000002933 Thioredoxin Human genes 0.000 description 3
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 3
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 3
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 3
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 3
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 3
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 3
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 3
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 3
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 3
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 3
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 3
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 3
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 3
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 3
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 3
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 3
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 108010036533 arginylvaline Proteins 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 229920001429 chelating resin Polymers 0.000 description 3
- 239000007330 chocolate agar Substances 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 244000052769 pathogen Species 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 239000013615 primer Substances 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 108060008226 thioredoxin Proteins 0.000 description 3
- 229940094937 thioredoxin Drugs 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 2
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 2
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- OCDJOVKIUJVUMO-SRVKXCTJSA-N Arg-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N OCDJOVKIUJVUMO-SRVKXCTJSA-N 0.000 description 2
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 2
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 2
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 2
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 2
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 2
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 2
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 2
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 2
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- IIQIOFVDFOLCHP-UHFFFAOYSA-N Asn-Pro-Ser-Ser Chemical compound NC(=O)CC(N)C(=O)N1CCCC1C(=O)NC(CO)C(=O)NC(CO)C(O)=O IIQIOFVDFOLCHP-UHFFFAOYSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 2
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 2
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 2
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 2
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 2
- QHHVSXGWLYEAGX-GUBZILKMSA-N Asp-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QHHVSXGWLYEAGX-GUBZILKMSA-N 0.000 description 2
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 2
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 2
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 2
- 208000031729 Bacteremia Diseases 0.000 description 2
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 2
- 241000672609 Escherichia coli BL21 Species 0.000 description 2
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 2
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 2
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 2
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 2
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 2
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 2
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 2
- KSKFIECUYMYWNS-AVGNSLFASA-N Gln-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N KSKFIECUYMYWNS-AVGNSLFASA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 2
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 2
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 2
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 2
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 2
- XOIATPHFYVWFEU-DCAQKATOSA-N Glu-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOIATPHFYVWFEU-DCAQKATOSA-N 0.000 description 2
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 2
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 2
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 2
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 2
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 2
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 2
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 2
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 2
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 2
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 2
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 2
- FHGVHXCQMJWQPK-SRVKXCTJSA-N His-Lys-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O FHGVHXCQMJWQPK-SRVKXCTJSA-N 0.000 description 2
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 2
- ZNTSGDNUITWTRA-WDSOQIARSA-N His-Trp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O ZNTSGDNUITWTRA-WDSOQIARSA-N 0.000 description 2
- MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 2
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 2
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 2
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 2
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 2
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 2
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 2
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 2
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 2
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 2
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- 108090001030 Lipoproteins Proteins 0.000 description 2
- 102000004895 Lipoproteins Human genes 0.000 description 2
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 2
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 2
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 2
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 2
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 2
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 2
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 2
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 2
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 2
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 2
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 2
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 2
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 2
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 2
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 2
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 2
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 2
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 2
- 241001072332 Monia Species 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 2
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 2
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 2
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 2
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 2
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 2
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 2
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 2
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 101100460200 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) NEW1 gene Proteins 0.000 description 2
- 101100167780 Schizosaccharomyces pombe (strain 972 / ATCC 24843) coa2 gene Proteins 0.000 description 2
- 101100172625 Schizosaccharomyces pombe (strain 972 / ATCC 24843) erh1 gene Proteins 0.000 description 2
- 101100460198 Schizosaccharomyces pombe (strain 972 / ATCC 24843) new14 gene Proteins 0.000 description 2
- 101100460204 Schizosaccharomyces pombe (strain 972 / ATCC 24843) new4 gene Proteins 0.000 description 2
- 101100273916 Schizosaccharomyces pombe (strain 972 / ATCC 24843) wip1 gene Proteins 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 2
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 241001505901 Streptococcus sp. 'group A' Species 0.000 description 2
- 241000193990 Streptococcus sp. 'group B' Species 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 2
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 2
- BDYBHQWMHYDRKJ-UNQGMJICSA-N Thr-Phe-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N)O BDYBHQWMHYDRKJ-UNQGMJICSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 2
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 2
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 2
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 2
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 2
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 2
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 2
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 2
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 2
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- CHWRZUGUMAMTFC-IHRRRGAJSA-N Val-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CNC=N1 CHWRZUGUMAMTFC-IHRRRGAJSA-N 0.000 description 2
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 230000009435 amidation Effects 0.000 description 2
- 238000007112 amidation reaction Methods 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 238000003018 immunoassay Methods 0.000 description 2
- 238000003119 immunoblot Methods 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 238000010255 intramuscular injection Methods 0.000 description 2
- 239000007927 intramuscular injection Substances 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 231100000518 lethal Toxicity 0.000 description 2
- 230000001665 lethal effect Effects 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 210000004897 n-terminal region Anatomy 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000011321 prophylaxis Methods 0.000 description 2
- 229940023143 protein vaccine Drugs 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- CWERGRDVMFNCDR-UHFFFAOYSA-N thioglycolic acid Chemical compound OC(=O)CS CWERGRDVMFNCDR-UHFFFAOYSA-N 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 238000002255 vaccination Methods 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- HUEXNHSMABCRTH-UHFFFAOYSA-N 1h-imidazole Chemical compound C1=CNC=N1.C1=CNC=N1 HUEXNHSMABCRTH-UHFFFAOYSA-N 0.000 description 1
- 101710117373 92 kDa protein Proteins 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- 239000005995 Aluminium silicate Substances 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- JRCASHGTXZYSPW-XIRDDKMYSA-N Asn-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC(=O)N)N JRCASHGTXZYSPW-XIRDDKMYSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000228245 Aspergillus niger Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 239000004971 Cross linker Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 101001065501 Escherichia phage MS2 Lysis protein Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 238000000729 Fisher's exact test Methods 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 208000029462 Immunodeficiency disease Diseases 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- -1 NEW11 Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 241000187654 Nocardia Species 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 101100322386 Schizosaccharomyces pombe (strain 972 / ATCC 24843) new8 gene Proteins 0.000 description 1
- 101100460205 Schizosaccharomyces pombe (strain 972 / ATCC 24843) new9 gene Proteins 0.000 description 1
- 101100255212 Schizosaccharomyces pombe (strain 972 / ATCC 24843) rsa3 gene Proteins 0.000 description 1
- 101100210181 Schizosaccharomyces pombe (strain 972 / ATCC 24843) vts1 gene Proteins 0.000 description 1
- 206010040047 Sepsis Diseases 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- 229940124858 Streptococcus pneumoniae vaccine Drugs 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 206010043376 Tetanus Diseases 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 230000037006 agalactosis Effects 0.000 description 1
- 238000007818 agglutination assay Methods 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 235000012211 aluminium silicate Nutrition 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 230000002547 anomalous effect Effects 0.000 description 1
- 230000000845 anti-microbial effect Effects 0.000 description 1
- 230000003497 anti-pneumococcal effect Effects 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 238000009640 blood culture Methods 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 125000001314 canonical amino-acid group Chemical group 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010924 continuous production Methods 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000003113 dilution method Methods 0.000 description 1
- 125000000118 dimethyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000006806 disease prevention Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 238000012872 hydroxylapatite chromatography Methods 0.000 description 1
- 150000002460 imidazoles Chemical class 0.000 description 1
- 230000007813 immunodeficiency Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- NLYAJNPCOHFWQQ-UHFFFAOYSA-N kaolin Chemical compound O.O.O=[Al]O[Si](=O)O[Si](=O)O[Al]=O NLYAJNPCOHFWQQ-UHFFFAOYSA-N 0.000 description 1
- 101150109249 lacI gene Proteins 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 238000001325 log-rank test Methods 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 150000003956 methylamines Chemical class 0.000 description 1
- 238000007479 molecular analysis Methods 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 230000001254 nonsecretory effect Effects 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 229940080469 phosphocellulose Drugs 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- ZZIZZTHXZRDOFM-XFULWGLBSA-N tamsulosin hydrochloride Chemical compound [H+].[Cl-].CCOC1=CC=CC=C1OCCN[C@H](C)CC1=CC=C(OC)C(S(N)(=O)=O)=C1 ZZIZZTHXZRDOFM-XFULWGLBSA-N 0.000 description 1
- 125000004149 thio group Chemical group *S* 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 101150057627 trxB gene Proteins 0.000 description 1
- 108010087967 type I signal peptidase Proteins 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940125575 vaccine candidate Drugs 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/315—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci
- C07K14/3156—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Streptococcus (G), e.g. Enterococci from Streptococcus pneumoniae (Pneumococcus)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P11/00—Drugs for disorders of the respiratory system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P27/00—Drugs for disorders of the senses
- A61P27/16—Otologicals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Abstract
Description
본 발명은 항원에 관한 것으로, 좀 더 자세하게는 치료 및/또는 예방을 위한 백신 성분으로 유용한 스트렙토코커스 뉴모니애(streptococcus pneumoniae) 병원균(pathogen)의 단백질 항원에 관한 것이다.
S. 뉴모니애는 사람 특히 유아, 노인 및 면역력이 떨어진 사람에게 주요한 발병 요인이다. 상기 병원균은 전세계적으로 높은 발병율 및 사망율을 갖는 균혈증 (bacteraemia) /패혈증(septicaemia), 폐렴(pneumonia), 뇌막염(meningitis) 등의 침입성 질병의 환자로 부터 자주 검출된다. 심지어 적절한 항생제 치료가 병행되어도, 폐렴 구균(peumococcal)의 감염은 여전히 높은 사망율을 나타낸다. 비록 항미생물성 약물의 출현이 전반적인 폐렴 구균성 질병으로 인한 사망율을 감소시켰다 하더라도 저항성을 갖는 폐렴 구균성 미생물의 존재는 오늘날 세계적으로 주요 문제가 되고 있다. 효과적인 폐렴 구균 백신은 S. 뉴모니애로 인한 발병율 및 사망율에 주요 효과를 가져야 한다. 또한, 상기 백신은 유아와 어린이의 중이염(otitis media)을 막는데에도 잠재적으로 유용하다.
폐렴 구균 백신을 개발하기 위한 노력은 일반적으로 폐렴 구균 외피 다당류 에 면역 반응을 유발하는 것으로 집약될 수 있다. 80개 이상의 폐렴 구균 외피 혈청형(serotype)이 항원적으로 차이점을 가진다는 사실은 알려져왔다. 현재 입수 가능한 폐렴 구균 백신은 가장 빈번하게 질병을 유발하는 23 외피 다당류를 포함하는데, 몇몇 외피 다당류의 낮은 면역 유발력, 및 혈청형의 다양성 및 시간에 따른 혈청형 분포의 차이, 위치 영역 및 연령 그룹과 관련된 중요한 단점들을 가지고 있다. 특히, 모든 혈청형에 대하여 어린이들을 보호하기 위한 기존의 백신 및 현재 개발중인 외피 융합 백신은 효과가 낮아 다른 S. 뉴모니애 성분의 조사를 측정하였다. 비록 외피 다당류의 면역 유발능이 향상될 수 있다하더라도, 혈청형 특이성은 여전히 다당류-원류 백신의 주요 한계점일 것이다. 항원적으로 공통된 면역 유발 폐렴 구균 단백질 항원 단독 또는 부가적인 성분과 결합된 형태로의 사용은 단백질-원류의 폐렴 구균 백신의 효과 상승 가능성을 제공한다.
1998년 5월 7일자에 "스트렙토코커스 뉴모니애 항원 및 백신" 의 명칭으로 공개된 PCT 공개 번호 제WO98/18930호는 항원성을 갖는다고 주장된 특정 폴리펩타이드에 대하여 기술하고 있다. 그러나, 상기 폴리펩타이드의 생물학적 활성은 보고되지 않았다.
따라서, 스트렙토코커스 감염의 예방 및/또는 치료를 위한 백신 성분으로써 사용될 수 있는 스트렙토코커스 항원에 대한 요구는 충족되지 않은 상태로 남아있는 실정이다.
본 발명은 스트렙토코커스 감염의 예방 및/또는 치료를 위한 백신 성분으로써 사용될 수 있는 스트렙토코커스 항원의 제공을 목적으로 한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70%의 동일성을 갖는 폴리펩타이드를 암호화하는 분리형 폴리뉴클레오타이드를 제공한다.
다른 관점에서, 본 발명은 발현 조절 영역에 조작적으로 결합된 본 발명의 폴리뉴클레오타이드를 포함하는 벡터, 상기 벡터로 형질전환된 숙주 세포 및 발현에 적합한 조건 하에서 상기 숙주 세포를 배양하는 것을 포함하는 폴리펩타이드의 제조 방법을 제공한다.
또 다른 측면에 있어서, 본 발명은 상기 폴리뉴클레오타이드에 의해 암호화되는 신규한 폴리펩타이드를 제공한다.
BVH-71, BVH-3 및 BVH-11 단백질이 동일한 기능을 공유할 수 있다. 또한, 본 발명자들의 결과는 BVH-71 단백질이 항-스트렙토코커스의 단백질 백신 성분으로 사용될 수 있다는 것을 시사한다. 좀 더 구체적으로, 상기 BVH-71 단백질은 항-GAS 또는 항-GBS 백신의 단백질 백신 성분으로 사용될 수 있다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 95% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 8, 10, 14, 16, 55 내지 75, 77 내지 79 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴 리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 8, 10, 16, 55, 56, 57, 58, 59, 64, 65, 66, 78 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 8, 10, 16, 55, 56, 57, 59, 64, 65, 66, 78 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 4, 14, 58, 60, 61, 62, 63, 67, 68, 69, 70, 71, 72, 73, 74, 75, 77, 79 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 4, 14, 60, 61, 62, 63, 67, 68, 69, 70, 71, 72, 73, 74, 75, 77, 79 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 10, 14, 16, 55 내지 75, 77 내지 79 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴 클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 10, 55 내지 75, 77, 78, 79 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 55 내지 75, 77, 78, 79 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 6, 8, 10 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 10, 14, 16 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 14, 16 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2 또는 이의 절편, 유사체 또는 유도 체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 4 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 10 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 14 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 16 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 58 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 60 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 62 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 64 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 67 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 68 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 69 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 72 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
*한가지 관점에서, 본 발명은 서열번호 74 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 77 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 제2의 폴리펩타이드와 적어도 70% 동일성을 갖는 폴리펩타이드를 코딩하는 분리형 폴리뉴클레오타이드를 제공한다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 6, 8, 10 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
*한가지 관점에서, 본 발명은 서열번호 2, 4, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 8, 10, 14, 16, 55 내지 75, 77 내지 79 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 10, 14, 16, 55 내지 75, 77 내지 79 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 2, 4, 10, 14, 16 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 2 또는 이들의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 4 또는 이들의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 10 또는 이들의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 14 또는 이들의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 16 또는 이들의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 10, 55 내지 75, 77, 78, 79 또는 이 들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 10, 62, 64, 67, 68, 74, 77 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 58 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 62 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 64 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 67 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 68 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 74 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
한가지 관점에서, 본 발명은 서열번호 77 또는 이의 절편, 유사체 또는 유도체 서열을 포함하는 아미노산 서열로 특정되는 폴리펩타이드에 관한 것이다.
좀 더 구체적으로, 본 발명은 또한 본 출원에 기술된 하나 또는 그 이상의 폴리펩타이드 또는 이들의 절편, 유사체 또는 유도체를 포함하는 키메라성 폴리펩타이드에 관한 것이다.
좀 더 구체적으로, 본 발명은 또한 본 출원의 도면에 정의된 하나 또는 그 이상의 폴리펩타이드 또는 이들의 절편, 유사체 또는 유도체를 포함하는 키메라성 폴리펩타이드에 관한 것이다.
좀 더 구체적으로, 본 발명은 또한 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택되고; 키메라성 폴리펩타이드를 형성하기 위해 연결된 상기 폴리펩타이드 또는 이들의 절편, 유사체 또는 유도체로부터 제공된 둘 또는 그 이상의 폴리펩타이드를 포함하는 키메라성 폴리펩타이드에 관한 것이다.
좀 더 구체적으로, 상기 키메라성 폴리펩타이드는 서열번호 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 또는 이들의 절편, 유사체 또는 유도체로부터 선택되고; 키메라성 폴리펩타이드를 형성하기 위해 연결된 상기 폴리펩타이드 또는 이들의 절편, 유사체 또는 유도체로부터 제공된 둘 또는 그 이상의 폴리펩타이드를 포함할 것이다.
좀 더 구체적으로, 상기 키메라성 폴리펩타이드는 서열번호 10, 58, 60, 62, 64, 67, 68, 74, 77 또는 이들의 절편, 유사체 또는 유도체로부터 선택되고; 키메라성 폴리펩타이드를 형성하기 위해 연결된 상기 폴리펩타이드 또는 이들의 절편, 유사체 또는 유도체로부터 제공된 둘 또는 그 이상의 폴리펩타이드를 포함할 것이다.
좀 더 구체적으로, 상기 키메라성 폴리펩타이드는 서열번호 10, 62, 64, 67, 68, 74, 77 또는 이들의 절편, 유사체 또는 유도체로부터 선택되고; 키메라성 폴리펩타이드를 형성하기 위해 연결된 폴리펩타이드 또는 이들의 절편, 유사체 또는 유도체로부터 제공된 둘 또는 그 이상의 폴리펩타이드를 포함할 것이다.
좀 더 구체적으로, 상기 키메라성 폴리펩타이드는 2 내지 5개의 폴리펩타이드를 포함할 것이다.
좀 더 구체적으로, 상기 키메라성 폴리펩타이드는 2 내지 4개의 폴리펩타이드를 포함할 것이다.
*좀 더 구체적으로, 상기 키메라성 폴리펩타이드는 2 내지 3개의 폴리펩타이드를 포함할 것이다.
좀 더 구체적으로, 상기 키메라성 폴리펩타이드는 2개의 폴리펩타이드를 포함할 것이다.
좀 더 구체적으로, 본 발명은 하기 구조식 1로 표시되는 키메라성 폴리펩타이드를 제공한다.
(구조식 1)
상기 구조식 1에 있어서, m은 0 또는 1, n은 0 또는 1,
*A는 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편 또는 유사체 또는 유도체;
B는 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편 또는 유사체 또는 유도체;
C는 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편 또는 유사체 또는 유도체; 및
D는 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편 또는 유사체 또는 유도체 서열로 부터 선택된다.
좀 더 구체적으로,
A는 서열번호 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 또는 이들의 절편 또는 유사체 또는 유도체;
B는 서열번호 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 또는 이들의 절편 또는 유사체 또는 유도체;
C는 서열번호 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 또는 이들의 절편 또는 유사체 또는 유도체; 및
D는 서열번호 10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 또는 이들의 절편 또는 유사체 또는 유도체 서열로 부터 선택된다.
좀 더 구체적으로,
A는 서열번호 10, 58, 60, 62, 64, 67, 68, 74, 77 또는 이들의 절편 또는 유사체 또는 유도체;
B는 서열번호 10, 58, 60, 62, 64, 67, 68, 74, 77 또는 이들의 절편 또는 유사체 또는 유도체;
C는 서열번호 10, 58, 60, 62, 64, 67, 68, 74, 77 또는 이들의 절편 또는 유사체 또는 유도체; 및
D는 서열번호 10, 58, 60, 62, 64, 67, 68, 74, 77 또는 이들의 절편 또는 유사체 또는 유도체 서열로 부터 선택된다.
일실시예에 있어서, 본 발명의 키메라성 폴리펩타이드는 하기 실시예에 존재하는 폴리펩타이드를 단독 또는 결합체로 포함한다.
좀 더 구체적으로, A는 서열번호 10, 58, 62, 64, 67, 68, 74, 77 또는 이들의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, A는 서열번호 10 또는 이의 절편, 유사체 또는 유도체이 다.
좀 더 구체적으로, A는 서열번호 58 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, A는 서열번호 62 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, A는 서열번호 64 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, A는 서열번호 67 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, A는 서열번호 68 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, A는 서열번호 74 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, A는 서열번호 77 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, B는 서열번호 10, 58, 62, 64, 67, 68, 74, 77 또는 이들의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, B는 서열번호 10 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, B는 서열번호 58 또는 이의 절편, 유사체 또는 유도체이 다.
좀 더 구체적으로, B는 서열번호 64 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, B는 서열번호 64 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, B는 서열번호 67 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, B는 서열번호 68 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, B는 서열번호 74 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, B는 서열번호 77 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, C는 서열번호 10, 58, 62, 64, 67, 68, 74, 77 또는 이들의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, C는 서열번호 10 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, C는 서열번호 58 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, C는 서열번호 62 또는 이의 절편, 유사체 또는 유도체이 다.
좀 더 구체적으로, C는 서열번호 64 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, C는 서열번호 67 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, C는 서열번호 68 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, C는 서열번호 74 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, C는 서열번호 77 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, D는 서열번호 10, 58, 62, 64, 67, 68, 74, 77 또는 이들의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, D는 서열번호 10 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, D는 서열번호 58 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, D는 서열번호 62 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, D는 서열번호 64 또는 이의 절편, 유사체 또는 유도체이 다.
좀 더 구체적으로, D는 서열번호 67 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, D는 서열번호 68 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, D는 서열번호 74 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, D는 서열번호 77 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, m은 0이다.
좀 더 구체적으로, n은 0이다.
좀 더 구체적으로, m 및 n은 0이다.
좀 더 구체적으로, m 및 n은 0이고, A는 서열번호 64 또는 이의 절편, 유사체 또는 유도체이고, B는 서열번호 62 또는 이의 절편, 유사체 또는 유도체이다.
좀 더 구체적으로, m 및 n은 0이고, A는 서열번호 62 또는 이의 절편, 유사체 또는 유도체이고, B는 서열번호 64 또는 이의 절편, 유사체 또는 유도체이다.
본 발명의 견지에서, 폴리펩타이드를 코딩하는 모든 뉴클레오타이드 및 키메라성 폴리펩타이드는 본 발명의 범주내에 있다.
좀 더 구체적으로, 본 발명에 부합하는 상기 폴리펩타이드 또는 키메라성 폴리펩타이드는 항원이다.
좀 더 구체적으로, 본 발명에 부합하는 상기 폴리펩타이드 또는 키메라성 폴리펩타이드는 개개인에 면역 반응을 도출시킬 수 있다.
좀 더 구체적으로, 본 발명은 또한, 상기에 정의된 본 발명의 상기 폴리펩타이드 또는 키메라성 폴리펩타이드에 결합 특이성을 갖는 항체를 생성할 수 있는 폴리펩타이드에 관한 것이다.
"결합 특이성을 가지는" 항체는 선택된 폴리펩타이드를 인식하고 결합하나 생물학적 샘플 내의 다른 물질들과는 연속적으로 인식하고 결합하지 않는, 자연적으로 선택된 펩타이드를 포함하는 항체이다. 특이적 결합은 선택된 폴리펩타이드가 항원으로 사용된 ELISA 검정을 수행하여 측정될 수 있다.
다르게 정의되지 않았다면, 본 명세서에 사용된 모든 기술적 과학적 용어는 본 발명 해당 분야의 당업자에 의해 일반적으로 이해되는 것과 동일한 의미를 갖는다. 본 명세서에 언급된 모든 공개 공보, 특허 출원, 특허 및 다른 참고 문헌들은 모두 참고문헌으로 포함된다. 상충되는 경우 본 명세서의 정의에 따른다. 부가적으로 성가 물질, 방법, 및 실시예들은 단지 기술하기 위한 목적일 뿐 본 발명의 내용을 제한하기 위한 것이 아니다.
본 명세서에 사용된 본 발명의 폴리펩타이드의 "절편", "유도체" 또는 "유사체"는 하나 또는 그 이상의 아미노산 잔기가 공통되거나 또는 공통되지 않은 아미노산 잔기(바람직하게는 공통된)로 치환되고, 정상이거나 변성된 폴리펩타이드를 포함한다. 일실시예에 있어서, 본 발명 폴리펩타이드의 유도체 및 유사체는 도면에 기술된 서열 또는 이들의 절편과 약 70%의 동일성을 가질 것이다. 즉, 상기 잔 기의 70%는 동일하다. 좀 더 구체적으로, 폴리펩타이드는 75% 이상의 유사성을 가질 것이다. 좀 더 구체적으로, 폴리펩타이드는 80% 이상의 유사성을 가질 것이다. 좀 더 구체적으로, 폴리펩타이드는 85% 이상의 유사성을 가질 것이다. 좀 더 구체적으로, 폴리펩타이드는 90% 이상의 유사성을 가질 것이다. 좀 더 구체적으로, 폴리펩타이드는 95% 이상의 유사성을 가질 것이다. 좀 더 구체적으로, 폴리펩타이드는 99% 이상의 유사성을 가질 것이다. 좀 더 구체적으로, 본 발명 폴리펩타이드의 유도체 및 유사체는 치환, 변성 또는 결실된 아미노산 잔기가 약 20개 이하, 바람직하게는 10개 미만일 것이다. 바람직한 치환은 공통된 잔기 즉, 소수성(hydrophobicity), 사이즈, 극성, 또는 기능기 등의 물리 화학적 특성을 공유하는 치환된 잔기로 당업 분야에서 알려진 것이다.
본 발명의 견지에 있어서, 본 발명의 풀리펩타이드는 폴리펩타이드 및 키메라성 폴리펩타이드를 모두 포함한다.
또한, 본 발명은 폴리펩타이드의 생물학적 또는 약학적 특성을 변화시키는 다른 화합물, 예를 들어 반수명(half-life)을 증가시키는 PEG(polyethylene glycol); 정제를 용이하게 하기 위한 리더 또는 보조 아미노산 서열; 프리프로(prepro-) 및 프로(pro-) 서열; 및 (다)당류가 결합된 폴리펩타이드를 포함한다.
또한, 아미노산 영역이 다형적인 것으로 판명될 경우, 좀 더 효과적으로 다른 스트렙토코커스 균주의 다른 항원결정기와 유사한 하나 또는 그 이상의 특정 아미노산을 다양화하는 것이 바람직하다.
게다가, 본 발명의 폴리펩타이드는 안전성을 제공하고, 지지체 또는 다른 물 질과의 연결 또는 결합을 위한 소수성을 증가시키기 위하여 말단 -NH2 아실화(acylation; 예를 들어 암모니아 또는 메틸 아민을 사용한 아세틸화, 또는 티오글리콜산 아미드화(thioglycolic acid amidation), 말단 카르보시(carbosy) 아미드화)에 의해 변성될 수 있다.
또한, 본 발명에서 숙고된 것은 상기 폴리펩타이드 절편, 유사체 및 유도체의 이종 및 동종 폴리펩타이드 다합체(multimer)이다. 상기 중합체는 예를들어 아비딘/바이오틴, 글루테르알데히드(gluteraldehyde) 또는 디메틸 수퍼이미데이트 (dimethyl superimidate) 등의 크로스-링커(cross-linker)와 크로스-링크된 하나 또는 그 이상의 폴리펩타이드를 포함한다. 상기 중합체는 또한, 둘 또는 그 이상의 직렬 또는 전회된 연속적인 서열을 포함하는 폴리펩타이드를 포함하며, 재조합 DNA 기술에 의하여 제조된 다전사적 mRNAs로부터 제조된다. 바람직하게는 본 발명 폴리펩타이드의 절편, 유사체 또는 유도체는 적어도 하나의 항원성 유발 영역 즉, 적어도 하나의 항원 결정기를 포함할 것이다.
항원성 유발중합체(즉, 합성 다합체)를 형성하기 위하여, 비샬로아세틸기(bishaloacetyl group), 니트로아릴할라이드(nitroarylhalides) 또는 그 유사체을 가지며, 상기 반응물이 티오기(thio group)에 특이적인 폴리펩타이드가 사용될 수 있다.
그러므로, 상기 다른 펩타이드의 두 머켑토 기(mercapto group) 사이의 결합은 단일 결합이거나 또는 적어도 둘, 일반적으로는 적어도 네 개, 그리고 16개를 넘지 않는 결합 그룹으로 구성되나, 일반적으로 탄소수 약 14개를 넘지는 않는다.
본 발명의 특정 실시예에서, 본 발명의 폴리펩타이드 절편, 유사체 및 유도체는 출발 잔기로서 메티오닌(Met)을 포함하지는 않는다. 바람직하게 폴리펩타이드는 리더 또는 보조 서열(신호 서열)을 포함하지는 않을 것이다. 본 발명 폴리펩타이드의 신호 부분은 공지의 분자 생물학적 기술에 따라 결정될 수 있다. 일반적으로, 중요한 상기 폴리펩타이드는 스트렙토코커스 배양으로부터 분리될 수 있으며, 성숙 단백질(mature protein)의 초기 잔기 및 성숙 폴리펩타이드의 서열을 결정하기 위하여 연속적으로 서열 분석될 수 있다.
본 발명의 다른 관점에 따르면, 본 발명은 약학적으로 적합한 운반 희석제 또는 애주번트(adjuvant)로 이루어진 첨가물에 본 발명의 하나 또는 그 이상의 스트렙토코커스 폴리펩타이드를 포함하는 백신 조성물을 제공한다. 적절한 애주번트는 오일 즉, 프루이드의 완전 또는 불완전 애주번트(Freund's complete or incomplete adjuvant); 염 즉, AlK(SO4)2, AlNa(SO4)2, AlNH4(SO4)2, 실리카(silica), 카올린(kaolin), 탄소 폴리뉴클레오타이드 즉, 폴리 IC 및 폴리 AU이 포함된다. 바람직한 애주번트는 퀼A(QuilA) 및 Al하이드로겔(Alhydrogel). 본 발명의 백신은 주사제, 급격한 주입, 상인두 흡수(nasopharyngeal absorption), 피부 흡수(dermoabsorption)에 의해 비경구 투여될 수 있으며 또는 구강 또는 경구 투여 될 수 있다. 또한, 약학적으로 적합한 운반체는 파상풍 톡소이드(tetanus toxoid)를 포함한다.
본 발명의 백신 조성은 본 발명의 참고 문헌에 포함된 P. R. Murray (P. R. Murray, Ed, in chief, E.J. Baron, M.A. Pfaller, F.C. Tenover 및 R.H. Yolken.; Manual of Clinical Microbiology, ASM Press, Washington, D.C. sixth edition, 1995, 1482p)에 의해 기술된 바와 같이 스트렙토코커스 감염 및/또는 스트렙토코커스 감염의 치료 또는 예방에 의해 매개되는 질병 및 증상에 사용된다. 일 실시예에서 본 발명의 백신 조성물은 뇌막염, 중이염, 균혈증 또는 폐렴의 치료 또는 예방을 위해 사용된다. 일 실시예에서 본 발명의 백신 조성물은 스트렙토코커스 감염의 치료 또는 예방 및/또는 스트렙토코커스 특히, 스타필로코커스 유레우스(S. aureus) 뿐 아니라, S. 뉴모니애, 그룹 A 스트렙토코커스(streptococcus; pyogenes), 그룹 B 스트렙토코커스(GBS 또는 agalactiae), 디스갈락티애(dysgalactiae), 우베리스(uberis), 노카르디아(nocardia) 감염에 의해 매개되는 질병 및 증상의 치료 또는 예방을 위한 목적으로 사용된다. 좀 더 구체적으로, 상기 스트렙토코커스는 S. 뉴모니애이다.
특정 실시예에 있어서, 백신은 유아, 노인 및 면역력이 결핍된 개체 등과 같이 스트렙토코커스 감염의 위험성이 있는 개체에게 투여될 수 있다.
본 발명에서 "개체"라는 용어는 포유류를 의미하며 좀 더 구체적으로 상기 포유류는 사람이다.
*백신 조성은 바람직하게는 약 0.001 내지 100 ㎍/㎏(항원/체중), 좀 더 바람직하게는 0.01 내지 10 ㎍/㎏(항원/체중), 가장 바람직하게는 0.1 내지 1 ㎍/㎏ (항원/체중)의 투여량으로 면역을 위해 투여시기 마다 약 1내지 6주의 간격을 두고 1 내지 3번 투여된다.
다른 측면에 의하면, 본 발명은 서열번호 2, 4, 6, 8, 10, 14, 16, 55 내지 75, 77 내지 79, 81, 83 또는 이들의 절편, 유사체 또는 유도체로부터 선택된 아미노산 서열에 의해 특정되는 폴리펩타이드를 코딩하는 폴리뉴클레오타이드를 제공한다.
본 발명의 일 실시예에서, 상기 폴리뉴클레오타이드는 ORF를 포함하고, 본 발명의 폴리펩타이드를 코딩하는 서열번호 1, 3, 5, 7, 9, 11, 12, 13, 15, 76, 80, 82로 표시되는 것이다. 도면에 기재된 상기 폴리뉴클레오타이드 서열이 본 발명의 폴리펩타이드를 여전히 코딩하는 코돈을 변성시킴으로써 변경될 수 있다는 사실은 고려되어야 한다. 또한, 본 발명은 서열들 사이에서 50%의 동일성을 갖는 본 명세서 상에서 전술된 폴리뉴클레오타이드 서열(또는 이들에 상보적인 서열)에 혼성화된 폴리뉴클레오타이드를 더욱 제공한다. 일 실시예에서 서열들 사이의 동일성은 적어도 70%이다. 일 실시예에서 서열들 사이의 동일성은 적어도 75%이다. 일 실시예에서 서열들 사이의 동일성은 적어도 80%이다. 일 실시예에서 서열들 사이의 동일성은 적어도 85%이다. 일 실시예에서 서열들 사이의 동일성은 적어도 90%이다. 좀 더 구체적으로 폴리뉴클레오타이드는 적어도 95%의 동일성을 갖는 엄격한 조건(stringent conditions)하에서 혼성화가 가능하다. 좀 더 구체적으로는 동일성은 97% 이상이다.
좀 더 구체적으로, 폴리뉴클레오타이드는 본 발명의 폴리펩타이드를 코딩하 는 서열번호 1, 3, 7, 9, 11, 12, 13, 15, 76, 80, 82로 표시된다.
좀 더 구체적으로, 폴리뉴클레오타이드는 본 발명의 폴리펩타이드를 코딩하고, ORF를 포함하는 서열번호 1, 3, 9, 11, 12, 13, 15, 76, 80, 82로 표시된다.
좀 더 구체적으로, 폴리뉴클레오타이드는 본 발명의 폴리펩타이드를 코딩하고, ORF를 포함하는 서열번호 1, 3, 9, 11, 12, 13, 15, 76로 표시된다.
좀 더 구체적으로, 폴리뉴클레오타이드는 본 발명의 폴리펩타이드를 코딩하고, ORF를 포함하는 서열번호 1, 3, 7, 9, 11, 12, 13, 15, 76로 표시된다.
좀 더 구체적으로, 폴리뉴클레오타이드는 본 발명의 폴리펩타이드를 코딩하고, ORF를 포함하는 서열번호 1, 7, 9, 11, 15, 76로 표시된다.
좀 더 구체적으로, 폴리뉴클레오타이드는 본 발명의 폴리펩타이드를 코딩하고, ORF를 포함하는 서열번호 1, 9, 11, 15, 76로 표시된다.
좀 더 구체적으로, 폴리뉴클레오타이드는 본 발명의 폴리펩타이드를 코딩하고, ORF를 포함하는 서열번호 1, 7, 9, 11로 표시된다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 1로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 7로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 9로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 11로 표시되며, 본 발명 의 폴리펩타이드를 코딩한다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 15로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 3, 12, 13, 76으로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 3으로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 12로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 13으로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.
좀 더 구체적으로, 폴리뉴클레오타이드는 서열번호 76으로 표시되며, 본 발명의 폴리펩타이드를 코딩한다.
당업자에 의해 쉽게 인식되는 바와 같이, 상기 폴리뉴클레오타이드는 DNA 및 RNA 모두를 포함한다.
또한, 본 발명은 본 출원 명세서 상에 기술된 상기 폴리뉴클레오타이드에 상보적인 폴리뉴클레오타이드를 포함한다.
다른 관점에 있어서, 본 발명의 폴리펩타이드를 코딩하는 폴리뉴클레오타이드 또는 이의 절편, 유사체 또는 유도체는 DNA 면역 실험에 사용될 수 있다. 즉, 상기 폴리뉴클레오타이드 또는 이의 절편, 유사체 또는 유도체는 생체내에서 항원성 폴리펩타이드를 제조함으로써 단사(injection)상에 복제가 가능하고 발현가능한 벡터에 삽입될 수 있다. 예를 들어, 폴리뉴클레오타이드는 진핵세포에서 기능을 하는 CMV 프로모터의 조절하에 플라스미드 벡터로 삽입될 수 있다. 바람직하게 상기 벡터는 근육간 주사로 투여된다.
본 발명의 다른 관점에 따르면, 본 발명은 숙주세포에서 상기 폴리펩타이드를 코딩하는 폴리뉴클레오타이드를 발현하고 상기 발현된 폴리펩타이드 산물을 회수하는 재조합 기술에 의한 본 발명의 폴리펩타이드를 제조하는 공정을 제공한다. 또한, 상기 폴리펩타이드는 공지의 합성 화학 기술(synthetic chemical techniques) 즉, 전장 폴리펩타이드를 제조하기 위하여 접합된 올리고펩타이드의 용액상 또는 고체상 합성에 따라 제조될 수 있다.
폴리뉴클레오타이드 및 폴리펩타이드의 수득 및 정량 방법은 Sambrook 등에 의한 참고 문헌들을 따랐다(Sambrook 등, Molecular Cloning, A Laboratory Manual, 2nd ed, Cold Spring Harbor, N. Y., 1989; Current Protocols in Molecular Biology, Edited by Ausubel F.M. 등, John Wiley 및 Sons, Inc. 뉴욕; PCR Cloning Protocols, Molecular Cloning to Genetic Engineering, Edited by White B.A., Humana Press, Totowa, New Jersey, 1997, P.490; Protein Purification, Principles and Practices, Scopes R.K., Springer-Verlag, New York, 3rd Edition, 1993, 380 페이지; Current Protocols in Immunology, Edited by Coligan J. E. 등, John Wiley & Sons Inc., New York)
재조합체 생산을 위하여, 숙주 세포는 상기 폴리펩타이드를 코딩하는 벡터로 형질전환되었으며, 그 다음으로 프로모터를 활성화시키기에 적합하도록 변형된 영 양 배지상에서 배양하였고, 형질전환체를 선별하거나 또는 유전자를 증폭한다.
적합한 벡터는 선택된 숙주내에서 생존 및 복제가 가능한 것이며, 염색체성, 비-염색체성 및 합성 DNA 서열 예를 들어 박테리아 플라스미드, 파지(phage) DNA, 베큘로바이러스(baculovirus), 효모 플라스미드, 플라스미드와 파지 DNA의 결합에 의해 유도된 벡터를 포함한다. 상기 폴리펩타이드 서열은 제한효소를 이용하여 벡터의 적절한 자리에 삽입될 수 있는데, 즉, 프로모터, 리보솜 결합 부위(공통 영역 또는 Shine-Dalgarno 서열), 및 선택적으로 작동인자(operator; 조절 요소)를 포함하는 발현 조절 영역에 조작적으로 연결된다. 주어진 숙주 및 공지의 분자생물학적 원리에 따라 제조된 벡터에 적합한 발현 조절 영역의 개체적 성분을 선택할 수 있다(Sambrook 등, Molecular Cloning: A Laboratory Manual, 2nd ed, Cold Spring Harbor, N.Y., 1989; Current Protocols in Molecular Biology, Edited by Ausubel F.M. 등, John Wiley and Sons. Inc. New York incorporated herein by reference). 적합한 프로모터는 LTR 또는 SV40 프로모터, E. Coli lac, tac 또는 trp 프로모터 및 파지 람다(lambda) PL 프로모터를 포함하나, 이에 한정되지는 않는다. 바람직하게 벡터는 선별 마커 즉, 앰피실린(ampicilin) 저항 유전자 뿐 아니라 복제 기점(origin)을 포함할 것이다. 적합한 박테리아 벡터는 pET, pQE70, pQE60, pQE-9, pbs, pD10 파지 스크립트(phagescript), psiX174, pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A, ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 및 진핵세포용 벡터인 pBlueBacIII, pWLNEO, pSV2CAT, pOG44, pXT1, pSG, pSVK3, pBPV, pMSG 및 pSVL를 포함한다. 숙주 세포는 박테리아 즉, E. Coli, 바실러스 썹틸리스(Bacillus subtilis), 스트렙토마이세스(Streptomyces); 진균(fungal), 즉 아스파질러스 니거(Aspergillus niger), 아스파질러스 니둘린(Aspergillus nidulins); 효모 즉, 사카로마이세스(Saccharomyces) 또는 진핵세포 즉, CHO, COS 일 수 있다.
배양을 통한 폴리펩타이드의 발현시, 세포는 전형적으로 원심분리에 의해 수확된 다음 물리적 또는 화학적 수단(발현된 폴리펩타이드가 배지로 분비되지 않았을 경우)에 의해 분쇄되었고, 결과적인 조추출물(crude extract)은 중요한 상기 폴리펩타이드를 보유하고 있다. 배양 배지 또는 분쇄액(lysate)으로부터 상기 폴리펩타이드의 정제는 폴리펩타이드의 특성에 따라 공지의 기술 즉, 황산 암모늄 (ammonium sulfate) 또는 에탄올 침전, 산 추출, 음이온 또는 양이온 교환 크로마토그래피, 인산셀룰로오즈(phosphocellulose) 크로마토그래피, 소수성 상호작용 크로마토그래피(hydrophobic interaction chromatography), 수산화 인회석(hydroxylapatite) 크로마토그래피 및 렉틴(lectin) 크로마토그래피에 의해 얻어질 수 있다. 최종 정제는 HPLC를 사용하여 얻을 수 있다.
상기 폴리펩타이드는 리더 또는 분비 서열을 가지거나 또는 가지지 않은 상태로 발현된다. 전자의 경우 상기 리더 서열은 전사 후 공정에 의해 제거될 수 있으며(본 명세서에 참고문헌으로 기재된 미국특허 제4,431,739호, 미국특허 제4,425,437호, 및 미국특허 제4,338,397호 참조) 또는, 발현된 폴리펩타이드를 정제하기 위한 연속적인 공정을 통해 화학적으로 제거될 수 있다.
본 발명의 다른 관점에 따르면, 본 발명의 상기 스트렙토코커스 폴리펩타이드는 스트렙토코커스 특히 S. 뉴모니애 감염의 진단 테스트 시에 사용될 수 있다. 생물학적 시료에 있는 스트렙토코커스 미생물을 진단하기 위해서 여러가지 진단 방법이 가능한데, 예를들어 하기의 과정이 수행될 수 있다:
a) 환자로부터 생물학적 샘플을 얻는 단계;
b) 상기 생물학적 샘플과 본 발명의 스트렙토코커스 폴리펩타이드에 반응하는 항체 또는 이의 절편을 배양하여 혼합물을 만드는 단계; 및
c) 상기 혼합물 내에 특이적으로 결합된 항체 또는 결합 절편이 있는지를 확인하여 스트렙토코커스의 존재를 확인하는 단계.
또한, 상기 항체를 포함하고 있거나 포함하고 있다고 추정된 생물학적 시료에서 스트렙토코커스 항원에 특이적인 항체의 감지를 위한 방법은 하기와 같이 수행될 수 있다:
a) 환자로부터 생물학적 샘플을 얻는 단계;
b) 상기 생물학적 샘플과 하나 또는 그 이상의 본 발명의 스트렙토코커스 폴리펩타이드 또는 이의 절편을 배양하여 혼합물을 만드는 단계; 및
c) 상기 혼합물 내에 특이적으로 결합된 항원 또는 결합 절편이 있는지를 확인하여 스트렙토코커스에 특이적인 항체의 존재를 확인하는 단계.
당업자는 상기 진단 테스트가 ELISA(enzyme-linked immunosorbent assay), 방사면역분석시험(radioimmunoassay) 또는 유액 응집시험(latex agglutination assay) 등을 포함하는 면역적 방법을 다양한 형태로 수행될 수 있음을 인지할 것이 고, 필수적으로 상기 단백질에 특이적인 항체가 미생물 내에 존재하는 지를 결정한다.
또한, 본 발명의 폴리펩타이드를 코딩하는 DNA 서열은 해당 박테리아를 함유하고 있다고 추정되는 생물학적 시료에 스트렙토코커스의 존재 유무를 감지하기 위하여 사용되는 DNA 소식자를 디자인하는데 사용될 수 있다. 본 발명의 감지 방법은
a) 환자로부터 생물학적 샘플을 얻는 단계;
b) 상기 생물학적 샘플과 본 발명의 폴리펩타이드 또는 이의 절편을 코딩하는 DNA 서열을 갖는 하나 또는 그 이상의 DNA 소식자를 배양하여 혼합물을 형성하는 단계; 및
c) 스트렙토코커스 박테리아의 존재를 감지하기 위하여 상기 혼합물 내에 특이적으로 결합한 DNA 소식자를 감지하는 단계를 포함한다.
또한, 본 발명의 상기 DNA 소식자는 스트렙토코커스 감염을 진단하는 방법으로서 예를들어 PCR을 사용하여 순환하는 스트렙토코커스 즉, 샘플 내 S. 뉴모니애 핵산을 감지하는데 사용될 수 있다. 상기 소식자는 전통적인 방법으로 합성될 수 있으며, 고체상에서 고정되거나 또는 식별 가능한 라벨로 라벨링될 수 있다. 본 발명을 위한 바람직한 DNA 소식자는 본 발명의 스트렙토코커스 뉴모니애 폴리펩타이드의 적어도 6개 지속적인 뉴클레오타이드에 상보적인 서열을 갖는 올리고머이다.
환자의 스트렙토코커스를 감지하는 다른 진단 방법은
a) 감지 가능한 라벨로 상기 발명의 폴리펩타이드 또는 이의 절편과 반응하는 항체를 라벨링하는 단계;
b) 라벨링된 항체 또는 라벨링된 절편을 환자에 투여하는 단계; 및
c) 환자에서 특이적으로 결합된 라벨링된 항체 또는 라벨링된 절편을 감지하여 스트렙토코커스의 존재를 감지하는 단계를 포함한다.
본 발명의 다른 관점은, 본 발명의 스트렙토코커스 폴리펩타이드를 진단을 위한 특이적인 항체의 제조를 위한 면역원 및 특히, 스트렙토코커스 감염의 치료를 위해 사용하는 용도에 관한 것이다.
적합한 항체는 적절한 스크리닝 방법 예를 들어, 테스트 모델에 있어서 스트렙토코커스 감염에 대하여 수동적으로 방어하는 특정 항체의 능력을 측정함으로써 결정될 수 있다. 동물모델의 일실시예는 본 명세서의 실시예에 기술된 마우스 모델이다. 상기 항체는 전체 항체 또는 이의 항원-결합 절편이 될 수 있고, 어떠한 면역글로블린(immunoglobulin) 종류도 가능하다. 상기 항체 또는 절편은 동물 기원, 상세하게는 포유류 기원 및 좀 더 상세하게는 설치류(murine), 랫트 또는 사람 기원이다. 상기 항체 또는 절편은 자연 항체 또는 이의 절편 또는 원한다면 재조합 항체 또는 항체 절편이다. 상기의 재조합 항체 또는 항체 절편 용어는 분자 생물학적 기술을 이용하여 제조된 항체 또는 항체 절편을 의미한다. 상기 항체 또는 항체 절편은 폴리클로날 또는 바람직하게는 모노클로날이다. 상기 항체 또는 항체 절편은 스트렙토코커스 뉴모니애 폴리펩타이드와 관련되는 항원결정기의 수에 특이적이나, 바람직하게는 하나이다.
범주를 제한하지 않으면서, 또한 본 발명은 BVH-3, BVH-11, BVH-11-2, BVH-28 및 BVH-71로 지정된 새로운 항원에 관한 것이다. 또한, 본 발명은 BVH-3, BVH-11, BVH-11-2, BVH-28 및 BVH-71로 지정된 새로운 항원의 절편을 포함하는 절단된 폴리펩타이드에 관한 것이다. 또한, 본 발명은 BVH-3, BVH-11, BVH-11-2, BVH-28 및 BVH-71로 지정된 새로운 항원의 절편을 포함하는 키메라성 폴리펩타이드에 관한 것이다. 본 발명의 항원들 사이의 관계는 요약하여 하기 참고표에 기재하였다.
참고표
종류 | 뉴클레오타이드 서열번호 | 폴리펩타이드 서열번호 |
BVH-3 | ||
BVH-3 | 1, 11 | 2 |
BVH-3A | 7 | 8 |
BVH-3B | 9 | 10 |
BVH-3 SP63 | 15 | 16 |
BVH-3M | 55 | |
BVH-3AD | 56 | |
L-BVH-3AD | 57 | |
New12 | 76 | 58 |
BVH-3 | 59 | |
New1 | 64 | |
New2 | 65 | |
New3 | 66 | |
New15 | 78 | |
BVH-11 | ||
BVH-11 | 3, 12 | 4 |
BVH-11-2 | 13 | 14 |
BVH-11M | 60 | |
BVH-11A | 61 | |
BVH-11B(New13) | 62 | |
BVH-11C | 63 | |
New4 | 67 | |
New5 | 68 | |
New6 | 69 | |
New7 | 70 | |
New8 | 71 | |
New9 | 72 | |
BVH-11-2M | 73 | |
New10 | 74 | |
New11 | 75 | |
New12 | 76 | 58 |
New14 | 77 | |
New16 | 79 | |
BVH-28 | ||
BVH-28 | 5 | 6 |
BVH-71 | ||
GBS | 80 | 81 |
GAS | 82 | 83 |
실시예 1
본 실시예는 S. 뉴모니애 유전자의 클로닝에 관하여 기술한다.
서열번호 1로 표시되는 S. 뉴모니애 유전자 BVH-3의 코딩 영역 및 서열번호 5로 표시되는 S. 뉴모니애 유전자 BVH-28의 코딩 영역은 제한효소 위치 Bgl II (AGATCT) 및 Xba I (TCTAGA)의 추가를 위해 첨가된 염기를 함유한 올리고를 사용하여 혈청군 6 S. 뉴모니애 균주 SP64의 게노믹 DNA로부터 PCR(DNA Thermal Cycler GeneAmp PCR system 2400 Perkin Elmer, Scan Jose, CA)을 통해 확장되었다. PCR 산물은 키트(QIAquick gel extraction kit; QIAgen(Chatsworth, CA)를 사용하여 아가로오즈 젤로부터 정제되었고, Bgl II-Xba I(Pharmacia Canada Inc, Baie d'Urfe Canada)로 절단되었고, 페놀: 클로로포름으로 추출되고 에탄올로 침전시켰다. 수퍼링커 벡터 pSL301(Invitrogen, San Diego, CA)은 Bgl II 및 Xba I으로 절단되었고, 키트(QIAquick gel extraction kit; QIAgen(Chatsworth, CA)를 사용하여 아가로오즈 겔로부터 정제하였다. 상기 Bgl II-Xba I 게노믹 DNA 절편은 Bgl II-Xba I pSL301 벡터에 접합되었다. 상기 접합된 산물은 Simanis (Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed). pp. 109-135)의 방법에 따라 E. Coli 균주 DH5a [f80 lacZ DM15 endA1 recA1 hsdR17 (rK- mK+) supE44 thi-11- gyrA96 relA1 D(lacZYA-argF) U169] (Gibco BRL, Gaithersburg, MD)로 형질전환되었다. BVH-3 또는 BVH-28 유전자를 포함하는 재조합 pSL301 플라스미드(rpSL301)는 QIAgen 키트(Chatsworth, CA)를 사용하여 정제되었고, DNA 삽입물은 뉴클레오타이드 서열 분석(Taq Dye Deoxy Terminator Cycle Sequencing kit, ABI, Foster City, CA)에 의해 확인되었다. 재조합 rpSL301(rpSL301)은 제한효소 Bgl II (AGATCT) 및 Xho I (CTCGAG)로 절단하였다. 절단된 DNA 절편 Bgl II-Xho I은 키트(QIAquick gel extraction kit; QIAgen(Chatsworth, CA))를 사용하여 정제되었다. 티오레독 신(thioredoxin)-His 표지(tag) 서열을 함유하고 있는 pET-32c(+) 발현 벡터(Novagen, Madison, WI)는 Bam HI (GGATCC) 및 Xho I으로 절단하였고, 겔은 키트(QIAquick gel extraction kit; QIAgen(Chatsworth, CA)를 사용하여 추출하였다. 상기 Bgl II-Xho I DNA 절편은 티오레독신-His 표지-BVH-3 또는 티오레독신-His 표지-BVH-28 융합 단백질의 코딩 서열을 제조하기 위하여 pET-32c(+) 벡터의 Bam HI-Xho I 부위에 접합시켰다. 상기 접합된 결과물은 Simanis (Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed). pp. 109-135)의 방법에 따라 E. Coli 균주 DH5a [f80 lacZ DM15 endA1 recA1 hsdR17 (rK- mK+) supE44 thi-11- gyrA96 relA1 D(lacZYA-argF) U169] (Gibco BRL, Gaithersburg, MD)로 형질전환되었다. 재조합 pET-32c(+) 플라스미드는 QIAgen 키트(Chatsworth, CA)를 사용하여 정제되었고, 티오레독신-His 표지 및 DNA 삽입물의 융합 부분의 뉴클레오타이드 서열은 DNA 서열분석(Taq Dye Deoxy Terminator Cycle Sequencing kit, ABI, Foster City, CA)에 의해 확인되었다.
실시예 2
본 실시예는 CMV 플라스미드 pCMV-GH에서 S. 뉴모니애 단백질 유전자의 클로닝에 관한 것이다.
S. 뉴모니애 단백질의 DNA 코딩 영역은 플라스미드 벡터 pCMV-GH(Tang 등, Nature, 1992, 356: 152)에 있는 CMV(cytomegalavirus) 프로모터의 전사적 조절하에 있는 hGH(human growth hormone)의 하류(downstream)에 삽입되었다. 상기 CMV 프로모터는 E. Coli 세포에서 비기능적인 플라스미드이나, 진핵 세포에 플라스미드를 처리하였을 경우 활성을 갖는다. 또한, 상기 벡터는 엠피실린 저항 유전자를 포함한다.
서열번호 1로 표시되는 BVH-3 유전자의 코딩 영역 및 서열번호 5로 표시되는 BVH-28 유전자는 제한 효소 Bgl II (AGATCT) 및 Xba I (TCTAGA)를 사용하여 rpSL301로부터 얻었다(실시예 1 참조). 상기 절단된 결과물은 키트(QIAquick gel extraction kit; QIAgen(Chatsworth, CA)를 사용하여 정제되었다. 융합 단백질을 제조하기 위하여 사람 성장 호르몬을 포함하는 상기 pCMV-GH 벡터(Laboratory of Dr. Stephen A. Johnston, Department of Biochemistry, The University of Texas, Dallas, Texas)는 Bgl II 및 Xba I으로 절단하였고, 키트(QIAquick gel extraction kit; QIAgen(Chatsworth, CA)를 사용하여 아가로오즈 겔로부터 정제하였다. 상기 Bgl II-Xba I DNA 절편은 Bgl II-Xba I 으로 절단된 pCMV-GH 벡터에 접합시켜 CMV 프로모터에 의해 조절되는 hGH-BVH-3 또는 hGH-BVH-28 융합 단백질을 제조하였다. 상기 접합된 결과물은 Simanis (Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed). pp. 109-135)의 방법에 따라 E. Coli 균주 DH5a [f80 lacZ DM15 endA1 recA1 hsdR17 (rK- mK+) supE44 thi-11- gyrA96 relA1 D(lacZYA-argF) U169] (Gibco BRL, Gaithersburg, MD)로 형질전환되었다. 상기 재조합 pCMV 플라스미드는 QIAgen 키트(Chatsworth, CA)를 사용하여 정제되었다.
서열번호 3으로 표시되는 BVH-11 유전자의 코딩 영역은 제한효소 위치 Bgl II (AGATCT) 및 Hind III (AAGCTT)의 추가에 의한 첨가된 염기를 함유한 올리고를 사용하여 혈청군 6 S. 뉴모니애 균주 SP64의 게노믹 DNA로부터 PCR(DNA Thermal Cycler GeneAmp PCR system 2400 Perkin Elmer, Scan Jose, CA)을 통해 확장되었다. PCR 산물은 키트(QIAquick gel extraction kit; QIAgen(Chatsworth, CA)를 사용하여 아가로오즈 젤로부터 정제되었고, 제한 효소들(Pharmacia Canada Inc, Baie d'Urfe Canada)로 절단되었고, 페놀: 클로로포름으로 추출되고 에탄올로 침전시켰다. 상기 pCMV-GH 벡터(laboratory of Dr. Stephen A. Johnston, Department of Biochemistry, The University of Texas, Dallas, Texas)는 Bgl II 및 Hind III로 절단되고, 키트(QIAquick gel extraction kit; QIAgen(Chatsworth, CA)를 사용하여 아가로오즈 겔로부터 정제하였다. 상기 Bgl II-Hind III DNA 절편은 Bgl II-Hind III pCMV-GH 벡터에 접합되어 CMV 프로모터의 조절을 받는 hGH-BVH-11 융합 단백질을 제조하였다. 상기 접합된 산물은 Simanis (Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed). pp. 109-135)의 방법에 따라 E. Coli 균주 DH5a [f80 lacZ DM15 endA1 recA1 hsdR17 (rK- mK+) supE44 thi-11- gyrA96 relA1 D(lacZYA-argF) U169] (Gibco BRL, Gaithersburg, MD)로 형질전환되었다.
상기 재조합 pCMV 플라스미드는 QIAgen 키트(Chatsworth, CA)를 사용하여 정제되었고, DNA 삽입물의 뉴클레오타이드 서열은 서열 분석으로 확인하였다.
실시예 3
본 실시예는 S. 뉴모니애 항원에 면역 반응을 일으키는 DNA의 용도에 관한 것이다.
암컷 BALB/c 마우스(Charles River, St-Constant, Quebec, Canada) 8마리 그룹은 과립백혈구-대식세포 콜로니-촉진 인자 (GM-CSF; granulocyte-macrophage colony-stimulating factor)-발현 플라스미드 pCMV-GH-GM-CSF(Laboratory of Dr. Stephen A. Johnston, Department of Biochemistry, The University of Texas, Dallas, Texas) 50 ㎍과 함께 BVH-3, BVH-11 또는 BVH-28 유전자를 코딩하는 재조합 pCMV-GH 100 ㎍을 2주 또는 3주의 간격으로 50 ㎕씩 세 번에 근육주사하여 면역시켰다. 대조군으로는 pCMV-GH-GM-CSF 50 ㎍과 함께 pCMV-GH 100 ㎍을 주사한 마우스 그룹을 사용하였다. 혈액 샘플은 각 면역 전 및 세 번째 주사 다음 7일 후에 안구로부터 얻었으며, 혈청 항체 반응은 코팅 항원으로 티오레독신-His 표지-S. 뉴모니애 융합 단백질을 사용한 ELISA로 결정되었다. BVH-3, BVH-11 또는 BVH-28 S. 뉴모니애 단백질을 코딩하는 재조합 플라스미드 pCMV-GH를 사용한 DNA 면역 반응은 각각의 재조합 단백질에 대해 반응하는 항체를 유발한다. 상기 상호작용을 나타내는 항체의 타이터(titer)는 흡수 수치가 배경 수치보다 0.1 이상이 되도록 높은 혈청 희석 방법으로 정의한 결과 4×103 이상이다.
실시예 4
본 실시예는 재조합 S. 뉴모니애 단백질의 제조 및 정제에 관한 것이다.
각각 서열번호 1, 서열번호 3 또는 서열번호 5로 표시되는 BVH-3, BVH-11 또는 BVH-28 유전자를 포함하는 상기 재조합 pET 플라스미드는 전기천공법(Gene Pulser II apparatus, BIO-RAD Labs, Mississauga, Canada)을 사용하여 E. Coli 균주 AD494 (DE3)(Dara- leu7697 DlacX74 DphoA PvuII phoR DmalF3 F'[lac+(lacIq) pro] trxB: :Kan)(Novagen, Madison, WI)으로 각각 형질전환 되었다. 상기 E. Coli 균주에 있어서, T7 RNA 폴리머라아제(1DE3 프로파지 상에 존재)에 의해 특이적으로 인식되는 상기 융합 단백질의 발현을 조절하는 T7 프로모터는 IPTG(isopropyl-β-d-thio-galactopyranoside)에 의해 증폭되는 lac 프로모터에 의해 조절된다. 상기 형질전환체 AD494(DE3)/rpET는 ㎖당 엠피실린(Sigma-Aldrich Canada Ltd., Oakville, Canada) 100 ㎍이 포함된 LB 배지(peptone 10 g/L, 효모 추출물 5 g/L, NaCl 10 g/L) 상에 250 rpm으로 흔들어 주면서 37℃에서 A600 값이 0.6이 될 때까지 배양하였다. 티오레독신-His 표지-BVH-3, 티오레독신-His 표지-BVH-11 또는 티오레독신-His 표지-BVH-28 융합 단백질의 생산을 증폭시키기 위하여, 상기 세포에 최종농도 1 mM로 IPTG를 첨가하고 추가적으로 2시간동안 배양하였다. 100 ㎖의 배지에서 증폭된 세포는 원심분리에 의해 분리되어, -70℃에서 냉동시켰다.
IPTG로 증폭된 AD494(DE3)/rpET의 가용성 세포질 부분로 부터 상기 융합 단백질의 정제는 His 결합 금속 킬레이트 레진 상에 이동되는 2가 양이온(Ni2 +)에 결합되는 His 표지 서열(6개의 연속되는 히스티딘 레진)의 특성을 윈리로 하는 친화 크로마토그래피에 의해 수행되었다. 간략하게는, IPTG에 의해 증폭된 100 ㎖의 배 지로부터 얻은 응집된 세포는 PBS(Phosphate buffered saline) : 500 mM NaCl pH 7.1 용액에 재현탁되었고, 초음파 분쇄한 다음 끼꺼기를 제거하기 위하여 20,000 × g에서 20분 동안 회전시켰다. 상기 상층액은 여과시켰으며(0.22 ㎛ 기공크기 막), 미리 충전되고 킬레이트화 되어 사용할 준비가 되어있는 HiTrap(R)(Pharmacia Biotech, Baie d'Urfe Canada) 1 ㎖ 상에 충진시켰다. 상기 티오레독신-His 표지-S. 뉴모니애 융합 단백질은 1M 이미다졸(imidazole)-500 mM NaCl-PBS(pH 7.1) 용액으로 용출되었다. 상기 샘플의 염 및 이미다졸은 PBS로 4℃에서 투석(dialysis)하여 제거하였다. E. Coli의 수용성 부분으로부터 얻어진 융합 단백질의 양은 MicroBCA(Pierce, Rockford, Illinois)로 측정되었다.
실시예 5
본 실시예는 면역에 의한 마우스의 치명적 폐렴 구균 감염에 따른 보호능에 대하여 기술한다.
암컷 BALB/c 마우스(Charles River) 8마리 그룹은 QuilA 애주번트(Cedarlane Laboratories Ltd, Hornby, Canada) 15 ㎍이 첨가된 친화 정제된 티오레독신-His 표지-BVH-3 융합 단백질 25 ㎍을 3주 간격으로 3번 피하주사하여 면역시켰다. 대조군으로는 PBS 상의 QuilA 애주번트 만을 투여한 군을 사용하였다. 혈액 샘플은 각 면역화 과정 1일, 22일, 43일 전 및 3차 주사후 7일(50일째) 후에 안구 동(orbital sinus)으로부터 채취하였다. 일주일 후에 상기 마우스는 타입 3 S. 뉴모니애 균주 WU2를 대략 106 CFU 투여하였다. 상기 S. 뉴모니애 투여 접종의 샘플은 CFU를 결정 하고, 투여량을 결정하기 위하여 초콜렛 아가 플레이트상에 도말하였다. 사망은 14일 동안 기록되었으며, 투여 후 14일째에 생존하는 마우스는 사망시켰고, 혈액 샘플내 S. 뉴모니애 미생물의 존재 여부를 테스트하였다. 상기 생존 데이터는 하기 표 1에 나타내었다.
투여전 혈청은 표준적인 면역검정법으로 S. 뉴모니애와 반응하는 항체의 존재 유무가 분석되었다. ELISA 및 면역블럿 분석은 E. Coli 내에 제조된 재조합 S. 뉴모니애 단백질과의 면역 반응이 재조합 및 자연의 폐렴 구균 단백질 모두와 반응하는 항체를 도출하였음을 나타내었다.
면역원 | 투여 14일 후의 생존 마우스 수: 사망 마우스 수 | 평균 사망일 |
BVH-3 | 8:0 | 14일 이상 |
없음 | 0:8 | 1 |
BVH-3 재조합 단백질로 면역된 마우스는 감염에 대해 살아남은 반면, 애주번트만 단독으로 투여한 대조군 마우스는 모두 사망하였다. 마우스 두 그룹 사이의 현저한 생존율 차이를 나타내었다(p〈0.0001, log rank test for nonparametric analysis of survival curves; P=0.0002, Fisher's exact test). 생존한 마우스로 부터 얻은 모든 혈구 배양(hemoculture) 결과 투여 14일 후에 음성으로 나타났다.
실시예 6
본 실시예는 다양한 S. 뉴모니애 균주로부터 BVH-3 및 BVH-11 유전자의 클로닝 및 상기 유전자들의 분자적 공통성에 관한 것이다.
BVH -3 또는 BVH -11의 다른 영역 DNA 소식자로 분리된 다양한 S. 뉴모니애 염색체 DNA의 분자적 분석은 하나의 BVH -3 유전자 카피 및 두 개의 BVH -11 카피가 존재함을 드러낸다. 상기 두 개의 BVH-11 유전자 카피는 동일하지 않고 및 상기 유전자는 임의로 BVH -11(서열번호 12; 뉴클레오타이드 114 내지 2630의 ORF) 및 BVH-11-2(서열번호 13; 뉴클레오타이드 114 내지 2630의 ORF)로 지정하였다
BVH-3 및 BVH-11 코딩 영역의 첫번째 아미노산은 또한 신호 서열로도 알려진 리더 서열의 특성을 가진다. 리포프로테인(lipoprotein) 변경/과정 부분의 상기 컨센서스(consensus) 신호 펩티다아제 절개 사이트 L-X-X-C는 상기 서열내에 존재한다. S. 뉴모니애 SP64의 성숙 BVH-3, BVH-11 및 BVH-11-2 단백질은 각각 1019, 821 및 819개의 아미노산을 갖는다. 성숙 BVH-3을 위한 S. 뉴모니애 유전자 코딩 영역인 BVH-3M(뉴클레오타이드 1837-4896; 서열번호 11), BVH-11M(뉴클레오타이드 102-2567; 서열번호 12) 및 BVH-11-2M(뉴클레오타이드 171-2630; 서열번호 13)은 6 또는 7 S. 뉴모니애 균주의 게노믹 DNA를 사용한 PCR로 확장하였다(DNA Thermal Cycler GeneAmp PCR system 2400 Perkin Elmer, San Jose, CA). 혈청형 6 S. 뉴모니애 SP64 및 혈청형 9 SP63 진단 분리체(the laboratoire de la sante publique du quebec, Sainte-Anne-de-Bellevue; 혈청형 4 균주 JNR.7/87(Andrew Camilli, Tufts University School of Medicine, Boston; type 2 균주 D39의 비피낭성(nonencapsulate) 유도체, Rx1 균주 및 유형 3 균주 A66 및 WU2(David E. Briles, University of Alabama, Birmingham) 및 유형 3 진단적 분리체 P4241(centre cenytr de recherche en infectiologie du centre hospitalier de l'universite Laval, Sainte-Foy)은 각각의 공여자로부터 얻었다. 추가적인 제한 효소 부분을 위한 염기 부분을 함유하고 있는 올리고뉴클레오타이드 프라이머 OCRR479-OCRR480의 세트; HAMJ160-OCRR488 및 HAMJ160-HAMJ186은 각각 BVH -3, BVH -11 및 BVH -11-2 유전자의 증폭을 위해 사용하였다. 단, 예외로 SP64 균주의 BVH-11 유전자는 HAMJ487 및 OCRR488로 이루어진 프라이머 세트를 이용하여 확장되었다. 사용한 프라이머 서열은 하기 표 2a, b 및 c에 표시하였다. PCR 결과물은 QIAgen(chatsworth, CA)의 키트(QIAquick gel extraction kit)를 사용하여 아가로오즈 겔로부터 정제하였고, BglII-XbaI 또는 BglII-HindIII(pharmacia Canada Inc, Baie d'Urfe, Canada)로 절단하였다. 절단물은 QIAgen(chatsworth, CA) 키트(QIAquick PCR purification kit)를 사용하여 세척하였다. 상기 PCR 결과물은 BglII-XbaI 또는 BglII-HindIII pSL301 벡터에 접합시켰다. 상기 접합된 결과물은 Simanis (Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed). pp. 109-135)의 방법에 따라 E. Coli 균주 DH5a [φ80 lacZ DM15 endA1 recA1 hsdR17 (rK- mK+) supE44 thi-1λ- gyrA96 relA1 Δ(lacZYA-argF) U169] (Gibco BRL, Gaithersburg, MD)로 형질전환되었다. BVH-3, BVH-11 또는 BVH11-2를 포함하는 재조합 pSL301 플라스미드(rpSL301)은 QIAgen 키트(Chatsworth, CA)를 사용하여 정제하였고, DNA 삽입부분은 서열분석하였다(Taq Dye Deoxy Terminator Cycle Sequencing kit, ABI, Foster City, CA). 도 11a, b, c 및 d 및 12는 각각 BVH-3 및 BVH-11의 추론된 아미노산 서열로부터 얻은 공통 서열(consensus sequence)에 관해 도시하고 있다. BVH-3 단백질 서열의 비교 결과 모든 균주의 서열에서 99 내지 100%의 동일성을 나타내었으나, 단 예외로 혈청군 9 SP63 균주의 BVH-3(서열번호 15 및 서열번호 16)는 S. 뉴모니애 SP64의 BVH-3' 단백질 서열의 잔기 244 내지 420에 상응하는 177개 아미노산 부분이 결실되어 있다. 부가적인 혈청군 9 균주의 서열 분석 결과 BVH-3 분자가 4 균주를 제외한 균주 3에 동일한 결실을 가지고 있는 것을 확인하였으며, 따라서, 3 균주는 S. 뉴모니애 혈청군 9 클론의 멤버이다.
13 BVH-11 뉴클레오타이드 서열은 7 S. 뉴모니애 균주와 비교하였으며, 상기 뉴클레오타이드 서열이 매우 유사함을 확인할 수 있었다. 미리 예측된 BVH-11 단백질 서열 다합체 배열의 컴퓨터 분석(MacVector, Clustal W 1.4) 결과 상기 서열이 834 아미노산 길이에 있어서, 75% 동일성 및 82%의 유사성을 나타내었다. 복합 배열(Pairwise alignment)은 80 내지 100% 동일성을 나타낸다(도 13 참조). 상기 서열은 전체적 부분에서 높은 유사성을 나타낸다. 상기 단백질의 주요 서열에 있어서의 다양성은 거의 단백질 C-말단 부분의 마지막 125 아미노산을 제한한다. 상기 영역은 하나의 도메인을 구성한다. 상기 도메인의 유사성 검사 결과 두 그룹의 서열로 이루어져 있음을 확인하였다. 도 13의 첫번째 9 서열은 제1 그룹에 속하고, 마지막 4 서열은 다른 그룹에 속한다. 상기 도메인 서열 13 단백질을 비교하였고(MacVector, Clustal W 1.4), 동일성 수치는 39% 이었다. 동일 그룹에 속하는 서열을 비교하였을 경우 92% 이상까지 증가된 동일 수치를 나타내었다.
*실시예 7
본 실시예는 BVH -3 및 BVH -11 유전자의 유사 부분에 관한 것이다.
BVH -3 및 BVH -11 유전자 유래의 DNA 소식자를 사용한 분석 결과 BVH -3 및 BVH-11이 연과되어 있음을 확인하였다. 닷 블럿 혼성화(dot blot hydridization) 결과, BVH -3 또는 BVH -11 유전자 서열을 포함하는 DNA 소식자는 양쪽에 모두 혼성화되었고, 따라서 BVH -3 및 BVH -11 유전자가 유사 서열을 공유하고 있음을 확인하였다. 서열의 비교 결과 ORFs 및 단백질이 각각 43% 및 33%가 동일함을 확인하였다. 유사성 검사 결과는 BVH-3에서 아미노산 1 내지 225및 BVH-11에서 1 내지 228에 해당하는 영역이 각각 DNA 및 단백질 수준에 있어서, 73% 및 75%로 동일함을 확인하였다. 반대로 BVH-3 아미노산 226 내지 1039 및 BVH-11 아미노산 229-840에 상응하는 3' 영역은 DNA 및 단백질 수준에서 각각 단지 34% 및 22%만이 동일한하였다. 따라서, BVH-3 및 BVH-11 유전자의 5' 말단은 높은 비율의 공통 서열을 함유하고 있는 반면 상기 유전자들의 나머지 부분은 매우 다르다. 전술한 결과는 BVH-3 및 BVH-11이 공통 영역에 존재하는 서열에 의해 매개되는 동일한 기능을 공유할 수 있으며, 반면 BVH-3 및 BVH-11의 특이적 기능은 다른 영역에 존재하는 서열에 의해 매개될 수 있다는 것을 추측하게 한다.
실시예 8
본 실시예는 PCR을 통한 BVH-3, BVH-11 및 BVH-11-2 유전자의 클로닝 및 절단된 BVH-3 및 BVH-11 물질의 발현에 관해 기술한다.
유전자 절편은 S. 뉴모니애 균주 SP64 유래의 서열번호 1 및 서열번호 11로 표시되는 BVH-3, 서열번호 3 및 12로 표시되는 BVH-11 또는 서열번호 13으로 표시되는 BVH-11-2를 포괄하는 절편을 증폭하기 위하여 제작된 한쌍의 올리고뉴클레오타이드를 사용한 PCR로 확장되었다. 상기 프라이머 각각은 5' 말단에 제한 엔도뉴클라아제(endonuclease) 부위를 가지고 있으며, 따라서 절단된 플라스미드 벡터로 상기 증폭된 산물의 프레임내 방향성 클로닝을 가능하게 한다(표 2a, b 및 c, 및 표 3 참조). PCR-증폭된 산물은 제한 엔도뉴클레아제로 절단되었고, 동일한 효소로 절단되거나 상보적인 점착 말단(cohesive end)를 생성하는 효소로 절단된 선형화된 플라스미드 pSL301(실시예 1 참조), pCMV-GH(실시예 2 참조) 또는 pET(Novagen, Madison, WI) 발현 벡터에 접합시킨다. 재조합 pSL301 및 재조합 pCMV-GH 플라스미드는 pET 발현 벡터에 프레임내 삽입 클로닝을 위하여 제한 효소로 절단하였다. 클론은 절단된 BVH-3 또는 BVH-11 물질의 발현을 위해 E. Coli BL21(λDE3) 또는 AD494(λDE3)로 삽입하기 전에 E. Coli DH5α에 안정화시켰다. 결과적인 플라스미드 구조물 각각은 뉴클레오타이드 서열분석으로 확인하였다. 상기 재조합 단백질은 N-말단에 티오레독신 및 His 표지 또는 C-말단에 His 표지를 가진 상태로 발현된다. 상기 발현된 재조합 단백질은 His-결합 금속 킬레이트화 레진(QIAgen, Chatsworth, CA)을 사용하여 초음파 분쇄된 IPTG-증폭 E. Coli 배양물의 원심분리로 얻어진 상층액 부분에서 정제되었다. 상기 유전자 결과물은 하기 표 3에 표시되었다. 신호 서열을 포함하는 N-말단 영역 유전자 결과물은 지질화된 단백질 또는 리포 단백질(L-단백질)로 지정되었다. 신호 서열이 결핍된 N-말단 영역에 상응하는 유전자 결과물은 신호 서열이 없는(w/o ss) 단백질로 명명되었다.
실시예 9
본 실시예는 모노클로날 항체(Mabs)의 분리 및 BVH-3, BVH-11 및 BVH-11-2 단백질 항원결정기를 특정짓는 모노클로날 항체의 용도에 관해 기술한다.
암컷 BALB/c 마우스(Charles River)는 QuilA 애주번트(Cedarlane Laboratories Ltd, Hornby, Canada) 15 ㎍을 첨가하고 S. 뉴모니애 균주 SP64로부터 BVH-3, BVH-11 또는 BVH-11-2 유전자 결과물과 함께 피하주사하여 면역시켰다. 마우스 제1군(융합 실험 1)은 친화 정제된 티오레독신-His 표지-BVH-3M 융합 단백질 25 ㎍을 1일 및 14일에 면역시켰다. 마우스 제2군(융합 실험 2)은 친화 정제된 티오레독신-His 표지-BVH-11M 융합 단백질 25 ㎍을 3주 간격으로 3번 면역시켰다. 제3군(융합 실험 3)은 친화 정제된 티오레독신-His 표지-BVH-11-2M 융합 단백질 25 ㎍을 1일 및 15일에 면역시켰다. 제4군(융합 실험 4)은 친화 정제된 티오레독신-His 표지-BVH-11B 융합 단백질 25 ㎍을 1일에 면역시키고 PBS에 녹인 재조합 BVH-11B를 16일 및 37일에 정맥주사하여 추가면역시켰다. 융합 3시간 또는 4시간 전에 PBS에 현탁된 각 항원 25 ㎍을 정맥주사하였다. 하이브리도마(hybridoma)는 J. Hamel 등에 의해 기술된 바와 같이 비분비 SP2/0 골수종(myeloma) 세포와 비장 세포를 융합하여 제조하였다(J. Hamel 등, J. Med. Microbiol., 23, pp163-170 (1987)). 하이브리도마의 배양 상층액은 J. Hamel 등에 의해 기술된 방법에 따라 정제된 재조합 단백질 또는 가열하여 비활성화시킨 S. 뉴모니애 세포의 상층액으로 코팅된 플레이트를 사용하여 ELISA를 수행함으로써 1차적으로 탐색하였다. 다양한 항원을 사용한 ELISA 활성 측정에 근거하여 선택된 양성 하이브리도마는 제한적인 희석법에 의해 클론화되었고, 불린다음 냉동 보관하였다.
상기 하이브리도마는 모노클로날 항체에 의해 인식된 항원 결정기를 특정하기 위하여 BVH-3 및 BVH-11 유전자 결과물에 대해 ELISA 또는 웨스턴 면역블러팅으로 시험되었다. BVH-3 및 BVH-11는 양쪽 단백질의 활성을 나타내는(표 4 참조) 6개 모노클로날 항체(H3-1-F9, H3-1-D4, H3-1-H12, H11-1-E7, H11-1-H10 및 H11-1.1-G11)와 같은 항원 결정기를 공유하였다. S. 뉴모니애 SP64 유래의 BVH-11 및 BVH-11-2 물질은 BVH-11 및 BVH-11-2, 재조합 단백질과 반응하는 모노클로날 항체(표 5참조)와 BVH-3 상에 존재하지 않는 항원결정기를 공유한다.
a상기 표에 기재된 모노클로날 항체는 재조합 BVH-3 분자와는 반응하지 않는다.
상기 표 4 및 표 5에 나타난 모노클로날 항체의 면역 반응성 연구로부터 얻어진 결과는 각각의 유전자 서열로부터 얻은 단백질 서열과 일치하였다. 실제로 BVH-3 및 BVH-11 분자와 상호작용하는 모노클로날 항체는 공통 영역에 상응하는 BVH-3C 단백질을 인식하며, BVH-11 및 BVH-11-2 특이적 모노클로날 항체는 상기 분자들의 가변 부분에 위치하는 헝워결정기와 반응한다. BVH-3 및 BVH-11, 및 BVH-11 및 BVH-11-2는 모노클로날 항체와 그들의 반응성에 의해 구별된다.
실시예 10
*본 실시예는 S. 뉴모니애로부터 얻은 BVH-3 및 BVH-11 유전자의 동시적 발현에 관해 기술한다.
BVH -3 및 BVH -11 유전자가 S. 뉴모니애에서 발현되는 지의 여부를 조사하기 위하여 표준적인 웨스턴 블럿 기술이 사용되었다. S. 뉴모니애 균주 SP64 및 SP63은 쵸콜릿 아가 플레이트상에서 5% CO2 37℃에서 밤새도록 배양하였고, 박테리아는 PBS에 현탁하였으며, 20분간 56℃에서 가열하여 비활성화시켰다. 항원을 준비하기 위하여 S. 뉴모니애의 현탁액은 100℃에서 5분간 SDS 및 2-머켑토에탄올(2-mercaptoethanol)이 담겨있는 샘플 버퍼로 처리하였다. 폐렴구균성 단백질 항원은 Laemmli의 방법(Laemmli, Nature, 227, pp. 680-685, 1970)에 따라 SDS-PAGE 전기영동으로 분석하였다. SDS-PAGE 수행 후, Towbin의 방법(Towbin, Proc. Natl. Acad. Sci. USA, 76, pp.4350-4354, 1979)에 따라, 상기 단백질을 겔에서 니트로셀룰로오즈 막으로 전기영동으로 전이시키고, 마우스 항혈청 또는 모노클로날 항체로 조사하였다. 항체에 반응하는 항원을 감지하기 위해서 접합된-항-마우스 면역글로블린 및 유색 물질을 사용하는 간접적인 효소-면역 검정을 수행하였다. 항혈청이 재조합 BVH-3를 증가시킬 시점은 S. 뉴모니애 SP64 항원에 대하여 측정되었는데, 분자량이 127 kDa 및 99 kDa인 선명한 두 개의 반응 밴드가 나타난다. 동일한 분자량을 갖는 밴드는 또한, 모노클로날 항체 H3-1-F9, H3-1-D4, H3-1-H12, H11-1-E7, H11-1-H10 및 H11-1.1-G11가 면역적인 소식자로서 개별적으로 사용될 때 감지된다. 반대로 BVH-3 분자에 특이적인 모노클로날 항체는 127 kDa 밴드만을 감지하고, BVH-11에 특이적인 모노클로날 항체는 99 kDa 밴드만을 감지하며, 따라서 127 및 99 kDa 밴드가 각각 BVH-3 및 BVH-11와 동일함을 확인할 수 있었다. 상기 연구는 BVH-3 및 BVH-11 단백질이 동시에 S. 뉴모니애에 존재한다는 증거를 제공한다. 게다가, 상기 결과는 BVH-3 및 BVH-11가 양쪽 단백질에 공통적인 항원 결정기 및 공통적인 단백질을 배제하는 항원 결정기 모두를 소유하고 있다는 이전의 관찰과도 부합된다. S. 뉴모니애 SP64에서, 성숙 BVH-3, BVH-11 및 BVH-11-2는 각각 1019, 821 및 819 아미노산의 단백질이며, 각각 112.5 kDa, 92.4 kDa 및 91.7 kDa의 예측된 분자량을 가지는 단백질이다. 비록, 서열로부터 유추된 분자량과 SDS-PAGE 상에서 측정된 분자량이 일치하지 않아도, BVH-3은 매우 높은 분자량을 가지므로 BVH-11과 구별될 수 있다. 게다가 S. 뉴모니애 균주 SP63 유래의 BVH-3 분자는 127 kDa인 SP64 균주의 BVH-3과 비교하여 SDS-PAGE 상에서 정확히 112 kDa의 분자량을 가진다. 상기 데이타는 S. 뉴모니애 균주 SP63 유래의 BVH-3에서 177 아미노산 부분이 결실된 것과 일치한다.
실시예 11
본 실시예는 재조합 BVH -3 또는 BVH -11 유전자 결과물 백신이 투여된 마우스를 실험적으로 감염시킨 후 나타나는 보호능에 관한 것이다.
7마리 또는 8마리로 이루어진 암컷 BALB/c 마우스(Charles River) 그룹은 친화 정제된 티오레독신-His 표지-BVH-3M 융합 단백질, 친화 정제된 티오레독신-His 표지-BVH-11M 융합 단백질, 또는 대조군으로 PBS에 녹인 QuilA 애주번트를 단독으로 사용하여 3주 간격으로 3번 피하주사하여 면역시켰다. 3번째 면역 후 12 내지 14일 후에, 상기 마우스는 S. 뉴모니애 WU2 균주를 정맥주사하거나 또는 p4241 균주를 비강 투여하였다. 상기 S. 뉴모니애 투여 접종의 샘플은 CFU를 결정하고 투여량을 검증하기 위하여 쵸콜릿 아가 플레이트에 도말하였다. 투여량은 약 106 CFU이었다. 사망은 14일 동안 기록되었으며, 투여 14일 후에 살아남은 마우스는 사망시킨 다음 S. 뉴모니애 미생물의 존재 여부를 혈액 샘플로 테스트하였다. 상기 생존율 데이타는 하기 표 6 및 7에 나타내었다.
실험 | 면역원 | 생존수:사망수a | 평균 생존일 |
1 | BVH-3M 없음 |
8:0 0:8 |
14일 이상 1일 |
2 | BVH-3M 없음 |
8:0 0:8 |
14일 이상 1일 |
a생존 마우스의 수 : 투여 후 14일에 죽은 마우스 수
실험 | 면역원 | 생존수:사망수a | 평균 생존일 |
1 | BVH-3M 없음 |
6:1 1:7 |
14일 이상 4.5일 |
2 | BVH-3M BVH-11M 없음 |
8:0 8:0 0:8 |
14일 이상 14일 이상 1일 |
a생존 마우스의 수 : 투여 후 14일에 죽은 마우스 수
재조합 BVH-3M 또는 BVH-11M 단백질로 면역된 모든 마우스는 WU2의 감염에 대해서 생존한 반면 애주번트를 투여한 대조군 마우스는 모두 사망하였다. 재조합 BVH-3M 또는 BVH-11M 단백질로 면역된 한 마리를 제외한 모든 마우스는 P4241의 감염에 대하여 생존한 반면 애주번트를 투여한 대조군 마우스는 한 마리만이 생존하였다. 생존한 마우스로 부터 얻은 모든 혈액 배양 결과 투여 14일 후에 음성으로 나타났다. 상기 결과는 BVH-3M 및 BVH-11M 모두가 마우스에 보호적인 항-폐렴 구균 면역 반응을 도출한다는 것을 명백하게 나타낸다. 상기 단백질이 S. 뉴모니애 분리체들 사이에서 매우 높게 공통적이라는 사실은 보편적인 백신 유력 물질로서 BVH-3 및 BVH-11의 효능을 강조하는 것이다. 실제로, 혈청군 6 S. 뉴모니애 균주 SP64로부터 얻은 상기 BVH-3 및 BVH-11 단백질은 다른 외피 혈청형의 균주에 의한 폐렴 구균 감염에 대하여 보호능을 나타낸다.
이상적으로, 폐렴 구균 질병을 막을 수 있는 백신은 뇌막염, 중이염, 박테리아 및 폐렴을 막을 수 있다. BVH-3 및 BVH-11은 치명적인 구조적- 및 폐렴 감염 모델에 대해 보호 효과를 가지며, 사람에게 있어서, BVH-3 및 BVH-11-단백질 기본의 백신은 외피 혈청형에 독립적으로 모든 S. 뉴모니애에 의해 실질적으로 유발된 질병의 광범위한 스펙트럼의 경우를 감소시킬 수 있다.
표 6 및 7의 데이터는 BVH-3 및 BVH-11 모두가 S. 뉴모니애의 보호-유발 물질이라는 것을 명백하게 나타낸다. 그러나, 알려지지는 않았지만 상기 보호 효과는 BVH-3 및 BVH-11 분자 상에 공유되지 않는 특이적 서열에 의해 매개될 수 있다. 암컷 BALB/c 마우스(Charles River) 군은 QuilA 애주번트(Cedarlane Laboratories Ltd, Hornby, Canada) 15 ㎍과 함께 친화 정제된 티오레독신-His 표지-BVH-3AD, -BVH-3B 또는 -BVH-3C 융합 단백질을 3주 간격으로 3번 피하주사하여 면역시켰다. 대조군 마우스는 PBS상에 있는 QuilA 애주번트 또는 QuilA와 함께 친화정제된 티오레독신-His 표지 또는 티오레독신-His 표지-융합 단백질(His-Thio)로 면역시켰다.
일련의 절단된 단백질 NEW4, NEW5, NEW6, NEW7, NEW8, NEW9, NEW10, NEW11, NEW14 및 BVH-11B의 보호능을 결정하기 위하여 암컷 BALB/c 마우스(Charles River)는 QuilA 애주번트 15 ㎍과 함께 친화 정제된 His 표지 융합 단백질 25 ㎍ 으로 3주 간격으로 2번 피하주사하여 면역시켰다. 마지막 면역 후 10 내지 14일 후, 상기 마우스에 독성 S. 뉴모니애를 접종하였다. 본 발명자들에 의한 결과가 지적하는 바와 같이, BVH-3 분자의 아미노산 512-1039로 이루어지는 절단된 BVH-3 분자인 BVH-3B는 마우스 독성 균주 WU2 및 P4241에 대한 보호능을 나타내었다. 동일하게 각각 BVH-11 분자의 아미노산 354-840, 286-840 및 286-713으로 이루어진 BVH-11 분자의 절단형인 BVH-11B, NEW4 및 NEW5 분자는 정맥주사된 WU2 및 비강 투여된 P4241에 대해 보호능을 나타내었다. 또한, BVH-11-2 분자의 아미노산 272-838 및 아미노산 227-699로 이루어지는 NEW10 및 NEW14의 백신화는 폐렴구균 균주로 인한 사망에 대하여 보호능을 유발하였다. 상기 결과는 각각 S. 뉴모니애 SP64 BVH-11 및 BVH-11-2 단백질 서열 상의 아미노산 286-713 및 272-699에 걸쳐있는 428개의 아미노산을 포함하는 영역이 보호능을 갖는 항원결정기를 포함한다는 사실을 나타낸다. 상기 영역은 13개 BVH-11 단백질 서열들과 전체적으로는 91% 동일성 및 94% 유사성을 가지며 높은 비율의 공통서열을 갖는다.
a생존한 마우스의 수 : 투여 14일 후 사망한 마우스의 수
bWU2 투여량은 105 CFU
c14일 이상 생존한 마우스는 평균 값의 결정을 위해 생존 기간을 14일로 정하였다.
실시예 12
본 실시예는 BVH-11의 카르복시 말단 영역 C' 말단에 융합된 BVH-3 카르복시 말단 영역에 상응하는 키메라성 폴리펩타이드를 코딩하는 키메라성 유전자의 클로닝 및 발현, 및 상기 키메라성 폴리펩타이드의 백신화 이후에 관찰된 부가적인 보호능에 관하여 기술한다.
전술한 연구에 따르면 BVH-3 및 BVH-11은 혈청군적으로 다른 분자이고, S. 뉴모니애 상에 동시에 존재하는 것이 명확하다. 마우스의 면역학적 연구 결과는 상기 두 단백질이 모두 좋은 백신 후보 물질이라는 것을 드러낸다. 이들 단백질은 혈청형에 관계없이 모든 폐렴구균에 대하여 보호능을 제공하는 잠재력을 가지고 있다. 심지어 상기 두 단백질이 항원 결정기 및 서열을 공유하여도, 상기 두 단백질은 다른 특성을 가지며 다른 생물학적 기능을 할 수 있다. 따라서, 상기 두 단백질의 면역은 각각 개별적인 면역에 의해 생성되는 것보다 높은 수준의 보호능을 제공할 수 있다. 이를 시험하기 위하여 전장 또는 절단된 BVH-3 및 BVH-11를 복합 또는 융합의 형태로 투여하는 여러 방법으로 실험하였다. 본 발명자들은 본 명세서에서 New-12라 명명된 BVH-3-BVH-11(서열번호 76 및 58로 각각 표시됨) 융합 유전자의 유전공학 및 상기 New-12 단백질의 백신으로서의 잠재적 용도에 관해 기술한다.
유전자의 3' 말단에 상응하는 BVH -3 및 BVH -11 유전자 절편은 각각 S. 뉴모니애 균주 SP64 BVH -3 및 BVH -11 유전자의 뉴클레오타이드 1414 내지 3117(서열번호 1) 및 뉴클레오타이드 1060 내지 2520 (서열번호 3) 부분의 절편은 증폭하기 위하여 제작된 여러쌍의 올리고뉴클레오타이드를 사용한 PCR로 증폭하였다. 사용한 프라이머 HAMJ278 및 HAMJ279; HAMJ282 및 HAMJ283은 5' 말단에 제한 엔도뉴클레아제를 가지며, 따라서 절단된 pET21b(+) 플라스미드 벡터에 증폭된 결과물의 방향성 원프레임 클로닝이 가능하다. PCR-증폭된 결과물은 제한효소로 절단되었고, 동일한 효소로 절단되어 선형화된 플라스미드 pET21b(+) 벡터에 접합시켰다. 결과적인 플라스미드 구조물은 뉴클레오타이드 서열 분석으로 확인되었다. NdeI-Hind III BVH-3 PCR 결과물을 포함하는 상기 재조합 pET21b(+) 플라스미드는 BVH -11 유전자 절편을 포함하는 재조합 pET21(+) 벡터로부터 얻은 Hind III-Not I DNA 절편의 프레임 내 클로닝을 위해 제한 효소 Hind III 및 Not I으로 절단되었다. 클론은 키메라성 폐렴 구균의 단백질 분자의 발현을 위해 E. Coli BL21(λDE3)로 삽입되기 전에 우선 E. Coli DH5α에서 안정화되었다. NEW12라 명명된 상기 재조합 키메라성 폴리펩타이드는 His 표지를 가지고 C-말단 융합의 형태로 발현된다. 상기 발현된 재조합 NEW12 단백질은 초음파 분쇄된 IPTG-증폭 E. Coli 배양액을 원심분리함으로써 얻어진 상층액 부분으로부터 His-결합 금속 킬레이트화 레진(QIAgen, Chatsworth, CA)을 사용하여 정제되었다.
전술한 내용과 동일한 과정에 따르면, 다른 키메라성 폴리펩타이드를 제조하는 것이 가능한데, 그 결과로는 New1 및 New4, New1 및 New5, New1 및 New10, 또는 New1 및 New14의 동시 발현이 가능하다. 상기 제조물은 New1 상류(upstream) 또는 New4, New5, New10, BVH-11B 또는 New14의 하류(downstream)를 포함할 수 있다. BVH-3, BVH-11 또는 BVH-11-2 각 유전자의 두 개 이상의 절편의 동시발현의 결과를 위해 다른 키메라성 폴리펩타이드의 제작도 가능하다.
8마리의 암컷 BALB/c 마우스(Charles River)의 그룹은 QuilA 애주번트 15 ㎍과 친화 정제된 His 표지-융합 NEW1, BVH-11B 또는 NEW12 단백질 25 ㎍을 3주 간격으로 3번 피하주사하여 면역시켰다. 마지막 면역 후 10일 내지 14일 후, 상기 마우스에 독성 S. 뉴모니애를 투여하였다. 전술한 바와 같이, 각각 BVH-3 단백질의 아미노산 472에서 1039 및 BVH-11 단백질 아미노산 354-840으로 구성되는 NEW1 및 BVH-11B 분자는 방어적인 면역 반응 도출능이 있는 단백질의 부분과 상응한다. 키메라성 폴리펩타이드가 개별적인 대응물에 대해 보여지는 효과와 비교하여 보호능을 현저하게 향상시키는 지를 결정하기 위하여 투여량은 보호능이 New1 및 BVH-11B 분자에서 기대되지 않는 수준으로 조정되었다. 흥미롭게도, 상기 키메라성 New12 단백질은 마우스에 독성을 나타내는 균주 WU2 및 P4241에 대한 보호능을 도출한다. New 12로 면역된 8마리의 마우스 중 7마리는 투여 후 10일 이후에도 여전히 생존하였으며, New 1, BVH-11B, BVH-3M 또는 애주번트 단독으로 면역된 32 마우스 중 28마리는 투여 후 5일까지 사망하였다. 따라서, NEW12를 사용한 마우스의 백신은 WU2 투여에 대하여 높은 수준의 보호능을 제공하였다. 상기 결과는 키메라성 폴리펩타이드 및 가능하게는 BVH -3 및 BVH -11 유전자 결과물을 결합한 면역은 BVH -3 또는 BVH-11 항원 단독 처리에 의해 얻어지는 것보다 부가적인 보호능을 제공한다.
실시예 13
본 실시예는 S. 뉴모니애 이외에 스트렙토코커스 종에 있어서 추가적인 BVH-3 및 BVH-11 관련 서열의 동정에 관하여 기술한다.
BVH-3, BVH-11 및 BVH-11-2이 공통의 서열을 공유하는 관련 단백질의 과라는 것은 이미 전술한 바 있다. 상기 유전자들의 공통적인 영역 뉴클레오타이드 서열을 가지고 동일성 조사를 수행하였으며, 진뱅크(GenBank) 및 FASTA를 이용하여 EMBL 서열과 비교하였다. 가장 현저한 유사성은 그룹 B 스트렙토코커스 또는 GBS라 명명되는 S. 아갈락티애의 알려지지 않는 기능을 가진 계산된 서열번호 81로 표시되는 92-kDa 단백질에 해당하는 2.469 kb 유전자 코딩 부분에서 관찰되었다. 상기 유전자는 BVH-71이라 명명하였다. GBS 단백질과 99.2% 동일성 및 99.5% 유사성을 나타내는 단백질은 또한 그룹 A 스트렙토코커스 또는 GAS라 명명되는 S. 피오젠(S. pyogenes)에서 동정되었다(서열번호 83). 서열번호 80 및 82로 표시되는 뉴클레오타이드 1 내지 717를 포함하는 BVH-71 서열의 5' 영역은 BVH-3(뉴클레오타이드 1 내지 675) 및 BVH-11(뉴클레오타이드 1 내지 684)의 공통 영역과는 각각 58% 및 60% 동일성을 나타내었다. 상기 GBS 및 GAS BVH-71 ORF의 전사된 서열의 첫번째 239 아미노산은 BVH-3 및 BVH-11의 첫번째 225 및 228 아미노산에 각각 51% 및 54% 동일하다. 또한, 구조적 동일성에 있어서, 스트렙토코커스 BVH-3, BVH-11 및 BVH-71 단백질은 또한, 항원성 항원 결정기를 공유한다. 97 kDa 밴드는 BVH-3 및 BVH-11의 공통된 영역에 반응하는 모노클로날 항체 H11-1.1-G11를 사용하여 수행한 GAS 또는 GBS 전체 세포에 대한 웨스턴 블럿상에 나타났다. 동일하게, GAS 및 GBS 재조합 BVH-71 단백질은 웨스턴 면역 블럿 분석상에 감지되었다.
도 1은 서열번호 1로 표시되는 BVH-3 유전자의 DNA 서열이다.
도 2는 서열번호 2로 표시되는 BVH-3 단백질의 아미노산 서열이다.
도 3은 서열번호 3으로 표시되는 BVH-11 유전자의 DNA 서열이다.
도 4는 서열번호 4로 표시되는 BVH-11 단백질의 아미노산 서열이다.
도 5는 서열번호 5로 표시되는 BVH-28 유전자의 DNA 서열이다.
도 6은 서열번호 6으로 표시되는 BVH-28 단백질의 아미노산 서열이다.
도 7은 BVH-3의 5' 말단에 상응하는 서열번호 7로 표시되는 BVH-3A 유전자의 DNA 서열이다.
도 8은 서열번호 8로 표시되는 BVH-3A 단백질의 아미노산 서열이다.
도 9는 BVH-3의 3' 말단에 상응하는 서열번호 9로 표시되는 BVH-3B 유전자의 DNA 서열이다.
도 10은 서열번호 10으로 표시되는 BVH-3B 단백질의 아미노산 서열이다.
도 11a, b, c 및 d는 맥벡터 서열 분석 소프트웨어(버전 6.5; MacVector sequence analysis software)의 Clustal W 프로그램을 사용하여 WU2, RX1, JNR. 7/87, SP64, P4241 및 A66 S. 뉴모니애 균주의 BVH-3 ORF(Open Reading Frame)의 예측된 아미노산 서열을 비교한 것이다. 비교 결과 * 로 표시되는 공통 부분, . 로 표시되는 동일 부분 및 유사한 아미노산 잔기가 존재한다.
도 12a, b, c 및 d는 맥벡터 서열 분석 소프트웨어(버전 6.5)의 Clustal W 프로그램을 사용하여 WU2, RX1, JNR. 7/87, SP64, P4241 및 A66 및 SP63 S. 뉴모 니애 균주의 BVH-11 ORF의 예측 아미노산 서열을 비교한 것이다. 비교 결과 * 로 표시되는 공통 부분, . 로 표시되는 동일 부분 및 유사한 아미노산 잔기가 존재한다.
도 13은 다양한 S. 뉴모니애 균주의 BVH-11 단백질의 예상된 아미노산 서열을 비교한 것이다. 동일성(I) 및 유사성(S)의 정도는 맥벡터 서열 분석 소프트웨어(버전 6.5)을 사용하여 결정되었다.
도 14a 및 b는 서열번호 11로 표시되는 완전한 BVH-3 유전자(뉴클레오타이드 1777 내지 4896의 ORF)를 포함하는 DNA 서열이다.
도 15는 서열번호 12로 표시되는 완전한 BVH-11 유전자(뉴클레오타이드 45 내지 2567의 ORF)를 포함하는 DNA 서열이다.
도 16은 서열번호 13으로 표시되는 완전한 BVH-11-2 유전자(뉴클레오타이드 114 내지 2630의 ORF)를 포함하는 DNA 서열이다.
도 17은 서열번호 14로 표시되는 BVH-11-2 단백질의 아미노산 서열이다.
도 18은 서열번호 15로 표시되는 SP63 BVH-3 유전자의 DNA 서열이다.
도 19는 서열번호 16으로 표시되는 SP63 BVH-3 단백질의 아미노산 서열이다.
도 20은 서열번호 55로 표시되는 BVH-3M 단백질의 아미노산 서열이다.
도 21은 서열번호 56으로 표시되는 BVH-3AD 단백질의 아미노산 서열이다.
도 22는 서열번호 57로 표시되는 L-BVH-3-AD 단백질의 아미노산 서열이다.
도 23은 서열번호 58로 표시되는 NEW12 단백질의 아미노산 서열이다.
도 24는 서열번호 59로 표시되는 BVH-3C 단백질의 아미노산 서열이다.
도 25는 서열번호 60으로 표시되는 BVH-11M 단백질의 아미노산 서열이다.
도 26은 서열번호 61로 표시되는 BVH-11A 단백질의 아미노산 서열이다.
도 27은 서열번호 62로 표시되는 BVH-11B(New13으로도 명명) 단백질의 아미노산 서열이다.
도 28은 서열번호 63으로 표시되는 BVH-11C 단백질의 아미노산 서열이다.
도 29는 서열번호 64로 표시되는 New1 단백질의 아미노산 서열이다.
도 30은 서열번호 65로 표시되는 New2 단백질의 아미노산 서열이다.
도 31은 서열번호 66으로 표시되는 New3 단백질의 아미노산 서열이다.
도 32는 서열번호 67로 표시되는 New4 단백질의 아미노산 서열이다.
도 33은 서열번호 68로 표시되는 New5 단백질의 아미노산 서열이다.
도 34는 서열번호 69로 표시되는 New6 단백질의 아미노산 서열이다.
도 35는 서열번호 70로 표시되는 New7 단백질의 아미노산 서열이다.
도 36은 서열번호 71로 표시되는 New8 단백질의 아미노산 서열이다.
도 37은 서열번호 72로 표시되는 New9 단백질의 아미노산 서열이다.
도 38은 서열번호 73으로 표시되는 BVH-11-2M 단백질의 아미노산 서열이다.
도 39는 서열번호 74로 표시되는 New10 단백질의 아미노산 서열이다.
도 40은 서열번호 75로 표시되는 New11 단백질의 아미노산 서열이다.
도 41a 및 b은 서열번호 76으로 표시되는 New12 유전자의 DNA 서열이다.
도 42는 서열번호 77로 표시되는 New14 단백질의 아미노산 서열이다.
도 43은 서열번호 78로 표시되는 New15 단백질의 아미노산 서열이다.
도 44는 서열번호 79로 표시되는 New16 단백질의 아미노산 서열이다.
도 45는 서열번호 80으로 표시되는 GBS BVH-71 유전자의 DNA 서열이다.
도 46은 서열번호 81로 표시되는 GBS BVH-71 단백질의 아미노산 서열이다.
도 47은 서열번호 82로 표시되는 GAS BVH-71 유전자의 DNA 서열이다.
도 48은 서열번호 83으로 표시되는 GAS BVH-71 단백질의 아미노산 서열이다.
<110> SHIRE BIOCHEM INC.
HAMEL, Josee
BRODEUR, Bernard R.
PINEAU, Isabelle
MARTIN, Denis
RIOUX, Clement
CHARLAND, Nathalie
<120> NOVEL STREPTOCOCCUS ANTIGENS
<130> 484112.438PC
<140> PCT/CA1999/01218
<141> 1999-12-20
<150> US 60/113,800
<151> 1998-12-23
<160> 102
<170> FastSEQ for Windows Version 3.0
<210> 1
<211> 3120
<212> DNA
<213> S. pneumoniae
<400> 1
atgaaattta gtaaaaaata tatagcagct ggatcagctg ttatcgtatc cttgagtcta 60
tgtgcctatg cactaaacca gcatcgttcg caggaaaata aggacaataa tcgtgtctct 120
tatgtggatg gcagccagtc aagtcagaaa agtgaaaact tgacaccaga ccaggttagc 180
cagaaagaag gaattcaggc tgagcaaatt gtaatcaaaa ttacagatca gggctatgta 240
acgtcacacg gtgaccacta tcattactat aatgggaaag ttccttatga tgccctcttt 300
agtgaagaac tcttgatgaa ggatccaaac tatcaactta aagacgctga tattgtcaat 360
gaagtcaagg gtggttatat catcaaggtc gatggaaaat attatgtcta cctgaaagat 420
gcagctcatg ctgataatgt tcgaactaaa gatgaaatca atcgtcaaaa acaagaacat 480
gtcaaagata atgagaaggt taactctaat gttgctgtag caaggtctca gggacgatat 540
acgacaaatg atggttatgt ctttaatcca gctgatatta tcgaagatac gggtaatgct 600
tatatcgttc ctcatggagg tcactatcac tacattccca aaagcgattt atctgctagt 660
gaattagcag cagctaaagc acatctggct ggaaaaaata tgcaaccgag tcagttaagc 720
tattcttcaa cagctagtga caataacacg caatctgtag caaaaggatc aactagcaag 780
ccagcaaata aatctgaaaa tctccagagt cttttgaagg aactctatga ttcacctagc 840
gcccaacgtt acagtgaatc agatggcctg gtctttgacc ctgctaagat tatcagtcgt 900
acaccaaatg gagttgcgat tccgcatggc gaccattacc actttattcc ttacagcaag 960
ctttctgctt tagaagaaaa gattgccaga atggtgccta tcagtggaac tggttctaca 1020
gtttctacaa atgcaaaacc taatgaagta gtgtctagtc taggcagtct ttcaagcaat 1080
ccttcttctt taacgacaag taaggagctc tcttcagcat ctgatggtta tatttttaat 1140
ccaaaagata tcgttgaaga aacggctaca gcttatattg taagacatgg tgatcatttc 1200
cattacattc caaaatcaaa tcaaattggg caaccgactc ttccaaacaa tagtctagca 1260
acaccttctc catctcttcc aatcaatcca ggaacttcac atgagaaaca tgaagaagat 1320
ggatacggat ttgatgctaa tcgtattatc gctgaagatg aatcaggttt tgtcatgagt 1380
cacggagacc acaatcatta tttcttcaag aaggacttga cagaagagca aattaaggct 1440
gcgcaaaaac atttagagga agttaaaact agtcataatg gattagattc tttgtcatct 1500
catgaacagg attatccagg taatgccaaa gaaatgaaag atttagataa aaaaatcgaa 1560
gaaaaaattg ctggcattat gaaacaatat ggtgtcaaac gtgaaagtat tgtcgtgaat 1620
aaagaaaaaa atgcgattat ttatccgcat ggagatcacc atcatgcaga tccgattgat 1680
gaacataaac cggttggaat tggtcattct cacagtaact atgaactgtt taaacccgaa 1740
gaaggagttg ctaaaaaaga agggaataaa gtttatactg gagaagaatt aacgaatgtt 1800
gttaatttgt taaaaaatag tacgtttaat aatcaaaact ttactctagc caatggtcaa 1860
aaacgcgttt cttttagttt tccgcctgaa ttggagaaaa aattaggtat caatatgcta 1920
gtaaaattaa taacaccaga tggaaaagta ttggagaaag tatctggtaa agtatttgga 1980
gaaggagtag ggaatattgc aaactttgaa ttagatcaac cttatttacc aggacaaaca 2040
tttaagtata ctatcgcttc aaaagattat ccagaagtaa gttatgatgg tacatttaca 2100
gttccaacct ctttagctta caaaatggcc agtcaaacga ttttctatcc tttccatgca 2160
ggggatactt atttaagagt gaaccctcaa tttgcagtgc ctaaaggaac tgatgcttta 2220
gtcagagtgt ttgatgaatt tcatggaaat gcttatttag aaaataacta taaagttggt 2280
gaaatcaaat taccgattcc gaaattaaac caaggaacaa ccagaacggc cggaaataaa 2340
attcctgtaa ccttcatggc aaatgcttat ttggacaatc aatcgactta tattgtggaa 2400
gtacctatct tggaaaaaga aaatcaaact gataaaccaa gtattctacc acaatttaaa 2460
aggaataaag cacaagaaaa ctcaaaactt gatgaaaagg tagaagaacc aaagactagt 2520
gagaaggtag aaaaagaaaa actttctgaa actgggaata gtactagtaa ttcaacgtta 2580
gaagaagttc ctacagtgga tcctgtacaa gaaaaagtag caaaatttgc tgaaagttat 2640
gggatgaagc tagaaaatgt cttgtttaat atggacggaa caattgaatt atatttacca 2700
tcaggagaag tcattaaaaa gaatatggca gattttacag gagaagcacc tcaaggaaat 2760
ggtgaaaata aaccatctga aaatggaaaa gtatctactg gaacagttga gaaccaacca 2820
acagaaaata aaccagcaga ttctttacca gaggcaccaa acgaaaaacc tgtaaaacca 2880
gaaaactcaa cggataatgg aatgttgaat ccagaaggga atgtggggag tgaccctatg 2940
ttagatccag cattagagga agctccagca gtagatcctg tacaagaaaa attagaaaaa 3000
tttacagcta gttacggatt aggcttagat agtgttatat tcaatatgga tggaacgatt 3060
gaattaagat tgccaagtgg agaagtgata aaaaagaatt tatctgattt catagcgtaa 3120
3120
<210> 2
<211> 1039
<212> PRT
<213> S. pneumoniae
<400> 2
Met Lys Phe Ser Lys Lys Tyr Ile Ala Ala Gly Ser Ala Val Ile Val
1 5 10 15
Ser Leu Ser Leu Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu
20 25 30
Asn Lys Asp Asn Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser
35 40 45
Gln Lys Ser Glu Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly
50 55 60
Ile Gln Ala Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val
65 70 75 80
Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr
85 90 95
Asp Ala Leu Phe Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln
100 105 110
Leu Lys Asp Ala Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile
115 120 125
Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala
130 135 140
Asp Asn Val Arg Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His
145 150 155 160
Val Lys Asp Asn Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser
165 170 175
Gln Gly Arg Tyr Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp
180 185 190
Ile Ile Glu Asp Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His
195 200 205
Tyr His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala
210 215 220
Ala Lys Ala His Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser
225 230 235 240
Tyr Ser Ser Thr Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly
245 250 255
Ser Thr Ser Lys Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu
260 265 270
Lys Glu Leu Tyr Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp
275 280 285
Gly Leu Val Phe Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly
290 295 300
Val Ala Ile Pro His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys
305 310 315 320
Leu Ser Ala Leu Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly
325 330 335
Thr Gly Ser Thr Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser
340 345 350
Ser Leu Gly Ser Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys
355 360 365
Glu Leu Ser Ser Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile
370 375 380
Val Glu Glu Thr Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe
385 390 395 400
His Tyr Ile Pro Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn
405 410 415
Asn Ser Leu Ala Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr
420 425 430
Ser His Glu Lys His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg
435 440 445
Ile Ile Ala Glu Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His
450 455 460
Asn His Tyr Phe Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala
465 470 475 480
Ala Gln Lys His Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp
485 490 495
Ser Leu Ser Ser His Glu Gln Asp Tyr Pro Gly Asn Ala Lys Glu Met
500 505 510
Lys Asp Leu Asp Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys
515 520 525
Gln Tyr Gly Val Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn
530 535 540
Ala Ile Ile Tyr Pro His Gly Asp His His His Ala Asp Pro Ile Asp
545 550 555 560
Glu His Lys Pro Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu
565 570 575
Phe Lys Pro Glu Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr
580 585 590
Thr Gly Glu Glu Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr
595 600 605
Phe Asn Asn Gln Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser
610 615 620
Phe Ser Phe Pro Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu
625 630 635 640
Val Lys Leu Ile Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly
645 650 655
Lys Val Phe Gly Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp
660 665 670
Gln Pro Tyr Leu Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys
675 680 685
Asp Tyr Pro Glu Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser
690 695 700
Leu Ala Tyr Lys Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala
705 710 715 720
Gly Asp Thr Tyr Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly
725 730 735
Thr Asp Ala Leu Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr
740 745 750
Leu Glu Asn Asn Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys
755 760 765
Leu Asn Gln Gly Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr
770 775 780
Phe Met Ala Asn Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu
785 790 795 800
Val Pro Ile Leu Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu
805 810 815
Pro Gln Phe Lys Arg Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp Glu
820 825 830
Lys Val Glu Glu Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu
835 840 845
Ser Glu Thr Gly Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro
850 855 860
Thr Val Asp Pro Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr
865 870 875 880
Gly Met Lys Leu Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu
885 890 895
Leu Tyr Leu Pro Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe
900 905 910
Thr Gly Glu Ala Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn
915 920 925
Gly Lys Val Ser Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys
930 935 940
Pro Ala Asp Ser Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro
945 950 955 960
Glu Asn Ser Thr Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly
965 970 975
Ser Asp Pro Met Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val Asp
980 985 990
Pro Val Gln Glu Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly
995 1000 1005
Leu Asp Ser Val Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu
1010 1015 1020
Pro Ser Gly Glu Val Ile Lys Lys Asn Leu Ser Asp Phe Ile Ala
1025 1030 1035
<210> 3
<211> 2523
<212> DNA
<213> S. pneumoniae
<220>
<221> CDS
<222> (1)..(2520)
<223> Coding region of BVH-11 gene
<400> 3
atg aaa atc aat aaa aaa tat cta gct ggg tca gta gct aca ctt gtt 48
Met Lys Ile Asn Lys Lys Tyr Leu Ala Gly Ser Val Ala Thr Leu Val
1 5 10 15
tta agt gtc tgt gct tat gaa cta ggt ttg cat caa gct caa act gta 96
Leu Ser Val Cys Ala Tyr Glu Leu Gly Leu His Gln Ala Gln Thr Val
20 25 30
aaa gaa aat aat cgt gtt tcc tat ata gat gga aaa caa gcg acg caa 144
Lys Glu Asn Asn Arg Val Ser Tyr Ile Asp Gly Lys Gln Ala Thr Gln
35 40 45
aaa acg gag aat ttg act cct gat gag gtt agc aag cgt gaa gga atc 192
Lys Thr Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile
50 55 60
aac gcc gaa caa atc gtc atc aag att acg gat caa ggt tat gtg acc 240
Asn Ala Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr
65 70 75 80
tct cat gga gac cat tat cat tac tat aat ggc aag gtc cct tat gat 288
Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp
85 90 95
gcc atc atc agt gaa gag ctc ctc atg aaa gat ccg aat tat cag ttg 336
Ala Ile Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu
100 105 110
aag gat tca gac att gtc aat gaa atc aag ggt ggt tat gtc att aag 384
Lys Asp Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys
115 120 125
gta aac ggt aaa tac tat gtt tac ctt aag gat gca gct cat gcg gat 432
Val Asn Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp
130 135 140
aat gtc cgt aca aaa gaa gaa atc aat cgg caa aaa caa gaa cat agt 480
Asn Val Arg Thr Lys Glu Glu Ile Asn Arg Gln Lys Gln Glu His Ser
145 150 155 160
cag cat cgt gaa gga ggg act tca gca aac gat ggt gcg gta gcc ttt 528
Gln His Arg Glu Gly Gly Thr Ser Ala Asn Asp Gly Ala Val Ala Phe
165 170 175
gca cgt tca cag gga cgc tac acc aca gat gat ggt tat atc ttc aat 576
Ala Arg Ser Gln Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn
180 185 190
gca tct gat atc atc gaa gat acg ggc gat gcc tat atc gtt cct cat 624
Ala Ser Asp Ile Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His
195 200 205
gga gat cat tac cat tac att cct aag aat gag tta tca gct agc gag 672
Gly Asp His Tyr His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu
210 215 220
ttg gct gct gca gaa gcc ttc cta tct ggt cgg gaa aat ctg tca aat 720
Leu Ala Ala Ala Glu Ala Phe Leu Ser Gly Arg Glu Asn Leu Ser Asn
225 230 235 240
tta aga acc tat cgc cga caa aat agc gat aac act cca aga aca aac 768
Leu Arg Thr Tyr Arg Arg Gln Asn Ser Asp Asn Thr Pro Arg Thr Asn
245 250 255
tgg gta cct tct gta agc aat cca gga act aca aat act aac aca agc 816
Trp Val Pro Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser
260 265 270
aac aac agc aac act aac agt caa gca agt caa agt aat gac att gat 864
Asn Asn Ser Asn Thr Asn Ser Gln Ala Ser Gln Ser Asn Asp Ile Asp
275 280 285
agt ctc ttg aaa cag ctc tac aaa ctg cct ttg agt caa cgc cat gta 912
Ser Leu Leu Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln Arg His Val
290 295 300
gaa tct gat ggc ctt att ttc gac cca gcg caa atc aca agt cga acc 960
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
305 310 315 320
gcc aga ggt gta gct gtc cct cat ggt aac cat tac cac ttt atc cct 1008
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
325 330 335
tat gaa caa atg tct gaa ttg gaa aaa cga att gct cgt att att ccc 1056
Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro
340 345 350
ctt cgt tat cgt tca aac cat tgg gta cca gat tca aga cca gaa gaa 1104
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Glu
355 360 365
cca agt cca caa ccg act cca gaa cct agt cca agt ccg caa cct gca 1152
Pro Ser Pro Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala
370 375 380
cca aat cct caa cca gct cca agc aat cca att gat gag aaa ttg gtc 1200
Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val
385 390 395 400
aaa gaa gct gtt cga aaa gta ggc gat ggt tat gtc ttt gag gag aat 1248
Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn
405 410 415
gga gtt tct cgt tat atc cca gcc aag aat ctt tca gca gaa aca gca 1296
Gly Val Ser Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala Glu Thr Ala
420 425 430
gca ggc att gat agc aaa ctg gcc aag cag gaa agt tta tct cat aag 1344
Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys
435 440 445
cta gga gct aag aaa act gac ctc cca tct agt gat cga gaa ttt tac 1392
Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr
450 455 460
aat aag gct tat gac tta cta gca aga att cac caa gat tta ctt gat 1440
Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp
465 470 475 480
aat aaa ggt cga caa gtt gat ttt gag gct ttg gat aac ctg ttg gaa 1488
Asn Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu
485 490 495
cga ctc aag gat gtc tca agt gat aaa gtc aag tta gtg gat gat att 1536
Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile
500 505 510
ctt gcc ttc tta gct ccg att cgt cat cca gaa cgt tta gga aaa cca 1584
Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro
515 520 525
aat gcg caa att acc tac act gat gat gag att caa gta gcc aag ttg 1632
Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu
530 535 540
gca ggc aag tac aca aca gaa gac ggt tat atc ttt gat cct cgt gat 1680
Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp
545 550 555 560
ata acc agt gat gag ggg gat gcc tat gta act cca cat atg acc cat 1728
Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His
565 570 575
agc cac tgg att aaa aaa gat agt ttg tct gaa gct gag aga gcg gca 1776
Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala
580 585 590
gcc cag gct tat gct aaa gag aaa ggt ttg acc cct cct tcg aca gac 1824
Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp
595 600 605
cat cag gat tca gga aat act gag gca aaa gga gca gaa gct atc tac 1872
His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr
610 615 620
aac cgc gtg aaa gca gct aag aag gtg cca ctt gat cgt atg cct tac 1920
Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr
625 630 635 640
aat ctt caa tat act gta gaa gtc aaa aac ggt agt tta atc ata cct 1968
Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro
645 650 655
cat tat gac cat tac cat aac atc aaa ttt gag tgg ttt gac gaa ggc 2016
His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly
660 665 670
ctt tat gag gca cct aag ggg tat act ctt gag gat ctt ttg gcg act 2064
Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr
675 680 685
gtc aag tac tat gtc gaa cat cca aac gaa cgt ccg cat tca gat aat 2112
Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn
690 695 700
ggt ttt ggt aac gct agc gac cat gtt caa aga aac aaa aat ggt caa 2160
Gly Phe Gly Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln
705 710 715 720
gct gat acc aat caa acg gaa aaa cca agc gag gag aaa cct cag aca 2208
Ala Asp Thr Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln Thr
725 730 735
gaa aaa cct gag gaa gaa acc cct cga gaa gag aaa cca caa agc gag 2256
Glu Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu
740 745 750
aaa cca gag tct cca aaa cca aca gag gaa cca gaa gaa gaa tca cca 2304
Lys Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro
755 760 765
gag gaa tca gaa gaa cct cag gtc gag act gaa aag gtt gaa gaa aaa 2352
Glu Glu Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu Lys
770 775 780
ctg aga gag gct gaa gat tta ctt gga aaa atc cag gat cca att atc 2400
Leu Arg Glu Ala Glu Asp Leu Leu Gly Lys Ile Gln Asp Pro Ile Ile
785 790 795 800
aag tcc aat gcc aaa gag act ctc aca gga tta aaa aat aat tta cta 2448
Lys Ser Asn Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu
805 810 815
ttt ggc acc cag gac aac aat act att atg gca gaa gct gaa aaa cta 2496
Phe Gly Thr Gln Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu
820 825 830
ttg gct tta tta aag gag agt aag taa 2523
Leu Ala Leu Leu Lys Glu Ser Lys
835 840
<210> 4
<211> 840
<212> PRT
<213> S. pneumoniae
<400> 4
Met Lys Ile Asn Lys Lys Tyr Leu Ala Gly Ser Val Ala Thr Leu Val
1 5 10 15
Leu Ser Val Cys Ala Tyr Glu Leu Gly Leu His Gln Ala Gln Thr Val
20 25 30
Lys Glu Asn Asn Arg Val Ser Tyr Ile Asp Gly Lys Gln Ala Thr Gln
35 40 45
Lys Thr Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile
50 55 60
Asn Ala Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr
65 70 75 80
Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp
85 90 95
Ala Ile Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu
100 105 110
Lys Asp Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys
115 120 125
Val Asn Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp
130 135 140
Asn Val Arg Thr Lys Glu Glu Ile Asn Arg Gln Lys Gln Glu His Ser
145 150 155 160
Gln His Arg Glu Gly Gly Thr Ser Ala Asn Asp Gly Ala Val Ala Phe
165 170 175
Ala Arg Ser Gln Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn
180 185 190
Ala Ser Asp Ile Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His
195 200 205
Gly Asp His Tyr His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu
210 215 220
Leu Ala Ala Ala Glu Ala Phe Leu Ser Gly Arg Glu Asn Leu Ser Asn
225 230 235 240
Leu Arg Thr Tyr Arg Arg Gln Asn Ser Asp Asn Thr Pro Arg Thr Asn
245 250 255
Trp Val Pro Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser
260 265 270
Asn Asn Ser Asn Thr Asn Ser Gln Ala Ser Gln Ser Asn Asp Ile Asp
275 280 285
Ser Leu Leu Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln Arg His Val
290 295 300
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
305 310 315 320
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
325 330 335
Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro
340 345 350
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Glu
355 360 365
Pro Ser Pro Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala
370 375 380
Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val
385 390 395 400
Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn
405 410 415
Gly Val Ser Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala Glu Thr Ala
420 425 430
Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys
435 440 445
Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr
450 455 460
Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp
465 470 475 480
Asn Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu
485 490 495
Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile
500 505 510
Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro
515 520 525
Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu
530 535 540
Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp
545 550 555 560
Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His
565 570 575
Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala
580 585 590
Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp
595 600 605
His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr
610 615 620
Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr
625 630 635 640
Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro
645 650 655
His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly
660 665 670
Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr
675 680 685
Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn
690 695 700
Gly Phe Gly Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln
705 710 715 720
Ala Asp Thr Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln Thr
725 730 735
Glu Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu
740 745 750
Lys Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro
755 760 765
Glu Glu Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu Lys
770 775 780
Leu Arg Glu Ala Glu Asp Leu Leu Gly Lys Ile Gln Asp Pro Ile Ile
785 790 795 800
Lys Ser Asn Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu
805 810 815
Phe Gly Thr Gln Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu
820 825 830
Leu Ala Leu Leu Lys Glu Ser Lys
835 840
<210> 5
<211> 1581
<212> DNA
<213> S. pneumoniae
<220>
<221> CDS
<222> (1)..(1578)
<400> 5
atg gag aat ata gac atg ttt aaa tca aat cat gag cga aga atg cgt 48
Met Glu Asn Ile Asp Met Phe Lys Ser Asn His Glu Arg Arg Met Arg
1 5 10 15
tat tcc att cgt aaa ttt agt gta gga gta gct agc gta gct gtt gcc 96
Tyr Ser Ile Arg Lys Phe Ser Val Gly Val Ala Ser Val Ala Val Ala
20 25 30
agt ctt ttt atg gga agt gtt gta cat gcg aca gag aaa gag gga agt 144
Ser Leu Phe Met Gly Ser Val Val His Ala Thr Glu Lys Glu Gly Ser
35 40 45
acc caa gca gcc act tct ttt aat agg gga aat gga agt cag gca gaa 192
Thr Gln Ala Ala Thr Ser Phe Asn Arg Gly Asn Gly Ser Gln Ala Glu
50 55 60
caa cgt gga gaa ctc gat tta gaa cga gat aag gca atg aaa gcg gtc 240
Gln Arg Gly Glu Leu Asp Leu Glu Arg Asp Lys Ala Met Lys Ala Val
65 70 75 80
agt gaa tat gta gga aaa atg gtg aga gat gcc tat gta aaa tca gat 288
Ser Glu Tyr Val Gly Lys Met Val Arg Asp Ala Tyr Val Lys Ser Asp
85 90 95
aga aaa cga cat aaa aat act gta gct cta gtt aac cag ttg gga aac 336
Arg Lys Arg His Lys Asn Thr Val Ala Leu Val Asn Gln Leu Gly Asn
100 105 110
att aag aac agg tat ttg aat gaa ata gtt cat tca acc tca aaa agc 384
Ile Lys Asn Arg Tyr Leu Asn Glu Ile Val His Ser Thr Ser Lys Ser
115 120 125
caa cta cag gaa ctg atg atg aag agt caa tca gaa gta gat gaa gct 432
Gln Leu Gln Glu Leu Met Met Lys Ser Gln Ser Glu Val Asp Glu Ala
130 135 140
gtg tct aaa ttt gaa aag gac tca ttt tct tcg tca agt tca gga tcc 480
Val Ser Lys Phe Glu Lys Asp Ser Phe Ser Ser Ser Ser Ser Gly Ser
145 150 155 160
tcc act aaa cca gaa act ccg cag ccg gaa aat cca gag cat caa aaa 528
Ser Thr Lys Pro Glu Thr Pro Gln Pro Glu Asn Pro Glu His Gln Lys
165 170 175
cca aca act cca tct ccg gat acc aaa cca agc cct caa cca gaa ggc 576
Pro Thr Thr Pro Ser Pro Asp Thr Lys Pro Ser Pro Gln Pro Glu Gly
180 185 190
aag aaa cca agc gta cca gac att aat cag gaa aaa gaa aaa gct aag 624
Lys Lys Pro Ser Val Pro Asp Ile Asn Gln Glu Lys Glu Lys Ala Lys
195 200 205
ctt gct gta gta acc tac atg agc aag att tta gat gat ata caa aaa 672
Leu Ala Val Val Thr Tyr Met Ser Lys Ile Leu Asp Asp Ile Gln Lys
210 215 220
cat cat ctg cag aaa gaa aaa cat cgt cag att gtt gct ctt att aag 720
His His Leu Gln Lys Glu Lys His Arg Gln Ile Val Ala Leu Ile Lys
225 230 235 240
gag ctt gat gag ctt aaa aag caa gct ctt tct gaa att gat aat gta 768
Glu Leu Asp Glu Leu Lys Lys Gln Ala Leu Ser Glu Ile Asp Asn Val
245 250 255
aat acc aaa gta gaa att gaa aat aca gtc cac aag ata ttt gca gac 816
Asn Thr Lys Val Glu Ile Glu Asn Thr Val His Lys Ile Phe Ala Asp
260 265 270
atg gat gca gtt gtg act aaa ttc aaa aaa ggc tta act cag gac aca 864
Met Asp Ala Val Val Thr Lys Phe Lys Lys Gly Leu Thr Gln Asp Thr
275 280 285
cca aaa gaa cca ggt aac aaa aaa cca tct gct cca aaa cca ggt atg 912
Pro Lys Glu Pro Gly Asn Lys Lys Pro Ser Ala Pro Lys Pro Gly Met
290 295 300
caa cca agt cct caa cca gag gtt aaa ccg cag ctg gaa aaa cca aaa 960
Gln Pro Ser Pro Gln Pro Glu Val Lys Pro Gln Leu Glu Lys Pro Lys
305 310 315 320
cca gag gtt aaa ccg caa cca gaa aaa cca aaa cca gag gtt aaa ccg 1008
Pro Glu Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro
325 330 335
cag ccg gaa aaa cca aaa cca gag gtt aaa ccg cag ccg gaa aaa cca 1056
Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro Gln Pro Glu Lys Pro
340 345 350
aaa cca gag gtt aaa ccg cag ccg gaa aaa cca aaa cca gag gtt aaa 1104
Lys Pro Glu Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys
355 360 365
ccg cag ccg gaa aaa cca aaa cca gag gtt aaa ccg cag ccg gaa aaa 1152
Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro Gln Pro Glu Lys
370 375 380
cca aaa cca gag gtt aaa ccg cag ccg gaa aaa cca aaa cca gag gtt 1200
Pro Lys Pro Glu Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val
385 390 395 400
aaa ccg cag ccg gaa aaa cca aaa cca gag gtt aaa ccg cag ccg gaa 1248
Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro Gln Pro Glu
405 410 415
aaa cca aaa cca gag gtt aaa ccg cag ccg gaa aaa cca aaa cca gag 1296
Lys Pro Lys Pro Glu Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu
420 425 430
gtt aaa ccg caa cca gaa aaa cca aaa cca gag gtt aaa ccg caa cca 1344
Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro Gln Pro
435 440 445
gaa aaa cca aaa cca gat aat agc aag cca caa gca gat gat aag aag 1392
Glu Lys Pro Lys Pro Asp Asn Ser Lys Pro Gln Ala Asp Asp Lys Lys
450 455 460
cca tca act aca aat aat tta agc aag gac aag caa cct tct aac caa 1440
Pro Ser Thr Thr Asn Asn Leu Ser Lys Asp Lys Gln Pro Ser Asn Gln
465 470 475 480
gct tca aca aac gaa aaa gca aca aat aaa ccg aag aag tca ttg cca 1488
Ala Ser Thr Asn Glu Lys Ala Thr Asn Lys Pro Lys Lys Ser Leu Pro
485 490 495
tca act gga tct att tca aat cta gca ctt gaa att gca ggt ctt ctt 1536
Ser Thr Gly Ser Ile Ser Asn Leu Ala Leu Glu Ile Ala Gly Leu Leu
500 505 510
acc ttg gcg ggg gca acc att ctt gct aag aaa aga atg aaa ta 1580
Thr Leu Ala Gly Ala Thr Ile Leu Ala Lys Lys Arg Met Lys
515 520 525
g 1581
<210> 6
<211> 526
<212> PRT
<213> S. pneumoniae
<400> 6
Met Glu Asn Ile Asp Met Phe Lys Ser Asn His Glu Arg Arg Met Arg
1 5 10 15
Tyr Ser Ile Arg Lys Phe Ser Val Gly Val Ala Ser Val Ala Val Ala
20 25 30
Ser Leu Phe Met Gly Ser Val Val His Ala Thr Glu Lys Glu Gly Ser
35 40 45
Thr Gln Ala Ala Thr Ser Phe Asn Arg Gly Asn Gly Ser Gln Ala Glu
50 55 60
Gln Arg Gly Glu Leu Asp Leu Glu Arg Asp Lys Ala Met Lys Ala Val
65 70 75 80
Ser Glu Tyr Val Gly Lys Met Val Arg Asp Ala Tyr Val Lys Ser Asp
85 90 95
Arg Lys Arg His Lys Asn Thr Val Ala Leu Val Asn Gln Leu Gly Asn
100 105 110
Ile Lys Asn Arg Tyr Leu Asn Glu Ile Val His Ser Thr Ser Lys Ser
115 120 125
Gln Leu Gln Glu Leu Met Met Lys Ser Gln Ser Glu Val Asp Glu Ala
130 135 140
Val Ser Lys Phe Glu Lys Asp Ser Phe Ser Ser Ser Ser Ser Gly Ser
145 150 155 160
Ser Thr Lys Pro Glu Thr Pro Gln Pro Glu Asn Pro Glu His Gln Lys
165 170 175
Pro Thr Thr Pro Ser Pro Asp Thr Lys Pro Ser Pro Gln Pro Glu Gly
180 185 190
Lys Lys Pro Ser Val Pro Asp Ile Asn Gln Glu Lys Glu Lys Ala Lys
195 200 205
Leu Ala Val Val Thr Tyr Met Ser Lys Ile Leu Asp Asp Ile Gln Lys
210 215 220
His His Leu Gln Lys Glu Lys His Arg Gln Ile Val Ala Leu Ile Lys
225 230 235 240
Glu Leu Asp Glu Leu Lys Lys Gln Ala Leu Ser Glu Ile Asp Asn Val
245 250 255
Asn Thr Lys Val Glu Ile Glu Asn Thr Val His Lys Ile Phe Ala Asp
260 265 270
Met Asp Ala Val Val Thr Lys Phe Lys Lys Gly Leu Thr Gln Asp Thr
275 280 285
Pro Lys Glu Pro Gly Asn Lys Lys Pro Ser Ala Pro Lys Pro Gly Met
290 295 300
Gln Pro Ser Pro Gln Pro Glu Val Lys Pro Gln Leu Glu Lys Pro Lys
305 310 315 320
Pro Glu Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro
325 330 335
Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro Gln Pro Glu Lys Pro
340 345 350
Lys Pro Glu Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys
355 360 365
Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro Gln Pro Glu Lys
370 375 380
Pro Lys Pro Glu Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val
385 390 395 400
Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro Gln Pro Glu
405 410 415
Lys Pro Lys Pro Glu Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu
420 425 430
Val Lys Pro Gln Pro Glu Lys Pro Lys Pro Glu Val Lys Pro Gln Pro
435 440 445
Glu Lys Pro Lys Pro Asp Asn Ser Lys Pro Gln Ala Asp Asp Lys Lys
450 455 460
Pro Ser Thr Thr Asn Asn Leu Ser Lys Asp Lys Gln Pro Ser Asn Gln
465 470 475 480
Ala Ser Thr Asn Glu Lys Ala Thr Asn Lys Pro Lys Lys Ser Leu Pro
485 490 495
Ser Thr Gly Ser Ile Ser Asn Leu Ala Leu Glu Ile Ala Gly Leu Leu
500 505 510
Thr Leu Ala Gly Ala Thr Ile Leu Ala Lys Lys Arg Met Lys
515 520 525
<210> 7
<211> 1455
<212> DNA
<213> S. pneumoniae
<220>
<221> CDS
<222> (1)..(1452)
<400> 7
atg aaa ttt agt aaa aaa tat ata gca gct gga tca gct gtt atc gta 48
Met Lys Phe Ser Lys Lys Tyr Ile Ala Ala Gly Ser Ala Val Ile Val
1 5 10 15
tcc ttg agt cta tgt gcc tat gca cta aac cag cat cgt tcg cag gaa 96
Ser Leu Ser Leu Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu
20 25 30
aat aag gac aat aat cgt gtc tct tat gtg gat ggc agc cag tca agt 144
Asn Lys Asp Asn Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser
35 40 45
cag aaa agt gaa aac ttg aca cca gac cag gtt agc cag aaa gaa gga 192
Gln Lys Ser Glu Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly
50 55 60
att cag gct gag caa att gta atc aaa att aca gat cag ggc tat gta 240
Ile Gln Ala Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val
65 70 75 80
acg tca cac ggt gac cac tat cat tac tat aat ggg aaa gtt cct tat 288
Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr
85 90 95
gat gcc ctc ttt agt gaa gaa ctc ttg atg aag gat cca aac tat caa 336
Asp Ala Leu Phe Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln
100 105 110
ctt aaa gac gct gat att gtc aat gaa gtc aag ggt ggt tat atc atc 384
Leu Lys Asp Ala Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile
115 120 125
aag gtc gat gga aaa tat tat gtc tac ctg aaa gat gca gct cat gct 432
Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala
130 135 140
gat aat gtt cga act aaa gat gaa atc aat cgt caa aaa caa gaa cat 480
Asp Asn Val Arg Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His
145 150 155 160
gtc aaa gat aat gag aag gtt aac tct aat gtt gct gta gca agg tct 528
Val Lys Asp Asn Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser
165 170 175
cag gga cga tat acg aca aat gat ggt tat gtc ttt aat cca gct gat 576
Gln Gly Arg Tyr Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp
180 185 190
att atc gaa gat acg ggt aat gct tat atc gtt cct cat gga ggt cac 624
Ile Ile Glu Asp Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His
195 200 205
tat cac tac att ccc aaa agc gat tta tct gct agt gaa tta gca gca 672
Tyr His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala
210 215 220
gct aaa gca cat ctg gct gga aaa aat atg caa ccg agt cag tta agc 720
Ala Lys Ala His Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser
225 230 235 240
tat tct tca aca gct agt gac aat aac acg caa tct gta gca aaa gga 768
Tyr Ser Ser Thr Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly
245 250 255
tca act agc aag cca gca aat aaa tct gaa aat ctc cag agt ctt ttg 816
Ser Thr Ser Lys Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu
260 265 270
aag gaa ctc tat gat tca cct agc gcc caa cgt tac agt gaa tca gat 864
Lys Glu Leu Tyr Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp
275 280 285
ggc ctg gtc ttt gac cct gct aag att atc agt cgt aca cca aat gga 912
Gly Leu Val Phe Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly
290 295 300
gtt gcg att ccg cat ggc gac cat tac cac ttt att cct tac agc aag 960
Val Ala Ile Pro His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys
305 310 315 320
ctt tct gct tta gaa gaa aag att gcc aga atg gtg cct atc agt gga 1008
Leu Ser Ala Leu Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly
325 330 335
act ggt tct aca gtt tct aca aat gca aaa cct aat gaa gta gtg tct 1056
Thr Gly Ser Thr Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser
340 345 350
agt cta ggc agt ctt tca agc aat cct tct tct tta acg aca agt aag 1104
Ser Leu Gly Ser Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys
355 360 365
gag ctc tct tca gca tct gat ggt tat att ttt aat cca aaa gat atc 1152
Glu Leu Ser Ser Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile
370 375 380
gtt gaa gaa acg gct aca gct tat att gta aga cat ggt gat cat ttc 1200
Val Glu Glu Thr Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe
385 390 395 400
cat tac att cca aaa tca aat caa att ggg caa ccg act ctt cca aac 1248
His Tyr Ile Pro Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn
405 410 415
aat agt cta gca aca cct tct cca tct ctt cca atc aat cca gga act 1296
Asn Ser Leu Ala Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr
420 425 430
tca cat gag aaa cat gaa gaa gat gga tac gga ttt gat gct aat cgt 1344
Ser His Glu Lys His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg
435 440 445
att atc gct gaa gat gaa tca ggt ttt gtc atg agt cac gga gac cac 1392
Ile Ile Ala Glu Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His
450 455 460
aat cat tat ttc ttc aag aag gac ttg aca gaa gag caa att aag gtg 1440
Asn His Tyr Phe Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Val
465 470 475 480
cgc aaa aac att tag 1455
Arg Lys Asn Ile
<210> 8
<211> 484
<212> PRT
<213> S. pneumoniae
<400> 8
Met Lys Phe Ser Lys Lys Tyr Ile Ala Ala Gly Ser Ala Val Ile Val
1 5 10 15
Ser Leu Ser Leu Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu
20 25 30
Asn Lys Asp Asn Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser
35 40 45
Gln Lys Ser Glu Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly
50 55 60
Ile Gln Ala Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val
65 70 75 80
Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr
85 90 95
Asp Ala Leu Phe Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln
100 105 110
Leu Lys Asp Ala Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile
115 120 125
Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala
130 135 140
Asp Asn Val Arg Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His
145 150 155 160
Val Lys Asp Asn Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser
165 170 175
Gln Gly Arg Tyr Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp
180 185 190
Ile Ile Glu Asp Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His
195 200 205
Tyr His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala
210 215 220
Ala Lys Ala His Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser
225 230 235 240
Tyr Ser Ser Thr Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly
245 250 255
Ser Thr Ser Lys Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu
260 265 270
Lys Glu Leu Tyr Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp
275 280 285
Gly Leu Val Phe Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly
290 295 300
Val Ala Ile Pro His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys
305 310 315 320
Leu Ser Ala Leu Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly
325 330 335
Thr Gly Ser Thr Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser
340 345 350
Ser Leu Gly Ser Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys
355 360 365
Glu Leu Ser Ser Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile
370 375 380
Val Glu Glu Thr Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe
385 390 395 400
His Tyr Ile Pro Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn
405 410 415
Asn Ser Leu Ala Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr
420 425 430
Ser His Glu Lys His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg
435 440 445
Ile Ile Ala Glu Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His
450 455 460
Asn His Tyr Phe Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Val
465 470 475 480
Arg Lys Asn Ile
<210> 9
<211> 1587
<212> DNA
<213> S pneumoniae
<220>
<221> CDS
<222> (1)..(1584)
<400> 9
atg aaa gat tta gat aaa aaa atc gaa gaa aaa att gct ggc att atg 48
Met Lys Asp Leu Asp Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met
1 5 10 15
aaa caa tat ggt gtc aaa cgt gaa agt att gtc gtg aat aaa gaa aaa 96
Lys Gln Tyr Gly Val Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys
20 25 30
aat gcg att att tat ccg cat gga gat cac cat cat gca gat ccg att 144
Asn Ala Ile Ile Tyr Pro His Gly Asp His His His Ala Asp Pro Ile
35 40 45
gat gaa cat aaa ccg gtt gga att ggt cat tct cac agt aac tat gaa 192
Asp Glu His Lys Pro Val Gly Ile Gly His Ser His Ser Asn Tyr Glu
50 55 60
ctg ttt aaa ccc gaa gaa gga gtt gct aaa aaa gaa ggg aat aaa gtt 240
Leu Phe Lys Pro Glu Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val
65 70 75 80
tat act gga gaa gaa tta acg aat gtt gtt aat ttg tta aaa aat agt 288
Tyr Thr Gly Glu Glu Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser
85 90 95
acg ttt aat aat caa aac ttt act cta gcc aat ggt caa aaa cgc gtt 336
Thr Phe Asn Asn Gln Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val
100 105 110
tct ttt agt ttt ccg cct gaa ttg gag aaa aaa tta ggt atc aat atg 384
Ser Phe Ser Phe Pro Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met
115 120 125
cta gta aaa tta ata aca cca gat gga aaa gta ttg gag aaa gta tct 432
Leu Val Lys Leu Ile Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser
130 135 140
ggt aaa gta ttt gga gaa gga gta ggg aat att gca aac ttt gaa tta 480
Gly Lys Val Phe Gly Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu
145 150 155 160
gat caa cct tat tta cca gga caa aca ttt aag tat act atc gct tca 528
Asp Gln Pro Tyr Leu Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser
165 170 175
aaa gat tat cca gaa gta agt tat gat ggt aca ttt aca gtt cca acc 576
Lys Asp Tyr Pro Glu Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr
180 185 190
tct tta gct tac aaa atg gcc agt caa acg att ttc tat cct ttc cat 624
Ser Leu Ala Tyr Lys Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His
195 200 205
gca ggg gat act tat tta aga gtg aac cct caa ttt gca gtg cct aaa 672
Ala Gly Asp Thr Tyr Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys
210 215 220
gga act gat gct tta gtc aga gtg ttt gat gaa ttt cat gga aat gct 720
Gly Thr Asp Ala Leu Val Arg Val Phe Asp Glu Phe His Gly Asn Ala
225 230 235 240
tat tta gaa aat aac tat aaa gtt ggt gaa atc aaa tta ccg att ccg 768
Tyr Leu Glu Asn Asn Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro
245 250 255
aaa tta aac caa gga aca acc aga acg gcc gga aat aaa att cct gta 816
Lys Leu Asn Gln Gly Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val
260 265 270
acc ttc atg gca aat gct tat ttg gac aat caa tcg act tat att gtg 864
Thr Phe Met Ala Asn Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val
275 280 285
gaa gta cct atc ttg gaa aaa gaa aat caa act gat aaa cca agt att 912
Glu Val Pro Ile Leu Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile
290 295 300
cta cca caa ttt aaa agg aat aaa gca caa gaa aac tca aaa ctt gat 960
Leu Pro Gln Phe Lys Arg Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp
305 310 315 320
gaa aag gta gaa gaa cca aag act agt gag aag gta gaa aaa gaa aaa 1008
Glu Lys Val Glu Glu Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys
325 330 335
ctt tct gaa act ggg aat agt act agt aat tca acg tta gaa gaa gtt 1056
Leu Ser Glu Thr Gly Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val
340 345 350
cct aca gtg gat cct gta caa gaa aaa gta gca aaa ttt gct gaa agt 1104
Pro Thr Val Asp Pro Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser
355 360 365
tat ggg atg aag cta gaa aat gtc ttg ttt aat atg gac gga aca att 1152
Tyr Gly Met Lys Leu Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile
370 375 380
gaa tta tat tta cca tca gga gaa gtc att aaa aag aat atg gca gat 1200
Glu Leu Tyr Leu Pro Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp
385 390 395 400
ttt aca gga gaa gca cct caa gga aat ggt gaa aat aaa cca tct gaa 1248
Phe Thr Gly Glu Ala Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu
405 410 415
aat gga aaa gta tct act gga aca gtt gag aac caa cca aca gaa aat 1296
Asn Gly Lys Val Ser Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn
420 425 430
aaa cca gca gat tct tta cca gag gca cca aac gaa aaa cct gta aaa 1344
Lys Pro Ala Asp Ser Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys
435 440 445
cca gaa aac tca acg gat aat gga atg ttg aat cca gaa ggg aat gtg 1392
Pro Glu Asn Ser Thr Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val
450 455 460
ggg agt gac cct atg tta gat cca gca tta gag gaa gct cca gca gta 1440
Gly Ser Asp Pro Met Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val
465 470 475 480
gat cct gta caa gaa aaa tta gaa aaa ttt aca gct agt tac gga tta 1488
Asp Pro Val Gln Glu Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu
485 490 495
ggc tta gat agt gtt ata ttc aat atg gat gga acg att gaa tta aga 1536
Gly Leu Asp Ser Val Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg
500 505 510
ttg cca agt gga gaa gtg ata aaa aag aat tta tct gat ttc ata gcg 1584
Leu Pro Ser Gly Glu Val Ile Lys Lys Asn Leu Ser Asp Phe Ile Ala
515 520 525
taa 1587
<210> 10
<211> 528
<212> PRT
<213> S pneumoniae
<400> 10
Met Lys Asp Leu Asp Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met
1 5 10 15
Lys Gln Tyr Gly Val Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys
20 25 30
Asn Ala Ile Ile Tyr Pro His Gly Asp His His His Ala Asp Pro Ile
35 40 45
Asp Glu His Lys Pro Val Gly Ile Gly His Ser His Ser Asn Tyr Glu
50 55 60
Leu Phe Lys Pro Glu Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val
65 70 75 80
Tyr Thr Gly Glu Glu Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser
85 90 95
Thr Phe Asn Asn Gln Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val
100 105 110
Ser Phe Ser Phe Pro Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met
115 120 125
Leu Val Lys Leu Ile Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser
130 135 140
Gly Lys Val Phe Gly Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu
145 150 155 160
Asp Gln Pro Tyr Leu Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser
165 170 175
Lys Asp Tyr Pro Glu Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr
180 185 190
Ser Leu Ala Tyr Lys Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His
195 200 205
Ala Gly Asp Thr Tyr Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys
210 215 220
Gly Thr Asp Ala Leu Val Arg Val Phe Asp Glu Phe His Gly Asn Ala
225 230 235 240
Tyr Leu Glu Asn Asn Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro
245 250 255
Lys Leu Asn Gln Gly Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val
260 265 270
Thr Phe Met Ala Asn Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val
275 280 285
Glu Val Pro Ile Leu Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile
290 295 300
Leu Pro Gln Phe Lys Arg Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp
305 310 315 320
Glu Lys Val Glu Glu Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys
325 330 335
Leu Ser Glu Thr Gly Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val
340 345 350
Pro Thr Val Asp Pro Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser
355 360 365
Tyr Gly Met Lys Leu Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile
370 375 380
Glu Leu Tyr Leu Pro Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp
385 390 395 400
Phe Thr Gly Glu Ala Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu
405 410 415
Asn Gly Lys Val Ser Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn
420 425 430
Lys Pro Ala Asp Ser Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys
435 440 445
Pro Glu Asn Ser Thr Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val
450 455 460
Gly Ser Asp Pro Met Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val
465 470 475 480
Asp Pro Val Gln Glu Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu
485 490 495
Gly Leu Asp Ser Val Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg
500 505 510
Leu Pro Ser Gly Glu Val Ile Lys Lys Asn Leu Ser Asp Phe Ile Ala
515 520 525
<210> 11
<211> 5048
<212> DNA
<213> S. pneumoniae
<400> 11
aattccttgt cgggtaagtt ccgacccgca cgaaaggcgt aatgatttgg gcactgtctc 60
aacgagagac tcggtgaaat tttagtacct gtgaagatgc aggttacccg cgacaggacg 120
gaaagacccc atggagcttt actgcagttt gatattgagt gtctgtacca catgtacagg 180
ataggtagga gtctaagaga tcgggacgcc agtttcgaag gagacgctgt tgggatacta 240
cccttgtgtt atggccactc taacccagat aggtgatccc tatcggagac agtgtctgac 300
gggcagtttg actggggcgg tcgcctccta aaaggtaacg gaggcgccca aaggttccct 360
cagaatggtt ggaaatcatt cgcagagtgt aaaggtataa gggagcttga ctgcgagagc 420
tacaactcga gcagggacga aagtcgggct tagtgatccg gtggttccgt atggaagggc 480
catcgctcaa cggataaaag ctaccctggg gataacaggc ttatctcccc caagagttca 540
catcgacggg gaggtttggc acctcgatgt cggctcgtcg catcctgggg ctgtagtcgg 600
tcccaagggt tgggctgttc gcccattaaa gcggcacgcg agctgggttc agaacgtcgt 660
gagacagttc ggtccctatc cgtcgcgggc gtaggaaatt tgagaggatc tgctcctagt 720
acgagaggac cagagtggac ttaccgctgg tgtaccagtt gtcttgccaa aggcatcgct 780
gggtagctat gtagggaagg gataaacgct gaaagcatct aagtgtgaaa cccacctcaa 840
gatgagattt cccatgatta tatatcagta agagccctga gagatgatca ggtagatagg 900
ttagaagtgg aagtgtggcg acacatgtag cggactaata ctaatagctc gaggacttat 960
ccaaagtaac tgagaatatg aaagcgaacg gttttcttaa attgaataga tattcaattt 1020
tgagtaggta ttactcagag ttaagtgacg atagcctagg agatacacct gtacccatgc 1080
cgaacacaga agttaagccc tagaacgccg gaagtagttg ggggttgccc cctgtgagat 1140
agggaagtcg cttagctcta gggagtttag ctcagctggg agagcatctg ccttacaagc 1200
agagggtcag cggttcgatc ccgttaactc ccaaaggtcc cgtagtgtag cggttatcac 1260
gtcgccctgt cacggcgaag atcgcgggtt cgattcccgt cgggaccgtt taaggtaacg 1320
caagttattt tagactcgtt agctcagttg gtagagcaat tgacttttaa tcaatgggtc 1380
actggttcga gcccagtacg ggtcatatat gcgggtttgg cggaattcta atctctttga 1440
aatcatcttc tctcactttc caaaactcta ttacctctta ttataccaca tttcaatctt 1500
caacttccca gtaatataag cacctctggc gaaagaagtt tcaatgtcct aaagtaataa 1560
gtgaatccaa ttcaggaact ccaagaacaa aagaaacatc tggtgtcaca agtattggat 1620
ggcacagagt cacgtggtag tctgacccta gcagaaattt taaatagtaa actatttact 1680
ggttaattaa atggttaaat aaccggttta gaaaactatt taataaagta aaagaagttg 1740
agaaaaaact tcatcattta ttgaaatgag ggatttatga aatttagtaa aaaatatata 1800
gcagctggat cagctgttat cgtatccttg agtctatgtg cctatgcact aaaccagcat 1860
cgttcgcagg aaaataagga caataatcgt gtctcttatg tggatggcag ccagtcaagt 1920
cagaaaagtg aaaacttgac accagaccag gttagccaga aagaaggaat tcaggctgag 1980
caaattgtaa tcaaaattac agatcagggc tatgtaacgt cacacggtga ccactatcat 2040
tactataatg ggaaagttcc ttatgatgcc ctctttagtg aagaactctt gatgaaggat 2100
ccaaactatc aacttaaaga cgctgatatt gtcaatgaag tcaagggtgg ttatatcatc 2160
aaggtcgatg gaaaatatta tgtctacctg aaagatgcag ctcatgctga taatgttcga 2220
actaaagatg aaatcaatcg tcaaaaacaa gaacatgtca aagataatga gaaggttaac 2280
tctaatgttg ctgtagcaag gtctcaggga cgatatacga caaatgatgg ttatgtcttt 2340
aatccagctg atattatcga agatacgggt aatgcttata tcgttcctca tggaggtcac 2400
tatcactaca ttcccaaaag cgatttatct gctagtgaat tagcagcagc taaagcacat 2460
ctggctggaa aaaatatgca accgagtcag ttaagctatt cttcaacagc tagtgacaat 2520
aacacgcaat ctgtagcaaa aggatcaact agcaagccag caaataaatc tgaaaatctc 2580
cagagtcttt tgaaggaact ctatgattca cctagcgccc aacgttacag tgaatcagat 2640
ggcctggtct ttgaccctgc taagattatc agtcgtacac caaatggagt tgcgattccg 2700
catggcgacc attaccactt tattccttac agcaagcttt ctgctttaga agaaaagatt 2760
gccagaatgg tgcctatcag tggaactggt tctacagttt ctacaaatgc aaaacctaat 2820
gaagtagtgt ctagtctagg cagtctttca agcaatcctt cttctttaac gacaagtaag 2880
gagctctctt cagcatctga tggttatatt tttaatccaa aagatatcgt tgaagaaacg 2940
gctacagctt atattgtaag acatggtgat catttccatt acattccaaa atcaaatcaa 3000
attgggcaac cgactcttcc aaacaatagt ctagcaacac cttctccatc tcttccaatc 3060
aatccaggaa cttcacatga gaaacatgaa gaagatggat acggatttga tgctaatcgt 3120
attatcgctg aagatgaatc aggttttgtc atgagtcacg gagaccacaa tcattatttc 3180
ttcaagaagg acttgacaga agagcaaatt aaggctgcgc aaaaacattt agaggaagtt 3240
aaaactagtc ataatggatt agattctttg tcatctcatg aacaggatta tccaggtaat 3300
gccaaagaaa tgaaagattt agataaaaaa atcgaagaaa aaattgctgg cattatgaaa 3360
caatatggtg tcaaacgtga aagtattgtc gtgaataaag aaaaaaatgc gattatttat 3420
ccgcatggag atcaccatca tgcagatccg attgatgaac ataaaccggt tggaattggt 3480
cattctcaca gtaactatga actgtttaaa cccgaagaag gagttgctaa aaaagaaggg 3540
aataaagttt atactggaga agaattaacg aatgttgtta atttgttaaa aaatagtacg 3600
tttaataatc aaaactttac tctagccaat ggtcaaaaac gcgtttcttt tagttttccg 3660
cctgaattgg agaaaaaatt aggtatcaat atgctagtaa aattaataac accagatgga 3720
aaagtattgg agaaagtatc tggtaaagta tttggagaag gagtagggaa tattgcaaac 3780
tttgaattag atcaacctta tttaccagga caaacattta agtatactat cgcttcaaaa 3840
gattatccag aagtaagtta tgatggtaca tttacagttc caacctcttt agcttacaaa 3900
atggccagtc aaacgatttt ctatcctttc catgcagggg atacttattt aagagtgaac 3960
cctcaatttg cagtgcctaa aggaactgat gctttagtca gagtgtttga tgaatttcat 4020
ggaaatgctt atttagaaaa taactataaa gttggtgaaa tcaaattacc gattccgaaa 4080
ttaaaccaag gaacaaccag aacggccgga aataaaattc ctgtaacctt catggcaaat 4140
gcttatttgg acaatcaatc gacttatatt gtggaagtac ctatcttgga aaaagaaaat 4200
caaactgata aaccaagtat tctaccacaa tttaaaagga ataaagcaca agaaaactca 4260
aaacttgatg aaaaggtaga agaaccaaag actagtgaga aggtagaaaa agaaaaactt 4320
tctgaaactg ggaatagtac tagtaattca acgttagaag aagttcctac agtggatcct 4380
gtacaagaaa aagtagcaaa atttgctgaa agttatggga tgaagctaga aaatgtcttg 4440
tttaatatgg acggaacaat tgaattatat ttaccatcag gagaagtcat taaaaagaat 4500
atggcagatt ttacaggaga agcacctcaa ggaaatggtg aaaataaacc atctgaaaat 4560
ggaaaagtat ctactggaac agttgagaac caaccaacag aaaataaacc agcagattct 4620
ttaccagagg caccaaacga aaaacctgta aaaccagaaa actcaacgga taatggaatg 4680
ttgaatccag aagggaatgt ggggagtgac cctatgttag atccagcatt agaggaagct 4740
ccagcagtag atcctgtaca agaaaaatta gaaaaattta cagctagtta cggattaggc 4800
ttagatagtg ttatattcaa tatggatgga acgattgaat taagattgcc aagtggagaa 4860
gtgataaaaa agaatttatc tgatttcata gcgtaaggaa tagcagtaga aaaagtctga 4920
atcaaaaatg aagttctctc aaaagttaga aataaaactc tgactttggg agaatttcat 4980
tttattatta atatataaaa tttcttgaca tacaacttaa aaagaggtgg aatatttact 5040
agttaatt 5048
<210> 12
<211> 2647
<212> DNA
<213> S. pneumoniae
<400> 12
cagagatctt agtgaatcaa atatacttaa gaaaagagga aagaatgaaa atcaataaaa 60
aatatctagc tgggtcagta gctacacttg ttttaagtgt ctgtgcttat gaactaggtt 120
tgcatcaagc tcaaactgta aaagaaaata atcgtgtttc ctatatagat ggaaaacaag 180
cgacgcaaaa aacggagaat ttgactcctg atgaggttag caagcgtgaa ggaatcaacg 240
ccgaacaaat cgtcatcaag attacggatc aaggttatgt gacctctcat ggagaccatt 300
atcattacta taatggcaag gtcccttatg atgccatcat cagtgaagag ctcctcatga 360
aagatccgaa ttatcagttg aaggattcag acattgtcaa tgaaatcaag ggtggttatg 420
tcattaaggt aaacggtaaa tactatgttt accttaagga tgcagctcat gcggataatg 480
tccgtacaaa agaagaaatc aatcggcaaa aacaagaaca tagtcagcat cgtgaaggag 540
ggacttcagc aaacgatggt gcggtagcct ttgcacgttc acagggacgc tacaccacag 600
atgatggtta tatcttcaat gcatctgata tcatcgaaga tacgggcgat gcctatatcg 660
ttcctcatgg agatcattac cattacattc ctaagaatga gttatcagct agcgagttgg 720
ctgctgcaga agccttccta tctggtcggg aaaatctgtc aaatttaaga acctatcgcc 780
gacaaaatag cgataacact ccaagaacaa actgggtacc ttctgtaagc aatccaggaa 840
ctacaaatac taacacaagc aacaacagca acactaacag tcaagcaagt caaagtaatg 900
acattgatag tctcttgaaa cagctctaca aactgccttt gagtcaacgc catgtagaat 960
ctgatggcct tattttcgac ccagcgcaaa tcacaagtcg aaccgccaga ggtgtagctg 1020
tccctcatgg taaccattac cactttatcc cttatgaaca aatgtctgaa ttggaaaaac 1080
gaattgctcg tattattccc cttcgttatc gttcaaacca ttgggtacca gattcaagac 1140
cagaagaacc aagtccacaa ccgactccag aacctagtcc aagtccgcaa cctgcaccaa 1200
atcctcaacc agctccaagc aatccaattg atgagaaatt ggtcaaagaa gctgttcgaa 1260
aagtaggcga tggttatgtc tttgaggaga atggagtttc tcgttatatc ccagccaaga 1320
atctttcagc agaaacagca gcaggcattg atagcaaact ggccaagcag gaaagtttat 1380
ctcataagct aggagctaag aaaactgacc tcccatctag tgatcgagaa ttttacaata 1440
aggcttatga cttactagca agaattcacc aagatttact tgataataaa ggtcgacaag 1500
ttgattttga ggctttggat aacctgttgg aacgactcaa ggatgtctca agtgataaag 1560
tcaagttagt ggatgatatt cttgccttct tagctccgat tcgtcatcca gaacgtttag 1620
gaaaaccaaa tgcgcaaatt acctacactg atgatgagat tcaagtagcc aagttggcag 1680
gcaagtacac aacagaagac ggttatatct ttgatcctcg tgatataacc agtgatgagg 1740
gggatgccta tgtaactcca catatgaccc atagccactg gattaaaaaa gatagtttgt 1800
ctgaagctga gagagcggca gcccaggctt atgctaaaga gaaaggtttg acccctcctt 1860
cgacagacca tcaggattca ggaaatactg aggcaaaagg agcagaagct atctacaacc 1920
gcgtgaaagc agctaagaag gtgccacttg atcgtatgcc ttacaatctt caatatactg 1980
tagaagtcaa aaacggtagt ttaatcatac ctcattatga ccattaccat aacatcaaat 2040
ttgagtggtt tgacgaaggc ctttatgagg cacctaaggg gtatactctt gaggatcttt 2100
tggcgactgt caagtactat gtcgaacatc caaacgaacg tccgcattca gataatggtt 2160
ttggtaacgc tagcgaccat gttcaaagaa acaaaaatgg tcaagctgat accaatcaaa 2220
cggaaaaacc aagcgaggag aaacctcaga cagaaaaacc tgaggaagaa acccctcgag 2280
aagagaaacc acaaagcgag aaaccagagt ctccaaaacc aacagaggaa ccagaagaag 2340
aatcaccaga ggaatcagaa gaacctcagg tcgagactga aaaggttgaa gaaaaactga 2400
gagaggctga agatttactt ggaaaaatcc aggatccaat tatcaagtcc aatgccaaag 2460
agactctcac aggattaaaa aataatttac tatttggcac ccaggacaac aatactatta 2520
tggcagaagc tgaaaaacta ttggctttat taaaggagag taagtaaagg tagcagcatt 2580
ttctaactcc taaaaacagg ataggagaac gggaaaacga aaaatgagag cagaatgtga 2640
gttctag 2647
<210> 13
<211> 2639
<212> DNA
<213> S. pneumoniae
<220>
<221> CDS
<222> (114)..(2627)
<400> 13
gggtcttaaa actctgaatc ctttagaggc agacccacaa aatgacaaga cctatttaga 60
aaatctggaa gaaaatatga gtgttctagc agaagaatta aagtgaggaa aga 113
atg aaa atc aat aaa aaa tat cta gca ggt tca gtg gca gtc ctt gcc 161
Met Lys Ile Asn Lys Lys Tyr Leu Ala Gly Ser Val Ala Val Leu Ala
1 5 10 15
cta agt gtt tgt tcc tat gaa ctt ggt cgt cac caa gct ggt cag gtt 209
Leu Ser Val Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Val
20 25 30
aag aaa gag tct aat cga gtt tct tat ata gat ggt gat cag gct ggt 257
Lys Lys Glu Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gln Ala Gly
35 40 45
caa aag gca gaa aat ttg aca cca gat gaa gtc agt aag aga gag ggg 305
Gln Lys Ala Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly
50 55 60
atc aac gcc gaa caa att gtt atc aag att acg gat caa ggt tat gtg 353
Ile Asn Ala Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val
65 70 75 80
acc tct cat gga gac cat tat cat tac tat aat ggc aag gtt cct tat 401
Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr
85 90 95
gat gcc atc atc agt gaa gaa ctt ctc atg aaa gat ccg aat tat cag 449
Asp Ala Ile Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln
100 105 110
ttg aag gat tca gac att gtc aat gaa atc aag ggt ggc tat gtg att 497
Leu Lys Asp Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile
115 120 125
aag gta gac gga aaa tac tat gtt tac ctt aaa gat gcg gcc cat gcg 545
Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala
130 135 140
gac aat att cgg aca aaa gaa gag att aaa cgt cag aag cag gaa cac 593
Asp Asn Ile Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His
145 150 155 160
agt cat aat cat aac tca aga gca gat aat gct gtt gct gca gcc aga 641
Ser His Asn His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg
165 170 175
gcc caa gga cgt tat aca acg gat gat ggg tat atc ttc aat gca tct 689
Ala Gln Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser
180 185 190
gat atc att gag gac acg ggt gat gct tat atc gtt cct cac ggc gac 737
Asp Ile Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp
195 200 205
cat tac cat tac att cct aag aat gag tta tca gct agc gag tta gct 785
His Tyr His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala
210 215 220
gct gca gaa gcc tat tgg aat ggg aag cag gga tct cgt cct tct tca 833
Ala Ala Glu Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser
225 230 235 240
agt tct agt tat aat gca aat cca gtt caa cca aga ttg tca gag aac 881
Ser Ser Ser Tyr Asn Ala Asn Pro Val Gln Pro Arg Leu Ser Glu Asn
245 250 255
cac aat ctg act gtc act cca act tat cat caa aat caa ggg gaa aac 929
His Asn Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn
260 265 270
att tca agc ctt tta cgt gaa ttg tat gct aaa ccc tta tca gaa cgc 977
Ile Ser Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg
275 280 285
cat gta gaa tct gat ggc ctt att ttc gac cca gcg caa atc aca agt 1025
His Val Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser
290 295 300
cga acc gcc aga ggt gta gct gtc cct cat ggt aac cat tac cac ttt 1073
Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe
305 310 315 320
atc cct tat gaa caa atg tct gaa ttg gaa aaa cga att gct cgt att 1121
Ile Pro Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile
325 330 335
att ccc ctt cgt tat cgt tca aac cat tgg gta cca gat tca aga cca 1169
Ile Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro
340 345 350
gaa caa cca agt cca caa tcg act ccg gaa cct agt cca agt ctg caa 1217
Glu Gln Pro Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Leu Gln
355 360 365
cct gca cca aat cct caa cca gct cca agc aat cca att gat gag aaa 1265
Pro Ala Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys
370 375 380
ttg gtc aaa gaa gct gtt cga aaa gta ggc gat ggt tat gtc ttt gag 1313
Leu Val Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu
385 390 395 400
gag aat gga gtt tct cgt tat atc cca gcc aag gat ctt tca gca gaa 1361
Glu Asn Gly Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu
405 410 415
aca gca gca ggc att gat agc aaa ctg gcc aag cag gaa agt tta tct 1409
Thr Ala Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser
420 425 430
cat aag cta gga gct aag aaa act gac ctc cca tct agt gat cga gaa 1457
His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu
435 440 445
ttt tac aat aag gct tat gac tta cta gca aga att cac caa gat tta 1505
Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu
450 455 460
ctt gat aat aaa ggt cga caa gtt gat ttt gag gtt ttg gat aac ctg 1553
Leu Asp Asn Lys Gly Arg Gln Val Asp Phe Glu Val Leu Asp Asn Leu
465 470 475 480
ttg gaa cga ctc aag gat gtc tca agt gat aaa gtc aag tta gtg gat 1601
Leu Glu Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp
485 490 495
gat att ctt gcc ttc tta gct ccg att cgt cat cca gaa cgt tta gga 1649
Asp Ile Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly
500 505 510
aaa cca aat gcg caa att acc tac act gat gat gag att caa gta gcc 1697
Lys Pro Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala
515 520 525
aag ttg gca ggc aag tac aca aca gaa gac ggt tat atc ttt gat cct 1745
Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro
530 535 540
cgt gat ata acc agt gat gag ggg gat gcc tat gta act cca cat atg 1793
Arg Asp Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met
545 550 555 560
acc cat agc cac tgg att aaa aaa gat agt ttg tct gaa gct gag aga 1841
Thr His Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg
565 570 575
gcg gca gcc cag gct tat gct aaa gag aaa ggt ttg acc cct cct tcg 1889
Ala Ala Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser
580 585 590
aca gac cac cag gat tca gga aat act gag gca aaa gga gca gaa gct 1937
Thr Asp His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala
595 600 605
atc tac aac cgc gtg aaa gca gct aag aag gtg cca ctt gat cgt atg 1985
Ile Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met
610 615 620
cct tac aat ctt caa tat act gta gaa gtc aaa aac ggt agt tta atc 2033
Pro Tyr Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile
625 630 635 640
ata cct cat tat gac cat tac cat aac atc aaa ttt gag tgg ttt gac 2081
Ile Pro His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp
645 650 655
gaa ggc ctt tat gag gca cct aag ggg tat agt ctt gag gat ctt ttg 2129
Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Ser Leu Glu Asp Leu Leu
660 665 670
gcg act gtc aag tac tat gtc gaa cat cca aac gaa cgt ccg cat tca 2177
Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser
675 680 685
gat aat ggt ttt ggt aac gct agt gac cat gtt cgt aaa aat aag gca 2225
Asp Asn Gly Phe Gly Asn Ala Ser Asp His Val Arg Lys Asn Lys Ala
690 695 700
gac caa gat agt aaa cct gat gaa gat aag gaa cat gat gaa gta agt 2273
Asp Gln Asp Ser Lys Pro Asp Glu Asp Lys Glu His Asp Glu Val Ser
705 710 715 720
gag cca act cac cct gaa tct gat gaa aaa gag aat cac gct ggt tta 2321
Glu Pro Thr His Pro Glu Ser Asp Glu Lys Glu Asn His Ala Gly Leu
725 730 735
aat cct tca gca gat aat ctt tat aaa cca agc act gat acg gaa gag 2369
Asn Pro Ser Ala Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu
740 745 750
aca gag gaa gaa gct gaa gat acc aca gat gag gct gaa att cct caa 2417
Thr Glu Glu Glu Ala Glu Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln
755 760 765
gta gag aat tct gtt att aac gct aag ata gca gat gcg gag gcc ttg 2465
Val Glu Asn Ser Val Ile Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu
770 775 780
cta gaa aaa gta aca gat cct agt att aga caa aat gct atg gag aca 2513
Leu Glu Lys Val Thr Asp Pro Ser Ile Arg Gln Asn Ala Met Glu Thr
785 790 795 800
ttg act ggt cta aaa agt agt ctt ctt ctc gga acg aaa gat aat aac 2561
Leu Thr Gly Leu Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn
805 810 815
act att tca gca gaa gta gat agt ctc ttg gct ttg tta aaa gaa agt 2609
Thr Ile Ser Ala Glu Val Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser
820 825 830
caa ccg gct cct ata cag tag taaaatgaa 2639
Gln Pro Ala Pro Ile Gln
835
<210> 14
<211> 838
<212> PRT
<213> S. pneumoniae
<400> 14
Met Lys Ile Asn Lys Lys Tyr Leu Ala Gly Ser Val Ala Val Leu Ala
1 5 10 15
Leu Ser Val Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Val
20 25 30
Lys Lys Glu Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gln Ala Gly
35 40 45
Gln Lys Ala Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly
50 55 60
Ile Asn Ala Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val
65 70 75 80
Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr
85 90 95
Asp Ala Ile Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln
100 105 110
Leu Lys Asp Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile
115 120 125
Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala
130 135 140
Asp Asn Ile Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His
145 150 155 160
Ser His Asn His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg
165 170 175
Ala Gln Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser
180 185 190
Asp Ile Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp
195 200 205
His Tyr His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala
210 215 220
Ala Ala Glu Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser
225 230 235 240
Ser Ser Ser Tyr Asn Ala Asn Pro Val Gln Pro Arg Leu Ser Glu Asn
245 250 255
His Asn Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn
260 265 270
Ile Ser Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg
275 280 285
His Val Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser
290 295 300
Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe
305 310 315 320
Ile Pro Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile
325 330 335
Ile Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro
340 345 350
Glu Gln Pro Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Leu Gln
355 360 365
Pro Ala Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys
370 375 380
Leu Val Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu
385 390 395 400
Glu Asn Gly Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu
405 410 415
Thr Ala Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser
420 425 430
His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu
435 440 445
Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu
450 455 460
Leu Asp Asn Lys Gly Arg Gln Val Asp Phe Glu Val Leu Asp Asn Leu
465 470 475 480
Leu Glu Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp
485 490 495
Asp Ile Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly
500 505 510
Lys Pro Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala
515 520 525
Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro
530 535 540
Arg Asp Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met
545 550 555 560
Thr His Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg
565 570 575
Ala Ala Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser
580 585 590
Thr Asp His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala
595 600 605
Ile Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met
610 615 620
Pro Tyr Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile
625 630 635 640
Ile Pro His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp
645 650 655
Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Ser Leu Glu Asp Leu Leu
660 665 670
Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser
675 680 685
Asp Asn Gly Phe Gly Asn Ala Ser Asp His Val Arg Lys Asn Lys Ala
690 695 700
Asp Gln Asp Ser Lys Pro Asp Glu Asp Lys Glu His Asp Glu Val Ser
705 710 715 720
Glu Pro Thr His Pro Glu Ser Asp Glu Lys Glu Asn His Ala Gly Leu
725 730 735
Asn Pro Ser Ala Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu
740 745 750
Thr Glu Glu Glu Ala Glu Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln
755 760 765
Val Glu Asn Ser Val Ile Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu
770 775 780
Leu Glu Lys Val Thr Asp Pro Ser Ile Arg Gln Asn Ala Met Glu Thr
785 790 795 800
Leu Thr Gly Leu Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn
805 810 815
Thr Ile Ser Ala Glu Val Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser
820 825 830
Gln Pro Ala Pro Ile Gln
835
<210> 15
<211> 2528
<212> DNA
<213> S. pneumoniae
<220>
<221> CDS
<222> (1)..(2520)
<400> 15
tgt gcc tat gca cta aac cag cat cgt tcg cag gaa aat aag gac aat 48
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
aat cgt gtc tct tat gtg gat ggc agc cag tca agt cag aaa agt gaa 96
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
aac ttg aca cca gac cag gtt agc cag aaa gaa gga att cag gct gag 144
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
caa att gta atc aaa att aca gat cag ggc tat gta acg tca cac ggt 192
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
gat cac tat cat tac tat aat ggg aaa gtt cct tat gat gcc ctc ttt 240
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
agt gaa gaa ctc ttg atg aag gat cca aac tat caa ctt aaa gac gct 288
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
gat att gtc aat gaa gtc aag ggt ggt tat atc atc aag gtc gat gga 336
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
aaa tat tat gtc tac ctg aaa gat gca gct cat gct gat aat gtt cga 384
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
act aaa gat gaa atc aat cgt caa aaa caa gaa cat gtc aaa gat aat 432
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
gag aag gtt aac tct aat gtt gct gta gca agg tct cag gga cga tat 480
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
acg aca aat gat ggt tat gtc ttt aat cca gct gat att atc gaa gat 528
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
acg ggt aat gct tat atc gtt cct cat gga ggt cac tat cac tac att 576
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
ccc aaa agc gat tta tct gct agt gaa tta gca gca gct aaa gca cat 624
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
ctg gct gga aaa aat atg caa ccg agt cag tta agc tat tct tca aca 672
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
cct tct cca tct ctt cca atc aat cca gga act tca cat gag aaa cat 720
Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys His
225 230 235 240
gaa gaa gat gga tac gga ttt gat gct aat cgt att atc gct gaa gat 768
Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu Asp
245 250 255
gaa tca ggt ttt gtc atg agt cac gga gac cac aat cat tat ttc ttc 816
Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe Phe
260 265 270
aag aag gac ttg aca gaa gag caa att aag gct gcg caa aaa cat tta 864
Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His Leu
275 280 285
gag gaa gtt aaa act agt cat aat gga tta gat tct ttg tca tct cat 912
Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser His
290 295 300
gaa cag gat tat cca agt aat gcc aaa gaa atg aaa gat tta gat aaa 960
Glu Gln Asp Tyr Pro Ser Asn Ala Lys Glu Met Lys Asp Leu Asp Lys
305 310 315 320
aaa atc gaa gaa aaa att gct ggc att atg aaa caa tat ggt gtc aaa 1008
Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val Lys
325 330 335
cgt gaa agt att gtc gtg aat aaa gaa aaa aat gcg att att tat ccg 1056
Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr Pro
340 345 350
cat gga gat cac cat cat gca gat ccg att gat gaa cat aaa ccg gtt 1104
His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro Val
355 360 365
gga att ggt cat tct cac agt aac tat gaa ctg ttt aaa ccc gaa gaa 1152
Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu Glu
370 375 380
gga gtt gct aaa aaa gaa ggg aat aaa gtt tat act gga gaa gaa tta 1200
Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu Leu
385 390 395 400
acg aat gtt gtt aat ttg tta aaa aat agt acg ttt aat aat caa aac 1248
Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln Asn
405 410 415
ttt act cta gcc aat ggt caa aaa cgc gtt tct ttt agt ttt ccg cct 1296
Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro Pro
420 425 430
gaa ttg gag aaa aaa tta ggt atc aat atg cta gta aaa tta ata aca 1344
Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile Thr
435 440 445
cca gat gga aaa gta ttg gag aaa gta tct ggt aaa gta ttt gga gaa 1392
Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly Glu
450 455 460
gga gta ggg aat att gca aac ttt gaa tta gat caa cct tat tta cca 1440
Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu Pro
465 470 475 480
gga caa aca ttt aag tat act atc gct tca aaa gat tat cca gaa gta 1488
Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu Val
485 490 495
agt tat gat ggt aca ttt aca gtt cca acc tct tta gct tac aaa atg 1536
Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys Met
500 505 510
gcc agt caa acg att ttc tat cct ttc cat gca ggg gat act tat tta 1584
Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr Leu
515 520 525
aga gtg aac cct caa ttt gca gtg cct aaa gga act gat gct tta gtc 1632
Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu Val
530 535 540
aga gtg ttt gat gaa ttt cat gga aat gct tat tta gaa aat aac tat 1680
Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn Tyr
545 550 555 560
aaa gtt ggt gaa atc aaa tta ccg att ccg aaa tta aac caa gga aca 1728
Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly Thr
565 570 575
acc aga acg gcc gga aat aaa att cct gta acc ttc atg gca aat gct 1776
Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn Ala
580 585 590
tat ttg gac aat caa tcg act tat att gtg gaa gta cct atc ttg gaa 1824
Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu Glu
595 600 605
aaa gaa aat caa act gat aaa cca agt att cta cca caa ttt aaa agg 1872
Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys Arg
610 615 620
aat aaa gca caa gaa aac tca aaa ctt gat gaa aag gta gaa gaa cca 1920
Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp Glu Lys Val Glu Glu Pro
625 630 635 640
aag act agt gag aag gta gaa aaa gaa aaa ctt tct gaa act ggg aat 1968
Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly Asn
645 650 655
agt act agt aat tca acg tta gaa gaa gtt cct aca gtg gat cct gta 2016
Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro Val
660 665 670
caa gaa aaa gta gca aaa ttt gct gaa agt tat ggg atg aag cta gaa 2064
Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu Glu
675 680 685
aat gtc ttg ttt aat atg gac gga aca att gaa tta tat tta cca tcg 2112
Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro Ser
690 695 700
gga gaa gtc att aaa aag aat atg gca gat ttt aca gga gaa gca cct 2160
Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala Pro
705 710 715 720
caa gga aat ggt gaa aat aaa cca tct gaa aat gga aaa gta tct act 2208
Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser Thr
725 730 735
gga aca gtt gag aac caa cca aca gaa aat aaa cca gca gat tct tta 2256
Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser Leu
740 745 750
cca gag gca cca aac gaa aaa cct gta aaa cca gaa aac tca acg gat 2304
Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr Asp
755 760 765
aat gga atg ttg aat cca gaa ggg aat gtg ggg agt gac cct atg tta 2352
Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met Leu
770 775 780
gat tca gca tta gag gaa gct cca gca gta gat cct gta caa gaa aaa 2400
Asp Ser Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu Lys
785 790 795 800
tta gaa aaa ttt aca gct agt tac gga tta ggc tta gat agt gtt ata 2448
Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val Ile
805 810 815
ttc aat atg gat gga acg att gaa tta aga ttg cca agt gga gaa gtg 2496
Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu Val
820 825 830
ata aaa aag aat tta ttg atc tca tagcgtaa 2528
Ile Lys Lys Asn Leu Leu Ile Ser
835 840
<210> 16
<211> 840
<212> PRT
<213> S. pneumoniae
<400> 16
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys His
225 230 235 240
Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu Asp
245 250 255
Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe Phe
260 265 270
Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His Leu
275 280 285
Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser His
290 295 300
Glu Gln Asp Tyr Pro Ser Asn Ala Lys Glu Met Lys Asp Leu Asp Lys
305 310 315 320
Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val Lys
325 330 335
Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr Pro
340 345 350
His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro Val
355 360 365
Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu Glu
370 375 380
Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu Leu
385 390 395 400
Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln Asn
405 410 415
Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro Pro
420 425 430
Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile Thr
435 440 445
Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly Glu
450 455 460
Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu Pro
465 470 475 480
Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu Val
485 490 495
Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys Met
500 505 510
Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr Leu
515 520 525
Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu Val
530 535 540
Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn Tyr
545 550 555 560
Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly Thr
565 570 575
Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn Ala
580 585 590
Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu Glu
595 600 605
Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys Arg
610 615 620
Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp Glu Lys Val Glu Glu Pro
625 630 635 640
Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly Asn
645 650 655
Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro Val
660 665 670
Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu Glu
675 680 685
Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro Ser
690 695 700
Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala Pro
705 710 715 720
Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser Thr
725 730 735
Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser Leu
740 745 750
Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr Asp
755 760 765
Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met Leu
770 775 780
Asp Ser Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu Lys
785 790 795 800
Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val Ile
805 810 815
Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu Val
820 825 830
Ile Lys Lys Asn Leu Leu Ile Ser
835 840
<210> 17
<211> 27
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 17
cagtagatct gtgcctatgc actaaac 27
<210> 18
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 18
gatctctaga ctactgctat tccttacgct atg 33
<210> 19
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 19
atcactcgag cattacctgg ataatcctgt 30
<210> 20
<211> 26
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 20
ctgctaagct tatgaaagat ttagat 26
<210> 21
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 21
gatactcgag ctgctattcc ttac 24
<210> 22
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 22
gaatctcgag ttaagctgct gctaattc 28
<210> 23
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 23
gacgctcgag cgctatgaaa tcagataaat tc 32
<210> 24
<211> 37
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 24
gacgctcgag ggcattacct ggataatcct gttcatg 37
<210> 25
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 25
cagtagatct cttcatcatt tattgaaaag agg 33
<210> 26
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 26
ttatttcttc catatggact tgacagaaga gcaaattaag 40
<210> 27
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 27
cgccaagctt cgctatgaaa tcagataaat tc 32
<210> 28
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 28
cgccaagctt ttccacaata taagtcgatt gatt 34
<210> 29
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 29
ttatttcttc catatggaag tacctatctt ggaaaaagaa 40
<210> 30
<211> 37
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 30
ttatttcttc catatggtgc ctatgcacta aaccagc 37
<210> 31
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 31
ataagaatgc ggccgcttcc acaatataag tcgattgatt 40
<210> 32
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 32
cagtagatct gtgcttatga actaggtttg c 31
<210> 33
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 33
gatcaagctt gctgctacct ttacttactc tc 32
<210> 34
<211> 26
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 34
ctgagatatc cgttatcgtt caaacc 26
<210> 35
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 35
ctgcaagctt ttaaagggga ataatacg 28
<210> 36
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 36
cagtagatct gcagaagcct tcctatctg 29
<210> 37
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 37
tcgccaagct tcgttatcgt tcaaaccatt ggg 33
<210> 38
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 38
ataagaatgc ggccgcctta ctctccttta ataaagccaa tagtt 45
<210> 39
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 39
catgccatgg acattgatag tctcttgaaa cagc 34
<210> 40
<211> 37
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 40
cgccaagctt cttactctcc tttaataaag ccaatag 37
<210> 41
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 41
cgacaagctt aacatggtcg ctagcgttac c 31
<210> 42
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 42
cataccatgg gcctttatga ggcacctaag 30
<210> 43
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 43
cgacaagctt aagtaaatct tcagcctctc tcag 34
<210> 44
<211> 31
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 44
gataccatgg ctagcgacca tgttcaaaga a 31
<210> 45
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 45
cgccaagctt atcatccact aacttgactt tatcac 36
<210> 46
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 46
cataccatgg atattcttgc cttcttagct ccg 33
<210> 47
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 47
catgccatgg tgcttatgaa ctaggtttgc 30
<210> 48
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 48
cgccaagctt tagcgttacc aaaaccatta tc 32
<210> 49
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 49
gtattagatc tgttcctatg aacttggtcg tcacca 36
<210> 50
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 50
cgcctctaga ctactgtata ggagccgg 28
<210> 51
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 51
catgccatgg aaaacatttc aagcctttta cgtg 34
<210> 52
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 52
cgacaagctt ctgtatagga gccggttgac tttc 34
<210> 53
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 53
catgccatgg ttcgtaaaaa taaggcagac caag 34
<210> 54
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> PCR oligonucleotide primer
<400> 54
catgccatgg aagcctattg gaatgggaag 30
<210> 55
<211> 1019
<212> PRT
<213> S. pneumoniae
<400> 55
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Gly Asn Ala Lys Glu Met Lys Asp Leu Asp
485 490 495
Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val
500 505 510
Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr
515 520 525
Pro His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro
530 535 540
Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu
545 550 555 560
Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu
565 570 575
Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln
580 585 590
Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro
595 600 605
Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile
610 615 620
Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly
625 630 635 640
Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu
645 650 655
Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu
660 665 670
Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys
675 680 685
Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr
690 695 700
Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu
705 710 715 720
Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn
725 730 735
Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly
740 745 750
Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn
755 760 765
Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu
770 775 780
Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys
785 790 795 800
Arg Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp Glu Lys Val Glu Glu
805 810 815
Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly
820 825 830
Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro
835 840 845
Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu
850 855 860
Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro
865 870 875 880
Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala
885 890 895
Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser
900 905 910
Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser
915 920 925
Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr
930 935 940
Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met
945 950 955 960
Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu
965 970 975
Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val
980 985 990
Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu
995 1000 1005
Val Ile Lys Lys Asn Leu Ser Asp Phe Ile Ala
1010 1015
<210> 56
<211> 489
<212> PRT
<213> S. pneumoniae
<400> 56
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Gly Asn Ala
485
<210> 57
<211> 509
<212> PRT
<213> S. pneumoniae
<400> 57
Met Lys Phe Ser Lys Lys Tyr Ile Ala Ala Gly Ser Ala Val Ile Val
1 5 10 15
Ser Leu Ser Leu Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu
20 25 30
Asn Lys Asp Asn Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser
35 40 45
Gln Lys Ser Glu Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly
50 55 60
Ile Gln Ala Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val
65 70 75 80
Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr
85 90 95
Asp Ala Leu Phe Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln
100 105 110
Leu Lys Asp Ala Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile
115 120 125
Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala
130 135 140
Asp Asn Val Arg Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His
145 150 155 160
Val Lys Asp Asn Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser
165 170 175
Gln Gly Arg Tyr Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp
180 185 190
Ile Ile Glu Asp Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His
195 200 205
Tyr His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala
210 215 220
Ala Lys Ala His Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser
225 230 235 240
Tyr Ser Ser Thr Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly
245 250 255
Ser Thr Ser Lys Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu
260 265 270
Lys Glu Leu Tyr Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp
275 280 285
Gly Leu Val Phe Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly
290 295 300
Val Ala Ile Pro His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys
305 310 315 320
Leu Ser Ala Leu Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly
325 330 335
Thr Gly Ser Thr Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser
340 345 350
Ser Leu Gly Ser Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys
355 360 365
Glu Leu Ser Ser Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile
370 375 380
Val Glu Glu Thr Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe
385 390 395 400
His Tyr Ile Pro Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn
405 410 415
Asn Ser Leu Ala Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr
420 425 430
Ser His Glu Lys His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg
435 440 445
Ile Ile Ala Glu Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His
450 455 460
Asn His Tyr Phe Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala
465 470 475 480
Ala Gln Lys His Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp
485 490 495
Ser Leu Ser Ser His Glu Gln Asp Tyr Pro Gly Asn Ala
500 505
<210> 58
<211> 1057
<212> PRT
<213> S. pneumoniae
<400> 58
Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His Leu Glu Glu
1 5 10 15
Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser His Glu Gln
20 25 30
Asp Tyr Pro Gly Asn Ala Lys Glu Met Lys Asp Leu Asp Lys Lys Ile
35 40 45
Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val Lys Arg Glu
50 55 60
Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr Pro His Gly
65 70 75 80
Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro Val Gly Ile
85 90 95
Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu Glu Gly Val
100 105 110
Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu Leu Thr Asn
115 120 125
Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln Asn Phe Thr
130 135 140
Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro Pro Glu Leu
145 150 155 160
Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile Thr Pro Asp
165 170 175
Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly Glu Gly Val
180 185 190
Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu Pro Gly Gln
195 200 205
Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu Val Ser Tyr
210 215 220
Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys Met Ala Ser
225 230 235 240
Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr Leu Arg Val
245 250 255
Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu Val Arg Val
260 265 270
Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn Tyr Lys Val
275 280 285
Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly Thr Thr Arg
290 295 300
Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn Ala Tyr Leu
305 310 315 320
Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu Glu Lys Glu
325 330 335
Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys Arg Asn Lys
340 345 350
Ala Gln Glu Asn Ser Lys Leu Asp Glu Lys Val Glu Glu Pro Lys Thr
355 360 365
Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly Asn Ser Thr
370 375 380
Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro Val Gln Glu
385 390 395 400
Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu Glu Asn Val
405 410 415
Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro Ser Gly Glu
420 425 430
Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala Pro Gln Gly
435 440 445
Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser Thr Gly Thr
450 455 460
Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser Leu Pro Glu
465 470 475 480
Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr Asp Asn Gly
485 490 495
Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met Leu Asp Pro
500 505 510
Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu Lys Leu Glu
515 520 525
Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val Ile Phe Asn
530 535 540
Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu Val Ile Lys
545 550 555 560
Lys Asn Leu Ser Asp Phe Ile Ala Lys Leu Arg Tyr Arg Ser Asn His
565 570 575
Trp Val Pro Asp Ser Arg Pro Glu Glu Pro Ser Pro Gln Pro Thr Pro
580 585 590
Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro Gln Pro Ala Pro
595 600 605
Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala Val Arg Lys Val
610 615 620
Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg Tyr Ile Pro
625 630 635 640
Ala Lys Asn Leu Ser Ala Glu Thr Ala Ala Gly Ile Asp Ser Lys Leu
645 650 655
Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Ala Lys Lys Thr Asp
660 665 670
Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu
675 680 685
Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly Arg Gln Val Asp
690 695 700
Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp Val Ser Ser
705 710 715 720
Asp Lys Val Lys Leu Val Asp Asp Ile Leu Ala Phe Leu Ala Pro Ile
725 730 735
Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gln Ile Thr Tyr Thr
740 745 750
Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys Tyr Thr Thr Glu
755 760 765
Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser Asp Glu Gly Asp
770 775 780
Ala Tyr Val Thr Pro His Met Thr His Ser His Trp Ile Lys Lys Asp
785 790 795 800
Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala Tyr Ala Lys Glu
805 810 815
Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp Ser Gly Asn Thr
820 825 830
Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val Lys Ala Ala Lys
835 840 845
Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln Tyr Thr Val Glu
850 855 860
Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp His Tyr His Asn
865 870 875 880
Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala Pro Lys Gly
885 890 895
Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr Val Glu His
900 905 910
Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn Ala Ser Asp
915 920 925
His Val Gln Arg Asn Lys Asn Gly Gln Ala Asp Thr Asn Gln Thr Glu
930 935 940
Lys Pro Ser Glu Glu Lys Pro Gln Thr Glu Lys Pro Glu Glu Glu Thr
945 950 955 960
Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys Pro Glu Ser Pro Lys Pro
965 970 975
Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu Glu Ser Glu Glu Pro Gln
980 985 990
Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg Glu Ala Glu Asp Leu
995 1000 1005
Leu Gly Lys Ile Gln Asp Pro Ile Ile Lys Ser Asn Ala Lys Glu Thr
1010 1015 1020
Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe Gly Thr Gln Asp Asn Asn
1025 1030 1035 1040
Thr Ile Met Ala Glu Ala Glu Lys Leu Leu Ala Leu Leu Lys Glu Ser
1045 1050 1055
Lys
<210> 59
<211> 205
<212> PRT
<213> S. pneumoniae
<400> 59
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala
195 200 205
<210> 60
<211> 821
<212> PRT
<213> S. pneumoniae
<400> 60
Cys Ala Tyr Glu Leu Gly Leu His Gln Ala Gln Thr Val Lys Glu Asn
1 5 10 15
Asn Arg Val Ser Tyr Ile Asp Gly Lys Gln Ala Thr Gln Lys Thr Glu
20 25 30
Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile Ile
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ser
85 90 95
Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Glu Glu Ile Asn Arg Gln Lys Gln Glu His Ser Gln His Arg
130 135 140
Glu Gly Gly Thr Ser Ala Asn Asp Gly Ala Val Ala Phe Ala Arg Ser
145 150 155 160
Gln Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp
165 170 175
Ile Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His
180 185 190
Tyr His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala
195 200 205
Ala Glu Ala Phe Leu Ser Gly Arg Glu Asn Leu Ser Asn Leu Arg Thr
210 215 220
Tyr Arg Arg Gln Asn Ser Asp Asn Thr Pro Arg Thr Asn Trp Val Pro
225 230 235 240
Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser Asn Asn Ser
245 250 255
Asn Thr Asn Ser Gln Ala Ser Gln Ser Asn Asp Ile Asp Ser Leu Leu
260 265 270
Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln Arg His Val Glu Ser Asp
275 280 285
Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala Arg Gly
290 295 300
Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr Glu Gln
305 310 315 320
Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro Leu Arg Tyr
325 330 335
Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Glu Pro Ser Pro
340 345 350
Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro
355 360 365
Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala
370 375 380
Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser
385 390 395 400
Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala Glu Thr Ala Ala Gly Ile
405 410 415
Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Ala
420 425 430
Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala
435 440 445
Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly
450 455 460
Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys
465 470 475 480
Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu Ala Phe
485 490 495
Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gln
500 505 510
Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys
515 520 525
Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser
530 535 540
Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp
545 550 555 560
Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala
565 570 575
Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp
580 585 590
Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val
595 600 605
Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln
610 615 620
Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp
625 630 635 640
His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu
645 650 655
Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr
660 665 670
Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly
675 680 685
Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln Ala Asp Thr
690 695 700
Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln Thr Glu Lys Pro
705 710 715 720
Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys Pro Glu
725 730 735
Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu Glu Ser
740 745 750
Glu Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg Glu
755 760 765
Ala Glu Asp Leu Leu Gly Lys Ile Gln Asp Pro Ile Ile Lys Ser Asn
770 775 780
Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe Gly Thr
785 790 795 800
Gln Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu Leu Ala Leu
805 810 815
Leu Lys Glu Ser Lys
820
<210> 61
<211> 334
<212> PRT
<213> S. pneumoniae
<400> 61
Cys Ala Tyr Glu Leu Gly Leu His Gln Ala Gln Thr Val Lys Glu Asn
1 5 10 15
Asn Arg Val Ser Tyr Ile Asp Gly Lys Gln Ala Thr Gln Lys Thr Glu
20 25 30
Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile Ile
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ser
85 90 95
Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Glu Glu Ile Asn Arg Gln Lys Gln Glu His Ser Gln His Arg
130 135 140
Glu Gly Gly Thr Ser Ala Asn Asp Gly Ala Val Ala Phe Ala Arg Ser
145 150 155 160
Gln Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp
165 170 175
Ile Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His
180 185 190
Tyr His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala
195 200 205
Ala Glu Ala Phe Leu Ser Gly Arg Glu Asn Leu Ser Asn Leu Arg Thr
210 215 220
Tyr Arg Arg Gln Asn Ser Asp Asn Thr Pro Arg Thr Asn Trp Val Pro
225 230 235 240
Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser Asn Asn Ser
245 250 255
Asn Thr Asn Ser Gln Ala Ser Gln Ser Asn Asp Ile Asp Ser Leu Leu
260 265 270
Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln Arg His Val Glu Ser Asp
275 280 285
Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala Arg Gly
290 295 300
Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr Glu Gln
305 310 315 320
Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro Leu
325 330
<210> 62
<211> 487
<212> PRT
<213> S. pneumoniae
<400> 62
Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Glu Pro
1 5 10 15
Ser Pro Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro
20 25 30
Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys
35 40 45
Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly
50 55 60
Val Ser Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala Glu Thr Ala Ala
65 70 75 80
Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu
85 90 95
Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn
100 105 110
Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn
115 120 125
Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg
130 135 140
Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu
145 150 155 160
Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn
165 170 175
Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala
180 185 190
Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile
195 200 205
Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser
210 215 220
His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala
225 230 235 240
Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His
245 250 255
Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn
260 265 270
Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn
275 280 285
Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His
290 295 300
Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu
305 310 315 320
Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val
325 330 335
Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly
340 345 350
Phe Gly Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln Ala
355 360 365
Asp Thr Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln Thr Glu
370 375 380
Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys
385 390 395 400
Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu
405 410 415
Glu Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu Lys Leu
420 425 430
Arg Glu Ala Glu Asp Leu Leu Gly Lys Ile Gln Asp Pro Ile Ile Lys
435 440 445
Ser Asn Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe
450 455 460
Gly Thr Gln Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu Leu
465 470 475 480
Ala Leu Leu Lys Glu Ser Lys
485
<210> 63
<211> 613
<212> PRT
<213> S. pneumoniae
<400> 63
Ala Glu Ala Phe Leu Ser Gly Arg Glu Asn Leu Ser Asn Leu Arg Thr
1 5 10 15
Tyr Arg Arg Gln Asn Ser Asp Asn Thr Pro Arg Thr Asn Trp Val Pro
20 25 30
Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser Asn Asn Ser
35 40 45
Asn Thr Asn Ser Gln Ala Ser Gln Ser Asn Asp Ile Asp Ser Leu Leu
50 55 60
Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln Arg His Val Glu Ser Asp
65 70 75 80
Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala Arg Gly
85 90 95
Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr Glu Gln
100 105 110
Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro Leu Arg Tyr
115 120 125
Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Glu Pro Ser Pro
130 135 140
Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro
145 150 155 160
Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala
165 170 175
Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser
180 185 190
Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala Glu Thr Ala Ala Gly Ile
195 200 205
Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Ala
210 215 220
Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala
225 230 235 240
Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly
245 250 255
Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys
260 265 270
Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu Ala Phe
275 280 285
Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gln
290 295 300
Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys
305 310 315 320
Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser
325 330 335
Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp
340 345 350
Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala
355 360 365
Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp
370 375 380
Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val
385 390 395 400
Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln
405 410 415
Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp
420 425 430
His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu
435 440 445
Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr
450 455 460
Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly
465 470 475 480
Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln Ala Asp Thr
485 490 495
Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln Thr Glu Lys Pro
500 505 510
Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys Pro Glu
515 520 525
Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu Glu Ser
530 535 540
Glu Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg Glu
545 550 555 560
Ala Glu Asp Leu Leu Gly Lys Ile Gln Asp Pro Ile Ile Lys Ser Asn
565 570 575
Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe Gly Thr
580 585 590
Gln Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu Leu Ala Leu
595 600 605
Leu Lys Glu Ser Lys
610
<210> 64
<211> 568
<212> PRT
<213> S. pneumoniae
<400> 64
Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His Leu Glu Glu
1 5 10 15
Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser His Glu Gln
20 25 30
Asp Tyr Pro Gly Asn Ala Lys Glu Met Lys Asp Leu Asp Lys Lys Ile
35 40 45
Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val Lys Arg Glu
50 55 60
Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr Pro His Gly
65 70 75 80
Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro Val Gly Ile
85 90 95
Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu Glu Gly Val
100 105 110
Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu Leu Thr Asn
115 120 125
Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln Asn Phe Thr
130 135 140
Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro Pro Glu Leu
145 150 155 160
Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile Thr Pro Asp
165 170 175
Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly Glu Gly Val
180 185 190
Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu Pro Gly Gln
195 200 205
Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu Val Ser Tyr
210 215 220
Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys Met Ala Ser
225 230 235 240
Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr Leu Arg Val
245 250 255
Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu Val Arg Val
260 265 270
Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn Tyr Lys Val
275 280 285
Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly Thr Thr Arg
290 295 300
Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn Ala Tyr Leu
305 310 315 320
Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu Glu Lys Glu
325 330 335
Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys Arg Asn Lys
340 345 350
Ala Gln Glu Asn Ser Lys Leu Asp Glu Lys Val Glu Glu Pro Lys Thr
355 360 365
Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly Asn Ser Thr
370 375 380
Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro Val Gln Glu
385 390 395 400
Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu Glu Asn Val
405 410 415
Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro Ser Gly Glu
420 425 430
Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala Pro Gln Gly
435 440 445
Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser Thr Gly Thr
450 455 460
Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser Leu Pro Glu
465 470 475 480
Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr Asp Asn Gly
485 490 495
Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met Leu Asp Pro
500 505 510
Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu Lys Leu Glu
515 520 525
Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val Ile Phe Asn
530 535 540
Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu Val Ile Lys
545 550 555 560
Lys Asn Leu Ser Asp Phe Ile Ala
565
<210> 65
<211> 329
<212> PRT
<213> S. pneumoniae
<400> 65
Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His Leu Glu Glu
1 5 10 15
Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser His Glu Gln
20 25 30
Asp Tyr Pro Gly Asn Ala Lys Glu Met Lys Asp Leu Asp Lys Lys Ile
35 40 45
Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val Lys Arg Glu
50 55 60
Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr Pro His Gly
65 70 75 80
Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro Val Gly Ile
85 90 95
Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu Glu Gly Val
100 105 110
Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu Leu Thr Asn
115 120 125
Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln Asn Phe Thr
130 135 140
Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro Pro Glu Leu
145 150 155 160
Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile Thr Pro Asp
165 170 175
Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly Glu Gly Val
180 185 190
Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu Pro Gly Gln
195 200 205
Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu Val Ser Tyr
210 215 220
Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys Met Ala Ser
225 230 235 240
Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr Leu Arg Val
245 250 255
Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu Val Arg Val
260 265 270
Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn Tyr Lys Val
275 280 285
Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly Thr Thr Arg
290 295 300
Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn Ala Tyr Leu
305 310 315 320
Asp Asn Gln Ser Thr Tyr Ile Val Glu
325
<210> 66
<211> 240
<212> PRT
<213> S. pneumoniae
<400> 66
Glu Val Pro Ile Leu Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile
1 5 10 15
Leu Pro Gln Phe Lys Arg Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp
20 25 30
Glu Lys Val Glu Glu Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys
35 40 45
Leu Ser Glu Thr Gly Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val
50 55 60
Pro Thr Val Asp Pro Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser
65 70 75 80
Tyr Gly Met Lys Leu Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile
85 90 95
Glu Leu Tyr Leu Pro Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp
100 105 110
Phe Thr Gly Glu Ala Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu
115 120 125
Asn Gly Lys Val Ser Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn
130 135 140
Lys Pro Ala Asp Ser Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys
145 150 155 160
Pro Glu Asn Ser Thr Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val
165 170 175
Gly Ser Asp Pro Met Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val
180 185 190
Asp Pro Val Gln Glu Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu
195 200 205
Gly Leu Asp Ser Val Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg
210 215 220
Leu Pro Ser Gly Glu Val Ile Lys Lys Asn Leu Ser Asp Phe Ile Ala
225 230 235 240
<210> 67
<211> 555
<212> PRT
<213> S. pneumoniae
<400> 67
Asp Ile Asp Ser Leu Leu Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln
1 5 10 15
Arg His Val Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr
20 25 30
Ser Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His
35 40 45
Phe Ile Pro Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg
50 55 60
Ile Ile Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg
65 70 75 80
Pro Glu Glu Pro Ser Pro Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro
85 90 95
Gln Pro Ala Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu
100 105 110
Lys Leu Val Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe
115 120 125
Glu Glu Asn Gly Val Ser Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala
130 135 140
Glu Thr Ala Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu
145 150 155 160
Ser His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg
165 170 175
Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp
180 185 190
Leu Leu Asp Asn Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn
195 200 205
Leu Leu Glu Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val
210 215 220
Asp Asp Ile Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu
225 230 235 240
Gly Lys Pro Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val
245 250 255
Ala Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp
260 265 270
Pro Arg Asp Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His
275 280 285
Met Thr His Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu
290 295 300
Arg Ala Ala Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro
305 310 315 320
Ser Thr Asp His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu
325 330 335
Ala Ile Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg
340 345 350
Met Pro Tyr Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu
355 360 365
Ile Ile Pro His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe
370 375 380
Asp Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu
385 390 395 400
Leu Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His
405 410 415
Ser Asp Asn Gly Phe Gly Asn Ala Ser Asp His Val Gln Arg Asn Lys
420 425 430
Asn Gly Gln Ala Asp Thr Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys
435 440 445
Pro Gln Thr Glu Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro
450 455 460
Gln Ser Glu Lys Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu
465 470 475 480
Glu Ser Pro Glu Glu Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Val
485 490 495
Glu Glu Lys Leu Arg Glu Ala Glu Asp Leu Leu Gly Lys Ile Gln Asp
500 505 510
Pro Ile Ile Lys Ser Asn Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn
515 520 525
Asn Leu Leu Phe Gly Thr Gln Asp Asn Asn Thr Ile Met Ala Glu Ala
530 535 540
Glu Lys Leu Leu Ala Leu Leu Lys Glu Ser Lys
545 550 555
<210> 68
<211> 428
<212> PRT
<213> S. pneumoniae
<400> 68
Asp Ile Asp Ser Leu Leu Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln
1 5 10 15
Arg His Val Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr
20 25 30
Ser Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His
35 40 45
Phe Ile Pro Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg
50 55 60
Ile Ile Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg
65 70 75 80
Pro Glu Glu Pro Ser Pro Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro
85 90 95
Gln Pro Ala Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu
100 105 110
Lys Leu Val Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe
115 120 125
Glu Glu Asn Gly Val Ser Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala
130 135 140
Glu Thr Ala Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu
145 150 155 160
Ser His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg
165 170 175
Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp
180 185 190
Leu Leu Asp Asn Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn
195 200 205
Leu Leu Glu Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val
210 215 220
Asp Asp Ile Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu
225 230 235 240
Gly Lys Pro Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val
245 250 255
Ala Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp
260 265 270
Pro Arg Asp Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His
275 280 285
Met Thr His Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu
290 295 300
Arg Ala Ala Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro
305 310 315 320
Ser Thr Asp His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu
325 330 335
Ala Ile Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg
340 345 350
Met Pro Tyr Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu
355 360 365
Ile Ile Pro His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe
370 375 380
Asp Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu
385 390 395 400
Leu Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His
405 410 415
Ser Asp Asn Gly Phe Gly Asn Ala Ser Asp His Val
420 425
<210> 69
<211> 121
<212> PRT
<213> S. pneumoniae
<400> 69
Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala
1 5 10 15
Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp
20 25 30
Asn Gly Phe Gly Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly
35 40 45
Gln Ala Asp Thr Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln
50 55 60
Thr Glu Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser
65 70 75 80
Glu Lys Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser
85 90 95
Pro Glu Glu Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu
100 105 110
Lys Leu Arg Glu Ala Glu Asp Leu Leu
115 120
<210> 70
<211> 132
<212> PRT
<213> S. pneumoniae
<400> 70
Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln Ala Asp Thr Asn
1 5 10 15
Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln Thr Glu Lys Pro Glu
20 25 30
Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys Pro Glu Ser
35 40 45
Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu Glu Ser Glu
50 55 60
Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg Glu Ala
65 70 75 80
Glu Asp Leu Leu Gly Lys Ile Gln Asp Pro Ile Ile Lys Ser Asn Ala
85 90 95
Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe Gly Thr Gln
100 105 110
Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu Leu Ala Leu Leu
115 120 125
Lys Glu Ser Lys
130
<210> 71
<211> 226
<212> PRT
<213> S. pneumoniae
<400> 71
Asp Ile Asp Ser Leu Leu Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln
1 5 10 15
Arg His Val Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr
20 25 30
Ser Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His
35 40 45
Phe Ile Pro Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg
50 55 60
Ile Ile Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg
65 70 75 80
Pro Glu Glu Pro Ser Pro Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro
85 90 95
Gln Pro Ala Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu
100 105 110
Lys Leu Val Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe
115 120 125
Glu Glu Asn Gly Val Ser Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala
130 135 140
Glu Thr Ala Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu
145 150 155 160
Ser His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg
165 170 175
Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp
180 185 190
Leu Leu Asp Asn Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn
195 200 205
Leu Leu Glu Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val
210 215 220
Asp Asp
225
<210> 72
<211> 203
<212> PRT
<213> S. pneumoniae
<400> 72
Asp Ile Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly
1 5 10 15
Lys Pro Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala
20 25 30
Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro
35 40 45
Arg Asp Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met
50 55 60
Thr His Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg
65 70 75 80
Ala Ala Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser
85 90 95
Thr Asp His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala
100 105 110
Ile Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met
115 120 125
Pro Tyr Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile
130 135 140
Ile Pro His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp
145 150 155 160
Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu
165 170 175
Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser
180 185 190
Asp Asn Gly Phe Gly Asn Ala Ser Asp His Val
195 200
<210> 73
<211> 819
<212> PRT
<213> S. pneumoniae
<400> 73
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Val Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asp
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His Ser His Asn
130 135 140
His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gln Gly
145 150 155 160
Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile Ile
165 170 175
Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His Tyr His
180 185 190
Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala Glu
195 200 205
Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser Ser
210 215 220
Tyr Asn Ala Asn Pro Val Gln Pro Arg Leu Ser Glu Asn His Asn Leu
225 230 235 240
Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser Ser
245 250 255
Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val Glu
260 265 270
Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala
275 280 285
Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr
290 295 300
Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro Leu
305 310 315 320
Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln Pro
325 330 335
Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Leu Gln Pro Ala Pro
340 345 350
Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys
355 360 365
Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly
370 375 380
Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala
385 390 395 400
Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu
405 410 415
Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn
420 425 430
Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn
435 440 445
Lys Gly Arg Gln Val Asp Phe Glu Val Leu Asp Asn Leu Leu Glu Arg
450 455 460
Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu
465 470 475 480
Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn
485 490 495
Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala
500 505 510
Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile
515 520 525
Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser
530 535 540
His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala
545 550 555 560
Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His
565 570 575
Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn
580 585 590
Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn
595 600 605
Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His
610 615 620
Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu
625 630 635 640
Tyr Glu Ala Pro Lys Gly Tyr Ser Leu Glu Asp Leu Leu Ala Thr Val
645 650 655
Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly
660 665 670
Phe Gly Asn Ala Ser Asp His Val Arg Lys Asn Lys Ala Asp Gln Asp
675 680 685
Ser Lys Pro Asp Glu Asp Lys Glu His Asp Glu Val Ser Glu Pro Thr
690 695 700
His Pro Glu Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro Ser
705 710 715 720
Ala Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu
725 730 735
Glu Ala Glu Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu Asn
740 745 750
Ser Val Ile Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu Lys
755 760 765
Val Thr Asp Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr Gly
770 775 780
Leu Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser
785 790 795 800
Ala Glu Val Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser Gln Pro Ala
805 810 815
Pro Ile Gln
<210> 74
<211> 568
<212> PRT
<213> S. pneumoniae
<400> 74
Glu Asn Ile Ser Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser
1 5 10 15
Glu Arg His Val Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile
20 25 30
Thr Ser Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr
35 40 45
His Phe Ile Pro Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala
50 55 60
Arg Ile Ile Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser
65 70 75 80
Arg Pro Glu Gln Pro Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser
85 90 95
Leu Gln Pro Ala Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp
100 105 110
Glu Lys Leu Val Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val
115 120 125
Phe Glu Glu Asn Gly Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Ser
130 135 140
Ala Glu Thr Ala Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser
145 150 155 160
Leu Ser His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp
165 170 175
Arg Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln
180 185 190
Asp Leu Leu Asp Asn Lys Gly Arg Gln Val Asp Phe Glu Val Leu Asp
195 200 205
Asn Leu Leu Glu Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu
210 215 220
Val Asp Asp Ile Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg
225 230 235 240
Leu Gly Lys Pro Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln
245 250 255
Val Ala Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe
260 265 270
Asp Pro Arg Asp Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro
275 280 285
His Met Thr His Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala
290 295 300
Glu Arg Ala Ala Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro
305 310 315 320
Pro Ser Thr Asp His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala
325 330 335
Glu Ala Ile Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp
340 345 350
Arg Met Pro Tyr Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser
355 360 365
Leu Ile Ile Pro His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp
370 375 380
Phe Asp Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Ser Leu Glu Asp
385 390 395 400
Leu Leu Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro
405 410 415
His Ser Asp Asn Gly Phe Gly Asn Ala Ser Asp His Val Arg Lys Asn
420 425 430
Lys Ala Asp Gln Asp Ser Lys Pro Asp Glu Asp Lys Glu His Asp Glu
435 440 445
Val Ser Glu Pro Thr His Pro Glu Ser Asp Glu Lys Glu Asn His Ala
450 455 460
Gly Leu Asn Pro Ser Ala Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr
465 470 475 480
Glu Glu Thr Glu Glu Glu Ala Glu Asp Thr Thr Asp Glu Ala Glu Ile
485 490 495
Pro Gln Val Glu Asn Ser Val Ile Asn Ala Lys Ile Ala Asp Ala Glu
500 505 510
Ala Leu Leu Glu Lys Val Thr Asp Pro Ser Ile Arg Gln Asn Ala Met
515 520 525
Glu Thr Leu Thr Gly Leu Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp
530 535 540
Asn Asn Thr Ile Ser Ala Glu Val Asp Ser Leu Leu Ala Leu Leu Lys
545 550 555 560
Glu Ser Gln Pro Ala Pro Ile Gln
565
<210> 75
<211> 140
<212> PRT
<213> S. pneumoniae
<400> 75
Val Arg Lys Asn Lys Ala Asp Gln Asp Ser Lys Pro Asp Glu Asp Lys
1 5 10 15
Glu His Asp Glu Val Ser Glu Pro Thr His Pro Glu Ser Asp Glu Lys
20 25 30
Glu Asn His Ala Gly Leu Asn Pro Ser Ala Asp Asn Leu Tyr Lys Pro
35 40 45
Ser Thr Asp Thr Glu Glu Thr Glu Glu Glu Ala Glu Asp Thr Thr Asp
50 55 60
Glu Ala Glu Ile Pro Gln Val Glu Asn Ser Val Ile Asn Ala Lys Ile
65 70 75 80
Ala Asp Ala Glu Ala Leu Leu Glu Lys Val Thr Asp Pro Ser Ile Arg
85 90 95
Gln Asn Ala Met Glu Thr Leu Thr Gly Leu Lys Ser Ser Leu Leu Leu
100 105 110
Gly Thr Lys Asp Asn Asn Thr Ile Ser Ala Glu Val Asp Ser Leu Leu
115 120 125
Ala Leu Leu Lys Glu Ser Gln Pro Ala Pro Ile Gln
130 135 140
<210> 76
<211> 3171
<212> DNA
<213> S. pneumoniae
<400> 76
gacttgacag aagagcaaat taaggctgcg caaaaacatt tagaggaagt taaaactagt 60
cataatggat tagattcttt gtcatctcat gaacaggatt atccaggtaa tgccaaagaa 120
atgaaagatt tagataaaaa aatcgaagaa aaaattgctg gcattatgaa acaatatggt 180
gtcaaacgtg aaagtattgt cgtgaataaa gaaaaaaatg cgattattta tccgcatgga 240
gatcaccatc atgcagatcc gattgatgaa cataaaccgg ttggaattgg tcattctcac 300
agtaactatg aactgtttaa acccgaagaa ggagttgcta aaaaagaagg gaataaagtt 360
tatactggag aagaattaac gaatgttgtt aatttgttaa aaaatagtac gtttaataat 420
caaaacttta ctctagccaa tggtcaaaaa cgcgtttctt ttagttttcc gcctgaattg 480
gagaaaaaat taggtatcaa tatgctagta aaattaataa caccagatgg aaaagtattg 540
gagaaagtat ctggtaaagt atttggagaa ggagtaggga atattgcaaa ctttgaatta 600
gatcaacctt atttaccagg acaaacattt aagtatacta tcgcttcaaa agattatcca 660
gaagtaagtt atgatggtac atttacagtt ccaacctctt tagcttacaa aatggccagt 720
caaacgattt tctatccttt ccatgcaggg gatacttatt taagagtgaa ccctcaattt 780
gcagtgccta aaggaactga tgctttagtc agagtgtttg atgaatttca tggaaatgct 840
tatttagaaa ataactataa agttggtgaa atcaaattac cgattccgaa attaaaccaa 900
ggaacaacca gaacggccgg aaataaaatt cctgtaacct tcatggcaaa tgcttatttg 960
gacaatcaat cgacttatat tgtggaagta cctatcttgg aaaaagaaaa tcaaactgat 1020
aaaccaagta ttctaccaca atttaaaagg aataaagcac aagaaaactc aaaacttgat 1080
gaaaaggtag aagaaccaaa gactagtgag aaggtagaaa aagaaaaact ttctgaaact 1140
gggaatagta ctagtaattc aacgttagaa gaagttccta cagtggatcc tgtacaagaa 1200
aaagtagcaa aatttgctga aagttatggg atgaagctag aaaatgtctt gtttaatatg 1260
gacggaacaa ttgaattata tttaccatca ggagaagtca ttaaaaagaa tatggcagat 1320
tttacaggag aagcacctca aggaaatggt gaaaataaac catctgaaaa tggaaaagta 1380
tctactggaa cagttgagaa ccaaccaaca gaaaataaac cagcagattc tttaccagag 1440
gcaccaaacg aaaaacctgt aaaaccagaa aactcaacgg ataatggaat gttgaatcca 1500
gaagggaatg tggggagtga ccctatgtta gatccagcat tagaggaagc tccagcagta 1560
gatcctgtac aagaaaaatt agaaaaattt acagctagtt acggattagg cttagatagt 1620
gttatattca atatggatgg aacgattgaa ttaagattgc caagtggaga agtgataaaa 1680
aagaatttat ctgatttcat agcgaagctt cgttatcgtt caaaccattg ggtaccagat 1740
tcaagaccag aagaaccaag tccacaaccg actccagaac ctagtccaag tccgcaacct 1800
gcaccaaatc ctcaaccagc tccaagcaat ccaattgatg agaaattggt caaagaagct 1860
gttcgaaaag taggcgatgg ttatgtcttt gaggagaatg gagtttctcg ttatatccca 1920
gccaagaatc tttcagcaga aacagcagca ggcattgata gcaaactggc caagcaggaa 1980
agtttatctc ataagctagg agctaagaaa actgacctcc catctagtga tcgagaattt 2040
tacaataagg cttatgactt actagcaaga attcaccaag atttacttga taataaaggt 2100
cgacaagttg attttgaggc tttggataac ctgttggaac gactcaagga tgtctcaagt 2160
gataaagtca agttagtgga tgatattctt gccttcttag ctccgattcg tcatccagaa 2220
cgtttaggaa aaccaaatgc gcaaattacc tacactgatg atgagattca agtagccaag 2280
ttggcaggca agtacacaac agaagacggt tatatctttg atcctcgtga tataaccagt 2340
gatgaggggg atgcctatgt aactccacat atgacccata gccactggat taaaaaagat 2400
agtttgtctg aagctgagag agcggcagcc caggcttatg ctaaagagaa aggtttgacc 2460
cctccttcga cagaccatca ggattcagga aatactgagg caaaaggagc agaagctatc 2520
tacaaccgcg tgaaagcagc taagaaggtg ccacttgatc gtatgcctta caatcttcaa 2580
tatactgtag aagtcaaaaa cggtagttta atcatacctc attatgacca ttaccataac 2640
atcaaatttg agtggtttga cgaaggcctt tatgaggcac ctaaggggta tactcttgag 2700
gatcttttgg cgactgtcaa gtactatgtc gaacatccaa acgaacgtcc gcattcagat 2760
aatggttttg gtaacgctag cgaccatgtt caaagaaaca aaaatggtca agctgatacc 2820
aatcaaacgg aaaaaccaag cgaggagaaa cctcagacag aaaaacctga ggaagaaacc 2880
cctcgagaag agaaaccaca aagcgagaaa ccagagtctc caaaaccaac agaggaacca 2940
gaagaagaat caccagagga atcagaagaa cctcaggtcg agactgaaaa ggttgaagaa 3000
aaactgagag aggctgaaga tttacttgga aaaatccagg atccaattat caagtccaat 3060
gccaaagaga ctctcacagg attaaaaaat aatttactat ttggcaccca ggacaacaat 3120
actattatgg cagaagctga aaaactattg gctttattaa aggagagtaa g 3171
<210> 77
<211> 473
<212> PRT
<213> S. pneumoniae
<400> 77
Glu Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser
1 5 10 15
Ser Tyr Asn Ala Asn Pro Val Gln Pro Arg Leu Ser Glu Asn His Asn
20 25 30
Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser
35 40 45
Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val
50 55 60
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
65 70 75 80
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
85 90 95
Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro
100 105 110
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln
115 120 125
Pro Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Leu Gln Pro Ala
130 135 140
Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val
145 150 155 160
Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn
165 170 175
Gly Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala
180 185 190
Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys
195 200 205
Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr
210 215 220
Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp
225 230 235 240
Asn Lys Gly Arg Gln Val Asp Phe Glu Val Leu Asp Asn Leu Leu Glu
245 250 255
Arg Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile
260 265 270
Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro
275 280 285
Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu
290 295 300
Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp
305 310 315 320
Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His
325 330 335
Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala
340 345 350
Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp
355 360 365
His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr
370 375 380
Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr
385 390 395 400
Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro
405 410 415
His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly
420 425 430
Leu Tyr Glu Ala Pro Lys Gly Tyr Ser Leu Glu Asp Leu Leu Ala Thr
435 440 445
Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn
450 455 460
Gly Phe Gly Asn Ala Ser Asp His Val
465 470
<210> 78
<211> 780
<212> PRT
<213> S. pneumoniae
<400> 78
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Gly Asn Ala Lys Glu Met Lys Asp Leu Asp
485 490 495
Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val
500 505 510
Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr
515 520 525
Pro His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro
530 535 540
Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu
545 550 555 560
Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu
565 570 575
Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln
580 585 590
Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro
595 600 605
Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile
610 615 620
Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly
625 630 635 640
Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu
645 650 655
Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu
660 665 670
Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys
675 680 685
Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr
690 695 700
Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu
705 710 715 720
Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn
725 730 735
Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly
740 745 750
Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn
755 760 765
Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu
770 775 780
<210> 79
<211> 690
<212> PRT
<213> S. pneumoniae
<400> 79
Cys Ala Tyr Glu Leu Gly Leu His Gln Ala Gln Thr Val Lys Glu Asn
1 5 10 15
Asn Arg Val Ser Tyr Ile Asp Gly Lys Gln Ala Thr Gln Lys Thr Glu
20 25 30
Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile Ile
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ser
85 90 95
Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Glu Glu Ile Asn Arg Gln Lys Gln Glu His Ser Gln His Arg
130 135 140
Glu Gly Gly Thr Ser Ala Asn Asp Gly Ala Val Ala Phe Ala Arg Ser
145 150 155 160
Gln Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp
165 170 175
Ile Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His
180 185 190
Tyr His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala
195 200 205
Ala Glu Ala Phe Leu Ser Gly Arg Glu Asn Leu Ser Asn Leu Arg Thr
210 215 220
Tyr Arg Arg Gln Asn Ser Asp Asn Thr Pro Arg Thr Asn Trp Val Pro
225 230 235 240
Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser Asn Asn Ser
245 250 255
Asn Thr Asn Ser Gln Ala Ser Gln Ser Asn Asp Ile Asp Ser Leu Leu
260 265 270
Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln Arg His Val Glu Ser Asp
275 280 285
Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala Arg Gly
290 295 300
Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr Glu Gln
305 310 315 320
Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro Leu Arg Tyr
325 330 335
Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Glu Pro Ser Pro
340 345 350
Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro
355 360 365
Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala
370 375 380
Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser
385 390 395 400
Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala Glu Thr Ala Ala Gly Ile
405 410 415
Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Ala
420 425 430
Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala
435 440 445
Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly
450 455 460
Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys
465 470 475 480
Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu Ala Phe
485 490 495
Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gln
500 505 510
Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys
515 520 525
Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser
530 535 540
Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp
545 550 555 560
Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala
565 570 575
Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp
580 585 590
Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val
595 600 605
Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln
610 615 620
Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp
625 630 635 640
His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu
645 650 655
Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr
660 665 670
Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly
675 680 685
Asn Ala
690
<210> 80
<211> 2469
<212> DNA
<213> S. pneumoniae
<400> 80
gtgaagaaaa catatggtta tatcggctca gttgctgcca ttttactagc tactcatatt 60
ggaagttacc aacttggtaa gcatcatatg ggtctagcaa caaaggacaa tcagattgcc 120
tatattgatg acagcaaagg taaggcaaaa gcccctaaaa caaacaaaac gatggatcaa 180
atcagtgctg aagaaggcat ctctgctgaa cagatcgtag tcaaaattac tgaccaaggc 240
tatgtgacct cacacggtga ccattatcat ttttacaatg ggaaagttcc ttatgatgcg 300
attattagtg aagagttgtt gatgacggat cctaattacc gttttaaaca atcagacgtt 360
atcaatgaaa tcttagacgg ttacgttatt aaagtcaatg gcaactatta tgtttacctc 420
aagccaggta gtaagcgcaa aaacattcga accaaacaac aaattgctga gcaagtagcc 480
aaaggaacta aagaagctaa agaaaaaggt ttagctcaag tggcccatct cagtaaagaa 540
gaagttgcgg cagtcaatga agcaaaaaga caaggacgct atactacaga cgatggctat 600
atttttagtc cgacagatat cattgatgat ttaggagatg cttatttagt acctcatggt 660
aatcactatc attatattcc taaaaaggat ttgtctccaa gtgagctagc tgctgcacaa 720
gcctactgga gtcaaaaaca aggtcgaggt gctagaccgt ctgattaccg cccgacacca 780
gccccaggtc gtaggaaagc cccaattcct gatgtgacgc ctaaccctgg acaaggtcat 840
cagccagata acggtggcta tcatccagcg cctcctaggc caaatgatgc gtcacaaaac 900
aaacaccaaa gagatgagtt taaaggaaaa acctttaagg aacttttaga tcaactacac 960
cgtcttgatt tgaaataccg tcatgtggaa gaagatgggt tgatttttga accgactcaa 1020
gtgatcaaat caaacgcttt tgggtatgtg gtgcctcatg gagatcatta tcatattatc 1080
ccaagaagtc agttatcacc tcttgaaatg gaattagcag atcgatactt agctggccaa 1140
actgaggaca atgactcagg ttcagagcac tcaaaaccat cagataaaga agtgacacat 1200
acctttcttg gtcatcgcat caaagcttac ggaaaaggct tagatggtaa accatatgat 1260
acgagtgatg cttatgtttt tagtaaagaa tccattcatt cagtggataa atcaggagtt 1320
acagctaaac acggagatca tttccactat ataggatttg gagaacttga acaatatgag 1380
ttggatgagg tcgctaactg ggtgaaagca aaaggtcaag ctgatgagct tgctgctgct 1440
ttggatcagg aacaaggcaa agaaaaacca ctctttgaca ctaaaaaagt gagtcgcaaa 1500
gtaacaaaag atggtaaagt gggctatatg atgccaaaag atggtaagga ctatttctat 1560
gctcgtgatc aacttgattt gactcagatt gcctttgccg aacaagaact aatgcttaaa 1620
gataagaagc attaccgtta tgacattgtt gacacaggta ttgagccacg acttgctgta 1680
gatgtgtcaa gtctgccgat gcatgctggt aatgctactt acgatactgg aagttcgttt 1740
gttatcccac atattgatca tatccatgtc gttccgtatt catggttgac gcgcgatcag 1800
attgcaacag tcaagtatgt gatgcaacac cccgaagttc gtccggatgt atggtctaag 1860
ccagggcatg aagagtcagg ttcggtcatt ccaaatgtta cgcctcttga taaacgtgct 1920
ggtatgccaa actggcaaat tatccattct gctgaagaag ttcaaaaagc cctagcagaa 1980
ggtcgttttg caacaccaga cggctatatt ttcgatccac gagatgtttt ggccaaagaa 2040
acttttgtat ggaaagatgg ctcctttagc atcccaagag cagatggcag ttcattgaga 2100
accattaata aatctgatct atcccaagct gagtggcaac aagctcaaga gttattggca 2160
aagaaaaata ctggtgatgc tactgatacg gataaaccca aagaaaagca acaggcagat 2220
aagagcaatg aaaaccaaca gccaagtgaa gccagtaaag aagaaaaaga atcagatgac 2280
tttatagaca gtttaccaga ctatggtcta gatagagcaa ccctagaaga tcatatcaat 2340
caattagcac aaaaagctaa tatcgatcct aagtatctca ttttccaacc agaaggtgtc 2400
caattttata ataaaaatgg tgaattggta acttatgata tcaagacact tcaacaaata 2460
aacccttaa 2469
<210> 81
<211> 823
<212> PRT
<213> S. pneumoniae
<400> 81
Val Lys Lys Thr Tyr Gly Tyr Ile Gly Ser Val Ala Ala Ile Leu Leu
1 5 10 15
Ala Thr His Ile Gly Ser Tyr Gln Leu Gly Lys His His Met Gly Leu
20 25 30
Ala Thr Lys Asp Asn Gln Ile Ala Tyr Ile Asp Asp Ser Lys Gly Lys
35 40 45
Ala Lys Ala Pro Lys Thr Asn Lys Thr Met Asp Gln Ile Ser Ala Glu
50 55 60
Glu Gly Ile Ser Ala Glu Gln Ile Val Val Lys Ile Thr Asp Gln Gly
65 70 75 80
Tyr Val Thr Ser His Gly Asp His Tyr His Phe Tyr Asn Gly Lys Val
85 90 95
Pro Tyr Asp Ala Ile Ile Ser Glu Glu Leu Leu Met Thr Asp Pro Asn
100 105 110
Tyr Arg Phe Lys Gln Ser Asp Val Ile Asn Glu Ile Leu Asp Gly Tyr
115 120 125
Val Ile Lys Val Asn Gly Asn Tyr Tyr Val Tyr Leu Lys Pro Gly Ser
130 135 140
Lys Arg Lys Asn Ile Arg Thr Lys Gln Gln Ile Ala Glu Gln Val Ala
145 150 155 160
Lys Gly Thr Lys Glu Ala Lys Glu Lys Gly Leu Ala Gln Val Ala His
165 170 175
Leu Ser Lys Glu Glu Val Ala Ala Val Asn Glu Ala Lys Arg Gln Gly
180 185 190
Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Ser Pro Thr Asp Ile Ile
195 200 205
Asp Asp Leu Gly Asp Ala Tyr Leu Val Pro His Gly Asn His Tyr His
210 215 220
Tyr Ile Pro Lys Lys Asp Leu Ser Pro Ser Glu Leu Ala Ala Ala Gln
225 230 235 240
Ala Tyr Trp Ser Gln Lys Gln Gly Arg Gly Ala Arg Pro Ser Asp Tyr
245 250 255
Arg Pro Thr Pro Ala Pro Gly Arg Arg Lys Ala Pro Ile Pro Asp Val
260 265 270
Thr Pro Asn Pro Gly Gln Gly His Gln Pro Asp Asn Gly Gly Tyr His
275 280 285
Pro Ala Pro Pro Arg Pro Asn Asp Ala Ser Gln Asn Lys His Gln Arg
290 295 300
Asp Glu Phe Lys Gly Lys Thr Phe Lys Glu Leu Leu Asp Gln Leu His
305 310 315 320
Arg Leu Asp Leu Lys Tyr Arg His Val Glu Glu Asp Gly Leu Ile Phe
325 330 335
Glu Pro Thr Gln Val Ile Lys Ser Asn Ala Phe Gly Tyr Val Val Pro
340 345 350
His Gly Asp His Tyr His Ile Ile Pro Arg Ser Gln Leu Ser Pro Leu
355 360 365
Glu Met Glu Leu Ala Asp Arg Tyr Leu Ala Gly Gln Thr Glu Asp Asn
370 375 380
Asp Ser Gly Ser Glu His Ser Lys Pro Ser Asp Lys Glu Val Thr His
385 390 395 400
Thr Phe Leu Gly His Arg Ile Lys Ala Tyr Gly Lys Gly Leu Asp Gly
405 410 415
Lys Pro Tyr Asp Thr Ser Asp Ala Tyr Val Phe Ser Lys Glu Ser Ile
420 425 430
His Ser Val Asp Lys Ser Gly Val Thr Ala Lys His Gly Asp His Phe
435 440 445
His Tyr Ile Gly Phe Gly Glu Leu Glu Gln Tyr Glu Leu Asp Glu Val
450 455 460
Ala Asn Trp Val Lys Ala Lys Gly Gln Ala Asp Glu Leu Ala Ala Ala
465 470 475 480
Leu Asp Gln Glu Gln Gly Lys Glu Lys Pro Leu Phe Asp Thr Lys Lys
485 490 495
Val Ser Arg Lys Val Thr Lys Asp Gly Lys Val Gly Tyr Met Met Pro
500 505 510
Lys Asp Gly Lys Asp Tyr Phe Tyr Ala Arg Asp Gln Leu Asp Leu Thr
515 520 525
Gln Ile Ala Phe Ala Glu Gln Glu Leu Met Leu Lys Asp Lys Lys His
530 535 540
Tyr Arg Tyr Asp Ile Val Asp Thr Gly Ile Glu Pro Arg Leu Ala Val
545 550 555 560
Asp Val Ser Ser Leu Pro Met His Ala Gly Asn Ala Thr Tyr Asp Thr
565 570 575
Gly Ser Ser Phe Val Ile Pro His Ile Asp His Ile His Val Val Pro
580 585 590
Tyr Ser Trp Leu Thr Arg Asp Gln Ile Ala Thr Val Lys Tyr Val Met
595 600 605
Gln His Pro Glu Val Arg Pro Asp Val Trp Ser Lys Pro Gly His Glu
610 615 620
Glu Ser Gly Ser Val Ile Pro Asn Val Thr Pro Leu Asp Lys Arg Ala
625 630 635 640
Gly Met Pro Asn Trp Gln Ile Ile His Ser Ala Glu Glu Val Gln Lys
645 650 655
Ala Leu Ala Glu Gly Arg Phe Ala Thr Pro Asp Gly Tyr Ile Phe Asp
660 665 670
Pro Arg Asp Val Leu Ala Lys Glu Thr Phe Val Trp Lys Asp Gly Ser
675 680 685
Phe Ser Ile Pro Arg Ala Asp Gly Ser Ser Leu Arg Thr Ile Asn Lys
690 695 700
Ser Asp Leu Ser Gln Ala Glu Trp Gln Gln Ala Gln Glu Leu Leu Ala
705 710 715 720
Lys Lys Asn Thr Gly Asp Ala Thr Asp Thr Asp Lys Pro Lys Glu Lys
725 730 735
Gln Gln Ala Asp Lys Ser Asn Glu Asn Gln Gln Pro Ser Glu Ala Ser
740 745 750
Lys Glu Glu Lys Glu Ser Asp Asp Phe Ile Asp Ser Leu Pro Asp Tyr
755 760 765
Gly Leu Asp Arg Ala Thr Leu Glu Asp His Ile Asn Gln Leu Ala Gln
770 775 780
Lys Ala Asn Ile Asp Pro Lys Tyr Leu Ile Phe Gln Pro Glu Gly Val
785 790 795 800
Gln Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp Ile Lys Thr
805 810 815
Leu Gln Gln Ile Asn Pro Pro
820
<210> 82
<211> 2472
<212> DNA
<213> S. pneumoniae
<400> 82
gtgaagaaaa catatggtta tatcggctca gttgctgcca ttttactagc tactcatatt 60
ggaagttacc aacttggtaa gcatcatatg ggtctagcaa caaaggacaa tcagattgcc 120
tatattgatg atagcaaagg taaggcaaaa gcccctaaaa caaacaaaac gatggatcaa 180
atcagtgctg aagaaggcat ctctgctgaa cagatcgtag tcaaaattac tgaccaaggt 240
tatgtgacct cacacggtga ccattatcat ttttacaatg ggaaagttcc ttatgatgcg 300
attattagtg aagagttgtt gatgacggat cctaattacc attttaaaca atcagacgtt 360
atcaatgaaa tcttagacgg ttacgttatt aaagtcaatg gcaactatta tgtttacctc 420
aagccaggta gtaagcgcaa aaacattcga accaaacaac aaattgctga gcaagtagcc 480
aaaggaacta aagaagctaa agaaaaaggt ttagctcaag tggcccatct cagtaaagaa 540
gaagttgcgg cagtcaatga agcaaaaaga caaggacgct atactacaga cgatggctat 600
atttttagtc cgacagatat cattgatgat ttaggagacg cttatttagt acctcatggt 660
aatcactatc attatattcc taaaaaagat ttgtctccaa gtgagctagc tgctgcacaa 720
gcttactgga gtcaaaaaca aggtcgaggt gctagaccgt ctgattaccg cccgacacca 780
gccccaggtc gtaggaaagc tccaattcct gatgtgacgc ctaaccctgg acaaggtcat 840
cagccagata acggtggcta tcatccagcg cctcctaggc caaatgatgc gtcacaaaac 900
aaacaccaaa gagatgagtt taaaggaaaa acctttaagg aacttttaga tcaactacac 960
cgtcttgatt tgaaataccg tcatgtggaa gaagatgggt tgatttttga accgactcaa 1020
gtgatcaaat caaacgcttt tgggtatgtg gtgcctcatg gagatcatta tcatattatc 1080
ccaagaagtc agttatcacc tcttgaaatg gaattagcag atcgatactt agccggtcaa 1140
actgaggaca atgattcagg ttcagatcac tcaaaaccat cagataaaga agtgacacat 1200
acctttcttg gtcatcgcat caaagcttac ggaaaaggct tagatggtaa accatatgat 1260
acgagtgatg cttatgtttt tagtaaagaa tccattcatt cagtggataa atcaggagtt 1320
acagctaaac acggagatca tttccactat ataggatttg gagaacttga acaatatgag 1380
ttggatgagg tcgctaactg ggtgaaagca aaaggtcaag ctgatgagct tgctgctgct 1440
ttggatcagg aacaaggcaa agaaaaacca ctctttgaca ctaaaaaagt gagtcgcaaa 1500
gtaacaaaag atggtaaagt gggctatatt atgccaaaag atggcaagga ctatttctat 1560
gctcgtgatc aacttgattt gactcagatt gcctttgccg aacaagaact aatgcttaaa 1620
gataagaacc attaccgtta tgacattgtt gacacaggta ttgagccacg acttgctgta 1680
gatgtgtcaa gtctgccgat gcatgctggt aatgctactt acgatactgg aagttcgttt 1740
gttatccctc atattgatca tatccatgtc gttccgtatt catggttgac gcgcgatcag 1800
attgcaacaa tcaagtatgt gatgcaacac cccgaagttc gtccagatgt atggtctaag 1860
ccagggcatg aagagtcagg ttcggtcatt ccaaatgtta cgcctcttga taaacgtgct 1920
ggtatgccaa attggcaaat catccattct gctgaagaag ttcaaaaagc cctagcagaa 1980
ggtcgttttg caacaccaga cggctatatt ttcgatccac gagatgtttt ggccaaagaa 2040
acttttgtat ggaaagatgg ctcctttagc atcccaagag cagatggcag ttcattgaga 2100
accattaata aatctgatct atcccaagct gagtggcaac aagctcaaga gttattggca 2160
aagaaaaacg ctggtgatgc tactgatacg gataaaccca aagaaaagca acaggcagat 2220
aagagcaatg aaaaccaaca gccaagtgaa gccagtaaag aagaagaaaa agaatcagat 2280
gactttatag acagtttacc agactatggt ctagatagag caaccctaga agatcatatc 2340
aatcaattag cacaaaaagc taatatcgat cctaagtatc tcattttcca accagaaggt 2400
gtccaatttt ataataaaaa tggtgaatta gtaacttatg atatcaagac gcttcaacaa 2460
ataaaccctt aa 2472
<210> 83
<211> 824
<212> PRT
<213> S. pneumoniae
<400> 83
Val Lys Lys Thr Tyr Gly Tyr Ile Gly Ser Val Ala Ala Ile Leu Leu
1 5 10 15
Ala Thr His Ile Gly Ser Tyr Gln Leu Gly Lys His His Met Gly Leu
20 25 30
Ala Thr Lys Asp Asn Gln Ile Ala Tyr Ile Asp Asp Ser Lys Gly Lys
35 40 45
Ala Lys Ala Pro Lys Thr Asn Lys Thr Met Asp Gln Ile Ser Ala Glu
50 55 60
Glu Gly Ile Ser Ala Glu Gln Ile Val Val Lys Ile Thr Asp Gln Gly
65 70 75 80
Tyr Val Thr Ser His Gly Asp His Tyr His Phe Tyr Asn Gly Lys Val
85 90 95
Pro Tyr Asp Ala Ile Ile Ser Glu Glu Leu Leu Met Thr Asp Pro Asn
100 105 110
Tyr His Phe Lys Gln Ser Asp Val Ile Asn Glu Ile Leu Asp Gly Tyr
115 120 125
Val Ile Lys Val Asn Gly Asn Tyr Tyr Val Tyr Leu Lys Pro Gly Ser
130 135 140
Lys Arg Lys Asn Ile Arg Thr Lys Gln Gln Ile Ala Glu Gln Val Ala
145 150 155 160
Lys Gly Thr Lys Glu Ala Lys Glu Lys Gly Leu Ala Gln Val Ala His
165 170 175
Leu Ser Lys Glu Glu Val Ala Ala Val Asn Glu Ala Lys Arg Gln Gly
180 185 190
Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Ser Pro Thr Asp Ile Ile
195 200 205
Asp Asp Leu Gly Asp Ala Tyr Leu Val Pro His Gly Asn His Tyr His
210 215 220
Tyr Ile Pro Lys Lys Asp Leu Ser Pro Ser Glu Leu Ala Ala Ala Gln
225 230 235 240
Ala Tyr Trp Ser Gln Lys Gln Gly Arg Gly Ala Arg Pro Ser Asp Tyr
245 250 255
Arg Pro Thr Pro Ala Pro Gly Arg Arg Lys Ala Pro Ile Pro Asp Val
260 265 270
Thr Pro Asn Pro Gly Gln Gly His Gln Pro Asp Asn Gly Gly Tyr His
275 280 285
Pro Ala Pro Pro Arg Pro Asn Asp Ala Ser Gln Asn Lys His Gln Arg
290 295 300
Asp Glu Phe Lys Gly Lys Thr Phe Lys Glu Leu Leu Asp Gln Leu His
305 310 315 320
Arg Leu Asp Leu Lys Tyr Arg His Val Glu Glu Asp Gly Leu Ile Phe
325 330 335
Glu Pro Thr Gln Val Ile Lys Ser Asn Ala Phe Gly Tyr Val Val Pro
340 345 350
His Gly Asp His Tyr His Ile Ile Pro Arg Ser Gln Leu Ser Pro Leu
355 360 365
Glu Met Glu Leu Ala Asp Arg Tyr Leu Ala Gly Gln Thr Glu Asp Asn
370 375 380
Asp Ser Gly Ser Asp His Ser Lys Pro Ser Asp Lys Glu Val Thr His
385 390 395 400
Thr Phe Leu Gly His Arg Ile Lys Ala Tyr Gly Lys Gly Leu Asp Gly
405 410 415
Lys Pro Tyr Asp Thr Ser Asp Ala Tyr Val Phe Ser Lys Glu Ser Ile
420 425 430
His Ser Val Asp Lys Ser Gly Val Thr Ala Lys His Gly Asp His Phe
435 440 445
His Tyr Ile Gly Phe Gly Glu Leu Glu Gln Tyr Glu Leu Asp Glu Val
450 455 460
Ala Asn Trp Val Lys Ala Lys Gly Gln Ala Asp Glu Leu Ala Ala Ala
465 470 475 480
Leu Asp Gln Glu Gln Gly Lys Glu Lys Pro Leu Phe Asp Thr Lys Lys
485 490 495
Val Ser Arg Lys Val Thr Lys Asp Gly Lys Val Gly Tyr Ile Met Pro
500 505 510
Lys Asp Gly Lys Asp Tyr Phe Tyr Ala Arg Asp Gln Leu Asp Leu Thr
515 520 525
Gln Ile Ala Phe Ala Glu Gln Glu Leu Met Leu Lys Asp Lys Asn His
530 535 540
Tyr Arg Tyr Asp Ile Val Asp Thr Gly Ile Glu Pro Arg Leu Ala Val
545 550 555 560
Asp Val Ser Ser Leu Pro Met His Ala Gly Asn Ala Thr Tyr Asp Thr
565 570 575
Gly Ser Ser Phe Val Ile Pro His Ile Asp His Ile His Val Val Pro
580 585 590
Tyr Ser Trp Leu Thr Arg Asp Gln Ile Ala Thr Ile Lys Tyr Val Met
595 600 605
Gln His Pro Glu Val Arg Pro Asp Val Trp Ser Lys Pro Gly His Glu
610 615 620
Glu Ser Gly Ser Val Ile Pro Asn Val Thr Pro Leu Asp Lys Arg Ala
625 630 635 640
Gly Met Pro Asn Trp Gln Ile Ile His Ser Ala Glu Glu Val Gln Lys
645 650 655
Ala Leu Ala Glu Gly Arg Phe Ala Thr Pro Asp Gly Tyr Ile Phe Asp
660 665 670
Pro Arg Asp Val Leu Ala Lys Glu Thr Phe Val Trp Lys Asp Gly Ser
675 680 685
Phe Ser Ile Pro Arg Ala Asp Gly Ser Ser Leu Arg Thr Ile Asn Lys
690 695 700
Ser Asp Leu Ser Gln Ala Glu Trp Gln Gln Ala Gln Glu Leu Leu Ala
705 710 715 720
Lys Lys Asn Ala Gly Asp Ala Thr Asp Thr Asp Lys Pro Lys Glu Lys
725 730 735
Gln Gln Ala Asp Lys Ser Asn Glu Asn Gln Gln Pro Ser Glu Ala Ser
740 745 750
Lys Glu Glu Glu Lys Glu Ser Asp Asp Phe Ile Asp Ser Leu Pro Asp
755 760 765
Tyr Gly Leu Asp Arg Ala Thr Leu Glu Asp His Ile Asn Gln Leu Ala
770 775 780
Gln Lys Ala Asn Ile Asp Pro Lys Tyr Leu Ile Phe Gln Pro Glu Gly
785 790 795 800
Val Gln Phe Tyr Asn Lys Asn Gly Glu Leu Val Thr Tyr Asp Ile Lys
805 810 815
Thr Leu Gln Gln Ile Asn Pro Pro
820
<210> 84
<211> 1019
<212> PRT
<213> S. pneumoniae
<400> 84
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Arg Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Ser Asn Ala Lys Glu Met Lys Asp Leu Asp
485 490 495
Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val
500 505 510
Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr
515 520 525
Pro His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro
530 535 540
Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu
545 550 555 560
Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu
565 570 575
Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln
580 585 590
Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro
595 600 605
Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile
610 615 620
Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly
625 630 635 640
Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu
645 650 655
Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu
660 665 670
Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys
675 680 685
Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr
690 695 700
Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu
705 710 715 720
Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn
725 730 735
Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly
740 745 750
Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn
755 760 765
Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu
770 775 780
Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys
785 790 795 800
Arg Asn Lys Ala Gln Glu Asn Ser Lys Phe Asp Glu Lys Val Glu Glu
805 810 815
Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly
820 825 830
Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro
835 840 845
Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu
850 855 860
Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro
865 870 875 880
Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala
885 890 895
Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser
900 905 910
Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser
915 920 925
Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr
930 935 940
Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met
945 950 955 960
Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu
965 970 975
Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val
980 985 990
Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu
995 1000 1005
Val Ile Lys Lys Asn Leu Ser Asp Leu Ile Ala
1010 1015
<210> 85
<211> 1019
<212> PRT
<213> S. pneumoniae
<400> 85
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Arg Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Ile Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Ile Met Ser His Gly Asn His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Gly Asn Ala Lys Glu Met Lys Asp Leu Asp
485 490 495
Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val
500 505 510
Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr
515 520 525
Pro His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro
530 535 540
Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu
545 550 555 560
Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu
565 570 575
Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln
580 585 590
Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro
595 600 605
Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile
610 615 620
Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly
625 630 635 640
Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu
645 650 655
Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu
660 665 670
Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys
675 680 685
Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr
690 695 700
Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu
705 710 715 720
Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn
725 730 735
Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly
740 745 750
Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn
755 760 765
Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu
770 775 780
Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys
785 790 795 800
Arg Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp Glu Lys Val Glu Glu
805 810 815
Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly
820 825 830
Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro
835 840 845
Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu
850 855 860
Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro
865 870 875 880
Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala
885 890 895
Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser
900 905 910
Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser
915 920 925
Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr
930 935 940
Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met
945 950 955 960
Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu
965 970 975
Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val
980 985 990
Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu
995 1000 1005
Val Ile Lys Lys Asn Leu Ser Asp Leu Ile Ala
1010 1015
<210> 86
<211> 1019
<212> PRT
<213> S. pneumoniae
<400> 86
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Ser Asn Ala Lys Glu Met Lys Asp Leu Asp
485 490 495
Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val
500 505 510
Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr
515 520 525
Pro His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro
530 535 540
Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu
545 550 555 560
Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu
565 570 575
Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln
580 585 590
Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro
595 600 605
Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile
610 615 620
Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly
625 630 635 640
Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu
645 650 655
Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu
660 665 670
Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys
675 680 685
Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr
690 695 700
Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu
705 710 715 720
Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn
725 730 735
Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly
740 745 750
Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn
755 760 765
Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu
770 775 780
Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys
785 790 795 800
Arg Asn Lys Ala Gln Glu Asn Leu Lys Leu Asp Glu Lys Val Glu Glu
805 810 815
Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly
820 825 830
Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro
835 840 845
Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu
850 855 860
Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro
865 870 875 880
Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala
885 890 895
Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser
900 905 910
Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser
915 920 925
Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr
930 935 940
Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met
945 950 955 960
Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu
965 970 975
Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val
980 985 990
Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu
995 1000 1005
Val Ile Lys Lys Asn Leu Ser Asp Leu Ile Ala
1010 1015
<210> 87
<211> 1019
<212> PRT
<213> S. pneumoniae
<400> 87
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Gly Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Gly Asn Ala Lys Glu Met Lys Asp Leu Asp
485 490 495
Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val
500 505 510
Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr
515 520 525
Pro His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro
530 535 540
Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu
545 550 555 560
Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu
565 570 575
Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln
580 585 590
Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro
595 600 605
Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile
610 615 620
Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly
625 630 635 640
Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu
645 650 655
Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu
660 665 670
Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys
675 680 685
Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr
690 695 700
Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu
705 710 715 720
Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn
725 730 735
Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly
740 745 750
Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn
755 760 765
Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu
770 775 780
Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys
785 790 795 800
Arg Asn Lys Ala Gln Glu Asn Ser Lys Leu Asp Glu Lys Val Glu Glu
805 810 815
Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly
820 825 830
Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro
835 840 845
Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu
850 855 860
Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro
865 870 875 880
Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala
885 890 895
Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser
900 905 910
Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser
915 920 925
Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr
930 935 940
Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met
945 950 955 960
Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu
965 970 975
Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val
980 985 990
Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu
995 1000 1005
Val Ile Lys Lys Asn Leu Ser Asp Phe Ile Ala
1010 1015
<210> 88
<211> 1019
<212> PRT
<213> S. pneumoniae
<400> 88
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Arg Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Ser Asn Ala Lys Glu Met Lys Asp Leu Asp
485 490 495
Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val
500 505 510
Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr
515 520 525
Pro His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro
530 535 540
Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu
545 550 555 560
Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu
565 570 575
Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln
580 585 590
Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro
595 600 605
Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile
610 615 620
Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly
625 630 635 640
Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu
645 650 655
Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu
660 665 670
Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys
675 680 685
Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr
690 695 700
Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu
705 710 715 720
Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn
725 730 735
Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly
740 745 750
Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn
755 760 765
Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu
770 775 780
Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys
785 790 795 800
Arg Asn Lys Ala Gln Glu Asn Ser Lys Phe Asp Glu Lys Val Glu Glu
805 810 815
Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly
820 825 830
Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro
835 840 845
Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu
850 855 860
Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro
865 870 875 880
Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala
885 890 895
Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser
900 905 910
Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser
915 920 925
Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr
930 935 940
Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met
945 950 955 960
Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu
965 970 975
Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val
980 985 990
Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu
995 1000 1005
Val Ile Lys Lys Asn Leu Ser Asp Leu Ile Ala
1010 1015
<210> 89
<211> 1019
<212> PRT
<213> S. pneumoniae
<400> 89
Cys Ala Tyr Ala Leu Asn Gln His Arg Ser Gln Glu Asn Lys Asp Asn
1 5 10 15
Asn Arg Val Ser Tyr Val Asp Gly Ser Gln Ser Ser Gln Lys Ser Glu
20 25 30
Asn Leu Thr Pro Asp Gln Val Ser Gln Lys Glu Gly Ile Gln Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ala
85 90 95
Asp Ile Val Asn Glu Val Lys Gly Gly Tyr Ile Ile Lys Val Asp Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Asp Glu Ile Asn Arg Gln Lys Gln Glu His Val Lys Asp Asn
130 135 140
Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser Gln Gly Arg Tyr
145 150 155 160
Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp Ile Ile Glu Asp
165 170 175
Thr Gly Asn Ala Tyr Ile Val Pro His Arg Gly His Tyr His Tyr Ile
180 185 190
Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His
195 200 205
Leu Ala Gly Lys Asn Met Gln Pro Ser Gln Leu Ser Tyr Ser Ser Thr
210 215 220
Ala Ser Asp Asn Asn Thr Gln Ser Val Ala Lys Gly Ser Thr Ser Lys
225 230 235 240
Pro Ala Asn Lys Ser Glu Asn Leu Gln Ser Leu Leu Lys Glu Leu Tyr
245 250 255
Asp Ser Pro Ser Ala Gln Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe
260 265 270
Asp Pro Ala Lys Ile Ile Ser Arg Thr Pro Asn Gly Val Ala Ile Pro
275 280 285
His Gly Asp His Tyr His Phe Ile Pro Tyr Ser Lys Leu Ser Ala Leu
290 295 300
Glu Glu Lys Ile Ala Arg Met Val Pro Ile Ser Gly Thr Gly Ser Thr
305 310 315 320
Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser
325 330 335
Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys Glu Leu Ser Ser
340 345 350
Ala Ser Asp Gly Tyr Ile Phe Asn Pro Lys Asp Ile Val Glu Glu Thr
355 360 365
Ala Thr Ala Tyr Ile Val Arg His Gly Asp His Phe His Tyr Ile Pro
370 375 380
Lys Ser Asn Gln Ile Gly Gln Pro Thr Leu Pro Asn Asn Ser Leu Ala
385 390 395 400
Thr Pro Ser Pro Ser Leu Pro Ile Asn Pro Gly Thr Ser His Glu Lys
405 410 415
His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg Ile Ile Ala Glu
420 425 430
Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His Asn His Tyr Phe
435 440 445
Phe Lys Lys Asp Leu Thr Glu Glu Gln Ile Lys Ala Ala Gln Lys His
450 455 460
Leu Glu Glu Val Lys Thr Ser His Asn Gly Leu Asp Ser Leu Ser Ser
465 470 475 480
His Glu Gln Asp Tyr Pro Ser Asn Ala Lys Glu Met Lys Asp Leu Asp
485 490 495
Lys Lys Ile Glu Glu Lys Ile Ala Gly Ile Met Lys Gln Tyr Gly Val
500 505 510
Lys Arg Glu Ser Ile Val Val Asn Lys Glu Lys Asn Ala Ile Ile Tyr
515 520 525
Pro His Gly Asp His His His Ala Asp Pro Ile Asp Glu His Lys Pro
530 535 540
Val Gly Ile Gly His Ser His Ser Asn Tyr Glu Leu Phe Lys Pro Glu
545 550 555 560
Glu Gly Val Ala Lys Lys Glu Gly Asn Lys Val Tyr Thr Gly Glu Glu
565 570 575
Leu Thr Asn Val Val Asn Leu Leu Lys Asn Ser Thr Phe Asn Asn Gln
580 585 590
Asn Phe Thr Leu Ala Asn Gly Gln Lys Arg Val Ser Phe Ser Phe Pro
595 600 605
Pro Glu Leu Glu Lys Lys Leu Gly Ile Asn Met Leu Val Lys Leu Ile
610 615 620
Thr Pro Asp Gly Lys Val Leu Glu Lys Val Ser Gly Lys Val Phe Gly
625 630 635 640
Glu Gly Val Gly Asn Ile Ala Asn Phe Glu Leu Asp Gln Pro Tyr Leu
645 650 655
Pro Gly Gln Thr Phe Lys Tyr Thr Ile Ala Ser Lys Asp Tyr Pro Glu
660 665 670
Val Ser Tyr Asp Gly Thr Phe Thr Val Pro Thr Ser Leu Ala Tyr Lys
675 680 685
Met Ala Ser Gln Thr Ile Phe Tyr Pro Phe His Ala Gly Asp Thr Tyr
690 695 700
Leu Arg Val Asn Pro Gln Phe Ala Val Pro Lys Gly Thr Asp Ala Leu
705 710 715 720
Val Arg Val Phe Asp Glu Phe His Gly Asn Ala Tyr Leu Glu Asn Asn
725 730 735
Tyr Lys Val Gly Glu Ile Lys Leu Pro Ile Pro Lys Leu Asn Gln Gly
740 745 750
Thr Thr Arg Thr Ala Gly Asn Lys Ile Pro Val Thr Phe Met Ala Asn
755 760 765
Ala Tyr Leu Asp Asn Gln Ser Thr Tyr Ile Val Glu Val Pro Ile Leu
770 775 780
Glu Lys Glu Asn Gln Thr Asp Lys Pro Ser Ile Leu Pro Gln Phe Lys
785 790 795 800
Arg Asn Lys Ala Gln Glu Asn Ser Lys Phe Asp Glu Lys Val Glu Glu
805 810 815
Pro Lys Thr Ser Glu Lys Val Glu Lys Glu Lys Leu Ser Glu Thr Gly
820 825 830
Asn Ser Thr Ser Asn Ser Thr Leu Glu Glu Val Pro Thr Val Asp Pro
835 840 845
Val Gln Glu Lys Val Ala Lys Phe Ala Glu Ser Tyr Gly Met Lys Leu
850 855 860
Glu Asn Val Leu Phe Asn Met Asp Gly Thr Ile Glu Leu Tyr Leu Pro
865 870 875 880
Ser Gly Glu Val Ile Lys Lys Asn Met Ala Asp Phe Thr Gly Glu Ala
885 890 895
Pro Gln Gly Asn Gly Glu Asn Lys Pro Ser Glu Asn Gly Lys Val Ser
900 905 910
Thr Gly Thr Val Glu Asn Gln Pro Thr Glu Asn Lys Pro Ala Asp Ser
915 920 925
Leu Pro Glu Ala Pro Asn Glu Lys Pro Val Lys Pro Glu Asn Ser Thr
930 935 940
Asp Asn Gly Met Leu Asn Pro Glu Gly Asn Val Gly Ser Asp Pro Met
945 950 955 960
Leu Asp Pro Ala Leu Glu Glu Ala Pro Ala Val Asp Pro Val Gln Glu
965 970 975
Lys Leu Glu Lys Phe Thr Ala Ser Tyr Gly Leu Gly Leu Asp Ser Val
980 985 990
Ile Phe Asn Met Asp Gly Thr Ile Glu Leu Arg Leu Pro Ser Gly Glu
995 1000 1005
Val Ile Lys Lys Asn Leu Ser Asp Leu Ile Ala
1010 1015
<210> 90
<211> 819
<212> PRT
<213> S. pneumoniae
<400> 90
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Val Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asp
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His Ser His Asn
130 135 140
His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gln Gly
145 150 155 160
Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile Ile
165 170 175
Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His Tyr His
180 185 190
Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala Glu
195 200 205
Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser Ser
210 215 220
Tyr Asn Ala Asn Pro Val Gln Pro Arg Leu Ser Glu Asn His Asn Leu
225 230 235 240
Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser Ser
245 250 255
Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val Glu
260 265 270
Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala
275 280 285
Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr
290 295 300
Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro Leu
305 310 315 320
Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln Pro
325 330 335
Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Leu Gln Pro Ala Pro
340 345 350
Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys
355 360 365
Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly
370 375 380
Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala
385 390 395 400
Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu
405 410 415
Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn
420 425 430
Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn
435 440 445
Lys Gly Arg Gln Val Asp Phe Glu Val Leu Asp Asn Leu Leu Glu Arg
450 455 460
Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu
465 470 475 480
Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn
485 490 495
Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala
500 505 510
Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile
515 520 525
Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser
530 535 540
His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala
545 550 555 560
Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His
565 570 575
Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn
580 585 590
Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn
595 600 605
Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His
610 615 620
Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu
625 630 635 640
Tyr Glu Ala Pro Lys Gly Tyr Ser Leu Glu Asp Leu Leu Ala Thr Val
645 650 655
Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly
660 665 670
Phe Gly Asn Ala Ser Asp His Val Arg Lys Asn Lys Ala Asp Gln Asp
675 680 685
Ser Lys Pro Asp Glu Asp Lys Glu His Asp Glu Val Ser Glu Pro Thr
690 695 700
His Pro Glu Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro Ser
705 710 715 720
Ala Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu
725 730 735
Glu Ala Glu Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu Asn
740 745 750
Ser Val Ile Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu Lys
755 760 765
Val Thr Asp Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr Gly
770 775 780
Leu Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser
785 790 795 800
Ala Glu Val Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser Gln Pro Ala
805 810 815
Pro Ile Gln
<210> 91
<211> 820
<212> PRT
<213> S. pneumoniae
<400> 91
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Val Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asp
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His Ser His Asn
130 135 140
His Gly Gly Gly Ser Asn Asp Gln Ala Val Val Ala Ala Arg Ala Gln
145 150 155 160
Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile
165 170 175
Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His Tyr
180 185 190
His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala
195 200 205
Glu Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser
210 215 220
Ser Tyr Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn
225 230 235 240
Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser
245 250 255
Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val
260 265 270
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
275 280 285
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
290 295 300
Tyr Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro
305 310 315 320
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln
325 330 335
Pro Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala
340 345 350
Pro Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val
355 360 365
Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn
370 375 380
Gly Val Ser Arg Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala
385 390 395 400
Ala Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys
405 410 415
Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr
420 425 430
Asn Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp
435 440 445
Asn Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu
450 455 460
Arg Leu Lys Asp Val Pro Ser Asp Lys Val Lys Leu Val Asp Asp Ile
465 470 475 480
Leu Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro
485 490 495
Asn Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu
500 505 510
Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp
515 520 525
Ile Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His
530 535 540
Ser His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala
545 550 555 560
Ala Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp
565 570 575
His Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr
580 585 590
Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr
595 600 605
Asn Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro
610 615 620
His Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly
625 630 635 640
Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr
645 650 655
Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn
660 665 670
Gly Phe Gly Asn Ala Ser Asp His Val Arg Lys Asn Lys Val Asp Gln
675 680 685
Asp Ser Lys Pro Asp Glu Asp Lys Glu His Asp Glu Val Ser Glu Pro
690 695 700
Thr His Pro Glu Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro
705 710 715 720
Ser Ala Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu
725 730 735
Glu Glu Ala Glu Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu
740 745 750
Asn Ser Val Ile Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu
755 760 765
Lys Val Thr Asp Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr
770 775 780
Gly Leu Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile
785 790 795 800
Ser Ala Glu Val Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser Gln Pro
805 810 815
Ala Pro Ile Gln
820
<210> 92
<211> 816
<212> PRT
<213> S. pneumoniae
<400> 92
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Asp Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ala Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His Ser His Asn
130 135 140
His Gly Gly Gly Ser Asn Asp Gln Ala Val Val Ala Ala Arg Ala Gln
145 150 155 160
Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile
165 170 175
Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asn His Phe
180 185 190
His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala
195 200 205
Gln Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser
210 215 220
Ser His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn
225 230 235 240
Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser
245 250 255
Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val
260 265 270
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
275 280 285
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
290 295 300
Tyr Glu Gln Met Ser Glu Leu Glu Glu Arg Ile Ala Arg Ile Ile Pro
305 310 315 320
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln
325 330 335
Pro Ser Pro Gln Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro Gln
340 345 350
Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala Val
355 360 365
Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg
370 375 380
Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly Ile Asp
385 390 395 400
Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Thr Lys
405 410 415
Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr
420 425 430
Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly Arg
435 440 445
Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp
450 455 460
Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp Ile Leu Ala Phe Leu
465 470 475 480
Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gln Ile
485 490 495
Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys Tyr
500 505 510
Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser Asp
515 520 525
Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp Ile
530 535 540
Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala Tyr
545 550 555 560
Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Arg Asp Ser
565 570 575
Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val Lys
580 585 590
Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln Tyr
595 600 605
Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp His
610 615 620
Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala
625 630 635 640
Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr
645 650 655
Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn
660 665 670
Ala Ser Asp His Val Arg Lys Asn Lys Ala Asp Gln Asp Ser Lys Pro
675 680 685
Asp Glu Asp Lys Gly His Asp Glu Val Ser Glu Pro Thr His Pro Glu
690 695 700
Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro Ser Ala Asp Asn
705 710 715 720
Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu Glu Ala Glu
725 730 735
Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu His Ser Val Ile
740 745 750
Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu Lys Val Thr Asp
755 760 765
Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr Gly Leu Lys Ser
770 775 780
Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser Ala Glu Val
785 790 795 800
Asp Ser Leu Leu Ala Leu Leu Lys Lys Ser Gln Pro Ala Pro Ile Gln
805 810 815
<210> 93
<211> 816
<212> PRT
<213> S. pneumoniae
<400> 93
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Asp Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ala Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Arg Gln Glu His Ser His Asn
130 135 140
His Gly Gly Gly Ser Asn Asp Gln Ala Val Val Ala Ala Arg Ala Gln
145 150 155 160
Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile
165 170 175
Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asn His Phe
180 185 190
His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala
195 200 205
Gln Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser
210 215 220
Ser His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn
225 230 235 240
Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser
245 250 255
Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val
260 265 270
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
275 280 285
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
290 295 300
Tyr Glu Gln Met Ser Glu Leu Glu Glu Arg Ile Ala Arg Ile Ile Pro
305 310 315 320
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln
325 330 335
Pro Ser Pro Gln Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro Gln
340 345 350
Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala Val
355 360 365
Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg
370 375 380
Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly Ile Asp
385 390 395 400
Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Thr Lys
405 410 415
Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr
420 425 430
Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly Arg
435 440 445
Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp
450 455 460
Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp Ile Leu Ala Phe Leu
465 470 475 480
Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gln Ile
485 490 495
Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys Tyr
500 505 510
Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser Asp
515 520 525
Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp Ile
530 535 540
Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala Tyr
545 550 555 560
Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp Ser
565 570 575
Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val Lys
580 585 590
Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln Tyr
595 600 605
Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp His
610 615 620
Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala
625 630 635 640
Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr
645 650 655
Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn
660 665 670
Ala Ser Asp His Val Arg Lys Asn Lys Ala Asp Gln Asp Ser Lys Pro
675 680 685
Asp Glu Asp Lys Gly His Asp Glu Val Ser Glu Pro Thr His Pro Glu
690 695 700
Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro Ser Ala Asp Asn
705 710 715 720
Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu Glu Ala Glu
725 730 735
Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu His Ser Val Ile
740 745 750
Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu Lys Val Thr Asp
755 760 765
Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr Gly Leu Lys Ser
770 775 780
Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser Ala Glu Val
785 790 795 800
Asp Ser Leu Leu Ala Leu Leu Lys Lys Ser Gln Pro Ala Pro Ile Gln
805 810 815
<210> 94
<211> 816
<212> PRT
<213> S. pneumoniae
<400> 94
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Asp Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ala Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His Ser His Asn
130 135 140
His Gly Gly Gly Ser Asn Asp Gln Ala Val Val Ala Ala Arg Ala Gln
145 150 155 160
Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile
165 170 175
Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro Arg Gly Asn His Phe
180 185 190
His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala
195 200 205
Gln Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser
210 215 220
Ser His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn
225 230 235 240
Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser
245 250 255
Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg Arg Val
260 265 270
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
275 280 285
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
290 295 300
Tyr Glu Gln Met Ser Glu Leu Glu Glu Arg Ile Ala Arg Ile Ile Pro
305 310 315 320
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln
325 330 335
Pro Ser Pro Gln Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro Gln
340 345 350
Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala Val
355 360 365
Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg
370 375 380
Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly Ile Asp
385 390 395 400
Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Thr Lys
405 410 415
Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr
420 425 430
Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly Arg
435 440 445
Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp
450 455 460
Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp Ile Leu Ala Phe Leu
465 470 475 480
Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gln Ile
485 490 495
Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys Tyr
500 505 510
Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser Asp
515 520 525
Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp Ile
530 535 540
Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala Tyr
545 550 555 560
Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp Ser
565 570 575
Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val Lys
580 585 590
Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln Tyr
595 600 605
Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp His
610 615 620
Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala
625 630 635 640
Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr
645 650 655
Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn
660 665 670
Ala Ser Asp His Val Arg Lys Asn Lys Ala Asp Gln Asp Ser Lys Pro
675 680 685
Asp Glu Asp Lys Gly His Asp Glu Val Ser Glu Pro Thr His Pro Glu
690 695 700
Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro Ser Ala Asp Asn
705 710 715 720
Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu Glu Ala Glu
725 730 735
Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu His Ser Val Ile
740 745 750
Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu Lys Val Thr Asp
755 760 765
Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr Gly Leu Lys Ser
770 775 780
Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser Ala Glu Val
785 790 795 800
Asp Ser Leu Leu Ala Leu Leu Lys Lys Ser Gln Pro Ala Pro Ile Gln
805 810 815
<210> 95
<211> 834
<212> PRT
<213> S. pneumoniae
<400> 95
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Val Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asp
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu Arg Ser His Asn
130 135 140
His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gln Gly
145 150 155 160
Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile Ile
165 170 175
Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His Tyr His
180 185 190
Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Gln
195 200 205
Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser Ser
210 215 220
His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn Leu
225 230 235 240
Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser Ser
245 250 255
Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val Glu
260 265 270
Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala
275 280 285
Asn Gly Val Ala Val Pro His Gly Asp His Tyr His Phe Ile Pro Tyr
290 295 300
Ser Gln Leu Ser Pro Leu Glu Glu Lys Leu Ala Arg Ile Ile Pro Leu
305 310 315 320
Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln Pro
325 330 335
Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro
340 345 350
Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys
355 360 365
Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly
370 375 380
Val Pro Arg Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala
385 390 395 400
Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu
405 410 415
Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn
420 425 430
Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn
435 440 445
Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg
450 455 460
Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu
465 470 475 480
Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn
485 490 495
Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala
500 505 510
Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile
515 520 525
Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser
530 535 540
His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala
545 550 555 560
Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His
565 570 575
Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn
580 585 590
Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn
595 600 605
Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His
610 615 620
Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu
625 630 635 640
Tyr Glu Ala Pro Lys Gly Tyr Ser Leu Glu Asp Leu Leu Ala Thr Val
645 650 655
Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly
660 665 670
Phe Gly Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln Ala
675 680 685
Asp Thr Asn Gln Thr Glu Lys Pro Asn Glu Glu Lys Pro Gln Thr Glu
690 695 700
Lys Pro Glu Glu Asp Lys Glu His Asp Glu Val Ser Glu Pro Thr His
705 710 715 720
Pro Glu Ser Asp Glu Lys Glu Asn His Val Gly Leu Asn Pro Ser Ala
725 730 735
Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu Glu
740 745 750
Ala Glu Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu Tyr Ser
755 760 765
Val Ile Asn Ala Lys Ile Ala Glu Ala Glu Ala Leu Leu Glu Lys Val
770 775 780
Thr Asp Ser Ser Ile Arg Gln Asn Ala Val Glu Thr Leu Thr Gly Leu
785 790 795 800
Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser Ala
805 810 815
Glu Val Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser Gln Pro Ala Pro
820 825 830
Ile Gln
<210> 96
<211> 811
<212> PRT
<213> S. pneumoniae
<400> 96
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Asp Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ala Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His Ser His Asn
130 135 140
His Gly Gly Gly Ser Asn Asp Gln Ala Val Val Ala Ala Arg Ala Gln
145 150 155 160
Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile
165 170 175
Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asn His Phe
180 185 190
His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala
195 200 205
Gln Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser
210 215 220
Ser His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn
225 230 235 240
Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser
245 250 255
Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val
260 265 270
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
275 280 285
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
290 295 300
Tyr Glu Gln Met Ser Glu Leu Glu Glu Arg Ile Ala Arg Ile Ile Pro
305 310 315 320
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln
325 330 335
Pro Ser Pro Gln Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro Gln
340 345 350
Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala Val
355 360 365
Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg
370 375 380
Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly Ile Asp
385 390 395 400
Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Thr Lys
405 410 415
Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr
420 425 430
Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly Arg
435 440 445
Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp
450 455 460
Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp Ile Leu Ala Phe Leu
465 470 475 480
Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gln Ile
485 490 495
Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys Tyr
500 505 510
Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser Asp
515 520 525
Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp Ile
530 535 540
Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala Tyr
545 550 555 560
Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp Ser
565 570 575
Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val Lys
580 585 590
Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln Tyr
595 600 605
Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp His
610 615 620
Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala
625 630 635 640
Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr
645 650 655
Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn
660 665 670
Ala Ser Asp His Val Arg Lys Asn Lys Ala Asp Gln Asp Ser Lys Pro
675 680 685
Asp Glu Asp Lys Gly His Asp Glu Val Ser Glu Pro Thr His Pro Glu
690 695 700
Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro Ser Ala Asp Asn
705 710 715 720
Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu Glu Ala Glu
725 730 735
Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu His Ser Val Ile
740 745 750
Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu Lys Val Thr Asp
755 760 765
Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr Gly Leu Lys Ser
770 775 780
Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser Ala Glu Val
785 790 795 800
Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser Lys
805 810
<210> 97
<211> 811
<212> PRT
<213> S. pneumoniae
<400> 97
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Asp Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ala Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn
100 105 110
Gly Lys Tyr Tyr Gly Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His Ser His Asn
130 135 140
His Gly Gly Gly Ser Asn Asp Gln Ala Val Val Ala Ala Arg Ala Gln
145 150 155 160
Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile
165 170 175
Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asn His Phe
180 185 190
His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala
195 200 205
Gln Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser
210 215 220
Ser His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn
225 230 235 240
Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser
245 250 255
Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val
260 265 270
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
275 280 285
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
290 295 300
Tyr Glu Gln Met Ser Glu Leu Glu Glu Arg Ile Ala Arg Ile Ile Pro
305 310 315 320
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln
325 330 335
Pro Ser Pro Gln Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro Gln
340 345 350
Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala Val
355 360 365
Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg
370 375 380
Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly Ile Asp
385 390 395 400
Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Thr Lys
405 410 415
Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr
420 425 430
Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly Arg
435 440 445
Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp
450 455 460
Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp Ile Leu Ala Phe Leu
465 470 475 480
Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gln Ile
485 490 495
Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys Tyr
500 505 510
Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser Asp
515 520 525
Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp Ile
530 535 540
Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala Tyr
545 550 555 560
Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp Ser
565 570 575
Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val Lys
580 585 590
Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln Tyr
595 600 605
Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp His
610 615 620
Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala
625 630 635 640
Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr
645 650 655
Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn
660 665 670
Ala Ser Asp His Val Arg Lys Asn Lys Ala Asp Gln Asp Ser Lys Pro
675 680 685
Asp Glu Asp Lys Gly His Asp Glu Val Ser Glu Pro Thr His Pro Glu
690 695 700
Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro Ser Ala Asp Asn
705 710 715 720
Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu Glu Ala Glu
725 730 735
Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu His Ser Val Ile
740 745 750
Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu Lys Val Thr Asp
755 760 765
Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr Gly Leu Lys Ser
770 775 780
Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser Ala Glu Val
785 790 795 800
Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser Lys
805 810
<210> 98
<211> 811
<212> PRT
<213> S. pneumoniae
<400> 98
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Asp Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ala Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu His Ser His Asn
130 135 140
His Gly Gly Gly Ser Asn Asp Gln Ala Val Val Ala Ala Arg Ala Gln
145 150 155 160
Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile
165 170 175
Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asn His Phe
180 185 190
His Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala
195 200 205
Gln Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser
210 215 220
Ser His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn
225 230 235 240
Leu Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser
245 250 255
Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val
260 265 270
Glu Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr
275 280 285
Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro
290 295 300
Tyr Glu Gln Met Ser Glu Leu Glu Glu Arg Ile Ala Arg Ile Ile Pro
305 310 315 320
Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln
325 330 335
Pro Ser Pro Gln Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro Gln
340 345 350
Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala Val
355 360 365
Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg
370 375 380
Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly Ile Asp
385 390 395 400
Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Thr Lys
405 410 415
Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr
420 425 430
Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly Arg
435 440 445
Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp
450 455 460
Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp Ile Leu Ala Phe Leu
465 470 475 480
Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gln Ile
485 490 495
Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys Tyr
500 505 510
Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser Asp
515 520 525
Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp Ile
530 535 540
Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala Tyr
545 550 555 560
Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp Ser
565 570 575
Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val Lys
580 585 590
Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln Tyr
595 600 605
Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp His
610 615 620
Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala
625 630 635 640
Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr
645 650 655
Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn
660 665 670
Ala Ser Asp His Val Arg Lys Asn Lys Ala Asp Gln Asp Ser Lys Pro
675 680 685
Asp Glu Asp Lys Gly His Asp Glu Val Ser Glu Pro Thr His Pro Glu
690 695 700
Ser Asp Glu Lys Glu Asn His Ala Gly Leu Asn Pro Ser Ala Asp Asn
705 710 715 720
Leu Tyr Lys Pro Ser Thr Asp Thr Glu Glu Thr Glu Glu Glu Ala Glu
725 730 735
Asp Thr Thr Asp Glu Ala Glu Ile Pro Gln Val Glu His Ser Val Ile
740 745 750
Asn Ala Lys Ile Ala Asp Ala Glu Ala Leu Leu Glu Lys Val Thr Asp
755 760 765
Pro Ser Ile Arg Gln Asn Ala Met Glu Thr Leu Thr Gly Leu Lys Ser
770 775 780
Ser Leu Leu Leu Gly Thr Lys Asp Asn Asn Thr Ile Ser Ala Glu Val
785 790 795 800
Asp Ser Leu Leu Ala Leu Leu Lys Glu Ser Lys
805 810
<210> 99
<211> 811
<212> PRT
<213> S. pneumoniae
<400> 99
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Val Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asp
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu Arg Ser His Asn
130 135 140
His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gln Gly
145 150 155 160
Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile Ile
165 170 175
Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His Tyr His
180 185 190
Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Gln
195 200 205
Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser Ser
210 215 220
His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn Leu
225 230 235 240
Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser Ser
245 250 255
Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val Glu
260 265 270
Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala
275 280 285
Asn Gly Val Ala Val Pro His Gly Asp His Tyr His Phe Ile Pro Tyr
290 295 300
Ser Gln Leu Ser Pro Leu Glu Glu Lys Leu Ala Arg Ile Ile Pro Leu
305 310 315 320
Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln Pro
325 330 335
Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro
340 345 350
Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys
355 360 365
Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly
370 375 380
Val Pro Arg Tyr Ile Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala
385 390 395 400
Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu
405 410 415
Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn
420 425 430
Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn
435 440 445
Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg
450 455 460
Leu Lys Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu
465 470 475 480
Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn
485 490 495
Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala
500 505 510
Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile
515 520 525
Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser
530 535 540
His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala
545 550 555 560
Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His
565 570 575
Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn
580 585 590
Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn
595 600 605
Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His
610 615 620
Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu
625 630 635 640
Tyr Glu Ala Pro Lys Gly Tyr Ser Leu Glu Asp Leu Leu Ala Thr Val
645 650 655
Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly
660 665 670
Phe Gly Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln Ala
675 680 685
Asp Thr Asn Gln Thr Glu Lys Pro Asn Glu Glu Lys Pro Gln Thr Glu
690 695 700
Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys
705 710 715 720
Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu
725 730 735
Glu Ser Pro Glu Glu Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Val
740 745 750
Lys Glu Lys Leu Arg Glu Ala Glu Asp Leu Leu Gly Lys Ile Gln Asn
755 760 765
Pro Ile Ile Lys Ser Asn Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn
770 775 780
Asn Leu Leu Phe Gly Thr Gln Asp Asn Asn Thr Ile Met Ala Glu Ala
785 790 795 800
Glu Lys Leu Leu Ala Leu Leu Lys Glu Ser Lys
805 810
<210> 100
<211> 805
<212> PRT
<213> S. pneumoniae
<400> 100
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Asp Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ala Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu Arg Ser His Asn
130 135 140
His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gln Gly
145 150 155 160
Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile Ile
165 170 175
Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His Tyr His
180 185 190
Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala Glu
195 200 205
Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser Ser
210 215 220
Tyr Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn Leu
225 230 235 240
Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser Ser
245 250 255
Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val Glu
260 265 270
Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala
275 280 285
Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr
290 295 300
Glu Gln Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro Leu
305 310 315 320
Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Glu Pro
325 330 335
Ser Pro Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro
340 345 350
Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala Val Arg Lys Val
355 360 365
Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg Tyr Ile Pro
370 375 380
Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly Ile Asp Ser Lys Leu
385 390 395 400
Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Ala Lys Lys Thr Asp
405 410 415
Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu
420 425 430
Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly Arg Gln Val Asp
435 440 445
Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp Val Ser Ser
450 455 460
Asp Lys Val Lys Leu Val Asp Asp Ile Leu Ala Phe Leu Ala Pro Ile
465 470 475 480
Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gln Ile Thr Tyr Thr
485 490 495
Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys Tyr Thr Thr Glu
500 505 510
Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser Asp Glu Gly Asp
515 520 525
Ala Tyr Val Thr Pro His Met Thr His Ser His Trp Ile Lys Lys Asp
530 535 540
Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala Tyr Ala Lys Glu
545 550 555 560
Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp Ser Gly Asn Thr
565 570 575
Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val Lys Ala Ala Lys
580 585 590
Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln Tyr Thr Val Glu
595 600 605
Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp His Tyr His Asn
610 615 620
Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala Pro Lys Gly
625 630 635 640
Tyr Ser Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr Val Glu His
645 650 655
Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn Ala Ser Asp
660 665 670
His Val Gln Arg Asn Lys Asn Gly Gln Ala Asp Thr Asn Gln Thr Glu
675 680 685
Lys Pro Asn Glu Glu Lys Pro Gln Thr Glu Lys Pro Glu Glu Glu Thr
690 695 700
Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys Pro Glu Ser Pro Lys Pro
705 710 715 720
Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu Glu Ser Pro Glu Glu Ser
725 730 735
Glu Glu Pro Gln Val Glu Thr Glu Lys Val Lys Glu Lys Leu Arg Glu
740 745 750
Ala Glu Asp Leu Leu Gly Lys Ile Gln Asn Pro Ile Ile Lys Ser Asn
755 760 765
Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe Gly Thr
770 775 780
Gln Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu Leu Ala Leu
785 790 795 800
Leu Lys Glu Ser Lys
805
<210> 101
<211> 807
<212> PRT
<213> S. pneumoniae
<400> 101
Cys Ser Tyr Glu Leu Gly Arg His Gln Ala Gly Gln Val Lys Lys Glu
1 5 10 15
Ser Asn Arg Val Ser Tyr Ile Asp Gly Asp Gln Ala Gly Gln Lys Ala
20 25 30
Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala
35 40 45
Glu Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His
50 55 60
Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile
65 70 75 80
Ile Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp
85 90 95
Ser Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asp
100 105 110
Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Ile
115 120 125
Arg Thr Lys Glu Glu Ile Lys Arg Gln Lys Gln Glu Arg Ser His Asn
130 135 140
His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gln Gly
145 150 155 160
Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp Ile Ile
165 170 175
Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asn His Phe His
180 185 190
Tyr Ile Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala Ala Gln
195 200 205
Ala Tyr Trp Asn Gly Lys Gln Gly Ser Arg Pro Ser Ser Ser Ser Ser
210 215 220
His Asn Ala Asn Pro Ala Gln Pro Arg Leu Ser Glu Asn His Asn Leu
225 230 235 240
Thr Val Thr Pro Thr Tyr His Gln Asn Gln Gly Glu Asn Ile Ser Ser
245 250 255
Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg His Val Glu
260 265 270
Ser Asp Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala
275 280 285
Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr
290 295 300
Ser Gln Met Ser Glu Leu Glu Glu Arg Ile Ala Arg Ile Ile Pro Leu
305 310 315 320
Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gln Pro
325 330 335
Ser Pro Gln Ser Thr Pro Glu Pro Ser Pro Ser Pro Gln Ser Ala Pro
340 345 350
Asn Pro Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys
355 360 365
Glu Val Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Lys Asn Gly
370 375 380
Val Ser Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala Glu Thr Ala Ala
385 390 395 400
Gly Ile Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu
405 410 415
Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn
420 425 430
Lys Ala Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn
435 440 445
Lys Gly Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg
450 455 460
Leu Glu Asp Val Pro Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu
465 470 475 480
Ala Phe Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn
485 490 495
Ala Gln Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala
500 505 510
Gly Lys Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile
515 520 525
Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser
530 535 540
His Trp Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala
545 550 555 560
Gln Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His
565 570 575
Gln Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn
580 585 590
Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn
595 600 605
Leu Gln Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His
610 615 620
Tyr Asp His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu
625 630 635 640
Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val
645 650 655
Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly
660 665 670
Phe Gly Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln Ala
675 680 685
Asp Thr Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln Thr Glu
690 695 700
Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys
705 710 715 720
Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu
725 730 735
Glu Ser Glu Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu Lys Leu
740 745 750
Arg Glu Ala Glu Asp Leu Leu Gly Lys Ile Gln Asp Pro Ile Ile Lys
755 760 765
Ser Asn Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe
770 775 780
Gly Thr Gln Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu Leu
785 790 795 800
Ala Leu Leu Lys Glu Ser Lys
805
<210> 102
<211> 821
<212> PRT
<213> S. pneumoniae
<400> 102
Cys Ala Tyr Glu Leu Gly Leu His Gln Ala Gln Thr Val Lys Glu Asn
1 5 10 15
Asn Arg Val Ser Tyr Ile Asp Gly Lys Gln Ala Thr Gln Lys Thr Glu
20 25 30
Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly Ile Asn Ala Glu
35 40 45
Gln Ile Val Ile Lys Ile Thr Asp Gln Gly Tyr Val Thr Ser His Gly
50 55 60
Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Ile Ile
65 70 75 80
Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gln Leu Lys Asp Ser
85 90 95
Asp Ile Val Asn Glu Ile Lys Gly Gly Tyr Val Ile Lys Val Asn Gly
100 105 110
Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg
115 120 125
Thr Lys Glu Glu Ile Asn Arg Gln Lys Gln Glu His Ser Gln His Arg
130 135 140
Glu Gly Gly Thr Ser Ala Asn Asp Gly Ala Val Ala Phe Ala Arg Ser
145 150 155 160
Gln Gly Arg Tyr Thr Thr Asp Asp Gly Tyr Ile Phe Asn Ala Ser Asp
165 170 175
Ile Ile Glu Asp Thr Gly Asp Ala Tyr Ile Val Pro His Gly Asp His
180 185 190
Tyr His Tyr Ile Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala
195 200 205
Ala Glu Ala Phe Leu Ser Gly Arg Glu Asn Leu Ser Asn Leu Arg Thr
210 215 220
Tyr Arg Arg Gln Asn Ser Asp Asn Thr Pro Arg Thr Asn Trp Val Pro
225 230 235 240
Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser Asn Asn Ser
245 250 255
Asn Thr Asn Ser Gln Ala Ser Gln Ser Asn Asp Ile Asp Ser Leu Leu
260 265 270
Lys Gln Leu Tyr Lys Leu Pro Leu Ser Gln Arg His Val Glu Ser Asp
275 280 285
Gly Leu Ile Phe Asp Pro Ala Gln Ile Thr Ser Arg Thr Ala Arg Gly
290 295 300
Val Ala Val Pro His Gly Asn His Tyr His Phe Ile Pro Tyr Glu Gln
305 310 315 320
Met Ser Glu Leu Glu Lys Arg Ile Ala Arg Ile Ile Pro Leu Arg Tyr
325 330 335
Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Glu Pro Ser Pro
340 345 350
Gln Pro Thr Pro Glu Pro Ser Pro Ser Pro Gln Pro Ala Pro Asn Pro
355 360 365
Gln Pro Ala Pro Ser Asn Pro Ile Asp Glu Lys Leu Val Lys Glu Ala
370 375 380
Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser
385 390 395 400
Arg Tyr Ile Pro Ala Lys Asn Leu Ser Ala Glu Thr Ala Ala Gly Ile
405 410 415
Asp Ser Lys Leu Ala Lys Gln Glu Ser Leu Ser His Lys Leu Gly Ala
420 425 430
Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala
435 440 445
Tyr Asp Leu Leu Ala Arg Ile His Gln Asp Leu Leu Asp Asn Lys Gly
450 455 460
Arg Gln Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys
465 470 475 480
Asp Val Ser Ser Asp Lys Val Lys Leu Val Asp Asp Ile Leu Ala Phe
485 490 495
Leu Ala Pro Ile Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gln
500 505 510
Ile Thr Tyr Thr Asp Asp Glu Ile Gln Val Ala Lys Leu Ala Gly Lys
515 520 525
Tyr Thr Thr Glu Asp Gly Tyr Ile Phe Asp Pro Arg Asp Ile Thr Ser
530 535 540
Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp
545 550 555 560
Ile Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gln Ala
565 570 575
Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gln Asp
580 585 590
Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala Ile Tyr Asn Arg Val
595 600 605
Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gln
610 615 620
Tyr Thr Val Glu Val Lys Asn Gly Ser Leu Ile Ile Pro His Tyr Asp
625 630 635 640
His Tyr His Asn Ile Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu
645 650 655
Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr
660 665 670
Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly
675 680 685
Asn Ala Ser Asp His Val Gln Arg Asn Lys Asn Gly Gln Ala Asp Thr
690 695 700
Asn Gln Thr Glu Lys Pro Ser Glu Glu Lys Pro Gln Thr Glu Lys Pro
705 710 715 720
Glu Glu Glu Thr Pro Arg Glu Glu Lys Pro Gln Ser Glu Lys Pro Glu
725 730 735
Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Glu Ser Pro Glu Glu Ser
740 745 750
Glu Glu Pro Gln Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg Glu
755 760 765
Ala Glu Asp Leu Leu Gly Lys Ile Gln Asp Pro Ile Ile Lys Ser Asn
770 775 780
Ala Lys Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe Gly Thr
785 790 795 800
Gln Asp Asn Asn Thr Ile Met Ala Glu Ala Glu Lys Leu Leu Ala Leu
805 810 815
Leu Lys Glu Ser Lys
820
Claims (42)
- 서열번호 3 또는 서열번호 13에 나타난 뉴클레오타이드 서열과 적어도 95% 동일성을 갖는 뉴클레오타이드 서열을 갖는 폴리뉴클레오타이드로서, 상기 폴리뉴클레오타이드는 서열번호 4 또는 서열번호 14에 나타난 아미노산 서열로 이루어진 폴리펩타이드에 특이적으로 결합하는 항체를 유도할 수 있는 스트렙토코커스 뉴모니애 폴리펩타이드를 코딩하는 것을 특징으로 하는 폴리뉴클레오타이드.
- 제1항에 있어서, 상기 암호화된 스트렙토코커스 폴리펩타이드는 개체에서 항-스트렙토코커스 면역 반응을 유도할 수 있는 것을 특징으로 하는 폴리뉴클레오타이드.
- 제2항에 있어서, 상기 항-스트렙토코커스 반응은 항-스트렙토코커스 뉴모니애 면역 반응인 것을 특징으로 하는 폴리뉴클레오타이드.
- 삭제
- 삭제
- 제1항 내지 제3항 중 어느 한 항에 있어서, 상기 폴리뉴클레오타이드는 DNA인 것을 특징으로 하는 폴리뉴클레오타이드.
- 제1항 내지 제3항 중 어느 한 항에 있어서, 상기 폴리뉴클레오타이드는 RNA인 것을 특징으로 하는 폴리뉴클레오타이드.
- 제1항 내지 제3항 중 어느 한 항의 폴리뉴클레오타이드에 상보적인 폴리뉴클레오타이드.
- 제1항 내지 제3항 중 어느 한 항의 폴리뉴클레오타이드를 포함하는 벡터로서, 상기 폴리뉴클레오타이드가 발현 조절 영역에 조작적으로 연결되는 것을 특징으로 하는 벡터.
- 제9항에 따른 벡터로 형질전환된 숙주 세포.
- 제10항에 있어서, 상기 숙주 세포는 박테리아 세포인 것을 특징으로 하는 숙주 세포.
- 프로모터를 포함하는 발현 조절 영역을 활성화하기 위해 변형된 영양 배지에서 숙주 세포를 배양하는 것을 포함하는 폴리펩타이드의 발현에 적합한 조건 하에서 제10항에 따른 숙주 세포를 배양하는 것을 포함하는 스트렙토코커스 폴리펩타이드의 제조 방법.
- 제12항에 있어서, 상기 숙주 세포는 박테리아 세포인 것을 특징으로 하는 폴리펩타이드의 제조 방법.
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11380098P | 1998-12-23 | 1998-12-23 | |
US60/113,800 | 1998-12-23 | ||
PCT/CA1999/001218 WO2000039299A2 (en) | 1998-12-23 | 1999-12-20 | Streptococcus antigens |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020067025282A Division KR100891398B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020107009285A Division KR101078919B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20080036666A KR20080036666A (ko) | 2008-04-28 |
KR101170203B1 true KR101170203B1 (ko) | 2012-07-31 |
Family
ID=22351610
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020107009285A KR101078919B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
KR1020017007963A KR100802198B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
KR1020087008264A KR101170203B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
KR1020067025282A KR100891398B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020107009285A KR101078919B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
KR1020017007963A KR100802198B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020067025282A KR100891398B1 (ko) | 1998-12-23 | 1999-12-20 | 신규한 스트렙토코커스 항원 |
Country Status (30)
Country | Link |
---|---|
EP (2) | EP1141306B1 (ko) |
JP (2) | JP4761623B2 (ko) |
KR (4) | KR101078919B1 (ko) |
CN (3) | CN100398653C (ko) |
AP (1) | AP2001002199A0 (ko) |
AR (1) | AR029322A1 (ko) |
AT (1) | ATE394489T1 (ko) |
AU (1) | AU1764900A (ko) |
BR (1) | BR9916477A (ko) |
CA (1) | CA2356836C (ko) |
CL (1) | CL2009002037A1 (ko) |
CY (3) | CY1108223T1 (ko) |
CZ (2) | CZ303675B6 (ko) |
DE (1) | DE69938670D1 (ko) |
DK (3) | DK2261358T3 (ko) |
EA (1) | EA007409B1 (ko) |
ES (3) | ES2480417T3 (ko) |
HK (1) | HK1118575A1 (ko) |
HU (1) | HU229664B1 (ko) |
IL (3) | IL143905A0 (ko) |
MX (1) | MXPA01006427A (ko) |
NO (1) | NO330800B1 (ko) |
NZ (1) | NZ512574A (ko) |
OA (1) | OA11736A (ko) |
PL (3) | PL205041B1 (ko) |
PT (3) | PT2261358E (ko) |
TR (2) | TR200200633T2 (ko) |
UY (1) | UY25877A1 (ko) |
WO (1) | WO2000039299A2 (ko) |
ZA (1) | ZA200105114B (ko) |
Families Citing this family (63)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CZ301056B6 (cs) | 1998-02-20 | 2009-10-29 | Id Biomedical Corporation | Antigeny streptokoku skupiny B |
EP1801218A3 (en) * | 1998-07-27 | 2007-10-10 | Sanofi Pasteur Limited | Nucleic acids and proteins from streptococcus pneumoniae |
US20030134407A1 (en) | 1998-07-27 | 2003-07-17 | Le Page Richard William Falla | Nucleic acids and proteins from Streptococcus pneumoniae |
CA2355364C (en) * | 1998-12-21 | 2014-03-18 | Medimmune, Inc. | Streptococcus pneumoniae proteins and immunogenic fragments for vaccines |
US7128918B1 (en) | 1998-12-23 | 2006-10-31 | Id Biomedical Corporation | Streptococcus antigens |
CZ303675B6 (cs) * | 1998-12-23 | 2013-02-27 | Id Biomedical Corporation Of Quebec | Izolovaný polynukleotid, vektor, hostitelská bunka, zpusob produkce, izolovaný polypeptid, chimérový polypeptid, vakcinacní prostredek a pouzití |
EP1294771B1 (en) | 2000-06-12 | 2008-10-29 | University Of Saskatchewan | Chimeric GapC protein from Streptococcus and its use in vaccination and diagnosis |
US6833134B2 (en) | 2000-06-12 | 2004-12-21 | University Of Saskacthewan | Immunization of dairy cattle with GapC protein against Streptococcus infection |
US6866855B2 (en) | 2000-06-12 | 2005-03-15 | University Of Saskatchewan | Immunization of dairy cattle with GapC protein against Streptococcus infection |
JP5051959B2 (ja) * | 2000-06-20 | 2012-10-17 | アイディー バイオメディカル コーポレイション オブ ケベック | ストレプトコッカス抗原 |
US7074415B2 (en) | 2000-06-20 | 2006-07-11 | Id Biomedical Corporation | Streptococcus antigens |
WO2002079475A2 (en) * | 2001-03-30 | 2002-10-10 | Shire Biochem Inc. | Streptococcus pyogenes antigens and corresponding dna fragments |
AU2002317107B2 (en) | 2001-07-06 | 2008-06-12 | Id Biomedical Corporation | Group B streptococcus antigens and corresponding DNA fragments |
GB0130228D0 (en) * | 2001-12-18 | 2002-02-06 | Hansa Medica Ab | Protein |
EP1456231A2 (en) | 2001-12-20 | 2004-09-15 | Shire Biochem Inc. | Streptococcus antigens |
EP2287313A1 (en) * | 2003-03-04 | 2011-02-23 | Intercell AG | Streptococcus pyogenes antigens |
AU2004230244B2 (en) | 2003-04-15 | 2011-09-22 | Intercell Ag | S. pneumoniae antigens |
CA2816182C (en) | 2005-12-22 | 2018-02-20 | Glaxosmithkline Biologicals S.A. | Pneumococcal polysaccharide conjugate vaccine |
GB0607088D0 (en) | 2006-04-07 | 2006-05-17 | Glaxosmithkline Biolog Sa | Vaccine |
KR101579947B1 (ko) | 2007-06-26 | 2015-12-28 | 글락소스미스클라인 바이오로지칼즈 에스.에이. | 스트렙토코쿠스 뉴모니애 캡슐 다당류 컨쥬게이트를 포함하는 백신 |
EP2183366B1 (en) * | 2007-07-23 | 2012-10-24 | Sanofi Pasteur Limited | Immunogenic polypeptides and monoclonal antibodies |
MX2010011412A (es) | 2008-04-16 | 2010-11-12 | Glaxosmithkline Biolog Sa | Vacuna. |
US20100092526A1 (en) | 2008-09-26 | 2010-04-15 | Nanobio Corporation | Nanoemulsion therapeutic compositions and methods of using the same |
RU2536248C2 (ru) | 2009-04-30 | 2014-12-20 | Коули Фармасьютикал Груп, Инк. | Пневмококковая вакцина и ее применения |
WO2010132833A1 (en) | 2009-05-14 | 2010-11-18 | The Regents Of The University Of Michigan | Streptococcus vaccine compositions and methods of using the same |
PE20161551A1 (es) | 2009-09-03 | 2017-01-18 | Pfizer Vaccines Llc | Vacuna de pcsk9 |
GB201003920D0 (en) * | 2010-03-09 | 2010-04-21 | Glaxosmithkline Biolog Sa | Method of treatment |
BR112013013702A2 (pt) | 2010-12-03 | 2016-09-13 | Sanofi Pasteur Ltd | composição para imunização contra streptococcus pneumoniae |
CA2861313A1 (en) | 2011-01-20 | 2012-07-26 | Genocea Biosciences, Inc. | Vaccines and compositions against streptococcus pneumoniae |
WO2012131504A1 (en) | 2011-03-02 | 2012-10-04 | Pfizer Inc. | Pcsk9 vaccine |
CN103533953A (zh) | 2011-05-17 | 2014-01-22 | 葛兰素史密丝克莱恩生物有限公司 | 针对肺炎链球菌的疫苗 |
AU2013207191B2 (en) * | 2012-01-05 | 2017-10-26 | Deutsches Krebsforschungszentrum Stiftung Des Offentlichen Rechts | Means and methods for treating or diagnosing IDH1 R132H mutant-positive cancers |
CA2894903A1 (en) * | 2012-12-14 | 2014-06-19 | Sanofi Pasteur, Ltd. | Methods for assessing immunogenicity |
CN114887048A (zh) | 2014-01-21 | 2022-08-12 | 辉瑞公司 | 包含缀合荚膜糖抗原的免疫原性组合物及其用途 |
US11160855B2 (en) | 2014-01-21 | 2021-11-02 | Pfizer Inc. | Immunogenic compositions comprising conjugated capsular saccharide antigens and uses thereof |
PT3096783T (pt) | 2014-01-21 | 2021-08-16 | Pfizer | Polissacáridos capsulares de streptococcus pneumoniae e conjugados dos mesmos |
US20160324949A1 (en) | 2014-01-21 | 2016-11-10 | Pfizer Inc. | Streptococcus pneumoniae capsular polysaccharides and conjugates thereof |
EP3104886B1 (en) | 2014-02-14 | 2018-10-17 | Pfizer Inc | Immunogenic glycoprotein conjugates |
BR112017013891B1 (pt) | 2015-01-15 | 2024-01-30 | Pfizer Inc | Composições imunogênicas para uso em vacinas pneumocócicas |
WO2017013548A1 (en) | 2015-07-21 | 2017-01-26 | Pfizer Inc. | Immunogenic compositions comprising conjugated capsular saccharide antigens, kits comprising the same and uses thereof |
GB201518684D0 (en) | 2015-10-21 | 2015-12-02 | Glaxosmithkline Biolog Sa | Vaccine |
CA3005524C (en) | 2015-11-20 | 2023-10-10 | Pfizer Inc. | Immunogenic compositions for use in pneumococcal vaccines |
GB201610599D0 (en) | 2016-06-17 | 2016-08-03 | Glaxosmithkline Biologicals Sa | Immunogenic Composition |
HRP20220573T1 (hr) | 2017-01-20 | 2022-06-10 | Pfizer Inc. | Imunogeni pripravci, namijenjeni upotrebi u pneumokoknim cjepivima |
US11260119B2 (en) | 2018-08-24 | 2022-03-01 | Pfizer Inc. | Escherichia coli compositions and methods thereof |
WO2020121159A1 (en) | 2018-12-12 | 2020-06-18 | Pfizer Inc. | Immunogenic multiple hetero-antigen polysaccharide-protein conjugates and uses thereof |
JP7239509B6 (ja) | 2019-02-22 | 2023-03-28 | ファイザー・インク | 細菌多糖類を精製するための方法 |
JP2022528158A (ja) | 2019-04-10 | 2022-06-08 | ファイザー・インク | コンジュゲート化莢膜糖抗原を含む免疫原性組成物、それを含むキットおよびその使用 |
KR20220042378A (ko) | 2019-07-31 | 2022-04-05 | 사노피 파스퇴르 인코포레이티드 | 다가 폐렴구균 다당류-단백질 접합체 조성물 및 그 사용 방법 |
JP2021087420A (ja) | 2019-11-01 | 2021-06-10 | ファイザー・インク | Escherichia coli組成物およびその方法 |
AU2021224078B2 (en) | 2020-02-21 | 2024-01-18 | Pfizer Inc. | Purification of saccharides |
AU2021223184A1 (en) | 2020-02-23 | 2022-08-18 | Pfizer Inc. | Escherichia coli compositions and methods thereof |
US20230383324A1 (en) | 2020-10-22 | 2023-11-30 | Pfizer Inc. | Methods for purifying bacterial polysaccharides |
CA3199610A1 (en) | 2020-10-27 | 2022-05-05 | Pfizer Inc. | Escherichia coli compositions and methods thereof |
IL302413A (en) | 2020-11-04 | 2023-06-01 | Pfizer | Immunogenic preparations for use in pneumococcal vaccines |
US20220202923A1 (en) | 2020-12-23 | 2022-06-30 | Pfizer Inc. | E. coli fimh mutants and uses thereof |
WO2022234416A1 (en) | 2021-05-03 | 2022-11-10 | Pfizer Inc. | Vaccination against pneumoccocal and covid-19 infections |
CA3218544A1 (en) | 2021-05-03 | 2022-11-10 | Pfizer Inc. | Vaccination against bacterial and betacoronavirus infections |
CA3221075A1 (en) | 2021-05-28 | 2022-12-01 | Pfizer Inc. | Immunogenic compositions comprising conjugated capsular saccharide antigens and uses thereof |
CA3221074A1 (en) | 2021-05-28 | 2022-12-01 | Pfizer Inc. | Immunogenic compositions comprising conjugated capsular saccharide antigens and uses thereof |
WO2023135515A1 (en) | 2022-01-13 | 2023-07-20 | Pfizer Inc. | Immunogenic compositions comprising conjugated capsular saccharide antigens and uses thereof |
WO2023161817A1 (en) | 2022-02-25 | 2023-08-31 | Pfizer Inc. | Methods for incorporating azido groups in bacterial capsular polysaccharides |
WO2023218322A1 (en) | 2022-05-11 | 2023-11-16 | Pfizer Inc. | Process for producing of vaccine formulations with preservatives |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1998018930A3 (en) * | 1996-10-31 | 1998-10-08 | Human Genome Sciences Inc | Streptococcus pneumoniae antigens and vaccines |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4425437A (en) | 1979-11-05 | 1984-01-10 | Genentech, Inc. | Microbial polypeptide expression vehicle |
US4431739A (en) | 1979-11-05 | 1984-02-14 | Genentech, Inc. | Transformant bacterial culture capable of expressing heterologous protein |
US4338397A (en) | 1980-04-11 | 1982-07-06 | President And Fellows Of Harvard College | Mature protein synthesis |
SI0656014T1 (en) * | 1993-03-19 | 2003-12-31 | Gunnar Lindahl | Protein rib, a cell surface protein that confers immunity to many strains of the group b streptococcus; process for purification of the protein, reagent kit and pharmaceutical composition |
EA199800046A1 (ru) * | 1995-06-07 | 1998-06-25 | Байокем Вэксинс Инк. | Полипептид, последовательность днк, вакцинная композиция (варианты), антитело или его фрагмент, вакцина, применения указанных полипептида, последовательности днк и антитела или его фрагмента |
US5882896A (en) * | 1996-09-24 | 1999-03-16 | Smithkline Beecham Corporation | M protein |
US5928895A (en) * | 1996-09-24 | 1999-07-27 | Smithkline Beecham Corporation | IgA Fc binding protein |
AU9510598A (en) * | 1997-09-24 | 1999-04-12 | American Cyanamid Company | Human complement c3-degrading proteinase from (streptococcus pneumoniae) |
EP1073450A4 (en) * | 1998-04-23 | 2003-04-23 | Uab Research Foundation | PNEUMOCOCCAL SURFACE PROTEIN C (PSPC), EPITOPIC REGIONS, SELECTION OF CORRESPONDING STRES AND USES |
CN1318103A (zh) * | 1998-07-27 | 2001-10-17 | 微生物技术有限公司 | 肺炎链球菌的核酸和蛋白质 |
ATE361365T1 (de) * | 1998-07-27 | 2007-05-15 | Sanofi Pasteur Ltd | Streptococcus pneumoniae proteine und nukleinsäuren |
AU6060899A (en) * | 1998-09-24 | 2000-04-10 | American Cyanamid Company | Human complement c3-degrading polypeptide from (streptococcus pneumoniae) |
CA2355364C (en) * | 1998-12-21 | 2014-03-18 | Medimmune, Inc. | Streptococcus pneumoniae proteins and immunogenic fragments for vaccines |
CZ303675B6 (cs) * | 1998-12-23 | 2013-02-27 | Id Biomedical Corporation Of Quebec | Izolovaný polynukleotid, vektor, hostitelská bunka, zpusob produkce, izolovaný polypeptid, chimérový polypeptid, vakcinacní prostredek a pouzití |
-
1999
- 1999-12-20 CZ CZ20012161A patent/CZ303675B6/cs not_active IP Right Cessation
- 1999-12-20 KR KR1020107009285A patent/KR101078919B1/ko not_active IP Right Cessation
- 1999-12-20 ES ES10180372.4T patent/ES2480417T3/es not_active Expired - Lifetime
- 1999-12-20 DK DK10180372.4T patent/DK2261358T3/da active
- 1999-12-20 PL PL382437A patent/PL205041B1/pl unknown
- 1999-12-20 IL IL14390599A patent/IL143905A0/xx unknown
- 1999-12-20 TR TR2002/00633T patent/TR200200633T2/xx unknown
- 1999-12-20 AU AU17649/00A patent/AU1764900A/en not_active Abandoned
- 1999-12-20 EA EA200100565A patent/EA007409B1/ru not_active IP Right Cessation
- 1999-12-20 OA OA1200100165A patent/OA11736A/en unknown
- 1999-12-20 JP JP2000591190A patent/JP4761623B2/ja not_active Expired - Fee Related
- 1999-12-20 WO PCT/CA1999/001218 patent/WO2000039299A2/en active Application Filing
- 1999-12-20 PL PL382438A patent/PL204073B1/pl not_active IP Right Cessation
- 1999-12-20 MX MXPA01006427A patent/MXPA01006427A/es not_active IP Right Cessation
- 1999-12-20 NZ NZ512574A patent/NZ512574A/xx not_active IP Right Cessation
- 1999-12-20 PT PT101803724T patent/PT2261358E/pt unknown
- 1999-12-20 ES ES99960748T patent/ES2306528T3/es not_active Expired - Lifetime
- 1999-12-20 EP EP99960748A patent/EP1141306B1/en not_active Expired - Lifetime
- 1999-12-20 PT PT99960748T patent/PT1141306E/pt unknown
- 1999-12-20 PT PT81553174T patent/PT1950302E/pt unknown
- 1999-12-20 KR KR1020017007963A patent/KR100802198B1/ko not_active IP Right Cessation
- 1999-12-20 DK DK08155317.4T patent/DK1950302T3/da active
- 1999-12-20 DK DK99960748T patent/DK1141306T3/da active
- 1999-12-20 CN CNB2005100037492A patent/CN100398653C/zh not_active Expired - Fee Related
- 1999-12-20 ES ES08155317T patent/ES2400280T3/es not_active Expired - Lifetime
- 1999-12-20 BR BR9916477-9A patent/BR9916477A/pt active Search and Examination
- 1999-12-20 AT AT99960748T patent/ATE394489T1/de active
- 1999-12-20 EP EP10180372.4A patent/EP2261358B1/en not_active Expired - Lifetime
- 1999-12-20 HU HU0104774A patent/HU229664B1/hu not_active IP Right Cessation
- 1999-12-20 CZ CZ20100674A patent/CZ302790B6/cs not_active IP Right Cessation
- 1999-12-20 TR TR2001/02497T patent/TR200102497T2/xx unknown
- 1999-12-20 AP APAP/P/2001/002199A patent/AP2001002199A0/en unknown
- 1999-12-20 KR KR1020087008264A patent/KR101170203B1/ko not_active IP Right Cessation
- 1999-12-20 CA CA2356836A patent/CA2356836C/en not_active Expired - Fee Related
- 1999-12-20 CN CNB998149047A patent/CN1191362C/zh not_active Expired - Fee Related
- 1999-12-20 DE DE69938670T patent/DE69938670D1/de not_active Expired - Lifetime
- 1999-12-20 PL PL349777A patent/PL206576B1/pl unknown
- 1999-12-20 KR KR1020067025282A patent/KR100891398B1/ko not_active IP Right Cessation
- 1999-12-20 CN CNA2007101481600A patent/CN101134775A/zh active Pending
- 1999-12-23 UY UY25877A patent/UY25877A1/es unknown
- 1999-12-23 AR ARP990106746A patent/AR029322A1/es not_active Application Discontinuation
-
2001
- 2001-06-19 NO NO20013045A patent/NO330800B1/no not_active IP Right Cessation
- 2001-06-21 ZA ZA200105114A patent/ZA200105114B/en unknown
- 2001-06-21 IL IL143905A patent/IL143905A/en not_active IP Right Cessation
-
2008
- 2008-07-29 CY CY20081100787T patent/CY1108223T1/el unknown
- 2008-09-10 HK HK08110047.4A patent/HK1118575A1/xx not_active IP Right Cessation
- 2008-09-21 IL IL194230A patent/IL194230A/en not_active IP Right Cessation
-
2009
- 2009-11-05 CL CL2009002037A patent/CL2009002037A1/es unknown
-
2010
- 2010-02-09 JP JP2010026521A patent/JP5039802B2/ja not_active Expired - Fee Related
-
2012
- 2012-12-27 CY CY20121101268T patent/CY1114722T1/el unknown
-
2014
- 2014-07-16 CY CY20141100536T patent/CY1115340T1/el unknown
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1998018930A3 (en) * | 1996-10-31 | 1998-10-08 | Human Genome Sciences Inc | Streptococcus pneumoniae antigens and vaccines |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101170203B1 (ko) | 신규한 스트렙토코커스 항원 | |
JP5051959B2 (ja) | ストレプトコッカス抗原 | |
US8211437B2 (en) | Streptococcus antigens | |
US7262024B2 (en) | Streptococcus antigens | |
KR100771148B1 (ko) | 그룹 b 스트렙토코커스 항원 | |
US20060177465A1 (en) | Streptococcus antigens | |
EP1950302B1 (en) | Streptococcus antigens | |
AU2008229967B2 (en) | Novel streptococcus antigens |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A107 | Divisional application of patent | ||
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E902 | Notification of reason for refusal | ||
N231 | Notification of change of applicant | ||
E902 | Notification of reason for refusal | ||
A107 | Divisional application of patent | ||
E601 | Decision to refuse application | ||
J201 | Request for trial against refusal decision | ||
J301 | Trial decision |
Free format text: TRIAL DECISION FOR APPEAL AGAINST DECISION TO DECLINE REFUSAL REQUESTED 20100930 Effective date: 20120328 Free format text: TRIAL NUMBER: 2010101007573; TRIAL DECISION FOR APPEAL AGAINST DECISION TO DECLINE REFUSAL REQUESTED 20100930 Effective date: 20120328 |
|
S901 | Examination by remand of revocation | ||
GRNO | Decision to grant (after opposition) | ||
GRNT | Written decision to grant | ||
FPAY | Annual fee payment |
Payment date: 20160629 Year of fee payment: 5 |
|
LAPS | Lapse due to unpaid annual fee |