US20040115211A1 - TB diagnostic based on antigens from M. tuberculosis - Google Patents
TB diagnostic based on antigens from M. tuberculosis Download PDFInfo
- Publication number
- US20040115211A1 US20040115211A1 US10/620,246 US62024603A US2004115211A1 US 20040115211 A1 US20040115211 A1 US 20040115211A1 US 62024603 A US62024603 A US 62024603A US 2004115211 A1 US2004115211 A1 US 2004115211A1
- Authority
- US
- United States
- Prior art keywords
- seq
- polypeptide
- amino acid
- sequences
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 201000008827 tuberculosis Diseases 0.000 title claims abstract description 106
- 239000000427 antigen Substances 0.000 title claims description 147
- 108091007433 antigens Proteins 0.000 title claims description 146
- 102000036639 antigens Human genes 0.000 title claims description 146
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 271
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 267
- 229920001184 polypeptide Polymers 0.000 claims abstract description 260
- 239000012634 fragment Substances 0.000 claims abstract description 97
- 230000004927 fusion Effects 0.000 claims abstract description 46
- 239000000203 mixture Substances 0.000 claims abstract description 45
- 101710166488 6 kDa early secretory antigenic target Proteins 0.000 claims abstract description 44
- 230000001900 immune effect Effects 0.000 claims abstract description 22
- 238000012360 testing method Methods 0.000 claims abstract description 21
- 239000003153 chemical reaction reagent Substances 0.000 claims abstract description 7
- 150000001413 amino acids Chemical class 0.000 claims description 147
- 108010074328 Interferon-gamma Proteins 0.000 claims description 55
- 208000015181 infectious disease Diseases 0.000 claims description 52
- 102100037850 Interferon gamma Human genes 0.000 claims description 51
- 230000028993 immune response Effects 0.000 claims description 44
- 241000699670 Mus sp. Species 0.000 claims description 39
- 210000001744 T-lymphocyte Anatomy 0.000 claims description 38
- 241001465754 Metazoa Species 0.000 claims description 32
- 230000001681 protective effect Effects 0.000 claims description 32
- 125000000539 amino acid group Chemical group 0.000 claims description 27
- 238000006243 chemical reaction Methods 0.000 claims description 27
- 239000002671 adjuvant Substances 0.000 claims description 25
- 241000282414 Homo sapiens Species 0.000 claims description 23
- 230000002163 immunogen Effects 0.000 claims description 21
- 239000000725 suspension Substances 0.000 claims description 17
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 16
- 206010070834 Sensitisation Diseases 0.000 claims description 16
- 230000008313 sensitization Effects 0.000 claims description 16
- -1 TB10.4 Proteins 0.000 claims description 15
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 13
- 230000000763 evoking effect Effects 0.000 claims description 13
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 claims description 13
- 241000186359 Mycobacterium Species 0.000 claims description 12
- 239000003981 vehicle Substances 0.000 claims description 12
- 241000699666 Mus <mouse, genus> Species 0.000 claims description 11
- 102000037865 fusion proteins Human genes 0.000 claims description 10
- 108020001507 fusion proteins Proteins 0.000 claims description 10
- 101150014428 mpt64 gene Proteins 0.000 claims description 10
- 239000006228 supernatant Substances 0.000 claims description 10
- 101100291912 Mycobacterium bovis (strain ATCC BAA-935 / AF2122/97) mpb64 gene Proteins 0.000 claims description 9
- 230000006698 induction Effects 0.000 claims description 8
- 206010053613 Type IV hypersensitivity reaction Diseases 0.000 claims description 6
- 238000001514 detection method Methods 0.000 claims description 6
- 239000003937 drug carrier Substances 0.000 claims description 6
- 210000004989 spleen cell Anatomy 0.000 claims description 6
- 230000005951 type IV hypersensitivity Effects 0.000 claims description 6
- 208000027930 type IV hypersensitivity disease Diseases 0.000 claims description 6
- 241000283690 Bos taurus Species 0.000 claims description 5
- 210000003071 memory t lymphocyte Anatomy 0.000 claims description 4
- 229920000642 polymer Polymers 0.000 claims description 4
- 239000003085 diluting agent Substances 0.000 claims description 3
- 230000002209 hydrophobic effect Effects 0.000 claims description 3
- 230000003993 interaction Effects 0.000 claims description 3
- 239000000375 suspending agent Substances 0.000 claims description 3
- 101710088334 Diacylglycerol acyltransferase/mycolyltransferase Ag85B Proteins 0.000 claims description 2
- 101001043272 Mycobacterium tuberculosis (strain ATCC 25177 / H37Ra) Lipoprotein LpqH Proteins 0.000 claims description 2
- 101150079015 esxB gene Proteins 0.000 claims description 2
- 210000002443 helper t lymphocyte Anatomy 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 36
- 101710088335 Diacylglycerol acyltransferase/mycolyltransferase Ag85A Proteins 0.000 claims 1
- 101710088427 Diacylglycerol acyltransferase/mycolyltransferase Ag85C Proteins 0.000 claims 1
- 102000007362 alpha-Crystallins Human genes 0.000 claims 1
- 108010007908 alpha-Crystallins Proteins 0.000 claims 1
- 108090000623 proteins and genes Proteins 0.000 abstract description 245
- 102000004169 proteins and genes Human genes 0.000 abstract description 155
- 229960005486 vaccine Drugs 0.000 abstract description 53
- 238000012512 characterization method Methods 0.000 abstract description 2
- 235000018102 proteins Nutrition 0.000 description 144
- 108020004414 DNA Proteins 0.000 description 127
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 116
- 235000001014 amino acid Nutrition 0.000 description 84
- 229940024606 amino acid Drugs 0.000 description 84
- 239000002773 nucleotide Substances 0.000 description 79
- 125000003729 nucleotide group Chemical group 0.000 description 79
- 210000004027 cell Anatomy 0.000 description 70
- 238000010367 cloning Methods 0.000 description 65
- 238000000034 method Methods 0.000 description 61
- 239000013615 primer Substances 0.000 description 58
- 101150047914 mpt51 gene Proteins 0.000 description 51
- 108700026244 Open Reading Frames Proteins 0.000 description 49
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 48
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 46
- 150000007523 nucleic acids Chemical group 0.000 description 46
- 238000003752 polymerase chain reaction Methods 0.000 description 41
- 108020004705 Codon Proteins 0.000 description 39
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 39
- 230000000295 complement effect Effects 0.000 description 39
- 229960000190 bacillus calmette–guérin vaccine Drugs 0.000 description 38
- 239000013598 vector Substances 0.000 description 38
- 101150069551 esxH gene Proteins 0.000 description 36
- 101000856404 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) Carboxylesterase Culp1 Proteins 0.000 description 32
- 108091028043 Nucleic acid sequence Proteins 0.000 description 32
- 239000000499 gel Substances 0.000 description 30
- 241001467552 Mycobacterium bovis BCG Species 0.000 description 29
- 101000941208 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) Probable carboxylesterase Culp2 Proteins 0.000 description 29
- 241000588724 Escherichia coli Species 0.000 description 28
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 27
- 239000000706 filtrate Substances 0.000 description 27
- 239000000872 buffer Substances 0.000 description 25
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 25
- 239000004202 carbamide Substances 0.000 description 23
- 241000894006 Bacteria Species 0.000 description 22
- 108091034117 Oligonucleotide Proteins 0.000 description 22
- 108010047495 alanylglycine Proteins 0.000 description 21
- 239000013612 plasmid Substances 0.000 description 21
- 239000000523 sample Substances 0.000 description 21
- 238000012163 sequencing technique Methods 0.000 description 21
- 239000013604 expression vector Substances 0.000 description 20
- 238000002360 preparation method Methods 0.000 description 20
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 18
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 18
- 238000004458 analytical method Methods 0.000 description 18
- 108010050848 glycylleucine Proteins 0.000 description 18
- 239000012528 membrane Substances 0.000 description 18
- 108010087924 alanylproline Proteins 0.000 description 17
- 238000005516 engineering process Methods 0.000 description 17
- 230000036039 immunity Effects 0.000 description 17
- 239000002953 phosphate buffered saline Substances 0.000 description 17
- 241001646725 Mycobacterium tuberculosis H37Rv Species 0.000 description 16
- 239000001166 ammonium sulphate Substances 0.000 description 16
- 235000011130 ammonium sulphate Nutrition 0.000 description 16
- 230000003053 immunization Effects 0.000 description 16
- 239000000047 product Substances 0.000 description 16
- 238000000746 purification Methods 0.000 description 16
- 230000004044 response Effects 0.000 description 16
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 15
- 108010093581 aspartyl-proline Proteins 0.000 description 15
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 15
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 14
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- 230000005847 immunogenicity Effects 0.000 description 14
- 239000011780 sodium chloride Substances 0.000 description 14
- 241000186366 Mycobacterium bovis Species 0.000 description 13
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 13
- 238000007792 addition Methods 0.000 description 13
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 13
- 238000005119 centrifugation Methods 0.000 description 13
- 238000009396 hybridization Methods 0.000 description 13
- 238000002649 immunization Methods 0.000 description 13
- 241000894007 species Species 0.000 description 13
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 12
- 108010079364 N-glycylalanine Proteins 0.000 description 12
- 239000011543 agarose gel Substances 0.000 description 12
- 238000003556 assay Methods 0.000 description 12
- 101150027583 cfp32 gene Proteins 0.000 description 12
- 101150022442 cysO gene Proteins 0.000 description 12
- 238000010828 elution Methods 0.000 description 12
- 239000007924 injection Substances 0.000 description 12
- 238000002347 injection Methods 0.000 description 12
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 12
- 230000015654 memory Effects 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 11
- 239000007983 Tris buffer Substances 0.000 description 11
- 150000003839 salts Chemical group 0.000 description 11
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 11
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 108010047857 aspartylglycine Proteins 0.000 description 10
- 230000027455 binding Effects 0.000 description 10
- 238000012217 deletion Methods 0.000 description 10
- 230000037430 deletion Effects 0.000 description 10
- PSLWZOIUBRXAQW-UHFFFAOYSA-M dimethyl(dioctadecyl)azanium;bromide Chemical compound [Br-].CCCCCCCCCCCCCCCCCC[N+](C)(C)CCCCCCCCCCCCCCCCCC PSLWZOIUBRXAQW-UHFFFAOYSA-M 0.000 description 10
- 238000000338 in vitro Methods 0.000 description 10
- 108091008146 restriction endonucleases Proteins 0.000 description 10
- 239000000126 substance Substances 0.000 description 10
- 238000001712 DNA sequencing Methods 0.000 description 9
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 9
- 239000002033 PVDF binder Substances 0.000 description 9
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 9
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 9
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 9
- 238000010171 animal model Methods 0.000 description 9
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 9
- 108010077515 glycylproline Proteins 0.000 description 9
- 238000002955 isolation Methods 0.000 description 9
- 238000004519 manufacturing process Methods 0.000 description 9
- 102000039446 nucleic acids Human genes 0.000 description 9
- 108020004707 nucleic acids Proteins 0.000 description 9
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 9
- 230000004224 protection Effects 0.000 description 9
- 108020005038 Terminator Codon Proteins 0.000 description 8
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 8
- 229960000723 ampicillin Drugs 0.000 description 8
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 8
- 239000002158 endotoxin Substances 0.000 description 8
- 108010040030 histidinoalanine Proteins 0.000 description 8
- 231100000430 skin reaction Toxicity 0.000 description 8
- 238000000527 sonication Methods 0.000 description 8
- 210000000952 spleen Anatomy 0.000 description 8
- UMCMPZBLKLEWAF-BCTGSCMUSA-N 3-[(3-cholamidopropyl)dimethylammonio]propane-1-sulfonate Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCC[N+](C)(C)CCCS([O-])(=O)=O)C)[C@@]2(C)[C@@H](O)C1 UMCMPZBLKLEWAF-BCTGSCMUSA-N 0.000 description 7
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 7
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 7
- 102000004127 Cytokines Human genes 0.000 description 7
- 108090000695 Cytokines Proteins 0.000 description 7
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 7
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 7
- 239000006137 Luria-Bertani broth Substances 0.000 description 7
- 241000186367 Mycobacterium avium Species 0.000 description 7
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 7
- 238000012408 PCR amplification Methods 0.000 description 7
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 7
- 238000002105 Southern blotting Methods 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 7
- 210000004369 blood Anatomy 0.000 description 7
- 239000008280 blood Substances 0.000 description 7
- 239000013611 chromosomal DNA Substances 0.000 description 7
- 108010037850 glycylvaline Proteins 0.000 description 7
- 229920006008 lipopolysaccharide Polymers 0.000 description 7
- 244000005700 microbiome Species 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 239000012460 protein solution Substances 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- 238000002741 site-directed mutagenesis Methods 0.000 description 7
- 230000000638 stimulation Effects 0.000 description 7
- 229940031626 subunit vaccine Drugs 0.000 description 7
- 238000002255 vaccination Methods 0.000 description 7
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 6
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 6
- 241000700198 Cavia Species 0.000 description 6
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 6
- 241000187492 Mycobacterium marinum Species 0.000 description 6
- SEQKRHFRPICQDD-UHFFFAOYSA-N N-tris(hydroxymethyl)methylglycine Chemical compound OCC(CO)(CO)[NH2+]CC([O-])=O SEQKRHFRPICQDD-UHFFFAOYSA-N 0.000 description 6
- 206010040914 Skin reaction Diseases 0.000 description 6
- 238000001042 affinity chromatography Methods 0.000 description 6
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 238000003745 diagnosis Methods 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 238000009472 formulation Methods 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010089804 glycyl-threonine Proteins 0.000 description 6
- 210000004698 lymphocyte Anatomy 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 6
- 108010031719 prolyl-serine Proteins 0.000 description 6
- 239000011347 resin Substances 0.000 description 6
- 229920005989 resin Polymers 0.000 description 6
- 230000035483 skin reaction Effects 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 5
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 5
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 5
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 5
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 5
- 241001467553 Mycobacterium africanum Species 0.000 description 5
- 241000186363 Mycobacterium kansasii Species 0.000 description 5
- 241000186362 Mycobacterium leprae Species 0.000 description 5
- 241000187496 Mycobacterium szulgai Species 0.000 description 5
- 241000187494 Mycobacterium xenopi Species 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 5
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 5
- 239000004480 active ingredient Substances 0.000 description 5
- 108010041407 alanylaspartic acid Proteins 0.000 description 5
- 230000000692 anti-sense effect Effects 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 102000005936 beta-Galactosidase Human genes 0.000 description 5
- 108010005774 beta-Galactosidase Proteins 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010081551 glycylphenylalanine Proteins 0.000 description 5
- 235000014304 histidine Nutrition 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 238000001155 isoelectric focusing Methods 0.000 description 5
- 108010078274 isoleucylvaline Proteins 0.000 description 5
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 239000006166 lysate Substances 0.000 description 5
- 239000011159 matrix material Substances 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 230000010076 replication Effects 0.000 description 5
- 210000002966 serum Anatomy 0.000 description 5
- 229910052709 silver Inorganic materials 0.000 description 5
- 239000004332 silver Substances 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- 238000000108 ultra-filtration Methods 0.000 description 5
- 230000001018 virulence Effects 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- WURBVZBTWMNKQT-UHFFFAOYSA-N 1-(4-chlorophenoxy)-3,3-dimethyl-1-(1,2,4-triazol-1-yl)butan-2-one Chemical compound C1=NC=NN1C(C(=O)C(C)(C)C)OC1=CC=C(Cl)C=C1 WURBVZBTWMNKQT-UHFFFAOYSA-N 0.000 description 4
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 4
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 4
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 4
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 4
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 4
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 4
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 4
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 4
- 101100396524 Capsicum annuum eIF4E gene Proteins 0.000 description 4
- 102000014914 Carrier Proteins Human genes 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- 239000003155 DNA primer Substances 0.000 description 4
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 4
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 4
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 4
- 102000008070 Interferon-gamma Human genes 0.000 description 4
- 102000013462 Interleukin-12 Human genes 0.000 description 4
- 108010065805 Interleukin-12 Proteins 0.000 description 4
- 108010002350 Interleukin-2 Proteins 0.000 description 4
- 102000000588 Interleukin-2 Human genes 0.000 description 4
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 4
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 4
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 4
- 241000186365 Mycobacterium fortuitum Species 0.000 description 4
- 241000187490 Mycobacterium scrofulaceum Species 0.000 description 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 4
- 239000000020 Nitrocellulose Substances 0.000 description 4
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 4
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 4
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 4
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 4
- 239000004098 Tetracycline Substances 0.000 description 4
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 4
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 4
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 4
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 4
- 238000001261 affinity purification Methods 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 108010011559 alanylphenylalanine Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 238000005349 anion exchange Methods 0.000 description 4
- 230000000890 antigenic effect Effects 0.000 description 4
- 239000003926 antimycobacterial agent Substances 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108091008324 binding proteins Proteins 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 210000002421 cell wall Anatomy 0.000 description 4
- 108010005400 cutinase Proteins 0.000 description 4
- KCFYHBSOLOXZIF-UHFFFAOYSA-N dihydrochrysin Natural products COC1=C(O)C(OC)=CC(C2OC3=CC(O)=CC(O)=C3C(=O)C2)=C1 KCFYHBSOLOXZIF-UHFFFAOYSA-N 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 4
- MSQACBWWAIBWIC-UHFFFAOYSA-N hydron;piperazine;chloride Chemical compound Cl.C1CNCCN1 MSQACBWWAIBWIC-UHFFFAOYSA-N 0.000 description 4
- 210000000987 immune system Anatomy 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 229910052751 metal Inorganic materials 0.000 description 4
- 239000002184 metal Substances 0.000 description 4
- 244000000010 microbial pathogen Species 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 4
- 238000010172 mouse model Methods 0.000 description 4
- BSOQXXWZTUDTEL-ZUYCGGNHSA-N muramyl dipeptide Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)O[C@@H](O)[C@@H]1NC(C)=O BSOQXXWZTUDTEL-ZUYCGGNHSA-N 0.000 description 4
- 229920001220 nitrocellulos Polymers 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 238000010647 peptide synthesis reaction Methods 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 4
- 230000003393 splenic effect Effects 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 230000002194 synthesizing effect Effects 0.000 description 4
- 229960002180 tetracycline Drugs 0.000 description 4
- 229930101283 tetracycline Natural products 0.000 description 4
- 235000019364 tetracycline Nutrition 0.000 description 4
- 150000003522 tetracyclines Chemical class 0.000 description 4
- 238000001262 western blot Methods 0.000 description 4
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 3
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 3
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 3
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 3
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 3
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 3
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 3
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 3
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 3
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 3
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 3
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 3
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 3
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 3
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 3
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 3
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 3
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 3
- 241000700199 Cavia porcellus Species 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 206010015150 Erythema Diseases 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 3
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 3
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 3
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 3
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 3
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 3
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 3
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 3
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 3
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 3
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 3
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 3
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- 108010074338 Lymphokines Proteins 0.000 description 3
- 102000008072 Lymphokines Human genes 0.000 description 3
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 3
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 241000983409 Microbacterium terrae Species 0.000 description 3
- 241000187486 Mycobacterium flavescens Species 0.000 description 3
- 241001302239 Mycobacterium tuberculosis complex Species 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 3
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 3
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 3
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 3
- 241000288906 Primates Species 0.000 description 3
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 3
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 3
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 3
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 3
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 3
- 241000589516 Pseudomonas Species 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 3
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 3
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 3
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 3
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 3
- UZMAPBJVXOGOFT-UHFFFAOYSA-N Syringetin Natural products COC1=C(O)C(OC)=CC(C2=C(C(=O)C3=C(O)C=C(O)C=C3O2)O)=C1 UZMAPBJVXOGOFT-UHFFFAOYSA-N 0.000 description 3
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 3
- 239000007997 Tricine buffer Substances 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 3
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 3
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 3
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 3
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 3
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 3
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 3
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 238000005377 adsorption chromatography Methods 0.000 description 3
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 3
- 238000003277 amino acid sequence analysis Methods 0.000 description 3
- 238000005571 anion exchange chromatography Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 210000000601 blood cell Anatomy 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 239000000470 constituent Substances 0.000 description 3
- 239000000287 crude extract Substances 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 108010054813 diprotin B Proteins 0.000 description 3
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 3
- 239000003596 drug target Substances 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 231100000321 erythema Toxicity 0.000 description 3
- 238000012869 ethanol precipitation Methods 0.000 description 3
- 238000013230 female C57BL/6J mice Methods 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 239000002054 inoculum Substances 0.000 description 3
- 229960003130 interferon gamma Drugs 0.000 description 3
- 238000001990 intravenous administration Methods 0.000 description 3
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 3
- OJMMVQQUTAEWLP-KIDUDLJLSA-N lincomycin Chemical compound CN1C[C@H](CCC)C[C@H]1C(=O)N[C@H]([C@@H](C)O)[C@@H]1[C@H](O)[C@H](O)[C@@H](O)[C@@H](SC)O1 OJMMVQQUTAEWLP-KIDUDLJLSA-N 0.000 description 3
- 108010047534 linocin Proteins 0.000 description 3
- 238000004811 liquid chromatography Methods 0.000 description 3
- 230000007787 long-term memory Effects 0.000 description 3
- 210000004072 lung Anatomy 0.000 description 3
- 230000001320 lysogenic effect Effects 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 244000052769 pathogen Species 0.000 description 3
- 239000008188 pellet Substances 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 229940068196 placebo Drugs 0.000 description 3
- 239000000902 placebo Substances 0.000 description 3
- 229920002401 polyacrylamide Polymers 0.000 description 3
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 3
- 239000012723 sample buffer Substances 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 239000001488 sodium phosphate Substances 0.000 description 3
- 229910000162 sodium phosphate Inorganic materials 0.000 description 3
- 239000000829 suppository Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 3
- 108010084932 tryptophyl-proline Proteins 0.000 description 3
- 210000003462 vein Anatomy 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 2
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 2
- UKPGFKQVRITNFM-KBPBESRZSA-N 2-[[2-[[(2s)-1-[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)NCC(O)=O)C1=CC=C(O)C=C1 UKPGFKQVRITNFM-KBPBESRZSA-N 0.000 description 2
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- YHOPXCAOTRUGLV-XAMCCFCMSA-N Ala-Ala-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YHOPXCAOTRUGLV-XAMCCFCMSA-N 0.000 description 2
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 2
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 2
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 2
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 2
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 2
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 2
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 2
- 229920000856 Amylose Polymers 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 2
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 2
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 2
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 2
- 241000304886 Bacilli Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 244000177578 Bacterium linens Species 0.000 description 2
- 235000012539 Bacterium linens Nutrition 0.000 description 2
- 102100021277 Beta-secretase 2 Human genes 0.000 description 2
- 101710150190 Beta-secretase 2 Proteins 0.000 description 2
- 238000011746 C57BL/6J (JAX™ mouse strain) Methods 0.000 description 2
- 238000011752 CBA/J (JAX™ mouse strain) Methods 0.000 description 2
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 2
- 208000003322 Coinfection Diseases 0.000 description 2
- 108010041986 DNA Vaccines Proteins 0.000 description 2
- 229940021995 DNA vaccine Drugs 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 241000701959 Escherichia virus Lambda Species 0.000 description 2
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 2
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 2
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 2
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 2
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 2
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 2
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 2
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 2
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 2
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 2
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 2
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108091027305 Heteroduplex Proteins 0.000 description 2
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 2
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 2
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 2
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 2
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 2
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 2
- 241000203407 Methanocaldococcus jannaschii Species 0.000 description 2
- 101100115705 Mus musculus Stfa1 gene Proteins 0.000 description 2
- 101100276102 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) cut2 gene Proteins 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 2
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 2
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- 108091036414 Polyinosinic:polycytidylic acid Proteins 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- TYMBHHITTMGGPI-NAKRPEOUSA-N Pro-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 TYMBHHITTMGGPI-NAKRPEOUSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 2
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- 102000004245 Proteasome Endopeptidase Complex Human genes 0.000 description 2
- 108090000708 Proteasome Endopeptidase Complex Proteins 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 208000035415 Reinfection Diseases 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 108010073443 Ribi adjuvant Proteins 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 230000005867 T cell response Effects 0.000 description 2
- 101150006914 TRP1 gene Proteins 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 2
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 2
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 2
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 2
- KZIQDVNORJKTMO-WDSOQIARSA-N Trp-Arg-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N KZIQDVNORJKTMO-WDSOQIARSA-N 0.000 description 2
- QTQNGBOKNQNQLS-PMVMPFDFSA-N Trp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N QTQNGBOKNQNQLS-PMVMPFDFSA-N 0.000 description 2
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 2
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 2
- 230000000240 adjuvant effect Effects 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 2
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 239000000032 diagnostic agent Substances 0.000 description 2
- 229940039227 diagnostic agent Drugs 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000001378 electrochemiluminescence detection Methods 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 241001237520 fungal sp. Species 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 108010037896 heparin-binding hemagglutinin Proteins 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 229960003350 isoniazid Drugs 0.000 description 2
- QRXWMOHMRWLFEY-UHFFFAOYSA-N isoniazide Chemical compound NNC(=O)C1=CC=NC=C1 QRXWMOHMRWLFEY-UHFFFAOYSA-N 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 230000029226 lipidation Effects 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000007791 liquid phase Substances 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 108010034507 methionyltryptophan Proteins 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 210000005087 mononuclear cell Anatomy 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 210000005105 peripheral blood lymphocyte Anatomy 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 229940115272 polyinosinic:polycytidylic acid Drugs 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 238000012257 pre-denaturation Methods 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 239000002987 primer (paints) Substances 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 150000003384 small molecules Chemical class 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 230000002195 synergetic effect Effects 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- GETQZCLCWQTVFV-UHFFFAOYSA-N trimethylamine Chemical compound CN(C)C GETQZCLCWQTVFV-UHFFFAOYSA-N 0.000 description 2
- 239000001226 triphosphate Substances 0.000 description 2
- 235000011178 triphosphate Nutrition 0.000 description 2
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 1
- QZCJOXAIQXPLNS-UHFFFAOYSA-N 1,1,2,2,3,3,4,4,4a,5,5,6,6,7,7,8,8,8a-octadecafluoronaphthalene 4-(2-aminoethyl)benzene-1,2-diol Chemical compound NCCc1ccc(O)c(O)c1.FC1(F)C(F)(F)C(F)(F)C2(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C2(F)C1(F)F QZCJOXAIQXPLNS-UHFFFAOYSA-N 0.000 description 1
- 102100024341 10 kDa heat shock protein, mitochondrial Human genes 0.000 description 1
- MIJDSYMOBYNHOT-UHFFFAOYSA-N 2-(ethylamino)ethanol Chemical compound CCNCCO MIJDSYMOBYNHOT-UHFFFAOYSA-N 0.000 description 1
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 description 1
- QWTLUPDHBKBULE-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[[2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QWTLUPDHBKBULE-UHFFFAOYSA-N 0.000 description 1
- HRGUSFBJBOKSML-UHFFFAOYSA-N 3',5'-di-O-methyltricetin Chemical compound COC1=C(O)C(OC)=CC(C=2OC3=CC(O)=CC(O)=C3C(=O)C=2)=C1 HRGUSFBJBOKSML-UHFFFAOYSA-N 0.000 description 1
- UAIUNKRWKOVEES-UHFFFAOYSA-N 3,3',5,5'-tetramethylbenzidine Chemical compound CC1=C(N)C(C)=CC(C=2C=C(C)C(N)=C(C)C=2)=C1 UAIUNKRWKOVEES-UHFFFAOYSA-N 0.000 description 1
- DBCSOHBZRZWHQM-UHFFFAOYSA-N 3-[1-(3-amino-3-oxoprop-1-enyl)piperazin-2-yl]prop-2-enamide Chemical compound NC(=O)C=CC1CNCCN1C=CC(N)=O DBCSOHBZRZWHQM-UHFFFAOYSA-N 0.000 description 1
- 101710173133 50S ribosomal protein L7/L12 Proteins 0.000 description 1
- 102100038222 60 kDa heat shock protein, mitochondrial Human genes 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 101000768957 Acholeplasma phage L2 Uncharacterized 37.2 kDa protein Proteins 0.000 description 1
- 101000823746 Acidianus ambivalens Uncharacterized 17.7 kDa protein in bps2 3'region Proteins 0.000 description 1
- 101000916369 Acidianus ambivalens Uncharacterized protein in sor 5'region Proteins 0.000 description 1
- 101000769342 Acinetobacter guillouiae Uncharacterized protein in rpoN-murA intergenic region Proteins 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 101000823696 Actinobacillus pleuropneumoniae Uncharacterized glycosyltransferase in aroQ 3'region Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 101000786513 Agrobacterium tumefaciens (strain 15955) Uncharacterized protein outside the virF region Proteins 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- UWIQWPWWZUHBAO-ZLIFDBKOSA-N Ala-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)CC(C)C)C(O)=O)=CNC2=C1 UWIQWPWWZUHBAO-ZLIFDBKOSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 1
- 108010031025 Alanine Dehydrogenase Proteins 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 101000618005 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_00885 Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- 102100020724 Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Human genes 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- RFNDQEWMNJMQHD-SZMVWBNQSA-N Arg-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RFNDQEWMNJMQHD-SZMVWBNQSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 1
- 102000016904 Armadillo Domain Proteins Human genes 0.000 description 1
- 108010014223 Armadillo Domain Proteins Proteins 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- CQMQJWRCRQSBAF-BPUTZDHNSA-N Asn-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N CQMQJWRCRQSBAF-BPUTZDHNSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- GFGUPLIETCNQGF-DCAQKATOSA-N Asn-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O GFGUPLIETCNQGF-DCAQKATOSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 1
- ANRZCQXIXGDXLR-CWRNSKLLSA-N Asn-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)N)N)C(=O)O ANRZCQXIXGDXLR-CWRNSKLLSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- KGAJCJXBEWLQDZ-UBHSHLNASA-N Asp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N KGAJCJXBEWLQDZ-UBHSHLNASA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- UKGGPJNBONZZCM-WDSKDSINSA-N Asp-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- DWOSGXZMLQNDBN-FXQIFTODSA-N Asp-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O DWOSGXZMLQNDBN-FXQIFTODSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- KCOPOPKJRHVGPE-AQZXSJQPSA-N Asp-Thr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O KCOPOPKJRHVGPE-AQZXSJQPSA-N 0.000 description 1
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 1
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 1
- IHZFGJLKDYINPV-XIRDDKMYSA-N Asp-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(O)=O)N)C(O)=O)C1=CN=CN1 IHZFGJLKDYINPV-XIRDDKMYSA-N 0.000 description 1
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 101000967489 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / JCM 20966 / LMG 6465 / NBRC 14845 / NCIMB 13405 / ORS 571) Uncharacterized protein AZC_3924 Proteins 0.000 description 1
- 238000011725 BALB/c mouse Methods 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 101000823761 Bacillus licheniformis Uncharacterized 9.4 kDa protein in flaL 3'region Proteins 0.000 description 1
- 101000819719 Bacillus methanolicus Uncharacterized N-acetyltransferase in lysA 3'region Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 101000789586 Bacillus subtilis (strain 168) UPF0702 transmembrane protein YkjA Proteins 0.000 description 1
- 101000792624 Bacillus subtilis (strain 168) Uncharacterized protein YbxH Proteins 0.000 description 1
- 101000790792 Bacillus subtilis (strain 168) Uncharacterized protein YckC Proteins 0.000 description 1
- 101000819705 Bacillus subtilis (strain 168) Uncharacterized protein YlxR Proteins 0.000 description 1
- 101000948218 Bacillus subtilis (strain 168) Uncharacterized protein YtxJ Proteins 0.000 description 1
- 101000718627 Bacillus thuringiensis subsp. kurstaki Putative RNA polymerase sigma-G factor Proteins 0.000 description 1
- 101710201279 Biotin carboxyl carrier protein Proteins 0.000 description 1
- 101000641200 Bombyx mori densovirus Putative non-structural protein Proteins 0.000 description 1
- 101001086191 Borrelia burgdorferi Outer surface protein A Proteins 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 101001020583 Brevibacterium linens Linocin-M18 Proteins 0.000 description 1
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 108010059013 Chaperonin 10 Proteins 0.000 description 1
- 108010058432 Chaperonin 60 Proteins 0.000 description 1
- 101000947633 Claviceps purpurea Uncharacterized 13.8 kDa protein Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 241000037164 Collema parvum Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 101000690585 Corynebacterium striatum N(alpha)-acyl-glutamine aminoacylase Proteins 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 241000289632 Dasypodidae Species 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 101100364969 Dictyostelium discoideum scai gene Proteins 0.000 description 1
- 102000028526 Dihydrolipoamide Dehydrogenase Human genes 0.000 description 1
- 108010028127 Dihydrolipoamide Dehydrogenase Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 101000948901 Enterobacteria phage T4 Uncharacterized 16.0 kDa protein in segB-ipI intergenic region Proteins 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 101000805958 Equine herpesvirus 4 (strain 1942) Virion protein US10 homolog Proteins 0.000 description 1
- 241001522878 Escherichia coli B Species 0.000 description 1
- 101000790442 Escherichia coli Insertion element IS2 uncharacterized 11.1 kDa protein Proteins 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 101000788354 Escherichia phage P2 Uncharacterized 8.2 kDa protein in gpA 5'region Proteins 0.000 description 1
- 241001524679 Escherichia virus M13 Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 208000020545 Exposure to communicable disease Diseases 0.000 description 1
- 108010000916 Fimbriae Proteins Proteins 0.000 description 1
- 101710177917 Fimbrial protein Proteins 0.000 description 1
- 101000770304 Frankia alni UPF0460 protein in nifX-nifW intergenic region Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- 101000797344 Geobacillus stearothermophilus Putative tRNA (cytidine(34)-2'-O)-methyltransferase Proteins 0.000 description 1
- 101000748410 Geobacillus stearothermophilus Uncharacterized protein in fumA 3'region Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- OVQXQLWWJSNYFV-XEGUGMAKSA-N Gln-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(N)=O)C)C(O)=O)=CNC2=C1 OVQXQLWWJSNYFV-XEGUGMAKSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- JEFZIKRIDLHOIF-BYPYZUCNSA-N Gln-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(O)=O JEFZIKRIDLHOIF-BYPYZUCNSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- LVRKAFPPFJRIOF-GARJFASQSA-N Gln-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N LVRKAFPPFJRIOF-GARJFASQSA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- WBBVTGIFQIZBHP-JBACZVJFSA-N Gln-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N WBBVTGIFQIZBHP-JBACZVJFSA-N 0.000 description 1
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- ZXQPJYWZSFGWJB-AVGNSLFASA-N Glu-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXQPJYWZSFGWJB-AVGNSLFASA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- 108010050006 Gly-Asp-Gly-Arg Proteins 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- PYFHPYDQHCEVIT-KBPBESRZSA-N Gly-Trp-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O PYFHPYDQHCEVIT-KBPBESRZSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 101000772675 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) UPF0438 protein HI_0847 Proteins 0.000 description 1
- 101000631019 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Uncharacterized protein HI_0350 Proteins 0.000 description 1
- 101000768938 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 1
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- BILZDIPAKWZFSG-PYJNHQTQSA-N His-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BILZDIPAKWZFSG-PYJNHQTQSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- CMMBEMZGNGYJRJ-IHRRRGAJSA-N His-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N CMMBEMZGNGYJRJ-IHRRRGAJSA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- SWBUZLFWGJETAO-KKUMJFAQSA-N His-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O SWBUZLFWGJETAO-KKUMJFAQSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000785414 Homo sapiens Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Proteins 0.000 description 1
- 101000802660 Homo sapiens Histo-blood group ABO system transferase Proteins 0.000 description 1
- 101000878213 Homo sapiens Inactive peptidyl-prolyl cis-trans isomerase FKBP6 Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- 102000009490 IgG Receptors Human genes 0.000 description 1
- 108010073807 IgG Receptors Proteins 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 1
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 102100036984 Inactive peptidyl-prolyl cis-trans isomerase FKBP6 Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- 101000782488 Junonia coenia densovirus (isolate pBRJ/1990) Putative non-structural protein NS2 Proteins 0.000 description 1
- 101000811523 Klebsiella pneumoniae Uncharacterized 55.8 kDa protein in cps region Proteins 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 101000818409 Lactococcus lactis subsp. lactis Uncharacterized HTH-type transcriptional regulator in lacX 3'region Proteins 0.000 description 1
- 101000878851 Leptolyngbya boryana Putative Fe(2+) transport protein A Proteins 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- LJKJVTCIRDCITR-SRVKXCTJSA-N Leu-Cys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LJKJVTCIRDCITR-SRVKXCTJSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- 241000239218 Limulus Species 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- BPDXWKVZNCKUGG-BZSNNMDCSA-N Lys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N BPDXWKVZNCKUGG-BZSNNMDCSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- CFOLERIRBUAYAD-HOCLYGCPSA-N Lys-Trp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O CFOLERIRBUAYAD-HOCLYGCPSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 1
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 1
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 1
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 1
- GVIVXNFKJQFTCE-YUMQZZPRSA-N Met-Gly-Gln Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O GVIVXNFKJQFTCE-YUMQZZPRSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- KZKVVWBOGDKHKE-QTKMDUPCSA-N Met-Thr-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 KZKVVWBOGDKHKE-QTKMDUPCSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- XLTSAUGGDYRFLS-UMPQAUOISA-N Met-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCSC)N)O XLTSAUGGDYRFLS-UMPQAUOISA-N 0.000 description 1
- CKAVKDJBSNTJDB-SRVKXCTJSA-N Met-Val-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCSC CKAVKDJBSNTJDB-SRVKXCTJSA-N 0.000 description 1
- 101000758828 Methanosarcina barkeri (strain Fusaro / DSM 804) Uncharacterized protein Mbar_A1602 Proteins 0.000 description 1
- 101001122401 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF3 Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 101100364971 Mus musculus Scai gene Proteins 0.000 description 1
- 101100115709 Mus musculus Stfa2 gene Proteins 0.000 description 1
- 241000327728 Mycobacterium bovis AN5 Species 0.000 description 1
- 101100011988 Mycobacterium leprae (strain TN) etfB gene Proteins 0.000 description 1
- 241000187480 Mycobacterium smegmatis Species 0.000 description 1
- 101100276105 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) cut3 gene Proteins 0.000 description 1
- 101100385666 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) cut7 gene Proteins 0.000 description 1
- 101100445652 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) esxK gene Proteins 0.000 description 1
- 101100013964 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) garA gene Proteins 0.000 description 1
- 101100331606 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) lpdC gene Proteins 0.000 description 1
- 101001055788 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Pentapeptide repeat protein MfpA Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 241001644525 Nastus productus Species 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 101000740670 Orgyia pseudotsugata multicapsid polyhedrosis virus Protein C42 Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108700006640 OspA Proteins 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 108010087702 Penicillinase Proteins 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 241001442654 Percnon planissimum Species 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- ACJULKNZOCRWEI-ULQDDVLXSA-N Phe-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O ACJULKNZOCRWEI-ULQDDVLXSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- KQUMFXGQTSAEJE-PMVMPFDFSA-N Phe-Trp-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KQUMFXGQTSAEJE-PMVMPFDFSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- 102000006335 Phosphate-Binding Proteins Human genes 0.000 description 1
- 108010058514 Phosphate-Binding Proteins Proteins 0.000 description 1
- 101000769182 Photorhabdus luminescens Uncharacterized protein in pnp 3'region Proteins 0.000 description 1
- IDDMFNIRSJVBHE-UHFFFAOYSA-N Piscigenin Natural products COC1=C(O)C(OC)=CC(C=2C(C3=C(O)C=C(O)C=C3OC=2)=O)=C1 IDDMFNIRSJVBHE-UHFFFAOYSA-N 0.000 description 1
- 101001002763 Plasmodium chabaudi Acidic phosphoprotein Proteins 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- NUZHSNLQJDYSRW-BZSNNMDCSA-N Pro-Arg-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NUZHSNLQJDYSRW-BZSNNMDCSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- QAAYIXYLEMRULP-SRVKXCTJSA-N Pro-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 QAAYIXYLEMRULP-SRVKXCTJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- BNUKRHFCHHLIGR-JYJNAYRXSA-N Pro-Trp-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC(=O)O)C(=O)O BNUKRHFCHHLIGR-JYJNAYRXSA-N 0.000 description 1
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101000961392 Pseudescherichia vulneris Uncharacterized 29.9 kDa protein in crtE 3'region Proteins 0.000 description 1
- 101000731030 Pseudomonas oleovorans Poly(3-hydroxyalkanoate) polymerase 2 Proteins 0.000 description 1
- 101001065485 Pseudomonas putida Probable fatty acid methyltransferase Proteins 0.000 description 1
- 108010007131 Pulmonary Surfactant-Associated Protein B Proteins 0.000 description 1
- 102100032617 Pulmonary surfactant-associated protein B Human genes 0.000 description 1
- 108010053763 Pyruvate Carboxylase Proteins 0.000 description 1
- 102100039895 Pyruvate carboxylase, mitochondrial Human genes 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 101000711023 Rhizobium leguminosarum bv. trifolii Uncharacterized protein in tfuA 3'region Proteins 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 101000948156 Rhodococcus erythropolis Uncharacterized 47.3 kDa protein in thcA 5'region Proteins 0.000 description 1
- 101000917565 Rhodococcus fascians Uncharacterized 33.6 kDa protein in fasciation locus Proteins 0.000 description 1
- 241000187562 Rhodococcus sp. Species 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 101000790284 Saimiriine herpesvirus 2 (strain 488) Uncharacterized 9.5 kDa protein in DHFR 3'region Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- AXOHAHIUJHCLQR-IHRRRGAJSA-N Ser-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CO)N AXOHAHIUJHCLQR-IHRRRGAJSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 229920002125 Sokalan® Polymers 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 101000936719 Streptococcus gordonii Accessory Sec system protein Asp3 Proteins 0.000 description 1
- 241000194019 Streptococcus mutans Species 0.000 description 1
- 101000788499 Streptomyces coelicolor Uncharacterized oxidoreductase in mprA 5'region Proteins 0.000 description 1
- 101001102841 Streptomyces griseus Purine nucleoside phosphorylase ORF3 Proteins 0.000 description 1
- 101000708557 Streptomyces lincolnensis Uncharacterized 17.2 kDa protein in melC2-rnhH intergenic region Proteins 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 102000019197 Superoxide Dismutase Human genes 0.000 description 1
- 108010012715 Superoxide dismutase Proteins 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 101000649826 Thermotoga neapolitana Putative anti-sigma factor antagonist TM1081 homolog Proteins 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- VOGXLRKCWFLJBY-HSHDSVGOSA-N Thr-Arg-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VOGXLRKCWFLJBY-HSHDSVGOSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- VBPDMBAFBRDZSK-HOUAVDHOSA-N Thr-Asn-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VBPDMBAFBRDZSK-HOUAVDHOSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- DGOJNGCGEYOBKN-BWBBJGPYSA-N Thr-Cys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O DGOJNGCGEYOBKN-BWBBJGPYSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- BIENEHRYNODTLP-HJGDQZAQSA-N Thr-Glu-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O BIENEHRYNODTLP-HJGDQZAQSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- YDWLCDQXLCILCZ-BWAGICSOSA-N Thr-His-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YDWLCDQXLCILCZ-BWAGICSOSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- UGFSAPWZBROURT-IXOXFDKPSA-N Thr-Phe-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N)O UGFSAPWZBROURT-IXOXFDKPSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000005497 Thymidylate Synthase Human genes 0.000 description 1
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 1
- LAIUAVGWZYTBKN-VHWLVUOQSA-N Trp-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O LAIUAVGWZYTBKN-VHWLVUOQSA-N 0.000 description 1
- GKUROEIXVURAAO-BPUTZDHNSA-N Trp-Asp-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GKUROEIXVURAAO-BPUTZDHNSA-N 0.000 description 1
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- NOFFAYIYPAUNRM-HKUYNNGSSA-N Trp-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NOFFAYIYPAUNRM-HKUYNNGSSA-N 0.000 description 1
- OTWIOROMZLNAQC-XIRDDKMYSA-N Trp-His-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OTWIOROMZLNAQC-XIRDDKMYSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 1
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 1
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 1
- SUGLEXVWEJOCGN-ONUFPDRFSA-N Trp-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)O SUGLEXVWEJOCGN-ONUFPDRFSA-N 0.000 description 1
- YCEHCFIOIYNQTR-NYVOZVTQSA-N Trp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CO)C(=O)O)N YCEHCFIOIYNQTR-NYVOZVTQSA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- MBLJBGZWLHTJBH-SZMVWBNQSA-N Trp-Val-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 MBLJBGZWLHTJBH-SZMVWBNQSA-N 0.000 description 1
- VNRTXOUAOUZCFW-WDSOQIARSA-N Trp-Val-His Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O VNRTXOUAOUZCFW-WDSOQIARSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- WEFIPBYPXZYPHD-HJPIBITLSA-N Tyr-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WEFIPBYPXZYPHD-HJPIBITLSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 1
- XDGPTBVOSHKDFT-KKUMJFAQSA-N Tyr-Met-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XDGPTBVOSHKDFT-KKUMJFAQSA-N 0.000 description 1
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 1
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 1
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- 108010046334 Urease Proteins 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 1
- 101000827562 Vibrio alginolyticus Uncharacterized protein in proC 3'region Proteins 0.000 description 1
- 101000778915 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) Uncharacterized membrane protein VP2115 Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- UZQJVUCHXGYFLQ-AYDHOLPZSA-N [(2s,3r,4s,5r,6r)-4-[(2s,3r,4s,5r,6r)-4-[(2r,3r,4s,5r,6r)-4-[(2s,3r,4s,5r,6r)-3,5-dihydroxy-6-(hydroxymethyl)-4-[(2s,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]oxy-3,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-3,5-dihydroxy-6-(hy Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O)O[C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O)O[C@H]1CC[C@]2(C)[C@H]3CC=C4[C@@]([C@@]3(CC[C@H]2[C@@]1(C=O)C)C)(C)CC(O)[C@]1(CCC(CC14)(C)C)C(=O)O[C@H]1[C@@H]([C@@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O[C@H]4[C@@H]([C@@H](O[C@H]5[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O5)O)[C@H](O)[C@@H](CO)O4)O)[C@H](O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O UZQJVUCHXGYFLQ-AYDHOLPZSA-N 0.000 description 1
- LUXUAZKGQZPOBZ-SAXJAHGMSA-N [(3S,4S,5S,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] (Z)-octadec-9-enoate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC1O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]1O LUXUAZKGQZPOBZ-SAXJAHGMSA-N 0.000 description 1
- ZWBTYMGEBZUQTK-PVLSIAFMSA-N [(7S,9E,11S,12R,13S,14R,15R,16R,17S,18S,19E,21Z)-2,15,17,32-tetrahydroxy-11-methoxy-3,7,12,14,16,18,22-heptamethyl-1'-(2-methylpropyl)-6,23-dioxospiro[8,33-dioxa-24,27,29-triazapentacyclo[23.6.1.14,7.05,31.026,30]tritriaconta-1(32),2,4,9,19,21,24,26,30-nonaene-28,4'-piperidine]-13-yl] acetate Chemical compound CO[C@H]1\C=C\O[C@@]2(C)Oc3c(C2=O)c2c4NC5(CCN(CC(C)C)CC5)N=c4c(=NC(=O)\C(C)=C/C=C/[C@H](C)[C@H](O)[C@@H](C)[C@@H](O)[C@@H](C)[C@H](OC(C)=O)[C@@H]1C)c(O)c2c(O)c3C ZWBTYMGEBZUQTK-PVLSIAFMSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 229960003767 alanine Drugs 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010087049 alanyl-alanyl-prolyl-valine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 230000001355 anti-mycobacterial effect Effects 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 210000004970 cd4 cell Anatomy 0.000 description 1
- 238000001516 cell proliferation assay Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 238000011098 chromatofocusing Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- WZHCOOQXZCIUNC-UHFFFAOYSA-N cyclandelate Chemical compound C1C(C)(C)CC(C)CC1OC(=O)C(O)C1=CC=CC=C1 WZHCOOQXZCIUNC-UHFFFAOYSA-N 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 231100000517 death Toxicity 0.000 description 1
- 108010025198 decaglycine Proteins 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 239000003651 drinking water Substances 0.000 description 1
- 235000020188 drinking water Nutrition 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 210000003162 effector t lymphocyte Anatomy 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 108010072542 endotoxin binding proteins Proteins 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 210000001723 extracellular space Anatomy 0.000 description 1
- 239000000727 fraction Substances 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 229940044627 gamma-interferon Drugs 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 230000005182 global health Effects 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 102000005396 glutamine synthetase Human genes 0.000 description 1
- 108020002326 glutamine synthetase Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- KZNQNBZMBZJQJO-YFKPBYRVSA-N glyclproline Chemical compound NCC(=O)N1CCC[C@H]1C(O)=O KZNQNBZMBZJQJO-YFKPBYRVSA-N 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 102000056538 human ABO Human genes 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 235000011167 hydrochloric acid Nutrition 0.000 description 1
- 150000004679 hydroxides Chemical class 0.000 description 1
- 230000002519 immonomodulatory effect Effects 0.000 description 1
- 230000036737 immune function Effects 0.000 description 1
- 230000008073 immune recognition Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- JJWLVOIRVHMVIS-UHFFFAOYSA-N isopropylamine Chemical compound CC(C)N JJWLVOIRVHMVIS-UHFFFAOYSA-N 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 239000006193 liquid solution Substances 0.000 description 1
- 239000006194 liquid suspension Substances 0.000 description 1
- 229940124590 live attenuated vaccine Drugs 0.000 description 1
- 229940023012 live-attenuated vaccine Drugs 0.000 description 1
- 230000005923 long-lasting effect Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- ZLNQQNXFFQJAID-UHFFFAOYSA-L magnesium carbonate Chemical compound [Mg+2].[O-]C([O-])=O ZLNQQNXFFQJAID-UHFFFAOYSA-L 0.000 description 1
- 239000001095 magnesium carbonate Substances 0.000 description 1
- 229910000021 magnesium carbonate Inorganic materials 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229940127554 medical product Drugs 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 210000001616 monocyte Anatomy 0.000 description 1
- 229940126619 mouse monoclonal antibody Drugs 0.000 description 1
- 210000000663 muscle cell Anatomy 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- HEGSGKPQLMEBJL-RKQHYHRCSA-N octyl beta-D-glucopyranoside Chemical compound CCCCCCCCO[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HEGSGKPQLMEBJL-RKQHYHRCSA-N 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- 239000006179 pH buffering agent Substances 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 108020004410 pectinesterase Proteins 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 229950009506 penicillinase Drugs 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 235000011007 phosphoric acid Nutrition 0.000 description 1
- 150000003016 phosphoric acids Chemical class 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 101150105604 prcB gene Proteins 0.000 description 1
- MFDFERRIHVXMIY-UHFFFAOYSA-N procaine Chemical compound CCN(CC)CCOC(=O)C1=CC=C(N)C=C1 MFDFERRIHVXMIY-UHFFFAOYSA-N 0.000 description 1
- 229960004919 procaine Drugs 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 238000011867 re-evaluation Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000284 resting effect Effects 0.000 description 1
- 229960000885 rifabutin Drugs 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- YEENEYXBHNNNGV-XEHWZWQGSA-M sodium;3-acetamido-5-[acetyl(methyl)amino]-2,4,6-triiodobenzoate;(2r,3r,4s,5s,6r)-2-[(2r,3s,4s,5r)-3,4-dihydroxy-2,5-bis(hydroxymethyl)oxolan-2-yl]oxy-6-(hydroxymethyl)oxane-3,4,5-triol Chemical compound [Na+].CC(=O)N(C)C1=C(I)C(NC(C)=O)=C(I)C(C([O-])=O)=C1I.O[C@H]1[C@H](O)[C@@H](CO)O[C@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 YEENEYXBHNNNGV-XEHWZWQGSA-M 0.000 description 1
- 230000003381 solubilizing effect Effects 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 210000001768 subcellular fraction Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000013268 sustained release Methods 0.000 description 1
- 239000012730 sustained-release form Substances 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- TXEYQDLBPFQVAA-UHFFFAOYSA-N tetrafluoromethane Chemical compound FC(F)(F)F TXEYQDLBPFQVAA-UHFFFAOYSA-N 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- BMCJATLPEJCACU-UHFFFAOYSA-N tricin Natural products COc1cc(OC)c(O)c(c1)C2=CC(=O)c3c(O)cc(O)cc3O2 BMCJATLPEJCACU-UHFFFAOYSA-N 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010087967 type I signal peptidase Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010037335 tyrosyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 229940125575 vaccine candidate Drugs 0.000 description 1
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/35—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Mycobacteriaceae (F)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
- A61P31/06—Antibacterial agents for tuberculosis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Definitions
- the present invention relates to a number of immunologically active, novel polypeptide fragments derived from the Mycobacterium tuberculosis , vaccines and other immunologic compositions containing the fragments as immunogenic components, and methods of production and use of the polypeptides.
- the invention also relates to novel nucleic acid fragments derived from M. tuberculosis which are useful in the preparation of the polypeptide fragments of the invention or in the diagnosis of infection with M. tuberculosis .
- the invention further relates to certain fusion polypeptides, notably fusions between ESAT-6 and MPT59.
- TB Human tuberculosis caused by Mycobacterium tuberculosis is a severe global health problem responsible for approximately 3 million deaths annually, according to the WHO. The worldwide incidence of new TB cases has been progressively falling for the last decade but the recent years has markedly changed this trend due to the advent of AIDS and the appearance of multidrug resistant strains of M. tuberculosis .
- BCG The only vaccine presently available for clinical use is BCG, a vaccine which efficacy remains a matter of controversy. BCG generally induces a high level of acquired resistance in animal models of TB, but several human trials in developing countries have failed to demonstrate significant protection. Notably, BCG is not approved by the FDA for use in the United States.
- Immunity to M. tuberculosis is characterized by three basic features; i) Living bacilli efficiently induces a protective immune response in contrast to killed preparations; ii) Specifically sensitized T lymphocytes mediate this protection; iii) The most important mediator molecule seems to be interferon gamma (INF- ⁇ ).
- ST-CF Short term-culture filtrate
- the novel antigens may also be important drug targets.
- the present invention is i.a. based on the identification and characterization of a number of previously uncharacterized culture filtrate antigens from M. tuberculosis .
- T cells mediating immunity are focused predominantly to antigens in the regions 6-12 and 17-30 kDa of ST-CF.
- 8 antigens in the low molecular weight region (CFP7, CFP7A, CFP7B, CFP8A, CFP8B, CFP9, CFP10A, and CFP11) and 18 antigens (CFP16, CFP17, CFP19, CFP19B, CFP20, CFP21, CFP22, CFP22A, CFP23, CFP23A, CFP23B, CFP25, CFP26, CFP27, CFP28, CFP29, CFP30A, and CFP30B) in the 17-30 kDa region have been identified.
- CFP19A and CFP23 have been selected because they exhibit relatively high homologies with CFP21 and CFP25, respectively, in so far that a nucleotide homology sequence search in the Sanger Database (cf. below) with the genes encoding CFP21 and CFP25, (cfp25 and cfp21 respectively), shows homology to two M. tuberculosis DNA sequences, orf19A and orf23.
- the two sequences, orf19a and orf23 encode to putative proteins CFP19A and CFP23 with the molecular weights of approx. 19 and 23 kDa respectively.
- the identity, at amino acid level, to CFP21 and CFP25 is 46% and 50%, respectively, for both proteins.
- CFP21 and CFP25 have been shown to be dominant T-cell antigens, and it is therefore believed that CFP19A and CFP23 are possible new T-cell antigens.
- CFP50 50 kDa antigen
- CWP32 antigen isolated from the cell wall in the 30 kDa region.
- the present invention is also based on the identification of a number of putative antigens from M. tuberculosis which are not present in Mycobacterium bovis BCG strains.
- the nucleotide sequences encoding these putative antigens are: rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a, and rd1-orf9b.
- the invention is based on the surprising discovery that fusions between ESAT-6 and MPT59 are superior immunogens compared to the unfused proteins, respectively.
- the encoding genes for 33 of the antigens have been determined, the distribution of a number of the antigens in various mycobacterial strains investigated and the biological activity of the products characterized.
- the panel hold antigens with potential for vaccine purposes as well as for diagnostic purposes, since the antigens are all secreted by metabolizing mycobacteria.
- T-cell epitopes are responsible for the elicitation of the acquired immunity against TB, whereas B-cell epitopes are without any significant influence on acquired immunity and recognition of mycobacteria in vivo. Since such T-cell epitopes are linear and are known to have a minimum length of 6 amino acid residues, the present invention is especially concerned with the identification and utilisation of such T-cell epitopes.
- the invention relates to a substantially pure polypeptide fragment which
- a) comprises an amino acid sequence selected from the sequences shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, and any one of 168-171,
- b) comprises a subsequence of the polypeptide fragment defined in a) which has a length of at least 6 amino acid residues, said subsequence being immunologically equivalent to the polypeptide defined in a) with respect to the ability of evoking a protective immune response against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex, or
- c) comprises an amino acid sequence having a sequence identity with the polypeptide defined in a) or the subsequence defined in b) of at least 70% and at the same time being immunologically equivalent to the polypeptide defined in a) with respect to the ability of evoking a protective immune response against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex,
- polypeptide fragment is in essentially pure form when consisting of the amino acid sequence 1-96 of SEQ ID NO: 2 or when consisting of the amino acid sequence 87-108 of SEQ ID NO: 4 fused to ⁇ -galactosidase,
- the degree of sequence identity in c) is at least 95% when the polypeptide comprises a homologue of a polypeptide which has the amino acid sequence SEQ ID NO: 12 or a subsequence thereof as defined in b), and
- the polypeptide fragment contains a threonine residue corresponding to position 213 in SEQ ID NO: 42 when comprising an amino acid sequence of at least 6 amino acids in SEQ ID NO: 42.
- polypeptide fragment denotes both short peptides with a length of at least two amino acid residues and at most 10 amino acid residues, oligopeptides (11-100 amino acid residues), and longer peptides-(the usual interpretation of “polypeptide”, i.e. more than 100 amino acid residues in length) as well as proteins (the functional entity comprising at least one peptide, oligopeptide, or polypeptide which may be chemically modified by being glycosylated, by being lipidated, or by comprising prosthetic groups).
- polypeptides also comprises native forms of peptides/proteins in mycobacteria as well as recombinant proteins or peptides in any type of expression vectors transforming any kind of host, and also chemically synthesized peptides.
- substantially pure polypeptide fragment means a polypeptide preparation which contains at most 5% by weight of other polypeptide material with which it is natively associated (lower percentages of other polypeptide material are preferred, e.g. at most 4%, at most 3%, at most 2%, at most 1%, and at most 1 ⁇ 2%). It is preferred that the substantially pure polypeptide is at least 96% pure, i.e. that the polypeptide constitutes at least 96% by weight of total polypeptide material present in the preparation, and higher percentages are preferred, such as at least 97%, at least 98%, at least 99%, at least 99,25%, at least 99,5%, and at least 99,75%.
- the polypeptide fragment is in “essentially pure form”, i.e. that the polypeptide fragment is essentially free of any other antigen with which it is natively associated, i.e. free of any other antigen from bacteria belonging to the tuberculosis complex.
- This can be accomplished by preparing the polypeptide fragment by means of recombinant methods in a non-mycobacterial host cell as will be described in detail below, or by synthesizing the polypeptide fragment by the well-known methods of solid or liquid phase peptide synthesis, e.g. by the method described by Merrifield or variations thereof.
- sequence when used in connection with a polypeptide of the invention having a SEQ ID NO selected from 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, and any one of 168-171 denotes any continuous stretch of at least 6 amino acid residues taken from the M.
- a polypeptide from different sources such as other bacteria or even from eukaryotic cells.
- polypeptide when formulated in a vaccine or a diagnostic agent (i.e. together with a pharmaceutically acceptable carrier or vehicle and optionally an adjuvant), will
- I) confer, upon administration (either alone or as an immunologically active constituent together with other antigens), an acquired increased specific resistance in a mouse and/or in a guinea pig and/or in a primate such as a human being against infections with bacteria belonging to the tuberculosis complex which is at least 20% of the acquired increased resistance conferred by Mycobacterium bovis BCG and also at least 20% of the acquired increased resistance conferred by the parent polypeptide comprising SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 (said parent polypeptide having substantially the same relative location and pattern in a 2DE gel prepared as the 2DE gel shown in FIG.
- the acquired increased resistance being assessed by the observed reduction in mycobacterial counts from spleen, lung or other organ homogenates isolated from the mouse or guinea pig receiving a challenge infection with a virulent strain of M. tuberculosis , or, in a primate such as a human being, being assessed by determining the protection against development of clinical tuberculosis in a vaccinated group versus that observed in a control group receiving a placebo or BCG (preferably the increased resistance is higher and corresponds to at least 50% of the protective immune response elicited by M. bovis BCG, such as at least 60%, or even more preferred to at least 80% of the protective immune response elicited by M.
- bovis BCG such as at least 90%; in some cases it is expected that the increased resistance will supersede that conferred by M. bovis BCG, and hence it is preferred that the resistance will be at least 100%, such as at least 110% of said increased resistance); and/or
- this diagnostically significant immune response can be in the form of a delayed type hypersensitivity reaction which can e.g. be determined by a skin test, or can be in the form of IFN- ⁇ release determined e.g. by an IFN- ⁇ assay as described in detail below.
- a diagnostically significant response in a skin test setup will be a reaction which gives rise to a skin reaction which is at least 5 mm in diameter and which is at least 65% (preferably at least 75% such as at the least 85%) of the skin reaction (assessed as the skin reaction diameter) elicited by the parent polypeptide comprising SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171.
- the ability of the polypeptide fragment to confer increased immunity may thus be assessed by measuring in an experimental animal, e.g. a mouse or a guinea pig, the reduction in mycobacterial counts from the spleen, lung or other organ homogenates isolated from the experimental animal which have received a challenge infection with a virulent strain of mycobacteria belonging to the tuberculosis complex after previously having been immunized with the polypeptide, as compared to the mycobacterial counts in a control group of experimental animals infected with the same virulent strain, which experimental animals have not previously been immunized against tuberculosis.
- the comparison of the mycobacterial counts may also be carried out with mycobacterial counts from a group of experimental animals receiving a challenge infection with the same virulent strain after having been immunized with Mycobacterium bovis BCG.
- the mycobacterial counts in homogenates from the experimental animals immunized with a polypeptide fragment according to the present invention must at the most be 5 times the counts in the mice or guinea pigs immunized with Mycobacterium bovis BCG, such as at the most 3 times the counts, and preferably at the most 2 times the counts.
- a more relevant assessment of the ability of the polypeptide fragment of the invention to confer increased resistance is to compare the incidence of clinical tuberculosis in two groups of individuals (e.g. humans or other primates) where one group receives a vaccine as described herein which contains an antigen of the invention and the other group receives either a placebo or an other known TB vaccine (e.g. BCG).
- the antigen of the invention should give rise to a protective immunity which is significantly higher than the one provided by the administration of the placebo (as determined by statistical methods known to the skilled artisan).
- the “tuberculosis-complex” has its usual meaning, i.e. the complex of mycobacteria causing TB which are Mycobacterium tuberculosis, Mycobacterium bovis, Mycobacterium bovis BCG, and Mycobacterium africanum.
- metabolicizing mycobacteria means live mycobacteria that are multiplying logarithmically and releasing polypeptides into the culture medium wherein they are cultured.
- sequence identity indicates a quantitative measure of the degree of homology between two amino acid sequences or between two nucleotide sequences of equal length: The sequence identity can be calculated as ( N ref - N dif ) ⁇ 100 N ref ,
- Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences.
- sequence identity is used here to illustrate the degree of identity between the amino acid sequence of a given polypeptide and the amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171.
- amino acid sequence to be compared with the amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 may be deduced from a DNA sequence, e.g. obtained by hybridization as defined below, or may be obtained by conventional amino acid sequencing methods.
- the sequence identity is preferably determined on the amino acid sequence of a mature polypeptide, i.e. without taking any leader sequence into consideration.
- polypeptides which are not identical to the polypeptides having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 are embraced by the present invention.
- the invention allows for minor variations which do not have an adverse effect on immunogenicity compared to the parent sequences and which may give interesting and useful novel binding properties or biological functions and immunogenicities etc.
- Each polypeptide fragment may thus be characterized by specific amino acid and nucleic acid sequences. It will be understood that such sequences include analogues and variants produced by recombinant methods wherein such nucleic acid and polypeptide sequences have been modified by substitution, insertion, addition and/or deletion of one or more nucleotides in said nucleic acid sequences to cause the substitution, insertion, addition or deletion of one or more amino acid residues in the recombinant polypeptide.
- DNA is used in the following, it should be understood that for the number of purposes where DNA can be substituted with RNA, the term DNA should be read to include RNA embodiments which will be apparent for the man skilled in the art.
- PNA may be used instead of DNA, as PNA has been shown to exhibit a very dynamic hybridization profile (PNA is described in Nielsen P E et al., 1991, Science 254: 1497-1500).
- antigens from segments of a known immunogenic protein or polypeptide. Certain epitopic regions may be used to produce responses similar to those produced by the entire antigenic polypeptide. Potential antigenic or immunogenic regions may be identified by any of a number of approaches, e.g., Jameson-Wolf or Kyte-Doolittle antigenicity analyses or Hopp and Woods (1981) hydrophobicity analysis (see, e.g., Jameson and Wolf, 1988; Kyte and Doolittle, 1982; or U.S. Pat. No. 4,554,101).
- Hydrophobicity analysis assigns average hydrophilicity values to each amino acid residue from these values average hydrophilicities can be calculated and regions of greatest hydrophilicity determined. Using one or more of these methods, regions of predicted antigenicity may be derived from the amino acid sequence assigned to the polypeptides of the invention.
- T-cell epitopes are linear, deletion mutants of polypeptides having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 will, if constructed systematically, reveal what regions of the polypeptides are essential in immune recognition, e.g.
- Another method utilises overlapping oligomers (preferably synthetic having a length of e.g. 20 amino acid residues) derived from polypeptides having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171. Some of these will give a positive response in the IFN- ⁇ assay whereas others will not.
- the polypeptide fragment of the invention comprises an epitope for a T-helper cell.
- the minimum length of a T-cell epitope has been shown to be at least 6 amino acids, it is normal that such epitopes are constituted of longer stretches of amino acids.
- the polypeptide fragment of the invention has a length of at least 7 amino acid residues, such as at least 8, at least 9, at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, and at least 30 amino acid residues.
- polypeptides of the invention are natively translation products which include a leader sequence (or other short peptide sequences), whereas the product which can be isolated from short-term culture filtrates from bacteria belonging to the tuberculosis complex are free of these sequences.
- a leader sequence or other short peptide sequences
- the product which can be isolated from short-term culture filtrates from bacteria belonging to the tuberculosis complex are free of these sequences.
- the polypeptide fragment of the invention is free from any signal sequence; this is especially interesting when the polypeptide fragment is produced synthetically but even when the polypeptide fragments are produced recombinantly it is normally acceptable that they are not exported by the host cell to the periplasm or the extracellular space; the polypeptide fragments can be recovered by traditional methods (cf. the discussion below) from the cytoplasm after disruption of the host cells, and if there is need for refolding of the polypeptide fragments, general refolding schemes can be employed, cf. e.g. the disclosure in WO 94/18227 where such a general applicable refolding method is described.
- a suitable assay for the potential utility of a given polypeptide fragment derived from SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 is to assess the ability of the polypeptide fragment to effect IFN- ⁇ release from primed memory T-lymphocytes.
- Polypeptide fragments which have this capability are according to the invention especially interesting embodiments of the invention: It is contemplated that polypeptide fragments which stimulate T lymphocyte immune response shortly after the onset of the infection are important in the control of the mycobacteria causing the infection before the mycobacteria have succeeded in multiplying up to the number of bacteria that would have resulted in fulminant infection.
- an important embodiment of the invention is a polypeptide fragment defined above which
- [0050] 1) induces a release of IFN- ⁇ from primed memory T-lymphocytes withdrawn from a mouse within 2 weeks of primary infection or within 4 days after the mouse has been rechallenge infected with mycobacteria belonging to the tuberculosis complex, the induction performed by the addition of the polypeptide to a suspension comprising about 200,000 spleen cells per ml, the addition of the polypeptide resulting in a concentration of 1-4 ⁇ g polypeptide per ml suspension, the release of IFN- ⁇ being assessable by determination of IFN- ⁇ in supernatant harvested 2 days after the addition of the polypeptide to the suspension, and/or
- [0051] 2) induces a release of IFN- ⁇ of at least 1,500 pg/ml above background level from about 1,000,000 human PBMC (peripheral blood mononuclear cells) per ml isolated from TB patients in the first phase of infection, or from healthy BCG vaccinated donors, or from healthy contacts to TB patients, the induction being performed by the addition of the polypeptide to a suspension comprising the about 1,000,000 PBMC per ml, the addition of the polypeptide resulting in a concentration of 1-4 ⁇ g polypeptide per ml suspension, the release of IFN- ⁇ being assessable by determination of IFN- ⁇ in supernatant harvested 2 days after the addition of the polypeptide to the suspension; and/or
- the release effected by the polypeptide fragment gives rise to at least 1,500 pg/ml IFN- ⁇ in the supernatant but higher concentrations are preferred, e.g. at least 2,000 pg/ml and even at least 3,000 pg/ml IFN- ⁇ in the supernatant.
- the IFN- ⁇ release from bovine PBMC can e.g. be measured as the optical density (OD) index over background in a standard cytokine ELISA and should thus be at least two, but higher numbers such as at least 3, 5, 8, and 10 are preferred.
- the polypeptide fragments of the invention preferably comprises an amino acid sequence of at least 6 amino acid residues in length which has a higher sequence identity than 70 percent with SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171.
- a preferred minimum percentage of sequence identity is at least 80%, such as at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, and at least 99.5%.
- the invention also pertains to a fusion polypeptide comprising at least one polypeptide fragment defined above and at least one fusion partner.
- the fusion partner can, in order to enhance immunogenicity, e.g.
- T-cell epitopes e.g. derived from the polypeptides ESAT-6, MPB64, MPT64, or MPB59
- immunogenic epitopes enhancing the immunogenicity of the target gene product, e.g. lymphokines such as INF- ⁇ , IL-2 and IL-12.
- the fusion partner can e.g. be a bacterial fimbrial protein, e.g. the pilus components pilin and papA; protein A; the ZZ-peptide (ZZ-fusions are marketed by Pharmacia in Sweden); the maltose binding protein; gluthatione S-transferase; ⁇ -galactosidase; or poly-histidine.
- polypeptides which are lipidated and thereby effect that the immunogenic polypeptide is presented in a suitable manner to the immune system.
- This effect is e.g. known from vaccines based on the Borrelia burgdorferi OspA polypeptide, wherein the lipidated membrane anchor in the polypeptide confers a self-adjuvating effect to the polypeptide (which is natively lipidated) when isolated from cells producing it.
- the OspA polypeptide is relatively silent immunologically when prepared without the lipidation anchor.
- Example 6A the fusion polypeptide consisting of MPT59 fused directly N-terminally to ESAT-6 enhances the immunogenicity of ESAT-6 beyond what would be expected from the immunogenicities of MPT59 and ESAT-6 alone.
- the precise reason for this surprising finding is not yet known, but it is expected that either the presence of both antigens lead to a synergistic effect with respect to immunogenicity or the presence of a sequence N-terminally to the ESAT-6 sequence protects this immune dominant protein from loss of important epitopes known to be present in the N-terminus.
- a third, alternative, possibility is that the presence of a sequence C-terminally to the MPT59 sequence enhances the immunologic properties of this antigen.
- one part of the invention pertains to a fusion polypeptide fragment which comprises a first amino acid sequence including at least one stretch of amino acids constituting a T-cell epitope derived from the M. tuberculosis protein ESAT-6 or MPT59, and a second amino acid sequence including at least one T-cell epitope derived from a M. tuberculosis protein different from ESAT-6 (if the first stretch of amino acids are derived from ESAT-6) or MPT59 (if the first stretch of amino acids are derived from MPT59) and/or including a stretch of amino acids which protects the first amino acid sequence from in vivo degradation or posttranslational processing.
- the first amino acid sequence may be situated N- or C-terminally to the second amino acid sequence, but in line with the above considerations regarding protection of the ESAT-6 N-terminus it is preferred that the first amino acid sequence is C-terminal to the second when the first amino acid sequence is derived from ESAT-6.
- a fusion polypeptide fragment according of the invention is one, wherein the at least one T-cell epitope included in the second amino acid sequence is derived from a M. tuberculosis polypeptide (the “parent” polypeptide) selected from the group consisting of a polypeptide fragment according to the present invention and described in detail above and in the examples, or the amino acid sequence could be derived from any one of the M.
- tuberculosis proteins DnaK, GroEL, urease, glutamine synthetase, the proline rich complex, L-alanine dehydrogenase, phosphate binding protein, Ag 85 complex, HBHA (heparin binding hemagglutinin), MPT51, MPT64, superoxide dismutase, 19 kDa lipoprotein, a-crystallin, GroES, MPT59 (when the first amino acid sequence is derived from ESAT-6), and ESAT-6 (when the first amino acid sequence is derived from MPT59).
- first and second T-cell pitopes each have a sequence identity of at least 70% with the natively occurring sequence in the proteins from which they are derived and it is even further preferred that the first and/or second amino acid sequence has a sequence identity of at least 70 % with the protein from which they are derived.
- a most preferred embodiment of this fusion polypeptide is one wherein the first amino acid sequence is the amino acid sequence of ESAT-6 or MPT59 and/or the second amino acid sequence is the full-length amino acid sequence of the possible “parent” polypeptides listed above.
- the fusion polypeptide fragment comprises ESAT-6 fused to MPT59 (advantageously, ESAT-6 is fused to the C-terminus of MPT59) and in one special embodiment, there are no linkers introduced between the two amino acid sequences constituting the two parent polypeptide fragments.
- Another part of the invention pertains to a nucleic acid fragment in isolated form which
- [0062] 1) comprises a nucleic acid sequence which encodes a polypeptide or fusion polypeptide as defined above, or comprises a nucleic acid sequence complementary thereto, and/or
- SEQ ID NO: 65 or a sequence complementary thereto
- SEQ ID NO: 87 or a sequence complementary.thereto,
- SEQ ID NO: 140 or a sequence complementary thereto
- SEQ ID NO: 148 or a sequence complementary thereto
- SEQ ID NO: 150 or a sequence complementary thereto
- SEQ ID NO: 152 or a sequence complementary thereto
- nucleic acid fragment comprises a subsequence of SEQ ID NO: 41
- nucleic acid fragment contains an A corresponding to position 781 in SEQ ID NO: 41
- nucleic acid fragment comprises a subsequence of a nucleotide sequence exactly complementary to SEQ ID NO: 41
- nucleic acid fragment comprises a T corresponding to position 781 in SEQ ID NO: 41.
- the nucleic acid fragment is a DNA fragment.
- the preferred nucleic acid sequence when employed for hybridization studies or assays includes sequences that are complementary to at least a 10 to 40, or so, nucleotide stretch of the selected sequence.
- a size of at least 10 nucleotides in length helps to ensure that the fragment will be of sufficient length to form a duplex molecule that is both stable and selective.
- Molecules having complementary sequences over stretches greater than 10 bases in length are generally preferred, though, in order to increase stability and selectivity of the hybrid, and thereby improve the quality and degree of specific hybrid molecules obtained.
- sequence when used in connection with the nucleic acid fragments of the invention is intended to indicate a continuous stretch of at least 10 nucleotides exhibits the above hybridization pattern. Normally this will require a minimum sequence identity of at least 70% with a subsequence of the hybridization partner having SEQ ID NO: 1, 3, 5, 7, 9, 11, 12, 15, 21, 41, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 87, 89, 91, 93, 140, 142, 144, 146, 148, 150, or 152.
- the nucleic acid fragment is longer than 10 nucleotides, such as at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, and at least 80 nucleotides long, and the sequence identity should preferable also be higher than 70%, such as at least 75%, at least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 96%, and at least 98%. It is most preferred that the sequence identity is 100%.
- Such fragments may be readily prepared by, for example, directly synthesizing the fragment by chemical means, by application of nucleic acid reproduction technology, such as the PCR technology of U.S. Pat. No. 4,603,102, or by introducing selected sequences into recombinant vectors for recombinant production.
- nucleic acid fragment of the invention may be exchanged by others which, when expressed, result in a polypeptide identical or substantially identical to the polypeptide encoded by the nucleic acid fragment in question.
- the invention thus allows for variations in the sequence such as substitution, insertion (including introns), addition, deletion and rearrangement of one or more nucleotides, which variations do not have any substantial effect on the polypeptide encoded by the nucleic acid fragment or a subsequence thereof.
- substitution is intended to mean the replacement of one or more nucleotides in the full nucleotide sequence with one or more different nucleotides
- addition is understood to mean the addition of one or more nucleotides at either end of the full nucleotide sequence
- insertion is intended to mean the introduction of one or more nucleotides within the full nucleotide sequence
- deletion is intended to indicate that one or more nucleotides have been deleted from the full nucleotide sequence whether at either end of the sequence or at any suitable point within it
- “rearrangement” is intended to mean that two or more nucleotide residues have been exchanged with each other.
- the nucleotide sequence to be modified may be of cDNA or genomic origin as discussed above, but may also be of synthetic origin. Furthermore, the sequence may be of mixed cDNA and genomic, mixed cDNA and synthetic or genomic and synthetic origin as discussed above. The sequence may have been modified, e.g. by site-directed mutagenesis, to result in the desired nucleic acid fragment encoding the desired polypeptide.
- the following discussion focused on modifications of nucleic acid encoding the polypeptide should be understood to encompass also such possibilities, as well as the possibility of building up the nucleic acid by ligation of two or more DNA fragments to obtain the desired nucleic acid fragment, and combinations of the above-mentioned principles.
- nucleotide sequence may be modified using any suitable technique which results in the production of a nucleic acid fragment encoding a polypeptide of the invention.
- the modification of the nucleotide sequence encoding the amino acid sequence of the polypeptide of the invention should be one which does not impair the immunological function of the resulting polypeptide.
- a preferred method of preparing variants of the antigens disclosed herein is site-directed mutagenesis. This technique is useful in the preparation of individual peptides, or biologically functional equivalent proteins or peptides, derived from the antigen sequences, through specific mutagenesis of the underlying nucleic acid. The technique further provides a ready ability to prepare and test sequence variants, for example, incorporating one or more of the foregoing considerations, by introducing one or more nucleotide sequence changes into the nucleic acid.
- Site-specific mutagenesis allows the production of mutants through the use of specific oligonucleotide sequences which encode the nucleotide sequence of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed.
- a primer of about 17 to 25 nucleotides in length is preferred, with about 5 to 10 residues on both sides of the junction of the sequence being altered.
- the technique of site-specific mutagenesis is well known in the art as exemplified by publications (Adelman et al., 1983). As will be appreciated, the technique typically employs a phage vector which exists in both a single stranded and double stranded form.
- Typical vectors useful in site-directed mutagenesis include vectors such as the M13 phage (Messing et al., 1981). These phage are readily commercially available and their use is generally well known to those skilled in the art.
- site-directed mutagenesis in accordance herewith is performed by first obtaining a single-stranded vector which includes within its sequence a nucleic acid sequence which encodes the polypeptides of the invention.
- An oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically, for example by the method of Crea et al. (1978).
- This primer is then annealed with the single-stranded vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment, in order to complete the synthesis of the mutation-bearing strand.
- DNA polymerizing enzymes such as E. coli polymerase I Klenow fragment
- sequence variants of the selected nucleic acid fragments of the invention using site-directed mutagenesis is provided as a means of producing potentially useful species of the genes and is not meant to be limiting as there are other ways in which sequence variants of the nucleic acid fragments of the invention may be obtained.
- recombinant vectors encoding the desired genes may be treated with mutagenic agents to obtain sequence variants (see, e.g., a method described by Eichenlaub, 1979) for the mutagenesis of plasmid DNA using hydroxylamine.
- the invention also relates to a replicable expression vector which comprises a nucleic acid fragment defined above, especially a vector which comprises a nucleic acid fragment encoding a polypeptide fragment of the invention.
- the vector may be any vector which may conveniently be subjected to recombinant DNA procedures, and the choice of vector will often depend on the host cell into which it is to be introduced.
- the vector may be an autonomously replicating vector, i.e. a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication; examples of such a vector are a plasmid, phage, cosmid, mini-chromosome or virus.
- the vector may be one which, when introduced in a host cell, is integrated in the host cell genome and replicated together with the chromosome(s) into which it has been integrated.
- the vector of the invention may be used to transform cells so as to allow propagation of the nucleic acid fragments of the invention or so as to allow expression of the polypeptide fragments of the invention.
- the invention also pertains to a transformed cell harbouring at least one such vector according to the invention, said cell being one which does not natively harbour the vector and/or the nucleic acid fragment of the invention contained therein.
- a transformed cell (which is also a part of the invention) may be any suitable bacterial host cell or any other type of cell such as a unicellular eukaryotic organism, a fungus or yeast, or a cell derived from a multicellular organism, e.g. an animal or a plant.
- a mammalian cell is used, although glycosylation of proteins is a rare event in prokaryotes.
- a prokaryotic cell is preferred such as a bacterium belonging to the genera Mycobacterium, Salmonella, Pseudomonas, Bacillus and Eschericia.
- the transformed cell is an E. coli, B. subtilis , or M. bovis BCG cell, and it is especially preferred that the transformed cell expresses a polypeptide according of the invention. The latter opens for the possibility to produce the polypeptide of the invention by simply recovering it from the culture containing the transformed cell.
- the transformed cell is Mycobacterium bovis BCG strain: Danish 1331, which is the Mycobacterium bovis strain Copenhagen from the Copenhagen BCG Laboratory, Statens Seruminstitut, Denmark.
- nucleic acid fragments of the invention allow for the recombinant production of the polypeptides fragments of the invention. However, also isolation from the natural source is a way of providing the polypeptide fragments as is peptide synthesis.
- the invention also pertains to a method for the preparation of a polypeptide fragment of the invention, said method comprising inserting a nucleic acid fragment as defined above into a vector which is able to replicate in a host cell, introducing the resulting recombinant vector into the host cell (transformed cells may be selected using various techniques, including screening by differential hybridization, identification of fused reporter gene products, resistance markers, anti-antigen antibodies and the like), culturing the host cell in a culture medium under conditions sufficient to effect expression of the polypeptide (of course the cell may be cultivated under conditions appropriate to the circumstances, and if DNA is desired, replication conditions are used), and recovering the polypeptide from the host cell or culture medium; or
- the medium used to grow the transformed cells may be any conventional medium suitable for the purpose.
- a suitable vector may be any of the vectors described above, and an appropriate host cell may be any of the cell types listed above.
- the methods employed to construct the vector and effect introduction thereof into the host cell may be any methods known for such purposes within the field of recombinant DNA. In the following a more detailed description of the possibilities will be given:
- prokaryotes are preferred for the initial cloning of nucleic sequences of the invention and constructing the vectors useful in the invention.
- prokaryotes are preferred for the initial cloning of nucleic sequences of the invention and constructing the vectors useful in the invention.
- strains such as E. coli K12 strain 294 (ATCC No. 31446), E. coli B, and E. coli X 1776 (ATCC No. 31537). These examples are, of course, intended to be illustrative rather than limiting.
- Prokaryotes are also preferred for expression.
- the aforementioned strains, as well as E. coli W3110 (F-, lambda-, prototrophic, ATCC No. 273325), bacilli such as Bacillus subtilis , or other enterobacteriaceae such as Salmonella typhimurium or Serratia marcesans , and various Pseudomonas species may be used.
- rapid-growing mycobacteria e.g. M. smegmatis , as these bacteria have a high degree of resemblance with mycobacteria of the tuberculosis complex and therefore stand a good chance of reducing the need of performing post-translational modifications of the expression product.
- plasmid vectors containing replicon and control sequences which are derived from species compatible with the host cell are used in connection with these hosts.
- the vector ordinarily carries a replication site, as well as marking sequences which are capable of providing phenotypic selection in transformed cells.
- E. coli is typically transformed using pBR322, a plasmid derived from an E. coli species (see, e.g., Bolivar et al., 1977, Gene 2: 95).
- the pBR322 plasmid contains genes for ampicillin and tetracycline resistance and thus provides easy means for identifying transformed cells.
- the pBR plasmid, or other microbial plasmid or phage must also contain, or be modified to contain, promoters which can be used by the microorganism for expression.
- Those promoters most commonly used in recombinant DNA construction include the B-lactamase (penicillinase) and lactose promoter systems (Chang et al., 1978; Itakura et al., 1977; Goeddel et al., 1979) and a tryptophan (trp) promoter system (Goeddel et al., 1979; EPO Appl. Publ. No. 0036776).
- the isolation of the polypeptide may for instance be carried out by affinity chromatography (or other conventional biochemical procedures based on chromatography), using a monoclonal antibody which substantially specifically binds the polypeptide according to the invention.
- affinity chromatography or other conventional biochemical procedures based on chromatography
- Another possibility is to employ the simultaneous electroelution technique described by Andersen et al. in J. Immunol. Methods 161: 29-39.
- the post-translational modifications involves lipidation, glycosylation, cleavage, or elongation of the polypeptide.
- the DNA sequence information provided by this invention allows for the preparation of relatively short DNA (or RNA or PNA) sequences having the ability to specifically hybridize to mycobacterial gene sequences.
- nucleic acid probes of an appropriate length are prepared based on a consideration of the relevant sequence. The ability of such nucleic acid probes to specifically hybridize to the mycobacterial gene sequences lend them particular utility in a variety of embodiments.
- the probes can be used in a variety of diagnostic assays for detecting the presence of pathogenic organisms in a given sample. However, either uses are envisioned, including the use of the sequence information for the preparation of mutant species primers, or primers for use in preparing other genetic constructs.
- nucleic acid fragments of the invention may be used for effecting in vivo expression of antigens, i.e. the nucleic acid fragments may be used in so-called DNA vaccines.
- DNA vaccines a DNA fragment cloned in a vector which is non-replicative in eukaryotic cells may be introduced into an animal (including a human being) by e.g. intramuscular injection or percutaneous administration (the so-called “gene gun” approach). The DNA is taken up by e.g.
- muscle cells and the gene of interest is expressed by a promoter which is functioning in eukaryotes, e.g. a viral promoter, and the gene product thereafter stimulates the immune system.
- a promoter which is functioning in eukaryotes, e.g. a viral promoter, and the gene product thereafter stimulates the immune system.
- the invention also relates to a vaccine comprising a nucleic acid fragment according to the invention, the vaccine effecting in vivo expression of antigen by an animal, including a human being, to whom the vaccine has been administered, the amount of expressed antigen being effective to confer substantially increased resistance to infections with mycobacteria of the tuberculosis complex in an animal, including a human being.
- the efficacy of such a “DNA vaccine” can possibly be enhanced by administering the gene encoding the expression product together with a DNA fragment encoding a polypeptide which has the capability of modulating an immune response.
- a gene encoding lymphokine precursors or lymphokines e.g. IFN- ⁇ , IL-2, or IL-12
- lymphokines could be administered together with the gene encoding the immunogenic protein, either by administering two separate DNA fragments or by administering both DNA fragments included in the same vector.
- DNA fragments comprising a multitude of nucleotide sequences which each encode relevant epitopes of the polypeptides disclosed herein so as to effect a continuous sensitization of the immune system with a broad spectrum of these epitopes.
- polypeptide fragments of the invention are excellent candidates for vaccine constituents or for constituents in an immune diagnostic agent due to their extracellular presence in culture media containing metabolizing virulent mycobacteria belonging to the tuberculosis complex, or because of their high homologies with such extracellular antigens, or because of their absence in M. bovis BCG.
- an immunologic composition comprising a polypeptide or fusion polypeptide according to the invention.
- it is preferred that it comprises an immunologically and pharmaceutically acceptable carrier, vehicle or adjuvant.
- Suitable carriers are selected from the group consisting of a polymer to which the polypeptide(s) is/are bound by hydrophobic non-covalent interaction, such as a plastic, e.g. polystyrene, or a polymer to which the polypeptide(s) is/are covalently bound, such as a polysaccharide, or a polypeptide, e.g. bovine serum albumin, ovalbumin or keyhole limpet haemocyanin.
- Suitable vehicles are selected from the group consisting of a diluent and a suspending agent.
- the adjuvant is preferably selected from the group consisting of dimethyldioctadecylammonium bromide (DDA), Quil A, poly I:C, Freund's incomplete adjuvant, IFN- ⁇ , IL-2, IL-12, monophosphoryl lipid A (MPL), and muramyl dipeptide (MDP).
- DDA dimethyldioctadecylammonium bromide
- Quil A Quil A
- poly I:C poly I:C
- Freund's incomplete adjuvant IFN- ⁇
- IL-2 IL-2
- IL-12 monophosphoryl lipid A
- MPL monophosphoryl lipid A
- MDP muramyl dipeptide
- a preferred immunologic composition according to the present invention comprising at least two different polypeptide fragments, each different polypeptide fragment being a polypeptide or a fusion polypeptide defined above. It is preferred that the immunologic composition comprises between 3-20 different polypeptide fragments or fusion polypeptides.
- Such an immunologic composition may preferably be in the form of a vaccine or in the form of a skin test reagent.
- the invention therefore also pertain to a method for producing an immunologic composition according to the invention, the method comprising preparing, synthesizing or isolating a polypeptide according to the invention, and solubilizing or dispersing the polypeptide in a medium for a vaccine, and optionally adding other M. tuberculosis antigens and/or a carrier, vehicle and/or adjuvant substance.
- vaccines which contain peptide sequences as active ingredients are generally well understood in the art, as exemplified by U.S. Pat. Nos. 4,608,251; 4,601,903; 4,599,231; 4,599,230; 4,596,792; and 4,578,770, all incorporated herein by reference.
- such vaccines are prepared as injectables either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared.
- the preparation may also be emulsified.
- the active immunogenic ingredient is often mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient.
- Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like, and combinations thereof.
- the vaccine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance the effectiveness of the vaccines.
- the vaccines are conventionally administered parenterally, by injection, for example, either subcutaneously or intramuscularly.
- Additional formulations which are suitable for other modes of administration include suppositories and, in some cases, oral formulations.
- suppositories traditional binders and carriers may include, for example, polyalkalene glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in the range of 0.5% to 10%, preferably 1-2%.
- Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release formulations or powders and contain 10-95% of active ingredient, preferably 25-70%.
- the proteins may be formulated into the vaccine as neutral or salt forms.
- Pharmaceutically acceptable salts include acid addition salts (formed with the free amino groups of the peptide) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups may also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like.
- the vaccines are administered in a manner compatible with the dosage formulation, and in such amount as will be therapeutically effective and immunogenic.
- the quantity to be administered depends on the subject to be treated, including, e.g., the capacity of the individual's immune system to mount an immune response, and the degree of protection desired.
- Suitable dosage ranges are of the order of several hundred micrograms active ingredient per vaccination with a preferred range from about 0.1 ⁇ g to 1000 ⁇ g, such as in the range from about 1 ⁇ g to 300 ⁇ g, and especially in the range from about 10 ⁇ g to 50 ⁇ g.
- Suitable regimens for initial administration and booster shots are also variable but are typified by an initial administration followed by subsequent inoculations or other administrations.
- the manner of application may be varied widely. Any of the conventional methods for administration of a vaccine are applicable. These are believed to include oral application on a solid physiologically acceptable base or in a physiologically acceptable dispersion, parenterally, by injection or the like.
- the dosage of the vaccine will depend on the route of administration and will vary according to the age of the person to be vaccinated and, to a lesser degree, the size of the person to be vaccinated.
- polypeptides of the vaccine are sufficiently immunogenic in a vaccine, but for some of the others the immune response will be enhanced if the vaccine further comprises an adjuvant substance.
- Various methods of achieving adjuvant effect for the vaccine include use of agents such as aluminum hydroxide or phosphate (alum), commonly used as 0.05 to 0.1 percent solution in phosphate buffered saline, admixture with synthetic polymers of sugars (Carbopol) used as 0.25 percent solution, aggregation of the protein in the vaccine by heat treatment with temperatures ranging between 70° to 101° C. for 30 second to 2 minute periods respectively. Aggregation by reactivating with pepsin treated (Fab) antibodies to albumin, mixture with bacterial cells such as C.
- agents such as aluminum hydroxide or phosphate (alum), commonly used as 0.05 to 0.1 percent solution in phosphate buffered saline, admixture with synthetic polymers of sugars (Carbopol) used as 0.25 percent solution, aggregation of the protein in the vaccine by heat treatment with temperatures ranging between 70° to 101° C. for 30 second to 2 minute periods respectively. Aggregation by reactivating with pepsin treated (
- parvum or endotoxins or lipopolysaccharide components of gram-negative bacteria emulsion in physiologically acceptable oil vehicles such as mannide monooleate (Aracel A) or emulsion with 20 percent solution of a perfluorocarbon (Fluosol-DA) used as a block substitute may also be employed.
- DDA dimethyldioctadecylammonium bromide
- MPL monophosphoryl lipid A
- MDP muramyl dipeptide
- Another highly interesting (and thus, preferred) possibility of achieving adjuvant effect is to employ the technique described in Gosselin et al., 1992 (which is hereby incorporated by reference herein).
- the presentation of a relevant antigen such as an antigen of the present invention can be enhanced by conjugating the antigen to antibodies (or antigen binding antibody fragments) against the Fc ⁇ receptors on monocytes/macrophages.
- a relevant antigen such as an antigen of the present invention
- conjugates between antigen and anti-Fc ⁇ RI have been demonstrated to enhance immunogenicity for the purposes of vaccination.
- lymphokines e.g. IFN- ⁇ , IL-2 and IL-12
- synthetic IFN- ⁇ inducers such as poly I:C in combination with the above-mentioned adjuvants.
- lymphokines e.g. IFN- ⁇ , IL-2 and IL-12
- synthetic IFN- ⁇ inducers such as poly I:C
- the vaccine will be necessary to have multiple administrations of the vaccine, usually not exceeding six vaccinations, more usually not exceeding four vaccinations and preferably one or more, usually at least about three vaccinations.
- the vaccinations will normally be at from two to twelve week intervals, more usually from three to five week intervals. Periodic boosters at intervals of 1-5 years, usually three years, will be desirable to maintain the desired levels of protective immunity.
- the course of the immunization may be followed by in vitro proliferation assays of PBL (peripheral blood lymphocytes) co-cultured with ESAT-6 or ST-CF, and especially by measuring the levels of IFN- ⁇ released form the primed lymphocytes.
- PBL peripheral blood lymphocytes
- the assays may be performed using conventional labels, such as radionuclides, enzymes, fluorescers, and the like. These techniques are well known and may be found in a wide variety of patents, such as U.S. Pat. Nos. 3,791,932; 4,174,384 and 3,949,064, as illustrative of these types of assays.
- the vaccine according to the invention may comprise several different polypeptides in order to increase the immune response.
- the vaccine may comprise two or more polypeptides, where all of the polypeptides are as defined above, or some but not all of the peptides may be derived from a bacterium belonging to the M. tuberculosis complex.
- the polypeptides not necessarily fulfilling the criteria set forth above for polypeptides may either act due to their own immunogenicity or merely act as adjuvants.
- Examples of such interesting polypeptides are MPB64, MPT64, and MPB59, but any other substance which can be isolated from mycobacteria are possible candidates.
- the vaccine may comprise 3-20 different polypeptides, such as 3-10 different polypeptides.
- Another important aspect of the present invention is an improvement of the living BCG vaccine presently available, which is a vaccine for immunizing an animal, including a human being, against TB caused by mycobacteria belonging to the tuberculosis-complex, comprising as the effective component a microorganism, wherein one or more copies of a DNA sequence encoding a polypeptide as defined above has been incorporated into the genome of the microorganism in a manner allowing the microorganism to express and secrete the polypeptide.
- the term “genome” refers to the chromosome of the microorganisms as well as extrachromosomally DNA or RNA, such as plasmids. It is, however, preferred that the DNA sequence of the present invention has been introduced into the chromosome of the non-pathogenic microorganism, since this will prevent loss of the genetic material introduced.
- the non-pathogenic microorganism is a bacterium, e.g. selected from the group consisting of the genera Mycobacterium, Salmonella, Pseudomonas and Eschericia.
- the non-pathogenic microorganism is Mycobacterium bovis BCG, such as Mycobacterium bovis BCG strain: Danish 1331.
- incorporation of one or more copies of a nucleotide sequence encoding the polypeptide according to the invention in a mycobacterium from a M. bovis BCG strain will enhance the immunogenic effect of the BCG strain.
- the incorporation of more than one copy of a nucleotide sequence of the invention is contemplated to enhance the immune response even more, and consequently an aspect of the invention is a vaccine wherein at least 2 copies of a DNA sequence encoding a polypeptide is incorporated in the genome of the microorganism, such as at least 5 copies.
- the copies of DNA sequences may either be identical encoding identical polypeptides or be variants of the same DNA sequence encoding identical or homologues of a polypeptide, or in another embodiment be different DNA sequences encoding different polypeptides where at least one of the polypeptides is according to the present invention.
- the living vaccine of the invention can be prepared by cultivating a transformed non-pathogenic cell according to the invention, and transferring these cells to a medium for a vaccine, and optionally adding a carrier, vehicle and/or adjuvant substance.
- the invention also relates to a method of diagnosing TB caused by Mycobacterium tuberculosis, Mycobacterium africanum or Mycobacterium bovis in an animal, including a human being, comprising intradermally injecting, in the animal, a polypeptide according to the invention or a skin test reagent described above, a positive skin response at the location of injection being indicative of the animal having TB, and a negative skin response at the location of injection being indicative of the animal not having TB.
- a positive response is a skin reaction having a diameter of at least 5 mm, but larger reactions are preferred, such as at least 1 cm, 1.5 cm, and at least 2 cm in diameter.
- the composition used as the skin test reagent can be prepared in the same manner as described for the vaccines above.
- the invention also pertains to a method for immunising an animal, including a human being, against TB caused by mycobacteria belonging to the tuberculosis complex, comprising administering to the animal the polypeptide of the invention, or a vaccine composition of the invention as described above, or a living vaccine described above.
- Preferred routes of administration are the parenteral (such as intravenous and intraarterially), intraperitoneal, intramuscular, subcutaneous, intradermal, oral, buccal, sublingual, nasal, rectal or transdermal route.
- ESAT-6 which is present in short-term culture filtrates from mycobacteria as well as the esat-6 gene in the mycobacterial genome has been demonstrated to have a very limited distribution in other mycobacterial strains that M. tuberculosis , e.g. esat-6 is absent in both BCG and the majority of mycobacterial species isolated from the environment, such as M. avium and M. terrae .
- the diagnostic embodiments of the invention are especially well-suited for performing the diagnosis of on-going or previous infection with virulent mycobacterial strains of the tuberculosis complex, and it is contemplated that it will be possible to distinguish between 1) subjects (animal or human) which have been previously vaccinated with e.g. BCG vaccines or subjected to antigens from non-virulent mycobacteria and 2) subjects which have or have had active infection with virulent mycobacteria.
- a blood sample comprising mononuclear cells (i.a. T-lymphocytes) from a patient could be contacted with a sample of one or more polypeptides of the invention.
- This contacting can be performed in vitro and a positive reaction could e.g. be proliferation of the T-cells or release cytokines such as ⁇ -interferon into the extracellular phase (e.g. into a culture supernatant); a suitable in vivo test would be a skin test as described above.
- a serum sample from a subject to contact with a polypeptide of the invention the demonstration of a binding between antibodies in the serum sample and the polypeptide being indicative of previous or ongoing infection.
- the invention therefore also relates to an in vitro method for diagnosing ongoing or previous sensitization in an animal or a human being with bacteria belonging to the tuberculosis complex, the method comprising providing a blood sample from the animal or human being, and contacting the sample from the animal with the polypeptide of the invention, a significant release into the extracellular phase of at least one cytokine by mononuclear cells in the blood sample being indicative of the animal being sensitized.
- a significant release is herein meant that the release of the cytokine is significantly higher than the cytokine release from a blood sample derived from a non-tuberculous subject (e.g. a subject which does not react in a traditional skin test for TB). Normally, a significant release is at least two times the release observed from such a sample.
- a sample of a possibly infected organ may be contacted with an antibody raised against a polypeptide of the invention.
- the demonstration of the reaction by means of methods well-known in the art between the sample and the antibody will be indicative of ongoing infection. It is of course also a possibility to demonstrate the presence of anti-mycobacterial antibodies in serum by contacting a serum sample from a subject with at least one of the polypeptide fragments of the invention and using well-known methods for visualizing the reaction between the antibody and antigen.
- a method of determining the presence of mycobacterial nucleic acids in an animal comprising administering a nucleic acid fragment of the invention to the animal or incubating the sample with the nucleic acid fragment of the invention or a nucleic acid fragment complementary thereto, and detecting the presence of hybridized nucleic acids resulting from the incubation (by using the hybridization assays which are well-known in the art), is also included in the invention.
- Such a method of diagnosing TB might involve the use of a composition comprising at least a part of a nucleotide sequence as defined above and detecting the presence of nucleotide sequences in a sample from the animal or human being to be tested which hybridize with the nucleic acid fragment (or a complementary fragment) by the use of PCR technique.
- antigens are not present in M. bovis BCG but are present in virulent mycobacteria point them out as interesting drug targets; the antigens may constitute receptor molecules or toxins which facilitate the infection by the mycobacterium, and if such functionalities are blocked the infectivity of the mycobacterium will be diminshed.
- the gene encoding at least one of the polypeptides of the invention and the necessary control sequences can be introduced into avirulent strains of mycobacteria (e.g. BCG) so as to determine which of the polypeptides are critical for virulence.
- mycobacteria e.g. BCG
- anti-mycobacterial agents can be designed rationally to inhibit expression of the critical genes or to attack the critical gene products.
- antibodies or fragments thereof such as Fab and (Fab′) 2 fragments can be prepared against such critical polypeptides by methods known in the art and thereafter used as prophylactic or therapeutic agents.
- small molecules can be screened for their ability to selectively inhibit expression of the critical gene products, e.g. using recombinant expression systems which include the gene's endogenous promoter, or for their ability to directly interfere with the action of the target. These small molecules are then used as therapeutics or as prophylactic agents to inhibit mycobacterial virulence.
- anti-mycobacterial agents which render a virulent mycobacterium avirulent can be operably linked to expression control sequences and used to transform a virulent mycobacterium.
- Such anti-mycobacterial agents inhibit the replication of a specified mycobacterium upon transcription or translation of the agent in the mycobacterium.
- Such a “newly avirulent” mycobacterium would constitute a superb alternative to the above described modified BCG for vaccine purposes since it would be immunologically very similar to a virulent mycobacterium compared to e.g. BCG.
- a monoclonal or polyclonal antibody which is specifically reacting with a polypeptide of the invention in an immuno assay, or a specific binding fragment of said antibody, is also a part of the invention.
- the production of such polyclonal antibodies requires that a suitable animal be immunized with the polypeptide and that these antibodies are subsequently isolated, suitably by immune affinity chromatography.
- the production of monoclonals can be effected by methods well-known in the art, since the present invention provides for adequate amounts of antigen for both immunization and screening of positive hybridomas.
- FIG. 1 Long term memory immune mice are very efficiently protected towards an infection with M. tuberculosis . Mice were given a challenge of M. tuberculosis and spleens were isolated at different time points. Spleen lymphocytes were stimulated in vitro with ST-CF and the release of IFN- ⁇ investigated (panel A). The counts of CFU in the spleens of the two groups of mice are indicated in panel B. The memory immune mice control infection within the first week and produce large quantities of IFN- ⁇ in response to antigens in ST-CF.
- FIG. 2 T cells involved in protective immunity are predominantly directed to molecules from 6-12 and 17-38 kDa. Splenic T cells were isolated four days after the challenge with M. tuberculosis and stimulated in vitro with narrow molecular mass fractions of ST-CF. The release of IFN- ⁇ was investigated
- FIG. 3 Nucleotide sequence (SEQ ID NO: 1) of cfp7.
- the deduced amino acid sequence (SEQ ID NO: 2) of CFP7 is given in conventional one-letter code below the nucleotide sequence.
- the putative ribosome-binding site is written in underlined italics as are the putative ⁇ 10 and ⁇ 35 regions. Nucleotides written in bold are those encoding CFP7.
- FIG. 4 Nucleotide sequence (SEQ ID NO: 3) of cfp9.
- the deduced amino acid sequence (SEQ ID NO: 4) of CFP9 is given in conventional one-letter code below the nucleotide sequence.
- the putative ribosome-binding site Shine Delgarno sequence is written in underlined italics as are the putative ⁇ 10 and ⁇ 35 regions. Nucleotides in bold writing are those encoding CFP9.
- the nucleotide sequence obtained from the lambda 226 phage is double underlined.
- FIG. 5 Nucleotide sequence of mpt51.
- the deduced amino acid sequence of MPT51 is given in a one-letter code below the nucleotide sequence.
- the signal is indicated in italics the putative potential ribosome-binding site is underlined.
- the nucleotide difference and amino acid difference compared to the nucleotide sequence of MPB51 are underlined at position 780.
- the nucleotides given in italics are not present in M. tuberculosis H37Rv.
- FIG. 6 the position of the purified antigens in the 2DE system have been determined and mapped in a reference gel. The newly purified antigens are encircled and the position of well-known proteins are also indicated.
- mice A group of efficiently protected mice was generated by infecting 8-12 weeks old female C57Bl/6j mice with 5 ⁇ 10 4 M. tuberculosis i.v. After 30 days of infection the mice were subjected to 60 days of antibiotic treatment with isoniazid and were then left for 200-240 days to ensure the establishment of resting long-term memory immunity. Such memory immune mice are very efficiently protected against a secondary infection (FIG. 1). Long lasting immunity in this model is mediated by a population of highly reactive CD4 cells recruited to the site of infection and triggered to produce large amounts of IFN- ⁇ in response to ST-CF (FIG. 1) (Andersen et al. 1995).
- example 1 it was demonstrated that antigens in the low molecular mass fraction are recognized strongly by cells isolated from memory immune mice. Monoclonal antibodies (mAbs) to these antigens were therefore generated by immunizing with the low mass fraction in RIBI adjuvant (first and second immunization) followed by two injections with the fractions in aluminium hydroxide. Fusion and cloning of the reactive cell lines were done according to standard procedures (Kohler and Milstein 1975).
- the plates were overlaid with sheets of nitrocellulose saturated with isopropyl- ⁇ -D-thiogalactopyranoside and incubation was continued for 2,5 hours at 370C.
- the nitrocellulose was removed and incubated with samples of the monoclonal antibodies in PBS with Tween 20 added to a final concentration of 0.05%.
- Bound monoclonal antibodies were visualized by horseradish peroxidase-conjugated rabbit anti-mouse immunoglobulins (P260, Dako, Glostrup, DK) and a staining reaction involving 5,5′,3,3′-tetramethylbenzidine and H 2 O 2 .
- AA226 (expressing ST-3 reactive polypeptide CFP9) which has been deposited 28 Jun. 1993 with the collection of Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSM) under the accession number DSM 8377 and in accordance with the provisions of the Budapest Treaty, and
- AA242 (expressing PV-2 reactive polypeptide CFP7) which has been deposited 28 Jun. 1993 with the collection of Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSM) under the accession number DSM 8379 and in accordance with the provisions of the Budapest Treaty.
- DSM Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH
- the st-3 binding protein is expressed as a protein fused to ⁇ -galactosidase, whereas the pv-2 binding protein appears to be expressed in an unfused version.
- ORF open reading frame encoding a sequence of 96 amino acid residues was identified from an ATG start codon at position 91-93 extending to a TAG stop codon at position 379-381.
- the deduced amino acid sequence is shown in SEQ ID NO: 2 (and in FIG. 3 where conventional one-letter amino acid codes are used).
- CFP7 appear to be expressed in E. coli as an unfused version.
- the nucleotide sequence at position 78-84 is expected to be the Shine Delgarno sequence and the sequences from position 47-50 and 14-19 are expected to be the ⁇ 10 and -35 regions, respectively:
- the protein recognised by ST-3 was produced as a ⁇ -galactosidase fusion protein, when expressed from the AA226 lambda phage.
- the fusion protein had an approx. size of 116-117 kDa (Mw for ⁇ -galactosidase 116.25 kDa) which may suggest that only part of the CFP9 gene was included in the lambda clone (AA226).
- nucleotide sequence at position 123-130 is expected to be the Shine Delgarno sequence and the sequences from position 73-78 and 4-9 are expected to be the ⁇ 10 and -35 region respectively (FIG. 4).
- the ORF overlapping with the 5′-end of the sequence of AA229 is shown in FIG. 4 by double underlining.
- the PCR amplification was carried out in a thermal reactor (Rapid cycler, Idaho Technology, Idaho) by mixing 10 ng plasmid DNA with the mastermix (0.5 ⁇ M of each oligonucleotide primer, 0.25 ⁇ M BSA (Stratagene), low salt buffer (20 mM Tris-HCl, pH 8.8, 10 mM KCl, 10 mM (NH4) 2 SO 4 , 2 mM MgSO 4 and 0,1% Triton X-100) (Stratagene), 0.25 nM of each deoxynucleoside triphosphate and 0.5 U Taq Plus Long DNA polymerase (Stratagene)).
- Final volume was 10 ⁇ l (all concentrations given are concentrations in the final volume).
- Predenaturation was carried out at 94° C. for 30 s. 30 cycles of the following was performed; Denaturation at 94° C. for 30 s, annealing at 55° C. for 30 s and elongation at 72° C. for 1 min.
- oligonucleotide primers were synthesised automatically on a DNA synthesizer (Applied Biosystems, Forster City, Calif., ABI-391, PCR-mode), deblocked, and purified by ethanol precipitation.
- the cfp7 oligonucleotides (TABLE 1) were synthesised on the basis of the nucleotide sequence from the CFP7 sequence (FIG. 3).
- the oligonucleotides were engineered to include an SmaI restriction enzyme site at the 5′ end and a BamHI restriction enzyme site at the 3′ end for directed subcloning.
- the cfp9 oligonucleotides TABLE 1
- oligonucleotides were synthesized partly on the basis of the nucleotide sequence from the sequence of the AA229 clone and partly from the identical sequence found in the Sanger database cosmid MTCY48 (FIG. 4).
- the oligonucleotides were engineered to include a SmaI restriction enzyme site at the 5′ end and a HindIII restriction enzyme site at the 3′ end for directed subcloning.
- IPTG isopropyl- ⁇ -D-thiogalactoside
- a final concentration of 2 mM expression was regulated either by the strong IPTG inducible Ptac or the T5 promoter
- the cells were harvested by centrifugation at 4,200 ⁇ g at 4° C. for 8 min.
- the pelleted bacteria were stored overnight at ⁇ 20° C.
- the pellet was resuspended in BC 40/100 buffer (20 mM Tris-HCl pH 7.9, 20% glycerol, 100 mM KCl, 40 mM Imidazole) and cells were broken by sonication (5 times for 30 s with intervals of 30 s) at 4° C. followed by centrifugation at 12,000 ⁇ g for 30 min at 4° C., the supernatant (crude extract) was used for purification of the recombinant antigens.
- the two Histidine fusion proteins (His-rCFP7 and His-rCFP9) were purified from the crude extract by affinity chromatography on a Ni 2+ -NTA column from QIAGEN with a volume of 100 ml. His-rCFP7 and His-rCFP9 binds to Ni 2+ . After extensive washes of the column in BC 40/100 buffer, the fusion protein was eluted with a BC 1000/100 buffer containing 100 mM imidazole, 20 mM Tris pH 7.9, 20% glycerol and 1 M KCl. subsequently, the purified products were dialysed extensively against 10 mM Tris pH 8.0.
- His-rCFP7 and His-rCFP9 were then separated from contaminants by fast protein liquid chromatography (FPLC) over an anion-exchange column (Mono Q, Pharmacia, Sweden). in 10 mM Tris pH 8.0 with a linear gradient of NaCl from 0 to 1 M. Aliquots of the fractions were analyzed by 10%-20% gradient sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE). Fractions containing purified either purified His-rCFP7 or His-rCFP9 were pooled. TABLE 1 Sequence of the cfp7 and cfp9 oligonucleotides a .
- oligonucleotide Sequences (5′ ⁇ 3′) (nucleotide) Sense pvR3 GCAACACCCGGG ATGTCGCAAATCATG 91-105 (SEQ ID NO: 43) (SEQ ID NO: 1) stR2 GTAACACCCGGG GTGGCCGCCGACCCG 141-155 (SEQ ID NO: 44) (SEQ ID NO: 3) Antisense pvF4 CTACTAAGCTTGGATCC CTAGCCGCCCCATTTGGCGG 381-362 (SEQ ID NO: 45) (SEQ ID NO: 1) stF2 CTACTAAGCTTCCATGG TCAGGTCTTTTCGATGCTTAC 467-447 (SEQ ID NO: 46) (SEQ ID NO: 3)
- BCG Bacillus Calmette-Guérin
- the first mutations described are the loss of the gene encoding MPB64 in some BCG strains (Li et al., 1993, Oettinger and Andersen, 1994) and the gene encoding ESAT-6 in all BCG strain tested (Harboe et al., 1996), later 3 large deletions in BCG have been identified (Mahairas et al., 1996).
- the region named RD1 includes the gene encoding ESAT-6 and an other (RD2) the gene encoding MPT64. Both antigens have been shown to have diagnostic potential and ESAT-6 has been shown to have properties as a vaccine candidate (cf.
- the predicted ORFs include ESAT-6 (RD1-ORF7) and CFP10 (RD1-ORF6) described previously (Sorensen et al., 1995), as a positive control for the ability of the algorithm. In the present is described the potential of 7 of the predicted antigens for diagnosis of TB as well as potential as candidates for a new vaccine against TB.
- the nucleotide sequence of rd1-orf2 from M. tuberculosis H37Rv is set forth in SEQ ID NO: 71.
- the deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 72.
- nucleotide sequence of rd1-orf3 from M. tuberculosis H37Rv is set forth in SEQ ID NO: 87.
- the deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 88.
- nucleotide sequence of rd1-orf4 from M. tuberculosis H37Rv is set forth in SEQ ID NO: 89.
- the deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 90.
- nucleotide sequence of rd1-orf5 from M. tuberculosis H37Rv is set forth in SEQ ID NO: 91.
- the deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 92.
- nucleotide sequence of rd1-orf8 from M. tuberculosis H37Rv is set forth in SEQ ID NO: 67.
- the deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 68.
- nucleotide sequence of rd1-orf9a from M. tuberculosis H37Rv is set forth in SEQ ID NO: 93.
- the deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 94.
- nucleotide sequence of rd1-orf9b from M. tuberculosis H37Rv is set forth in SEQ ID NO: 69.
- the deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 70.
- the DNA sequence rd1-orf2 (SEQ ID NO: 71) contained an open reading frame starting with an ATG codon at position 889-891 and ending with a termination codon (TAA) at position 2662-2664 (position numbers referring to the location in RD1).
- the deduced amino acid sequence (SEQ ID NO: 72) contains 591 residues corresponding to a molecular weight of 64,525.
- the DNA sequence rd1-orf3 (SEQ ID NO: 87) contained an open reading frame starting with an ATG codon at position 2807-2809 and ending with a termination codon (TAA) at position 3101-3103 (position numbers referring to the location in RD1).
- the deduced amino acid sequence (SEQ ID NO: 88) contains 98 residues corresponding to a molecular weight of 9,799.
- the DNA sequence rd1-orf4 (SEQ ID NO: 89) contained an open reading frame starting with a GTG codon at position 4014-4012 and ending with a termination codon (TAG) at position 3597-3595 (position numbers referring to the location in RD1).
- the deduced amino acid sequence (SEQ ID NO: 90) contains 139 residues corresponding to a molecular weight of 14,210.
- the DNA sequence rd1-orf5 (SEQ ID NO: 91) contained an open reading frame starting with a GTG codon at position 3128-3130 and ending with a termination codon (TGA) at position 4241-4243 (position numbers referring to the location in RD1).
- the deduced amino acid sequence (SEQ ID NO: 92) contains 371 residues corresponding to a molecular weight of 37,647.
- the DNA sequence rd1-orf8 (SEQ ID NO: 67) contained an open reading frame starting with a GTG codon at position 5502-5500 and ending with a termination codon (TAG) at position 5084-5082 (position numbers referring to the location in RD1), and the deduced amino acid sequence (SEQ ID NO: 68) contains 139 residues with a molecular weight of 11,737.
- the DNA sequence rd1-orf9a (SEQ ID NO: 93) contained an open reading frame starting with a GTG codon at position 6146-6148 and ending with a termination codon (TAA) at position 7070-7072 (position numbers referring to the location in RD1).
- the deduced amino acid sequence (SEQ ID NO: 94) contains 308 residues corresponding to a molecular weight of 33,453.
- the DNA sequence rd1-orf9b (SEQ ID NO: 69) contained an open reading frame starting with an ATG codon at position 5072-5074 and ending with a termination codon (TAA) at position 7070-7072 (position numbers referring to the location in RD1).
- the deduced amino acid sequence (SEQ ID NO: 70) contains 666 residues corresponding to a molecular weight of 70,650.
- ORF's rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b were PCR cloned in the pMST24 (Theisen et al., 1995) (rd1-orf3) or the pQE32 (QIAGEN) (rd1-orf2, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b) expression vector.
- oligonucleotides Preparation of oligonucleotides and PCR amplification of the rd1-orf encoding genes, was carried out as described in example 2. Chromosomal DNA from M. tuberculosis H37Rv was used as template in the PCR reactions. Oligonucleotides were synthesized on the basis of the nucleotide sequence from the RD1 region (Accession no. U34848). The oligonucleotide primers were engineered to include an restriction enzyme site at the 5′ end and at the 3′ end by which a later subcloning was possible. Primers are listed in TABLE 2.
- rd1-orf2 A BamHI site was engineered immediately 5′ of the first codon of rd1-orf2, and a HindIII site was incorporated right after the stop codon at the 3′ end.
- the gene rd1-orf2 was subcloned in pQE32, giving pTO96.
- rd1-orf3 A SmaI site was engineered immediately 5′ of the first codon of rd1-orf3, and a NcoI site was incorporated right after the stop codon at the 3′ end.
- the gene rd1-orf3 was subcloned in pMST24, giving pTO87.
- rd1-orf4 A BamHI site was engineered immediately 5′ of the first codon of rd1-orf4, and a HindIII site was incorporated right after the stop codon at the 3′ end.
- the gene rd1-orf4 was subcloned in pQE32, giving pTO89.
- rd1-orf5 A BamHI site was engineered immediately 5′ of the first codon of rd1-orf5, and a HindIII site was incorporated right after the stop codon at the 3′ end.
- the gene rd1-orf5 was subcloned in pQE32, giving pTO88.
- rd1-orf8 A BamHI site was engineered immediately 5′ of the first codon of rd1-orf8, and a NcoI site was incorporated right after the stop codon at the 3′ end.
- the gene rd1-orf8 was subcloned in pMST24, giving pTO98.
- rd1-orf9a A BamHI site was engineered immediately 5′ of the first codon of rd1-orf9a, and a HindIII site was incorporated right after the stop codon at the 3′ end.
- the gene rd1-orf9a was subcloned in pQE32, giving pTO91.
- rd1-orf9b A ScaI site was engineered immediately 5′ of the first codon of rd1-orf9b, and a Hind III site was incorporated right after the stop codon at the 3′ end.
- the gene rdlorf9b was subcloned in pQE32, giving pTO90.
- PCR fragments were digested with the suitable restriction enzymes, purified from an agarose gel and cloned into either pMST24 or pQE-32.
- the seven constructs were used to transform the E. coli XL1-Blue. Endpoints of the gene fusions were determined by the dideoxy chain termination method. Both strands of the DNA were sequenced.
- the rRD1-ORFs were fused N-terminally to the (His) 6 -tag.
- Recombinant antigen was prepared as described in example 2 (with the exception that pTO91 was expressed at 300C and not at 37° C.), using a single colony of E. coli harbouring either the pTO87, pTO88, pTO89, pTO90, pTO91, pTO96 or pTO98 for inoculation. Purification of recombinant antigen by Ni 2+ affinity chromatography was also carried out as described in example 2.
- His-rRD1-ORF2 His-rRD1-ORF3 His-rRD1-ORF4, His-rRD1-ORF5, His-rRD1-ORF8, His-rRD1-ORF9a or His-rRD1-ORF9b were pooled.
- the His-rRD1-ORF's were extensively dialysed against 10 mM Tris/HCl, pH 8.5, 3 M urea followed by an additional purification step performed on an anion exchange column (Mono Q) using fast protein liquid chromatography (FPLC) (Pharmacia, Uppsala, Sweden).
- nucleotide sequences of rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a, and rd1-orf9b from M. tuberculosis H37Rv are set forth in SEQ ID NO: 71, 87, 89, 91, 67, 93, and 69, respectively.
- deduced amino acid sequences of rd1-orf2, rd1-orf3, rd1-orf4 rd1-orf5, rd1-orf8, rd1-orf9a, and rd1-orf9b are set forth in SEQ ID NO: 72, 88, 90, 92, 68, 94, and 70, respectively.
- ST-CF was precipitated with ammonium sulphate at 80% saturation.
- the precipitated proteins were removed by centrifugation and after resuspension washed with 8 M urea.
- CHAPS and glycerol were added to a final concentration of 0.5% (w/v) and 5% (v/v) respectively and the protein solution was applied to a Rotofor isoelectrical Cell (BioRad).
- the Rotofor Cell had been equilibrated with an 8 M urea buffer containing 0.5% (w/v) CHAPS, 5% (v/v) glycerol, 3% (v/v) Biolyt 3/5 and 1% (v/v) Biolyt 4/6 (BioRad).
- Isoelectric focusing was performed in a pH gradient from 3-6. The fractions were analyzed on silver-stained 10-20% SDS-PAGE. Fractions with similar band patterns were pooled and washed three times with PBS on a Centriprep concentrator (Amicon) with a 3 kDa cut off membrane to a final volume of 1-3 ml. An equal volume of SDS containing sample buffer was added and the protein solution boiled for 5 min before further separation on a Prep Cell (BioRad) in a matrix of 16% polyacrylamide under an electrical gradient. Fractions containing pure proteins with an molecular mass from 17-30 kDa were collected.
- Anti-CFP29 reacting with CFP29 was generated by immunization of BALB/c mice with crushed gel pieces in RIBI adjuvant (first and second immunization) or aluminium hydroxide (third immunization and boosting) with two week intervals. SDS-PAGE gel pieces containing 2-5 ⁇ g of CFP29 were used for each immunization. Mice were boosted with antigen 3 days before removal of the spleen. Generation of a monoclonal cell line producing antibodies against CFP29 was obtained essentially as described by Köhler and Milstein (1975). Screening of supernatants from growing clones was carried out by immunoblotting of nitrocellulose strips containing ST-CF separated by SDS-PAGE. Each strip contained approximately 50 ⁇ g of ST-CF. The antibody class of anti-CFP29 was identified as IgM by the mouse monoclonal antibody isotyping kit, RPN29 (Amersham) according to the manufacturer's instructions.
- CFP29 was purified by the following method: ST-CF was concentrated 10 fold by ultrafiltration, and ammonium sulphate precipitation in the 45 to 55% saturation range was performed. The pellet was redissolved in 50 mM sodium phosphate, 1.5 M ammonium sulphate, pH 8.5, and subjected to thiophilic adsorption chromatography (Porath et al., 1985) on an Affi-T gel column (Kem-En-Tec).
- CFP17, CFP20, CFP21, CFP22, CFP25, and CFP28 were washed with water on a Centricon concentrator (Amicon) with cutoff at 10 kDa and then applied to a ProSpin concentrator (Applied Biosystems) where the proteins were collected on a PVDF membrane. The membrane was washed 5 times with 20% methanol before sequencing on a Procise sequencer (Applied Biosystems).
- CFP29 containing fractions were blotted to PVDF membrane after tricine SDS-PAGE (Ploug et al., 1989). The relevant bands were excised and subjected to amino acid analysis (Barkholt and Jensen, 1989) and N-terminal sequence analysis on a Procise sequencer (Applied Biosystems).
- X denotes an amino acid which could not be determined by the sequencing method used, whereas a “/” between two amino acids denotes that the sequencing method could not determine which of the two amino acids is the one actually present.
- the N-terminal sequence of CFP29 was used for a homology search in the EMBL database using the TFASTA program of the Genetics Computer Group sequence analysis software package.
- the search identified a protein, Linocin M18, from Brevibacterium linens that shares 74% identity with the 19 N-terminal amino acids of CFP29.
- the fragment was excised from a 1% agarose gel, purified by Spin-X spinn columns (Costar), cloned into pBluescript SK II+-T vector (Stratagene) and finally sequenced with the Sequenase kit from United States Biochemical.
- This program identified a Mycobacterium tuberculosis sequence on cosmid cy444 in the database that is nearly 100% identical to the 150 bp sequence of the CFP29 protein.
- the sequence is contained within a 795 bp open reading frame of which the 5′ end translates into a sequence that is 100% identical to the N-terminally sequenced 19 amino acids of the purified CFP29 protein.
- CFP17, CFP20, CFP21, CFP22, CFP25, and CFP28 the N-terminal amino acid sequence from each of the proteins were used for a homology search using the blast program of the Sanger Mycobacterium tuberculosis database:
- CFP29 the first 150 bp of the DNA sequence was used for the search. Furthermore, the EMBL database was searched for proteins with homology to CFP29.
- the translated protein is 162 amino acids long.
- the N-terminal of the protein purified from culture filtrate starts at amino acid 31 in agreement with the presence of a signal sequence that has been cleaved off. This gives a length of the mature protein of 132 amino acids, which corresponds to a theoretical molecular mass of 13833 Da and a theoretical pI of 4.4.
- the observed mass in SDS-PAGE is 17 kDa.
- the translated protein is 217 amino acids long.
- the N-terminally determined sequence from the protein purified from culture filtrate starts at amino acid 33 in agreement with the presence of a signal sequence that has been cleaved off. This gives a length of the mature protein of 185 amino acids, which corresponds to a theoretical molecular weigh at 18657 Da, and a theoretical pI at 4,6.
- the observed weight in a SDS-PAGE is 21 kDa.
- the translated protein has 90,1% identity with E235739; a peptidyl-prolyl cis-trans isomerase.
- N-terminally determined sequence from the protein purified from culture filtrate begins at amino acid 33 in agreement with the presence of a signal sequence that has been cleaved off. This gives a length of the mature protein of 187 amino acids, which corresponds to a theoretical molecular weigh at 19665 Da, and a theoretical pI at 4.9. The observed weight in a SDS-PAGE is 25 kDa.
- EMBL database searching The open reading frame encodes a 265 amino acid protein that is 58% identical and 74% similar to the Linocin M18 protein (61% identity on DNA level). This is a 28.6 kDa protein with bacteriocin activity (Valdes-Stauber and Scherer, 1994; Valdés-Stauber and Scherer, 1996). The two proteins have the same length (except for 1 amino acid) and share the same theoretical physicochemical properties. We therefore suggest that CFP29 is a mycobacterial homolog to the Brevibacterium linens Linocin M18 protein.
- CFP17 (SEQ ID NO: 6): 1 MTDMNPDIEK DQTSDEVTVE TTSVFRADFL SELDAPAQAG TESAVSGVEG 51 LPPGSALLVV KRGPNAGSRF LLDQAITSAG RHPDSDIFLD DVTVSRRHAE 101 FRLENNEFNV VDVGSLNGTY VNREPVDSAV LANGDEVQIG KFRLVFLTGP 151 KQGEDDGSTG GP
- CFP20 (SEQ ID NO: 8): 1 MAQITLRGNA INTVGELPAV GSPAPAFTLT GGDLGVISSD QFRGKSVLLN 51 IFPSVDTPVC ATSVRTFDER AAASGATVLC VSKDLPFAQK RFCGAEGTEN 101 VMPASAFRDS FGEDYGVTIA DGPMAGLLAR AIVVIGADGN VAYTELV
- PCR reactions contained 10 ng of M. tuberculosis chromosomal DNA in 1 ⁇ low salt Taq+ buffer from Stratagene supplemented with 250 mM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+DNA polymerase (Stratagene) in 10 ⁇ l reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles according to the following program; 94° C. for 10 sec., 55° C. for 10 sec. and 72° C. for 90 sec, using thermocycler equipment from Idaho Technology.
- DNA fragments were subsequently run on lt agarose gels, the bands were excised and purified by Spin-X spin columns (Costar) and cloned into pBluescript SK II+-T vector (Stratagene). Plasmid DNA was thereafter prepared from clones harbouring the desired fragments, digested with suitable restriction enzymes and subcloned into the expression vector pMCT6 in frame with 8 histidine residues which are added to the N-terminal of the expressed proteins.
- the resulting clones were hereafter sequenced by use of the dideoxy chain termination method adapted for supercoiled DNA using the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., USA) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- CFP17 Primers used for cloning of cfp17: OPBR-51: ACAGATCTGTGACGGACATGAACCCG (SEQ ID NO: 117) OPBR-52: TTTTCCATGGTCACGGGCCCCCGGTACT (SEQ ID NO: 118)
- OPBR-51 and OPBR-52 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP20 Primers used for cloning of cfp20: OPBR-53: ACAGATCTGTGCCCATGGCACAGATA (SEQ ID NO: 119) OPBR-54: TTTAAGCTTCTAGGCGCCCAGCGCGGC (SEQ ID NO: 120)
- OPBR-53 and OPBR-54 create BglII and HinDIII sites, respectively, used for the cloning in pMCT6.
- CFP21 Primers used for cloning of cfp21: OPBR-55: ACAGATCTGCGCATGCGGATCCGTGT (SEQ ID NO: 121) OPBR-56: TTTTCCATGGTCATCCGGCGTGATCGAG (SEQ ID NO: 122)
- OPBR-55 and OPBR-56 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP22 Primers used for cloning of cfp22: OPBR-57: ACAGATCTGTAATGGCAGACTGTGAT (SEQ ID NO: 123) OPBR-58: TTTTCCATGGTCAGGAGATGGTGATCGA (SEQ ID NO: 124)
- OPBR-57 and OPBR-58 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP25 Primers used for cloning of cfp25: OPBR-59: ACAGATCTGCCGGCTACCCCGGTGCC (SEQ ID NO: 125) OPBR-60: TTTTCCATGGCTATTGCAGCTTTCCGGC (SEQ ID NO: 126)
- OPBR-59 and OPBR-60 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- the dialysed protein was further purified by FPLC (Pharmacia, Sweden) using a 6 ml Resource-Q column, eluted with a linear 0-1 M gradient of NaCl. Fractions were analyzed by SDS-PAGE and protein concentrations were estimated at OD 280 . Fractions containing protein were pooled and dialysed against 25 mM Hepes buffer, pH 8.5.
- ST-CF was precipitated with ammonium sulphate at 80% saturation.
- the precipitated proteins were removed by centrifugation and after resuspension washed with 8 M urea.
- CHAPS and glycerol were added to a final concentration of 0.5% (w/v) and 5% (v/v) respectively and the protein solution was applied to a Rotofor isoelectrical Cell (BioRad).
- the Rotofor Cell had been equilibrated with a 8M urea buffer containing 0.5% (w/v) CHAPS, 5% (v/v) glycerol, 3% (v/v) Biolyt 3/5 and 1% (v/v) Biolyt 4/6 (BioRad).
- Isoelectric focusing was performed in a pH gradient from 3-6.
- the frac-tions were analyzed on silver-stained 10-20% SDS-PAGE.
- Fractions with similar band patterns were pooled and washed three times with PBS on a Centriprep concentrator (Amicon) with a 3 kDa cut off membrane to a final volume of 1-3 ml.
- An equal volume of SDS containing sample buffer was added and the protein solution boiled for 5 min before further separation on a Prep Cell (BioRad) in a matrix of 16% polyacrylamide under an electrical gradient.
- Fractions containing well separated bands in SDS-PAGE were selected for N-terminal sequencing after transfer to PVDF membrane.
- ST-CF was precipitated with ammonium sulphate at 80% saturation and redissolved in PBS, pH 7.4, and dialysed 3 times against 25 mM Piperazin-HCl, pH 5.5, and subjected to chromatofocusing on a matrix of PBE 94 (Pharmacia) in a column connected to an FPLC system (Pharmacia).
- the column was equilibrated with 25 mM Piperazin-HCl, pH 5.5, and the elution was performed with 10% PB74-HCl, pH 4.0 (Pharmacia).
- ST-CF was concentrated approximately 10 fold by ultrafiltration and proteins were precipitated at 80% saturation, redissolved in PBS, pH 7.4, and dialysed 3 times against PBS, pH 7.4.
- 5.1 ml of the dialysed ST-CF was treated with RNase (0.2 mg/ml, QUIAGEN) and DNase (0.2 mg/ml, Boehringer Mannheim) for 6 h and placed on top of 6.4 ml of 48% (w/v) sucrose in PBS, pH 7.4, in Sorvall tubes (Ultracrimp 03987, DuPont Medical Products) and ultracentrifuged for 20 h at 257,300 ⁇ g max , 10° C. The pellet was redissolved in 200 ⁇ l of 25 mM Tris-192 mM glycine, 0.1% SDS, pH 8.3.
- CFP27 CFP30A and CFP50 ST-CF was concentrated approximately 10 fold by ultrafiltration and ammonium sulphate precipitation in the 45 to 55% saturation range was performed. Proteins were redissolved in 50 mM sodium phosphate, 1.5 M ammonium sulphate, pH 8.5, and subjected to thiophilic adsorption chromatography on an Affi-T gel column (Kem-EnTec). Proteins were eluted by a 1.5 to 0 M decreasing gradient of ammonium sulphate. Fractions with similar band patterns in SDS-PAGE were pooled and anion exchange chromatography was performed on a Mono Q HR 5/5 column connected to an FPLC system (Pharmacia). The column was equilibrated with 10 mM Tris-HCl, pH 8.5, and the elution was performed with a gradient of NaCl from 0 to 1 M. Fractions containing well separated bands in SDS-PAGE were selected.
- CFP7A and CFP25A were obtained as described above except for the following modification: ST-CF was concentrated approximately 10 fold by ultrafiltration and proteins were precipitated at 80% saturation, redissolved in PBS, pH 7.4, and dialysed 3 times against PBS, pH 7.4. Ammonium sulphate was added to a concentration of 1.5 M, and ST-CF proteins were loaded on an Affi T-gel column. Elution from the Affi T-gel column and anion exchange were performed as described above.
- CFP7A AEDVRAEIVA SVLEVVVNEG DQIDKGDVVV LLESMYMEIP (SEQ ID NO: 81) VLAEAAGTVS CFP8A: DPVDDAFIAKLNTAG (SEQ ID NO: 73) CFP8B: DPVDAIINLDNYGX (SEQ ID NO: 74) CFP16: AKLSTDELLDAFKEM (SEQ ID NO: 79) CFP19: TTSPDPYAALPKLPS (SEQ ID NO: 82) CFP19B: DPAXAPDVPTAAQLT (SEQ ID NO: 80) CFP22A: TEYEGPKTKF HALMQ (SEQ ID NO: 83) CFP23A: VIQ/AGMVT/GHIHXVAG (SEQ ID NO: 76) CFP23B: AEMKXFKNAIVQEID (SEQ ID NO: 75) CFP25
- CFP8A A sequence 80% identical to the 15 N-terminal amino acids was found on contig TB — 1884. The N-terminally determined sequence from the protein purified from culture filtrate starts at amino acid 32. This gives a length of the mature protein of 98 amino acids corresponding to a theoretical MW of 9700 Da and a pI of 3.72 This is in good agreement with the observed MW on SDS-PAGE at approximately 8 kDa. The full length protein has a theoretical MW of 12989 Da and a pI of 4.38.
- CFP8B A sequence 71% identical to the 14 N-terminal amino acids was found on contig TB — 653. However, careful re-evaluation of the original N-terminal sequence data confirmed the identification of the protein.
- CFP16 The 15 aa N-terminal sequence was found to be 100% identical to a sequence found on cosmid MTCY20H.
- CFP19 The 15 aa N-terminal sequence was found to be 100t identical to a sequence found on cosmid MTCY270.
- CFP22A The 15 aa N-terminal sequence was found to be 100t identical to a sequence found on cosmid MTCY1A6.
- CFP25A The 15 aa N-terminal sequence was found to be 100% identical to a sequence found on contig 255.
- CFP27 The 15 aa N-terminal sequence was found to be 100% identical to a sequence found on cosmid MTCY261.
- CFP30A Of the 13 determined amino acids in CFP30A, a 100% identical sequence was found on cosmid MTCY261.
- CWP32 The 15 amino acid N-terminal sequence was found to be 100% identical to a sequence found on contig 281. The identity was found within an open reading frame of 266 amino acids length, corresponding to a theoretical MW of CWP32 of 28083 Da and a pI of 4.563. The observed molecular weight in an SDS-PAGE gel is 32 kDa.
- CFP50 The 15 aa N-terminal sequence was found to be 100% identical to a sequence found in MTV038.06. The identity is found within an open reading frame of 464 amino acids length corresponding to a theoretical MW of CFP50 of 49244 Da and a pI of 5.66. The observed molecular weight in an SDS-PAGE gel is 50 kDa.
- the first reading frame encodes a 254 amino acid protein of which the first 26 aa constitute a putative leader peptide that strongly indicates an extracellular location of the protein.
- the mature protein is thus 228 aa in length corresponding to a theoretical MW of 23149.0 Da and a Pi of 5.80.
- the protein is named CFP23.
- the second reading frame encodes an 231 aa protein of which the first 44 aa constitute a putative leader peptide that strongly indicates an extracellular location of the protein.
- the mature protein is thus 187 aa in length corresponding to a theoretical MW of 19020.3 Da and a Pi of 7.03.
- the protein is named CFP19A.
- amino acid sequences derived from the translated genes of the individual antigens were used for homology searching in the EMBL and Genbank databases using the TFASTA algorithm, in order to find homologous proteins and to address eventual functional roles of the antigens.
- CFP7A has 44% identity and 70% similarity to hypothetical Methanococcus jannaschii protein ( M. jannaschii from base 1162199-1175341), as well as 43% and 38t identity and 68 and 64% similarity to the C-terminal part of B. stearothermophilus pyruvate carboxylase and Streptococcus mutans biotin carboxyl carrier protein.
- CFP7A contains a consensus sequence EAMKM for a biotin binding site motif which in this case was slightly modified (ESMKM in amino acid residues 34 to 38).
- ESMKM alkaline phosphatase conjugated streptavidin after SDS-PAGE and transfer to nitrocellulose it was demonstrated that native CFP7A was biotinylated.
- CFP16 RplL gene, 130 aa. Identical to the M. bovis 50s ribosomal protein L7/L12 (acc. No P37381).
- CFP19 has 47% identity and 55% similarity to E. coli pectinesterase homolog (ybhc gene) in a 150 aa overlap.
- CFP19A has between 38% and 45% identity to several cutinases from different fungal sp.
- CFP19A has 46t identity and 61% similarity to CFP25 as well as 50% identity and 64% similarity to CFP21 (both proteins are earlier isolated from the ST-CF).
- CFP22A No apparent homology
- CFP23 CFP23 has between 38% and 46% identity to several cutinases from different fungal sp.
- CFP23 has 46% identity and 61% similarity to CFP25 as well as 50t identity and 63% similarity to CFP21 (both proteins are earlier isolated from the ST-CF).
- CFP25A has 95% identity in a 241 aa overlap to a putative M. tuberculosis thymidylate synthase (450 aa accession No p28176).
- CFP27 has 81% identity to a hypothetical M. leprae protein and 64% identity and 78% similarity to Rhodococcus sp. proteasome beta-type subunit 2 (prcB(2) gene).
- CFP30A has 67t identity to Rhodococcus proteasome alfa-type 1 subunit.
- CWP32 The CWP32 N-terminal sequence is 100% identical to the Mycobacterium leprae sequence MLCB637.03.
- CFP50 The CFP50 N-terminal sequence is 100t identical to a putative lipoamide dehydrogenase from M. leprae (Accession 415183)
- PCR reactions contained 10 ng of M. tuberculosis chromosomal DNA in 1 ⁇ low salt Taq+ buffer from Stratagene supplemented with 250 mM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+DNA polymerase (Stratagene) in 10 ml reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 10 sec., 55° C. for 10 sec. and 72° C. for 90 sec, using thermocycler equipment from Idaho Technology.
- Plasmid DNA was hereafter prepared from clones harbouring the desired fragments, digested with suitable restriction enzymes and subcloned into the expression vector pMCT6 in frame with 8 histidines which are added to the N-terminal of the expressed proteins.
- the resulting clones were hereafter sequenced by use of the dideoxy chain termination method adapted for supercoiled DNA using the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., USA) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- CFP7A Primers used for cloning of cfp7A: OPBR-79: AAGAGTAGATCTATGATGGCCGAGGATGTTCGCG (SEQ ID NO: 95) OPBR-80: CGGCGACGACGGATCCTACCGCGTCGG (SEQ ID NO: 96)
- OPBR-79 and OPBR-80 create BglII and BamHI sites, respectively, used for the cloning in pMCT6.
- CFP8A Primers used for cloning of cfp8A: (SEQ ID NO: 154)
- CFP8A-F CTGAGATCTATGAACCTACGGCGCC (SEQ ID NO: 155)
- CFP8A-R CTCCCATGGTACCCTAGGACCCGGGCAGCCCCGGC
- CFP8A-F and CFP8A-R create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP8B Primers used for cloning of cfp8B: CFP8B-F: CTGAGATCTATGAGGCTGTCGTTGACCGC (SEQ ID NO: 156) CFP8B-R: CTCCCCGGGCTTAATAGTTGTTGCAGGAGC (SEQ ID NO: 157)
- CFP8B-F and CFP8B-R create BglII and SmaI sites, respectively, used for the cloning in pMCT6.
- CFP16 Primers used for cloning of cfp16: OPBR-104: CCGGGAGATCTATGGCAAAGCTCTCCACCGACG (SEQ ID NOs: 111 and 130) OPBR-105: CGCTGGGCAGAGCTACTTGACGGTGACGGTGG (SEQ ID NOS: 112 and 131)
- OPBR-104 and OPBR-105 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP19 Primers used for cloning of cfp19: OPBR-96: GAGGAAGATCTATGACAACTTCACCCGACCCG (SEQ ID NO: 107) OPBR-97. CATGAAGCCATGGCCCGCAGGCTGCATG (SEQ ID NO: 108)
- OPBR-96 and OPBR-97 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP19A Primers used for cloning of cfp19A: OPBR-88: CCCCCCAGATCTGCACCACCGGCATCGGCGGGC (SEQ ID NO: 99) OPBR-89. GCGGCGGATCCGTTGCTTAGCCGG (SEQ ID NO: 100)
- OPBR-88 and OPBR-89 create BglII and BamHI sites, respectively, used for the cloning in pMCT6.
- CFP22A Primers used for cloning of cfp22A: OPBR-90: CCGGCTGAGATCTATGACAGAATACGAAGGGC (SEQ ID NO:101) OPBR-91: CCCCGCCAGGGAACTAGAGGCGGC (SEQ ID NO:102)
- OPBR-90 and OPBR-91 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP23 Primers used for cloning of cfp23: OPBR- CCTTGGGAGATCTTTGGACCCCGGTTGC (SEQ ID NO:97) 86: OPBR- GACGAGATCTTATGGGCTTACTGAC (SEQ ID NO:98) 87:
- OPBR-86 and OPBR-87 both create a BglII site used for the cloning in pMCT6.
- CFP25A Primers used for cloning of cfp25A: OPBR-106: (SEQ ID NO:113) GGCCCAGATCTATGGCCATTGAGGTTTCGGTGTTGC OPBR-107: (SEQ ID NO:114) CGCCGTGTTGCATGGCAGCGCTGAGC
- OPBR-106 and OPBR-107 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP27 Primers used for cloning of cfp27: OPBR-92: (SEQ ID NO:103) CTGCCGAGATCTACCACCATTGTCGCGCTGAAATACCC OPBR-93: (SEQ ID NO:104) CGCCATGGCCTTACGCGCCAACTCG
- OPBR-92 and OPBR-93 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP30A Primers used for cloning of cfp30A: OPBR-94: GGCGGAGATCTGTGAGTTTTCCGTATTTCATC (SEQ ID NO:105) OPBR-95: CGCGTCGAGCCATGGTTAGGCGCAG (SEQ ID NO:106)
- OPBR-94 and OPBR-95 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CWP32 Primers used for cloning of cwp32: CWP32-F: GCTTAGATCTATGATTTTCTGGGCAACCAGGTA (SEQ ID NO:158) CW232-R: GCTTCCATGGGCGAGGCACAGGCGTGGGAA (SEQ ID NO:159)
- CWP32 F and CWP32—R create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP50 Primers used for cloning of cfp50: OPBR-100: GGCCGAGATCTGTGACCCACTATGACGTCGTCG (SEQ ID NO:109) OPBR-101: GGCGCCCATGGTCAGAAATTGATCATGTGGCCAA (SEQ ID NO:110)
- OPBR-100 and OPBR-101 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- ST-CF was precipitated with ammonium sulphate at 80% saturation and redissolved in PBS, pH 7.4, and dialyzed 3 times against 25 mM Piperazin-HCl, pH 5.5, and subjected to cromatofocusing on a matrix of PBE 94 (Pharmacia) in a column connected to an FPLC system (Pharmacia).
- the column was equilibrated with 25 mM Piperazin-HCl, pH 5.5, and the elution was performed with 10% PB74-HCl, pH 4.0 (Pharmacia).
- ST-CF was precipitated with ammonium sulphate at 80% saturation.
- the precipitated proteins were removed by centrifugation and after resuspension washed with 8 M urea.
- CHAPS and glycerol were added to a final concentration of 0.5% (w/v) and 5% (v/v) respectively and the protein solution was applied to a Rotofor isoelectrical Cell (BioRad).
- the Rotofor Cell had been equilibrated with an 8M urea buffer containing 0.5% (w/v) CHAPS, 5% (v/v) glycerol, 3% (v/v) Biolyt 3/5 and 1% (v/v) Biolyt 4/6 (BioRad).
- Isoelectric focusing was performed in a pH gradient from 3-6.
- the fractions were analyzed on silver-stained 10-20% SDS-PAGE.
- the fractions in the pH gradient 5.5 to 6 were pooled and washed three times with PBS on a Centriprep concentrator (Amicon) with a 3 kDa cut off membrane to a final volume of 1 ml.
- 300 mg of the protein preparation was separated on a 10-20% Tricine SDS-PAGE (Ploug et al 1989) and transferred to a PVDF membrane and Coomassie stained. The lowest band occurring on the membrane was excised and submitted for N-terminal sequencing.
- ST-CF was concentrated approximately 10-fold by ultrafiltration and ammonium sulphate precipitation at 80% saturation. Proteins were redissolved in 50 mM sodium phosphate, 1.5 M ammonium sulphate, pH 8.5, and subjected to thiophilic adsorption chromatography on an Affi-T gel column (Kem-En-Tec). Proteins were eluted by a 1.5 to 0 M decreasing gradient of ammonium sulphate. Fractions with similar band patterns in SDS-PAGE were pooled and anion exchange chromatography was performed on a Mono Q HR 5/5 column connected to an FPLC system (Pharmacia). The column was equilibrated with 10 mM Tris-HCl, pH 8.5, and the elution was performed with a gradient of NaCl from 0 to 1 M. Fractions containing well separated bands in SDS-PAGE were selected.
- N-terminal amino acid sequence analysis was performed on a Procise 494 sequencer (applied Biosystems).
- CFP7B PQGTVKWFNAEKGFG (SEQ ID NO:168)
- CFP10A NVTVSIPTILRPXXX (SEQ ID NO:169)
- CFP11 TRFMTDPHANRDMAG (SEQ ID NO:170)
- CFP30B PKRSEYRQGTPNWVD (SEQ ID NO:171)
- X denotes an amino acid which could not be determined by the sequencing method used.
- CFP10A For CFP10A a sequence 100% identical to 12 N-terminal amino acids was found on contig TB — 752 and on locus CY130.20 with EMBL accession number: Q10646 and Z73902. The identity was found within an open reading frame of 93 amino acids length corresponding to a theoretical MW of 9557 Da and a pI of 4.78.
- PCR reactions contained 10 ng of M. tuberculosis chromosomal DNA in 1 ⁇ low salt Taq+ buffer from Stratagene supplemented with 250 mM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+DNA polymerase (Stratagene) in 10 ml reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 10 sec., 55° C. for 10 sec. and 72° C. for 90 sec., using thermocycler equipment from Idaho Technology.
- Plasmid DNA was hereafter prepared from clones harbouring the desired fragments, digested with suitable restriction enzymes and subcloned into the expression vector pMCT6 in frame with 8 histidines which are added to the N-terminal of the expressed proteins.
- the resulting clones were hereafter sequenced by use of the dideoxy chain termination method adapted for supercoiled DNA using the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., USA) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- CFP7B Primers used for cloning of cfp7B: CFP7B-F: CTGAGATCTAGAATGCCACAGGGAACTGTG (SEQ ID NO:160) CFP7B-R: TCTCCCGGGGGTAACTCAGAGCGAGCGGAC (SEQ ID NO:161)
- CFP7B-F and CFP7B-R create BglII and SmaI sites, respectively, used for the cloning in pMCT6.
- CFP10A Primers used for cloning of cfp10A: CFP10A-F: CTGAGATCTATGAACGTCACCGTATCC (SEQ ID NO: 162) CFP10A-R: TCTCCCGGGGCTCACCCACCGGCCACG (SEQ ID NO: 163)
- CFP10A-F and CFP10A-R create BglII and SmaI sites, respectively, used for the cloning in pMCT6.
- CFP11 Primers used for cloning of cfp11: CFP11-F: CTGAGATCTATGGCAACACGTTTTATGACG (SEQ ID NO: 164) CFP11-R: CTCCCCGGGTTAGCTGCTGAGGATCTGCTH (SEQ ID NO: 165)
- CFP11-F and CFP11-R create BglII and SmaI sites, respectively, used for the cloning in pMCT6.
- CFP30B Primers used for cloning of cfp30B: CFP30B-F: CTGAAGATCTATGCCCAAGAGAAGCGAATAC (SEQ ID NO: 166) CFP30B-R: CGGCAGCTGCTAGCATTCTCCGAATCTGCCG (SEQ ID NO: 167)
- CFP30B-F and CFP30B-R create BglII and PvuII sites, respectively, used for the cloning in pMCT6.
- Oligonucleotide primers were synthesized automatically on a DNA synthesizer (Applied Biosystems, Forster City, Calif., ABI-391, PCR-mode) deblocked and purified by ethanol precipitation.
- oligonucleotides were synthesized (TABLE 3) on the basis of the nucleotide sequence from mpb51 described by Ohara et al. (1995).
- the oligonucleotides were engineered to include an EcoRI restriction enzyme site at the 5′ end and at the 3′ end by which a later subcloning was possible.
- nucleotide sequence of the cloned 952 bp M. tuberculosis H37Rv PCR fragment, pTO52, containing the Shine Dalgarno sequence, the signal peptide sequence and the structural gene of MPT51, and the nucleotide sequence of the cloned 815 bp PCR fragment containing the structural gene of MPT51, pTO53, were determined by the dideoxy chain termination method adapted for supercoiled DNA by use of the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., Cleveland, Ohio) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- the nucleotide sequences of pTO52 and pTO53 and the deduced amino acid sequence are shown in FIG. 5.
- the DNA sequence contained an open reading frame starting with a ATG codon at position 45-47 and ending with a termination codon (TAA) at position 942-944.
- TAA termination codon
- the nucleotide sequence of mpt51 differed with one nucleotide compared to the nucleotide sequence of MPB51 described by Ohara et al. (1995) (FIG. 5).
- mpt51 at position 780 was found a substitution of a guanine to an adenine. From the deduced amino acid sequence this change occurs at a first position of the codon giving a amino acid change from alanine to threonine.
- mpt51 consists of 801 bp and that the deduced amino acid sequence contains 266 residues with a molecular weight of 27,842, and MPT51 show 99,8% identity to MPB51.
- DNA of the recombinant plasmid pTO53 was cleaved at the EcoRI sites.
- the 815 bp fragment was purified from an agarose gel and subcloned into the EcoRI site of the pMAL-cR1 expression vector (New England Biolabs), pTO54.
- Vector DNA containing the gene fusion was used to transform the E. coli XL1-Blue by the standard procedures for DNA manipulation.
- Recombinant antigen was prepared in accordance with instructions provided by New England Biolabs. Briefly, single colonies of E. coli harbouring the pTO54 plasmid were inoculated into Luria-Bertani broth containing 50 ⁇ g/ml ampicillin and 12.5 ⁇ g/ml tetracycline and grown at 37° C. to 2 ⁇ 10 8 cells/ml. Isopropyl- ⁇ -D-thiogalactoside (IPTG) was then added to a final concentration of 0.3 mM and growth was continued for further 2 hours. The pelleted bacteria were stored overnight at ⁇ 20° C.
- IPTG Isopropyl- ⁇ -D-thiogalactoside
- MBP-rMPT51 maltose binding protein-MPT51fusion protein
- the fusion protein was eluted with 10 mM maltose. Aliquots of the fractions were analyzed on 10% SDS-PAGE. Fractions containing the fusion protein of interest were pooled and was dialysed extensively against physiological saline.
- a PCR fragment was produced from pTO52 using the primer combination MPT51-F and MPT51-R (TABLE 4).
- a BamHI site was engineered immediately 5′ of the first codon of mpt51 so that only the coding region of the gene encoding MPT51 would be expressed, and an NcoI site was incorporated right after the stop codon at the 3′ end.
- the PCR product was cleaved at the BamHI and the NcoI site.
- the 811 bp fragment was purified from an agarose gel and subcloned into the BamHI and the NcoI site of the pMST24 expression vector, pT086.
- Vector DNA containing the gene fusion was used to transform the E. coli XL1-Blue by the standard procedures for DNA manipulation.
- nucleotide sequence of complete gene fusion was determined by the dideoxy chain termination method as described under section DNA sequencing. Both strands of the DNA were sequenced.
- Recombinant antigen was prepared from single colonies of E. coli harbouring the pTO86 plasmid inoculated into LuriaBertani broth containing 50 ⁇ g/ml ampicillin and 12.5 ⁇ g/ml tetracycline and grown at 37° C. to 2 ⁇ 10 8 cells/ml. Isopropyl- ⁇ -D-thiogalactoside (IPTG) was then added to a final concentration of 1 mM and growth was continued for further 2 hours. The pelleted bacteria were resuspended in BC 100/20 buffer (100 mM KCl, 20 mM Imidazole, 20 mM Tris/HCl, pH 7.9, 20% glycerol).
- the fusion protein was eluted with BC 100/40 buffer (100 mM KCl, 40 mM Imidazole, 20 mM Tris/HCl, pH 7.9, 20% glycerol) with 8 M urea and BC 1000/40 buffer (1000 mM KCl, 40 mM Imidazole, 20 mM Tris/HCl, pH 7.9, 20% glycerol) with 8 M urea.
- BC 100/40 buffer 100 mM KCl, 40 mM Imidazole, 20 mM Tris/HCl, pH 7.9, 20% glycerol
- BC 1000/40 buffer 1000 mM KCl, 40 mM Imidazole, 20 mM Tris/HCl, pH 7.9, 20% glycerol
- His-rMPT51 was extensive dialysed against 10 mM Tris/HCl, pH 8.5, 3 M urea followed by purification using fast protein liquid chromatography (FPLC) (Pharmacia, Uppsala, Sweden), over an anion exchange column (Mono Q) using 10 mM Tris/HCl, pH 8.5, 3 M urea with a 0-1 M NaCl linear gradient. Fractions containing rMPT51 were pooled and subsequently dialysed extensively against 25 mM Hepes, pH 8.0 before use.
- FPLC fast protein liquid chromatography
- Mono Q anion exchange column
- Protein concentration was determined by the BCA method supplied by Pierce (Pierce Chemical Company, Rockford, Ill.).
- the lipopolysaccharide (LPS) content was determined by the limulus amoebocyte lysate test (LAL) to be less than 0.004 ng/ ⁇ g rMPT51, and this concentration had no influence on cellular activity.
- LAL limulus amoebocyte lysate test
- ESAT-6 is a an immunogen which is relatively difficult to adjuvate in order to obtain consistent results when immunizing therewith.
- To detect an in vitro recognition of ESAT-6 after immunization with the antigen is very difficult compared to the strong recognition of the antigen that has been found during the recall of memory immunity to M. tuberculosis .
- ESAT-6 has been found in ST-CF in a truncated version were amino acids 1-15 have been deleted. The deletion includes the main T-cell epitopes recognized by C57BL/6j mice (Brandt et al., 1996). This result indicates that ESAT-6 either is N-terminally processed or proteolytically degraded in ST-CF.
- ESAT-6 In order to optimize ESAT-6 as an immunogen, a gene fusion between ESAT-6 and another major T cell antigen MPT59 has been constructed. Two different construct have been made: MPT59-ESAT-6 (SEQ ID NO: 172) and ESAT-6-MPT59 (SEQ ID NO: 173). In the first hybrid ESAT-6 is N-terminally protected by MPT59 and in the latter it is expected that the fusion of two dominant T-cell antigens can have a synergistic effect.
- ESAT6 OPBR-4: GGCGCCGGCAAGCTTGCCATGACAGAGCAGCAGTGG (SEQ ID NO: 132)
- OPBR-28 CGAACTCGCCGGATCCCGTGTTTCGC (SEQ ID NO: 133)
- OPBR-4 and OPBR-28 create HinDIII and BamHI sites, respectively.
- MPT59 OPBR-48: GGCAACCGCGAGATCTTTCTCCCGGCCGGGGC (SEQ ID NO: 134)
- OPBR-3 GGCAAGCTTGCCGGCGCCTAACGAACT (SEQ ID NO: 135)
- OPBR-48 and OPBR-3 create BglII and HinDIII, respectively. Additionally OPBR-3 deletes the stop codon of MPT59.
- PCR reactions contained 10 ng of M. tuberculosis chromosomal DNA in 1 ⁇ low salt Taq+ buffer from Stratagene supplemented with 250 mM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+DNA polymerase (Stratagene) in 10 ⁇ l reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 10 sec., 550C for 10 sec. and 72° C. for 90 sec, using thermocycler equipment from Idaho Technology.
- the DNA fragments were subsequently run on 1% agarose gels, the bands were excised and purified by Spin-X spin columns (Costar).
- the two PCR fragments were digested with HinDIII and ligated.
- a PCR amplification of the ligated PCR fragments encoding MPT59-ESAT6 was carried out using the primers OPBR-48 and OPBR-28.
- PCR reaction was initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 30 sec., 55° C. for 30 sec. and 72° C. for 90 sec.
- the resulting PCR fragment was digested with BglII and BamHI and cloned into the expression vector pMCT6 in frame with 8 histidines which are added to the N-terminal of the expressed protein hybrid.
- the resulting clones were hereafter sequenced by use of the dideoxy chain termination method adapted for supercoiled DNA using the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., USA) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- ESAT6 OPBR-75: GGACCCAGATCTATGACAGAGCAGCAGTGG (SEQ ID NO: 136)
- OPBR-76 CCGGCAGCCCCGGCCGGGAGAAAAGCTTTGCGAACATCCCAGTGACG (SEQ ID NO: 137)
- OPBR-75 and OPBR-76 create BglII and HinDIII sites, respectively. Additionally OPBR-76 deletes the stop codon of ESAT6.
- MPT59 OPBR-77: GTTCGCAAAGCTTTTCTCCCGGCCGGGGCTGCCGGTCGAGTACC (SEQ ID NO: 138)
- OPBR-18 CCTTCGGTGGATCCCGTCAG (SEQ ID NO: 139)
- OPBR-77 and OPBR-18 create HinDIII and BamHI sites, respectively.
- the lysate was applied to a column containing 25 ml of resuspended Talon resin (Clontech, Palo Alto, USA). The column was washed and eluted as described by the manufacturers.
- the first dimension was performed in acrylamide/piperazin diacrylamide tube gels in the presence of urea, the detergent CHAPS and the reducing agent DTT at 400 V for 18 hours and 800 V for 2 hours.
- the second dimension 10-20% SDS-PAGE was performed at 100 V for 18 hours and silver stained.
- CFP7, CFP7A, CFP7B, CFP8A, CFP8B, CFP9, CFP11, CFP16, CFP17, CFP19, CFP20, CFP21, CFP22, CFP25, CFP27, CFP28, CFP29, CFP30A, CFP50, and MPT51 in the 2DE reference gel were done by comparing the spot pattern of the purified antigen with ST-CF with and without the purified antigen. By the assistance of an analytical 2DE software system (Phoretix International, UK) the spots have been identified in FIG. 6. The position of MPT51 and CFP29 were confirmed by a Western blot of the 2DE gel using the Mab's anti-CFP29 and HBT 4.
- the recombinant antigens rCFP7A, rCFP17, rCFP21, rCFP25, and rCFP29 were all recognized in at least two strains of mice at a level comparable to ST-CF.
- rMPT51 and rCFP7 were only recognized in one or two strains respectively, at a level corresponding to no more than 1 ⁇ 3 of the response detected after ST-CF stimulation.
- Neither of the antigens rCFP20 and rCFP22 were recognized by any of the four mouse strains.
- mice (Bomholtegaard, Ry) were given intravenous infections via the lateral tail vein with an inoculum of 5 ⁇ 10 4 M. tuberculosis suspended in PBS in a vol. of 0.1 ml. After 1 month of infection the mice were treated with isoniazid (Merck and Co., Rahway, N.J.) and rifabutin (Farmatalia Carlo Erba, Milano, Italy) in the drinking water, for two months. The mice were rested for 4-6 months before being used in experiments. For the study of the recall of memory immunity, animals were infected with an inoculum of 1 ⁇ 10 6 bacteria i.v.
- PBMC Human donors: PBMC were obtained from healthy BCG vaccinated donors with no known exposure to patients with TB and from patients with culture or microscopy proven infection with Mycobacterium tuberculosis . Blood samples were drawn from the TB patients 1-4 months after diagnosis.
- Lymphocyte preparations and cell culture PBMC were freshly isolated by gradient centrifugation of heparinized blood on Lymphoprep (Nycomed, Oslo, Norway). The cells were resuspended in complete medium: RPMI 1640 (Gibco, Grand Island, N.Y.) supplemented with 40 ⁇ g/ml streptomycin, 40 U/ml penicillin, and 0.04 mM/ml glutamine, (all from Gibco Laboratories, Paisley, Scotland) and 10% normal human ABO serum (NHS) from the local blood bank. The number and the viability of the cells were determined by trypan blue staining.
- Cultures were established with 2,5 ⁇ 10 5 PBMC in 200 ⁇ l in microtitre plates (Nunc, Roskilde, Denmark) and stimulated with no antigen, ST-CF, PPD (2.5 ⁇ g/ml); rCFP7, rCFP7A, rCFP17, rCFP20, rCFP21, rCFP22, rCFP25, rCFP26, rCFP29, in a final concentration of 5 ⁇ g/ml.
- Phytohaemagglutinin, 1 ⁇ g/ml (PHA, Difco laboratories, Detroit, Mich. was used as a positive control.
- Supernatants for the detection of cytokines were harvested after 5 days of culture, pooled and stored at ⁇ 80° C. until use.
- Cytokine analysis Interferon- ⁇ (IFN- ⁇ ) was measured with a standard ELISA technique using a commercially available pair of mAb's from Endogen and used according to the instructions for use. Recombinant IFN- ⁇ (Gibco laboratories) was used as a standard. The detection level for the assay was 50 pg/ml. The variation between the duplicate wells did not exceed 10% of the mean. Responses of 9 individual donors are shown in TABLE 9.
- avium culture filtrate are shown for the comparison.
- donor no ag PHA PPD STCF CFP7 CFP17 CFP7A CFP20 CFP21 CFP22 CFP25 CFP26 CFP29 Controls, Healthy, BCG vaccinated, no known TB exposure 1 6 9564 6774 3966 7034 69 1799 58 152 73 182 946 86 2 48 12486 6603 8067 3146 10044 5267 29 6149 51 1937 526 2065 3 190 11929 10000 8299 8015 11563 8641 437 3194 669 2531 8076 6098 4 10 21029 4106 3537 1323 1939 5211 1 284 1 1344 20 125 5 1 18750 14209 13027 17725 8038 19002 1 3008 1 2103 974 8181 TB patients, 1-4 month after diagnosis 6 9 8973 5096 6145 852 4250 4019 284 1131 48 2400 1078 4584 7 1 12413 6281 3393 168 6375 45
- Group 2 10 ⁇ g MPT59/DDA (250 ⁇ g)
- mice were injected with a volume of 0.2 ml. Two weeks after the first injection and 3 weeks after the second injection the mice were boosted a little further up the back. One week after the last immunization the mice were bled and the blood cells were isolated. The immune response induced was monitored by release of IFN- ⁇ into the culture supernatants when stimulated in vitro with relevant antigens (see the following table).
- mice The recombinant antigens were tested individually as subunit vaccines in mice. Eleven groups of 6-8 weeks old, female C57Bl/6j mice (BomholtegArd, Denmark) were immunized subcutaneously at the base of the tail with vaccines of the following composition:
- mice 6 weeks after the last immunization the mice were aerosol challenged with 5 ⁇ 10 6 viable Mycobacterium tuberculosis /ml. After 6 weeks of infection the mice were killed and the number of viable bacteria in lung and spleen of infected mice was determined by plating serial 3-fold dilutions of organ homogenates on 7H11 plates. Colonies were counted after 2-3 weeks of incubation.
- the protective efficacy is expressed as the difference between log 10 values of the geometric mean of counts obtained from five mice of the relevant group and the geometric mean of counts obtained from five mouse of the relevant control group.
- PCR analyses were used in order to determine the distribution of the cfp7, cfp9 and mpt51 gene in species belonging to the tuberculosis-complex and in other mycobacteria.
- the bacterial strains used are listed in TABLE 10. PCR was performed on genomic DNA prepared from mycobacterial cells as described previously (Andersen et al., 1992).
- oligonucleotide primers used were synthesised automatically on a DNA synthesizer (Applied Biosystems, Forster City, Calif., ABI-391, PCR-mode), deblocked, and purified by ethanol precipitation.
- the primers used for the analyses are shown in TABLE 11.
- the PCR amplification was carried out in a thermal reactor (Rapid cycler, Idaho Technology, Idaho) by mixing 20 ng chromosomal with the mastermix (contained 0.5 ⁇ M of each oligonucleotide primer, 0.25 ⁇ M BSA (Stratagene), low salt buffer (20 mM Tris-HCl, pH8.8 , 10 mM KCl, 10 mM (NH4) 2 SO 4 , 2 mM MgSO 4 and 0, 1% Triton X-100) (Stratagene), 0.25 mM of each deoxynucleoside triphosphate and 0.5 U Taq Plus Long DNA polymerase (Stratagene)).
- the mastermix obtained 0.5 ⁇ M of each oligonucleotide primer, 0.25 ⁇ M BSA (Stratagene), low salt buffer (20 mM Tris-HCl, pH8.8 , 10 mM KCl, 10 mM (NH4) 2 SO 4 , 2 mM M
- Final volume was 10 ⁇ l (all concentrations given are concentrations in the final volume).
- Predenaturation was carried out at 94° C. for 30 s. 30 cycles of the following was performed: Denaturation at 94° C. for 30 s, annealing at 55° C. for 30 s and elongation at 72° C. for 1 min.
- mpt51 MPT51-3 and MPT51-2 (820 bp), MPT51-3 and MPT51-6 (108 bp), MPT51-5 and MPT51-4 (415 bp), MPT51-7 and MPT51-4 (325 bp).
- cfp7 pVF1 and PVR1 (274 bp), pVF1 and PVR2 (197 bp), pVF3 and PVR1 (302 bp), pVF3 and PVR2 (125 bp).
- cfp9 stR3 and stF1 (351 bp).
- the cfp7, cfp9, mpt51, rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b gene fragments were amplified by PCR from the plasmids pRVN01, pRVN02, pT052, pT087, pT088, pT089, pT090, pT091, pT096 or pT098 by using the primers shown in TABLE 11 and TABLE 2 (in Example 2a).
- cfp7, cfp9 and mpt51 were found in the M. tuberculosis complex including BCG and the environmental mycobacteria; M. avium, M. kansasii, M. marinum, M. intracellular and M. flavescens .
- cfp9 was additionally found in M. szulgai and mpt51 in M. xenopi.
- R1-ORF3 was recognized by 59% of smear positive TB patients but most importantly almost 50% of the HIV positive patients (compared to 15% for 38 kDa antigen), and as many as 70% of the smear negative patients (compared to 36% for 38 kDa antigen).
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Pulmonology (AREA)
- Communicable Diseases (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention is based on the identification and characterization of a number of M. tuberculosis derived novel proteins and protein fragments (SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, and 168-171). The invention is directed to the polypeptides and immunologically active fragments thereof, the genes encoding them, immunological compositions such as vaccines and skin test reagents containing the polypeptides. Another part of the invention is based on the surprising discovery that fusions between ESAT-6 and MPT59 are superior immunogens compared to each of the unfused proteins, respectively.
Description
- The present invention relates to a number of immunologically active, novel polypeptide fragments derived from theMycobacterium tuberculosis, vaccines and other immunologic compositions containing the fragments as immunogenic components, and methods of production and use of the polypeptides. The invention also relates to novel nucleic acid fragments derived from M. tuberculosis which are useful in the preparation of the polypeptide fragments of the invention or in the diagnosis of infection with M. tuberculosis. The invention further relates to certain fusion polypeptides, notably fusions between ESAT-6 and MPT59.
- Human tuberculosis (hereinafter designated “TB”) caused byMycobacterium tuberculosis is a severe global health problem responsible for approximately 3 million deaths annually, according to the WHO. The worldwide incidence of new TB cases has been progressively falling for the last decade but the recent years has markedly changed this trend due to the advent of AIDS and the appearance of multidrug resistant strains of M. tuberculosis.
- The only vaccine presently available for clinical use is BCG, a vaccine which efficacy remains a matter of controversy. BCG generally induces a high level of acquired resistance in animal models of TB, but several human trials in developing countries have failed to demonstrate significant protection. Notably, BCG is not approved by the FDA for use in the United States.
- This makes the development of a new and improved vaccine against TB an urgent matter which has been given a very high priority by the WHO. Many attempts to define protective mycobacterial substances have been made, and from 1950 to 1970 several investigators reported an increased resistance after experimental vaccination. However, the demonstration of a specific long-term protective immune response with the potency of BCG has not yet been achieved by administration of soluble proteins or cell wall fragments, although progress is currently being made by relying on polypeptides derived from short term-culture filtrate, cf. the discussion below.
- Immunity toM. tuberculosis is characterized by three basic features; i) Living bacilli efficiently induces a protective immune response in contrast to killed preparations; ii) Specifically sensitized T lymphocytes mediate this protection; iii) The most important mediator molecule seems to be interferon gamma (INF-γ).
- Short term-culture filtrate (ST-CF) is a complex mixture of proteins released fromM. tuberculosis during the first few days of growth in a liquid medium (Andersen et al., 1991). Culture filtrates has been suggested to hold protective antigens recognized by the host in the first phase of TB infection (Andersen et al. 1991, Orme et al. 1993). Recent data from several laboratories have demonstrated that experimental subunit vaccines based on culture filtrate antigens can provide high levels of acquired resistance to TB (Pal and Horwitz, 1992; Roberts et al., 1995; Andersen, 1994; Lindblad et al., 1997). Culture filtrates are, however, complex protein mixtures and until now very limited information has been available on the molecules responsible for this protective immune response. In this regard, only two culture filtrate antigens have been described as involved in protective immunity, the low mass antigen ESAT-6 (Andersen et al., 1995 and EP-A-0 706 571) and the 31 kDa molecule Ag85B (EP-0 432 203).
- There is therefore a need for the identification of further antigens involved in the induction of protective immunity against TB in order to eventually produce an effective subunit vaccine.
- It is an object of the invention to provide novel antigens which are effective as components in a subunit vaccine against TB or which are useful as components in diagnostic compositions for the detection of infection with mycobacteria, especially virulence-associated mycobacteria. The novel antigens may also be important drug targets.
- The present invention is i.a. based on the identification and characterization of a number of previously uncharacterized culture filtrate antigens fromM. tuberculosis. In animal models of TB, T cells mediating immunity are focused predominantly to antigens in the regions 6-12 and 17-30 kDa of ST-CF. In the present invention 8 antigens in the low molecular weight region (CFP7, CFP7A, CFP7B, CFP8A, CFP8B, CFP9, CFP10A, and CFP11) and 18 antigens (CFP16, CFP17, CFP19, CFP19B, CFP20, CFP21, CFP22, CFP22A, CFP23, CFP23A, CFP23B, CFP25, CFP26, CFP27, CFP28, CFP29, CFP30A, and CFP30B) in the 17-30 kDa region have been identified. Of these, CFP19A and CFP23 have been selected because they exhibit relatively high homologies with CFP21 and CFP25, respectively, in so far that a nucleotide homology sequence search in the Sanger Database (cf. below) with the genes encoding CFP21 and CFP25, (cfp25 and cfp21 respectively), shows homology to two M. tuberculosis DNA sequences, orf19A and orf23. The two sequences, orf19a and orf23, encode to putative proteins CFP19A and CFP23 with the molecular weights of approx. 19 and 23 kDa respectively. The identity, at amino acid level, to CFP21 and CFP25 is 46% and 50%, respectively, for both proteins. CFP21 and CFP25 have been shown to be dominant T-cell antigens, and it is therefore believed that CFP19A and CFP23 are possible new T-cell antigens.
- Furthermore, a 50 kDa antigen (CFP50) has been isolated from culture filtrate and so has also an antigen (CWP32) isolated from the cell wall in the 30 kDa region.
- The present invention is also based on the identification of a number of putative antigens fromM. tuberculosis which are not present in Mycobacterium bovis BCG strains. The nucleotide sequences encoding these putative antigens are: rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a, and rd1-orf9b.
- Finally, the invention is based on the surprising discovery that fusions between ESAT-6 and MPT59 are superior immunogens compared to the unfused proteins, respectively.
- The encoding genes for 33 of the antigens have been determined, the distribution of a number of the antigens in various mycobacterial strains investigated and the biological activity of the products characterized. The panel hold antigens with potential for vaccine purposes as well as for diagnostic purposes, since the antigens are all secreted by metabolizing mycobacteria.
- The following table lists the antigens of the invention by the names used herein as well as by reference to relevant SEQ ID NOs of N-terminal sequences, full amino acid sequences and sequences of DNA encoding the antigens:
N-terminal Nucleotide Amino acid sequence sequence sequence Antigen SEQ ID NO: SEQ ID NO: SEQ ID NO: CFP7 1 2 CFP7A 81 47 48 CFP7B 168 146 147 CFP8A 73 148 149 CFP8B 74 150 151 CFP9 3 4 CFP10A 169 140 141 CFP11 170 142 143 CFP16 79 63 64 CFP17 17 5 6 CFP19 82 49 50 CFP19A 51 52 CFP19B 80 CFP20 18 7 8 CFP21 19 9 10 CFP22 20 11 12 CFP22A 83 53 54 CFP23 55 56 CFP23A 76 CFP23B 75 CFP25 21 13 14 CFP25A 78 65 66 CFP27 84 57 58 CFP28 22 CFP29 23 15 16 CFP30A 85 59 60 CFP30B 171 144 145 CFP50 86 61 62 MPT51 41 42 CWP32 77 152 153 RD1-ORF8 67 68 RD1-ORF2 71 72 RD1-ORF9B 69 70 RD1-ORF3 87 88 RD1-ORF9A 93 94 RD1-ORF4 89 90 RD1-ORF5 91 92 MPT59- 172 ESAT6 ESAT6- 173 MPT59 - It is well-known in the art that T-cell epitopes are responsible for the elicitation of the acquired immunity against TB, whereas B-cell epitopes are without any significant influence on acquired immunity and recognition of mycobacteria in vivo. Since such T-cell epitopes are linear and are known to have a minimum length of 6 amino acid residues, the present invention is especially concerned with the identification and utilisation of such T-cell epitopes.
- Hence, in its broadest aspect the invention relates to a substantially pure polypeptide fragment which
- a) comprises an amino acid sequence selected from the sequences shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, and any one of 168-171,
- b) comprises a subsequence of the polypeptide fragment defined in a) which has a length of at least 6 amino acid residues, said subsequence being immunologically equivalent to the polypeptide defined in a) with respect to the ability of evoking a protective immune response against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex, or
- c) comprises an amino acid sequence having a sequence identity with the polypeptide defined in a) or the subsequence defined in b) of at least 70% and at the same time being immunologically equivalent to the polypeptide defined in a) with respect to the ability of evoking a protective immune response against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex,
- with the proviso that
- i) the polypeptide fragment is in essentially pure form when consisting of the amino acid sequence 1-96 of SEQ ID NO: 2 or when consisting of the amino acid sequence 87-108 of SEQ ID NO: 4 fused to β-galactosidase,
- ii) the degree of sequence identity in c) is at least 95% when the polypeptide comprises a homologue of a polypeptide which has the amino acid sequence SEQ ID NO: 12 or a subsequence thereof as defined in b), and
- iii) the polypeptide fragment contains a threonine residue corresponding to position 213 in SEQ ID NO: 42 when comprising an amino acid sequence of at least 6 amino acids in SEQ ID NO: 42.
- Other parts of the invention pertains to the DNA fragments encoding a polypeptide with the above definition as well as to DNA fragments useful for determining the presence of DNA encoding such polypeptides.
- In the present specification and claims, the term “polypeptide fragment” denotes both short peptides with a length of at least two amino acid residues and at most 10 amino acid residues, oligopeptides (11-100 amino acid residues), and longer peptides-(the usual interpretation of “polypeptide”, i.e. more than 100 amino acid residues in length) as well as proteins (the functional entity comprising at least one peptide, oligopeptide, or polypeptide which may be chemically modified by being glycosylated, by being lipidated, or by comprising prosthetic groups). The definition of polypeptides also comprises native forms of peptides/proteins in mycobacteria as well as recombinant proteins or peptides in any type of expression vectors transforming any kind of host, and also chemically synthesized peptides.
- In the present context the term “substantially pure polypeptide fragment” means a polypeptide preparation which contains at most 5% by weight of other polypeptide material with which it is natively associated (lower percentages of other polypeptide material are preferred, e.g. at most 4%, at most 3%, at most 2%, at most 1%, and at most ½%). It is preferred that the substantially pure polypeptide is at least 96% pure, i.e. that the polypeptide constitutes at least 96% by weight of total polypeptide material present in the preparation, and higher percentages are preferred, such as at least 97%, at least 98%, at least 99%, at least 99,25%, at least 99,5%, and at least 99,75%. It is especially preferred that the polypeptide fragment is in “essentially pure form”, i.e. that the polypeptide fragment is essentially free of any other antigen with which it is natively associated, i.e. free of any other antigen from bacteria belonging to the tuberculosis complex. This can be accomplished by preparing the polypeptide fragment by means of recombinant methods in a non-mycobacterial host cell as will be described in detail below, or by synthesizing the polypeptide fragment by the well-known methods of solid or liquid phase peptide synthesis, e.g. by the method described by Merrifield or variations thereof.
- The term “subsequence” when used in connection with a polypeptide of the invention having a SEQ ID NO selected from 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, and any one of 168-171 denotes any continuous stretch of at least 6 amino acid residues taken from theM. tuberculosis derived polypeptides in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 and being immunological equivalent thereto with respect to the ability of conferring increased resistance to infections with bacteria belonging to the tuberculosis complex. Thus, included is also a polypeptide from different sources, such as other bacteria or even from eukaryotic cells.
- When referring to an “immunologically equivalent” polypeptide is herein meant that the polypeptide, when formulated in a vaccine or a diagnostic agent (i.e. together with a pharmaceutically acceptable carrier or vehicle and optionally an adjuvant), will
- I) confer, upon administration (either alone or as an immunologically active constituent together with other antigens), an acquired increased specific resistance in a mouse and/or in a guinea pig and/or in a primate such as a human being against infections with bacteria belonging to the tuberculosis complex which is at least 20% of the acquired increased resistance conferred byMycobacterium bovis BCG and also at least 20% of the acquired increased resistance conferred by the parent polypeptide comprising SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 (said parent polypeptide having substantially the same relative location and pattern in a 2DE gel prepared as the 2DE gel shown in FIG. 6, cf. the examples), the acquired increased resistance being assessed by the observed reduction in mycobacterial counts from spleen, lung or other organ homogenates isolated from the mouse or guinea pig receiving a challenge infection with a virulent strain of M. tuberculosis, or, in a primate such as a human being, being assessed by determining the protection against development of clinical tuberculosis in a vaccinated group versus that observed in a control group receiving a placebo or BCG (preferably the increased resistance is higher and corresponds to at least 50% of the protective immune response elicited by M. bovis BCG, such as at least 60%, or even more preferred to at least 80% of the protective immune response elicited by M. bovis BCG, such as at least 90%; in some cases it is expected that the increased resistance will supersede that conferred by M. bovis BCG, and hence it is preferred that the resistance will be at least 100%, such as at least 110% of said increased resistance); and/or
- II) elicit a diagnostically significant immune response in a mammal indicating previous or ongoing sensitization
- with antigens derived from mycobacteria belonging to the tuberculosis complex; this diagnostically significant immune response can be in the form of a delayed type hypersensitivity reaction which can e.g. be determined by a skin test, or can be in the form of IFN-γ release determined e.g. by an IFN-γ assay as described in detail below. A diagnostically significant response in a skin test setup will be a reaction which gives rise to a skin reaction which is at least 5 mm in diameter and which is at least 65% (preferably at least 75% such as at the least 85%) of the skin reaction (assessed as the skin reaction diameter) elicited by the parent polypeptide comprising SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171.
- The ability of the polypeptide fragment to confer increased immunity may thus be assessed by measuring in an experimental animal, e.g. a mouse or a guinea pig, the reduction in mycobacterial counts from the spleen, lung or other organ homogenates isolated from the experimental animal which have received a challenge infection with a virulent strain of mycobacteria belonging to the tuberculosis complex after previously having been immunized with the polypeptide, as compared to the mycobacterial counts in a control group of experimental animals infected with the same virulent strain, which experimental animals have not previously been immunized against tuberculosis. The comparison of the mycobacterial counts may also be carried out with mycobacterial counts from a group of experimental animals receiving a challenge infection with the same virulent strain after having been immunized withMycobacterium bovis BCG.
- The mycobacterial counts in homogenates from the experimental animals immunized with a polypeptide fragment according to the present invention must at the most be 5 times the counts in the mice or guinea pigs immunized withMycobacterium bovis BCG, such as at the most 3 times the counts, and preferably at the most 2 times the counts.
- A more relevant assessment of the ability of the polypeptide fragment of the invention to confer increased resistance is to compare the incidence of clinical tuberculosis in two groups of individuals (e.g. humans or other primates) where one group receives a vaccine as described herein which contains an antigen of the invention and the other group receives either a placebo or an other known TB vaccine (e.g. BCG). In such a setup, the antigen of the invention should give rise to a protective immunity which is significantly higher than the one provided by the administration of the placebo (as determined by statistical methods known to the skilled artisan).
- The “tuberculosis-complex” has its usual meaning, i.e. the complex of mycobacteria causing TB which areMycobacterium tuberculosis, Mycobacterium bovis, Mycobacterium bovis BCG, and Mycobacterium africanum.
- In the present context the term “metabolizing mycobacteria” means live mycobacteria that are multiplying logarithmically and releasing polypeptides into the culture medium wherein they are cultured.
-
- wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences. Hence, the DNA sequence AGTCAGTC will have a sequence identity of 75% with the sequence AATCAATC (Ndif=2 and Nref=8).
- The sequence identity is used here to illustrate the degree of identity between the amino acid sequence of a given polypeptide and the amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171. The amino acid sequence to be compared with the amino acid sequence shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 may be deduced from a DNA sequence, e.g. obtained by hybridization as defined below, or may be obtained by conventional amino acid sequencing methods. The sequence identity is preferably determined on the amino acid sequence of a mature polypeptide, i.e. without taking any leader sequence into consideration.
- As appears from the above disclosure, polypeptides which are not identical to the polypeptides having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 are embraced by the present invention. The invention allows for minor variations which do not have an adverse effect on immunogenicity compared to the parent sequences and which may give interesting and useful novel binding properties or biological functions and immunogenicities etc.
- Each polypeptide fragment may thus be characterized by specific amino acid and nucleic acid sequences. It will be understood that such sequences include analogues and variants produced by recombinant methods wherein such nucleic acid and polypeptide sequences have been modified by substitution, insertion, addition and/or deletion of one or more nucleotides in said nucleic acid sequences to cause the substitution, insertion, addition or deletion of one or more amino acid residues in the recombinant polypeptide. When the term DNA is used in the following, it should be understood that for the number of purposes where DNA can be substituted with RNA, the term DNA should be read to include RNA embodiments which will be apparent for the man skilled in the art. For the purposes of hybridization, PNA may be used instead of DNA, as PNA has been shown to exhibit a very dynamic hybridization profile (PNA is described in Nielsen P E et al., 1991, Science 254: 1497-1500).
- In both immunodiagnostics and vaccine preparation, it is often possible and practical to prepare antigens from segments of a known immunogenic protein or polypeptide. Certain epitopic regions may be used to produce responses similar to those produced by the entire antigenic polypeptide. Potential antigenic or immunogenic regions may be identified by any of a number of approaches, e.g., Jameson-Wolf or Kyte-Doolittle antigenicity analyses or Hopp and Woods (1981) hydrophobicity analysis (see, e.g., Jameson and Wolf, 1988; Kyte and Doolittle, 1982; or U.S. Pat. No. 4,554,101). Hydrophobicity analysis assigns average hydrophilicity values to each amino acid residue from these values average hydrophilicities can be calculated and regions of greatest hydrophilicity determined. Using one or more of these methods, regions of predicted antigenicity may be derived from the amino acid sequence assigned to the polypeptides of the invention.
- Alternatively, in order to identify relevant T-cell epitopes which are recognized during an immune response, it is also possible to use a “brute force” method: Since T-cell epitopes are linear, deletion mutants of polypeptides having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 will, if constructed systematically, reveal what regions of the polypeptides are essential in immune recognition, e.g. by subjecting these deletion mutants to the IFN-γ assay described herein. Another method utilises overlapping oligomers (preferably synthetic having a length of e.g. 20 amino acid residues) derived from polypeptides having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171. Some of these will give a positive response in the IFN-γ assay whereas others will not.
- In a preferred embodiment of the invention, the polypeptide fragment of the invention comprises an epitope for a T-helper cell.
- Although the minimum length of a T-cell epitope has been shown to be at least 6 amino acids, it is normal that such epitopes are constituted of longer stretches of amino acids. Hence it is preferred that the polypeptide fragment of the invention has a length of at least 7 amino acid residues, such as at least 8, at least 9, at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, and at least 30 amino acid residues.
- As will appear from the examples, a number of the polypeptides of the invention are natively translation products which include a leader sequence (or other short peptide sequences), whereas the product which can be isolated from short-term culture filtrates from bacteria belonging to the tuberculosis complex are free of these sequences. Although it may in some applications be advantageous to produce these polypeptides recombinantly and in this connection facilitate export of the polypeptides from the host cell by including information encoding the leader sequence in the gene for the polypeptide, it is more often preferred to either substitute the leader sequence with one which has been shown to be superior in the host system for effecting export, or to totally omit the leader sequence (e.g. when producing the polypeptide by peptide synthesis. Hence, a preferred embodiment of the invention is a polypeptide which is free from amino acid residues −30 to −1 in SEQ ID NO: 6 and/or −32 to −1 in SEQ ID NO: 10 and/or −8 to −1 in SEQ ID NO: 12 and/or −32 to −1 in SEQ ID NO: 14 and/or −33 to −1 in SEQ ID NO: 42 and/or −38 to −1 in SEQ ID NO: 52 and/or −33 to −1 in SEQ ID NO: 56 and/or −56 to −1 in SEQ ID NO: 58 and/or −28 to −1 in SEQ ID NO: 151.
- In another preferred embodiment, the polypeptide fragment of the invention is free from any signal sequence; this is especially interesting when the polypeptide fragment is produced synthetically but even when the polypeptide fragments are produced recombinantly it is normally acceptable that they are not exported by the host cell to the periplasm or the extracellular space; the polypeptide fragments can be recovered by traditional methods (cf. the discussion below) from the cytoplasm after disruption of the host cells, and if there is need for refolding of the polypeptide fragments, general refolding schemes can be employed, cf. e.g. the disclosure in WO 94/18227 where such a general applicable refolding method is described.
- A suitable assay for the potential utility of a given polypeptide fragment derived from SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171 is to assess the ability of the polypeptide fragment to effect IFN-γ release from primed memory T-lymphocytes. Polypeptide fragments which have this capability are according to the invention especially interesting embodiments of the invention: It is contemplated that polypeptide fragments which stimulate T lymphocyte immune response shortly after the onset of the infection are important in the control of the mycobacteria causing the infection before the mycobacteria have succeeded in multiplying up to the number of bacteria that would have resulted in fulminant infection.
- Thus, an important embodiment of the invention is a polypeptide fragment defined above which
- 1) induces a release of IFN-γ from primed memory T-lymphocytes withdrawn from a mouse within 2 weeks of primary infection or within 4 days after the mouse has been rechallenge infected with mycobacteria belonging to the tuberculosis complex, the induction performed by the addition of the polypeptide to a suspension comprising about 200,000 spleen cells per ml, the addition of the polypeptide resulting in a concentration of 1-4 μg polypeptide per ml suspension, the release of IFN-γ being assessable by determination of IFN-γ in supernatant harvested 2 days after the addition of the polypeptide to the suspension, and/or
- 2) induces a release of IFN-γ of at least 1,500 pg/ml above background level from about 1,000,000 human PBMC (peripheral blood mononuclear cells) per ml isolated from TB patients in the first phase of infection, or from healthy BCG vaccinated donors, or from healthy contacts to TB patients, the induction being performed by the addition of the polypeptide to a suspension comprising the about 1,000,000 PBMC per ml, the addition of the polypeptide resulting in a concentration of 1-4 μg polypeptide per ml suspension, the release of IFN-γ being assessable by determination of IFN-γ in supernatant harvested 2 days after the addition of the polypeptide to the suspension; and/or
- 3) induces an IFN-γ release from bovine PBMC derived from animals previously sensitized with mycobacteria belonging to the tuberculosis complex, said release being at least two times the release observed from bovine PBMC derived from animals not previously sensitized with mycobacteria belonging to the tuberculosis complex.
- Preferably, in
alternatives - The polypeptide fragments of the invention preferably comprises an amino acid sequence of at least 6 amino acid residues in length which has a higher sequence identity than 70 percent with SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, any one of 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, any one of 72-86, 88, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, or any one of 168-171. A preferred minimum percentage of sequence identity is at least 80%, such as at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, and at least 99.5%.
- As mentioned above, it will normally be interesting to omit the leader sequences from the polypeptide fragments of the invention. However, by producing fusion polypeptides, superior characteristics of the polypeptide fragments of the invention can be achieved. For instance, fusion partners which facilitate export of the polypeptide when produced recombinantly, fusion partners which facilitate purification of the polypeptide, and fusion partners which enhance the immunogenicity of the polypeptide fragment of the invention are all interesting possibilities. Therefore, the invention also pertains to a fusion polypeptide comprising at least one polypeptide fragment defined above and at least one fusion partner. The fusion partner can, in order to enhance immunogenicity, e.g. be selected from the group consisting of another polypeptide fragment as defined above (so as to allow for multiple expression of relevant epitopes), and an other polypeptide derived from a bacterium belonging to the tuberculosis complex, such as ESAT-6, MPB64, MPT64, and MPB59 or at least one T-cell epitope of any of these antigens. Other immunogenicity enhancing polypeptides which could serve as fusion partners are T-cell epitopes (e.g. derived from the polypeptides ESAT-6, MPB64, MPT64, or MPB59) or other immunogenic epitopes enhancing the immunogenicity of the target gene product, e.g. lymphokines such as INF-γ, IL-2 and IL-12. In order to facilitate expression and/or purification the fusion partner can e.g. be a bacterial fimbrial protein, e.g. the pilus components pilin and papA; protein A; the ZZ-peptide (ZZ-fusions are marketed by Pharmacia in Sweden); the maltose binding protein; gluthatione S-transferase; β-galactosidase; or poly-histidine.
- Other interesting fusion partners are polypeptides which are lipidated and thereby effect that the immunogenic polypeptide is presented in a suitable manner to the immune system. This effect is e.g. known from vaccines based on theBorrelia burgdorferi OspA polypeptide, wherein the lipidated membrane anchor in the polypeptide confers a self-adjuvating effect to the polypeptide (which is natively lipidated) when isolated from cells producing it. In contrast, the OspA polypeptide is relatively silent immunologically when prepared without the lipidation anchor.
- As evidenced in Example 6A, the fusion polypeptide consisting of MPT59 fused directly N-terminally to ESAT-6 enhances the immunogenicity of ESAT-6 beyond what would be expected from the immunogenicities of MPT59 and ESAT-6 alone. The precise reason for this surprising finding is not yet known, but it is expected that either the presence of both antigens lead to a synergistic effect with respect to immunogenicity or the presence of a sequence N-terminally to the ESAT-6 sequence protects this immune dominant protein from loss of important epitopes known to be present in the N-terminus. A third, alternative, possibility is that the presence of a sequence C-terminally to the MPT59 sequence enhances the immunologic properties of this antigen.
- Hence, one part of the invention pertains to a fusion polypeptide fragment which comprises a first amino acid sequence including at least one stretch of amino acids constituting a T-cell epitope derived from theM. tuberculosis protein ESAT-6 or MPT59, and a second amino acid sequence including at least one T-cell epitope derived from a M. tuberculosis protein different from ESAT-6 (if the first stretch of amino acids are derived from ESAT-6) or MPT59 (if the first stretch of amino acids are derived from MPT59) and/or including a stretch of amino acids which protects the first amino acid sequence from in vivo degradation or posttranslational processing. The first amino acid sequence may be situated N- or C-terminally to the second amino acid sequence, but in line with the above considerations regarding protection of the ESAT-6 N-terminus it is preferred that the first amino acid sequence is C-terminal to the second when the first amino acid sequence is derived from ESAT-6.
- Although only the effect of fusion between MPT59 and ESAT6 has been investigated at present, it is believed that ESAT6 and MPT59 or epitopes derived therefrom could be advantageously be fused to other fusion partners having substantially the same effect on overall immunogenicity of the fusion construct. Hence, it is preferred that such a fusion polypeptide fragment according of the invention is one, wherein the at least one T-cell epitope included in the second amino acid sequence is derived from aM. tuberculosis polypeptide (the “parent” polypeptide) selected from the group consisting of a polypeptide fragment according to the present invention and described in detail above and in the examples, or the amino acid sequence could be derived from any one of the M. tuberculosis proteins DnaK, GroEL, urease, glutamine synthetase, the proline rich complex, L-alanine dehydrogenase, phosphate binding protein, Ag 85 complex, HBHA (heparin binding hemagglutinin), MPT51, MPT64, superoxide dismutase, 19 kDa lipoprotein, a-crystallin, GroES, MPT59 (when the first amino acid sequence is derived from ESAT-6), and ESAT-6 (when the first amino acid sequence is derived from MPT59). It is preferred that the first and second T-cell pitopes each have a sequence identity of at least 70% with the natively occurring sequence in the proteins from which they are derived and it is even further preferred that the first and/or second amino acid sequence has a sequence identity of at least 70 % with the protein from which they are derived. A most preferred embodiment of this fusion polypeptide is one wherein the first amino acid sequence is the amino acid sequence of ESAT-6 or MPT59 and/or the second amino acid sequence is the full-length amino acid sequence of the possible “parent” polypeptides listed above.
- In the most preferred embodiment, the fusion polypeptide fragment comprises ESAT-6 fused to MPT59 (advantageously, ESAT-6 is fused to the C-terminus of MPT59) and in one special embodiment, there are no linkers introduced between the two amino acid sequences constituting the two parent polypeptide fragments.
- Another part of the invention pertains to a nucleic acid fragment in isolated form which
- 1) comprises a nucleic acid sequence which encodes a polypeptide or fusion polypeptide as defined above, or comprises a nucleic acid sequence complementary thereto, and/or
- 2) has a length of at least 10 nucleotides and hybridizes readily under stringent hybridization conditions (as defined in the art, i.e. 5-10° C. under the melting point Tm, cf. Sambrook et al, 1989, pages 11.45-11.49) with a nucleic acid fragment which has a nucleotide sequence selected from
- SEQ ID NO: 1 or a sequence complementary thereto,
- SEQ ID NO: 3 or a sequence complementary thereto,
- SEQ ID NO: 5 or a sequence complementary thereto,
- SEQ ID NO: 7 or a sequence complementary thereto,
- SEQ ID NO: 9 or a sequence complementary thereto,
- SEQ ID NO: 11 or a sequence complementary thereto,
- SEQ ID NO: 13 or a sequence complementary thereto,
- SEQ ID NO: 15 or a sequence complementary thereto,
- SEQ ID NO: 41 or a sequence complementary thereto,
- SEQ ID NO: 47 or a sequence complementary thereto,
- SEQ ID NO: 49 or a sequence complementary thereto,
- SEQ ID NO: 51 or a sequence complementary thereto,
- SEQ ID NO: 53 or a sequence complementary thereto,
- SEQ ID NO: 55 or a sequence complementary thereto,
- SEQ ID NO: 57 or a sequence complementary thereto,
- SEQ ID NO: 59 or a sequence complementary thereto,
- SEQ ID NO: 61 or a sequence complementary thereto,
- SEQ ID NO: 63 or a sequence complementary thereto,
- SEQ ID NO: 65 or a sequence complementary thereto,
- SEQ ID NO: 67 or a sequence complementary thereto,
- SEQ ID NO: 69 or a sequence complementary thereto,
- SEQ ID NO: 71 or a sequence complementary thereto,
- SEQ ID NO: 87 or a sequence complementary.thereto,
- SEQ ID NO: 89 or a sequence complementary thereto,
- SEQ ID NO: 91 or a sequence complementary thereto,
- SEQ ID NO: 93 or a sequence complementary thereto,
- SEQ ID NO: 140 or a sequence complementary thereto,
- SEQ ID NO: 142 or a sequence complementary thereto,
- SEQ ID NO: 144 or a sequence complementary thereto,
- SEQ ID NO: 146 or a sequence complementary thereto,
- SEQ ID NO: 148 or a sequence complementary thereto,
- SEQ ID NO: 150 or a sequence complementary thereto, and
- SEQ ID NO: 152 or a sequence complementary thereto,
- with the proviso that when the nucleic acid fragment comprises a subsequence of SEQ ID NO: 41, then the nucleic acid fragment contains an A corresponding to position 781 in SEQ ID NO: 41 and when the nucleic acid fragment comprises a subsequence of a nucleotide sequence exactly complementary to SEQ ID NO: 41, then the nucleic acid fragment comprises a T corresponding to position 781 in SEQ ID NO: 41.
- It is preferred that the nucleic acid fragment is a DNA fragment.
- To provide certainty of the advantages in accordance with the invention, the preferred nucleic acid sequence when employed for hybridization studies or assays includes sequences that are complementary to at least a 10 to 40, or so, nucleotide stretch of the selected sequence. A size of at least 10 nucleotides in length helps to ensure that the fragment will be of sufficient length to form a duplex molecule that is both stable and selective. Molecules having complementary sequences over stretches greater than 10 bases in length are generally preferred, though, in order to increase stability and selectivity of the hybrid, and thereby improve the quality and degree of specific hybrid molecules obtained.
- Hence, the term “subsequence” when used in connection with the nucleic acid fragments of the invention is intended to indicate a continuous stretch of at least 10 nucleotides exhibits the above hybridization pattern. Normally this will require a minimum sequence identity of at least 70% with a subsequence of the hybridization partner having SEQ ID NO: 1, 3, 5, 7, 9, 11, 12, 15, 21, 41, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 87, 89, 91, 93, 140, 142, 144, 146, 148, 150, or 152. It is preferred that the nucleic acid fragment is longer than 10 nucleotides, such as at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, and at least 80 nucleotides long, and the sequence identity should preferable also be higher than 70%, such as at least 75%, at least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 96%, and at least 98%. It is most preferred that the sequence identity is 100%. Such fragments may be readily prepared by, for example, directly synthesizing the fragment by chemical means, by application of nucleic acid reproduction technology, such as the PCR technology of U.S. Pat. No. 4,603,102, or by introducing selected sequences into recombinant vectors for recombinant production.
- It is well known that the same amino acid may be encoded by various codons, the codon usage being related, inter alia, to the preference of the organisms in question expressing the nucleotide sequence. Thus, at least one nucleotide or codon of a nucleic acid fragment of the invention may be exchanged by others which, when expressed, result in a polypeptide identical or substantially identical to the polypeptide encoded by the nucleic acid fragment in question. The invention thus allows for variations in the sequence such as substitution, insertion (including introns), addition, deletion and rearrangement of one or more nucleotides, which variations do not have any substantial effect on the polypeptide encoded by the nucleic acid fragment or a subsequence thereof. The term “substitution” is intended to mean the replacement of one or more nucleotides in the full nucleotide sequence with one or more different nucleotides, “addition” is understood to mean the addition of one or more nucleotides at either end of the full nucleotide sequence, “insertion” is intended to mean the introduction of one or more nucleotides within the full nucleotide sequence, “deletion” is intended to indicate that one or more nucleotides have been deleted from the full nucleotide sequence whether at either end of the sequence or at any suitable point within it, and “rearrangement” is intended to mean that two or more nucleotide residues have been exchanged with each other.
- The nucleotide sequence to be modified may be of cDNA or genomic origin as discussed above, but may also be of synthetic origin. Furthermore, the sequence may be of mixed cDNA and genomic, mixed cDNA and synthetic or genomic and synthetic origin as discussed above. The sequence may have been modified, e.g. by site-directed mutagenesis, to result in the desired nucleic acid fragment encoding the desired polypeptide. The following discussion focused on modifications of nucleic acid encoding the polypeptide should be understood to encompass also such possibilities, as well as the possibility of building up the nucleic acid by ligation of two or more DNA fragments to obtain the desired nucleic acid fragment, and combinations of the above-mentioned principles.
- The nucleotide sequence may be modified using any suitable technique which results in the production of a nucleic acid fragment encoding a polypeptide of the invention.
- The modification of the nucleotide sequence encoding the amino acid sequence of the polypeptide of the invention should be one which does not impair the immunological function of the resulting polypeptide.
- A preferred method of preparing variants of the antigens disclosed herein is site-directed mutagenesis. This technique is useful in the preparation of individual peptides, or biologically functional equivalent proteins or peptides, derived from the antigen sequences, through specific mutagenesis of the underlying nucleic acid. The technique further provides a ready ability to prepare and test sequence variants, for example, incorporating one or more of the foregoing considerations, by introducing one or more nucleotide sequence changes into the nucleic acid. Site-specific mutagenesis allows the production of mutants through the use of specific oligonucleotide sequences which encode the nucleotide sequence of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. Typically, a primer of about 17 to 25 nucleotides in length is preferred, with about 5 to 10 residues on both sides of the junction of the sequence being altered.
- In general, the technique of site-specific mutagenesis is well known in the art as exemplified by publications (Adelman et al., 1983). As will be appreciated, the technique typically employs a phage vector which exists in both a single stranded and double stranded form. Typical vectors useful in site-directed mutagenesis include vectors such as the M13 phage (Messing et al., 1981). These phage are readily commercially available and their use is generally well known to those skilled in the art.
- In general, site-directed mutagenesis in accordance herewith is performed by first obtaining a single-stranded vector which includes within its sequence a nucleic acid sequence which encodes the polypeptides of the invention. An oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically, for example by the method of Crea et al. (1978). This primer is then annealed with the single-stranded vector, and subjected to DNA polymerizing enzymes such asE. coli polymerase I Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated sequence and the second strand bears the desired mutation. This heteroduplex vector is then used to transform appropriate cells, such as E. coli cells, and clones are selected which include recombinant vectors bearing the mutated sequence arrangement.
- The preparation of sequence variants of the selected nucleic acid fragments of the invention using site-directed mutagenesis is provided as a means of producing potentially useful species of the genes and is not meant to be limiting as there are other ways in which sequence variants of the nucleic acid fragments of the invention may be obtained. For example, recombinant vectors encoding the desired genes may be treated with mutagenic agents to obtain sequence variants (see, e.g., a method described by Eichenlaub, 1979) for the mutagenesis of plasmid DNA using hydroxylamine.
- The invention also relates to a replicable expression vector which comprises a nucleic acid fragment defined above, especially a vector which comprises a nucleic acid fragment encoding a polypeptide fragment of the invention.
- The vector may be any vector which may conveniently be subjected to recombinant DNA procedures, and the choice of vector will often depend on the host cell into which it is to be introduced. Thus, the vector may be an autonomously replicating vector, i.e. a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication; examples of such a vector are a plasmid, phage, cosmid, mini-chromosome or virus. Alternatively, the vector may be one which, when introduced in a host cell, is integrated in the host cell genome and replicated together with the chromosome(s) into which it has been integrated.
- Expression vectors may be constructed to include any of the DNA segments disclosed herein. Such DNA might encode an antigenic protein specific for virulent strains of mycobacteria or even hybridization probes for detecting mycobacteria nucleic acids in samples. Longer or shorter DNA segments could be used, depending on the antigenic protein desired. Epitopic regions of the proteins expressed or encoded by the disclosed DNA could be included as relatively short segments of DNA. A wide variety of expression vectors is possible including, for example, DNA segments encoding reporter gene products useful for identification of heterologous gene products and/or resistance genes such as antibiotic resistance genes which may be useful in identifying transformed cells.
- The vector of the invention may be used to transform cells so as to allow propagation of the nucleic acid fragments of the invention or so as to allow expression of the polypeptide fragments of the invention. Hence, the invention also pertains to a transformed cell harbouring at least one such vector according to the invention, said cell being one which does not natively harbour the vector and/or the nucleic acid fragment of the invention contained therein. Such a transformed cell (which is also a part of the invention) may be any suitable bacterial host cell or any other type of cell such as a unicellular eukaryotic organism, a fungus or yeast, or a cell derived from a multicellular organism, e.g. an animal or a plant. It is especially in cases where glycosylation is desired that a mammalian cell is used, although glycosylation of proteins is a rare event in prokaryotes. Normally, however, a prokaryotic cell is preferred such as a bacterium belonging to the genera Mycobacterium, Salmonella, Pseudomonas, Bacillus and Eschericia. It is preferred that the transformed cell is anE. coli, B. subtilis, or M. bovis BCG cell, and it is especially preferred that the transformed cell expresses a polypeptide according of the invention. The latter opens for the possibility to produce the polypeptide of the invention by simply recovering it from the culture containing the transformed cell. In the most preferred embodiment of this part of the invention the transformed cell is Mycobacterium bovis BCG strain: Danish 1331, which is the Mycobacterium bovis strain Copenhagen from the Copenhagen BCG Laboratory, Statens Seruminstitut, Denmark.
- The nucleic acid fragments of the invention allow for the recombinant production of the polypeptides fragments of the invention. However, also isolation from the natural source is a way of providing the polypeptide fragments as is peptide synthesis.
- Therefore, the invention also pertains to a method for the preparation of a polypeptide fragment of the invention, said method comprising inserting a nucleic acid fragment as defined above into a vector which is able to replicate in a host cell, introducing the resulting recombinant vector into the host cell (transformed cells may be selected using various techniques, including screening by differential hybridization, identification of fused reporter gene products, resistance markers, anti-antigen antibodies and the like), culturing the host cell in a culture medium under conditions sufficient to effect expression of the polypeptide (of course the cell may be cultivated under conditions appropriate to the circumstances, and if DNA is desired, replication conditions are used), and recovering the polypeptide from the host cell or culture medium; or
- isolating the polypeptide from a short-term culture filtrate as defined in
claim 1; or - isolating the polypeptide from whole mycobacteria of the tuberculosis complex or from lysates or fractions thereof, e.g. cell wall containing fractions, or
- synthesizing the polypeptide by solid or liquid phase peptide synthesis.
- The medium used to grow the transformed cells may be any conventional medium suitable for the purpose. A suitable vector may be any of the vectors described above, and an appropriate host cell may be any of the cell types listed above. The methods employed to construct the vector and effect introduction thereof into the host cell may be any methods known for such purposes within the field of recombinant DNA. In the following a more detailed description of the possibilities will be given:
- In general, of course, prokaryotes are preferred for the initial cloning of nucleic sequences of the invention and constructing the vectors useful in the invention. For example, in addition to the particular strains mentioned in the more specific disclosure below, one may mention by way of example, strains such asE. coli K12 strain 294 (ATCC No. 31446), E. coli B, and E. coli X 1776 (ATCC No. 31537). These examples are, of course, intended to be illustrative rather than limiting.
- Prokaryotes are also preferred for expression. The aforementioned strains, as well asE. coli W3110 (F-, lambda-, prototrophic, ATCC No. 273325), bacilli such as Bacillus subtilis, or other enterobacteriaceae such as Salmonella typhimurium or Serratia marcesans, and various Pseudomonas species may be used. Especially interesting are rapid-growing mycobacteria, e.g. M. smegmatis, as these bacteria have a high degree of resemblance with mycobacteria of the tuberculosis complex and therefore stand a good chance of reducing the need of performing post-translational modifications of the expression product.
- In general, plasmid vectors containing replicon and control sequences which are derived from species compatible with the host cell are used in connection with these hosts. The vector ordinarily carries a replication site, as well as marking sequences which are capable of providing phenotypic selection in transformed cells. For example,E. coli is typically transformed using pBR322, a plasmid derived from an E. coli species (see, e.g., Bolivar et al., 1977, Gene 2: 95). The pBR322 plasmid contains genes for ampicillin and tetracycline resistance and thus provides easy means for identifying transformed cells. The pBR plasmid, or other microbial plasmid or phage must also contain, or be modified to contain, promoters which can be used by the microorganism for expression.
- Those promoters most commonly used in recombinant DNA construction include the B-lactamase (penicillinase) and lactose promoter systems (Chang et al., 1978; Itakura et al., 1977; Goeddel et al., 1979) and a tryptophan (trp) promoter system (Goeddel et al., 1979; EPO Appl. Publ. No. 0036776). While these are the most commonly used, other microbial promoters have been discovered and utilized, and details concerning their nucleotide sequences have been published, enabling a skilled worker to ligate them functionally with plasmid vectors (Siebwenlist et al., 1980). Certain genes from prokaryotes may be expressed efficiently inE. coli from their own promoter sequences, precluding the need for addition of another promoter by artificial means.
- After the recombinant preparation of the polypeptide according to the invention, the isolation of the polypeptide may for instance be carried out by affinity chromatography (or other conventional biochemical procedures based on chromatography), using a monoclonal antibody which substantially specifically binds the polypeptide according to the invention. Another possibility is to employ the simultaneous electroelution technique described by Andersen et al. in J. Immunol. Methods 161: 29-39.
- According to the invention the post-translational modifications involves lipidation, glycosylation, cleavage, or elongation of the polypeptide.
- In certain aspects, the DNA sequence information provided by this invention allows for the preparation of relatively short DNA (or RNA or PNA) sequences having the ability to specifically hybridize to mycobacterial gene sequences. In these aspects, nucleic acid probes of an appropriate length are prepared based on a consideration of the relevant sequence. The ability of such nucleic acid probes to specifically hybridize to the mycobacterial gene sequences lend them particular utility in a variety of embodiments. Most importantly, the probes can be used in a variety of diagnostic assays for detecting the presence of pathogenic organisms in a given sample. However, either uses are envisioned, including the use of the sequence information for the preparation of mutant species primers, or primers for use in preparing other genetic constructs.
- Apart from their use as starting points for the synthesis of polypeptides of the invention and for hybridization probes (useful for direct hybridization assays or as primers in e.g. PCR or other molecular amplification methods) the nucleic acid fragments of the invention may be used for effecting in vivo expression of antigens, i.e. the nucleic acid fragments may be used in so-called DNA vaccines. Recent research have revealed that a DNA fragment cloned in a vector which is non-replicative in eukaryotic cells may be introduced into an animal (including a human being) by e.g. intramuscular injection or percutaneous administration (the so-called “gene gun” approach). The DNA is taken up by e.g. muscle cells and the gene of interest is expressed by a promoter which is functioning in eukaryotes, e.g. a viral promoter, and the gene product thereafter stimulates the immune system. These newly discovered methods are reviewed in Ulmer et al., 1993, which hereby is included by reference.
- Hence, the invention also relates to a vaccine comprising a nucleic acid fragment according to the invention, the vaccine effecting in vivo expression of antigen by an animal, including a human being, to whom the vaccine has been administered, the amount of expressed antigen being effective to confer substantially increased resistance to infections with mycobacteria of the tuberculosis complex in an animal, including a human being.
- The efficacy of such a “DNA vaccine” can possibly be enhanced by administering the gene encoding the expression product together with a DNA fragment encoding a polypeptide which has the capability of modulating an immune response. For instance, a gene encoding lymphokine precursors or lymphokines (e.g. IFN-γ, IL-2, or IL-12) could be administered together with the gene encoding the immunogenic protein, either by administering two separate DNA fragments or by administering both DNA fragments included in the same vector. It also is a possibility to administer DNA fragments comprising a multitude of nucleotide sequences which each encode relevant epitopes of the polypeptides disclosed herein so as to effect a continuous sensitization of the immune system with a broad spectrum of these epitopes.
- As explained above, the polypeptide fragments of the invention are excellent candidates for vaccine constituents or for constituents in an immune diagnostic agent due to their extracellular presence in culture media containing metabolizing virulent mycobacteria belonging to the tuberculosis complex, or because of their high homologies with such extracellular antigens, or because of their absence inM. bovis BCG.
- Thus, another part of the invention pertains to an immunologic composition comprising a polypeptide or fusion polypeptide according to the invention. In order to ensure optimum performance of such an immunologic composition it is preferred that it comprises an immunologically and pharmaceutically acceptable carrier, vehicle or adjuvant.
- Suitable carriers are selected from the group consisting of a polymer to which the polypeptide(s) is/are bound by hydrophobic non-covalent interaction, such as a plastic, e.g. polystyrene, or a polymer to which the polypeptide(s) is/are covalently bound, such as a polysaccharide, or a polypeptide, e.g. bovine serum albumin, ovalbumin or keyhole limpet haemocyanin. Suitable vehicles are selected from the group consisting of a diluent and a suspending agent. The adjuvant is preferably selected from the group consisting of dimethyldioctadecylammonium bromide (DDA), Quil A, poly I:C, Freund's incomplete adjuvant, IFN-γ, IL-2, IL-12, monophosphoryl lipid A (MPL), and muramyl dipeptide (MDP).
- A preferred immunologic composition according to the present invention comprising at least two different polypeptide fragments, each different polypeptide fragment being a polypeptide or a fusion polypeptide defined above. It is preferred that the immunologic composition comprises between 3-20 different polypeptide fragments or fusion polypeptides.
- Such an immunologic composition may preferably be in the form of a vaccine or in the form of a skin test reagent.
- In line with the above, the invention therefore also pertain to a method for producing an immunologic composition according to the invention, the method comprising preparing, synthesizing or isolating a polypeptide according to the invention, and solubilizing or dispersing the polypeptide in a medium for a vaccine, and optionally adding otherM. tuberculosis antigens and/or a carrier, vehicle and/or adjuvant substance.
- Preparation of vaccines which contain peptide sequences as active ingredients is generally well understood in the art, as exemplified by U.S. Pat. Nos. 4,608,251; 4,601,903; 4,599,231; 4,599,230; 4,596,792; and 4,578,770, all incorporated herein by reference. Typically, such vaccines are prepared as injectables either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The preparation may also be emulsified. The active immunogenic ingredient is often mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like, and combinations thereof. In addition, if desired, the vaccine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance the effectiveness of the vaccines.
- The vaccines are conventionally administered parenterally, by injection, for example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other modes of administration include suppositories and, in some cases, oral formulations. For suppositories, traditional binders and carriers may include, for example, polyalkalene glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in the range of 0.5% to 10%, preferably 1-2%. Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release formulations or powders and contain 10-95% of active ingredient, preferably 25-70%.
- The proteins may be formulated into the vaccine as neutral or salt forms. Pharmaceutically acceptable salts include acid addition salts (formed with the free amino groups of the peptide) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups may also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like.
- The vaccines are administered in a manner compatible with the dosage formulation, and in such amount as will be therapeutically effective and immunogenic. The quantity to be administered depends on the subject to be treated, including, e.g., the capacity of the individual's immune system to mount an immune response, and the degree of protection desired. Suitable dosage ranges are of the order of several hundred micrograms active ingredient per vaccination with a preferred range from about 0.1 μg to 1000 μg, such as in the range from about 1 μg to 300 μg, and especially in the range from about 10 μg to 50 μg. Suitable regimens for initial administration and booster shots are also variable but are typified by an initial administration followed by subsequent inoculations or other administrations.
- The manner of application may be varied widely. Any of the conventional methods for administration of a vaccine are applicable. These are believed to include oral application on a solid physiologically acceptable base or in a physiologically acceptable dispersion, parenterally, by injection or the like. The dosage of the vaccine will depend on the route of administration and will vary according to the age of the person to be vaccinated and, to a lesser degree, the size of the person to be vaccinated.
- Some of the polypeptides of the vaccine are sufficiently immunogenic in a vaccine, but for some of the others the immune response will be enhanced if the vaccine further comprises an adjuvant substance.
- Various methods of achieving adjuvant effect for the vaccine include use of agents such as aluminum hydroxide or phosphate (alum), commonly used as 0.05 to 0.1 percent solution in phosphate buffered saline, admixture with synthetic polymers of sugars (Carbopol) used as 0.25 percent solution, aggregation of the protein in the vaccine by heat treatment with temperatures ranging between 70° to 101° C. for 30 second to 2 minute periods respectively. Aggregation by reactivating with pepsin treated (Fab) antibodies to albumin, mixture with bacterial cells such asC. parvum or endotoxins or lipopolysaccharide components of gram-negative bacteria, emulsion in physiologically acceptable oil vehicles such as mannide monooleate (Aracel A) or emulsion with 20 percent solution of a perfluorocarbon (Fluosol-DA) used as a block substitute may also be employed. According to the invention DDA (dimethyldioctadecylammonium bromide) is an interesting candidate for an adjuvant, but also Freund's complete and incomplete adjuvants as well as QuilA and RIBI are interesting possibilities. Further possibilities are monophosphoryl lipid A (MPL), and muramyl dipeptide (MDP).
- Another highly interesting (and thus, preferred) possibility of achieving adjuvant effect is to employ the technique described in Gosselin et al., 1992 (which is hereby incorporated by reference herein). In brief, the presentation of a relevant antigen such as an antigen of the present invention can be enhanced by conjugating the antigen to antibodies (or antigen binding antibody fragments) against the Fcγ receptors on monocytes/macrophages. Especially conjugates between antigen and anti-FcγRI have been demonstrated to enhance immunogenicity for the purposes of vaccination.
- Other possibilities involve the use of immune modulating substances such as lymphokines (e.g. IFN-γ, IL-2 and IL-12) or synthetic IFN-γ inducers such as poly I:C in combination with the above-mentioned adjuvants. As discussed in example 3, it is contemplated that such mixtures of antigen and adjuvant will lead to superior vaccine formulations.
- In many instances, it will be necessary to have multiple administrations of the vaccine, usually not exceeding six vaccinations, more usually not exceeding four vaccinations and preferably one or more, usually at least about three vaccinations. The vaccinations will normally be at from two to twelve week intervals, more usually from three to five week intervals. Periodic boosters at intervals of 1-5 years, usually three years, will be desirable to maintain the desired levels of protective immunity. The course of the immunization may be followed by in vitro proliferation assays of PBL (peripheral blood lymphocytes) co-cultured with ESAT-6 or ST-CF, and especially by measuring the levels of IFN-γ released form the primed lymphocytes. The assays may be performed using conventional labels, such as radionuclides, enzymes, fluorescers, and the like. These techniques are well known and may be found in a wide variety of patents, such as U.S. Pat. Nos. 3,791,932; 4,174,384 and 3,949,064, as illustrative of these types of assays.
- Due to genetic variation, different individuals may react with immune responses of varying strength to the same polypeptide. Therefore, the vaccine according to the invention may comprise several different polypeptides in order to increase the immune response. The vaccine may comprise two or more polypeptides, where all of the polypeptides are as defined above, or some but not all of the peptides may be derived from a bacterium belonging to theM. tuberculosis complex. In the latter example the polypeptides not necessarily fulfilling the criteria set forth above for polypeptides may either act due to their own immunogenicity or merely act as adjuvants. Examples of such interesting polypeptides are MPB64, MPT64, and MPB59, but any other substance which can be isolated from mycobacteria are possible candidates.
- The vaccine may comprise 3-20 different polypeptides, such as 3-10 different polypeptides.
- One reason for admixing the polypeptides of the invention with an adjuvant is to effectively activate a cellular immune response. However, this effect can also be achieved in other ways, for instance by expressing the effective antigen in a vaccine in a non-pathogenic microorganism. A well-known example of such a microorganism isMycobacterium bovis BCG.
- Therefore, another important aspect of the present invention is an improvement of the living BCG vaccine presently available, which is a vaccine for immunizing an animal, including a human being, against TB caused by mycobacteria belonging to the tuberculosis-complex, comprising as the effective component a microorganism, wherein one or more copies of a DNA sequence encoding a polypeptide as defined above has been incorporated into the genome of the microorganism in a manner allowing the microorganism to express and secrete the polypeptide.
- In the present context the term “genome” refers to the chromosome of the microorganisms as well as extrachromosomally DNA or RNA, such as plasmids. It is, however, preferred that the DNA sequence of the present invention has been introduced into the chromosome of the non-pathogenic microorganism, since this will prevent loss of the genetic material introduced.
- It is preferred that the non-pathogenic microorganism is a bacterium, e.g. selected from the group consisting of the genera Mycobacterium, Salmonella, Pseudomonas and Eschericia.
- It is especially preferred that the non-pathogenic microorganism isMycobacterium bovis BCG, such as Mycobacterium bovis BCG strain: Danish 1331.
- The incorporation of one or more copies of a nucleotide sequence encoding the polypeptide according to the invention in a mycobacterium from aM. bovis BCG strain will enhance the immunogenic effect of the BCG strain. The incorporation of more than one copy of a nucleotide sequence of the invention is contemplated to enhance the immune response even more, and consequently an aspect of the invention is a vaccine wherein at least 2 copies of a DNA sequence encoding a polypeptide is incorporated in the genome of the microorganism, such as at least 5 copies. The copies of DNA sequences may either be identical encoding identical polypeptides or be variants of the same DNA sequence encoding identical or homologues of a polypeptide, or in another embodiment be different DNA sequences encoding different polypeptides where at least one of the polypeptides is according to the present invention.
- The living vaccine of the invention can be prepared by cultivating a transformed non-pathogenic cell according to the invention, and transferring these cells to a medium for a vaccine, and optionally adding a carrier, vehicle and/or adjuvant substance.
- The invention also relates to a method of diagnosing TB caused byMycobacterium tuberculosis, Mycobacterium africanum or Mycobacterium bovis in an animal, including a human being, comprising intradermally injecting, in the animal, a polypeptide according to the invention or a skin test reagent described above, a positive skin response at the location of injection being indicative of the animal having TB, and a negative skin response at the location of injection being indicative of the animal not having TB. A positive response is a skin reaction having a diameter of at least 5 mm, but larger reactions are preferred, such as at least 1 cm, 1.5 cm, and at least 2 cm in diameter. The composition used as the skin test reagent can be prepared in the same manner as described for the vaccines above.
- In line with the disclosure above pertaining to vaccine preparation and use, the invention also pertains to a method for immunising an animal, including a human being, against TB caused by mycobacteria belonging to the tuberculosis complex, comprising administering to the animal the polypeptide of the invention, or a vaccine composition of the invention as described above, or a living vaccine described above. Preferred routes of administration are the parenteral (such as intravenous and intraarterially), intraperitoneal, intramuscular, subcutaneous, intradermal, oral, buccal, sublingual, nasal, rectal or transdermal route.
- The protein ESAT-6 which is present in short-term culture filtrates from mycobacteria as well as the esat-6 gene in the mycobacterial genome has been demonstrated to have a very limited distribution in other mycobacterial strains thatM. tuberculosis, e.g. esat-6 is absent in both BCG and the majority of mycobacterial species isolated from the environment, such as M. avium and M. terrae. It is believed that this is also the case for at least one of the antigens of the present invention and their genes and therefore, the diagnostic embodiments of the invention are especially well-suited for performing the diagnosis of on-going or previous infection with virulent mycobacterial strains of the tuberculosis complex, and it is contemplated that it will be possible to distinguish between 1) subjects (animal or human) which have been previously vaccinated with e.g. BCG vaccines or subjected to antigens from non-virulent mycobacteria and 2) subjects which have or have had active infection with virulent mycobacteria.
- A number of possible diagnostic assays and methods can be envisaged:
- When diagnosis of previous or ongoing infection with virulent mycobacteria is the aim, a blood sample comprising mononuclear cells (i.a. T-lymphocytes) from a patient could be contacted with a sample of one or more polypeptides of the invention. This contacting can be performed in vitro and a positive reaction could e.g. be proliferation of the T-cells or release cytokines such as γ-interferon into the extracellular phase (e.g. into a culture supernatant); a suitable in vivo test would be a skin test as described above. It is also conceivable to contact a serum sample from a subject to contact with a polypeptide of the invention, the demonstration of a binding between antibodies in the serum sample and the polypeptide being indicative of previous or ongoing infection.
- The invention therefore also relates to an in vitro method for diagnosing ongoing or previous sensitization in an animal or a human being with bacteria belonging to the tuberculosis complex, the method comprising providing a blood sample from the animal or human being, and contacting the sample from the animal with the polypeptide of the invention, a significant release into the extracellular phase of at least one cytokine by mononuclear cells in the blood sample being indicative of the animal being sensitized. By the term “significant release” is herein meant that the release of the cytokine is significantly higher than the cytokine release from a blood sample derived from a non-tuberculous subject (e.g. a subject which does not react in a traditional skin test for TB). Normally, a significant release is at least two times the release observed from such a sample.
- Alternatively, a sample of a possibly infected organ may be contacted with an antibody raised against a polypeptide of the invention. The demonstration of the reaction by means of methods well-known in the art between the sample and the antibody will be indicative of ongoing infection. It is of course also a possibility to demonstrate the presence of anti-mycobacterial antibodies in serum by contacting a serum sample from a subject with at least one of the polypeptide fragments of the invention and using well-known methods for visualizing the reaction between the antibody and antigen.
- Also a method of determining the presence of mycobacterial nucleic acids in an animal, including a human being, or in a sample, comprising administering a nucleic acid fragment of the invention to the animal or incubating the sample with the nucleic acid fragment of the invention or a nucleic acid fragment complementary thereto, and detecting the presence of hybridized nucleic acids resulting from the incubation (by using the hybridization assays which are well-known in the art), is also included in the invention. Such a method of diagnosing TB might involve the use of a composition comprising at least a part of a nucleotide sequence as defined above and detecting the presence of nucleotide sequences in a sample from the animal or human being to be tested which hybridize with the nucleic acid fragment (or a complementary fragment) by the use of PCR technique.
- The fact that certain of the disclosed antigens are not present inM. bovis BCG but are present in virulent mycobacteria point them out as interesting drug targets; the antigens may constitute receptor molecules or toxins which facilitate the infection by the mycobacterium, and if such functionalities are blocked the infectivity of the mycobacterium will be diminshed.
- To determine particularly suitable drug targets among the antigens of the invention, the gene encoding at least one of the polypeptides of the invention and the necessary control sequences can be introduced into avirulent strains of mycobacteria (e.g. BCG) so as to determine which of the polypeptides are critical for virulence. Once particular proteins are identified as critical for/contributory to virulence, anti-mycobacterial agents can be designed rationally to inhibit expression of the critical genes or to attack the critical gene products. For instance, antibodies or fragments thereof (such as Fab and (Fab′)2 fragments can be prepared against such critical polypeptides by methods known in the art and thereafter used as prophylactic or therapeutic agents. Alternatively, small molecules can be screened for their ability to selectively inhibit expression of the critical gene products, e.g. using recombinant expression systems which include the gene's endogenous promoter, or for their ability to directly interfere with the action of the target. These small molecules are then used as therapeutics or as prophylactic agents to inhibit mycobacterial virulence.
- Alternatively, anti-mycobacterial agents which render a virulent mycobacterium avirulent can be operably linked to expression control sequences and used to transform a virulent mycobacterium. Such anti-mycobacterial agents inhibit the replication of a specified mycobacterium upon transcription or translation of the agent in the mycobacterium. Such a “newly avirulent” mycobacterium would constitute a superb alternative to the above described modified BCG for vaccine purposes since it would be immunologically very similar to a virulent mycobacterium compared to e.g. BCG.
- Finally, a monoclonal or polyclonal antibody, which is specifically reacting with a polypeptide of the invention in an immuno assay, or a specific binding fragment of said antibody, is also a part of the invention. The production of such polyclonal antibodies requires that a suitable animal be immunized with the polypeptide and that these antibodies are subsequently isolated, suitably by immune affinity chromatography. The production of monoclonals can be effected by methods well-known in the art, since the present invention provides for adequate amounts of antigen for both immunization and screening of positive hybridomas.
- Legends to the Figures
- FIG. 1: Long term memory immune mice are very efficiently protected towards an infection withM. tuberculosis. Mice were given a challenge of M. tuberculosis and spleens were isolated at different time points. Spleen lymphocytes were stimulated in vitro with ST-CF and the release of IFN-γ investigated (panel A). The counts of CFU in the spleens of the two groups of mice are indicated in panel B. The memory immune mice control infection within the first week and produce large quantities of IFN-γ in response to antigens in ST-CF.
- FIG. 2: T cells involved in protective immunity are predominantly directed to molecules from 6-12 and 17-38 kDa. Splenic T cells were isolated four days after the challenge withM. tuberculosis and stimulated in vitro with narrow molecular mass fractions of ST-CF. The release of IFN-γ was investigated
- FIG. 3: Nucleotide sequence (SEQ ID NO: 1) of cfp7. The deduced amino acid sequence (SEQ ID NO: 2) of CFP7 is given in conventional one-letter code below the nucleotide sequence. The putative ribosome-binding site is written in underlined italics as are the putative −10 and −35 regions. Nucleotides written in bold are those encoding CFP7.
- FIG. 4. Nucleotide sequence (SEQ ID NO: 3) of cfp9. The deduced amino acid sequence (SEQ ID NO: 4) of CFP9 is given in conventional one-letter code below the nucleotide sequence. The putative ribosome-binding site Shine Delgarno sequence is written in underlined italics as are the putative −10 and −35 regions. Nucleotides in bold writing are those encoding CFP9. The nucleotide sequence obtained from the lambda 226 phage is double underlined.
- FIG. 5: Nucleotide sequence of mpt51. The deduced amino acid sequence of MPT51 is given in a one-letter code below the nucleotide sequence. The signal is indicated in italics the putative potential ribosome-binding site is underlined. The nucleotide difference and amino acid difference compared to the nucleotide sequence of MPB51 (Ohara et al., 1995) are underlined at position 780. The nucleotides given in italics are not present inM. tuberculosis H37Rv.
- FIG. 6: the position of the purified antigens in the 2DE system have been determined and mapped in a reference gel. The newly purified antigens are encircled and the position of well-known proteins are also indicated.
- Identification of Single Culture Filtrate Antigens Involved in Protective Immunity
- A group of efficiently protected mice was generated by infecting 8-12 weeks old female C57Bl/6j mice with 5×104 M. tuberculosis i.v. After 30 days of infection the mice were subjected to 60 days of antibiotic treatment with isoniazid and were then left for 200-240 days to ensure the establishment of resting long-term memory immunity. Such memory immune mice are very efficiently protected against a secondary infection (FIG. 1). Long lasting immunity in this model is mediated by a population of highly reactive CD4 cells recruited to the site of infection and triggered to produce large amounts of IFN-γ in response to ST-CF (FIG. 1) (Andersen et al. 1995).
- We have used this model to identify single antigens recognized by protective T cells. Memory immune mice were reinfected with 1×106 M. tuberculosis i.v. and splenic lymphocytes were harvested at day 4-6 of reinfection, a time point where this population is highly reactive to ST-CF. The antigens recognized by these T cells were mapped by the multi-elution technique (Andersen and Heron, 1993). This technique divides complex protein mixtures separated in SDS-PAGE into narrow fractions in a physiological buffer. These fractions were used to stimulate spleen lymphocytes in vitro and the release of IFN-γ was monitored (FIG. 2). Long-term memory immune mice did not recognize these fractions before TB infection, but splenic lymphocytes obtained during the recall of protective immunity recognized a range of culture filtrate antigens and peak production of IFN-γ was found in response to proteins of apparent molecular weight 6-12 and 17-30 kDa (FIG. 2). It is therefore concluded that culture filtrate antigens within these regions are the major targets recognized by memory effector T-cells triggered to release IFN-γ during the first phase of a protective immune response.
- Cloning of Genes Expressing Low Mass Culture Filtrate Antigens
- In example 1 it was demonstrated that antigens in the low molecular mass fraction are recognized strongly by cells isolated from memory immune mice. Monoclonal antibodies (mAbs) to these antigens were therefore generated by immunizing with the low mass fraction in RIBI adjuvant (first and second immunization) followed by two injections with the fractions in aluminium hydroxide. Fusion and cloning of the reactive cell lines were done according to standard procedures (Kohler and Milstein 1975). The procedure resulted in the provision of two mAbs: ST-3 directed to a 9 kDa culture filtrate antigen (CFP9) and PV-2 directed to a 7 kDa antigen (CFP7), when the molecular weight is estimated from migration of the antigens in an SDS-PAGE.
- In order to identify the antigens binding to the Mab's, the following experiments were carried out:
- The recombinant λgt11M. tuberculosis DNA library constructed by R. Young (Young, R. A. et al. 1985) and obtained through the World Health Organization IMMTUB programme (WHO.0032.wibr) was screened for phages expressing gene products which would bind the monoclonal antibodies ST-3 and PV-2.
- Approximately 1×105 pfu of the gene library (containing approximately 25% recombinant phages) were plated on Eschericia coli Y1090 (DlacU169, proA+, Dlon, araD139, supF, trpC22::tn10 [pMC9] ATCC#37197) in soft agar and incubated for 2,5 hours at 42° C.
- The plates were overlaid with sheets of nitrocellulose saturated with isopropyl-β-D-thiogalactopyranoside and incubation was continued for 2,5 hours at 370C. The nitrocellulose was removed and incubated with samples of the monoclonal antibodies in PBS with
Tween 20 added to a final concentration of 0.05%. Bound monoclonal antibodies were visualized by horseradish peroxidase-conjugated rabbit anti-mouse immunoglobulins (P260, Dako, Glostrup, DK) and a staining reaction involving 5,5′,3,3′-tetramethylbenzidine and H2O2. - Positive plaques were recloned and the phages originating from a single plaque were used to lysogenizeE. coli Y1089 (DlacU169, proA+, Dlon, araD139, strA, hfl150 [chr::tnlO] [pMC9] ATCC nr. 37196). The resultant lysogenic strains were used to propagate phage particles for DNA extraction. These lysogenic E. coli strains have been named:
- AA226 (expressing ST-3 reactive polypeptide CFP9) which has been deposited 28 Jun. 1993 with the collection of Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSM) under the accession number DSM 8377 and in accordance with the provisions of the Budapest Treaty, and
- AA242 (expressing PV-2 reactive polypeptide CFP7) which has been deposited 28 Jun. 1993 with the collection of Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSM) under the accession number DSM 8379 and in accordance with the provisions of the Budapest Treaty.
- These two lysogenicE. coli strains are disclosed in WO 95/01441 as are the mycobacterial polypeptide products expressed thereby. However, no information concerning the amino acid sequences of these polypeptides or their genetic origin are given, and therefore only the direct expression products of AA226 and AA242 are made available to the public.
- The st-3 binding protein is expressed as a protein fused to β-galactosidase, whereas the pv-2 binding protein appears to be expressed in an unfused version.
- Secuencing of the Nucleotide Sequence Encoding the PV-2 and ST-3 Binding Protein
- In order to obtain the nucleotide sequence of the gene encoding the pv-2 binding protein, the approximately 3 kbM. tuberculosis derived EcoRI−EcoRI fragment from AA242 was subcloned in the EcoRI site in the pBluescriptSK+(Stratagene) and used to transform E. coli XL-1Blue (Stratagene).
- Similarly, to obtain the nucleotide sequence of the gene encoding the st-3 binding protein, the approximately 5 kbM. tuberculosis derived EcoRI−EcoRI fragment from AA226 was subcloned in the EcoRI site in the pBluescriptSK+(Stratagene) and used to transform E. coli XL-1Blue (Stratagene).
- The complete DNA sequence of both genes were obtained by the dideoxy chain termination method adapted for supercoiled DNA by use of the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., Cleveland, Ohio) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. The sequences DNA are shown in SEQ ID NO: 1 (CFP7) and in SEQ ID NO: 3 (CFP9) as well as in FIGS. 3 and 4, respectively. Both strands of the DNA were sequenced.
- CFP7
- An open reading frame (ORF) encoding a sequence of 96 amino acid residues was identified from an ATG start codon at position 91-93 extending to a TAG stop codon at position 379-381. The deduced amino acid sequence is shown in SEQ ID NO: 2 (and in FIG. 3 where conventional one-letter amino acid codes are used).
- CFP7 appear to be expressed inE. coli as an unfused version. The nucleotide sequence at position 78-84 is expected to be the Shine Delgarno sequence and the sequences from position 47-50 and 14-19 are expected to be the −10 and -35 regions, respectively:
- CFP9
- The protein recognised by ST-3 was produced as a β-galactosidase fusion protein, when expressed from the AA226 lambda phage. The fusion protein had an approx. size of 116-117 kDa (Mw for β-galactosidase 116.25 kDa) which may suggest that only part of the CFP9 gene was included in the lambda clone (AA226).
-
- Based on the 90 bp nucleotide sequence obtained on the insert from lambda phage AA226, a search of homology to the nucleotide sequence of theM. tuberculosis genome was performed in the Sanger database (Sanger Mycobacterium tuberculosis database):
- http://www.sanger.ac.uk/pathogens/TB-blast-server.html;
- Williams, 1996). 100% identity to the cloned sequence was found on the MTCY48 cosmid. An open reading frame (ORF) encoding a sequence of 109 amino acid residues was identified from a GTG start codon at position 141-143 extending to a TGA stop codon at position 465-467. The deduced amino acid sequence is shown in FIG. 4 using conventional one letter code.
- The nucleotide sequence at position 123-130 is expected to be the Shine Delgarno sequence and the sequences from position 73-78 and 4-9 are expected to be the −10 and -35 region respectively (FIG. 4). The ORF overlapping with the 5′-end of the sequence of AA229 is shown in FIG. 4 by double underlining.
- Subcloning CFP7 and CFP9 in Expression Vectors
- The two ORFs encoding CFP7 and CFP9 were PCR cloned into the pMST24 (Theisen et al., 1995) expression vector pRVN01 or the pQE-32 (QIAGEN) expression vector pRVNO2, respectively.
- The PCR amplification was carried out in a thermal reactor (Rapid cycler, Idaho Technology, Idaho) by mixing 10 ng plasmid DNA with the mastermix (0.5 μM of each oligonucleotide primer, 0.25 μM BSA (Stratagene), low salt buffer (20 mM Tris-HCl, pH 8.8, 10 mM KCl, 10 mM (NH4)2SO4, 2 mM MgSO4 and 0,1% Triton X-100) (Stratagene), 0.25 nM of each deoxynucleoside triphosphate and 0.5 U Taq Plus Long DNA polymerase (Stratagene)). Final volume was 10 μl (all concentrations given are concentrations in the final volume). Predenaturation was carried out at 94° C. for 30 s. 30 cycles of the following was performed; Denaturation at 94° C. for 30 s, annealing at 55° C. for 30 s and elongation at 72° C. for 1 min.
- The oligonucleotide primers were synthesised automatically on a DNA synthesizer (Applied Biosystems, Forster City, Calif., ABI-391, PCR-mode), deblocked, and purified by ethanol precipitation.
- The cfp7 oligonucleotides (TABLE 1) were synthesised on the basis of the nucleotide sequence from the CFP7 sequence (FIG. 3). The oligonucleotides were engineered to include an SmaI restriction enzyme site at the 5′ end and a BamHI restriction enzyme site at the 3′ end for directed subcloning. The cfp9 oligonucleotides (TABLE 1
- ) were synthesized partly on the basis of the nucleotide sequence from the sequence of the AA229 clone and partly from the identical sequence found in the Sanger database cosmid MTCY48 (FIG. 4). The oligonucleotides were engineered to include a SmaI restriction enzyme site at the 5′ end and a HindIII restriction enzyme site at the 3′ end for directed subcloning.
- CFP7
- By the use of PCR a SmaI site was engineered immediately 5′ of the first codon of the ORF of 291 bp, encoding the cfp7 gene, so that only the coding region would be expressed, and a BamHI site was incorporated right after the stop codon at the 3′ end. The 291 bp PCR fragment was cleaved by SmaI and BamHI, purified from an agarose gel and subcloned into the SmaI-BamHI sites of the pMST24 expression vector. Vector DNA containing the gene fusion was used to transform theE. coli XL1-Blue (pRVNO1).
- CFP9
- By the use of PCR a SmaI site was engineered immediately 5′ of the first codon of an ORF of 327 bp, encoding the cfp9 gene, so that only the coding region would be expressed, and a HindIII site was incorporated after the stop codon at the 3′ end. The 327 bp PCR fragment was cleaved by SmaI and HindIII, purified from an agarose gel, and subcloned into the SmaI-HindIII sites of the pQE-32 (QIAGEN) expression vector. Vector DNA containing the gene fusion was used to transform theE. coli XL1-Blue (pRVNO2).
- Purification of Recombinant CFP7 and CFP9
- The ORFs were fused N-terminally to the (His)6-tag (cf. EP-A-0 282 242). Recombinant antigen was prepared as follows: Briefly, a single colony of E. coli harbouring either the pRVN01 or the pRVNO2 plasmid, was inoculated into LuriaBertani broth containing 100 μg/ml ampicillin and 12.5 μg/ml tetracycline and grown at 37° C. to OD600 nm=0-5. IPTG (isopropyl-β-D-thiogalactoside) was then added to a final concentration of 2 mM (expression was regulated either by the strong IPTG inducible Ptac or the T5 promoter) and growth was continued for further 2 hours. The cells were harvested by centrifugation at 4,200× g at 4° C. for 8 min. The pelleted bacteria were stored overnight at −20° C. The pellet was resuspended in BC 40/100 buffer (20 mM Tris-HCl pH 7.9, 20% glycerol, 100 mM KCl, 40 mM Imidazole) and cells were broken by sonication (5 times for 30 s with intervals of 30 s) at 4° C. followed by centrifugation at 12,000× g for 30 min at 4° C., the supernatant (crude extract) was used for purification of the recombinant antigens.
- The two Histidine fusion proteins (His-rCFP7 and His-rCFP9) were purified from the crude extract by affinity chromatography on a Ni2+-NTA column from QIAGEN with a volume of 100 ml. His-rCFP7 and His-rCFP9 binds to Ni2+. After extensive washes of the column in BC 40/100 buffer, the fusion protein was eluted with a BC 1000/100 buffer containing 100 mM imidazole, 20 mM Tris pH 7.9, 20% glycerol and 1 M KCl. subsequently, the purified products were dialysed extensively against 10 mM Tris pH 8.0. His-rCFP7 and His-rCFP9 were then separated from contaminants by fast protein liquid chromatography (FPLC) over an anion-exchange column (Mono Q, Pharmacia, Sweden). in 10 mM Tris pH 8.0 with a linear gradient of NaCl from 0 to 1 M. Aliquots of the fractions were analyzed by 10%-20% gradient sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE). Fractions containing purified either purified His-rCFP7 or His-rCFP9 were pooled.
TABLE 1 Sequence of the cfp7 and cfp9 oligonucleotidesa. Orientation and Positionb oligonucleotide Sequences (5′ → 3′) (nucleotide) Sense pvR3 GCAACACCCGGGATGTCGCAAATCATG 91-105 (SEQ ID NO: 43) (SEQ ID NO: 1) stR2 GTAACACCCGGGGTGGCCGCCGACCCG 141-155 (SEQ ID NO: 44) (SEQ ID NO: 3) Antisense pvF4 CTACTAAGCTTGGATCCCTAGCCGCCCCATTTGGCGG 381-362 (SEQ ID NO: 45) (SEQ ID NO: 1) stF2 CTACTAAGCTTCCATGGTCAGGTCTTTTCGATGCTTAC 467-447 (SEQ ID NO: 46) (SEQ ID NO: 3) - Identification of Antigens which are not Expressed in BCG Strains.
- In an effort to control the treat of TB, attenuated bacillus Calmette-Guérin (BCG) has been used as a live attenuated vaccine. BCG is an attenuated derivative of a virulent Mycobacterium bovis. The original BCG from the Pasteur Institute in Paris, France was developed from 1908 to 1921 by 231 passages in liquid culture and has never been shown to revert to virulence in animals, indicating that the attenuating mutation(s) in BCG are stable deletions and/or multiple mutations which do not readily revert. While physiological differences between BCG andM. tuberculosis and M. bovis has been noted, the attenuating mutations which arose during serial passage of the original BCG strain has been unknown until recently. The first mutations described are the loss of the gene encoding MPB64 in some BCG strains (Li et al., 1993, Oettinger and Andersen, 1994) and the gene encoding ESAT-6 in all BCG strain tested (Harboe et al., 1996), later 3 large deletions in BCG have been identified (Mahairas et al., 1996). The region named RD1 includes the gene encoding ESAT-6 and an other (RD2) the gene encoding MPT64. Both antigens have been shown to have diagnostic potential and ESAT-6 has been shown to have properties as a vaccine candidate (cf. PCT/DK94/00273 and PCT/DK/00270). In order to find new M. tuberculosis specific diagnostic antigens as well as antigens for a new vaccine against TB, the RD1 region (17.499 bp) of M. tuberculosis H37Rv has been analyzed for Open Reading Frames (ORF). ORFs with a minimum length of 96 bp have been predicted using the algorithm described by Borodovsky and McIninch (1993), in total 27 ORFs have been predicted, 20 of these have possible diagnostic and/or vaccine potential, as they are deleted from all known BCG strains. The predicted ORFs include ESAT-6 (RD1-ORF7) and CFP10 (RD1-ORF6) described previously (Sorensen et al., 1995), as a positive control for the ability of the algorithm. In the present is described the potential of 7 of the predicted antigens for diagnosis of TB as well as potential as candidates for a new vaccine against TB.
- Seven open reading frames (ORF) from the 17,499 kb RD1 region (Accession no. U34848) with possible diagnostic and vaccine potential have been identified and cloned.
- Identification of the ORF's rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a, and rd1-orf9b.
- The nucleotide sequence of rd1-orf2 fromM. tuberculosis H37Rv is set forth in SEQ ID NO: 71. The deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 72.
- The nucleotide sequence of rd1-orf3 fromM. tuberculosis H37Rv is set forth in SEQ ID NO: 87. The deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 88.
- The nucleotide sequence of rd1-orf4 fromM. tuberculosis H37Rv is set forth in SEQ ID NO: 89. The deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 90.
- The nucleotide sequence of rd1-orf5 fromM. tuberculosis H37Rv is set forth in SEQ ID NO: 91. The deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 92.
- The nucleotide sequence of rd1-orf8 fromM. tuberculosis H37Rv is set forth in SEQ ID NO: 67. The deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 68.
- The nucleotide sequence of rd1-orf9a fromM. tuberculosis H37Rv is set forth in SEQ ID NO: 93. The deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 94.
- The nucleotide sequence of rd1-orf9b fromM. tuberculosis H37Rv is set forth in SEQ ID NO: 69. The deduced amino acid sequence of RD1-ORF2 is set forth in SEQ ID NO: 70.
- The DNA sequence rd1-orf2 (SEQ ID NO: 71) contained an open reading frame starting with an ATG codon at position 889-891 and ending with a termination codon (TAA) at position 2662-2664 (position numbers referring to the location in RD1). The deduced amino acid sequence (SEQ ID NO: 72) contains 591 residues corresponding to a molecular weight of 64,525.
- The DNA sequence rd1-orf3 (SEQ ID NO: 87) contained an open reading frame starting with an ATG codon at position 2807-2809 and ending with a termination codon (TAA) at position 3101-3103 (position numbers referring to the location in RD1). The deduced amino acid sequence (SEQ ID NO: 88) contains 98 residues corresponding to a molecular weight of 9,799.
- The DNA sequence rd1-orf4 (SEQ ID NO: 89) contained an open reading frame starting with a GTG codon at position 4014-4012 and ending with a termination codon (TAG) at position 3597-3595 (position numbers referring to the location in RD1). The deduced amino acid sequence (SEQ ID NO: 90) contains 139 residues corresponding to a molecular weight of 14,210.
- The DNA sequence rd1-orf5 (SEQ ID NO: 91) contained an open reading frame starting with a GTG codon at position 3128-3130 and ending with a termination codon (TGA) at position 4241-4243 (position numbers referring to the location in RD1). The deduced amino acid sequence (SEQ ID NO: 92) contains 371 residues corresponding to a molecular weight of 37,647.
- The DNA sequence rd1-orf8 (SEQ ID NO: 67) contained an open reading frame starting with a GTG codon at position 5502-5500 and ending with a termination codon (TAG) at position 5084-5082 (position numbers referring to the location in RD1), and the deduced amino acid sequence (SEQ ID NO: 68) contains 139 residues with a molecular weight of 11,737.
- The DNA sequence rd1-orf9a (SEQ ID NO: 93) contained an open reading frame starting with a GTG codon at position 6146-6148 and ending with a termination codon (TAA) at position 7070-7072 (position numbers referring to the location in RD1). The deduced amino acid sequence (SEQ ID NO: 94) contains 308 residues corresponding to a molecular weight of 33,453.
- The DNA sequence rd1-orf9b (SEQ ID NO: 69) contained an open reading frame starting with an ATG codon at position 5072-5074 and ending with a termination codon (TAA) at position 7070-7072 (position numbers referring to the location in RD1). The deduced amino acid sequence (SEQ ID NO: 70) contains 666 residues corresponding to a molecular weight of 70,650.
- Cloning of the ORF's rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a, and rd1-orf9b.
- The ORF's rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b were PCR cloned in the pMST24 (Theisen et al., 1995) (rd1-orf3) or the pQE32 (QIAGEN) (rd1-orf2, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b) expression vector. Preparation of oligonucleotides and PCR amplification of the rd1-orf encoding genes, was carried out as described in example 2. Chromosomal DNA fromM. tuberculosis H37Rv was used as template in the PCR reactions. Oligonucleotides were synthesized on the basis of the nucleotide sequence from the RD1 region (Accession no. U34848). The oligonucleotide primers were engineered to include an restriction enzyme site at the 5′ end and at the 3′ end by which a later subcloning was possible. Primers are listed in TABLE 2.
- rd1-orf2. A BamHI site was engineered immediately 5′ of the first codon of rd1-orf2, and a HindIII site was incorporated right after the stop codon at the 3′ end. The gene rd1-orf2 was subcloned in pQE32, giving pTO96.
- rd1-orf3. A SmaI site was engineered immediately 5′ of the first codon of rd1-orf3, and a NcoI site was incorporated right after the stop codon at the 3′ end. The gene rd1-orf3 was subcloned in pMST24, giving pTO87.
- rd1-orf4. A BamHI site was engineered immediately 5′ of the first codon of rd1-orf4, and a HindIII site was incorporated right after the stop codon at the 3′ end. The gene rd1-orf4 was subcloned in pQE32, giving pTO89.
- rd1-orf5. A BamHI site was engineered immediately 5′ of the first codon of rd1-orf5, and a HindIII site was incorporated right after the stop codon at the 3′ end. The gene rd1-orf5 was subcloned in pQE32, giving pTO88.
- rd1-orf8. A BamHI site was engineered immediately 5′ of the first codon of rd1-orf8, and a NcoI site was incorporated right after the stop codon at the 3′ end. The gene rd1-orf8 was subcloned in pMST24, giving pTO98.
- rd1-orf9a. A BamHI site was engineered immediately 5′ of the first codon of rd1-orf9a, and a HindIII site was incorporated right after the stop codon at the 3′ end. The gene rd1-orf9a was subcloned in pQE32, giving pTO91.
- rd1-orf9b. A ScaI site was engineered immediately 5′ of the first codon of rd1-orf9b, and a Hind III site was incorporated right after the stop codon at the 3′ end. The gene rdlorf9b was subcloned in pQE32, giving pTO90.
- The PCR fragments were digested with the suitable restriction enzymes, purified from an agarose gel and cloned into either pMST24 or pQE-32. The seven constructs were used to transform theE. coli XL1-Blue. Endpoints of the gene fusions were determined by the dideoxy chain termination method. Both strands of the DNA were sequenced.
- Purification of recombinant RD1-ORF2, RD1-ORF3, RD1-ORF4, RD1-ORF5, RD1-ORF8, RD1-ORF9a and RD1-ORF9b.
- The rRD1-ORFs were fused N-terminally to the (His)6-tag. Recombinant antigen was prepared as described in example 2 (with the exception that pTO91 was expressed at 300C and not at 37° C.), using a single colony of E. coli harbouring either the pTO87, pTO88, pTO89, pTO90, pTO91, pTO96 or pTO98 for inoculation. Purification of recombinant antigen by Ni2+ affinity chromatography was also carried out as described in example 2. Fractions containing purified His-rRD1-ORF2, His-rRD1-ORF3 His-rRD1-ORF4, His-rRD1-ORF5, His-rRD1-ORF8, His-rRD1-ORF9a or His-rRD1-ORF9b were pooled. The His-rRD1-ORF's were extensively dialysed against 10 mM Tris/HCl, pH 8.5, 3 M urea followed by an additional purification step performed on an anion exchange column (Mono Q) using fast protein liquid chromatography (FPLC) (Pharmacia, Uppsala, Sweden). The purification was carried out in 10 mM Tris/HCl, pH 8.5, 3 M urea and protein was eluted by a linear gradient of NaCl from 0 to 1 M. Fractions containing the His-rRD1-ORF's were pooled and subsequently dialysed extensively against 25 mM Hepes, pH 8.0 before use.
TABLE 2 Sequence of the rd1-orf's oligonucleotidesa. Orientation and oligonucleotide Sequences (5′ → 3′) Position (nt) Sense RD1-ORF2f CTGGGGATCCGCATGACTGCTGAACCG 886-903 RD1-ORF3f CTTCCCGGGATGGAAAAAATGTCAC 2807-2822 RD1-ORF4f GTAGGATCCTAGGAGACATCAGCGGC 4028-4015 RD1-ORF5f CTGGGGATCCGCGTGATCACCATGCTGTGG 3028-3045 RD1-ORF8f CTCGGATCCTGTGGGTGCAGGTCCGGCGATGGGC 5502-5479 RD1-ORF9af GTGATGTGAGCTCAGGTGAAGAAGGTGAAG 6144-6160 RD1-ORF9bf GTGATGTGAGCTCCTATGGCGGCCGACTACGAC 5072-5089 Antisense RD1-ORF2r TGCAAGCTTTTAACCGGCGCTTGGGGGTGC 2664-2644 RD1-ORF3r GATGCCATGGTTAGGCGAAGACGCCGGC 3103-3086 RD1-ORF4r CGATCTAAGCTTGGCAATGGAGGTCTA 3582-3597 RD1-ORF5r TGCAAGCTTTCACCAGTCGTCCTCTTCGTC 4243-4223 RD1-ORF8r CTCCCATGGCTACGACAAGCTCTTCCGGCCGC 5083-5105 RD1-ORF9a/br CGATCTAAGCTTTCAACGACGTCCAGCC 7073-7056 - The nucleotide sequences of rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a, and rd1-orf9b fromM. tuberculosis H37Rv are set forth in SEQ ID NO: 71, 87, 89, 91, 67, 93, and 69, respectively. The deduced amino acid sequences of rd1-orf2, rd1-orf3, rd1-orf4 rd1-orf5, rd1-orf8, rd1-orf9a, and rd1-orf9b are set forth in SEQ ID NO: 72, 88, 90, 92, 68, 94, and 70, respectively.
- Cloning of the Genes Expressing 17-30 kDa Antigens from ST-CF
- Isolation of CFP17, CFP20, CFP21, CFP22, CFP25, and CFP28
- ST-CF was precipitated with ammonium sulphate at 80% saturation. The precipitated proteins were removed by centrifugation and after resuspension washed with 8 M urea. CHAPS and glycerol were added to a final concentration of 0.5% (w/v) and 5% (v/v) respectively and the protein solution was applied to a Rotofor isoelectrical Cell (BioRad). The Rotofor Cell had been equilibrated with an 8 M urea buffer containing 0.5% (w/v) CHAPS, 5% (v/v) glycerol, 3% (v/v) Biolyt 3/5 and 1% (v/v) Biolyt 4/6 (BioRad). Isoelectric focusing was performed in a pH gradient from 3-6. The fractions were analyzed on silver-stained 10-20% SDS-PAGE. Fractions with similar band patterns were pooled and washed three times with PBS on a Centriprep concentrator (Amicon) with a 3 kDa cut off membrane to a final volume of 1-3 ml. An equal volume of SDS containing sample buffer was added and the protein solution boiled for 5 min before further separation on a Prep Cell (BioRad) in a matrix of 16% polyacrylamide under an electrical gradient. Fractions containing pure proteins with an molecular mass from 17-30 kDa were collected.
- Isolation of CFP29
- Anti-CFP29, reacting with CFP29 was generated by immunization of BALB/c mice with crushed gel pieces in RIBI adjuvant (first and second immunization) or aluminium hydroxide (third immunization and boosting) with two week intervals. SDS-PAGE gel pieces containing 2-5 μg of CFP29 were used for each immunization. Mice were boosted with antigen 3 days before removal of the spleen. Generation of a monoclonal cell line producing antibodies against CFP29 was obtained essentially as described by Köhler and Milstein (1975). Screening of supernatants from growing clones was carried out by immunoblotting of nitrocellulose strips containing ST-CF separated by SDS-PAGE. Each strip contained approximately 50 μg of ST-CF. The antibody class of anti-CFP29 was identified as IgM by the mouse monoclonal antibody isotyping kit, RPN29 (Amersham) according to the manufacturer's instructions.
- CFP29 was purified by the following method: ST-CF was concentrated 10 fold by ultrafiltration, and ammonium sulphate precipitation in the 45 to 55% saturation range was performed. The pellet was redissolved in 50 mM sodium phosphate, 1.5 M ammonium sulphate, pH 8.5, and subjected to thiophilic adsorption chromatography (Porath et al., 1985) on an Affi-T gel column (Kem-En-Tec). Protein was eluted by a linear 1.5 to 0 M gradient of ammonium sulphate and fractions collected in the range 0.44 to 0.31 M ammonium sulphate were identified as CFP29 containing fractions in Western blot experiments with mAb Anti-CFP29. These fractions were pooled and anion exchange chromatography was performed on a
Mono Q HR 5/5 column connected to an FPLC system (Pharmacia). The column was equilibrated with 10 mM Tris-HCl, pH 8.5 and the elution was performed with a linear gradient from 0 to 500 mM NaCl. From 400 to 500 mM sodium chloride, rather pure CFP29 was eluted. As a final purification step the Mono Q fractions containing CFP29 were loaded on a 12.5% SDS-PAGE gel and pure CFP29 was obtained by the multi-elution technique (Andersen and Heron, 1993). - N-Terminal Sequencing and Amino Acid Analysis
- CFP17, CFP20, CFP21, CFP22, CFP25, and CFP28 were washed with water on a Centricon concentrator (Amicon) with cutoff at 10 kDa and then applied to a ProSpin concentrator (Applied Biosystems) where the proteins were collected on a PVDF membrane. The membrane was washed 5 times with 20% methanol before sequencing on a Procise sequencer (Applied Biosystems).
- CFP29 containing fractions were blotted to PVDF membrane after tricine SDS-PAGE (Ploug et al., 1989). The relevant bands were excised and subjected to amino acid analysis (Barkholt and Jensen, 1989) and N-terminal sequence analysis on a Procise sequencer (Applied Biosystems).
- The following N-terminal sequences were obtained:
For CFP17: A/S E L D A P A Q A G T E X A V (SEQ ID NO: 17) For CFP20: A Q I T L R G N A I N T V G E (SEQ ID NO: 18) For CFP21: D P X S D I A V V F A R G T H (SEQ ID NO: 19) For CFP22: T N S P L A T A T A T L H T N (SEQ ID NO: 20) For CFP25: A X P D A E V V F A R G R F E (SEQ ID NO: 21) For CFP28: X I/V Q K S L E L I V/T V/F T A D/Q E (SEQ ID NO: 22) For CFP29: M N N L Y R D L A P V T E A A W A E I (SEQ ID NO: 23) - “X” denotes an amino acid which could not be determined by the sequencing method used, whereas a “/” between two amino acids denotes that the sequencing method could not determine which of the two amino acids is the one actually present.
- Cloning the Gene Encoding CFP29
- The N-terminal sequence of CFP29 was used for a homology search in the EMBL database using the TFASTA program of the Genetics Computer Group sequence analysis software package. The search identified a protein, Linocin M18, from Brevibacterium linens that shares 74% identity with the 19 N-terminal amino acids of CFP29.
- Based on this identity between the N-terminal sequence of CFP29 and the sequence of the Linocin M18 protein from Brevibacterium linens, a set of degenerated primers were constructed for PCR cloning of theM. tuberculosis gene encoding CFP29. PCR reactions were containing 10 ng of M. tuberculosis chromosomal DNA in 1× low salt Taq+ buffer from Stratagene supplemented with 250 μM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% MSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+ DNA polymerase (Stratagene) in 10 μl reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 15 sec., 55° C. for 15 sec. and 72° C. for 90 sec, using thermocycler equipment from Idaho Technology.
- An approx. 300 bp fragment was obtained using primers with the sequences:
1: 5′-CCCGGCTCGAGAACCTSTACCGCGACCTSGCSCC (SEQ ID NO: 24) 2: 5′-GGGCCGGATCCGASGCSGCGTCCTTSACSGGYTGCCA (SEQ ID NO: 25) - where S=G/C and Y=T/C
- The fragment was excised from a 1% agarose gel, purified by Spin-X spinn columns (Costar), cloned into pBluescript SK II+-T vector (Stratagene) and finally sequenced with the Sequenase kit from United States Biochemical.
- The first 150 bp of this sequence was used for a homology search using the Blast program of the SangerMycobacterium tuberculosis database:
- (http//www.sanger.ac.uk/projects/M-tuberculosis/blast_server).
- This programidentified aMycobacterium tuberculosis sequence on cosmid cy444 in the database that is nearly 100% identical to the 150 bp sequence of the CFP29 protein. The sequence is contained within a 795 bp open reading frame of which the 5′ end translates into a sequence that is 100% identical to the N-terminally sequenced 19 amino acids of the purified CFP29 protein.
- Finally, the 795 bp open reading frame was PCR cloned under the same PCR conditions as described above using the primers:
3: 5′-GGAAGCCCCATATGAACAATCTCTACCG (SEQ ID NO: 26) 4: 5′-CGCGCTCAGCCCTTAGTGACTGAGCGCGACCG (SEQ ID NO: 27) - The resulting DNA fragments were purified from agarose gels as described above sequenced with primer 3 and 4 in addition to the following primers:
5: 5′-GGACGTTCAAGCGACACATCGCCG-3′ (SEQ ID NO: 115) 6: 5′-CAGCACGAACGCGCCGTCGATGGC-3′ (SEQ ID NO: 116) - Three independent cloned were sequenced. All three clones were in 100% agreement with the sequence on cosmid cy444.
- All other DNA manipulations were done according to Maniatis et al. (1989).
- All enzymes other than Taq polymerase were from New England Biolabs.
- Homology Searches in the Sanger Database
- For CFP17, CFP20, CFP21, CFP22, CFP25, and CFP28 the N-terminal amino acid sequence from each of the proteins were used for a homology search using the blast program of the SangerMycobacterium tuberculosis database:
- http://www.sanger.ac.uk/pathogens/TB-blast-server.html.
- For CFP29 the first 150 bp of the DNA sequence was used for the search. Furthermore, the EMBL database was searched for proteins with homology to CFP29.
- Thereby, the following information were obtained:
- CFP17
- Of the 14 determined amino acids in CFP17 a 93% identical sequence was found with MTCY1A11.16c. The difference between the two sequences is in the first amino acid: It is an A or an S in the N-terminal determined sequenced and a S in MTCY1A11. From the N-terminal sequencing it was not possible to determine amino acid number 13.
- Within the open reading frame the translated protein is 162 amino acids long. The N-terminal of the protein purified from culture filtrate starts at amino acid 31 in agreement with the presence of a signal sequence that has been cleaved off. This gives a length of the mature protein of 132 amino acids, which corresponds to a theoretical molecular mass of 13833 Da and a theoretical pI of 4.4. The observed mass in SDS-PAGE is 17 kDa.
- CFP20
- A
sequence 100% identical to the 15 determined amino acids of CFP20 was found on the translated cosmid cscy09F9. A stop codon is found at amino acid 166 from the amino acid M atposition 1. This gives a predicted length of 165 amino acids, which corresponds to a theoretical molecular mass of 16897 Da and a pI of 4.2. The observed molecular weight in a SDS-PAGE is 20 kDa. - Searching the GenEMBL database using the TFASTA algorithm (Pearson and Lipman, 1988) revealed a number of proteins with homology to the predicted 164 amino acids long translated protein.
- The highest homology, 51.5% identity in a 163 amino acid overlap, was found to a Haemophilus influenza Rd toxr reg.
- (HIHI0751)
- CFP21
- A
sequence 100% identical to the 14 determined amino acids of CFP21 was found at MTCY39. From the N-terminal sequencing it was not possible to determine amino acid number 3; this amino acid is a C in MTCY39. The amino acid C can not be detected on a Sequencer which is probably the explanation of this difference. - Within the open reading frame the translated protein is 217 amino acids long. The N-terminally determined sequence from the protein purified from culture filtrate starts at amino acid 33 in agreement with the presence of a signal sequence that has been cleaved off. This gives a length of the mature protein of 185 amino acids, which corresponds to a theoretical molecular weigh at 18657 Da, and a theoretical pI at 4,6. The observed weight in a SDS-PAGE is 21 kDa.
- In a 193 amino acids overlap the protein has 32,6% identity to a cutinase precursor with a length of 209 amino acids (CUTI_ALTBR P41744).
- A comparison of the 14 N-terminal determined amino acids with the translated region (RD2) deleted inM. bovis BCG revealed a 100% identical sequence (mb3484) (Mahairas et al. (1996)).
- CFP22
- A
sequence 100% identical to the 15 determined amino acids of CFP22 was found at MTCY10H4. Within the open reading frame the translated protein is 182 amino acids long. The N-terminal sequence of the protein purified from culture filtrate starts at amino acid 8 and therefore the length of the protein occurring in M. tuberculosis culture filtrate is 175 amino acids. This gives a theoretical molecular weigh at 18517 Da and a pI at 6.8. The observed weight in a SDS-PAGE is 22 kDa. - In an 182 amino acids overlap the translated protein has 90,1% identity with E235739; a peptidyl-prolyl cis-trans isomerase.
- CFP25
- A sequence 93% identical to the 15 determined amino acids was found on the cosmid MTCY339.08c. The one amino acid that differs between the two sequences is a C in MTCY339.08c and a X from the N-terminal sequence data. On a Sequencer a C can not be detected which is a probable explanation for this difference.
- The N-terminally determined sequence from the protein purified from culture filtrate begins at amino acid 33 in agreement with the presence of a signal sequence that has been cleaved off. This gives a length of the mature protein of 187 amino acids, which corresponds to a theoretical molecular weigh at 19665 Da, and a theoretical pI at 4.9. The observed weight in a SDS-PAGE is 25 kDa.
- In a 217 amino acids overlap the protein has 42.9% identity to CFP21 (MTCY39.35).
- CFP28
- No homology was found when using the 10 determined amino acid residues 2-8, 11, 12, and 14 of SEQ ID NO: 22 in the database search.
- CFP29
- Sanger database searching: A sequence nearly 100% identical to the 150 bp sequence of the CFP29 protein was found on cosmid cy444. The sequence is contained within a 795 bp open reading frame of which the 5′ end translates into a sequence that is 100% identical to the N-terminally sequenced 19 amino acids of the purified CFP29 protein. The open reading frame encodes a 265 amino acid protein.
- The amino acid analysis performed on the purified protein further confirmed the identity of CFP29 with the protein encoded in open reading frame on cosmid 444.
- EMBL database searching: The open reading frame encodes a 265 amino acid protein that is 58% identical and 74% similar to the Linocin M18 protein (61% identity on DNA level). This is a 28.6 kDa protein with bacteriocin activity (Valdes-Stauber and Scherer, 1994; Valdés-Stauber and Scherer, 1996). The two proteins have the same length (except for 1 amino acid) and share the same theoretical physicochemical properties. We therefore suggest that CFP29 is a mycobacterial homolog to the Brevibacterium linens Linocin M18 protein.
- The amino acid sequences of the purified antigens as picked from the Sanger database are shown in the following list. The amino acids determined by N-terminal sequencing are marked with bold.
CFP17 (SEQ ID NO: 6): 1 MTDMNPDIEK DQTSDEVTVE TTSVFRADFL SELDAPAQAG TESAVSGVEG 51 LPPGSALLVV KRGPNAGSRF LLDQAITSAG RHPDSDIFLD DVTVSRRHAE 101 FRLENNEFNV VDVGSLNGTY VNREPVDSAV LANGDEVQIG KFRLVFLTGP 151 KQGEDDGSTG GP CFP20 (SEQ ID NO: 8): 1 MAQITLRGNA INTVGELPAV GSPAPAFTLT GGDLGVISSD QFRGKSVLLN 51 IFPSVDTPVC ATSVRTFDER AAASGATVLC VSKDLPFAQK RFCGAEGTEN 101 VMPASAFRDS FGEDYGVTIA DGPMAGLLAR AIVVIGADGN VAYTELVPEI 151 AQEPNYEAAL AALGA CFP21 (SEQ ID NO: 10): 1 MTPRSLVRIV GVVVATTLAL VSAPAGGRAA HADPCSDIAV 41 VFARGTHQAS GLGDVGEAFV DSLTSQVGGR SIGVYAVNYP ASDDYRASAS 91 NGSDDASAHI QRTVASCPNT RIVLGGYSQG ATVIDLSTSA MPPAVADHVA 141 AVALFGEPSS GFSSMLWGGG SLPTIGPLYS SKTINLCAPD DPICTGGGNI 191 MAHVSYVQSG MTSQAATFAA NRLDHAG CFP22 (SEQ ID NO: 12): 1 MADCDSVTNS PLATATATLH TNRGDIKIAL FGNHAPKTVA NFVGLAQGTK 51 DYSTQNASGG PSGPFYDGAV FHRVIQGFMI QGGDPTGTGR GGPGYKFADE 101 FHPELQFDKP YLLAMANAGP GTNGSQFFIT VGKTPHLNRR HTIFGEVIDA 151 ESQRVVEAIS KTATDGNDRP TDPVVIESIT IS CFP25 (SEQ ID NO: 14): 1 MGAAAAMLAA VLLLTPITVP AGYPGAVAPA TAACPDAEVV FARGRFEPPG 51 IGTVGNAFVS ALRSKVNKNV GVYAVKYPAD NQIDVGANDM SAHIQSMANS 101 CPNTRLVPGG YSLGAAVTDV VLAVPTQMWG FTNPLPPGSD EHIAAVALFG 151 NGSQWVGPIT NFSPAYNDRT IELCHGDDPV CHPADPNTWE ANWPQHLAGA 201 YVSSGMVNQA ADFVAGKLQ CFP29 (SEQ ID NO: 16) 1 MNNLYRDLAP VTEAAWAEIE LEAARTFKRH IAGRRVVDVS DPGGPVTAAV 51 STGRLIDVKA PTNGVIAHLR ASKPLVRLRV PFTLSRNEID DVERGSKDSD 101 WEPVKEAAKK LAFVEDRTIF EGYSAASIEG IRSASSNPAL TLPEDPREIP 151 DVISQALSEL RLAGVDGPYS VLLSADVYTK VSETSDHGYP IREHLNRLVD 201 GDIIWAPAID GAFVLTTRGG DFDLQLGTDV AIGYASHDTD TVRLYLQETL 251 TFLCYTAEAS VALSH - For all six proteins the molecular weights predicted from the sequences are in agreement with the molecular weights observed on SDS-PAGE.
- Cloning of the Genes Encoding CFP17. CFP20, CFP21. CFP22 and CFP25.
- The genes encoding CFP17, CFP20, CFP21, CFP22 and CFP25 were all cloned into the expression vector pMCT6, by PCR amplification with gene specific primers, for recombinant expression inE. coli of the proteins.
- PCR reactions contained 10 ng ofM. tuberculosis chromosomal DNA in 1× low salt Taq+ buffer from Stratagene supplemented with 250 mM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+DNA polymerase (Stratagene) in 10 μl reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles according to the following program; 94° C. for 10 sec., 55° C. for 10 sec. and 72° C. for 90 sec, using thermocycler equipment from Idaho Technology.
- The DNA fragments were subsequently run on lt agarose gels, the bands were excised and purified by Spin-X spin columns (Costar) and cloned into pBluescript SK II+-T vector (Stratagene). Plasmid DNA was thereafter prepared from clones harbouring the desired fragments, digested with suitable restriction enzymes and subcloned into the expression vector pMCT6 in frame with 8 histidine residues which are added to the N-terminal of the expressed proteins. The resulting clones were hereafter sequenced by use of the dideoxy chain termination method adapted for supercoiled DNA using the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., USA) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- For cloning of the individual antigens, the following gene specific primers were used:
- CFP17: Primers used for cloning of cfp17:
OPBR-51: ACAGATCTGTGACGGACATGAACCCG (SEQ ID NO: 117) OPBR-52: TTTTCCATGGTCACGGGCCCCCGGTACT (SEQ ID NO: 118) - OPBR-51 and OPBR-52 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP20: Primers used for cloning of cfp20:
OPBR-53: ACAGATCTGTGCCCATGGCACAGATA (SEQ ID NO: 119) OPBR-54: TTTAAGCTTCTAGGCGCCCAGCGCGGC (SEQ ID NO: 120) - OPBR-53 and OPBR-54 create BglII and HinDIII sites, respectively, used for the cloning in pMCT6.
- CFP21: Primers used for cloning of cfp21:
OPBR-55: ACAGATCTGCGCATGCGGATCCGTGT (SEQ ID NO: 121) OPBR-56: TTTTCCATGGTCATCCGGCGTGATCGAG (SEQ ID NO: 122) - OPBR-55 and OPBR-56 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP22: Primers used for cloning of cfp22:
OPBR-57: ACAGATCTGTAATGGCAGACTGTGAT (SEQ ID NO: 123) OPBR-58: TTTTCCATGGTCAGGAGATGGTGATCGA (SEQ ID NO: 124) - OPBR-57 and OPBR-58 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP25: Primers used for cloning of cfp25:
OPBR-59: ACAGATCTGCCGGCTACCCCGGTGCC (SEQ ID NO: 125) OPBR-60: TTTTCCATGGCTATTGCAGCTTTCCGGC (SEQ ID NO: 126) - OPBR-59 and OPBR-60 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- Expression/Purification of Recombinant CFP17. CFP20. CFP21. CFP22 and CFP25 Proteins.
- Expression and metal affinity purification of recombinant proteins was undertaken essentially as described by the manufacturers. For each protein, 1 l LB-media containing 100 μg/ml ampicillin, was inoculated with 10 ml of an overnight culture of XL1-Blue cells harbouring recombinant pMCT6 plasmids. Cultures were shaken at 37° C. until they reached a density of OD600=0.4-0.6. IPTG was hereafter added to a final concentration of 1 mM and the cultures were further incubated 4-16 hours. Cells were harvested, resuspended in 1× sonication buffer+8 M urea and sonicated 5×30 sec. with 30 sec. pausing between the pulses.
- After centrifugation, the lysate was applied to a column containing 25 ml of resuspended Talon resin (Clontech, Palo Alto, USA). The column was washed and eluted as described by the manufacturers. After elution, all fractions (1.5 ml each) were subjected to analysis by SDS-PAGE using the Mighty Small (Hoefer Scientific Instruments, USA) system and the protein concentrations were estimated at 280 nm. Fractions containing recombinant protein were pooled and dialysed against 3 M urea in 10 mM Tris-HCl, pH 8.5. The dialysed protein was further purified by FPLC (Pharmacia, Sweden) using a 6 ml Resource-Q column, eluted with a linear 0-1 M gradient of NaCl. Fractions were analyzed by SDS-PAGE and protein concentrations were estimated at OD280. Fractions containing protein were pooled and dialysed against 25 mM Hepes buffer, pH 8.5.
- Finally the protein concentration and the LPS content were determined by the BCA (Pierce, Holland) and LAL (Endosafe, Charleston, USA) tests, respectively.
- Identification of CFP7A, CFPBA, CFPBB, CFP16, CFP19, CFP19B, CFP22A, CFP23A, CFP23B, CFP25A, CFP27, CFP30A, CWP32 and CFP50.
- Identification of CFP16 and CFP19B.
- ST-CF was precipitated with ammonium sulphate at 80% saturation. The precipitated proteins were removed by centrifugation and after resuspension washed with 8 M urea. CHAPS and glycerol were added to a final concentration of 0.5% (w/v) and 5% (v/v) respectively and the protein solution was applied to a Rotofor isoelectrical Cell (BioRad). The Rotofor Cell had been equilibrated with a 8M urea buffer containing 0.5% (w/v) CHAPS, 5% (v/v) glycerol, 3% (v/v) Biolyt 3/5 and 1% (v/v) Biolyt 4/6 (BioRad). Isoelectric focusing was performed in a pH gradient from 3-6. The frac-tions were analyzed on silver-stained 10-20% SDS-PAGE. Fractions with similar band patterns were pooled and washed three times with PBS on a Centriprep concentrator (Amicon) with a 3 kDa cut off membrane to a final volume of 1-3 ml. An equal volume of SDS containing sample buffer was added and the protein solution boiled for 5 min before further separation on a Prep Cell (BioRad) in a matrix of 16% polyacrylamide under an electrical gradient. Fractions containing well separated bands in SDS-PAGE were selected for N-terminal sequencing after transfer to PVDF membrane.
- Isolation of CFP8A. CFP8B. CFP19. CFP23A. and CFP23B.
- ST-CF was precipitated with ammonium sulphate at 80% saturation and redissolved in PBS, pH 7.4, and dialysed 3 times against 25 mM Piperazin-HCl, pH 5.5, and subjected to chromatofocusing on a matrix of PBE 94 (Pharmacia) in a column connected to an FPLC system (Pharmacia). The column was equilibrated with 25 mM Piperazin-HCl, pH 5.5, and the elution was performed with 10% PB74-HCl, pH 4.0 (Pharmacia).
- Fractions with similar band patterns were pooled and washed three times with PBS on a Centriprep concentrator (Amicon) with a 3 kDa cut off membrane to a final volume of 1-3 ml and separated on a Prepcell as described above.
- Identification of CFP22A
- ST-CF was concentrated approximately 10 fold by ultrafiltration and proteins were precipitated at 80% saturation, redissolved in PBS, pH 7.4, and dialysed 3 times against PBS, pH 7.4. 5.1 ml of the dialysed ST-CF was treated with RNase (0.2 mg/ml, QUIAGEN) and DNase (0.2 mg/ml, Boehringer Mannheim) for 6 h and placed on top of 6.4 ml of 48% (w/v) sucrose in PBS, pH 7.4, in Sorvall tubes (Ultracrimp 03987, DuPont Medical Products) and ultracentrifuged for 20 h at 257,300× gmax, 10° C. The pellet was redissolved in 200 μl of 25 mM Tris-192 mM glycine, 0.1% SDS, pH 8.3.
- Identification of CFP7A. CFP25A. CFP27. CFP30A and CFP50
- For CFP27, CFP30A and CFP50 ST-CF was concentrated approximately 10 fold by ultrafiltration and ammonium sulphate precipitation in the 45 to 55% saturation range was performed. Proteins were redissolved in 50 mM sodium phosphate, 1.5 M ammonium sulphate, pH 8.5, and subjected to thiophilic adsorption chromatography on an Affi-T gel column (Kem-EnTec). Proteins were eluted by a 1.5 to 0 M decreasing gradient of ammonium sulphate. Fractions with similar band patterns in SDS-PAGE were pooled and anion exchange chromatography was performed on a
Mono Q HR 5/5 column connected to an FPLC system (Pharmacia). The column was equilibrated with 10 mM Tris-HCl, pH 8.5, and the elution was performed with a gradient of NaCl from 0 to 1 M. Fractions containing well separated bands in SDS-PAGE were selected. - CFP7A and CFP25A were obtained as described above except for the following modification: ST-CF was concentrated approximately 10 fold by ultrafiltration and proteins were precipitated at 80% saturation, redissolved in PBS, pH 7.4, and dialysed 3 times against PBS, pH 7.4. Ammonium sulphate was added to a concentration of 1.5 M, and ST-CF proteins were loaded on an Affi T-gel column. Elution from the Affi T-gel column and anion exchange were performed as described above.
- Isolation of CWP32
- Heat treated H37Rv was subfractionated into subcellular fractions as described in Sorensen et al 1995. The Cell wall fraction was resuspended in 8 M urea, 0.2% (w/v) N-octyl β-D glucopyranoside (Sigma) and 5% (v/v) glycerol and the protein solution was applied to a Rotofor isoelectrical Cell (BioRad) which was equilibrated with the same buffer. Isoelectric focusing was performed in a pH gradient from 3-6. The fractions were analyzed by SDS-PAGE and fractions containing well separated bands were polled and subjected to N-terminal sequencing after transfer to PVDF membrane.
- N-Terminal Sequencing
- Fractions containing CFP7A, CFP8A, CFP8B, CFP16, CFP19, CFP19B, CFP22A, CFP23A, CFP23B, CFP27, CFP30A, CWP32, and CFP50A were blotted to PVDF membrane after Tricine SDS-PAGE (Ploug et al, 1989). The relevant bands were excised and subjected to N-terminal amino acid sequence analysis on a Procise 494 sequencer (Applied Biosystems). The fraction containing CFP25A was blotted to PVDF membrane after 2-DE PAGE (isoelectric focusing in the first dimension and Tricin SDS-PAGE in the second dimension). The relevant spot was excised and sequenced as described above.
- The following N-terminal sequences were obtained:
CFP7A: AEDVRAEIVA SVLEVVVNEG DQIDKGDVVV LLESMYMEIP (SEQ ID NO: 81) VLAEAAGTVS CFP8A: DPVDDAFIAKLNTAG (SEQ ID NO: 73) CFP8B: DPVDAIINLDNYGX (SEQ ID NO: 74) CFP16: AKLSTDELLDAFKEM (SEQ ID NO: 79) CFP19: TTSPDPYAALPKLPS (SEQ ID NO: 82) CFP19B: DPAXAPDVPTAAQLT (SEQ ID NO: 80) CFP22A: TEYEGPKTKF HALMQ (SEQ ID NO: 83) CFP23A: VIQ/AGMVT/GHIHXVAG (SEQ ID NO: 76) CFP23B: AEMKXFKNAIVQEID (SEQ ID NO: 75) CFP25A: AIEVSVLRVF TDSDG (SEQ ID NO: 78) CWP32: TNIVVLIKQVPDTWS (SEQ ID NO: 77) CFP27: TTIVALKYPG GVVMA (SEQ ID NO: 84) CFP30A: SFPYFISPEX AMRE (SEQ ID NO: 85) CFP50: THYDVVVLGA GPGGY (SEQ ID NO: 86) - N-Terminal Homology Searching in the Sanger Database and Identification of the Corresponding Genes.
- The N-terminal amino acid sequence from each of the proteins was used for a homology search using the blast program of the SangerMycobacterium tuberculosis database:
- http://www.sanger.ac.uk/projects/m-tuberculosis/TB-blast-server.
- For CFP23B, CFP23A, and CFP19B no similarities were found in the Sanger database. This could be due to the fact that only approximately 70% of theM. tuberculosis genome had been sequenced when the searches were performed. The genes encoding these proteins could be contained in the remaining 30% of the genome for which no sequence data is yet available.
- For CFP7A, CFP8A, CFP8B, CFP16, CFP19, CFP19B , CFP22A, CFP25A, CFP27, CFP30A, CWP32, and CFP50, the following information was obtained:
- CFP7A: Of the 50 determined amino acids in CFP7A a 98% identical sequence was found in cosmid csCY07Dl (contig 256): Score=226 (100.4 bits), Expect=1.4e-24, P=1.4e-24 Identities =49/50 (98%), Positives =49/50 (98%), Frame=−1
(SEQ ID NOs: 127, 128, and 129) Query: 1 AEDVRAEIVASVLEVVVNEGDQIDKGDVVVLLESMYMEIPVLAEAAGTVS 50 AEDVRAEIVASVLEVVVNEGDQIDKGDVVVLLESM MEIPVLAEAAGTVS Sbjct: 257679 AEDVRAEIVASVLEVVVNEGDQIDKGDVVVLLESMKMEIPVLAEAAGTVS 257530 - The identity is found within an open reading frame of 71 amino acids length corresponding to a theoretical MW of CFP7A of 7305.9 Da and a pI of 3.762. The observed molecular weight in an SDS-PAGE gel is 7 kDa.
- CFP8A: A sequence 80% identical to the 15 N-terminal amino acids was found on contig TB—1884. The N-terminally determined sequence from the protein purified from culture filtrate starts at amino acid 32. This gives a length of the mature protein of 98 amino acids corresponding to a theoretical MW of 9700 Da and a pI of 3.72 This is in good agreement with the observed MW on SDS-PAGE at approximately 8 kDa. The full length protein has a theoretical MW of 12989 Da and a pI of 4.38.
- CFP8B: A sequence 71% identical to the 14 N-terminal amino acids was found on contig TB—653. However, careful re-evaluation of the original N-terminal sequence data confirmed the identification of the protein. The N-terminally determined sequence from the protein purified from culture filtrate starts at
amino acid 29. This gives a length of the mature protein of 82 amino acids corresponding to a theoretical MW of 8337 Da and a pI of 4.23. This is in good agreement with the observed MW on SDS-PAGE at approximately 8 kDa. Analysis of the amino acid sequence predicts the presence of a signal peptide which has been cleaved of the mature protein found in culture filtrate. - CFP16: The 15 aa N-terminal sequence was found to be 100% identical to a sequence found on cosmid MTCY20H.
- The identity is found within an open reading frame of 130 amino acids length corresponding to a theoretical MW of CFP16 of 13440.4 Da and a pI of 4.59. The observed molecular weight in an SDS-PAGE gel is 16 kDa.
- CFP19: The 15 aa N-terminal sequence was found to be 100t identical to a sequence found on cosmid MTCY270.
- The identity is found within an open reading frame of 176 amino acids length corresponding to a theoretical MW of CFP19 of 18633.9 Da and a pI of 5.41. The observed molecular weight in an SDS-PAGE gel is 19 kDa.
- CFP22A: The 15 aa N-terminal sequence was found to be 100t identical to a sequence found on cosmid MTCY1A6.
- The identity is found within an open reading frame of 181 amino acids length corresponding to a theoretical MW of CFP22A of 20441.9 Da and a pI of 4.73. The observed molecular weight in an SDS-PAGE gel is 22 kDa.
- CFP25A: The 15 aa N-terminal sequence was found to be 100% identical to a sequence found on contig 255.
- The identity is found within an open reading frame of 228 amino acids length corresponding to a theoretical MW of CFP25A of 24574.3 Da and a pI of 4.95. The observed molecular weight in an SDS-PAGE gel is 25 kDa.
- CFP27: The 15 aa N-terminal sequence was found to be 100% identical to a sequence found on cosmid MTCY261.
- The identity is found within an open reading frame of 291 amino acids length. The N-terminally determined sequence from the protein purified from culture filtrate starts at amino acid 58. This gives a length of the mature protein of 233 amino acids, which corresponds to a theoretical molecular weigh at 24422.4 Da, and a theoretical pI at 4.64. The observed weight in an SDS-PAGE gel is 27 kDa.
- CFP30A: Of the 13 determined amino acids in CFP30A, a 100% identical sequence was found on cosmid MTCY261.
- The identity is found within an open reading frame of 248 amino acids length corresponding to a theoretical MW of CFP30A of 26881.0 Da and a pI of 5.41. The observed molecular weight in an SDS-PAGE gel is 30 kDa.
- CWP32: The 15 amino acid N-terminal sequence was found to be 100% identical to a sequence found on contig 281. The identity was found within an open reading frame of 266 amino acids length, corresponding to a theoretical MW of CWP32 of 28083 Da and a pI of 4.563. The observed molecular weight in an SDS-PAGE gel is 32 kDa.
- CFP50: The 15 aa N-terminal sequence was found to be 100% identical to a sequence found in MTV038.06. The identity is found within an open reading frame of 464 amino acids length corresponding to a theoretical MW of CFP50 of 49244 Da and a pI of 5.66. The observed molecular weight in an SDS-PAGE gel is 50 kDa.
- Use of Homology Searching in the EMBL Database for Identification of CFP19A and CFP23.
- Homology searching in the EMBL database (using the GCG package of the Biobase, Århus-DK) with the amino acid sequences of two earlier identified highly immunoreactive ST-CF proteins, using the TFASTA algorithm, revealed that these proteins (CFP21 and CFP25, EXAMPLE 3) belong to a family of fungal cutinase homologs. Among the most homologous sequences were also twoMycobacterium tuberculosis sequences found on cosmid MTCY13E12. The first, MTCY13E12.04 has 46% and 50% identity to CFP25 and CFP21 respectively. The second, MTCY13E12.05, has also 46% and 50% identity to CFP25 and CFP21. The two proteins share 62.5t aa identity in a 184 residues overlap. On the basis of the high homology to the strong T-cell antigens CFP21 and CFP25, respectively, it is believed that CFP19A and CFP23 are possible new T-cell antigens.
- The first reading frame encodes a 254 amino acid protein of which the first 26 aa constitute a putative leader peptide that strongly indicates an extracellular location of the protein. The mature protein is thus 228 aa in length corresponding to a theoretical MW of 23149.0 Da and a Pi of 5.80. The protein is named CFP23.
- The second reading frame encodes an 231 aa protein of which the first 44 aa constitute a putative leader peptide that strongly indicates an extracellular location of the protein. The mature protein is thus 187 aa in length corresponding to a theoretical MW of 19020.3 Da and a Pi of 7.03. The protein is named CFP19A.
- The presence of putative leader peptides in both proteins (and thereby their presence in the ST-CF) is confirmed by theoretical sequence analysis using the signalP program at the Expasy molecular Biology server
- (http://expasy.hcuge.ch/www/tools.html).
- Searching for Homologies to CFP7A, CFP16, CFP19, CFP19A, CFP19B, CFP22A, CFP23, CFP25A, CFP27, CFP30A, CWP32 and CFP50 in the EMBL database.
- The amino acid sequences derived from the translated genes of the individual antigens were used for homology searching in the EMBL and Genbank databases using the TFASTA algorithm, in order to find homologous proteins and to address eventual functional roles of the antigens.
- CFP7A: CFP7A has 44% identity and 70% similarity to hypotheticalMethanococcus jannaschii protein (M. jannaschii from base 1162199-1175341), as well as 43% and 38t identity and 68 and 64% similarity to the C-terminal part of B. stearothermophilus pyruvate carboxylase and Streptococcus mutans biotin carboxyl carrier protein.
- CFP7A contains a consensus sequence EAMKM for a biotin binding site motif which in this case was slightly modified (ESMKM in amino acid residues 34 to 38). By incubation with alkaline phosphatase conjugated streptavidin after SDS-PAGE and transfer to nitrocellulose it was demonstrated that native CFP7A was biotinylated.
- CFP16: RplL gene, 130 aa. Identical to theM. bovis 50s ribosomal protein L7/L12 (acc. No P37381).
- CFP19: CFP19 has 47% identity and 55% similarity toE. coli pectinesterase homolog (ybhc gene) in a 150 aa overlap.
- CFP19A: CFP19A has between 38% and 45% identity to several cutinases from different fungal sp.
- In addition CFP19A has 46t identity and 61% similarity to CFP25 as well as 50% identity and 64% similarity to CFP21 (both proteins are earlier isolated from the ST-CF).
- CFP19B: No apparent homology
- CFP22A: No apparent homology CFP23: CFP23 has between 38% and 46% identity to several cutinases from different fungal sp.
- In addition CFP23 has 46% identity and 61% similarity to CFP25 as well as 50t identity and 63% similarity to CFP21 (both proteins are earlier isolated from the ST-CF).
- CFP25A: CFP25A has 95% identity in a 241 aa overlap to a putativeM. tuberculosis thymidylate synthase (450 aa accession No p28176).
- CFP27: CFP27 has 81% identity to a hypotheticalM. leprae protein and 64% identity and 78% similarity to Rhodococcus sp. proteasome beta-type subunit 2 (prcB(2) gene).
- CFP30A: CFP30A has 67t identity to Rhodococcus proteasome alfa-
type 1 subunit. - CWP32: The CWP32 N-terminal sequence is 100% identical to theMycobacterium leprae sequence MLCB637.03.
- CFP50: The CFP50 N-terminal sequence is 100t identical to a putative lipoamide dehydrogenase fromM. leprae (Accession 415183)
- Cloning of the Genes Encoding CFP7A, CFP8A, CFP8B, CFP16, CFP19, CFP19A, CFP22A CFP23, CFP25A, CFP27, CFP30A, CWP32, and CFP50.
- The genes encoding CFP7A, CFP8A, CFP8B, CFP16, CFP19, CFP19A, CFP22A, CFP23, CFP25A, CFP27, CFP30A, CWP32 and CFP50 were all cloned into the expression vector pMCT6, by PCR amplification with gene specific primers, for recombinant expression inE. coli of the proteins.
- PCR reactions contained 10 ng ofM. tuberculosis chromosomal DNA in 1× low salt Taq+ buffer from Stratagene supplemented with 250 mM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+DNA polymerase (Stratagene) in 10 ml reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 10 sec., 55° C. for 10 sec. and 72° C. for 90 sec, using thermocycler equipment from Idaho Technology.
- The DNA fragments were subsequently run on 1 agarose gels, the bands were excised and purified by Spin-X spin columns (Costar) and cloned into pBluescript SK II+-T vector (Stratagene). Plasmid DNA was hereafter prepared from clones harbouring the desired fragments, digested with suitable restriction enzymes and subcloned into the expression vector pMCT6 in frame with 8 histidines which are added to the N-terminal of the expressed proteins. The resulting clones were hereafter sequenced by use of the dideoxy chain termination method adapted for supercoiled DNA using the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., USA) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- For cloning of the individual antigens, the following gene specific primers were used:
- CFP7A: Primers used for cloning of cfp7A:
OPBR-79: AAGAGTAGATCTATGATGGCCGAGGATGTTCGCG (SEQ ID NO: 95) OPBR-80: CGGCGACGACGGATCCTACCGCGTCGG (SEQ ID NO: 96) - OPBR-79 and OPBR-80 create BglII and BamHI sites, respectively, used for the cloning in pMCT6.
- CFP8A: Primers used for cloning of cfp8A:
(SEQ ID NO: 154) CFP8A-F: CTGAGATCTATGAACCTACGGCGCC (SEQ ID NO: 155) CFP8A-R: CTCCCATGGTACCCTAGGACCCGGGCAGCCCCGGC - CFP8A-F and CFP8A-R create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP8B: Primers used for cloning of cfp8B:
CFP8B-F: CTGAGATCTATGAGGCTGTCGTTGACCGC (SEQ ID NO: 156) CFP8B-R: CTCCCCGGGCTTAATAGTTGTTGCAGGAGC (SEQ ID NO: 157) - CFP8B-F and CFP8B-R create BglII and SmaI sites, respectively, used for the cloning in pMCT6.
- CFP16: Primers used for cloning of cfp16:
OPBR-104: CCGGGAGATCTATGGCAAAGCTCTCCACCGACG (SEQ ID NOs: 111 and 130) OPBR-105: CGCTGGGCAGAGCTACTTGACGGTGACGGTGG (SEQ ID NOS: 112 and 131) - OPBR-104 and OPBR-105 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP19: Primers used for cloning of cfp19:
OPBR-96: GAGGAAGATCTATGACAACTTCACCCGACCCG (SEQ ID NO: 107) OPBR-97. CATGAAGCCATGGCCCGCAGGCTGCATG (SEQ ID NO: 108) - OPBR-96 and OPBR-97 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP19A: Primers used for cloning of cfp19A:
OPBR-88: CCCCCCAGATCTGCACCACCGGCATCGGCGGGC (SEQ ID NO: 99) OPBR-89. GCGGCGGATCCGTTGCTTAGCCGG (SEQ ID NO: 100) - OPBR-88 and OPBR-89 create BglII and BamHI sites, respectively, used for the cloning in pMCT6.
- CFP22A: Primers used for cloning of cfp22A:
OPBR-90: CCGGCTGAGATCTATGACAGAATACGAAGGGC (SEQ ID NO:101) OPBR-91: CCCCGCCAGGGAACTAGAGGCGGC (SEQ ID NO:102) - OPBR-90 and OPBR-91 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP23: Primers used for cloning of cfp23:
OPBR- CCTTGGGAGATCTTTGGACCCCGGTTGC (SEQ ID NO:97) 86: OPBR- GACGAGATCTTATGGGCTTACTGAC (SEQ ID NO:98) 87: - OPBR-86 and OPBR-87 both create a BglII site used for the cloning in pMCT6.
- CFP25A: Primers used for cloning of cfp25A:
OPBR-106: (SEQ ID NO:113) GGCCCAGATCTATGGCCATTGAGGTTTCGGTGTTGC OPBR-107: (SEQ ID NO:114) CGCCGTGTTGCATGGCAGCGCTGAGC - OPBR-106 and OPBR-107 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP27: Primers used for cloning of cfp27:
OPBR-92: (SEQ ID NO:103) CTGCCGAGATCTACCACCATTGTCGCGCTGAAATACCC OPBR-93: (SEQ ID NO:104) CGCCATGGCCTTACGCGCCAACTCG - OPBR-92 and OPBR-93 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP30A: Primers used for cloning of cfp30A:
OPBR-94: GGCGGAGATCTGTGAGTTTTCCGTATTTCATC (SEQ ID NO:105) OPBR-95: CGCGTCGAGCCATGGTTAGGCGCAG (SEQ ID NO:106) - OPBR-94 and OPBR-95 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CWP32: Primers used for cloning of cwp32:
CWP32-F: GCTTAGATCTATGATTTTCTGGGCAACCAGGTA (SEQ ID NO:158) CW232-R: GCTTCCATGGGCGAGGCACAGGCGTGGGAA (SEQ ID NO:159) - CWP32—F and CWP32—R create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- CFP50: Primers used for cloning of cfp50:
OPBR-100: GGCCGAGATCTGTGACCCACTATGACGTCGTCG (SEQ ID NO:109) OPBR-101: GGCGCCCATGGTCAGAAATTGATCATGTGGCCAA (SEQ ID NO:110) - OPBR-100 and OPBR-101 create BglII and NcoI sites, respectively, used for the cloning in pMCT6.
- Expression/Purification of Recombinant CFP7A, CFP8A, CFP8B, CFP16, CFP19, CFP19A, CFP22A, CFP23, CFP25A, CFP27, CFP30A, CWP32, and CFP50 proteins.
- Expression and metal affinity purification of recombinant proteins was undertaken essentially as described by the manufacturers. For each protein, 1 1 LB-media containing 100 μg/ml ampicillin, was inoculated with 10 ml of an overnight culture of XLI-Blue cells harbouring recombinant pMCT6 plasmids. Cultures were shaken at 37° C. until they reached a density of OD600=0.4-0.6. IPTG was hereafter added to a final concentration of 1 mM and the cultures were further incubated 4-16 hours. Cells were harvested, resuspended in 1×sonication buffer+8 M urea and sonicated 5×30 sec. with 30 sec. pausing between the pulses.
- After centrifugation, the lysate was applied to a column containing 25 ml of resuspended Talon resin (Clontech, Palo Alto, USA). The column was washed and eluted as described by the manufacturers.
- After elution, all fractions (1.5 ml each) were subjected to analysis by SDS-PAGE using the Mighty Small (Hoefer Scientific Instruments, USA) system and the protein concentrations were estimated at 280 nm. Fractions containing recombinant protein were pooled and dialysed against 3 M urea in 10 mM Tris-HCl, pH 8.5. The dialysed protein was further purified by FPLC (Pharmacia, Sweden) using a 6 ml Resource-Q column, eluted with a linear 0-1 M gradient of NaCl. Fractions were analyzed by SDS-PAGE and protein concentrations were estimated at OD280. Fractions containing protein were pooled and dialysed against 25 mM Hepes buffer, pH 8.5.
- Finally the protein concentration and the LPS content were determined by the BCA (Pierce, Holland) and LAL (Endosafe, Charleston, USA) tests, respectively.
- Identification of CFP7B, CFP10A, CFP11 and CFP30B.
- Isolation of CFP7B
- ST-CF was precipitated with ammonium sulphate at 80% saturation and redissolved in PBS, pH 7.4, and dialyzed 3 times against 25 mM Piperazin-HCl, pH 5.5, and subjected to cromatofocusing on a matrix of PBE 94 (Pharmacia) in a column connected to an FPLC system (Pharmacia). The column was equilibrated with 25 mM Piperazin-HCl, pH 5.5, and the elution was performed with 10% PB74-HCl, pH 4.0 (Pharmacia). Fractions with similar band patterns were pooled and washed three times with PBS on a Centriprep concentrator (Amicon) with a 3 kDa cut off membrane to a final volume of 1-3 ml. An equal volume of SDS containing sample buffer was added and the protein solution boiled for 5 min before further separation on a MultiEluter (BioRad) in a matrix of 10-20% polyacrylamid (Andersen,P. & Heron,I., 1993). The fraction containing a well separated band below 10 kDa was selected for N-terminal sequencing after transfer to a PVDF membrane.
- Isolation of CFP11
- ST-CF was precipitated with ammonium sulphate at 80% saturation. The precipitated proteins were removed by centrifugation and after resuspension washed with 8 M urea. CHAPS and glycerol were added to a final concentration of 0.5% (w/v) and 5% (v/v) respectively and the protein solution was applied to a Rotofor isoelectrical Cell (BioRad). The Rotofor Cell had been equilibrated with an 8M urea buffer containing 0.5% (w/v) CHAPS, 5% (v/v) glycerol, 3% (v/v) Biolyt 3/5 and 1% (v/v) Biolyt 4/6 (BioRad). Isoelectric focusing was performed in a pH gradient from 3-6. The fractions were analyzed on silver-stained 10-20% SDS-PAGE. The fractions in the pH gradient 5.5 to 6 were pooled and washed three times with PBS on a Centriprep concentrator (Amicon) with a 3 kDa cut off membrane to a final volume of 1 ml. 300 mg of the protein preparation was separated on a 10-20% Tricine SDS-PAGE (Ploug et al 1989) and transferred to a PVDF membrane and Coomassie stained. The lowest band occurring on the membrane was excised and submitted for N-terminal sequencing.
- Isolation of CFP10A and CFP30B
- ST-CF was concentrated approximately 10-fold by ultrafiltration and ammonium sulphate precipitation at 80% saturation. Proteins were redissolved in 50 mM sodium phosphate, 1.5 M ammonium sulphate, pH 8.5, and subjected to thiophilic adsorption chromatography on an Affi-T gel column (Kem-En-Tec). Proteins were eluted by a 1.5 to 0 M decreasing gradient of ammonium sulphate. Fractions with similar band patterns in SDS-PAGE were pooled and anion exchange chromatography was performed on a
Mono Q HR 5/5 column connected to an FPLC system (Pharmacia). The column was equilibrated with 10 mM Tris-HCl, pH 8.5, and the elution was performed with a gradient of NaCl from 0 to 1 M. Fractions containing well separated bands in SDS-PAGE were selected. - Fractions containina CFP10A and CFP30B were blotted to PVDF membrane after 2-DE-PAGE (Ploug et al, 1989). The relevant spots were excised and subjected to N-terminal amino acid sequence analysis.
- N-Terminal Sequencing
- N-terminal amino acid sequence analysis was performed on a Procise 494 sequencer (applied Biosystems).
- The following N-terminal sequences were obtained:
CFP7B: PQGTVKWFNAEKGFG (SEQ ID NO:168) CFP10A: NVTVSIPTILRPXXX (SEQ ID NO:169) CFP11: TRFMTDPHANRDMAG (SEQ ID NO:170) CFP30B: PKRSEYRQGTPNWVD (SEQ ID NO:171) - “X” denotes an amino acid which could not be determined by the sequencing method used.
- N-Terminal Homology Searching in the Sanger Database and Identification of the Corresponding Genes.
- The N-terminal amino acid sequence from each of the proteins was used for a homology search using the blast program of the SangerMycobacterium tuberculosis genome database:
- http//www.sanger.ac.uk/projects/m-tuberculosis/TB-blast-server.
- For CFP11 a
sequence 100% identical to 15 N-terminal amino acids was found on contig TB—1314. The identity was found within an open reading frame of 98 amino acids length corresponding to a theoretical MW of 10977 Da and a pI of 5.14. - Amino acid number one can also be an Ala (insted of a Thr) as this sequence was also obtained (results not shown), and a 100% identical sequence to this N-terminal is found on contig TB—671 and on locus MTCI364.09.
- For CFP7B a
sequence 100% identical to 15 N-terminal amino acids was found on contig TB—2044 and on locus MTY15C10.04 with EMBL accession number: z95436. The identity was found within an open reading frame of 67 amino acids length corresponding to a theoretical MW of 7240 Da and a pI of 5.18. - For CFP10A a
sequence 100% identical to 12 N-terminal amino acids was found on contig TB—752 and on locus CY130.20 with EMBL accession number: Q10646 and Z73902. The identity was found within an open reading frame of 93 amino acids length corresponding to a theoretical MW of 9557 Da and a pI of 4.78. - For CFP30B a
sequence 100% identical to 15 N-terminal amino acids was found on contig TB—335. The identity was found within an open reading frame of 261 amino acids length corresponding to a theoretical MW of 27345 Da and a pI of 4.24. - The amino acid sequences of the purified antigens as picked from the Sanger database are shown in the following list.
CFP7B 1 MPQGTVKWFN AEKGFGFIAP EDGSADVFVH YTEIQGTGFR TLEENQKVEF (SEQ ID NO:147) 51 EIGHSPKGPQ ATGVRSL CFP10A 1 NNVTVSIPTI LRPHTGGQKS VSASGDTLGA VISDLEANYS GISERLMDPS (SEQ ID NO:141) 51 SPGKLHRFVN IYVNDEDVRF SGGLATAIAD GDSVTILPAV AGG CFP11 protein sequence 1 MATRFMTDPH AMRDMAGRFE VHAQTVEDEA RRNWASAQNI SGAGWSGMAE (SEQ ID NO:143) 51 ATSLDTMAQM NQAFRNIVNN LHGVRDGLVR DANNYEQQEQ ASQQILSS CFP30B 1 MPKRSEYRQG TPNWVDLQTT DQSAAKKFYT SLFGWGYDDN PVPGGGGVYS (SEQ ID NO:145) 51 MATLNGEAVA AIAPMPPGAP EGMPPIWNTY IAVDDVDAVV DKVVPGGGQV 101 MMPAFDIGDA GRMSFITDPT GAAVGLWQAN RHIGATLVNE TGTLIWNELL 151 TDKPDLALAF YEAVVGLTHS SMEIAAQQNY RVLKAGDAEV GGCMEPPMPG 201 VPNHWHVYFA VDDADATAAK AAAAGGQVIA EPADIPSVGR FAVLSDPQGA 251 IFSVLKPAPQ Q - Cloning of the Genes Encoding CFP7B, CFP10A, CFP11, and CFP30B.
- PCR reactions contained 10 ng ofM. tuberculosis chromosomal DNA in 1× low salt Taq+ buffer from Stratagene supplemented with 250 mM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+DNA polymerase (Stratagene) in 10 ml reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 10 sec., 55° C. for 10 sec. and 72° C. for 90 sec., using thermocycler equipment from Idaho Technology.
- The DNA fragments were subsequently run on 1% agarose gels, the bands were excised and purified by Spin-X spin columns (Costar) and cloned into pBluscript SK II+-T vector (Stratagene). Plasmid DNA was hereafter prepared from clones harbouring the desired fragments, digested with suitable restriction enzymes and subcloned into the expression vector pMCT6 in frame with 8 histidines which are added to the N-terminal of the expressed proteins. The resulting clones were hereafter sequenced by use of the dideoxy chain termination method adapted for supercoiled DNA using the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., USA) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- For cloning of the individual antigens, the following gene specific primers were used:
- CFP7B: Primers used for cloning of cfp7B:
CFP7B-F: CTGAGATCTAGAATGCCACAGGGAACTGTG (SEQ ID NO:160) CFP7B-R: TCTCCCGGGGGTAACTCAGAGCGAGCGGAC (SEQ ID NO:161) - CFP7B-F and CFP7B-R create BglII and SmaI sites, respectively, used for the cloning in pMCT6.
- CFP10A: Primers used for cloning of cfp10A:
CFP10A-F: CTGAGATCTATGAACGTCACCGTATCC (SEQ ID NO: 162) CFP10A-R: TCTCCCGGGGCTCACCCACCGGCCACG (SEQ ID NO: 163) - CFP10A-F and CFP10A-R create BglII and SmaI sites, respectively, used for the cloning in pMCT6.
- CFP11: Primers used for cloning of cfp11:
CFP11-F: CTGAGATCTATGGCAACACGTTTTATGACG (SEQ ID NO: 164) CFP11-R: CTCCCCGGGTTAGCTGCTGAGGATCTGCTH (SEQ ID NO: 165) - CFP11-F and CFP11-R create BglII and SmaI sites, respectively, used for the cloning in pMCT6.
- CFP30B: Primers used for cloning of cfp30B:
CFP30B-F: CTGAAGATCTATGCCCAAGAGAAGCGAATAC (SEQ ID NO: 166) CFP30B-R: CGGCAGCTGCTAGCATTCTCCGAATCTGCCG (SEQ ID NO: 167) - CFP30B-F and CFP30B-R create BglII and PvuII sites, respectively, used for the cloning in pMCT6.
- Expression/Purification of Recombinant CFP7B, CFP10A, CFP11 and CFP30B Protein.
- Expression and metal affinity purification of recombinant protein was undertaken essentially as described by the manufacturers. 1 1 LB-media containing 100 μg/ml ampicillin, was inoculated with 10 ml of an overnight culture of XL1-Blue cells harbouring recombinant pMCT6 plasmid. The culture was shaken at 37° C. until it reached a density of OD600=0.5. IPTG was hereafter added to a final concentration of 1 mM and the culture was further incubated 4 hours. Cells were harvested, resuspended in 1× sonication buffer+8 M urea and sonicated 5×30 sec. with 30 sec. pausing between the pulses.
- After centrifugation, the lysate was applied to a column containing 25 ml of resuspended Talon resin (Clontech, Palo Alto, USA). The column was washed and eluted as described by the manufacturers.
- After elution, all fractions (1.5 ml each) were subjected to analysis by SDS-PAGE using the Mighty Small (Hoefer Scientific Instruments, USA) system and the protein concentrations were estimated at 280 nm. Fractions containing recombinant protein were pooled and dialysed against 3 M urea in 10 mM Tris-HCl, pH 8.5. The dialysed protein was further purified by FPLC (Pharmacia, Sweden) using a 6 ml Resource-Q column, eluted with a linear 0-1 M gradient of NaCl. Fractions were analysed by SDS-PAGE and protein concentrations were estimated at OD280. Fractions containing protein were pooled and dialysed against 25 mM Hepes buffer, pH 8.5.
- Finally the protein concentration and the LPS content was determined by the BCA (Pierce, Holland) and LAL (Endosafe, Charleston, USA) tests, respectively.
- Cloning of the Gene Expressing CFP26 (MPT51)
- Synthesis and Design of Probes
- Oligonucleotide primers were synthesized automatically on a DNA synthesizer (Applied Biosystems, Forster City, Calif., ABI-391, PCR-mode) deblocked and purified by ethanol precipitation.
- Three oligonucleotides were synthesized (TABLE 3) on the basis of the nucleotide sequence from mpb51 described by Ohara et al. (1995). The oligonucleotides were engineered to include an EcoRI restriction enzyme site at the 5′ end and at the 3′ end by which a later subcloning was possible.
- Additional four oligonucleotides were synthesized on the basis of the nucleotide sequence from MPT51 (FIG. 5 and SEQ ID NO: 41). The four combinations of the primers were used for the PCR studies.
- DNA Cloning and PCR Technology
- Standard procedures were used for the preparation and handling of DNA (Sambrook et al., 1989). The gene mpt51 was cloned fromM. tuberculosis H37Rv chromosomal DNA by the use of the polymerase chain reactions (PCR) technology as described previously (Oettinger and Andersen, 1994). The PCR product was cloned in the pBluescriptSK +(Stratagene).
- Cloning of mpt51
- The gene, the signal sequence and the Shine Delgarno region of MPT51 was cloned by use of the PCR technology as two fragments of 952 bp and 815 bp in pBluescript SK+, designated pTO52 and pTO53.
- DNA Sequencing
- The nucleotide sequence of the cloned 952 bpM. tuberculosis H37Rv PCR fragment, pTO52, containing the Shine Dalgarno sequence, the signal peptide sequence and the structural gene of MPT51, and the nucleotide sequence of the cloned 815 bp PCR fragment containing the structural gene of MPT51, pTO53, were determined by the dideoxy chain termination method adapted for supercoiled DNA by use of the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., Cleveland, Ohio) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- The nucleotide sequences of pTO52 and pTO53 and the deduced amino acid sequence are shown in FIG. 5. The DNA sequence contained an open reading frame starting with a ATG codon at position 45-47 and ending with a termination codon (TAA) at position 942-944. The nucleotide sequence of the first 33 codons was expected to encode the signal sequence. On the basis of the known N-terminal amino acid sequence (Ala-Pro-Tyr-Glu-Asn) of the purified MPT51 (Nagai et al., 1991) and the features of the signal peptide, it is presumed that the signal peptidase recognition sequence (Ala-X-Ala) (von Heijne, 1984) is located in front of the N-terminal region of the mature protein at position 144. Therefore, a structural gene encoding MPT51, mpt51, derived fromM. tuberculosis H37Rv was found to be located at position 144-945 of the sequence shown in FIG. 5. The nucleotide sequence of mpt51 differed with one nucleotide compared to the nucleotide sequence of MPB51 described by Ohara et al. (1995) (FIG. 5). In mpt51 at position 780 was found a substitution of a guanine to an adenine. From the deduced amino acid sequence this change occurs at a first position of the codon giving a amino acid change from alanine to threonine. Thus it is concluded, that mpt51 consists of 801 bp and that the deduced amino acid sequence contains 266 residues with a molecular weight of 27,842, and MPT51 show 99,8% identity to MPB51.
- Subcloning of mpt51
- An EcoRI site was engineered immediately 5′ of the first codon of mpt51 so that only the coding region of the gene encoding MPT51 would be expressed, and an EcoRI site was incorporated right after the stop codon at the 3′ end.
- DNA of the recombinant plasmid pTO53 was cleaved at the EcoRI sites. The 815 bp fragment was purified from an agarose gel and subcloned into the EcoRI site of the pMAL-cR1 expression vector (New England Biolabs), pTO54. Vector DNA containing the gene fusion was used to transform theE. coli XL1-Blue by the standard procedures for DNA manipulation.
- The endpoints of the gene fusion were determined by the dideoxy chain termination method as described under section DNA sequencing. Both strands of the DNA were sequenced.
- Preparation and Purification of rMPT51
- Recombinant antigen was prepared in accordance with instructions provided by New England Biolabs. Briefly, single colonies ofE. coli harbouring the pTO54 plasmid were inoculated into Luria-Bertani broth containing 50 μg/ml ampicillin and 12.5 μg/ml tetracycline and grown at 37° C. to 2×108 cells/ml. Isopropyl-β-D-thiogalactoside (IPTG) was then added to a final concentration of 0.3 mM and growth was continued for further 2 hours. The pelleted bacteria were stored overnight at −20° C. in new column buffer (20 mM Tris/HCl, pH 7.4, 200 mM NaCl, 1 mM EDTA, 1 mM dithiothreitol (DTT))and thawed at 4° C. followed by incubation with 1 mg/ml lysozyme on ice for 30 min and sonication (20 times for 10 sec with intervals of 20 sec). After centrifugation at 9,000× g for 30 min at 4° C., the maltose binding protein-MPT51fusion protein (MBP-rMPT51) was purified from the crude extract by affinity chromatography on amylose resin column. MBP-rMPT51 binds to amylose. After extensive washes of the column, the fusion protein was eluted with 10 mM maltose. Aliquots of the fractions were analyzed on 10% SDS-PAGE. Fractions containing the fusion protein of interest were pooled and was dialysed extensively against physiological saline.
- Protein concentration was determined by the BCA method supplied by Pierce (Pierce Chemical Company, Rockford, Ill.).
TABLE 3 Sequence of the mpt51 oligonucleotidesa. Orientation and Positionb oligonucleotidea Sequences (5′ → 3′) (nucleotide) Sense MPT51-1 CTCGAATTCGCCGGGTGCACACAG 6-21 (SEQ ID NO: 28) (SEQ ID NO: 41) MPT51-3 CTCGAATTCGCCCCATACGAGAAC 143-158 (SEQ ID NO: 29) (SEQ ID NO: 41) MPT51-5 GTGTATCTGCTGGAC 228-242 (SEQ ID NO: 30) (SEQ ID NO: 41) MPT51-7 CCGACTGGCTGGCCG 418-432 (SEQ ID NO: 31) (SEQ ID NO: 41) Antisense MPT51-2 GAGGAATTCGCTTAGCGGATCGCA 946-932 (SEQ ID NO: 32) (SEQ ID NO: 41) MPT51-4 CCCACATTCCGTTGG 642-628 (SEQ ID NO: 33) (SEQ ID NO: 41) MPT51-6 GTCCAGCAGATACAC 242-228 (SEQ ID NO: 34) (SEQ ID NO: 41) - Cloning of mpt51 in the Expression Vector pMST24.
- A PCR fragment was produced from pTO52 using the primer combination MPT51-F and MPT51-R (TABLE 4). A BamHI site was engineered immediately 5′ of the first codon of mpt51 so that only the coding region of the gene encoding MPT51 would be expressed, and an NcoI site was incorporated right after the stop codon at the 3′ end.
- The PCR product was cleaved at the BamHI and the NcoI site. The 811 bp fragment was purified from an agarose gel and subcloned into the BamHI and the NcoI site of the pMST24 expression vector, pT086. Vector DNA containing the gene fusion was used to transform theE. coli XL1-Blue by the standard procedures for DNA manipulation.
- The nucleotide sequence of complete gene fusion was determined by the dideoxy chain termination method as described under section DNA sequencing. Both strands of the DNA were sequenced.
- Preparation and Purification of rMPT51.
- Recombinant antigen was prepared from single colonies ofE. coli harbouring the pTO86 plasmid inoculated into LuriaBertani broth containing 50 μg/ml ampicillin and 12.5 μg/ml tetracycline and grown at 37° C. to 2×108 cells/ml. Isopropyl-β-D-thiogalactoside (IPTG) was then added to a final concentration of 1 mM and growth was continued for further 2 hours. The pelleted bacteria were resuspended in
BC 100/20 buffer (100 mM KCl, 20 mM Imidazole, 20 mM Tris/HCl, pH 7.9, 20% glycerol). Cells were broken by sonication (20 times for 10 sec with intervals of 20 sec). After centrifugation at 9,000× g for 30 min. at 4° C. the insoluble matter was resuspended inBC 100/20 buffer with 8 M urea followed by sonication and centrifugation as above. The 6× His tag-MPT51 fusion protein (His-rMPT51) was purified by affinity chromatography on Ni-NTA resin column (Qiagen, Hilden, Germany). His-rMPT51 binds to Ni-NTA. After extensive washes of the column, the fusion protein was eluted withBC 100/40 buffer (100 mM KCl, 40 mM Imidazole, 20 mM Tris/HCl, pH 7.9, 20% glycerol) with 8 M urea and BC 1000/40 buffer (1000 mM KCl, 40 mM Imidazole, 20 mM Tris/HCl, pH 7.9, 20% glycerol) with 8 M urea. His-rMPT51 was extensive dialysed against 10 mM Tris/HCl, pH 8.5, 3 M urea followed by purification using fast protein liquid chromatography (FPLC) (Pharmacia, Uppsala, Sweden), over an anion exchange column (Mono Q) using 10 mM Tris/HCl, pH 8.5, 3 M urea with a 0-1 M NaCl linear gradient. Fractions containing rMPT51 were pooled and subsequently dialysed extensively against 25 mM Hepes, pH 8.0 before use. - Protein concentration was determined by the BCA method supplied by Pierce (Pierce Chemical Company, Rockford, Ill.). The lipopolysaccharide (LPS) content was determined by the limulus amoebocyte lysate test (LAL) to be less than 0.004 ng/μg rMPT51, and this concentration had no influence on cellular activity.
TABLE 4 Sequence of the mpt51 oligonucleotides. Orientation and oligo- Position nucleotide Sequences (5′ → 3′) (nt) Sense MPT51-F CTCGGATCCTGCCCCATACGAGAACCTG 139-156 Antisense MPT51-R CTCCCATGGTTAGCGGATCGCACCG 939-924 - Cloning of the ESAT6-MPT59 and the MPT59-ESAT6 Hybrides.
- Background for ESAT-MPT59 and MPT59-ESAT6 Fusion
- Several studies have demonstrated that ESAT-6 is a an immunogen which is relatively difficult to adjuvate in order to obtain consistent results when immunizing therewith. To detect an in vitro recognition of ESAT-6 after immunization with the antigen is very difficult compared to the strong recognition of the antigen that has been found during the recall of memory immunity toM. tuberculosis. ESAT-6 has been found in ST-CF in a truncated version were amino acids 1-15 have been deleted. The deletion includes the main T-cell epitopes recognized by C57BL/6j mice (Brandt et al., 1996). This result indicates that ESAT-6 either is N-terminally processed or proteolytically degraded in ST-CF. In order to optimize ESAT-6 as an immunogen, a gene fusion between ESAT-6 and another major T cell antigen MPT59 has been constructed. Two different construct have been made: MPT59-ESAT-6 (SEQ ID NO: 172) and ESAT-6-MPT59 (SEQ ID NO: 173). In the first hybrid ESAT-6 is N-terminally protected by MPT59 and in the latter it is expected that the fusion of two dominant T-cell antigens can have a synergistic effect.
- The genes encoding the ESAT6-MPT59 and the MPT59-ESAT6 hybrides were cloned into the expression vector pMCT6, by PCR amplification with gene specific primers, for recombinant expression inE. coli of the hybrid proteins.
- Construction of the Hybrid MPT59-ESAT6.
- The cloning was carried out in three steps. First the genes encoding the two components of the hybrid, ESAT6 and MPT59, were PCR amplified using the following primer constructions:
- ESAT6:
OPBR-4: GGCGCCGGCAAGCTTGCCATGACAGAGCAGCAGTGG (SEQ ID NO: 132) OPBR-28: CGAACTCGCCGGATCCCGTGTTTCGC (SEQ ID NO: 133) - OPBR-4 and OPBR-28 create HinDIII and BamHI sites, respectively.
- MPT59:
OPBR-48: GGCAACCGCGAGATCTTTCTCCCGGCCGGGGC (SEQ ID NO: 134) OPBR-3: GGCAAGCTTGCCGGCGCCTAACGAACT (SEQ ID NO: 135) - OPBR-48 and OPBR-3 create BglII and HinDIII, respectively. Additionally OPBR-3 deletes the stop codon of MPT59.
- PCR reactions contained 10 ng ofM. tuberculosis chromosomal DNA in 1× low salt Taq+ buffer from Stratagene supplemented with 250 mM of each of the four nucleotides (Boehringer Mannheim), 0,5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles of each primer and 0.5 unit Tag+DNA polymerase (Stratagene) in 10 μl reaction volume. Reactions were initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 10 sec., 550C for 10 sec. and 72° C. for 90 sec, using thermocycler equipment from Idaho Technology.
- The DNA fragments were subsequently run on 1% agarose gels, the bands were excised and purified by Spin-X spin columns (Costar). The two PCR fragments were digested with HinDIII and ligated. A PCR amplification of the ligated PCR fragments encoding MPT59-ESAT6 was carried out using the primers OPBR-48 and OPBR-28. PCR reaction was initially heated to 94° C. for 25 sec. and run for 30 cycles of the program; 94° C. for 30 sec., 55° C. for 30 sec. and 72° C. for 90 sec. The resulting PCR fragment was digested with BglII and BamHI and cloned into the expression vector pMCT6 in frame with 8 histidines which are added to the N-terminal of the expressed protein hybrid. The resulting clones were hereafter sequenced by use of the dideoxy chain termination method adapted for supercoiled DNA using the Sequenase DNA sequencing kit version 1.0 (United States Biochemical Corp., USA) and by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
- Construction of the Hybrid ESAT6-MPT59.
- Construction of the hybrid ESAT6-MPT59 was carried out as described for the hybrid MPT59-ESAT6. The primers used for the construction and cloning were:
- ESAT6:
OPBR-75: GGACCCAGATCTATGACAGAGCAGCAGTGG (SEQ ID NO: 136) OPBR-76: CCGGCAGCCCCGGCCGGGAGAAAAGCTTTGCGAACATCCCAGTGACG (SEQ ID NO: 137) - OPBR-75 and OPBR-76 create BglII and HinDIII sites, respectively. Additionally OPBR-76 deletes the stop codon of ESAT6.
- MPT59:
OPBR-77: GTTCGCAAAGCTTTTCTCCCGGCCGGGGCTGCCGGTCGAGTACC (SEQ ID NO: 138) OPBR-18: CCTTCGGTGGATCCCGTCAG (SEQ ID NO: 139) - OPBR-77 and OPBR-18 create HinDIII and BamHI sites, respectively.
- Expression/Purification of MPT59-ESAT6 and ESAT6-MPT59 Hybrid Proteins.
- Expression and metal affinity purification of recombinant proteins was undertaken essentially as described by the manufacturers. For each protein, 1 l LB-media containing 100 μg/ml ampicillin, was inoculated with 10 ml of an overnight culture of XL1-Blue cells harbouring recombinant pMCT6 plasmids. Cultures were shaken at 37° C. until they reached a density of OD600=0.4-0.6. IPTG was hereafter added to a final concentration of 1 mM and the cultures were further incubated 4-16 hours. Cells were harvested, resuspended in 1× sonication buffer+8 M urea and sonicated 5×30 sec. with 30 sec. pausing between the pulses.
- After centrifugation, the lysate was applied to a column containing 25 ml of resuspended Talon resin (Clontech, Palo Alto, USA). The column was washed and eluted as described by the manufacturers.
- After elution, all fractions (1.5 ml each) were subjected to analysis by SDS-PAGE using the Mighty Small (Hoefer Scientific Instruments, USA) system and the protein concentrations were estimated at 280 nm. Fractions containing recombinant protein were pooled and dialysed against 3 M urea in 10 mM Tris-HCl, pH 8.5. The dialysed protein was further purified by FPLC (Pharmacia, Sweden) using a 6 ml Resource-Q column, eluted with a linear 0-1 M gradient of NaCl. Fractions were analyzed by SDS-PAGE and protein concentrations were estimated at OD280. Fractions containing protein were pooled and dialysed against 25 mM Hepes buffer, pH 8.5.
- Finally the protein concentration and the LPS content were determined by the BCA (Pierce, Holland) and LAL (Endosafe, Charleston, USA) tests, respectively.
- The biological activity of the MPT59-ESAT6 fusion protein is described in Example 6A.
- Mapping of the Purified Antigens in a 2DE System.
- In order to characterize the purified antigens they were mapped in a 2-dimensional electrophoresis (2DE) reference system. This consists of a silver stained gel containing ST-CF proteins separated by isoelectrical focusing followed by a separation according to size in a polyacrylamide gel electrophoresis. The 2DE was performed according to Hochstrasser et al. (1988). 85 μg of ST-CF was applied to the isoelectrical focusing tubes where BioRad ampholytes BioLyt 4-6 (2 parts) and BioLyt 5-7 (3 parts) were included. The first dimension was performed in acrylamide/piperazin diacrylamide tube gels in the presence of urea, the detergent CHAPS and the reducing agent DTT at 400 V for 18 hours and 800 V for 2 hours. The second dimension 10-20% SDS-PAGE was performed at 100 V for 18 hours and silver stained. The identification of CFP7, CFP7A, CFP7B, CFP8A, CFP8B, CFP9, CFP11, CFP16, CFP17, CFP19, CFP20, CFP21, CFP22, CFP25, CFP27, CFP28, CFP29, CFP30A, CFP50, and MPT51 in the 2DE reference gel were done by comparing the spot pattern of the purified antigen with ST-CF with and without the purified antigen. By the assistance of an analytical 2DE software system (Phoretix International, UK) the spots have been identified in FIG. 6. The position of MPT51 and CFP29 were confirmed by a Western blot of the 2DE gel using the Mab's anti-CFP29 and HBT 4.
- Biological Activity of the Purified Antigens.
- IFN-γ Induction in the Mouse Model of TB Infection
- The recognition of the purified antigens in the mouse model of memory immunity to TB (described in example 1) was investigated. The results shown in TABLE 5 are representative for three experiments.
- A very high IFN-γ response was induced by two of the antigens CFP17 and CFP21 at almost the same high level as ST-CF.
TABLE 5 IFN-γ release from splenic memory effector cells from C57BL/6J mice isolated after reinfection with M. tuberculosis after stimulation with native antigens. Antigena IFN-γ (pg/ml)b ST-CF 12564 CFP7 NDd CFP9 ND CFP17 9251 CFP20 2388 CFP21 10732 CFP22 + CFP25c 5342 CFP26 (MPT51) ND CFP28 2818 CFP29 3700 - Skin Test Reaction in TB Infected Guinea Pigs
- The skin test activity of the purified proteins was tested inM. tuberculosis infected guinea pigs.
- 1 group of guinea pigs was infected via an ear vein with 1×104 CFU of M. tuberculosis H37Rv in 0,2 ml PBS. After 4 weeks skin tests were performed and 24 hours after injection erythema diameter was measured.
- As seen in TABLES 6 and 6a all of the antigens induced a significant Delayed Type Hypersensitivity (DTH) reaction.
TABLE 6 DTH erythema diameter in guinea pigs infected with 1 × 104 CFU of M. tuberculosis, after stimulation with native antigens. Antigena Skin reaction (mm)b Control 2.00 PPDc 15.40 (0.53) CFP7 NDe CFP9 ND CFP17 11.25 (0.84) CFP20 8.88 (0.13) CFP21 12.44 (0.79) CFP22 + CFP25d 9.19 (3.10) CFP26 (MPT51) ND CFP28 2.90 (1.28) CFP29 6.63 (0.88) - Together these analyses indicate that most of the antigens identified were highly biologically active and recognized during TB infection in different animal models.
TABLE 6a DTH erythema diameter of recombinant antigens in outbred guinea pigs infected with 1 × 104 CFU of M. Tuberculosis. Antigena Skin reaction (mm)b Control 2.9 (0.3) PPDc 14.5 (1.0) CFP 7a 13.6 (1.4) CFP 176.8 (1.9) CFP 206.4 (1.4) CFP 215.3 (0.7) CFP 2510.8 (0.8) CFP 297.4 (2.2) MPT 514.9 (1.1) - Biological Activity of the Purified Recombinant Antigens.
- Interferon-γInduction in the Mouse Model of TB Infection.
- Primary infections. 8 to 12 weeks old female C57BL/6j(H-2b), CBA/J(H-2k), DBA.2(H-2d) and A.SW(H-2s) mice (Bomholtegaard, Ry) were given intravenous infections via the lateral tail vein with an inoculum of 5×104 M. tuberculosis suspended in PBS in a vol. of 0.1 ml. 14 days postinfection the animals were sacrificed and spleen cells were isolated and tested for the recognition of recombinant antigen.
- As seen in TABLE 7 the recombinant antigens rCFP7A, rCFP17, rCFP21, rCFP25, and rCFP29 were all recognized in at least two strains of mice at a level comparable to ST-CF. rMPT51 and rCFP7 were only recognized in one or two strains respectively, at a level corresponding to no more than ⅓ of the response detected after ST-CF stimulation. Neither of the antigens rCFP20 and rCFP22 were recognized by any of the four mouse strains.
- Memory responses. 8-12 weeks old female C57BL/6j (H-2b) mice (Bomholtegaard, Ry) were given intravenous infections via the lateral tail vein with an inoculum of 5×104 M. tuberculosis suspended in PBS in a vol. of 0.1 ml. After 1 month of infection the mice were treated with isoniazid (Merck and Co., Rahway, N.J.) and rifabutin (Farmatalia Carlo Erba, Milano, Italy) in the drinking water, for two months. The mice were rested for 4-6 months before being used in experiments. For the study of the recall of memory immunity, animals were infected with an inoculum of 1×106 bacteria i.v. and sacrificed at day 4 postinfection. Spleen cells were isolated and tested for the recognition of recombinant antigen. As seen from TABLE 8, IFN-γ release after stimulation with rCFP17, rCFP21 and rCFP25 was at the same level as seen from spleen cells stimulated with ST-CF. Stimulation with rCFP7, rCFP7A and rCFP29 all resulted in an IFN-γ no higher than ⅓ of the response seen with ST-CF. rCFP22 was not recognized by IFN-γ producing cells. None of the antigens stimulated IFN-γ release in naive mice. Additionally non of the antigens were toxic to the cell cultures.
TABLE 7 T cell responses in primary TB infection. Name c57BL/6J(H2b) DBA.2(H2d) CBA/J(H2k) A.SW(H2s) rCFP7 + + − − rCFP7A +++ +++ +++ + rCFP17 +++ + +++ + rCFP20 − − − − rCFP21 +++ +++ +++ + rCFP22 − − − − rCFP25 +++ ++ +++ + rCFP29 +++ +++ +++ ++ rMPT51 + − − − -
TABLE 8 T cell responses in memory immune animals. Name Memory response rCFP7 + rCFP7A ++ rCFP17 +++ rCFP21 +++ rCFP22 − rCFP29 + rCFP25 +++ rMPT51 + - Interferon-γ Induction in Human TB Patients and BCG Vaccinated People.
- Human donors: PBMC were obtained from healthy BCG vaccinated donors with no known exposure to patients with TB and from patients with culture or microscopy proven infection withMycobacterium tuberculosis. Blood samples were drawn from the TB patients 1-4 months after diagnosis.
- Lymphocyte preparations and cell culture: PBMC were freshly isolated by gradient centrifugation of heparinized blood on Lymphoprep (Nycomed, Oslo, Norway). The cells were resuspended in complete medium: RPMI 1640 (Gibco, Grand Island, N.Y.) supplemented with 40 μg/ml streptomycin, 40 U/ml penicillin, and 0.04 mM/ml glutamine, (all from Gibco Laboratories, Paisley, Scotland) and 10% normal human ABO serum (NHS) from the local blood bank. The number and the viability of the cells were determined by trypan blue staining. Cultures were established with 2,5×105 PBMC in 200 μl in microtitre plates (Nunc, Roskilde, Denmark) and stimulated with no antigen, ST-CF, PPD (2.5 μg/ml); rCFP7, rCFP7A, rCFP17, rCFP20, rCFP21, rCFP22, rCFP25, rCFP26, rCFP29, in a final concentration of 5 μg/ml. Phytohaemagglutinin, 1 μg/ml (PHA, Difco laboratories, Detroit, Mich. was used as a positive control. Supernatants for the detection of cytokines were harvested after 5 days of culture, pooled and stored at −80° C. until use.
- Cytokine analysis: Interferon-γ (IFN-γ) was measured with a standard ELISA technique using a commercially available pair of mAb's from Endogen and used according to the instructions for use. Recombinant IFN-γ (Gibco laboratories) was used as a standard. The detection level for the assay was 50 pg/ml. The variation between the duplicate wells did not exceed 10% of the mean. Responses of 9 individual donors are shown in TABLE 9.
- A seen in TABLE 9 high levels of IFN-γ release are obtained after stimulation with several of the recombinant antigens. rCFP7a and rCFP17 gives rise to responses comparable to ST-CF in almost all donors. rCFP7 seems to be most strongly recognized by BCG vaccinated healthy donors. rCFP21, rCFP25, rCFP26, and rCFP29 gives rise to a mixed picture with intermediate responses in each group, whereas low responses are obtained by rCFP20 and rCFP22.
TABLE 9 Mean values of results from the stimulation of human blood cells from 7 BCG vaccinated and 7 TB patients with recombinant antigens. SE values are given for each antigen. ST-CF and M. avium culture filtrate are shown for the comparison. donor: no ag PHA PPD STCF CFP7 CFP17 CFP7A CFP20 CFP21 CFP22 CFP25 CFP26 CFP29 Controls, Healthy, BCG vaccinated, no known TB exposure 1 6 9564 6774 3966 7034 69 1799 58 152 73 182 946 86 2 48 12486 6603 8067 3146 10044 5267 29 6149 51 1937 526 2065 3 190 11929 10000 8299 8015 11563 8641 437 3194 669 2531 8076 6098 4 10 21029 4106 3537 1323 1939 5211 1 284 1 1344 20 125 5 1 18750 14209 13027 17725 8038 19002 1 3008 1 2103 974 8181 TB patients, 1-4 month after diagnosis 6 9 8973 5096 6145 852 4250 4019 284 1131 48 2400 1078 4584 7 1 12413 6281 3393 168 6375 4505 11 4335 16 3082 1370 5115 8 4 11915 7671 7375 104 2753 3356 119 407 437 2069 712 5284 9 32 22130 16417 17213 8450 9783 16319 91 5957 67 10043 13313 9953 - Four groups of 6-8 weeks old, female C57Bl/6J mice (Bomholtegard, Denmark) were immunized subcutaneously at the base of the tail with vaccines of the following compositions:
- Group 1: 10 μg ESAT-6/DDA (250 μg)
- Group 2: 10 μg MPT59/DDA (250 μg)
- Group 3: 10 μg MPT59-ESAT-6/DDA (250 μg)
- Group 4: Adjuvant control group: DDA (250 μg) in NaCl
- The animals were injected with a volume of 0.2 ml. Two weeks after the first injection and 3 weeks after the second injection the mice were boosted a little further up the back. One week after the last immunization the mice were bled and the blood cells were isolated. The immune response induced was monitored by release of IFN-γ into the culture supernatants when stimulated in vitro with relevant antigens (see the following table).
Immunogen For restimulationa): Ag in vitro 10 μg/dose no antigen ST-CF ESAT-6 MPT59 ESAT-6 219 ± 219 569 ± 569 835 ± 633 — MPT59 0 802 ± 182 — 5647 ± 159 Hybrid: 127 ± 127 7453 ± 581 15133 ± 861 16363 ± 1002 MPT59- ESAT-6 - The experiment demonstrates that immunization with the hybrid stimulates T cells which recognize ESAT-6 and MPT59 stronger than after single antigen immunization. Especially the recognition of ESAT-6 was enhanced by immunization with the MPT59-ESAT-6 hybrid. IFN-γ release in control mice immunized with DDA never exceeded 1000 pg/ml.
- The recombinant antigens were tested individually as subunit vaccines in mice. Eleven groups of 6-8 weeks old, female C57Bl/6j mice (BomholtegArd, Denmark) were immunized subcutaneously at the base of the tail with vaccines of the following composition:
- Group 1: 10 μg CFP7
- Group 2: 10 μg CFP17
- Group 3: 10 μg CFP21
- Group 4: 10 μg CFP22
- Group 5: 10 μg CFP25
- Group 6: 10 μg CFP29
- Group 7: 10 μg MPT51
- Group 8: 50 μg ST-CF
- Group 9: Adjuvant control group
- Group 10:
BCG - Group 11: Control group: Untreated
- All the subunit vaccines were given with DDA as adjuvant. The animals were vaccinated with a volume of 0.2 ml. Two weeks after the first injection and three weeks after the second injection group 1-9 were boosted a little further up the back. One week after the last injection the mice were bled and the blood cells were isolated. The immune response induced was monitored by release of IFN-γ into the culture supernatant when stimulated in vitro with the homologous protein.
- 6 weeks after the last immunization the mice were aerosol challenged with 5×106 viable Mycobacterium tuberculosis/ml. After 6 weeks of infection the mice were killed and the number of viable bacteria in lung and spleen of infected mice was determined by plating serial 3-fold dilutions of organ homogenates on 7H11 plates. Colonies were counted after 2-3 weeks of incubation. The protective efficacy is expressed as the difference between log10 values of the geometric mean of counts obtained from five mice of the relevant group and the geometric mean of counts obtained from five mouse of the relevant control group.
- The results from the experiments are presented in the following table.
- Immunogenicity and Protective Efficacy in Mice, of ST-CF and 7 Subunit Vaccines
Subunit Vaccine Immunogenicity Protective efficacy ST-CF +++ +++ CFP7 ++ − CFP17 +++ +++ CFP21 +++ ++ CFP22 − − CFP25 +++ +++ CFP29 +++ +++ MPT51 +++ ++ - In conclusion, we have identified a number of proteins inducing high levels of protection. Three of these CFP17, CFP25 and CFP29 giving rise to similar levels of protection as ST-CF and BCG while two proteins CFP21 and MPT51 induces protections around 2/3 the level of BCG and ST-CF. Two of the proteins CFP7 and CFP22 did not induce protection in the mouse model.
- Species distribution of cfp7, cfp9, mpt51, rd1-orf2, rdlorf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b as well as of cfp7a, cfp7b, cfp10a, cfp17, cfp20, cfp21, cfp22, cfp22a, cfp23, cfp25 and cfp25a.
- Presence of cfp7, cfp9, mpt51, rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b in different Mycobacterial Species.
- In order to determine the distribution of the cfp7, cfp9, mpt51, rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b genes in species belonging to theM. tuberculosis-complex and in other mycobacteria PCR and/or Southern blotting was used. The bacterial strains used are listed in TABLE 10. Genomic DNA was prepared from mycobacte-rial cells as described previously (Andersen et al. 1992).
- PCR analyses were used in order to determine the distribution of the cfp7, cfp9 and mpt51 gene in species belonging to the tuberculosis-complex and in other mycobacteria. The bacterial strains used are listed in TABLE 10. PCR was performed on genomic DNA prepared from mycobacterial cells as described previously (Andersen et al., 1992).
- The oligonucleotide primers used were synthesised automatically on a DNA synthesizer (Applied Biosystems, Forster City, Calif., ABI-391, PCR-mode), deblocked, and purified by ethanol precipitation. The primers used for the analyses are shown in TABLE 11.
- The PCR amplification was carried out in a thermal reactor (Rapid cycler, Idaho Technology, Idaho) by mixing 20 ng chromosomal with the mastermix (contained 0.5 μM of each oligonucleotide primer, 0.25 μM BSA (Stratagene), low salt buffer (20 mM Tris-HCl, pH8.8 , 10 mM KCl, 10 mM (NH4)2SO4, 2 mM MgSO4 and 0, 1% Triton X-100) (Stratagene), 0.25 mM of each deoxynucleoside triphosphate and 0.5 U Taq Plus Long DNA polymerase (Stratagene)). Final volume was 10 μl (all concentrations given are concentrations in the final volume). Predenaturation was carried out at 94° C. for 30 s. 30 cycles of the following was performed: Denaturation at 94° C. for 30 s, annealing at 55° C. for 30 s and elongation at 72° C. for 1 min.
- The following primer combinations were used (the length of the amplified products are given in parentheses):
- mpt51: MPT51-3 and MPT51-2 (820 bp), MPT51-3 and MPT51-6 (108 bp), MPT51-5 and MPT51-4 (415 bp), MPT51-7 and MPT51-4 (325 bp).
- cfp7: pVF1 and PVR1 (274 bp), pVF1 and PVR2 (197 bp), pVF3 and PVR1 (302 bp), pVF3 and PVR2 (125 bp). cfp9: stR3 and stF1 (351 bp).
TABLE 10 Mycobacterial strains used in this Example. Species and strain(s) Source 1. M. tuberculosis H37RvATCCa (ATCC 27294) 2. H37RaATCC (ATCC 25177) 3. Erdman Obtained from A. Lazlo, Ottawa, Canada 4. M. bovis BCG substrain: SSIb Danish 1331 5. Chinese SSI c6. Canadian SSI c7. Glaxo SSIc 8. Russia SSI c9. Pasteur SSI c10. Japan WHOe 11. M. bovis MNC 27 SSIc 12. M. africanum Isolated from a Danish patient 13. M. leprae (armadillo-derived) Obtained from J. M. Colston, London, UK 14. M. avium (ATCC 15769) ATCC 15. M. kansasii (ATCC 12478) ATCC 16. M. marinum (ATCC 927) ATCC 17. M. scrofulaceum (ATCC 19275) ATCC 18. M. intercellulare (ATCC 15985) ATCC 19. M. fortuitum (ATCC 6841) ATCC 20. M. xenopi Isolated from a Danish patient 21. M. flavescens Isolated from a Danish patient 22. M. szulgai Isolated from a Danish patient 23. M. terrae SSIc 24. E. coli SSI d 25. S. aureus SSId -
TABLE 11 Sequence of the mpt51, cfp7 and cfp9 oligonucleotides. Orientation and Positionb oligonucleotide Sequences (5′ → 3′)a (nucleotides) Sense MPT51-1 CTCGAATTCGCCGGGTGCACACAG 6-21 (SEQ ID NO: 28) (SEQ ID NO: 41) MPT51-3 CTCGAATTCGCCCCATACGAGAAC 143-158 (SEQ ID NO: 29) (SEQ ID NO: 41) MPT51-5 GTGTATCTGCTGGAC 228-242 (SEQ ID NO: 30) (SEQ ID NO: 41) MPT51-7 CCGACTGGCTGGCCG 418-432 (SEQ ID NO: 31) (SEQ ID NO: 41) pvR1 GTACGAGAATTCATGTCGCAAATCATG 91-105 (SEQ ID NO: 35) (SEQ ID NO: 1) pvR2 GTACGAGAATTCGAGCTTGGGGTGCCG 168-181 (SEQ ID NO: 36) (SEQ ID NO: 1) stR3 CGATTCCAAGCTTGTGGCCGCCGACCCG 141-155 (SEQ ID NO: 37) (SEQ ID NO: 3) Antisense MPT51-2 GAGGAATTCGCTTAGCGGATCGCA 946-932 (SEQ ID NO: 32) (SEQ ID NO: 41) MPT51-4 CCCACATTCCGTTGG 642-628 (SEQ ID NO: 33) (SEQ ID NO: 41) MPT51-6 GTCCAGCAGATACAC 242-228 (SEQ ID NO: 34) (SEQ ID NO: 41) pvF1 CGTTAGGGATCCTCATCGCCATGGTGTTGG 340-323 (SEQ ID NO: 38) (SEQ ID NO: 1) pvF3 CGTTAGGGATCCGGTTCCACTGTGCC 268-255 (SEQ ID NO: 39) (SEQ ID NO: 1) stF1 CGTTAGGGATCCTCAGGTCTTTTCGATG 467-452 (SEQ ID NO: 40) (SEQ ID NO: 3) - The Southern blotting was carried out as described previously (Oettinger and Andersen, 1994) with the following modifications: 2 μg of genomic DNA was digested with PvuII, electrophoresed in an 0.8% agarose gel, and transferred onto a nylon membrane (Hybond N-plus; Amersham International plc, Little Chalfont, United Kingdom) with a vacuum transfer device (Milliblot, TM-v; Millipore Corp., Bedford, Mass.). The cfp7, cfp9, mpt51, rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b gene fragments were amplified by PCR from the plasmids pRVN01, pRVN02, pT052, pT087, pT088, pT089, pT090, pT091, pT096 or pT098 by using the primers shown in TABLE 11 and TABLE 2 (in Example 2a). The probes were labelled non-radioactively with an enhanced chemiluminescence kit (ECL; Amersham International plc, Little Chalfont, United Kingdom). Hybridization and detection was performed according to the instructions provided by the manufacturer. The results are summarized in TABLES 12 and 13.
TABLE 12 Interspecies analysis of the cfp7, cfp9 and mpt51 genes by PCR and/or Southern blotting and of MPT51 protein by Western blotting. Western PCR Southern blot blot Species and strain cfp7 cfp9 mpt51 cfp7 cfp9 mpt51 MPT51 1. M. tub. H37Rv + + + + + + + 2. M. tub. H37Ra + + + N.D. N.D. + + 3. M. tub. Erdmann + + + + + + + 4. M. bovis + + + + + 5. M. bovis BCG Danish + + + + + + + 1331 6. M. bovis BCG + + N.D. + + + N.D. Japan 7. M. bovis BCG + + N.D. + + N.D. N.D. Chinese 8. M. bovis BCG Canadian + + N.D. + + N.D. N.D. 9. M. bovis BCG + + N.D. + + N.D. N.D. Glaxo 10. M. bovis BCG + + N.D. + + N.D. N.D. Russia 11. M. bovis BCG + + N.D. + + N.D. N.D. Pasteur 12. M. africanum + + + + + + + 13. M. leprae − − − − − − − 14. M. avium + + − + + + − 15. M. kansasii + − − + + + − 16. M. marinum − (+) − + + + − 17. M. scrofulaceum − − − − − − − 18. M. intercellulare + (+) − + + + − 19. M. fortuitum − − − − − − − 20. M. flavescens + (+) − + + + N.D. 21. M. xenopi − − − N.D. N.D. + − 22. M. szulgai (+) (+) − − + − − 23. M. terrae − − N.D. N.D. N.D. N.D. N.D. - cfp7, cfp9 and mpt51 were found in theM. tuberculosis complex including BCG and the environmental mycobacteria; M. avium, M. kansasii, M. marinum, M. intracellular and M. flavescens. cfp9 was additionally found in M. szulgai and mpt51 in M. xenopi.
- Furthermore the presence of native MPT51 in culture filtrates from different mycobacterial strains was investigated with western blots developed with Mab HBT4.
- There is a strong band at around 26 kDa inM. tuberculosis H37Rv, Ra, Erdman, M. bovis AN5, M. bovis BCG substrain Danish 1331 and M. africanum. No band was seen in the region in any other tested mycobacterial strains.
TABLE 13a Interspecies analysis of the rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b genes by Southern blotting. Species and strain rd1-orf2 rd1-orf3 rd1-orf4 rd1-orf5 rd1-orf8 rd1-orf9a rd1- orf9b 1. M. tub. H37Rv + + + + + + + 2. M. bovis + + + + N.D. + + 3. M. bovis BCG + − − − N.D. − − Danish 1331 4. M. bovis + − − − N.D. − − BCG Japan 5. M. avium − − − − N.D. − − 6. M. kansasii − − − − N.D. − − 7. M. marinum + − + − N.D. − − 8. M. scrofulaceum + − − − N.D. − − 9. M. intercellulare − − − − N.D. − − 10. M. fortuitum − − − − N.D. − − 11. M. xenopi − − − − N.D. − − 12. M. szulgai + − − − N.D. − − - Positive results for rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b were only obtained when using genomic DNA fromM. tuberculosis and M. bovis, and not from M. bovis BCG or other mycobacteria analyzed except rd1-orf4 which also was found in M. marinum.
- Presence of cfp7a, cfp7b, cfp10a, cfp17, cfp20, cfp21, cfp22. cfp22a, cfp23, cfp25 and cfp25a in Different Mycobacterial Species.
- Southern blotting was carried out as described for rd1-orf2, rd1-orf3, rd1-orf4, rd1-orf5, rd1-orf8, rd1-orf9a and rd1-orf9b. The cfp7a, cfp7b, cfp10a, cfp17, cfp20, cfp2l1, cfp22, cfp22a, cfp23, cfp25 and cfp25a gene fragments were amplified by PCR from the recombinant pMCT6 plasmids encoding the individual genes. The primers used (same as the primers used for cloning) are described in example 3, 3A and 3B. The results are summarized in Table 13b.
TABLE 13b Interspecies analysis of the cfp7a, cfp7b, cfp10a, cfp17, cfp20, cfp21, cfp22, cfp22a, cfp23, cfp25, and cfp25a genes by Southern blotting. Species and strain cfp7a cfp7b cfp10a cfp17 cfp20 cfp21 cfp22 cfp22a cfp23 cfp25 cfp25a 1. M. tub. H37Rv + + + + + + + + + + + 2. M. bovis + + + + + + + + + + + 3. M. bovis BCG + + + + + N.D. + + + + + Danish 1331 4. M. bovis + + + + + + + + + + + BCG Japan 5. M. avium + N.D. − + − + + + + + − 6. M. kansasii − N.D. + − − − + − + − − 7. M. marinum + + − + + + + + + + + 8. M. scrofulaceum − − + − + + − + + + − 9. M. intercellulare + + − + − + + − + + − 10. M. fortuitum − N.D. − − − − − − + − − 11. M. xenopi + + + + + + + + + + + 12. M. szulgai + + − + + + + + + + + - Table X: Sensitivity of the ORF3 Antigens in African Populations Divided Based on HIV Infections Status and Smear Result
Sens (%) Sens (%) Sens (%) (n = 72) (n = 79) (n = 13) Smear + + − HIV − + − RD1-ORF3 59% 49% 70% 38 kDa antigen 29% 15% 36% - In high endemic countries a serodiagnostic test would be ideal to supplement or replace current TB diagnostic approaches and in particular is smear negative TB patients and patients co-infected with HIV very difficult to diagnose with the traditional methods. We therefore evaluated the performance of the RD1-ORF3 antigen in sera collected from African patients groups including patients with a HIV co-infection and smear negative patients and the results were compared to the well known serodiagnostic antigen 38 kDa. As seen in table×RD1-ORF3 was recognized by 59% of smear positive TB patients but most importantly almost 50% of the HIV positive patients (compared to 15% for 38 kDa antigen), and as many as 70% of the smear negative patients (compared to 36% for 38 kDa antigen).
- This demonstrates the superior qualities of RD1-ORF3 antigen as a serodiagnostic reagent also to be used in TB patients groups that traditionally is difficult to diagnose.
- List of References
- Andersen, P. and Heron, I. 1993, J. Immunol. Methods 161: 29-39.
- Andersen, A. B. et al., 1992, Infect. Immuno. 60: 2317-2323.
- Andersen, P., 1994, Infect. Immun. 62: 2536-44.
- Barkholt, V. and Jensen A. L., 1989, Anal. Biochem. 177: 318-322.
- Borodovsky, M., and J. McIninch. 1993, Computers Chem. 17: 123-133.
- van Dyke M. W. et al., 1992. Gene pp. 99-104.
- Gosselin et al., 1992, J. Immunol. 149: 3477-3481.
- Harboe, M. et al., 1996, Infect. Immun. 64: 16-22.
- von Heijne, G., 1984, J. Mol. Bio. 173: 243-251.
- Hochstrasser, D. F. et al., 1988, Anal. Biochem. 173: 243-251.
- Köhler, G. and Milstein, C. 1975, Nature 256: 495-497.
- Li, H. et al., 1993, Infect. Immun. 61: 1730-1734.
- Lindblad, E. B. et al., 1997, Infect. Immun. 65: 623-629.
- Mahairas, G. G. et al., 1996, J. Bacteroil 178: 1274-1282.
- Manaitis, T. et al., 1989 “Molecular cloning: a laboratory manual”, 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.
- Nagai, S. et al., 1991, Infect. Immun. 59: 372-382.
- Oettinger, T. and Andersen, A. B., 1994, Infect. Immun. 62: 2058-2064.
- Ohara, N. et al., 1995, Scand. 3. Immunol. 41: 233-442.
- Pal, P. G. and Horwitz, M. A., 1992, Infect. Immun. 60: 4781-92.
- Pearson, W. r. and Lipman, D. J., 1988, Proc. Natl. Acad. Sci. USA 85: 2444-2448.
- Ploug, M. et al., 1989, Anal. Biochem. 181: 33-39.
- Porath, J. et al., 1985 FEBS Lett. 185: 306-310.
- Roberts, A. D. et al., 1995, Immunol. 85: 502-508.
- Sorensen, A. L. et al., 1995, Infect. Immun. 63: 1710-1717.
- Theisen, M. et al., 1995, Clinical and Diagnostic Laboratory Immunology, 2: 30-34.
- Valdés-Stauber, N. and Scherer, S., 1994, Appl. Environ. Microbiol. 60: 3809-3814.
- Valdés-Stauber, N. and Scherer, S., 1996, Appl. Environ. Microbiol. 62: 1283-1286.
- Williams, N., 1996, Science 272: 27.
- Young, R. A. et al., 1985, Proc. Natl. Acad. Sci. USA 82: 2583-2587.
-
0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 173 <210> SEQ ID NO 1 <211> LENGTH: 381 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 1 ggccgccggt acctatgtgg ccgccgatgc tgcggacgcg tcgacctata ccgggttctg 60 atcgaaccct gctgaccgag aggacttgtg atgtcgcaaa tcatgtacaa ctaccccgcg 120 atgttgggtc acgccgggga tatggccgga tatgccggca cgctgcagag cttgggtgcc 180 gagatcgccg tggagcaggc cgcgttgcag agtgcgtggc agggcgatac cgggatcacg 240 tatcaggcgt ggcaggcaca gtggaaccag gccatggaag atttggtgcg ggcctatcat 300 gcgatgtcca gcacccatga agccaacacc atggcgatga tggcccgcga caccgccgaa 360 gccgccaaat ggggcggcta g 381 <210> SEQ ID NO 2 <211> LENGTH: 96 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 2 Met Ser Gln Ile Met Tyr Asn Tyr Pro Ala Met Leu Gly His Ala Gly 1 5 10 15 Asp Met Ala Gly Tyr Ala Gly Thr Leu Gln Ser Leu Gly Ala Glu Ile 20 25 30 Ala Val Glu Gln Ala Ala Leu Gln Ser Ala Trp Gln Gly Asp Thr Gly 35 40 45 Ile Thr Tyr Gln Ala Trp Gln Ala Gln Trp Asn Gln Ala Met Glu Asp 50 55 60 Leu Val Arg Ala Tyr His Ala Met Ser Ser Thr His Glu Ala Asn Thr 65 70 75 80 Met Ala Met Met Ala Arg Asp Thr Ala Glu Ala Ala Lys Trp Gly Gly 85 90 95 <210> SEQ ID NO 3 <211> LENGTH: 467 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 3 gggtagccgg accacggctg ggcaaagatg tgcaggccgc catcaaggcg gtcaaggccg 60 gcgacggcgt cataaacccg gacggcacct tgttggcggg ccccgcggtg ctgacgcccg 120 acgagtacaa ctcccggctg gtggccgccg acccggagtc caccgcggcg ttgcccgacg 180 gcgccgggct ggtcgttctg gatggcaccg tcactgccga actcgaagcc gagggctggg 240 ccaaagatcg catccgcgaa ctgcaagagc tgcgtaagtc gaccgggctg gacgtttccg 300 accgcatccg ggtggtgatg tcggtgcctg cggaacgcga agactgggcg cgcacccatc 360 gcgacctcat tgccggagaa atcttggcta ccgacttcga attcgccgac ctcgccgatg 420 gtgtggccat cggcgacggc gtgcgggtaa gcatcgaaaa gacctga 467 <210> SEQ ID NO 4 <211> LENGTH: 108 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 4 Met Ala Ala Asp Pro Glu Ser Thr Ala Ala Leu Pro Asp Gly Ala Gly 1 5 10 15 Leu Val Val Leu Asp Gly Thr Val Thr Ala Glu Leu Glu Ala Glu Gly 20 25 30 Trp Ala Lys Asp Arg Ile Arg Glu Leu Gln Glu Leu Arg Lys Ser Thr 35 40 45 Gly Leu Asp Val Ser Asp Arg Ile Arg Val Val Met Ser Val Pro Ala 50 55 60 Glu Arg Glu Asp Trp Ala Arg Thr His Arg Asp Leu Ile Ala Gly Glu 65 70 75 80 Ile Leu Ala Thr Asp Phe Glu Phe Ala Asp Leu Ala Asp Gly Val Ala 85 90 95 Ile Gly Asp Gly Val Arg Val Ser Ile Glu Lys Thr 100 105 <210> SEQ ID NO 5 <211> LENGTH: 889 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 5 cgggtctgca cggatccggg ccgggcaggg caatcgagcc tgggatccgc tggggtgcgc 60 acatcgcgga cccgtgcgcg gtacggtcga gacagcggca cgagaaagta gtaagggcga 120 taataggcgg taaagagtag cgggaagccg gccgaacgac tcggtcagac aacgccacag 180 cggccagtga ggagcagcgg gtgacggaca tgaacccgga tattgagaag gaccagacct 240 ccgatgaagt cacggtagag acgacctccg tcttccgcgc agacttcctc agcgagctgg 300 acgctcctgc gcaagcgggt acggagagcg cggtctccgg ggtggaaggg ctcccgccgg 360 gctcggcgtt gctggtagtc aaacgaggcc ccaacgccgg gtcccggttc ctactcgacc 420 aagccatcac gtcggctggt cggcatcccg acagcgacat atttctcgac gacgtgaccg 480 tgagccgtcg ccatgctgaa ttccggttgg aaaacaacga attcaatgtc gtcgatgtcg 540 ggagtctcaa cggcacctac gtcaaccgcg agcccgtgga ttcggcggtg ctggcgaacg 600 gcgacgaggt ccagatcggc aagttccggt tggtgttctt gaccggaccc aagcaaggcg 660 aggatgacgg gagtaccggg ggcccgtgag cgcacccgat agccccgcgc tggccgggat 720 gtcgatcggg gcggtcctcg acctgctacg accggatttt cctgatgtca ccatctccaa 780 gattcgattc ttggaggctg agggtctggt gacgccccgg cgggcctcat cggggtatcg 840 gcggttcacc gcatacgact gcgcacggct gcgattcatt ctcactgcc 889 <210> SEQ ID NO 6 <211> LENGTH: 162 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 6 Met Thr Asp Met Asn Pro Asp Ile Glu Lys Asp Gln Thr Ser Asp Glu 1 5 10 15 Val Thr Val Glu Thr Thr Ser Val Phe Arg Ala Asp Phe Leu Ser Glu 20 25 30 Leu Asp Ala Pro Ala Gln Ala Gly Thr Glu Ser Ala Val Ser Gly Val 35 40 45 Glu Gly Leu Pro Pro Gly Ser Ala Leu Leu Val Val Lys Arg Gly Pro 50 55 60 Asn Ala Gly Ser Arg Phe Leu Leu Asp Gln Ala Ile Thr Ser Ala Gly 65 70 75 80 Arg His Pro Asp Ser Asp Ile Phe Leu Asp Asp Val Thr Val Ser Arg 85 90 95 Arg His Ala Glu Phe Arg Leu Glu Asn Asn Glu Phe Asn Val Val Asp 100 105 110 Val Gly Ser Leu Asn Gly Thr Tyr Val Asn Arg Glu Pro Val Asp Ser 115 120 125 Ala Val Leu Ala Asn Gly Asp Glu Val Gln Ile Gly Lys Phe Arg Leu 130 135 140 Val Phe Leu Thr Gly Pro Lys Gln Gly Glu Asp Asp Gly Ser Thr Gly 145 150 155 160 Gly Pro <210> SEQ ID NO 7 <211> LENGTH: 898 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 7 tcgactccgg cgccaccggg caggatcacg gtgtcgacgg ggtcgccggg gaatcccacg 60 ataaccactc ttcgcgccat gaatgccagt gttggccagg cgctggcctg gcgtccacgc 120 cacacaccgc acagattagg acacgccggc ggcgcagccc tgcccgaaag accgtgcacc 180 ggtcttggca gactgtgccc atggcacaga taaccctgcg aggaaacgcg atcaataccg 240 tcggtgagct acctgctgtc ggatccccgg ccccggcctt caccctgacc gggggcgatc 300 tgggggtgat cagcagcgac cagttccggg gtaagtccgt gttgctgaac atctttccat 360 ccgtggacac accggtgtgc gcgacgagtg tgcgaacctt cgacgagcgt gcggcggcaa 420 gtggcgctac cgtgctgtgt gtctcgaagg atctgccgtt cgcccagaag cgcttctgcg 480 gcgccgaggg caccgaaaac gtcatgcccg cgtcggcatt ccgggacagc ttcggcgagg 540 attacggcgt gaccatcgcc gacgggccga tggccgggct gctcgcccgc gcaatcgtgg 600 tgatcggcgc ggacggcaac gtcgcctaca cggaattggt gccggaaatc gcgcaagaac 660 ccaactacga agcggcgctg gccgcgctgg gcgcctaggc tttcacaagc cccgcgcgtt 720 cggcgagcag cgcacgattt cgagcgctgc tcccgaaaag cgcctcggtg gtcttggccc 780 ggcggtaata caggtgcagg tcgtgctccc acgtgaaggc gatggcaccg tggatctgaa 840 gagcggagcc ggcgcataac acaaaggttt ccgcggtctg cgccttcgcc agcggcgc 898 <210> SEQ ID NO 8 <211> LENGTH: 165 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 8 Met Ala Gln Ile Thr Leu Arg Gly Asn Ala Ile Asn Thr Val Gly Glu 1 5 10 15 Leu Pro Ala Val Gly Ser Pro Ala Pro Ala Phe Thr Leu Thr Gly Gly 20 25 30 Asp Leu Gly Val Ile Ser Ser Asp Gln Phe Arg Gly Lys Ser Val Leu 35 40 45 Leu Asn Ile Phe Pro Ser Val Asp Thr Pro Val Cys Ala Thr Ser Val 50 55 60 Arg Thr Phe Asp Glu Arg Ala Ala Ala Ser Gly Ala Thr Val Leu Cys 65 70 75 80 Val Ser Lys Asp Leu Pro Phe Ala Gln Lys Arg Phe Cys Gly Ala Glu 85 90 95 Gly Thr Glu Asn Val Met Pro Ala Ser Ala Phe Arg Asp Ser Phe Gly 100 105 110 Glu Asp Tyr Gly Val Thr Ile Ala Asp Gly Pro Met Ala Gly Leu Leu 115 120 125 Ala Arg Ala Ile Val Val Ile Gly Ala Asp Gly Asn Val Ala Tyr Thr 130 135 140 Glu Leu Val Pro Glu Ile Ala Gln Glu Pro Asn Tyr Glu Ala Ala Leu 145 150 155 160 Ala Ala Leu Gly Ala 165 <210> SEQ ID NO 9 <211> LENGTH: 1054 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 9 ataatcagct caccgttggg accgacctcg accaggggtc ctttgtgact gccgggcttg 60 acgcggacga ccacagagtc ggtcatcgcc taaggctacc gttctgacct ggggctgcgt 120 gggcgccgac gacgtgaggc acgtcatgtc tcagcggccc accgccacct cggtcgccgg 180 cagtatgtca gcatgtgcag atgactccac gcagccttgt tcgcatcgtt ggtgtcgtgg 240 ttgcgacgac cttggcgctg gtgagcgcac ccgccggcgg tcgtgccgcg catgcggatc 300 cgtgttcgga catcgcggtc gttttcgctc gcggcacgca tcaggcttct ggtcttggcg 360 acgtcggtga ggcgttcgtc gactcgctta cctcgcaagt tggcgggcgg tcgattgggg 420 tctacgcggt gaactaccca gcaagcgacg actaccgcgc gagcgcgtca aacggttccg 480 atgatgcgag cgcccacatc cagcgcaccg tcgccagctg cccgaacacc aggattgtgc 540 ttggtggcta ttcgcagggt gcgacggtca tcgatttgtc cacctcggcg atgccgcccg 600 cggtggcaga tcatgtcgcc gctgtcgccc ttttcggcga gccatccagt ggtttctcca 660 gcatgttgtg gggcggcggg tcgttgccga caatcggtcc gctgtatagc tctaagacca 720 taaacttgtg tgctcccgac gatccaatat gcaccggagg cggcaatatt atggcgcatg 780 tttcgtatgt tcagtcgggg atgacaagcc aggcggcgac attcgcggcg aacaggctcg 840 atcacgccgg atgatcaaag actgttgtcc ctataccgct ggggctgtag tcgatgtaca 900 ccggctggaa tctgaagggc aagaacccgg tattcatcag gccggatgaa atgacggtcg 960 ggcggtaatc gtttgtgttg aacgcgtaga gccgatcacc gccggggctg gtgtagacct 1020 caatgtttgt gttcgccggc agggttccgg atcc 1054 <210> SEQ ID NO 10 <211> LENGTH: 217 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 10 Met Thr Pro Arg Ser Leu Val Arg Ile Val Gly Val Val Val Ala Thr 1 5 10 15 Thr Leu Ala Leu Val Ser Ala Pro Ala Gly Gly Arg Ala Ala His Ala 20 25 30 Asp Pro Cys Ser Asp Ile Ala Val Val Phe Ala Arg Gly Thr His Gln 35 40 45 Ala Ser Gly Leu Gly Asp Val Gly Glu Ala Phe Val Asp Ser Leu Thr 50 55 60 Ser Gln Val Gly Gly Arg Ser Ile Gly Val Tyr Ala Val Asn Tyr Pro 65 70 75 80 Ala Ser Asp Asp Tyr Arg Ala Ser Ala Ser Asn Gly Ser Asp Asp Ala 85 90 95 Ser Ala His Ile Gln Arg Thr Val Ala Ser Cys Pro Asn Thr Arg Ile 100 105 110 Val Leu Gly Gly Tyr Ser Gln Gly Ala Thr Val Ile Asp Leu Ser Thr 115 120 125 Ser Ala Met Pro Pro Ala Val Ala Asp His Val Ala Ala Val Ala Leu 130 135 140 Phe Gly Glu Pro Ser Ser Gly Phe Ser Ser Met Leu Trp Gly Gly Gly 145 150 155 160 Ser Leu Pro Thr Ile Gly Pro Leu Tyr Ser Ser Lys Thr Ile Asn Leu 165 170 175 Cys Ala Pro Asp Asp Pro Ile Cys Thr Gly Gly Gly Asn Ile Met Ala 180 185 190 His Val Ser Tyr Val Gln Ser Gly Met Thr Ser Gln Ala Ala Thr Phe 195 200 205 Ala Ala Asn Arg Leu Asp His Ala Gly 210 215 <210> SEQ ID NO 11 <211> LENGTH: 949 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 11 agccgctcgc gtggggtcaa ccgggtttcc acctgctcac tcattttgcc gcctttctgt 60 gtccgggccg aggcttgcgc tcaataactc ggtcaagttc cttcacagac tgccatcact 120 ggcccgtcgg cgggctcgtt gcgggtgcgc cgcgtgcggg tttgtgttcc gggcaccggg 180 tgggggcccg cccgggcgta atggcagact gtgattccgt gactaacagc ccccttgcga 240 ccgctaccgc cacgctgcac actaaccgcg gcgacatcaa gatcgccctg ttcggaaacc 300 atgcgcccaa gaccgtcgcc aattttgtgg gccttgcgca gggcaccaag gactattcga 360 cccaaaacgc atcaggtggc ccgtccggcc cgttctacga cggcgcggtc tttcaccggg 420 tgatccaggg cttcatgatc cagggtggcg atccaaccgg gacgggtcgc ggcggacccg 480 gctacaagtt cgccgacgag ttccaccccg agctgcaatt cgacaagccc tatctgctcg 540 cgatggccaa cgccggtccg ggcaccaacg gctcacagtt tttcatcacc gtcggcaaga 600 ctccgcacct gaaccggcgc cacaccattt tcggtgaagt gatcgacgcg gagtcacagc 660 gggttgtgga ggcgatctcc aagacggcca ccgacggcaa cgatcggccg acggacccgg 720 tggtgatcga gtcgatcacc atctcctgac ccgaagctac gtcggctcgt cgctcgaata 780 caccttgtgg acccgccagg gcacgtggcg gtacaccgac acgccgttgg ggccgttcaa 840 ccggacgccc tcacgccaag tccgctcacc tttggccgcg accggcgtaa ccggcagcgg 900 taagcgcatc gagcacctcc actgggtcgg tgccgagatc ccagcggga 949 <210> SEQ ID NO 12 <211> LENGTH: 182 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 12 Met Ala Asp Cys Asp Ser Val Thr Asn Ser Pro Leu Ala Thr Ala Thr 1 5 10 15 Ala Thr Leu His Thr Asn Arg Gly Asp Ile Lys Ile Ala Leu Phe Gly 20 25 30 Asn His Ala Pro Lys Thr Val Ala Asn Phe Val Gly Leu Ala Gln Gly 35 40 45 Thr Lys Asp Tyr Ser Thr Gln Asn Ala Ser Gly Gly Pro Ser Gly Pro 50 55 60 Phe Tyr Asp Gly Ala Val Phe His Arg Val Ile Gln Gly Phe Met Ile 65 70 75 80 Gln Gly Gly Asp Pro Thr Gly Thr Gly Arg Gly Gly Pro Gly Tyr Lys 85 90 95 Phe Ala Asp Glu Phe His Pro Glu Leu Gln Phe Asp Lys Pro Tyr Leu 100 105 110 Leu Ala Met Ala Asn Ala Gly Pro Gly Thr Asn Gly Ser Gln Phe Phe 115 120 125 Ile Thr Val Gly Lys Thr Pro His Leu Asn Arg Arg His Thr Ile Phe 130 135 140 Gly Glu Val Ile Asp Ala Glu Ser Gln Arg Val Val Glu Ala Ile Ser 145 150 155 160 Lys Thr Ala Thr Asp Gly Asn Asp Arg Pro Thr Asp Pro Val Val Ile 165 170 175 Glu Ser Ile Thr Ile Ser 180 <210> SEQ ID NO 13 <211> LENGTH: 1060 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 13 tggaccttca ccggcggtcc cttcgcttcg ggggcgacac ctaacatact ggtcgtcaac 60 ctaccgcgac accgctggga ctttgtgcca ttgccggcca ctcggggccg ctgcggcctg 120 gaaaaattgg tcgggcacgg gcggccgcgg gtcgctacca tcccactgtg aatgatttac 180 tgacccgccg actgctcacc atgggcgcgg ccgccgcaat gctggccgcg gtgcttctgc 240 ttactcccat caccgttccc gccggctacc ccggtgccgt tgcaccggcc actgcagcct 300 gccccgacgc cgaagtggtg ttcgcccgcg gccgcttcga accgcccggg attggcacgg 360 tcggcaacgc attcgtcagc gcgctgcgct cgaaggtcaa caagaatgtc ggggtctacg 420 cggtgaaata ccccgccgac aatcagatcg atgtgggcgc caacgacatg agcgcccaca 480 ttcagagcat ggccaacagc tgtccgaata cccgcctggt gcccggcggt tactcgctgg 540 gcgcggccgt caccgacgtg gtactcgcgg tgcccaccca gatgtggggc ttcaccaatc 600 ccctgcctcc cggcagtgat gagcacatcg ccgcggtcgc gctgttcggc aatggcagtc 660 agtgggtcgg ccccatcacc aacttcagcc ccgcctacaa cgatcggacc atcgagttgt 720 gtcacggcga cgaccccgtc tgccaccctg ccgaccccaa cacctgggag gccaactggc 780 cccagcacct cgccggggcc tatgtctcgt cgggcatggt caaccaggcg gctgacttcg 840 ttgccggaaa gctgcaatag ccacctagcc cgtgcgcgag tctttgcttc acgctttcgc 900 taaccgacca acgcgcgcac gatggagggg tccgtggtca tatcaagaca agaagggagt 960 aggcgatgca cgcaaaagtc ggcgactacc tcgtggtgaa gggcacaacc acggaacggc 1020 atgatcaaca tgctgagatc atcgaggtgc gctccgcaga 1060 <210> SEQ ID NO 14 <211> LENGTH: 219 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 14 Met Gly Ala Ala Ala Ala Met Leu Ala Ala Val Leu Leu Leu Thr Pro 1 5 10 15 Ile Thr Val Pro Ala Gly Tyr Pro Gly Ala Val Ala Pro Ala Thr Ala 20 25 30 Ala Cys Pro Asp Ala Glu Val Val Phe Ala Arg Gly Arg Phe Glu Pro 35 40 45 Pro Gly Ile Gly Thr Val Gly Asn Ala Phe Val Ser Ala Leu Arg Ser 50 55 60 Lys Val Asn Lys Asn Val Gly Val Tyr Ala Val Lys Tyr Pro Ala Asp 65 70 75 80 Asn Gln Ile Asp Val Gly Ala Asn Asp Met Ser Ala His Ile Gln Ser 85 90 95 Met Ala Asn Ser Cys Pro Asn Thr Arg Leu Val Pro Gly Gly Tyr Ser 100 105 110 Leu Gly Ala Ala Val Thr Asp Val Val Leu Ala Val Pro Thr Gln Met 115 120 125 Trp Gly Phe Thr Asn Pro Leu Pro Pro Gly Ser Asp Glu His Ile Ala 130 135 140 Ala Val Ala Leu Phe Gly Asn Gly Ser Gln Trp Val Gly Pro Ile Thr 145 150 155 160 Asn Phe Ser Pro Ala Tyr Asn Asp Arg Thr Ile Glu Leu Cys His Gly 165 170 175 Asp Asp Pro Val Cys His Pro Ala Asp Pro Asn Thr Trp Glu Ala Asn 180 185 190 Trp Pro Gln His Leu Ala Gly Ala Tyr Val Ser Ser Gly Met Val Asn 195 200 205 Gln Ala Ala Asp Phe Val Ala Gly Lys Leu Gln 210 215 <210> SEQ ID NO 15 <211> LENGTH: 1198 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 15 cagatgctgc gcaacatgtt tctcggcgat ccggcaggca acaccgatcg agtgcttgac 60 ttttccaccg cggtgaccgg cggactgttc ttctcaccca ccatcgactt tctcgaccat 120 ccaccgcccc taccgcaggc ggcgacgcca actctggcag ccgggtcgct atcgatcggc 180 agcttgaaag gaagcccccg atgaacaatc tctaccgcga tttggcaccg gtcaccgaag 240 ccgcttgggc ggaaatcgaa ttggaggcgg cgcggacgtt caagcgacac atcgccgggc 300 gccgggtggt cgatgtcagt gatcccgggg ggcccgtcac cgcggcggtc agcaccggcc 360 ggctgatcga tgttaaggca ccaaccaacg gcgtgatcgc ccacctgcgg gccagcaaac 420 cccttgtccg gctacgggtt ccgtttaccc tgtcgcgcaa cgagatcgac gacgtggaac 480 gtggctctaa ggactccgat tgggaaccgg taaaggaggc ggccaagaag ctggccttcg 540 tcgaggaccg cacaatattc gaaggctaca gcgccgcatc aatcgaaggg atccgcagcg 600 cgagttcgaa cccggcgctg acgttgcccg aggatccccg tgaaatccct gatgtcatct 660 cccaggcatt gtccgaactg cggttggccg gtgtggacgg accgtattcg gtgttgctct 720 ctgctgacgt ctacaccaag gttagcgaga cttccgatca cggctatccc atccgtgagc 780 atctgaaccg gctggtggac ggggacatca tttgggcccc ggccatcgac ggcgcgttcg 840 tgctgaccac tcgaggcggc gacttcgacc tacagctggg caccgacgtt gcaatcgggt 900 acgccagcca cgacacggac accgagcgcc tctacctgca ggagacgctg acgttccttt 960 gctacaccgc cgaggcgtcg gtcgcgctca gccactaagg cacgagcgcg agcaatagct 1020 cctatggcaa gcggccgcgg gttgggtgtg ttcggagctg ggctggtgga cggtgcgcag 1080 ggcctggaag acggtgcggg ctaggcggcg tttgaggcag cgtagtgctg cgcgtttggt 1140 tttcccggcg tcttgcagcc tttggtagta ggcctggccc cggctgtcgg tcatccgg 1198 1210> SEQ ID NO 16 <211> LENGTH: 265 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 16 Met Asn Asn Leu Tyr Arg Asp Leu Ala Pro Val Thr Glu Ala Ala Trp 1 5 10 15 Ala Glu Ile Glu Leu Glu Ala Ala Arg Thr Phe Lys Arg His Ile Ala 20 25 30 Gly Arg Arg Val Val Asp Val Ser Asp Pro Gly Gly Pro Val Thr Ala 35 40 45 Ala Val Ser Thr Gly Arg Leu Ile Asp Val Lys Ala Pro Thr Asn Gly 50 55 60 Val Ile Ala His Leu Arg Ala Ser Lys Pro Leu Val Arg Leu Arg Val 65 70 75 80 Pro Phe Thr Leu Ser Arg Asn Glu Ile Asp Asp Val Glu Arg Gly Ser 85 90 95 Lys Asp Ser Asp Trp Glu Pro Val Lys Glu Ala Ala Lys Lys Leu Ala 100 105 110 Phe Val Glu Asp Arg Thr Ile Phe Glu Gly Tyr Ser Ala Ala Ser Ile 115 120 125 Glu Gly Ile Arg Ser Ala Ser Ser Asn Pro Ala Leu Thr Leu Pro Glu 130 135 140 Asp Pro Arg Glu Ile Pro Asp Val Ile Ser Gln Ala Leu Ser Glu Leu 145 150 155 160 Arg Leu Ala Gly Val Asp Gly Pro Tyr Ser Val Leu Leu Ser Ala Asp 165 170 175 Val Tyr Thr Lys Val Ser Glu Thr Ser Asp His Gly Tyr Pro Ile Arg 180 185 190 Glu His Leu Asn Arg Leu Val Asp Gly Asp Ile Ile Trp Ala Pro Ala 195 200 205 Ile Asp Gly Ala Phe Val Leu Thr Thr Arg Gly Gly Asp Phe Asp Leu 210 215 220 Gln Leu Gly Thr Asp Val Ala Ile Gly Tyr Ala Ser His Asp Thr Asp 225 230 235 240 Thr Glu Arg Leu Tyr Leu Gln Glu Thr Leu Thr Phe Leu Cys Tyr Thr 245 250 255 Ala Glu Ala Ser Val Ala Leu Ser His 260 265 <210> SEQ ID NO 17 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (1) <223> OTHER INFORMATION: Ala is Ala or Ser <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (13) <223> OTHER INFORMATION: Xaa is unknown <400> SEQUENCE: 17 Ala Glu Leu Asp Ala Pro Ala Gln Ala Gly Thr Glu Xaa Ala Val 1 5 10 15 <210> SEQ ID NO 18 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 18 Ala Gln Ile Thr Leu Arg Gly Asn Ala Ile Asn Thr Val Gly Glu 1 5 10 15 <210> SEQ ID NO 19 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (3) <223> OTHER INFORMATION: Xaa is unknown <400> SEQUENCE: 19 Asp Pro Xaa Ser Asp Ile Ala Val Val Phe Ala Arg Gly Thr His 1 5 10 15 <210> SEQ ID NO 20 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 20 Thr Asn Ser Pro Leu Ala Thr Ala Thr Ala Thr Leu His Thr Asn 1 5 10 15 <210> SEQ ID NO 21 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (2) <223> OTHER INFORMATION: Xaa is unknown <400> SEQUENCE: 21 Ala Xaa Pro Asp Ala Glu Val Val Phe Ala Arg Gly Arg Phe Glu 1 5 10 15 <210> SEQ ID NO 22 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (1) <223> OTHER INFORMATION: Xaa is unknown <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (2) <223> OTHER INFORMATION: Ile is Ile or Val <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (10) <223> OTHER INFORMATION: Val is Val or Thr <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (11) <223> OTHER INFORMATION: Val is Val or Phe <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (14) <223> OTHER INFORMATION: Asp is Asp or Gln <400> SEQUENCE: 22 Xaa Ile Gln Lys Ser Leu Glu Leu Ile Val Val Thr Ala Asp Glu 1 5 10 15 <210> SEQ ID NO 23 <211> LENGTH: 19 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 23 Met Asn Asn Leu Tyr Arg Asp Leu Ala Pro Val Thr Glu Ala Ala Trp 1 5 10 15 Ala Glu Ile <210> SEQ ID NO 24 <211> LENGTH: 34 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 24 cccggctcga gaacctstac cgcgacctsg cscc 34 <210> SEQ ID NO 25 <211> LENGTH: 37 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 25 gggccggatc cgasgcsgcg tccttsacsg gytgcca 37 <210> SEQ ID NO 26 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 26 ggaagcccca tatgaacaat ctctaccg 28 <210> SEQ ID NO 27 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 27 cgcgctcagc ccttagtgac tgagcgcgac cg 32 <210> SEQ ID NO 28 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 28 ctcgaattcg ccgggtgcac acag 24 <210> SEQ ID NO 29 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 29 ctcgaattcg cccccatacg agaac 25 <210> SEQ ID NO 30 <211> LENGTH: 15 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 30 gtgtatctgc tggac 15 <210> SEQ ID NO 31 <211> LENGTH: 15 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 31 ccgactggct ggccg 15 <210> SEQ ID NO 32 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 32 gaggaattcg cttagcggat cgca 24 <210> SEQ ID NO 33 <211> LENGTH: 15 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 33 cccacattcc gttgg 15 <210> SEQ ID NO 34 <211> LENGTH: 15 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 34 gtccagcaga tacac 15 <210> SEQ ID NO 35 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 35 gtacgagaat tcatgtcgca aatcatg 27 <210> SEQ ID NO 36 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 36 gtacgagaat tcgagcttgg ggtgccg 27 <210> SEQ ID NO 37 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 37 cgattccaag cttgtggccg ccgacccg 28 <210> SEQ ID NO 38 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 38 cgttagggat cctcatcgcc atggtgttgg 30 <210> SEQ ID NO 39 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 39 cgttagggat ccggttccac tgtgcc 26 <210> SEQ ID NO 40 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 40 cgttagggat cctcaggtct tttcgatg 28 <210> SEQ ID NO 41 <211> LENGTH: 952 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 41 gaattcgccg ggtgcacaca gccttacacg acggaggtgg acacatgaag ggtcggtcgg 60 cgctgctgcg ggcgctctgg attgccgcac tgtcattcgg gttgggcggt gtcgcggtag 120 ccgcggaacc caccgccaag gccgccccat acgagaacct gatggtgccg tcgccctcga 180 tgggccggga catcccggtg gccttcctag ccggtgggcc gcacgcggtg tatctgctgg 240 acgccttcaa cgccggcccg gatgtcagta actgggtcac cgcgggtaac gcgatgaaca 300 cgttggcggg caaggggatt tcggtggtgg caccggccgg tggtgcgtac agcatgtaca 360 ccaactggga gcaggatggc agcaagcagt gggacacctt cttgtccgct gagctgcccg 420 actggctggc cgctaaccgg ggcttggccc ccggtggcca tgcggccgtt ggcgccgctc 480 agggcggtta cggggcgatg gcgctggcgg ccttccaccc cgaccgcttc ggcttcgctg 540 gctcgatgtc gggctttttg tacccgtcga acaccaccac caacggtgcg atcgcggcgg 600 gcatgcagca attcggcggt gtggacacca acggaatgtg gggagcacca cagctgggtc 660 ggtggaagtg gcacgacccg tgggtgcatg ccagcctgct ggcgcaaaac aacacccggg 720 tgtgggtgtg gagcccgacc aacccgggag ccagcgatcc cgccgccatg atcggccaaa 780 ccgccgaggc gatgggtaac agccgcatgt tctacaacca gtatcgcagc gtcggcgggc 840 acaacggaca cttcgacttc ccagccagcg gtgacaacgg ctggggctcg tgggcgcccc 900 agctgggcgc tatgtcgggc gatatcgtcg gtgcgatccg ctaagcgaat tc 952 <210> SEQ ID NO 42 <211> LENGTH: 299 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 42 Met Lys Gly Arg Ser Ala Leu Leu Arg Ala Leu Trp Ile Ala Ala Leu 1 5 10 15 Ser Phe Gly Leu Gly Gly Val Ala Val Ala Ala Glu Pro Thr Ala Lys 20 25 30 Ala Ala Pro Tyr Glu Asn Leu Met Val Pro Ser Pro Ser Met Gly Arg 35 40 45 Asp Ile Pro Val Ala Phe Leu Ala Gly Gly Pro His Ala Val Tyr Leu 50 55 60 Leu Asp Ala Phe Asn Ala Gly Pro Asp Val Ser Asn Trp Val Thr Ala 65 70 75 80 Gly Asn Ala Met Asn Thr Leu Ala Gly Lys Gly Ile Ser Val Val Ala 85 90 95 Pro Ala Gly Gly Ala Tyr Ser Met Tyr Thr Asn Trp Glu Gln Asp Gly 100 105 110 Ser Lys Gln Trp Asp Thr Phe Leu Ser Ala Glu Leu Pro Asp Trp Leu 115 120 125 Ala Ala Asn Arg Gly Leu Ala Pro Gly Gly His Ala Ala Val Gly Ala 130 135 140 Ala Gln Gly Gly Tyr Gly Ala Met Ala Leu Ala Ala Phe His Pro Asp 145 150 155 160 Arg Phe Gly Phe Ala Gly Ser Met Ser Gly Phe Leu Tyr Pro Ser Asn 165 170 175 Thr Thr Thr Asn Gly Ala Ile Ala Ala Gly Met Gln Gln Phe Gly Gly 180 185 190 Val Asp Thr Asn Gly Met Trp Gly Ala Pro Gln Leu Gly Arg Trp Lys 195 200 205 Trp His Asp Pro Trp Val His Ala Ser Leu Leu Ala Gln Asn Asn Thr 210 215 220 Arg Val Trp Val Trp Ser Pro Thr Asn Pro Gly Ala Ser Asp Pro Ala 225 230 235 240 Ala Met Ile Gly Gln Thr Ala Glu Ala Met Gly Asn Ser Arg Met Phe 245 250 255 Tyr Asn Gln Tyr Arg Ser Val Gly Gly His Asn Gly His Phe Asp Phe 260 265 270 Pro Ala Ser Gly Asp Asn Gly Trp Gly Ser Trp Ala Pro Gln Leu Gly 275 280 285 Ala Met Ser Gly Asp Ile Val Gly Ala Ile Arg 290 295 <210> SEQ ID NO 43 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 43 gcaacacccg ggatgtcgca aatcatg 27 <210> SEQ ID NO 44 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 44 gtaacacccg gggtggccgc cgacccg 27 <210> SEQ ID NO 45 <211> LENGTH: 37 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 45 ctactaagct tggatcccta gccgccccat ttggcgg 37 <210> SEQ ID NO 46 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 46 ctactaagct tccatggtca ggtcttttcg atgcttac 38 <210> SEQ ID NO 47 <211> LENGTH: 450 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 47 gtgccgcgct ccccagggtt cttatggttc gatatacctg agtttgatgg aagtccgatg 60 accagcagtc agcatacggc atggccgaaa agagtggggt gatgatggcc gaggatgttc 120 gcgccgagat cgtggccagc gttctcgaag tcgttgtcaa cgaaggcgat cagatcgaca 180 agggcgacgt cgtggtgctg ctggagtcga tgaagatgga gatccccgtc ctggccgaag 240 ctgccggaac ggtcagcaag gtggcggtat cggtgggcga tgtcattcag gccggcgacc 300 ttatcgcggt gatcagctag tcgttgatag tcactcatgt ccacactcgg tgatctgctc 360 gccgaacaca cggtgctgcc gggcagcgcg gtggaccacc tgcatgcggt ggtcggggag 420 tggcagctcc ttgccgactt gtcgtttgcc 450 <210> SEQ ID NO 48 <211> LENGTH: 71 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 48 Met Ala Glu Asp Val Arg Ala Glu Ile Val Ala Ser Val Leu Glu Val 1 5 10 15 Val Val Asn Glu Gly Asp Gln Ile Asp Lys Gly Asp Val Val Val Leu 20 25 30 Leu Glu Ser Met Lys Met Glu Ile Pro Val Leu Ala Glu Ala Ala Gly 35 40 45 Thr Val Ser Lys Val Ala Val Ser Val Gly Asp Val Ile Gln Ala Gly 50 55 60 Asp Leu Ile Ala Val Ile Ser 65 70 <210> SEQ ID NO 49 <211> LENGTH: 750 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 49 gggtacccat cgatgggttg cggttcggca ccgaggtgct aacgcacttg ctgacacact 60 gctagtcgaa aacgaggcta gtcgcaacgt cgatcacacg agaggactga ccatgacaac 120 ttcacccgac ccgtatgccg cgctgcccaa gctgccgtcc ttcagcctga cgtcaacctc 180 gatcaccgat gggcagccgc tggctacacc ccaggtcagc gggatcatgg gtgcgggcgg 240 ggcggatgcc agtccgcagc tgaggtggtc gggatttccc agcgagaccc gcagcttcgc 300 ggtaaccgtc tacgaccctg atgcccccac cctgtccggg ttctggcact gggcggtggc 360 caacctgcct gccaacgtca ccgagttgcc cgagggtgtc ggcgatggcc gcgaactgcc 420 gggcggggca ctgacattgg tcaacgacgc cggtatgcgc cggtatgtgg gtgcggcgcc 480 gcctcccggt catggggtgc atcgctacta cgtcgcggta cacgcggtga aggtcgaaaa 540 gctcgacctc cccgaggacg cgagtcctgc atatctggga ttcaacctgt tccagcacgc 600 gattgcacga gcggtcatct tcggcaccta cgagcagcgt tagcgcttta gctgggttgc 660 cgacgtcttg ccgagccgac cgcttcgtgc agcgagccga acccgccgtc atgcagcctg 720 cgggcaatgc cttcatggat gtccttggcc 750 <210> SEQ ID NO 50 <211> LENGTH: 176 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 50 Met Thr Thr Ser Pro Asp Pro Tyr Ala Ala Leu Pro Lys Leu Pro Ser 1 5 10 15 Phe Ser Leu Thr Ser Thr Ser Ile Thr Asp Gly Gln Pro Leu Ala Thr 20 25 30 Pro Gln Val Ser Gly Ile Met Gly Ala Gly Gly Ala Asp Ala Ser Pro 35 40 45 Gln Leu Arg Trp Ser Gly Phe Pro Ser Glu Thr Arg Ser Phe Ala Val 50 55 60 Thr Val Tyr Asp Pro Asp Ala Pro Thr Leu Ser Gly Phe Trp His Trp 65 70 75 80 Ala Val Ala Asn Leu Pro Ala Asn Val Thr Glu Leu Pro Glu Gly Val 85 90 95 Gly Asp Gly Arg Glu Leu Pro Gly Gly Ala Leu Thr Leu Val Asn Asp 100 105 110 Ala Gly Met Arg Arg Tyr Val Gly Ala Ala Pro Pro Pro Gly His Gly 115 120 125 Val His Arg Tyr Tyr Val Ala Val His Ala Val Lys Val Glu Lys Leu 130 135 140 Asp Leu Pro Glu Asp Ala Ser Pro Ala Tyr Leu Gly Phe Asn Leu Phe 145 150 155 160 Gln His Ala Ile Ala Arg Ala Val Ile Phe Gly Thr Tyr Glu Gln Arg 165 170 175 <210> SEQ ID NO 51 <211> LENGTH: 800 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 51 tcatgaggtt catcggggtg atcccacgcc cgcagccgca ttcgggccgc tggcgagccg 60 gtgccgcacg ccgcctcacc agcctggtgg ccgccgcctt tgcggcggcc acactgttgc 120 ttacccccgc gctggcacca ccggcatcgg cgggctgccc ggatgccgag gtggtgttcg 180 cccgcggaac cggcgaacca cctggcctcg gtcgggtagg ccaagctttc gtcagttcat 240 tgcgccagca gaccaacaag agcatcggga catacggagt caactacccg gccaacggtg 300 atttcttggc cgccgctgac ggcgcgaacg acgccagcga ccacattcag cagatggcca 360 gcgcgtgccg ggccacgagg ttggtgctcg gcggctactc ccagggtgcg gccgtgatcg 420 acatcgtcac cgccgcacca ctgcccggcc tcgggttcac gcagccgttg ccgcccgcag 480 cggacgatca catcgccgcg atcgccctgt tcgggaatcc ctcgggccgc gctggcgggc 540 tgatgagcgc cctgacccct caattcgggt ccaagaccat caacctctgc aacaacggcg 600 acccgatttg ttcggacggc aaccggtggc gagcgcacct aggctacgtg cccgggatga 660 ccaaccaggc ggcgcgtttc gtcgcgagca ggatctaacg cgagccgccc catagattcc 720 ggctaagcaa cggctgcgcc gccgcccggc cacgagtgac cgccgccgac tggcacaccg 780 cttaccacgg ccttatgctg 800 <210> SEQ ID NO 52 <211> LENGTH: 226 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 52 Met Ile Pro Arg Pro Gln Pro His Ser Gly Arg Trp Arg Ala Gly Ala 1 5 10 15 Ala Arg Arg Leu Thr Ser Leu Val Ala Ala Ala Phe Ala Ala Ala Thr 20 25 30 Leu Leu Leu Thr Pro Ala Leu Ala Pro Pro Ala Ser Ala Gly Cys Pro 35 40 45 Asp Ala Glu Val Val Phe Ala Arg Gly Thr Gly Glu Pro Pro Gly Leu 50 55 60 Gly Arg Val Gly Gln Ala Phe Val Ser Ser Leu Arg Gln Gln Thr Asn 65 70 75 80 Lys Ser Ile Gly Thr Tyr Gly Val Asn Tyr Pro Ala Asn Gly Asp Phe 85 90 95 Leu Ala Ala Ala Asp Gly Ala Asn Asp Ala Ser Asp His Ile Gln Gln 100 105 110 Met Ala Ser Ala Cys Arg Ala Thr Arg Leu Val Leu Gly Gly Tyr Ser 115 120 125 Gln Gly Ala Ala Val Ile Asp Ile Val Thr Ala Ala Pro Leu Pro Gly 130 135 140 Leu Gly Phe Thr Gln Pro Leu Pro Pro Ala Ala Asp Asp His Ile Ala 145 150 155 160 Ala Ile Ala Leu Phe Gly Asn Pro Ser Gly Arg Ala Gly Gly Leu Met 165 170 175 Ser Ala Leu Thr Pro Gln Phe Gly Ser Lys Thr Ile Asn Leu Cys Asn 180 185 190 Asn Gly Asp Pro Ile Cys Ser Asp Gly Asn Arg Trp Arg Ala His Leu 195 200 205 Gly Tyr Val Pro Gly Met Thr Asn Gln Ala Ala Arg Phe Val Ala Ser 210 215 220 Arg Ile 225 <210> SEQ ID NO 53 <211> LENGTH: 700 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 53 ctaggaaagc ctttcctgag taagtattgc cttcgttgca taccgccctt tacctgcgtt 60 aatctgcatt ttatgacaga atacgaaggg cctaagacaa aattccacgc gttaatgcag 120 gaacagattc ataacgaatt cacagcggca caacaatatg tcgcgatcgc ggtttatttc 180 gacagcgaag acctgccgca gttggcgaag catttttaca gccaagcggt cgaggaacga 140 aaccatgcaa tgatgctcgt gcaacacctg ctcgaccgcg accttcgtgt cgaaattccc 100 ggcgtagaca cggtgcgaaa ccagttcgac agaccccgcg aggcactggc gctggcgctc 160 gatcaggaac gcacagtcac cgaccaggtc ggtcggctga cagcggtggc ccgcgacgag 120 ggcgatttcc tcggcgagca gttcatgcag tggttcttgc aggaacagat cgaagaggtg 180 gccttgatgg caaccctggt gcgggttgcc gatcgggccg gggccaacct gttcgagcta 140 gagaacttcg tcgcacgtga agtggatgtg gcgccggccg catcaggcgc cccgcacgct 100 gccgggggcc gcctctagat ccctggcggg gatcagcgag tggtcccgtt cgcccgcccg 160 tcttccagcc aggccttggt gcggccgggg tggtgagtac 100 <210> SEQ ID NO 54 <211> LENGTH: 181 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 54 Met Thr Glu Tyr Glu Gly Pro Lys Thr Lys Phe His Ala Leu Met Gln 1 5 10 15 Glu Gln Ile His Asn Glu Phe Thr Ala Ala Gln Gln Tyr Val Ala Ile 20 25 30 Ala Val Tyr Phe Asp Ser Glu Asp Leu Pro Gln Leu Ala Lys His Phe 35 40 45 Tyr Ser Gln Ala Val Glu Glu Arg Asn His Ala Met Met Leu Val Gln 50 55 60 His Leu Leu Asp Arg Asp Leu Arg Val Glu Ile Pro Gly Val Asp Thr 65 70 75 80 Val Arg Asn Gln Phe Asp Arg Pro Arg Glu Ala Leu Ala Leu Ala Leu 85 90 95 Asp Gln Glu Arg Thr Val Thr Asp Gln Val Gly Arg Leu Thr Ala Val 100 105 110 Ala Arg Asp Glu Gly Asp Phe Leu Gly Glu Gln Phe Met Gln Trp Phe 115 120 125 Leu Gln Glu Gln Ile Glu Glu Val Ala Leu Met Ala Thr Leu Val Arg 130 135 140 Val Ala Asp Arg Ala Gly Ala Asn Leu Phe Glu Leu Glu Asn Phe Val 145 150 155 160 Ala Arg Glu Val Asp Val Ala Pro Ala Ala Ser Gly Ala Pro His Ala 165 170 175 Ala Gly Gly Arg Leu 180 <210> SEQ ID NO 55 <211> LENGTH: 950 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 55 tgggctcggc actggctctc ccacggtggc gcgctgattt ctccccacgg taggcgttgc 60 gacgcatgtt cttcaccgtc tatccacagc taccgacatt tgctccggct ggatcgcggg 120 taaaattccg tcgtgaacaa tcgacccatc cgcctgctga catccggcag ggctggtttg 180 ggtgcgggcg cattgatcac cgccgtcgtc ctgctcatcg ccttgggcgc tgtttggacc 240 ccggttgcct tcgccgatgg atgcccggac gccgaagtca cgttcgcccg cggcaccggc 300 gagccgcccg gaatcgggcg cgttggccag gcgttcgtcg actcgctgcg ccagcagact 360 ggcatggaga tcggagtata cccggtgaat tacgccgcca gccgcctaca gctgcacggg 420 ggagacggcg ccaacgacgc catatcgcac attaagtcca tggcctcgtc atgcccgaac 480 accaagctgg tcttgggcgg ctattcgcag ggcgcaaccg tgatcgatat cgtggccggg 540 gttccgttgg gcagcatcag ctttggcagt ccgctacctg cggcatacgc agacaacgtc 600 gcagcggtcg cggtcttcgg caatccgtcc aaccgcgccg gcggatcgct gtcgagcctg 660 agcccgctat tcggttccaa ggcgattgac ctgtgcaatc ccaccgatcc gatctgccat 720 gtgggccccg gcaacgaatt cagcggacac atcgacggct acatacccac ctacaccacc 780 caggcggcta gtttcgtcgt gcagaggctc cgcgccgggt cggtgccaca tctgcctgga 840 tccgtcccgc agctgcccgg gtctgtcctt cagatgcccg gcactgccgc accggctccc 900 gaatcgctgc acggtcgctg acgctttgtc agtaagccca taaaatcgcg 950 <210> SEQ ID NO 56 <211> LENGTH: 262 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 56 Met Asn Asn Arg Pro Ile Arg Leu Leu Thr Ser Gly Arg Ala Gly Leu 1 5 10 15 Gly Ala Gly Ala Leu Ile Thr Ala Val Val Leu Leu Ile Ala Leu Gly 20 25 30 Ala Val Trp Thr Pro Val Ala Phe Ala Asp Gly Cys Pro Asp Ala Glu 35 40 45 Val Thr Phe Ala Arg Gly Thr Gly Glu Pro Pro Gly Ile Gly Arg Val 50 55 60 Gly Gln Ala Phe Val Asp Ser Leu Arg Gln Gln Thr Gly Met Glu Ile 65 70 75 80 Gly Val Tyr Pro Val Asn Tyr Ala Ala Ser Arg Leu Gln Leu His Gly 85 90 95 Gly Asp Gly Ala Asn Asp Ala Ile Ser His Ile Lys Ser Met Ala Ser 100 105 110 Ser Cys Pro Asn Thr Lys Leu Val Leu Gly Gly Tyr Ser Gln Gly Ala 115 120 125 Thr Val Ile Asp Ile Val Ala Gly Val Pro Leu Gly Ser Ile Ser Phe 130 135 140 Gly Ser Pro Leu Pro Ala Ala Tyr Ala Asp Asn Val Ala Ala Val Ala 145 150 155 160 Val Phe Gly Asn Pro Ser Asn Arg Ala Gly Gly Ser Leu Ser Ser Leu 165 170 175 Ser Pro Leu Phe Gly Ser Lys Ala Ile Asp Leu Cys Asn Pro Thr Asp 180 185 190 Pro Ile Cys His Val Gly Pro Gly Asn Glu Phe Ser Gly His Ile Asp 195 200 205 Gly Tyr Ile Pro Thr Tyr Thr Thr Gln Ala Ala Ser Phe Val Val Gln 210 215 220 Arg Leu Arg Ala Gly Ser Val Pro His Leu Pro Gly Ser Val Pro Gln 225 230 235 240 Leu Pro Gly Ser Val Leu Gln Met Pro Gly Thr Ala Ala Pro Ala Pro 245 250 255 Glu Ser Leu His Gly Arg 260 <210> SEQ ID NO 57 <211> LENGTH: 1000 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 57 cgaggagacc gacgatctgc tcgacgaaat cgacgacgtc ctcgaggaga acgccgagga 60 cttcgtccgc gcatacgtcc aaaagggcgg acagtgacct ggccgttgcc cgatcgcctg 120 tccattaatt cactctctgg aacacccgct gtagacctat cttctttcac tgacttcctg 180 cgccgccagg cgccggagtt gctgccggca agcatcagcg gcggtgcgcc actcgcaggc 240 ggcgatgcgc aactgccgca cggcaccacc attgtcgcgc tgaaataccc cggcggtgtt 300 gtcatggcgg gtgaccggcg ttcgacgcag ggcaacatga tttctgggcg tgatgtgcgc 360 aaggtgtata tcaccgatga ctacaccgct accggcatcg ctggcacggc tgcggtcgcg 420 gttgagtttg cccggctgta tgccgtggaa cttgagcact acgagaagct cgagggtgtg 480 ccgctgacgt ttgccggcaa aatcaaccgg ctggcgatta tggtgcgtgg caatctggcg 540 gccgcgatgc agggtctgct ggcgttgccg ttgctggcgg gctacgacat tcatgcgtct 600 gacccgcaga gcgcgggtcg tatcgtttcg ttcgacgccg ccggcggttg gaacatcgag 660 gaagagggct atcaggcggt gggctcgggt tcgctgttcg cgaagtcgtc gatgaagaag 720 ttgtattcgc aggttaccga cggtgattcg gggctgcggg tggcggtcga ggcgctctac 780 gacgccgccg acgacgactc cgccaccggc ggtccggacc tggtgcgggg catctttccg 840 acggcggtga tcatcgacgc cgacggggcg gttgacgtgc cggagagccg gattgccgaa 900 ttggcccgcg cgatcatcga aagccgttcg ggtgcggata ctttcggctc cgatggcggt 960 gagaagtgag ttttccgtat ttcatctcgc ctgagcaggc 1000 <210> SEQ ID NO 58 <211> LENGTH: 291 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 58 Met Thr Trp Pro Leu Pro Asp Arg Leu Ser Ile Asn Ser Leu Ser Gly 1 5 10 15 Thr Pro Ala Val Asp Leu Ser Ser Phe Thr Asp Phe Leu Arg Arg Gln 20 25 30 Ala Pro Glu Leu Leu Pro Ala Ser Ile Ser Gly Gly Ala Pro Leu Ala 35 40 45 Gly Gly Asp Ala Gln Leu Pro His Gly Thr Thr Ile Val Ala Leu Lys 50 55 60 Tyr Pro Gly Gly Val Val Met Ala Gly Asp Arg Arg Ser Thr Gln Gly 65 70 75 80 Asn Met Ile Ser Gly Arg Asp Val Arg Lys Val Tyr Ile Thr Asp Asp 85 90 95 Tyr Thr Ala Thr Gly Ile Ala Gly Thr Ala Ala Val Ala Val Glu Phe 100 105 110 Ala Arg Leu Tyr Ala Val Glu Leu Glu His Tyr Glu Lys Leu Glu Gly 115 120 125 Val Pro Leu Thr Phe Ala Gly Lys Ile Asn Arg Leu Ala Ile Met Val 130 135 140 Arg Gly Asn Leu Ala Ala Ala Met Gln Gly Leu Leu Ala Leu Pro Leu 145 150 155 160 Leu Ala Gly Tyr Asp Ile His Ala Ser Asp Pro Gln Ser Ala Gly Arg 165 170 175 Ile Val Ser Phe Asp Ala Ala Gly Gly Trp Asn Ile Glu Glu Glu Gly 180 185 190 Tyr Gln Ala Val Gly Ser Gly Ser Leu Phe Ala Lys Ser Ser Met Lys 195 200 205 Lys Leu Tyr Ser Gln Val Thr Asp Gly Asp Ser Gly Leu Arg Val Ala 210 215 220 Val Glu Ala Leu Tyr Asp Ala Ala Asp Asp Asp Ser Ala Thr Gly Gly 225 230 235 240 Pro Asp Leu Val Arg Gly Ile Phe Pro Thr Ala Val Ile Ile Asp Ala 245 250 255 Asp Gly Ala Val Asp Val Pro Glu Ser Arg Ile Ala Glu Leu Ala Arg 260 265 270 Ala Ile Ile Glu Ser Arg Ser Gly Ala Asp Thr Phe Gly Ser Asp Gly 275 280 285 Gly Glu Lys 290 <210> SEQ ID NO 59 <211> LENGTH: 900 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 59 ttggcccgcg cgatcatcga aagccgttcg ggtgcggata ctttcggctc cgatggcggt 60 gagaagtgag ttttccgtat ttcatctcgc ctgagcaggc gatgcgcgag cgcagcgagt 120 tggcgcgtaa gggcattgcg cgggccaaaa gcgtggtggc gctggcctat gccggtggtg 180 tgctgttcgt cgcggagaat ccgtcgcggt cgctgcagaa gatcagtgag ctctacgatc 240 gggtgggttt tgcggctgcg ggcaagttca acgagttcga caatttgcgc cgcggcggga 300 tccagttcgc cgacacccgc ggttacgcct atgaccgtcg tgacgtcacg ggtcggcagt 360 tggccaatgt ctacgcgcag actctaggca ccatcttcac cgaacaggcc aagccctacg 420 aggttgagtt gtgtgtggcc gaggtggcgc attacggcga gacgaaacgc cctgagttgt 480 atcgtattac ctacgacggg tcgatcgccg acgagccgca tttcgtggtg atgggcggca 540 ccacggagcc gatcgccaac gcgctcaaag agtcgtatgc cgagaacgcc agcctgaccg 600 acgccctgcg tatcgcggtc gctgcattgc gggccggcag tgccgacacc tcgggtggtg 660 atcaacccac ccttggcgtg gccagcttag aggtggccgt tctcgatgcc aaccggccac 720 ggcgcgcgtt ccggcgcatc accggctccg ccctgcaagc gttgctggta gaccaggaaa 780 gcccgcagtc tgacggcgaa tcgtcgggct gagtccgaaa gtccgacgcg tgtctgggac 840 cccgctgcga cgttaactgc gcctaacccc ggctcgacgc gtcgccggcc gtcctgactt 900 <210> SEQ ID NO 60 <211> LENGTH: 248 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 60 Met Ser Phe Pro Tyr Phe Ile Ser Pro Glu Gln Ala Met Arg Glu Arg 1 5 10 15 Ser Glu Leu Ala Arg Lys Gly Ile Ala Arg Ala Lys Ser Val Val Ala 20 25 30 Leu Ala Tyr Ala Gly Gly Val Leu Phe Val Ala Glu Asn Pro Ser Arg 35 40 45 Ser Leu Gln Lys Ile Ser Glu Leu Tyr Asp Arg Val Gly Phe Ala Ala 50 55 60 Ala Gly Lys Phe Asn Glu Phe Asp Asn Leu Arg Arg Gly Gly Ile Gln 65 70 75 80 Phe Ala Asp Thr Arg Gly Tyr Ala Tyr Asp Arg Arg Asp Val Thr Gly 85 90 95 Arg Gln Leu Ala Asn Val Tyr Ala Gln Thr Leu Gly Thr Ile Phe Thr 100 105 110 Glu Gln Ala Lys Pro Tyr Glu Val Glu Leu Cys Val Ala Glu Val Ala 115 120 125 His Tyr Gly Glu Thr Lys Arg Pro Glu Leu Tyr Arg Ile Thr Tyr Asp 130 135 140 Gly Ser Ile Ala Asp Glu Pro His Phe Val Val Met Gly Gly Thr Thr 145 150 155 160 Glu Pro Ile Ala Asn Ala Leu Lys Glu Ser Tyr Ala Glu Asn Ala Ser 165 170 175 Leu Thr Asp Ala Leu Arg Ile Ala Val Ala Ala Leu Arg Ala Gly Ser 180 185 190 Ala Asp Thr Ser Gly Gly Asp Gln Pro Thr Leu Gly Val Ala Ser Leu 195 200 205 Glu Val Ala Val Leu Asp Ala Asn Arg Pro Arg Arg Ala Phe Arg Arg 210 215 220 Ile Thr Gly Ser Ala Leu Gln Ala Leu Leu Val Asp Gln Glu Ser Pro 225 230 235 240 Gln Ser Asp Gly Glu Ser Ser Gly 245 <210> SEQ ID NO 61<211> LENGTH: 1560 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 61 gagtcattgc ctggtcggcg tcattccgta ctagtcggtt gtcggacttg acctactggg 60 tcaggccgac gagcactcga ccattagggt aggggccgtg acccactatg acgtcgtcgt 120tctcggagcc ggtcccggcg ggtatgtcgc ggcgattcgc gccgcacagc tcggcctgag 180 cactgcaatc gtcgaaccca agtactgggg cggagtatgc ctcaatgtcg gctgtatccc 240atccaaggcg ctgttgcgca acgccgaact ggtccacatc ttcaccaagg acgccaaagc 300 atttggcatc agcggcgagg tgaccttcga ctacggcatc gcctatgacc gcagccgaaa 360 ggtagccgag ggcagggtgg ccggtgtgca cttcctgatg aagaagaaca agatcaccga 420 gatccacggg tacggcacat ttgccgacgc caacacgttg ttggttgatc tcaacgacgg 480 cggtacagaa tcggtcacgt tcgacaacgc catcatcgcg accggcagta gcacccggct 540 ggttcccggc acctcactgt cggccaacgt agtcacctac gaggaacaga tcctgtcccg 600 agagctgccg aaatcgatca ttattgccgg agctggtgcc attggcatgg agttcggcta 660 cgtgctgaag aactacggcg ttgacgtgac catcgtggaa ttccttccgc gggcgctgcc 720 caacgaggac gccgatgtgt ccaaggagat cgagaagcag ttcaaaaagc tgggtgtcac 780 gatcctgacc gccacgaagg tcgagtccat cgccgatggc gggtcgcagg tcaccgtgac 840 cgtcaccaag gacggcgtgg cgcaagagct taaggcggaa aaggtgttgc aggccatcgg 900 atttgcgccc aacgtcgaag ggtacgggct ggacaaggca ggcgtcgcgc tgaccgaccg 960 caaggctatc ggtgtcgacg actacatgcg taccaacgtg ggccacatct acgctatcgg 1020 cgatgtcaat ggattactgc agctggcgca cgtcgccgag gcacaaggcg tggtagccgc 1080 cgaaaccatt gccggtgcag agactttgac gctgggcgac catcggatgt tgccgcgcgc 1140 gacgttctgt cagccaaacg ttgccagctt cgggctcacc gagcagcaag cccgcaacga 1200 aggttacgac gtggtggtgg ccaagttccc gttcacggcc aacgccaagg cgcacggcgt 1260 gggtgacccc agtgggttcg tcaagctggt ggccgacgcc aagcacggcg agctactggg 1320 tgggcacctg gtcggccacg acgtggccga gctgctgccg gagctcacgc tggcgcagag 1380 gtgggacctg accgccagcg agctggctcg caacgtccac acccacccaa cgatgtctga 1440 ggcgctgcag gagtgcttcc acggcctggt tggccacatg atcaatttct gagcggctca 1500 tgacgaggcg cgcgagcact gacacccccc agatcatcat gggtgccatc ggtggtgtgg 1560 <210> SEQ ID NO 62 <211> LENGTH: 464 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 62 Met Thr His Tyr Asp Val Val Val Leu Gly Ala Gly Pro Gly Gly Tyr 1 5 10 15 Val Ala Ala Ile Arg Ala Ala Gln Leu Gly Leu Ser Thr Ala Ile Val 20 25 30 Glu Pro Lys Tyr Trp Gly Gly Val Cys Leu Asn Val Gly Cys Ile Pro 35 40 45 Ser Lys Ala Leu Leu Arg Asn Ala Glu Leu Val His Ile Phe Thr Lys 50 55 60 Asp Ala Lys Ala Phe Gly Ile Ser Gly Glu Val Thr Phe Asp Tyr Gly 65 70 75 80 Ile Ala Tyr Asp Arg Ser Arg Lys Val Ala Glu Gly Arg Val Ala Gly 85 90 95 Val His Phe Leu Met Lys Lys Asn Lys Ile Thr Glu Ile His Gly Tyr 100 105 110 Gly Thr Phe Ala Asp Ala Asn Thr Leu Leu Val Asp Leu Asn Asp Gly 115 120 125 Gly Thr Glu Ser Val Thr Phe Asp Asn Ala Ile Ile Ala Thr Gly Ser 130 135 140 Ser Thr Arg Leu Val Pro Gly Thr Ser Leu Ser Ala Asn Val Val Thr 145 150 155 160 Tyr Glu Glu Gln Ile Leu Ser Arg Glu Leu Pro Lys Ser Ile Ile Ile 165 170 175 Ala Gly Ala Gly Ala Ile Gly Met Glu Phe Gly Tyr Val Leu Lys Asn 180 185 190 Tyr Gly Val Asp Val Thr Ile Val Glu Phe Leu Pro Arg Ala Leu Pro 195 200 205 Asn Glu Asp Ala Asp Val Ser Lys Glu Ile Glu Lys Gln Phe Lys Lys 210 215 220 Leu Gly Val Thr Ile Leu Thr Ala Thr Lys Val Glu Ser Ile Ala Asp 225 230 235 240 Gly Gly Ser Gln Val Thr Val Thr Val Thr Lys Asp Gly Val Ala Gln 245 250 255 Glu Leu Lys Ala Glu Lys Val Leu Gln Ala Ile Gly Phe Ala Pro Asn 260 265 270 Val Glu Gly Tyr Gly Leu Asp Lys Ala Gly Val Ala Leu Thr Asp Arg 275 280 285 Lys Ala Ile Gly Val Asp Asp Tyr Met Arg Thr Asn Val Gly His Ile 290 295 300 Tyr Ala Ile Gly Asp Val Asn Gly Leu Leu Gln Leu Ala His Val Ala 305 310 315 320 Glu Ala Gln Gly Val Val Ala Ala Glu Thr Ile Ala Gly Ala Glu Thr 325 330 335 Leu Thr Leu Gly Asp His Arg Met Leu Pro Arg Ala Thr Phe Cys Gln 340 345 350 Pro Asn Val Ala Ser Phe Gly Leu Thr Glu Gln Gln Ala Arg Asn Glu 355 360 365 Gly Tyr Asp Val Val Val Ala Lys Phe Pro Phe Thr Ala Asn Ala Lys 370 375 380 Ala His Gly Val Gly Asp Pro Ser Gly Phe Val Lys Leu Val Ala Asp 385 390 395 400 Ala Lys His Gly Glu Leu Leu Gly Gly His Leu Val Gly His Asp Val 405 410 415 Ala Glu Leu Leu Pro Glu Leu Thr Leu Ala Gln Arg Trp Asp Leu Thr 420 425 430 Ala Ser Glu Leu Ala Arg Asn Val His Thr His Pro Thr Met Ser Glu 435 440 445 Ala Leu Gln Glu Cys Phe His Gly Leu Val Gly His Met Ile Asn Phe 450 455 460 <210> SEQ ID NO 63 <211> LENGTH: 550 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 63 ggcccggctc gcggccgccc tgcaggaaaa gaaggcctgc ccaggcccag actcagccga 60gtagtcaccc agtaccccac accaggaagg accgcccatc atggcaaagc tctccaccga 120 cgaactgctg gacgcgttca aggaaatgac cctgttggag ctctccgact tcgtcaagaa 180 gttcgaggag accttcgagg tcaccgccgc cgctccagtc gccgtcgccg ccgccggtgc 240cgccccggcc ggtgccgccg tcgaggctgc cgaggagcag tccgagttcg acgtgatcct 300 tgaggccgcc ggcgacaaga agatcggcgt catcaaggtg gtccgggaga tcgtttccgg 360 cctgggcctc aaggaggcca aggacctggt cgacggcgcg cccaagccgc tgctggagaa 420 ggtcgccaag gaggccgccg acgaggccaa ggccaagctg gaggccgccg gcgccaccgt 480 caccgtcaag tagctctgcc cagcgtgttc ttttgcgtct gctcggcccg tagcgaacac 540 tgcgcccgct 550<210> SEQ ID NO 64 <211> LENGTH: 130 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 64 Met Ala Lys Leu Ser Thr Asp Glu Leu Leu Asp Ala Phe Lys Glu Met 1 5 10 15 Thr Leu Leu Glu Leu Ser Asp Phe Val Lys Lys Phe Glu Glu Thr Phe 20 25 30 Glu Val Thr Ala Ala Ala Pro Val Ala Val Ala Ala Ala Gly Ala Ala 35 40 45 Pro Ala Gly Ala Ala Val Glu Ala Ala Glu Glu Gln Ser Glu Phe Asp 50 55 60 Val Ile Leu Glu Ala Ala Gly Asp Lys Lys Ile Gly Val Ile Lys Val 65 70 75 80 Val Arg Glu Ile Val Ser Gly Leu Gly Leu Lys Glu Ala Lys Asp Leu 85 90 95 Val Asp Gly Ala Pro Lys Pro Leu Leu Glu Lys Val Ala Lys Glu Ala 100 105 110 Ala Asp Glu Ala Lys Ala Lys Leu Glu Ala Ala Gly Ala Thr Val Thr 115 120 125 Val Lys 130 <210> SEQ ID NO 65 <211> LENGTH: 900 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 65 tgaacgccat cgggtccaac gaacgcagcg ctacctgatc accaccgggt ctgttagggc 60tcttccccag gtcgtacagt cgggccatgg ccattgaggt ttcggtgttg cgggttttca 120 ccgattcaga cgggaatttc ggtaatccgc tgggggtgat caacgccagc aaggtcgaac 180 accgcgacag gcagcagctg gcagcccaat cgggctacag cgaaaccata ttcgtcgatc 240ttcccagccc cggctcaacc accgcacacg ccaccatcca tactccccgc accgaaattc 300 cgttcgccgg acacccgacc gtgggagcgt cctggtggct gcgcgagagg gggacgccaa 360 ttaacacgct gcaggtgccg gccggcatcg tccaggtgag ctaccacggt gatctcaccg 420 ccatcagcgc ccgctcggaa tgggcacccg agttcgccat ccacgacctg gattcacttg 480 atgcgcttgc cgccgccgac cccgccgact ttccggacga catcgcgcac tacctctgga 540 cctggaccga ccgctccgct ggctcgctgc gcgcccgcat gtttgccgcc aacttgggcg 600 tcaccgaaga cgaagcgacc ggtgccgcgg ccatccggat taccgattac ctcagccgtg 660 acctcaccat cacccagggc aaaggatcgt tgatccacac cacctggagt cccgagggct 720 gggttcgggt agccggccga gttgtcagcg acggtgtggc acaactcgac tgacgtagag 780 ctcagcgctg ccgatgcaac acggcggcaa ggtgatcctg caggggttgc ccgaccgcgc 840 gcatctgcaa cgagtacgaa agctcgtcgc cgtcgatgcg gtaggaacgg tcaagggcgg 900 <210> SEQ ID NO 66 <211> LENGTH: 228 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 66 Met Ala Ile Glu Val Ser Val Leu Arg Val Phe Thr Asp Ser Asp Gly 1 5 10 15 Asn Phe Gly Asn Pro Leu Gly Val Ile Asn Ala Ser Lys Val Glu His 20 25 30 Arg Asp Arg Gln Gln Leu Ala Ala Gln Ser Gly Tyr Ser Glu Thr Ile 35 40 45 Phe Val Asp Leu Pro Ser Pro Gly Ser Thr Thr Ala His Ala Thr Ile 50 55 60 His Thr Pro Arg Thr Glu Ile Pro Phe Ala Gly His Pro Thr Val Gly 65 70 75 80 Ala Ser Trp Trp Leu Arg Glu Arg Gly Thr Pro Ile Asn Thr Leu Gln 85 90 95 Val Pro Ala Gly Ile Val Gln Val Ser Tyr His Gly Asp Leu Thr Ala 100 105 110 Ile Ser Ala Arg Ser Glu Trp Ala Pro Glu Phe Ala Ile His Asp Leu 115 120 125 Asp Ser Leu Asp Ala Leu Ala Ala Ala Asp Pro Ala Asp Phe Pro Asp 130 135 140 Asp Ile Ala His Tyr Leu Trp Thr Trp Thr Asp Arg Ser Ala Gly Ser 145 150 155 160 Leu Arg Ala Arg Met Phe Ala Ala Asn Leu Gly Val Thr Glu Asp Glu 165 170 175 Ala Thr Gly Ala Ala Ala Ile Arg Ile Thr Asp Tyr Leu Ser Arg Asp 180 185 190 Leu Thr Ile Thr Gln Gly Lys Gly Ser Leu Ile His Thr Thr Trp Ser 195 200 205 Pro Glu Gly Trp Val Arg Val Ala Gly Arg Val Val Ser Asp Gly Val 210 215 220 Ala Gln Leu Asp 225 <210> SEQ ID NO 67 <211> LENGTH: 500 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 67 gtttgtggtg tcggtggtct ggggggcgcc aactgggatt cggttggggt gggtgcaggt 60 ccggcgatgg gcatcggagg tgtgggtggt ttgggtgggg ccggttcggg tccggcgatg 120 ggcatggggg gtgtgggtgg tttgggtggg gccggttcgg gtccggcgat gggcatgggg 180 ggtgtgggtg gtttagatgc ggccggttcc ggcgagggcg gctctcctgc ggcgatcggc 240atcggagttg gcggaggcgg aggtgggggt gggggtggcg gcggcggggc cgacacgaac 300cgctccgaca ggtcgtcgga cgtcgggggc ggagtctggc cgttgggctt cggtaggttt 360 gccgatgcgg gcgccggcgg aaacgaagca ctggggtcga agaacggctg cgctgccata 420 tcgtccggag cttccatacc ttcgtgcggc cggaagagct tgtcgtagtc ggccgccatg 480 acaacctctc agagtgcgct 500 <210> SEQ ID NO 68 <211> LENGTH: 139 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 68 Met Gly Ala Gly Pro Ala Met Gly Ile Gly Gly Val Gly Gly Leu Gly 1 5 10 15 Gly Ala Gly Ser Gly Pro Ala Met Gly Met Gly Gly Val Gly Gly Leu 20 25 30 Gly Gly Ala Gly Ser Gly Pro Ala Met Gly Met Gly Gly Val Gly Gly 35 40 45 Leu Asp Ala Ala Gly Ser Gly Glu Gly Gly Ser Pro Ala Ala Ile Gly 50 55 60 Ile Gly Val Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 65 70 75 80 Ala Asp Thr Asn Arg Ser Asp Arg Ser Ser Asp Val Gly Gly Gly Val 85 90 95 Trp Pro Leu Gly Phe Gly Arg Phe Ala Asp Ala Gly Ala Gly Gly Asn 100 105 110 Glu Ala Leu Gly Ser Lys Asn Gly Cys Ala Ala Ile Ser Ser Gly Ala 115 120 125 Ser Ile Pro Ser Cys Gly Arg Lys Ser Leu Ser 130 135 <210> SEQ ID NO 69 <211> LENGTH: 2050 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 69 agcgcactct gagaggttgt catggcggcc gactacgaca agctcttccg gccgcacgaa 60 ggtatggaag ctccggacga tatggcagcg cagccgttct tcgaccccag tgcttcgttt 120 ccgccggcgc ccgcatcggc aaacctaccg aagcccaacg gccagactcc gcccccgacg 180 tccgacgacc tgtcggagcg gttcgtgtcg gccccgccgc cgccaccccc acccccacct 240 ccgcctccgc caactccgat gccgatcgcc gcaggagagc cgccctcgcc ggaaccggcc 300 gcatctaaac cacccacacc ccccatgccc atcgccggac ccgaaccggc cccacccaaa 360 ccacccacac cccccatgcc catcgccgga cccgaaccgg ccccacccaa accacccaca 420 cctccgatgc ccatcgccgg acctgcaccc accccaaccg aatcccagtt ggcgcccccc 480 agaccaccga caccacaaac gccaaccgga gcgccgcagc aaccggaatc accggcgccc 540 cacgtaccct cgcacgggcc acatcaaccc cggcgcaccg caccagcacc gccctgggca 600 aagatgccaa tcggcgaacc cccgcccgct ccgtccagac cgtctgcgtc cccggccgaa 660 ccaccgaccc ggcctgcccc ccaacactcc cgacgtgcgc gccggggtca ccgctatcgc 720 acagacaccg aacgaaacgt cgggaaggta gcaactggtc catccatcca ggcgcggctg 780 cgggcagagg aagcatccgg cgcgcagctc gcccccggaa cggagccctc gccagcgccg 840 ttgggccaac cgagatcgta tctggctccg cccacccgcc ccgcgccgac agaacctccc 900 cccagcccct cgccgcagcg caactccggt cggcgtgccg agcgacgcgt ccaccccgat 960 ttagccgccc aacatgccgc ggcgcaacct gattcaatta cggccgcaac cactggcggt 1020 cgtcgccgca agcgtgcagc gccggatctc gacgcgacac agaaatcctt aaggccggcg 1080 gccaaggggc cgaaggtgaa gaaggtgaag ccccagaaac cgaaggccac gaagccgccc 1140 aaagtggtgt cgcagcgcgg ctggcgacat tgggtgcatg cgttgacgcg aatcaacctg 1200 ggcctgtcac ccgacgagaa gtacgagctg gacctgcacg ctcgagtccg ccgcaatccc 1260 cgcgggtcgt atcagatcgc cgtcgtcggt ctcaaaggtg gggctggcaa aaccacgctg 1320 acagcagcgt tggggtcgac gttggctcag gtgcgggccg accggatcct ggctctagac 1380 gcggatccag gcgccggaaa cctcgccgat cgggtagggc gacaatcggg cgcgaccatc 1440 gctgatgtgc ttgcagaaaa agagctgtcg cactacaacg acatccgcgc acacactagc 1500 gtcaatgcgg tcaatctgga agtgctgccg gcaccggaat acagctcggc gcagcgcgcg 1560 ctcagcgacg ccgactggca tttcatcgcc gatcctgcgt cgaggtttta caacctcgtc 1620 ttggctgatt gtggggccgg cttcttcgac ccgctgaccc gcggcgtgct gtccacggtg 1680 tccggtgtcg tggtcgtggc aagtgtctca atcgacggcg cacaacaggc gtcggtcgcg 1740 ttggactggt tgcgcaacaa cggttaccaa gatttggcga gccgcgcatg cgtggtcatc 1800 aatcacatca tgccgggaga acccaatgtc gcagttaaag acctggtgcg gcatttcgaa 1860 cagcaagttc aacccggccg ggtcgtggtc atgccgtggg acaggcacat tgcggccgga 1920 accgagattt cactcgactt gctcgaccct atctacaagc gcaaggtcct cgaattggcc 1980 gcagcgctat ccgacgattt cgagagggct ggacgtcgtt gagcgcacct gctgttgctg 2040 ctggtcctac 2050 <210> SEQ ID NO 70 <211> LENGTH: 666 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 70 Met Ala Ala Asp Tyr Asp Lys Leu Phe Arg Pro His Glu Gly Met Glu 1 5 10 15 Ala Pro Asp Asp Met Ala Ala Gln Pro Phe Phe Asp Pro Ser Ala Ser 20 25 30 Phe Pro Pro Ala Pro Ala Ser Ala Asn Leu Pro Lys Pro Asn Gly Gln 35 40 45 Thr Pro Pro Pro Thr Ser Asp Asp Leu Ser Glu Arg Phe Val Ser Ala 50 55 60 Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Thr Pro Met 65 70 75 80 Pro Ile Ala Ala Gly Glu Pro Pro Ser Pro Glu Pro Ala Ala Ser Lys 85 90 95 Pro Pro Thr Pro Pro Met Pro Ile Ala Gly Pro Glu Pro Ala Pro Pro 100 105 110 Lys Pro Pro Thr Pro Pro Met Pro Ile Ala Gly Pro Glu Pro Ala Pro 115 120 125 Pro Lys Pro Pro Thr Pro Pro Met Pro Ile Ala Gly Pro Ala Pro Thr 130 135 140 Pro Thr Glu Ser Gln Leu Ala Pro Pro Arg Pro Pro Thr Pro Gln Thr 145 150 155 160 Pro Thr Gly Ala Pro Gln Gln Pro Glu Ser Pro Ala Pro His Val Pro 165 170 175 Ser His Gly Pro His Gln Pro Arg Arg Thr Ala Pro Ala Pro Pro Trp 180 185 190 Ala Lys Met Pro Ile Gly Glu Pro Pro Pro Ala Pro Ser Arg Pro Ser 195 200 205 Ala Ser Pro Ala Glu Pro Pro Thr Arg Pro Ala Pro Gln His Ser Arg 210 215 220 Arg Ala Arg Arg Gly His Arg Tyr Arg Thr Asp Thr Glu Arg Asn Val 225 230 235 240 Gly Lys Val Ala Thr Gly Pro Ser Ile Gln Ala Arg Leu Arg Ala Glu 245 250 255 Glu Ala Ser Gly Ala Gln Leu Ala Pro Gly Thr Glu Pro Ser Pro Ala 260 265 270 Pro Leu Gly Gln Pro Arg Ser Tyr Leu Ala Pro Pro Thr Arg Pro Ala 275 280 285 Pro Thr Glu Pro Pro Pro Ser Pro Ser Pro Gln Arg Asn Ser Gly Arg 290 295 300 Arg Ala Glu Arg Arg Val His Pro Asp Leu Ala Ala Gln His Ala Ala 305 310 315 320 Ala Gln Pro Asp Ser Ile Thr Ala Ala Thr Thr Gly Gly Arg Arg Arg 325 330 335 Lys Arg Ala Ala Pro Asp Leu Asp Ala Thr Gln Lys Ser Leu Arg Pro 340 345 350 Ala Ala Lys Gly Pro Lys Val Lys Lys Val Lys Pro Gln Lys Pro Lys 355 360 365 Ala Thr Lys Pro Pro Lys Val Val Ser Gln Arg Gly Trp Arg His Trp 370 375 380 Val His Ala Leu Thr Arg Ile Asn Leu Gly Leu Ser Pro Asp Glu Lys 385 390 395 400 Tyr Glu Leu Asp Leu His Ala Arg Val Arg Arg Asn Pro Arg Gly Ser 405 410 415 Tyr Gln Ile Ala Val Val Gly Leu Lys Gly Gly Ala Gly Lys Thr Thr 420 425 430 Leu Thr Ala Ala Leu Gly Ser Thr Leu Ala Gln Val Arg Ala Asp Arg 435 440 445 Ile Leu Ala Leu Asp Ala Asp Pro Gly Ala Gly Asn Leu Ala Asp Arg 450 455 460 Val Gly Arg Gln Ser Gly Ala Thr Ile Ala Asp Val Leu Ala Glu Lys 465 470 475 480 Glu Leu Ser His Tyr Asn Asp Ile Arg Ala His Thr Ser Val Asn Ala 485 490 495 Val Asn Leu Glu Val Leu Pro Ala Pro Glu Tyr Ser Ser Ala Gln Arg 500 505 510 Ala Leu Ser Asp Ala Asp Trp His Phe Ile Ala Asp Pro Ala Ser Arg 515 520 525 Phe Tyr Asn Leu Val Leu Ala Asp Cys Gly Ala Gly Phe Phe Asp Pro 530 535 540 Leu Thr Arg Gly Val Leu Ser Thr Val Ser Gly Val Val Val Val Ala 545 550 555 560 Ser Val Ser Ile Asp Gly Ala Gln Gln Ala Ser Val Ala Leu Asp Trp 565 570 575 Leu Arg Asn Asn Gly Tyr Gln Asp Leu Ala Ser Arg Ala Cys Val Val 580 585 590 Ile Asn His Ile Met Pro Gly Glu Pro Asn Val Ala Val Lys Asp Leu 595 600 605 Val Arg His Phe Glu Gln Gln Val Gln Pro Gly Arg Val Val Val Met 610 615 620 Pro Trp Asp Arg His Ile Ala Ala Gly Thr Glu Ile Ser Leu Asp Leu 625 630 635 640 Leu Asp Pro Ile Tyr Lys Arg Lys Val Leu Glu Leu Ala Ala Ala Leu 645 650 655 Ser Asp Asp Phe Glu Arg Ala Gly Arg Arg 660 665 <210> SEQ ID NO 71 <211> LENGTH: 1890 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 71 gcagcgatga ggaggagcgg cgccaacggc ccgcgccggc gacgatgcaa agcgcagcga 60 tgaggaggag cggcgcgcat gactgctgaa ccggaagtac ggacgctgcg cgaggttgtg 120 ctggaccagc tcggcactgc tgaatcgcgt gcgtacaaga tgtggctgcc gccgttgacc 180 aatccggtcc cgctcaacga gctcatcgcc cgtgatcggc gacaacccct gcgatttgcc 240 ctggggatca tggatgaacc gcgccgccat ctacaggatg tgtggggcgt agacgtttcc 300 ggggccggcg gcaacatcgg tattgggggc gcacctcaaa ccgggaagtc gacgctactg 360cagacgatgg tgatgtcggc cgccgccaca cactcaccgc gcaacgttca gttctattgc 420 atcgacctag gtggcggcgg gctgatctat ctcgaaaacc ttccacacgt cggtggggta 480 gccaatcggt ccgagcccga caaggtcaac cgggtggtcg cagagatgca agccgtcatg 540 cggcaacggg aaaccacctt caaggaacac cgagtgggct cgatcgggat gtaccggcag 600 ctgcgtgacg atccaagtca acccgttgcg tccgatccat acggcgacgt ctttctgatc 660 atcgacggat ggcccggttt tgtcggcgag ttccccgacc ttgaggggca ggttcaagat 720 ctggccgccc aggggctggg gttcggcgtc cacgtcatca tctccacgcc acgctggaca 780 gagctgaagt cgcgtgttcg cgactacctc ggcaccaaga tcgagttccg gcttggtgac 840 gtcaatgaaa cccagatcga ccggattacc cgcgagatcc cggcgaatcg tccgggtcgg 900 gcagtgtcga tggaaaagca ccatctgatg atcggcgtgc ccaggttcga cggcgtgcac 960 agcgccgata acctggtgga ggcgatcacc gcgggggtga cgcagatcgc ttcccagcac 1020 accgaacagg cacctccggt gcgggtcctg ccggagcgta tccacctgca cgaactcgac 1080 ccgaacccgc cgggaccaga gtccgactac cgcactcgct gggagattcc gatcggcttg 1140 cgcgagacgg acctgacgcc ggctcactgc cacatgcaca cgaacccgca cctactgatc 1200 ttcggtgcgg ccaaatcggg caagacgacc attgcccacg cgatcgcgcg cgccatttgt 1260 gcccgaaaca gtccccagca ggtgcggttc atgctcgcgg actaccgctc gggcctgctg 1320 gacgcggtgc cggacaccca tctgctgggc gccggcgcga tcaaccgcaa cagcgcgtcg 1380 ctagacgagg ccgctcaagc actggcggtc aacctgaaga agcggttgcc gccgaccgac 1440 ctgacgacgg cgcagctacg ctcgcgttcg tggtggagcg gatttgacgt cgtgcttctg 1500 gtcgacgatt ggcacatgat cgtgggtgcc gccgggggga tgccgccgat ggcaccgctg 1560 gccccgttat tgccggcggc ggcagatatc gggttgcaca tcattgtcac ctgtcagatg 1620 agccaggctt acaaggcaac catggacaag ttcgtcggcg ccgcattcgg gtcgggcgct 1680 ccgacaatgt tcctttcggg cgagaagcag gaattcccat ccagtgagtt caaggtcaag 1740 cggcgccccc ctggccaggc atttctcgtc tcgccagacg gcaaagaggt catccaggcc 1800 ccctacatcg agcctccaga agaagtgttc gcagcacccc caagcgccgg ttaagattat 1860 ttcattgccg gtgtagcagg acccgagctc 1890 <210> SEQ ID NO 72 <211> LENGTH: 591 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 72 Met Thr Ala Glu Pro Glu Val Arg Thr Leu Arg Glu Val Val Leu Asp 1 5 10 15 Gln Leu Gly Thr Ala Glu Ser Arg Ala Tyr Lys Met Trp Leu Pro Pro 20 25 30 Leu Thr Asn Pro Val Pro Leu Asn Glu Leu Ile Ala Arg Asp Arg Arg 35 40 45 Gln Pro Leu Arg Phe Ala Leu Gly Ile Met Asp Glu Pro Arg Arg His 50 55 60 Leu Gln Asp Val Trp Gly Val Asp Val Ser Gly Ala Gly Gly Asn Ile 65 70 75 80 Gly Ile Gly Gly Ala Pro Gln Thr Gly Lys Ser Thr Leu Leu Gln Thr 85 90 95 Met Val Met Ser Ala Ala Ala Thr His Ser Pro Arg Asn Val Gln Phe 100 105 110 Tyr Cys Ile Asp Leu Gly Gly Gly Gly Leu Ile Tyr Leu Glu Asn Leu 115 120 125 Pro His Val Gly Gly Val Ala Asn Arg Ser Glu Pro Asp Lys Val Asn 130 135 140 Arg Val Val Ala Glu Met Gln Ala Val Met Arg Gln Arg Glu Thr Thr 145 150 155 160 Phe Lys Glu His Arg Val Gly Ser Ile Gly Met Tyr Arg Gln Leu Arg 165 170 175 Asp Asp Pro Ser Gln Pro Val Ala Ser Asp Pro Tyr Gly Asp Val Phe 180 185 190 Leu Ile Ile Asp Gly Trp Pro Gly Phe Val Gly Glu Phe Pro Asp Leu 195 200 205 Glu Gly Gln Val Gln Asp Leu Ala Ala Gln Gly Leu Gly Phe Gly Val 210 215 220 His Val Ile Ile Ser Thr Pro Arg Trp Thr Glu Leu Lys Ser Arg Val 225 230 235 240 Arg Asp Tyr Leu Gly Thr Lys Ile Glu Phe Arg Leu Gly Asp Val Asn 245 250 255 Glu Thr Gln Ile Asp Arg Ile Thr Arg Glu Ile Pro Ala Asn Arg Pro 260 265 270 Gly Arg Ala Val Ser Met Glu Lys His His Leu Met Ile Gly Val Pro 275 280 285 Arg Phe Asp Gly Val His Ser Ala Asp Asn Leu Val Glu Ala Ile Thr 290 295 300 Ala Gly Val Thr Gln Ile Ala Ser Gln His Thr Glu Gln Ala Pro Pro 305 310 315 320 Val Arg Val Leu Pro Glu Arg Ile His Leu His Glu Leu Asp Pro Asn 325 330 335 Pro Pro Gly Pro Glu Ser Asp Tyr Arg Thr Arg Trp Glu Ile Pro Ile 340 345 350 Gly Leu Arg Glu Thr Asp Leu Thr Pro Ala His Cys His Met His Thr 355 360 365 Asn Pro His Leu Leu Ile Phe Gly Ala Ala Lys Ser Gly Lys Thr Thr 370 375 380 Ile Ala His Ala Ile Ala Arg Ala Ile Cys Ala Arg Asn Ser Pro Gln 385 390 395 400 Gln Val Arg Phe Met Leu Ala Asp Tyr Arg Ser Gly Leu Leu Asp Ala 405 410 415 Val Pro Asp Thr His Leu Leu Gly Ala Gly Ala Ile Asn Arg Asn Ser 420 425 430 Ala Ser Leu Asp Glu Ala Ala Gln Ala Leu Ala Val Asn Leu Lys Lys 435 440 445 Arg Leu Pro Pro Thr Asp Leu Thr Thr Ala Gln Leu Arg Ser Arg Ser 450 455 460 Trp Trp Ser Gly Phe Asp Val Val Leu Leu Val Asp Asp Trp His Met 465 470 475 480 Ile Val Gly Ala Ala Gly Gly Met Pro Pro Met Ala Pro Leu Ala Pro 485 490 495 Leu Leu Pro Ala Ala Ala Asp Ile Gly Leu His Ile Ile Val Thr Cys 500 505 510 Gln Met Ser Gln Ala Tyr Lys Ala Thr Met Asp Lys Phe Val Gly Ala 515 520 525 Ala Phe Gly Ser Gly Ala Pro Thr Met Phe Leu Ser Gly Glu Lys Gln 530 535 540 Glu Phe Pro Ser Ser Glu Phe Lys Val Lys Arg Arg Pro Pro Gly Gln 545 550 555 560 Ala Phe Leu Val Ser Pro Asp Gly Lys Glu Val Ile Gln Ala Pro Tyr 565 570 575 Ile Glu Pro Pro Glu Glu Val Phe Ala Ala Pro Pro Ser Ala Gly 580 585 590 <210> SEQ ID NO 73 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 73 Asp Pro Val Asp Asp Ala Phe Ile Ala Lys Leu Asn Thr Ala Gly 1 5 10 15 <210> SEQ ID NO 74 <211> LENGTH: 14 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (14) <223> OTHER INFORMATION: Xaa is unknown <400> SEQUENCE: 74 Asp Pro Val Asp Ala Ile Ile Asn Leu Asp Asn Tyr Gly Xaa 1 5 10 <210> SEQ ID NO 75 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (5) <223> OTHER INFORMATION: Xaa is unknown <400> SEQUENCE: 75 Ala Glu Met Lys Xaa Phe Lys Asn Ala Ile Val Gln Glu Ile Asp 1 5 10 15 <210> SEQ ID NO 76 <211> LENGTH: 14 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (3) <223> OTHER INFORMATION: Ala is Ala or Gln <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (7) <223> OTHER INFORMATION: Thr is Gly or Thr <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (11) <223> OTHER INFORMATION: Xaa is unknown <400> SEQUENCE: 76 Val Ile Ala Gly Met Val Thr His Ile His Xaa Val Ala Gly 1 5 10 <210> SEQ ID NO 77 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 77 Thr Asn Ile Val Val Leu Ile Lys Gln Val Pro Asp Thr Trp Ser 1 5 10 15 <210> SEQ ID NO 78 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 78 Ala Ile Glu Val Ser Val Leu Arg Val Phe Thr Asp Ser Asp Gly 1 5 10 15 <210> SEQ ID NO 79 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 79 Ala Lys Leu Ser Thr Asp Glu Leu Leu Asp Ala Phe Lys Glu Met 1 5 10 15 <210> SEQ ID NO 80 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (4) <223> OTHER INFORMATION: Asp is Asp or Glu <400> SEQUENCE: 80 Asp Pro Ala Asp Ala Pro Asp Val Pro Thr Ala Ala Gln Leu Thr 1 5 10 15 <210> SEQ ID NO 81 <211> LENGTH: 50 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 81 Ala Glu Asp Val Arg Ala Glu Ile Val Ala Ser Val Leu Glu Val Val 1 5 10 15 Val Asn Glu Gly Asp Gln Ile Asp Lys Gly Asp Val Val Val Leu Leu 20 25 30 Glu Ser Met Tyr Met Glu Ile Pro Val Leu Ala Glu Ala Ala Gly Thr 35 40 45 Val Ser 50 <210> SEQ ID NO 82 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 82 Thr Thr Ser Pro Asp Pro Tyr Ala Ala Leu Pro Lys Leu Pro Ser 1 5 10 15 <210> SEQ ID NO 83 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 83 Thr Glu Tyr Glu Gly Pro Lys Thr Lys Phe His Ala Leu Met Gln 1 5 10 15 <210> SEQ ID NO 84 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 84 Thr Thr Ile Val Ala Leu Lys Tyr Pro Gly Gly Val Val Met Ala 1 5 10 15 <210> SEQ ID NO 85 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (10) <223> OTHER INFORMATION: Xaa is unknown <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (15) <223> OTHER INFORMATION: Xaa is unknown <400> SEQUENCE: 85 Ser Phe Pro Tyr Phe Ile Ser Pro Glu Xaa Ala Met Arg Glu Xaa 1 5 10 15 <210> SEQ ID NO 86 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 86 Thr His Tyr Asp Val Val Val Leu Gly Ala Gly Pro Gly Gly Tyr 1 5 10 15 <210> SEQ ID NO 87 <211> LENGTH: 450 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 87 agcccggtaa tcgagttcgg gcaatgctga ccatcgggtt tgtttccggc tataaccgaa 60 cggtttgtgt acgggataca aatacaggga gggaagaagt aggcaaatgg aaaaaatgtc 120acatgatccg atcgctgccg acattggcac gcaagtgagc gacaacgctc tgcacggcgt 180 gacggccggc tcgacggcgc tgacgtcggt gaccgggctg gttcccgcgg gggccgatga 240 ggtctccgcc caagcggcga cggcgttcac atcggagggc atccaattgc tggcttccaa 300 tgcatcggcc caagaccagc tccaccgtgc gggcgaagcg gtccaggacg tcgcccgcac 360ctattcgcaa atcgacgacg gcgccgccgg cgtcttcgcc taataggccc ccaacacatc 420 ggagggagtg atcaccatgc tgtggcacgc 450<210> SEQ ID NO 88 <211> LENGTH: 98 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 88 Met Glu Lys Met Ser His Asp Pro Ile Ala Ala Asp Ile Gly Thr Gln 1 5 10 15 Val Ser Asp Asn Ala Leu His Gly Val Thr Ala Gly Ser Thr Ala Leu 20 25 30 Thr Ser Val Thr Gly Leu Val Pro Ala Gly Ala Asp Glu Val Ser Ala 35 40 45 Gln Ala Ala Thr Ala Phe Thr Ser Glu Gly Ile Gln Leu Leu Ala Ser 50 55 60 Asn Ala Ser Ala Gln Asp Gln Leu His Arg Ala Gly Glu Ala Val Gln 65 70 75 80 Asp Val Ala Arg Thr Tyr Ser Gln Ile Asp Asp Gly Ala Ala Gly Val 85 90 95 Phe Ala <210> SEQ ID NO 89 <211> LENGTH: 460 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 89 gcaaccggct tttcgatcag ctgagacatc agcggcgtgc gggtcaacga cccacctgcg 60ccaggtagcg actccgcgcg cagcaggccc gcgcccgcgc tggggcctga tccaccagcc 120 agcggatggt tcgacagcgg actggtgccg agcaggccca tctgcgcggc ttcctcgtcg 180gctgggttgc cgccgccggt gccgcccacc tggctgaaca acgacgtcac ctgctgcagc 240ggctgggtca gctgctgcat cgggccgctc atctcaccca gttggccgag ggtctgggta 300 gccgccggcg gcaactggcc aaccggtgtt gagctgccag gggagggcat tccgaagatc 360 gggttcgtcg tgctctggct cgcgccggga tcaaggatcg acgccatcgg ctcgagcttc 420 tcgaaaagcg tgttaaccgc ggtctcggcc tggtagacct 460 <210> SEQ ID NO 90 <211> LENGTH: 139 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 90 Met Arg Val Asn Asp Pro Pro Ala Pro Gly Ser Asp Ser Ala Arg Ser 1 5 10 15 Arg Pro Ala Pro Ala Leu Gly Pro Asp Pro Pro Ala Ser Gly Trp Phe 20 25 30 Asp Ser Gly Leu Val Pro Ser Arg Pro Ile Cys Ala Ala Ser Ser Ser 35 40 45 Ala Gly Leu Pro Pro Pro Val Pro Pro Thr Trp Leu Asn Asn Asp Val 50 55 60 Thr Cys Cys Ser Gly Trp Val Ser Cys Cys Ile Gly Pro Leu Ile Ser 65 70 75 80 Pro Ser Trp Pro Arg Val Trp Val Ala Ala Gly Gly Asn Trp Pro Thr 85 90 95 Gly Val Glu Leu Pro Gly Glu Gly Ile Pro Lys Ile Gly Phe Val Val 100 105 110 Leu Trp Leu Ala Pro Gly Ser Arg Ile Asp Ala Ile Gly Ser Ser Phe 115 120 125 Ser Lys Ser Val Leu Thr Ala Val Ser Ala Trp 130 135 <210> SEQ ID NO 91 <211> LENGTH: 1200 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 91 taataggccc ccaacacatc ggagggagtg atcaccatgc tgtggcacgc aatgccaccg 60gagctaaata ccgcacggct gatggccggc gcgggtccgg ctccaatgct tgcggcggcc 120gcgggatggc agacgctttc ggcggctctg gacgctcagg ccgtcgagtt gaccgcgcgc 180ctgaactctc tgggagaagc ctggactgga ggtggcagcg acaaggcgct tgcggctgca 240acgccgatgg tggtctggct acaaaccgcg tcaacacagg ccaagacccg tgcgatgcag 300 gcgacggcgc aagccgcggc atacacccag gccatggcca cgacgccgtc gctgccggag 360 atcgccgcca accacatcac ccaggccgtc cttacggcca ccaacttctt cggtatcaac 420 acgatcccga tcgcgttgac cgagatggat tatttcatcc gtatgtggaa ccaggcagcc 480 ctggcaatgg aggtctacca ggccgagacc gcggttaaca cgcttttcga gaagctcgag 540 ccgatggcgt cgatccttga tcccggcgcg agccagagca cgacgaaccc gatcttcgga 600 atgccctccc ctggcagctc aacaccggtt ggccagttgc cgccggcggc tacccagacc 660 ctcggccaac tgggtgagat gagcggcccg atgcagcagc tgacccagcc gctgcagcag 720 gtgacgtcgt tgttcagcca ggtgggcggc accggcggcg gcaacccagc cgacgaggaa 780 gccgcgcaga tgggcctgct cggcaccagt ccgctgtcga accatccgct ggctggtgga 840 tcaggcccca gcgcgggcgc gggcctgctg cgcgcggagt cgctacctgg cgcaggtggg 900 tcgttgaccc gcacgccgct gatgtctcag ctgatcgaaa agccggttgc cccctcggtg 960 atgccggcgg ctgctgccgg atcgtcggcg acgggtggcg ccgctccggt gggtgcggga 1020 gcgatgggcc agggtgcgca atccggcggc tccaccaggc cgggtctggt cgcgccggca 1080 ccgctcgcgc aggagcgtga agaagacgac gaggacgact gggacgaaga ggacgactgg 1140 tgagctcccg taatgacaac agacttcccg gccacccggg ccggaagact tgccaacatt 1200 <210> SEQ ID NO 92 <211> LENGTH: 371 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 92 Met Ile Thr Met Leu Trp His Ala Met Pro Pro Glu Leu Asn Thr Ala 1 5 10 15 Arg Leu Met Ala Gly Ala Gly Pro Ala Pro Met Leu Ala Ala Ala Ala 20 25 30 Gly Trp Gln Thr Leu Ser Ala Ala Leu Asp Ala Gln Ala Val Glu Leu 35 40 45 Thr Ala Arg Leu Asn Ser Leu Gly Glu Ala Trp Thr Gly Gly Gly Ser 50 55 60 Asp Lys Ala Leu Ala Ala Ala Thr Pro Met Val Val Trp Leu Gln Thr 65 70 75 80 Ala Ser Thr Gln Ala Lys Thr Arg Ala Met Gln Ala Thr Ala Gln Ala 85 90 95 Ala Ala Tyr Thr Gln Ala Met Ala Thr Thr Pro Ser Leu Pro Glu Ile 100 105 110 Ala Ala Asn His Ile Thr Gln Ala Val Leu Thr Ala Thr Asn Phe Phe 115 120 125 Gly Ile Asn Thr Ile Pro Ile Ala Leu Thr Glu Met Asp Tyr Phe Ile 130 135 140 Arg Met Trp Asn Gln Ala Ala Leu Ala Met Glu Val Tyr Gln Ala Glu 145 150 155 160 Thr Ala Val Asn Thr Leu Phe Glu Lys Leu Glu Pro Met Ala Ser Ile 165 170 175 Leu Asp Pro Gly Ala Ser Gln Ser Thr Thr Asn Pro Ile Phe Gly Met 180 185 190 Pro Ser Pro Gly Ser Ser Thr Pro Val Gly Gln Leu Pro Pro Ala Ala 195 200 205 Thr Gln Thr Leu Gly Gln Leu Gly Glu Met Ser Gly Pro Met Gln Gln 210 215 220 Leu Thr Gln Pro Leu Gln Gln Val Thr Ser Leu Phe Ser Gln Val Gly 225 230 235 240 Gly Thr Gly Gly Gly Asn Pro Ala Asp Glu Glu Ala Ala Gln Met Gly 245 250 255 Leu Leu Gly Thr Ser Pro Leu Ser Asn His Pro Leu Ala Gly Gly Ser 260 265 270 Gly Pro Ser Ala Gly Ala Gly Leu Leu Arg Ala Glu Ser Leu Pro Gly 275 280 285 Ala Gly Gly Ser Leu Thr Arg Thr Pro Leu Met Ser Gln Leu Ile Glu 290 295 300 Lys Pro Val Ala Pro Ser Val Met Pro Ala Ala Ala Ala Gly Ser Ser 305 310 315 320 Ala Thr Gly Gly Ala Ala Pro Val Gly Ala Gly Ala Met Gly Gln Gly 325 330 335 Ala Gln Ser Gly Gly Ser Thr Arg Pro Gly Leu Val Ala Pro Ala Pro 340 345 350 Leu Ala Gln Glu Arg Glu Glu Asp Asp Glu Asp Asp Trp Asp Glu Glu 355 360 365 Asp Asp Trp 370 <210> SEQ ID NO 93 <211> LENGTH: 1000 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 93 gacgcgacac agaaatcctt aaggccggcg gccaaggggc cgaaggtgaa gaaggtgaag 60 ccccagaaac cgaaggccac gaagccgccc aaagtggtgt cgcagcgcgg ctggcgacat 120 tgggtgcatg cgttgacgcg aatcaacctg ggcctgtcac ccgacgagaa gtacgagctg 180 gacctgcacg ctcgagtccg ccgcaatccc cgcgggtcgt atcagatcgc cgtcgtcggt 240ctcaaaggtg gggctggcaa aaccacgctg acagcagcgt tggggtcgac gttggctcag 300 gtgcgggccg accggatcct ggctctagac gcggatccag gcgccggaaa cctcgccgat 360 cgggtagggc gacaatcggg cgcgaccatc gctgatgtgc ttgcagaaaa agagctgtcg 420 cactacaacg acatccgcgc acacactagc gtcaatgcgg tcaatctgga agtgctgccg 480 gcaccggaat acagctcggc gcagcgcgcg ctcagcgacg ccgactggca tttcatcgcc 540 gatcctgcgt cgaggtttta caacctcgtc ttggctgatt gtggggccgg cttcttcgac 600ccgctgaccc gcggcgtgct gtccacggtg tccggtgtcg tggtcgtggc aagtgtctca 660 atcgacggcg cacaacaggc gtcggtcgcg ttggactggt tgcgcaacaa cggttaccaa 720 gatttggcga gccgcgcatg cgtggtcatc aatcacatca tgccgggaga acccaatgtc 780 gcagttaaag acctggtgcg gcatttcgaa cagcaagttc aacccggccg ggtcgtggtc 840 atgccgtggg acaggcacat tgcggccgga accgagattt cactcgactt gctcgaccct 900 atctacaagc gcaaggtcct cgaattggcc gcagcgctat ccgacgattt cgagagggct 960 ggacgtcgtt gagcgcacct gctgttgctg ctggtcctac 1000 <210> SEQ ID NO 94 <211> LENGTH: 308 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 94 Met Lys Lys Val Lys Pro Gln Lys Pro Lys Ala Thr Lys Pro Pro Lys 1 5 10 15 Val Val Ser Gln Arg Gly Trp Arg His Trp Val His Ala Leu Thr Arg 20 25 30 Ile Asn Leu Gly Leu Ser Pro Asp Glu Lys Tyr Glu Leu Asp Leu His 35 40 45 Ala Arg Val Arg Arg Asn Pro Arg Gly Ser Tyr Gln Ile Ala Val Val 50 55 60 Gly Leu Lys Gly Gly Ala Gly Lys Thr Thr Leu Thr Ala Ala Leu Gly 65 70 75 80 Ser Thr Leu Ala Gln Val Arg Ala Asp Arg Ile Leu Ala Leu Asp Ala 85 90 95 Asp Pro Gly Ala Gly Asn Leu Ala Asp Arg Val Gly Arg Gln Ser Gly 100 105 110 Ala Thr Ile Ala Asp Val Leu Ala Glu Lys Glu Leu Ser His Tyr Asn 115 120 125 Asp Ile Arg Ala His Thr Ser Val Asn Ala Val Asn Leu Glu Val Leu 130 135 140 Pro Ala Pro Glu Tyr Ser Ser Ala Gln Arg Ala Leu Ser Asp Ala Asp 145 150 155 160 Trp His Phe Ile Ala Asp Pro Ala Ser Arg Phe Tyr Asn Leu Val Leu 165 170 175 Ala Asp Cys Gly Ala Gly Phe Phe Asp Pro Leu Thr Arg Gly Val Leu 180 185 190 Ser Thr Val Ser Gly Val Val Val Val Ala Ser Val Ser Ile Asp Gly 195 200 205 Ala Gln Gln Ala Ser Val Ala Leu Asp Trp Leu Arg Asn Asn Gly Tyr 210 215 220 Gln Asp Leu Ala Ser Arg Ala Cys Val Val Ile Asn His Ile Met Pro 225 230 235 240 Gly Glu Pro Asn Val Ala Val Lys Asp Leu Val Arg His Phe Glu Gln 245 250 255 Gln Val Gln Pro Gly Arg Val Val Val Met Pro Trp Asp Arg His Ile 260 265 270 Ala Ala Gly Thr Glu Ile Ser Leu Asp Leu Leu Asp Pro Ile Tyr Lys 275 280 285 Arg Lys Val Leu Glu Leu Ala Ala Ala Leu Ser Asp Asp Phe Glu Arg 290 295 300 Ala Gly Arg Arg 305 <210> SEQ ID NO 95 <211> LENGTH: 34 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 95 aagagtagat ctatgatggc cgaggatgtt cgcg 34 <210> SEQ ID NO 96 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 96 cggcgacgac ggatcctacc gcgtcgg 27 <210> SEQ ID NO 97 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 97 ccttgggaga tctttggacc ccggttgc 28<210> SEQ ID NO 98 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 98 gacgagatct tatgggctta ctgac 25<210> SEQ ID NO 99 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 99 ccccccagat ctgcaccacc ggcatcggcg ggc 33 <210> SEQ ID NO 100<211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 100 gcggcggatc cgttgcttag ccgg 24 <210> SEQ ID NO 101<211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 101 ccggctgaga tctatgacag aatacgaagg gc 32 <210> SEQ ID NO 102 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 102 ccccgccagg gaactagagg cggc 24 <210> SEQ ID NO 103 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 103 ctgccgagat ctaccaccat tgtcgcgctg aaataccc 38 <210> SEQ ID NO 104<211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 104 cgccatggcc ttacgcgcca actcg 25<210> SEQ ID NO 105<211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 105 ggcggagatc tgtgagtttt ccgtatttca tc 32 <210> SEQ ID NO 106<211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 106 cgcgtcgagc catggttagg cgcag 25<210> SEQ ID NO 107 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 107 gaggaagatc tatgacaact tcacccgacc cg 32 <210> SEQ ID NO 108 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 108 catgaagcca tggcccgcag gctgcatg 28<210> SEQ ID NO 109 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 109 ggccgagatc tgtgacccac tatgacgtcg tcg 33 <210> SEQ ID NO 110 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 110 ggcgcccatg gtcagaaatt gatcatgtgg ccaacc 36 <210> SEQ ID NO 111 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 111 ccgggagatc tatggcaaag ctctccaccg acg 33 <210> SEQ ID NO 112 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 112 cgctgggcag agctacttga cggtgacggt gg 32 <210> SEQ ID NO 113 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 113 ggcccagatc tatggccatt gaggtttcgg tgttgc 36 <210> SEQ ID NO 114 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 114 cgccgtgttg catggcagcg ctgagc 26 <210> SEQ ID NO 115 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 115 ggacgttcaa gcgacacatc gccg 24 <210> SEQ ID NO 116 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 116 cagcacgaac gcgccgtcga tggc 24 <210> SEQ ID NO 117 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 117 acagatctgt gacggacatg aacccg 26 <210> SEQ ID NO 118 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 118 ttttccatgg tcacgggccc ccggtact 28<210> SEQ ID NO 119 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 119 acagatctgt gcccatggca cagata 26 <210> SEQ ID NO 120<211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 120 tttaagcttc taggcgccca gcgcggc 27 <210> SEQ ID NO 121<211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 121 acagatctgc gcatgcggat ccgtgt 26 <210> SEQ ID NO 122 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 122 ttttccatgg tcatccggcg tgatcgag 28<210> SEQ ID NO 123 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 123 acagatctgt aatggcagac tgtgat 26 <210> SEQ ID NO 124 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 124 ttttccatgg tcaggagatg gtgatcga 28<210> SEQ ID NO 125 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 125 acagatctgc cggctacccc ggtgcc 26 <210> SEQ ID NO 126 <211> LENGTH: 28 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 126 ttttccatgg ctattgcagc tttccggc 28<210> SEQ ID NO 127 <211> LENGTH: 50 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 127 Ala Glu Asp Val Arg Ala Glu Ile Val Ala Ser Val Leu Glu Val Val 1 5 10 15 Val Asn Glu Gly Asp Gln Ile Asp Lys Gly Asp Val Val Val Leu Leu 20 25 30 Glu Ser Met Tyr Met Glu Ile Pro Val Leu Ala Glu Ala Ala Gly Thr 35 40 45 Val Ser 50 <210> SEQ ID NO 128 <211> LENGTH: 49 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 128 Ala Glu Asp Val Arg Ala Glu Ile Val Ala Ser Val Leu Glu Val Val 1 5 10 15 Val Asn Glu Gly Asp Gln Ile Asp Lys Gly Asp Val Val Val Leu Leu 20 25 30 Glu Ser Met Met Glu Ile Pro Val Leu Ala Glu Ala Ala Gly Thr Val 35 40 45 Ser <210> SEQ ID NO 129 <211> LENGTH: 50 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 129 Ala Glu Asp Val Arg Ala Glu Ile Val Ala Ser Val Leu Glu Val Val 1 5 10 15 Val Asn Glu Gly Asp Gln Ile Asp Lys Gly Asp Val Val Val Leu Leu 20 25 30 Glu Ser Met Lys Met Glu Ile Pro Val Leu Ala Glu Ala Ala Gly Thr 35 40 45 Val Ser 50 <210> SEQ ID NO 130 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 130 ccgggagatc tatggcaaag ctctccaccg acg 33 <210> SEQ ID NO 131 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 131 cgctgggcag agctacttga cggtgacggt gg 32 <210> SEQ ID NO 132 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 132 ggcgccggca agcttgccat gacagagcag cagtgg 36 <210> SEQ ID NO 133 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 133 cgaactcgcc ggatcccgtg tttcgc 26 <210> SEQ ID NO 134 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 134 ggcaaccgcg agatctttct cccggccggg gc 32 <210> SEQ ID NO 135 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 135 ggcaagcttg ccggcgccta acgaact 27 <210> SEQ ID NO 136 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 136 ggacccagat ctatgacaga gcagcagtgg 30 <210> SEQ ID NO 137 <211> LENGTH: 47 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 137 ccggcagccc cggccgggag aaaagctttg cgaacatccc agtgacg 47 <210> SEQ ID NO 138 <211> LENGTH: 44 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 138 gttcgcaaag cttttctccc ggccggggct gccggtcgag tacc 44 <210> SEQ ID NO 139 <211> LENGTH: 20 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 139 ccttcggtgg atcccgtcag 20<210> SEQ ID NO 140 <211> LENGTH: 450 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 140 tggcgctgtc accgaggaac ctgtcaatgt cgtcgagcag tactgaaccg ttccgagaaa 60 ggccagcatg aacgtcaccg tatccattcc gaccatcctg cggccccaca ccggcggcca 120 gaagagtgtc tcggccagcg gcgatacctt gggtgccgtc atcagcgacc tggaggccaa 180 ctattcgggc atttccgagc gcctgatgga cccgtcttcc ccaggtaagt tgcaccgctt 240 cgtgaacatc tacgtcaacg acgaggacgt gcggttctcc ggcggcttgg ccaccgcgat 300 cgctgacggt gactcggtca ccatcctccc cgccgtggcc ggtgggtgag cggagcacat 360 gacacgatac gactcgctgt tgcaggcctt gggcaacacg ccgctggttg gcctgcagcg 420 attgtcgcca cgctgggatg acgggcgaga 450 <210> SEQ ID NO 141 <211> LENGTH: 93 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 141 Met Asn Val Thr Val Ser Ile Pro Thr Ile Leu Arg Pro His Thr Gly 1 5 10 15 Gly Gln Lys Ser Val Ser Ala Ser Gly Asp Thr Leu Gly Ala Val Ile 20 25 30 Ser Asp Leu Glu Ala Asn Tyr Ser Gly Ile Ser Glu Arg Leu Met Asp 35 40 45 Pro Ser Ser Pro Gly Lys Leu His Arg Phe Val Asn Ile Tyr Val Asn 50 55 60 Asp Glu Asp Val Arg Phe Ser Gly Gly Leu Ala Thr Ala Ile Ala Asp 65 70 75 80 Gly Asp Ser Val Thr Ile Leu Pro Ala Val Ala Gly Gly 85 90 <210> SEQ ID NO 142 <211> LENGTH: 480 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 142 ggtgttcccg cggccggcta tgacaacagt caatgtgcat gacaagttac aggtattagg 60tccaggttca acaaggagac aggcaacatg gcaacacgtt ttatgacgga tccgcacgcg 120 atgcgggaca tggcgggccg ttttgaggtg cacgcccaga cggtggagga cgaggctcgc 180cggatgtggg cgtccgcgca aaacatctcg ggcgcgggct ggagtggcat ggccgaggcg 240acctcgctag acaccatggc ccagatgaat caggcgtttc gcaacatcgt gaacatgctg 300cacggggtgc gtgacgggct ggttcgcgac gccaacaact acgagcagca agagcaggcc 360 tcccagcaga tcctcagcag ctaacgtcag ccgctgcagc acaatacttt tacaagcgaa 420 ggagaacagg ttcgatgacc atcaactatc agttcggtga tgtcgacgct catggcgcca 480 <210> SEQ ID NO 143 <211> LENGTH: 98 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 143 Met Ala Thr Arg Phe Met Thr Asp Pro His Ala Met Arg Asp Met Ala 1 5 10 15 Gly Arg Phe Glu Val His Ala Gln Thr Val Glu Asp Glu Ala Arg Arg 20 25 30 Met Trp Ala Ser Ala Gln Asn Ile Ser Gly Ala Gly Trp Ser Gly Met 35 40 45 Ala Glu Ala Thr Ser Leu Asp Thr Met Ala Gln Met Asn Gln Ala Phe 50 55 60 Arg Asn Ile Val Asn Met Leu His Gly Val Arg Asp Gly Leu Val Arg 65 70 75 80 Asp Ala Asn Asn Tyr Glu Gln Gln Glu Gln Ala Ser Gln Gln Ile Leu 85 90 95 Ser Ser <210> SEQ ID NO 144 <211> LENGTH: 940 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 144 gccccagtcc tcgatcgcct catcgccttc accggccgcc agccgaccgc aggccacgtg 60 tccgccacct aacgaaagga tgatcatgcc caagagaagc gaatacaggc aaggcacgcc 120 gaactgggtc gaccttcaga ccaccgatca gtccgccgcc aaaaagttct acacatcgtt 180 gttcggctgg ggttacgacg acaacccggt ccccggaggc ggtggggtct attccatggc 240 cacgctgaac ggcgaagccg tggccgccat cgcaccgatg cccccgggtg caccggaggg 300 gatgccgccg atctggaaca cctatatcgc ggtggacgac gtcgatgcgg tggtggacaa 360 ggtggtgccc gggggcgggc aggtgatgat gccggccttc gacatcggcg atgccggccg 420 gatgtcgttc atcaccgatc cgaccggcgc tgccgtgggc ctatggcagg ccaatcggca 480 catcggagcg acgttggtca acgagacggg cacgctcatc tggaacgaac tgctcacgga 540 caagccggat ttggcgctag cgttctacga ggctgtggtt ggcctcaccc actcgagcat 600 ggagatagct gcgggccaga actatcgggt gctcaaggcc ggcgacgcgg aagtcggcgg 660 ctgtatggaa ccgccgatgc ccggcgtgcc gaatcattgg cacgtctact ttgcggtgga 720 tgacgccgac gccacggcgg ccaaagccgc cgcagcgggc ggccaggtca ttgcggaacc 780 ggctgacatt ccgtcggtgg gccggttcgc cgtgttgtcc gatccgcagg gcgcgatctt 840 cagtgtgttg aagcccgcac cgcagcaata gggagcatcc cgggcaggcc cgccggccgg 900 cagattcgga gaatgctaga agctgccgcc ggcgccgccg 940 <210> SEQ ID NO 145 <211> LENGTH: 261 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 145 Met Pro Lys Arg Ser Glu Tyr Arg Gln Gly Thr Pro Asn Trp Val Asp 1 5 10 15 Leu Gln Thr Thr Asp Gln Ser Ala Ala Lys Lys Phe Tyr Thr Ser Leu 20 25 30 Phe Gly Trp Gly Tyr Asp Asp Asn Pro Val Pro Gly Gly Gly Gly Val 35 40 45 Tyr Ser Met Ala Thr Leu Asn Gly Glu Ala Val Ala Ala Ile Ala Pro 50 55 60 Met Pro Pro Gly Ala Pro Glu Gly Met Pro Pro Ile Trp Asn Thr Tyr 65 70 75 80 Ile Ala Val Asp Asp Val Asp Ala Val Val Asp Lys Val Val Pro Gly 85 90 95 Gly Gly Gln Val Met Met Pro Ala Phe Asp Ile Gly Asp Ala Gly Arg 100 105 110 Met Ser Phe Ile Thr Asp Pro Thr Gly Ala Ala Val Gly Leu Trp Gln 115 120 125 Ala Asn Arg His Ile Gly Ala Thr Leu Val Asn Glu Thr Gly Thr Leu 130 135 140 Ile Trp Asn Glu Leu Leu Thr Asp Lys Pro Asp Leu Ala Leu Ala Phe 145 150 155 160 Tyr Glu Ala Val Val Gly Leu Thr His Ser Ser Met Glu Ile Ala Ala 165 170 175 Gly Gln Asn Tyr Arg Val Leu Lys Ala Gly Asp Ala Glu Val Gly Gly 180 185 190 Cys Met Glu Pro Pro Met Pro Gly Val Pro Asn His Trp His Val Tyr 195 200 205 Phe Ala Val Asp Asp Ala Asp Ala Thr Ala Ala Lys Ala Ala Ala Ala 210 215 220 Gly Gly Gln Val Ile Ala Glu Pro Ala Asp Ile Pro Ser Val Gly Arg 225 230 235 240 Phe Ala Val Leu Ser Asp Pro Gln Gly Ala Ile Phe Ser Val Leu Lys 245 250 255 Pro Ala Pro Gln Gln 260 <210> SEQ ID NO 146 <211> LENGTH: 280 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 146 ccgaaaggcg gtgcaccgca cccagaagaa aaggaaagat cgagaaatgc cacagggaac 60tgtgaagtgg ttcaacgcgg agaaggggtt cggctttatc gcccccgaag acggttccgc 120 ggatgtattt gtccactaca cggagatcca gggaacgggc ttccgcaccc ttgaagaaaa 180 ccagaaggtc gagttcgaga tcggccacag ccctaagggc ccccaggcca ccggagtccg 240 ctcgctctga gttacccccg cgagcagacg caaaaagccc 280 <210> SEQ ID NO 147 <211> LENGTH: 67 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 147 Met Pro Gln Gly Thr Val Lys Trp Phe Asn Ala Glu Lys Gly Phe Gly 1 5 10 15 Phe Ile Ala Pro Glu Asp Gly Ser Ala Asp Val Phe Val His Tyr Thr 20 25 30 Glu Ile Gln Gly Thr Gly Phe Arg Thr Leu Glu Glu Asn Gln Lys Val 35 40 45 Glu Phe Glu Ile Gly His Ser Pro Lys Gly Pro Gln Ala Thr Gly Val 50 55 60 Arg Ser Leu 65 <210> SEQ ID NO 148 <211> LENGTH: 540 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 148 atcgtgtcgt atcgagaacc ccggccggta tcagaacgcg ccagagcgca aacctttata 60 acttcgtgtc ccaaatgtga cgaccatgga ccaaggttcc tgagatgaac ctacggcgcc 120 atcagaccct gacgctgcga ctgctggcgg catccgcggg cattctcagc gccgcggcct 180 tcgccgcgcc agcacaggca aaccccgtcg acgacgcgtt catcgccgcg ctgaacaatg 240ccggcgtcaa ctacggcgat ccggtcgacg ccaaagcgct gggtcagtcc gtctgcccga 300 tcctggccga gcccggcggg tcgtttaaca ccgcggtagc cagcgttgtg gcgcgcgccc 360aaggcatgtc ccaggacatg gcgcaaacct tcaccagtat cgcgatttcg atgtactgcc 420 cctcggtgat ggcagacgtc gccagcggca acctgccggc cctgccagac atgccggggc 480 tgcccgggtc ctaggcgtgc gcggctccta gccggtccct aacggatcga tcgtggatgc 540 <210> SEQ ID NO 149 <211> LENGTH: 129 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 149 Met Asn Leu Arg Arg His Gln Thr Leu Thr Leu Arg Leu Leu Ala Ala 1 5 10 15 Ser Ala Gly Ile Leu Ser Ala Ala Ala Phe Ala Ala Pro Ala Gln Ala 20 25 30 Asn Pro Val Asp Asp Ala Phe Ile Ala Ala Leu Asn Asn Ala Gly Val 35 40 45 Asn Tyr Gly Asp Pro Val Asp Ala Lys Ala Leu Gly Gln Ser Val Cys 50 55 60 Pro Ile Leu Ala Glu Pro Gly Gly Ser Phe Asn Thr Ala Val Ala Ser 65 70 75 80 Val Val Ala Arg Ala Gln Gly Met Ser Gln Asp Met Ala Gln Thr Phe 85 90 95 Thr Ser Ile Ala Ile Ser Met Tyr Cys Pro Ser Val Met Ala Asp Val 100 105 110 Ala Ser Gly Asn Leu Pro Ala Leu Pro Asp Met Pro Gly Leu Pro Gly 115 120 125 Ser <210> SEQ ID NO 150<211> LENGTH: 400 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 150 atagtttggg gaaggtgtcc ataaatgagg ctgtcgttga ccgcattgag cgccggtgta 60 ggcgccgtgg caatgtcgtt gaccgtcggg gccggggtcg cctccgcaga tcccgtggac 120 gcggtcatta acaccacctg caattacggg caggtagtag ctgcgctcaa cgcgacggat 180 ccgggggctg ccgcacagtt caacgcctca ccggtggcgc agtcctattt gcgcaatttc 240ctcgccgcac cgccacctca gcgcgctgcc atggccgcgc aattgcaagc tgtgccgggg 300 gcggcacagt acatcggcct tgtcgagtcg gttgccggct cctgcaacaa ctattaagcc 360 catgcgggcc ccatcccgcg acccggcatc gtcgccgggg 400<210> SEQ ID NO 151<211> LENGTH: 110 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 151 Met Arg Leu Ser Leu Thr Ala Leu Ser Ala Gly Val Gly Ala Val Ala 1 5 10 15 Met Ser Leu Thr Val Gly Ala Gly Val Ala Ser Ala Asp Pro Val Asp 20 25 30 Ala Val Ile Asn Thr Thr Cys Asn Tyr Gly Gln Val Val Ala Ala Leu 35 40 45 Asn Ala Thr Asp Pro Gly Ala Ala Ala Gln Phe Asn Ala Ser Pro Val 50 55 60 Ala Gln Ser Tyr Leu Arg Asn Phe Leu Ala Ala Pro Pro Pro Gln Arg 65 70 75 80 Ala Ala Met Ala Ala Gln Leu Gln Ala Val Pro Gly Ala Ala Gln Tyr 85 90 95 Ile Gly Leu Val Glu Ser Val Ala Gly Ser Cys Asn Asn Tyr 100 105 110 <210> SEQ ID NO 152 <211> LENGTH: 990 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 152 aatagtaata tcgctgtgcg gttgcaaaac gtgtgaccga ggttccgcag tcgagcgctg 60 cgggccgcct tcgaggagga cgaaccacag tcatgacgaa catcgtggtc ctgatcaagc 120aggtcccaga tacctggtcg gagcgcaagc tgaccgacgg cgatttcacg ctggaccgcg 180 aggccgccga cgcggtgctg gacgagatca acgagcgcgc cgtggaggaa gcgctacaga 240ttcgggagaa agaggccgcc gacggcatcg aagggtcggt aaccgtgctg acggcgggcc 300 ccgagcgcgc caccgaggcg atccgcaagg cgctgtcgat gggtgccgac aaggccgtcc 360 acctaaagga cgacggcatg cacggctcgg acgtcatcca aaccgggtgg gctttggcgc 420 gcgcgttggg caccatcgag ggcaccgagc tggtgatcgc aggcaacgaa tcgaccgacg 480 gggtgggcgg tgcggtgccg gccatcatcg ccgagtacct gggcctgccg cagctcaccc 540 acctgcgcaa agtgtcgatc gagggcggca agatcaccgg cgagcgtgag accgatgagg 600 gcgtattcac cctcgaggcc acgctgcccg cggtgatcag cgtgaacgag aagatcaacg 660 agccgcgctt cccgtccttc aaaggcatca tggccgccaa gaagaaggaa gttaccgtgc 720 tgaccctggc cgagatcggt gtcgagagcg acgaggtggg gctggccaac gccggatcca 780 ccgtgctggc gtcgacgccc aaaccggcca agactgccgg ggagaaggtc accgacgagg 840 gtgaaggcgg caaccagatc gtgcagtacc tggttgccca gaaaatcatc taagacatac 900 gcacctccca aagacgagag cgatataacc catggctgaa gtactggtgc tcgttgagca 960 cgctgaaggc gcgttaaaga aggtcagcgc 990 <210> SEQ ID NO 153 <211> LENGTH: 266 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 153 Met Thr Asn Ile Val Val Leu Ile Lys Gln Val Pro Asp Thr Trp Ser 1 5 10 15 Glu Arg Lys Leu Thr Asp Gly Asp Phe Thr Leu Asp Arg Glu Ala Ala 20 25 30 Asp Ala Val Leu Asp Glu Ile Asn Glu Arg Ala Val Glu Glu Ala Leu 35 40 45 Gln Ile Arg Glu Lys Glu Ala Ala Asp Gly Ile Glu Gly Ser Val Thr 50 55 60 Val Leu Thr Ala Gly Pro Glu Arg Ala Thr Glu Ala Ile Arg Lys Ala 65 70 75 80 Leu Ser Met Gly Ala Asp Lys Ala Val His Leu Lys Asp Asp Gly Met 85 90 95 His Gly Ser Asp Val Ile Gln Thr Gly Trp Ala Leu Ala Arg Ala Leu 100 105 110 Gly Thr Ile Glu Gly Thr Glu Leu Val Ile Ala Gly Asn Glu Ser Thr 115 120 125 Asp Gly Val Gly Gly Ala Val Pro Ala Ile Ile Ala Glu Tyr Leu Gly 130 135 140 Leu Pro Gln Leu Thr His Leu Arg Lys Val Ser Ile Glu Gly Gly Lys 145 150 155 160 Ile Thr Gly Glu Arg Glu Thr Asp Glu Gly Val Phe Thr Leu Glu Ala 165 170 175 Thr Leu Pro Ala Val Ile Ser Val Asn Glu Lys Ile Asn Glu Pro Arg 180 185 190 Phe Pro Ser Phe Lys Gly Ile Met Ala Ala Lys Lys Lys Glu Val Thr 195 200 205 Val Leu Thr Leu Ala Glu Ile Gly Val Glu Ser Asp Glu Val Gly Leu 210 215 220 Ala Asn Ala Gly Ser Thr Val Leu Ala Ser Thr Pro Lys Pro Ala Lys 225 230 235 240 Thr Ala Gly Glu Lys Val Thr Asp Glu Gly Glu Gly Gly Asn Gln Ile 245 250 255 Val Gln Tyr Leu Val Ala Gln Lys Ile Ile 260 265 <210> SEQ ID NO 154 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 154 ctgagatcta tgaacctacg gcgcc 25<210> SEQ ID NO 155 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 155 ctcccatggt accctaggac ccgggcagcc ccggc 35<210> SEQ ID NO 156 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 156 ctgagatcta tgaggctgtc gttgaccgc 29<210> SEQ ID NO 157 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 157 ctccccgggc ttaatagttg ttgcaggagc 30 <210> SEQ ID NO 158 <211> LENGTH: 33 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 158 gcttagatct atgattttct gggcaaccag gta 33 <210> SEQ ID NO 159 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 159 gcttccatgg gcgaggcaca ggcgtgggaa 30 <210> SEQ ID NO 160 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 160 ctgagatcta gaatgccaca gggaactgtg 30 <210> SEQ ID NO 161 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 161 tctcccgggg gtaactcaga gcgagcggac 30 <210> SEQ ID NO 162 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 162 ctgagatcta tgaacgtcac cgtatcc 27 <210> SEQ ID NO 163 <211> LENGTH: 27 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 163 tctcccgggg ctcacccacc ggccacg 27 <210> SEQ ID NO 164 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 164 ctgagatcta tggcaacacg ttttatgacg 30 <210> SEQ ID NO 165 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 165 ctccccgggt tagctgctga ggatctgcth 30 <210> SEQ ID NO 166 <211> LENGTH: 31 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 166 ctgaagatct atgcccaaga gaagcgaata c 31 <210> SEQ ID NO 167 <211> LENGTH: 31 <212> TYPE: DNA <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 167 cggcagctgc tagcattctc cgaatctgcc g 31 <210> SEQ ID NO 168 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 168 Pro Gln Gly Thr Val Lys Trp Phe Asn Ala Glu Lys Gly Phe Gly 1 5 10 15 <210> SEQ ID NO 169 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: UNSURE <222> LOCATION: (15) <223> OTHER INFORMATION: Xaa is unknown <400> SEQUENCE: 169 Asn Val Thr Val Ser Ile Pro Thr Ile Leu Arg Pro Xaa Xaa Xaa 1 5 10 15 <210> SEQ ID NO 170<211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <220> FEATURE: <221> NAME/KEY: VARIANT <222> LOCATION: (1) <223> OTHER INFORMATION: Thr could also be Ala <400> SEQUENCE: 170 Thr Arg Phe Met Thr Asp Pro His Ala Met Arg Asp Met Ala Gly 1 5 10 15 <210> SEQ ID NO 171<211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 171 Pro Lys Arg Ser Glu Tyr Arg Gln Gly Thr Pro Asn Trp Val Asp 1 5 10 15 <210> SEQ ID NO 172 <211> LENGTH: 404 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 172 Met Ala Thr Val Asn Arg Ser Arg His His His His His His His His 1 5 10 15 Ile Glu Gly Arg Ser Phe Ser Arg Pro Gly Leu Pro Val Glu Tyr Leu 20 25 30 Gln Val Pro Ser Pro Ser Met Gly Arg Asp Ile Lys Val Gln Phe Gln 35 40 45 Ser Gly Gly Asn Asn Ser Pro Ala Val Tyr Leu Leu Asp Gly Leu Arg 50 55 60 Ala Gln Asp Asp Tyr Asn Gly Trp Asp Ile Asn Thr Pro Ala Phe Glu 65 70 75 80 Trp Tyr Tyr Gln Ser Gly Leu Ser Ile Val Met Pro Val Gly Gly Gln 85 90 95 Ser Ser Phe Tyr Ser Asp Trp Tyr Ser Pro Ala Cys Gly Lys Ala Gly 100 105 110 Cys Gln Thr Tyr Lys Trp Glu Thr Phe Leu Thr Ser Glu Leu Pro Gln 115 120 125 Trp Leu Ser Ala Asn Arg Ala Val Lys Pro Thr Gly Ser Ala Ala Ile 130 135 140 Gly Leu Ser Met Ala Gly Ser Ser Ala Met Ile Leu Ala Ala Tyr His 145 150 155 160 Pro Gln Gln Phe Ile Tyr Ala Gly Ser Leu Ser Ala Leu Leu Asp Pro 165 170 175 Ser Gln Gly Met Gly Pro Ser Leu Ile Gly Leu Ala Met Gly Asp Ala 180 185 190 Gly Gly Tyr Lys Ala Ala Asp Met Trp Gly Pro Ser Ser Asp Pro Ala 195 200 205 Trp Glu Arg Asn Asp Pro Thr Gln Gln Ile Pro Lys Leu Val Ala Asn 210 215 220 Asn Thr Arg Leu Trp Val Tyr Cys Gly Asn Gly Thr Pro Asn Glu Leu 225 230 235 240 Gly Gly Ala Asn Ile Pro Ala Glu Phe Leu Glu Asn Phe Val Arg Ser 245 250 255 Ser Asn Leu Lys Phe Gln Asp Ala Tyr Asn Ala Ala Gly Gly His Asn 260 265 270 Ala Val Phe Asn Phe Pro Pro Asn Gly Thr His Ser Trp Glu Tyr Trp 275 280 285 Gly Ala Gln Leu Asn Ala Met Lys Gly Asp Leu Gln Ser Ser Leu Gly 290 295 300 Ala Gly Lys Leu Ala Met Thr Glu Gln Gln Trp Asn Phe Ala Gly Ile 305 310 315 320 Glu Ala Ala Ala Ser Ala Ile Gln Gly Asn Val Thr Ser Ile His Ser 325 330 335 Leu Leu Asp Glu Gly Lys Gln Ser Leu Thr Lys Leu Ala Ala Ala Trp 340 345 350 Gly Gly Ser Gly Ser Glu Ala Tyr Gln Gly Val Gln Gln Lys Trp Asp 355 360 365 Ala Thr Ala Thr Glu Leu Asn Asn Ala Leu Gln Asn Leu Ala Arg Thr 370 375 380 Ile Ser Glu Ala Gly Gln Ala Met Ala Ser Thr Glu Gly Asn Val Thr 385 390 395 400 Gly Met Phe Ala <210> SEQ ID NO 173 <211> LENGTH: 403 <212> TYPE: PRT <213> ORGANISM: Mycobacterium tuberculosis <400> SEQUENCE: 173 Met Ala Thr Val Asn Arg Ser Arg His His His His His His His His 1 5 10 15 Ile Glu Gly Arg Ser Met Thr Glu Gln Gln Trp Asn Phe Ala Gly Ile 20 25 30 Glu Ala Ala Ala Ser Ala Ile Gln Gly Asn Val Thr Ser Ile His Ser 35 40 45 Leu Leu Asp Glu Gly Lys Gln Ser Leu Thr Lys Leu Ala Ala Ala Trp 50 55 60 Gly Gly Ser Gly Ser Glu Ala Tyr Gln Gly Val Gln Gln Lys Trp Asp 65 70 75 80 Ala Thr Ala Thr Glu Leu Asn Asn Ala Leu Gln Asn Leu Ala Arg Thr 85 90 95 Ile Ser Glu Ala Gly Gln Ala Met Ala Ser Thr Glu Gly Asn Val Thr 100 105 110 Gly Met Phe Ala Lys Leu Phe Ser Arg Pro Gly Leu Pro Val Glu Tyr 115 120 125 Leu Gln Val Pro Ser Pro Ser Met Gly Arg Asp Ile Lys Val Gln Phe 130 135 140 Gln Ser Gly Gly Asn Asn Ser Pro Ala Val Tyr Leu Leu Asp Gly Leu 145 150 155 160 Arg Ala Gln Asp Asp Tyr Asn Gly Trp Asp Ile Asn Thr Pro Ala Phe 165 170 175 Glu Trp Tyr Tyr Gln Ser Gly Leu Ser Ile Val Met Pro Val Gly Gly 180 185 190 Gln Ser Ser Phe Tyr Ser Asp Trp Tyr Ser Pro Ala Cys Gly Lys Ala 195 200 205 Gly Cys Gln Thr Tyr Lys Trp Glu Thr Phe Leu Thr Ser Glu Leu Pro 210 215 220 Gln Trp Leu Ser Ala Asn Arg Ala Val Lys Pro Thr Gly Ser Ala Ala 225 230 235 240 Ile Gly Leu Ser Met Ala Gly Ser Ser Ala Met Ile Leu Ala Ala Tyr 245 250 255 His Pro Gln Gln Phe Ile Tyr Ala Gly Ser Leu Ser Ala Leu Leu Asp 260 265 270 Pro Ser Gln Gly Met Gly Pro Ser Leu Ile Gly Leu Ala Met Gly Asp 275 280 285 Ala Gly Gly Tyr Lys Ala Ala Asp Met Trp Gly Pro Ser Ser Asp Pro 290 295 300 Ala Trp Glu Arg Asn Asp Pro Thr Gln Gln Ile Pro Lys Leu Val Ala 305 310 315 320 Asn Asn Thr Arg Leu Trp Val Tyr Cys Gly Asn Gly Thr Pro Asn Glu 325 330 335 Leu Gly Gly Ala Asn Ile Pro Ala Glu Phe Leu Glu Asn Phe Val Arg 340 345 350 Ser Ser Asn Leu Lys Phe Gln Asp Ala Tyr Asn Ala Ala Gly Gly His 355 360 365 Asn Ala Val Phe Asn Phe Pro Pro Asn Gly Thr His Ser Trp Glu Tyr 370 375 380 Trp Gly Ala Gln Leu Asn Ala Met Lys Gly Asp Leu Gln Ser Ser Leu 385 390 395 400 Gly Ala Gly
Claims (47)
1. A substantially pure or isolated polypeptide which
a) consists of the amino acid sequence as shown in SEQ ID NO: 88, or
b) consists essentially of the amino acid sequence shown in SEQ ID NO: 88, and is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex, or
c) consists essentially of an amino acid sequence with a sequence identity of at least 80% with SEQ ID NO: 88, and which is at least 6 contiguous amino acid residues of SEQ ID NO: 88, and is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex; wherein “sequence identity” is a measure of the degree of similarity between two amino acid sequences of equal length calculated as (Nref−Ndif)*100/Nref, wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences.
2. A substantially pure or isolated polypeptide which consists essentially of:
a) at least 6 contiguous amino acid residues of SEQ ID NO:88, of
b) an amino acid sequence with a sequence identity of at least 80% with SEQ ID NO. 88, or
c) an amino acid sequence with a sequence identity of at least 80% with a);
wherein the polypeptide is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex; wherein “sequence identity” is a measure of the degree of similarity between two amino acid sequences of equal length calculated as (Nref−Ndif)*100/Nref, wherein Ndif is the total number of nonidentical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences.
3. A substantially pure or isolated polypeptide which consists of the amino acid equence as shown in SEQ ID NO: 88.
4. A substantially pure or isolated polypeptide which consists essentially of the amino acid sequence as shown in SEQ ID NO: 88.
5. A substantially pure or isolated polypeptide which consists essentially of an amino acid sequence with a sequence identity of at least 80% with SEQ ID NO: 88, and which is at least 6 contiguous amino acid residues of SEQ ID NO: 88 wherein the polypeptide isimmunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex; wherein “sequence identity” is a measure of the degree of similarity between two amino acid sequences of equal length calculated as (Nref−Ndif)*100/Nref, wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences.
6. A substantially pure or isolated polypeptide which consists essentially of at least 6 contiguous amino acid residues of SEQ ID NO: 88, wherein the polypeptide is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex.
7. The polypeptide of claim 1 or 2 consisting essentially of a T cell epitope of SEQ ID NO: 88 that is a non-naturally occurring polypeptide that induces a release of IFN-γ from primed memory T-lymphocytes withdrawn from a mouse within 2 weeks of primary infection or within 4 days after the mouse has been re-challenge infected with mycobacteria belonging to the tuberculosis complex, the induction performed by the addition of the polypeptide of a suspension comprising about 200,000 spleen cells per ml, the addition of the polypeptide resulting in a concentration of 1-4 μg polypeptide per ml suspension, the release of IFN-γ being assessable by determination of IFN-γ in supernatant harvested 2 days after the addition of the polypeptide to the suspension, and elicits a delayed type hypersensitivity reaction.
8. A substantially pure or isolated polypeptide which consists essentially of an amino acid sequence with a sequence identity of at least 80% with SEQ ID NO: 88, or wherein the polypeptide is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex; wherein “sequence identity” is a measure of the degree of similarity between two amino acid sequences of equal length calculated as (Nref−Ndif)*100/Nref, wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences.
9. A substantially pure or isolated polypeptide which consists essentially of a first amino acid sequence with a sequence identity of at least 80% with a second amino acid sequence that consists essentially of at least 6 contiguous amino acid residues of SEQ ID NO: 88, wherein the polypeptide is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex; wherein “sequence identity” is a measure of the degree of similarity between two amino acid sequences of equal length calculated as (Nref−Ndif)*100/Nref, wherein Ndif is the total number of non-identical residues in the two sequences when aligned and wherein Nref is the number of residues in one of the sequences.
10. The polypeptide according to any one of claims 1 or 2 in essentially pure form.
11. The polypeptide according to any one of claims 1 or 2 which consists essentially of an epitope for a T-helper cell.
12. A substantially pure or isolated polypeptide which consists essentially of at least 7 contiguous amino acid residues of SEQ ID NO: 88 and is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex.
13. A substantially pure or isolated polypeptide which consists essentially of at least 12 contiguous amino acid residues of SEQ ID NO: 88 and is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex.
14. A substantially pure or isolated polypeptide which consists essentially of at least 20 contiguous amino acid residues of SEQ ID NO: 88 and is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex.
15. A substantially pure or isolated polypeptide which consists essentially of at least 30 contiguous amino acid residues of SEQ ID NO: 88 and is immunologically equivalent to the amino acid sequence shown in SEQ ID NO: 88 with respect to the ability of evoking a protective immune response in mice against infections with mycobacteria belonging to the tuberculosis complex or with respect to the ability of eliciting a diagnostically significant immune response indicating previous or ongoing sensitization with antigens derived from mycobacteria belonging to the tuberculosis complex.
16. The polypeptide according to any one of claims 1 or 2 which is free from any signal sequence.
17. The polypeptide according to any one of claims 1 or 2 which
a) induces a release of IFN-γ from primed memory T-lymphocytes withdrawn from a mouse within 2 weeks of primary infection or within 4 days after the mouse has been re-challenge infected with mycobacteria belonging to the tuberculosis complex, the induction performed by the addition of the polypeptide to a suspension comprising about 200,000 spleen cells per ml, the addition of the polypeptide resulting in a concentration of 1-4 μg polypeptide per ml suspension, the release of IFN-γ being assessable by determination of IFN-γ in supernatant harvested 2 days after the addition of the polypeptide to the suspension, and/or
b) induces a release of IFN-γ of at least 300 pg above background level from about 1000,000 human peripheral blood mononuclear cells per ml isolated from TB patients in the first phase of infection, or from healthy BCG vaccinated donors, or from healthy contacts to TB patients, the induction being performed by the addition of the polypeptide to a suspension comprising the about 1,000,000 PBMC per ml, the addition of the polypeptide resulting in a concentration of 1-4 μg polypeptide per ml suspension, the release of IFN-γ being assessable by determination of IFN-γ in supernatant harvested 2 days after the addition of the polypeptide to the suspension; and/or
c) induces an IFN-γ release from bovine PBMC derived from animals previously sensitized with mycobacteria belonging to the tuberculosis complex, said release being at least two times the release observed from bovine PBMC derived from animals not previously sensitized with mycobacteria belonging to the tuberculosis complex.
18. The polypeptide according to any one of claims 1 or 2, wherein the sequence identity is at least 85%.
19. The polypeptide according to claim 18 , wherein the sequence identity is at least 90%, or at least 91%, or at least 92%, or at least 93%, or at least 94%, or at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99%, or at least 99.5%.
20. The polypeptide according to claim 18 , wherein the sequence identity is at least 95%.
21. The polypeptide according to claim 20 , wherein the sequence identity is at least 96%, or at least 97%, or at least 98%, or at least 99%, or at least 99.5%.
22. A fusion polypeptide comprising at least one polypeptide according to any of claims 1 or 2 and at least one fusion partner.
23. A fusion polypeptide, consisting essentially of at least one polypeptide according to any one of claims 1 or 2 and at least one fusion partner selected from the group consisting of ESAT-6, at least one T-cell epitope of ESAT-6, MPB64, at least one T-cell epitope of MPB64, MPT64 at least one T-cell epitope of MPT64, and MPB59 and at least one T-cell epitope of MPB59.
24. The polypeptide according to any one of claims 1 or 2 which is lipidated.
25. A composition comprising a polypeptide according to any one of claims 1 or 2 and pharmaceutically acceptable carrier, vehicle or adjuvant.
26. An immunological composition comprising a polypeptide according to any one of claims 1 or 2.
27. The immunological composition according to claim 26 , further comprising an immunologically and pharmaceutically acceptable carrier, vehicle or adjuvant.
28. The immunological composition according to claim 27 , wherein the carrier is a polymer to which the polypeptide(s) is/are bound by hydrophobic non-covalent interaction; the vehicle is selected from the group consisting of a diluent and a suspending agent; and the adjuvant is Freund's incomplete adjuvant.
29. An immunological composition comprising at least two different polypeptides according to any one of claims 1 or 2.
30. An immunological composition comprising 3-20 different polypeptides according to any one of claims 1 or 2.
31. A skin test reagent comprising the immunological composition of claim 26 .
32. A composition for diagnosing tuberculosis in an animal, including a human being, comprising a polypeptide according to any one of claims 1 or 2 optionally in combination with a means for detection.
33. A fusion polypeptide comprising at least one polypeptide according to any one of claims 3 or 4 and at least one fusion partner.
34. A fusion polypeptide, consisting essentially of at least one polypeptide according to any one of claims 3 or 4 and at least one fusion partner selected from the group consisting of ESAT-6, at least one T-cell epitope of ESAT-6, MPB64, at least one T-cell epitope of MPB64, MPT64 at least one T-cell epitope of MPT64, and MPB59 and at least one T-cell epitope of MPB59.
35. The polypeptide according to any one of claims 3 or 4 which is lipidated.
36. A composition comprising a polypeptide according to any one of claims 3 or 4 and pharmaceutically acceptable carrier, vehicle or adjuvant.
37. An immunological composition comprising a polypeptide according to any one of claims 3 or 4.
38. The immunological compositions according to claim 37 , further comprising an immunologically and pharmaceutically acceptable carrier, vehicle or. adjuvant.
39. The immunological composition according to claim 38 , wherein the carrier is a polymer to which the polypeptide(s) is/are bound by hydrophobic non-covalent interaction; the vehicle is selected from the group consisting of a diluent and a suspending agent; and the adjuvant is Freund's incomplete adjuvant.
40. An immunological composition comprising at least two different polypeptides according to claim 4 .
41. An immunological composition comprising 3-20 different polypeptides according to claim 4 .
42. A skin test reagent comprising the immunological composition of claim 37 .
43. A composition for diagnosing tuberculosis in an animal, including a human being, comprising a polypeptide according to any one of claims 3 or 4 optionally in combination with a means for detection.
44. A diagnostic tool comprising a combination of two or more substantially pure polypeptides, of which one or more comprises one or more amino acid sequences selected from
(a) SEQ ID NO: 88;
(b) an immunogenic portion of the sequence in (a); and /or
(c) an amino acid sequence analogue having at least 70% sequence identity to any one of the sequences in (a) or (b) and at the same time being immunogenic.
45. A serodiagnostic composition comprising a combination of two or more substantially pure polypeptides, of which one or more comprises one or more amino acid sequences selected from
(a) SEQ ID NO: 88;
(b) an immunogenic portion of the sequence in (a); and /or
(c) an amino acid sequence analogue having at least 70% sequence identity to any one of the sequences in (a) or (b) and at the same time being immunogenic.
46. A composition according to claim 32 or claim 43 , which further comprises one or more amino acid sequences selected from the group consisting of:
d) an amino acid sequence selected from the sequences shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72-86, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, and 168-171;
e) an immunogenic portion of any one of the sequences in (d); and
f) an amino acid sequence analogue having at least 70% sequence identity to any one of the sequences in (d) or (e) and at the same time being immunogenic.
47. A fusion protein according to claim 22 , comprising as a fusion partner a polypeptide which comprises one or more amino acid sequences selected from the group consisting of:
a) an amino acid sequence selected from the sequences shown in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 17-23, 42, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72-86, 90, 92, 94, 141, 143, 145, 147, 149, 151, 153, and 168-171;
b) a polypeptide fragment derived from a virulent mycobacterium, such as ESAT-6, MPB64, MPT64, TB10.4, CFP10, RD1-ORF5, RD1-ORF2, Rv1036, Ag85A, Ag85B, Ag85C, 19 kDa lipoprotein, MPT32, MPB59 and alpha-crystallin;
c) an immunogenic portion of any one of the sequences in (a) or (b); and
d) an amino acid sequence analogue having at least 70% sequence identity to any one of the sequences in (a), (b), or (c) and at the same time being immunogenic.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/620,246 US20040115211A1 (en) | 1997-04-02 | 2003-07-15 | TB diagnostic based on antigens from M. tuberculosis |
Applications Claiming Priority (10)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DKDK199700376 | 1997-04-02 | ||
DK37697 | 1997-04-02 | ||
US4462497P | 1997-04-18 | 1997-04-18 | |
DK127797 | 1997-11-10 | ||
DKDK199701277 | 1997-11-10 | ||
US7048898P | 1998-01-05 | 1998-01-05 | |
US09/050,739 US6641814B1 (en) | 1997-04-02 | 1998-03-30 | Nucleic acids fragments and polypeptide fragments derived from M. tuberculosis |
DKPA199801281 | 1998-10-08 | ||
DKDK199801281 | 1998-10-08 | ||
US10/620,246 US20040115211A1 (en) | 1997-04-02 | 2003-07-15 | TB diagnostic based on antigens from M. tuberculosis |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/050,739 Continuation-In-Part US6641814B1 (en) | 1993-07-02 | 1998-03-30 | Nucleic acids fragments and polypeptide fragments derived from M. tuberculosis |
Publications (1)
Publication Number | Publication Date |
---|---|
US20040115211A1 true US20040115211A1 (en) | 2004-06-17 |
Family
ID=46846876
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/138,473 Expired - Fee Related US6982085B2 (en) | 1997-04-02 | 2002-05-02 | TB diagnostic based on antigens from M. tuberculosis |
US10/620,246 Abandoned US20040115211A1 (en) | 1997-04-02 | 2003-07-15 | TB diagnostic based on antigens from M. tuberculosis |
US11/196,018 Abandoned US20070184073A1 (en) | 1997-04-02 | 2005-08-02 | TB diagnostic based on antigens from M. tuberculosis |
US11/934,048 Expired - Fee Related US8076469B2 (en) | 1997-04-02 | 2007-11-01 | TB diagnostic based on antigens from M. tuberculosis |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/138,473 Expired - Fee Related US6982085B2 (en) | 1997-04-02 | 2002-05-02 | TB diagnostic based on antigens from M. tuberculosis |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/196,018 Abandoned US20070184073A1 (en) | 1997-04-02 | 2005-08-02 | TB diagnostic based on antigens from M. tuberculosis |
US11/934,048 Expired - Fee Related US8076469B2 (en) | 1997-04-02 | 2007-11-01 | TB diagnostic based on antigens from M. tuberculosis |
Country Status (1)
Country | Link |
---|---|
US (4) | US6982085B2 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060024332A1 (en) * | 2004-08-02 | 2006-02-02 | Waters Wade R | Recombinant ESAT-6:CFP-10 fusion protein useful for specific diagnosis of tuberculosis |
US7632646B1 (en) * | 1998-11-04 | 2009-12-15 | Isis Innovation Limited | Tuberculosis diagnostic test |
US20100008955A1 (en) * | 2006-09-14 | 2010-01-14 | Ajit Lalvani | Method and kit for detecting if an individual is susceptible to progress to an active mycobacterial disease |
EP2385371A2 (en) | 2008-09-22 | 2011-11-09 | Oregon Health and Science University | Methods for detecting a mycobacterium tuberculosis infection |
US20120039925A1 (en) * | 2009-04-24 | 2012-02-16 | Statens Serum Institut | Tuberculosis tb vaccine to prevent reactivation |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2231037B1 (en) * | 2003-10-31 | 2005-12-16 | Archivel Technologies, Sl | USEFUL IMMUNOTHERAPIC AGENT FOR THE COMBINED TREATMENT OF TUBERCULOSIS IN ASSOCIATION WITH OTHER PHARMACOS. |
BRPI0811796A2 (en) * | 2007-06-26 | 2020-04-28 | Statens Serum Institut | USE OF A MONOMICOLYL GLYCEROL (MMG) OR SYNTHETIC HOMOLOGISTS OR ANALOGS AND MODIFIED VERSIONS, ADJUVANT OR IMMUNOMODULATOR, HOMOLOGIST OR SYNTHETIC MMG ANALOG OR A MODIFIED VERSION, VACCINE AND APPLICATION SYSTEM |
US20100015171A1 (en) * | 2008-07-15 | 2010-01-21 | Statens Serum Institute | Vaccines comprising tb 10.4 |
ES2335177B1 (en) * | 2008-09-19 | 2011-02-28 | Archivel Farma, S.L. | PROPER IMMUNOTHERAPEUTIC AGENT FOR THE PRIMARY PROFILAXIS OF TUBERCULOSIS. |
AU2010277222A1 (en) * | 2009-07-29 | 2012-03-08 | Bryce Malcolm Buddle | Polymer particles and uses thereof |
WO2011028897A1 (en) * | 2009-09-03 | 2011-03-10 | Ordway Research Institute, Inc. | Methods for identifying a virulent strain of virus |
WO2011063263A2 (en) | 2009-11-20 | 2011-05-26 | Oregon Health & Science University | Methods for producing an immune response to tuberculosis |
GB201007075D0 (en) | 2010-04-28 | 2010-06-09 | Sec Dep For Environment Food A | Diagnostic reagents |
KR101087033B1 (en) | 2010-05-18 | 2011-11-25 | 부산대학교 산학협력단 | Maturation method using m. tuberculosis rv0652 protein for dendritic cell |
WO2012164088A1 (en) * | 2011-06-03 | 2012-12-06 | Universite Montpellier 2 Sciences Et Techniques | Diagnosis method of active tuberculosis |
MX347154B (en) | 2012-02-24 | 2017-02-21 | Centro De Investigación Científica Y De Educación Superior De Ensenada Baja California (Cicese) | Diagnostic test for infectious diseases in cattle. |
WO2014207708A2 (en) | 2013-06-28 | 2014-12-31 | Auckland Uniservices Limited | Amino acid and peptide conjugates and conjugation process |
MX2019003288A (en) * | 2016-09-22 | 2019-11-05 | Pace Diagnostics Inc | Mycobacterium tuberculosis proteins in diagnostic assays and devices for tuberculosis detection and diagnosis. |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4891315A (en) * | 1982-10-25 | 1990-01-02 | American Cyanamid Company | Production of herpes simplex viral porteins |
ATE60999T1 (en) * | 1986-12-19 | 1991-03-15 | Duphar Int Res | STABILIZED ADJUVANT SUSPENSION CONTAINING DIMETHYLDIOCTADECYLAMMONIUM BROMIDE. |
US4952395A (en) * | 1987-02-26 | 1990-08-28 | Scripps Clinic And Research Foundation | Mycobacterial recombinants and peptides |
DE3728153A1 (en) * | 1987-08-24 | 1989-03-16 | Kozierowski Joachim Von | Vertical launch and landing missile |
EP0499003A1 (en) | 1991-02-14 | 1992-08-19 | N.V. Innogenetics S.A. | Polypeptides and peptides, particularly recombinant polypeptides and peptides, nucleic acids coding for the same and use of these polypeptides and peptides in the diagnosis of tuberculosis |
US5330754A (en) * | 1992-06-29 | 1994-07-19 | Archana Kapoor | Membrane-associated immunogens of mycobacteria |
DK79893D0 (en) | 1993-07-02 | 1993-07-02 | Statens Seruminstitut | NEW VACCINE |
ATE407945T1 (en) | 1993-11-23 | 2008-09-15 | Univ California | ABUNDANT EXTRACELLULAR PRODUCTS AND METHODS FOR THEIR PRODUCTION AND THEIR USE |
BR9608894A (en) | 1995-05-23 | 1999-12-07 | Univ California | Abundant extracellular products and processes for their production and uses |
DK1203817T3 (en) | 1995-09-01 | 2006-08-14 | Corixa Corp | Compounds and Methods for Immunotherapy and Diagnosis of Tuberculosis |
EP0850305B1 (en) | 1995-09-01 | 2006-04-26 | Corixa Corporation | Compounds and methods for diagnosis of tuberculosis |
US6290969B1 (en) | 1995-09-01 | 2001-09-18 | Corixa Corporation | Compounds and methods for immunotherapy and diagnosis of tuberculosis |
WO1998016645A2 (en) | 1996-10-11 | 1998-04-23 | Corixa Corporation | Compounds and methods for diagnosis of tuberculosis |
DK1449922T3 (en) | 1997-04-02 | 2007-12-03 | Statens Seruminstitut | Nucleic acid fragments and polypeptide fragments derived from M. tuberculosis |
US6555653B2 (en) | 1997-05-20 | 2003-04-29 | Corixa Corporation | Compounds for diagnosis of tuberculosis and methods for their use |
US6613881B1 (en) | 1997-05-20 | 2003-09-02 | Corixa Corporation | Compounds for immunotherapy and diagnosis of tuberculosis and methods of their use |
-
2002
- 2002-05-02 US US10/138,473 patent/US6982085B2/en not_active Expired - Fee Related
-
2003
- 2003-07-15 US US10/620,246 patent/US20040115211A1/en not_active Abandoned
-
2005
- 2005-08-02 US US11/196,018 patent/US20070184073A1/en not_active Abandoned
-
2007
- 2007-11-01 US US11/934,048 patent/US8076469B2/en not_active Expired - Fee Related
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7632646B1 (en) * | 1998-11-04 | 2009-12-15 | Isis Innovation Limited | Tuberculosis diagnostic test |
US7901898B2 (en) | 1998-11-04 | 2011-03-08 | Isis Innovation Limited | Tuberculosis diagnostic test |
US20110201044A1 (en) * | 1998-11-04 | 2011-08-18 | Isis Innovation Limited | Tuberculosis diagnostic test |
US8216795B2 (en) | 1998-11-04 | 2012-07-10 | Isis Innovation Limited | Tuberculosis diagnostic test |
US8507211B2 (en) | 1998-11-04 | 2013-08-13 | Isis Innovation Limtied | Tuberculosis diagnostic test |
US9005902B2 (en) | 1998-11-04 | 2015-04-14 | Oxford Immunotec Limited | Tuberculosis diagnostic test |
US20060024332A1 (en) * | 2004-08-02 | 2006-02-02 | Waters Wade R | Recombinant ESAT-6:CFP-10 fusion protein useful for specific diagnosis of tuberculosis |
US20100008955A1 (en) * | 2006-09-14 | 2010-01-14 | Ajit Lalvani | Method and kit for detecting if an individual is susceptible to progress to an active mycobacterial disease |
EP2385371A2 (en) | 2008-09-22 | 2011-11-09 | Oregon Health and Science University | Methods for detecting a mycobacterium tuberculosis infection |
US20120039925A1 (en) * | 2009-04-24 | 2012-02-16 | Statens Serum Institut | Tuberculosis tb vaccine to prevent reactivation |
US9074001B2 (en) * | 2009-04-24 | 2015-07-07 | Statens Serum Institut | Tuberculosis TB vaccine to prevent reactivation |
US10519202B2 (en) | 2009-04-24 | 2019-12-31 | Statens Serum Institut | Tuberculosis TB vaccine to prevent reactivation |
Also Published As
Publication number | Publication date |
---|---|
US20030165525A1 (en) | 2003-09-04 |
US20080267990A1 (en) | 2008-10-30 |
US8076469B2 (en) | 2011-12-13 |
US20070184073A1 (en) | 2007-08-09 |
US6982085B2 (en) | 2006-01-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6641814B1 (en) | Nucleic acids fragments and polypeptide fragments derived from M. tuberculosis | |
AU750173B2 (en) | Nucleic acid fragments and polypeptide fragments derived from M. tuberculosis | |
US20030147897A1 (en) | M. tuberculosis antigens | |
CA2165949C (en) | Tuberculosis vaccine | |
JP4544434B2 (en) | Nucleic acid fragments and polypeptide fragments derived from Mycobacterium tuberculosis | |
US5955077A (en) | Tuberculosis vaccine | |
US8076469B2 (en) | TB diagnostic based on antigens from M. tuberculosis | |
AU766093B2 (en) | Tuberculosis vaccine and diagnostic reagents based on antigens from the mycobacterium tuberculosis cell | |
US20040013685A1 (en) | Nucleic acid fragments and polypeptide fragments derived from M. tuberculosis | |
EP1484405A1 (en) | Nucleic acid fragments and polypeptide fragments derived from M. Tuberculosis | |
AU2011200353A1 (en) | Nucleic acid fragments and polypeptide fragments derived from M. tuberculosis | |
EP1787994A1 (en) | TB vaccine and diagnostic based on antigens from M. tuberculosis cell |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: STATENS SERUM INSTIUT, DENMARK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ANDERSEN, PETER;SKJOT, RIKKE;OETTINGER, THOMAS;AND OTHERS;REEL/FRAME:015004/0877;SIGNING DATES FROM 20030715 TO 20030906 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |