KR102624718B1 - 변이형 rpoc 암호화 서열을 포함하는 핵산 분자 - Google Patents
변이형 rpoc 암호화 서열을 포함하는 핵산 분자 Download PDFInfo
- Publication number
- KR102624718B1 KR102624718B1 KR1020217006910A KR20217006910A KR102624718B1 KR 102624718 B1 KR102624718 B1 KR 102624718B1 KR 1020217006910 A KR1020217006910 A KR 1020217006910A KR 20217006910 A KR20217006910 A KR 20217006910A KR 102624718 B1 KR102624718 B1 KR 102624718B1
- Authority
- KR
- South Korea
- Prior art keywords
- leu
- glu
- ala
- gly
- val
- Prior art date
Links
- 108091026890 Coding region Proteins 0.000 title claims abstract description 65
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 39
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 37
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 37
- 239000013612 plasmid Substances 0.000 claims abstract description 139
- 244000005700 microbiome Species 0.000 claims abstract description 92
- 101150042391 rpoC gene Proteins 0.000 claims abstract description 91
- 101100145480 Prochlorococcus marinus (strain SARG / CCMP1375 / SS120) rpoC2 gene Proteins 0.000 claims abstract description 87
- 101150109946 rpo1C gene Proteins 0.000 claims abstract description 87
- 101150103066 rpoC1 gene Proteins 0.000 claims abstract description 87
- 239000013598 vector Substances 0.000 claims abstract description 71
- 238000004519 manufacturing process Methods 0.000 claims abstract description 42
- 238000000034 method Methods 0.000 claims abstract description 38
- 108090000623 proteins and genes Proteins 0.000 claims description 136
- 241000588724 Escherichia coli Species 0.000 claims description 63
- 102000004169 proteins and genes Human genes 0.000 claims description 60
- 238000006467 substitution reaction Methods 0.000 claims description 53
- 241000894006 Bacteria Species 0.000 claims description 43
- 102200007101 rs34049451 Human genes 0.000 claims description 38
- 102000004190 Enzymes Human genes 0.000 claims description 37
- 108090000790 Enzymes Proteins 0.000 claims description 37
- 239000002243 precursor Substances 0.000 claims description 37
- 230000010076 replication Effects 0.000 claims description 30
- 210000004899 c-terminal region Anatomy 0.000 claims description 28
- 210000000349 chromosome Anatomy 0.000 claims description 28
- 101710185074 DNA-directed RNA polymerase subunit beta' Proteins 0.000 claims description 23
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 19
- 125000003729 nucleotide group Chemical group 0.000 claims description 15
- 229920000642 polymer Polymers 0.000 claims description 15
- 239000002773 nucleotide Substances 0.000 claims description 14
- 239000001963 growth medium Substances 0.000 claims description 13
- 239000000825 pharmaceutical preparation Substances 0.000 claims description 9
- 229940127557 pharmaceutical product Drugs 0.000 claims description 9
- 229960005486 vaccine Drugs 0.000 claims description 9
- 150000001413 amino acids Chemical class 0.000 claims description 8
- 235000003599 food sweetener Nutrition 0.000 claims description 8
- 150000004676 glycans Chemical class 0.000 claims description 8
- 229920001282 polysaccharide Polymers 0.000 claims description 8
- 239000005017 polysaccharide Substances 0.000 claims description 8
- 239000003765 sweetening agent Substances 0.000 claims description 8
- 241000588722 Escherichia Species 0.000 claims description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 7
- 238000012258 culturing Methods 0.000 claims description 7
- 229920001222 biopolymer Polymers 0.000 claims description 4
- 230000021615 conjugation Effects 0.000 claims description 4
- 230000026683 transduction Effects 0.000 claims description 4
- 238000010361 transduction Methods 0.000 claims description 4
- 230000009466 transformation Effects 0.000 claims description 4
- 239000012620 biological material Substances 0.000 claims description 3
- 108020004414 DNA Proteins 0.000 description 54
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 46
- 108010047857 aspartylglycine Proteins 0.000 description 45
- 239000000047 product Substances 0.000 description 43
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 42
- 108010049041 glutamylalanine Proteins 0.000 description 42
- 235000018102 proteins Nutrition 0.000 description 42
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 41
- 108010050848 glycylleucine Proteins 0.000 description 37
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 35
- 108010064235 lysylglycine Proteins 0.000 description 35
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 34
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 34
- 108010005233 alanylglutamic acid Proteins 0.000 description 33
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 33
- 108010061238 threonyl-glycine Proteins 0.000 description 32
- 108010015792 glycyllysine Proteins 0.000 description 29
- 239000012634 fragment Substances 0.000 description 28
- 108010047495 alanylglycine Proteins 0.000 description 27
- 108010073969 valyllysine Proteins 0.000 description 27
- 108010079364 N-glycylalanine Proteins 0.000 description 26
- 238000003752 polymerase chain reaction Methods 0.000 description 26
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 25
- 108010034529 leucyl-lysine Proteins 0.000 description 25
- 108010013835 arginine glutamate Proteins 0.000 description 23
- 108010062796 arginyllysine Proteins 0.000 description 23
- 108010038633 aspartylglutamate Proteins 0.000 description 21
- 108010078144 glutaminyl-glycine Proteins 0.000 description 21
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 20
- 241000880493 Leptailurus serval Species 0.000 description 20
- 108010044940 alanylglutamine Proteins 0.000 description 20
- 108010054813 diprotin B Proteins 0.000 description 20
- 108010017391 lysylvaline Proteins 0.000 description 20
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 19
- 108010012581 phenylalanylglutamate Proteins 0.000 description 19
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 18
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 18
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 18
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 17
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 17
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 17
- 108010068265 aspartyltyrosine Proteins 0.000 description 17
- 108010009298 lysylglutamic acid Proteins 0.000 description 17
- 108010026333 seryl-proline Proteins 0.000 description 17
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 16
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 16
- 108010041407 alanylaspartic acid Proteins 0.000 description 16
- 210000004027 cell Anatomy 0.000 description 16
- 108010077515 glycylproline Proteins 0.000 description 16
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 15
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 15
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 15
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 15
- 108010054155 lysyllysine Proteins 0.000 description 15
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 14
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 14
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 14
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 14
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 14
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 14
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 13
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 13
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 13
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 13
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 13
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 13
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 13
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 13
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 13
- 108010037850 glycylvaline Proteins 0.000 description 13
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 12
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 12
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 12
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 12
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 12
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 12
- 108010065920 Insulin Lispro Proteins 0.000 description 12
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 12
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 12
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 12
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 12
- 108010016616 cysteinylglycine Proteins 0.000 description 12
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 12
- 108010003700 lysyl aspartic acid Proteins 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- 230000009467 reduction Effects 0.000 description 12
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 11
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 11
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 11
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 11
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 11
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 11
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 11
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 11
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 11
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 11
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 11
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 11
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 11
- 235000001014 amino acid Nutrition 0.000 description 11
- 238000013459 approach Methods 0.000 description 11
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 11
- 108010053037 kyotorphin Proteins 0.000 description 11
- 101150066555 lacZ gene Proteins 0.000 description 11
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 11
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 10
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 10
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 10
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 10
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 10
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 10
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 10
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 10
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 10
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 10
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 10
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 10
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 10
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 10
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 10
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 10
- 108010047562 NGR peptide Proteins 0.000 description 10
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 10
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 10
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 10
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 10
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 10
- 108010008355 arginyl-glutamine Proteins 0.000 description 10
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 10
- 238000010276 construction Methods 0.000 description 10
- 108010085325 histidylproline Proteins 0.000 description 10
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 10
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 10
- 108010031719 prolyl-serine Proteins 0.000 description 10
- 101150025220 sacB gene Proteins 0.000 description 10
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 9
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 9
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 9
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 9
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 9
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 9
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 9
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 9
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 9
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 9
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 9
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 9
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 9
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 9
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 9
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 9
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 9
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 9
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 9
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 9
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 9
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 9
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 9
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 9
- 108010036533 arginylvaline Proteins 0.000 description 9
- 108010024607 phenylalanylalanine Proteins 0.000 description 9
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 9
- 108010029020 prolylglycine Proteins 0.000 description 9
- 108010015796 prolylisoleucine Proteins 0.000 description 9
- 108010053725 prolylvaline Proteins 0.000 description 9
- 108010071207 serylmethionine Proteins 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 8
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 8
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 8
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 8
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 8
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 8
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 8
- 108010090461 DFG peptide Proteins 0.000 description 8
- ALUBSZXSNSPDQV-WDSKDSINSA-N Gln-Cys-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ALUBSZXSNSPDQV-WDSKDSINSA-N 0.000 description 8
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 8
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 8
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 8
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 8
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 8
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 8
- WZBLRQQCDYYRTD-SIXJUCDHSA-N His-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N WZBLRQQCDYYRTD-SIXJUCDHSA-N 0.000 description 8
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 8
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 8
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 8
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 8
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 8
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 8
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 8
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 8
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 8
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 8
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 8
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 8
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 8
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 8
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 8
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 8
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 8
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 8
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 8
- ZTVCLZLGHZXLOT-ULQDDVLXSA-N Pro-Glu-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O ZTVCLZLGHZXLOT-ULQDDVLXSA-N 0.000 description 8
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 8
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 8
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 8
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 8
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 8
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 8
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 8
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 8
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 8
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 8
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 8
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 8
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 8
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 8
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 8
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 8
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 8
- 229940024606 amino acid Drugs 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 230000001276 controlling effect Effects 0.000 description 8
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 8
- 108010036413 histidylglycine Proteins 0.000 description 8
- 108010085203 methionylmethionine Proteins 0.000 description 8
- 108010004914 prolylarginine Proteins 0.000 description 8
- 108010051110 tyrosyl-lysine Proteins 0.000 description 8
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 7
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 7
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 7
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 7
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 7
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 7
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 7
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 7
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Natural products CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 7
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 7
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 7
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 7
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 7
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 7
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 7
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 7
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 7
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 7
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 7
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 7
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 7
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 7
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 7
- 244000063299 Bacillus subtilis Species 0.000 description 7
- 235000014469 Bacillus subtilis Nutrition 0.000 description 7
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 7
- 102000053602 DNA Human genes 0.000 description 7
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 7
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 7
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 7
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 7
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 7
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 7
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 7
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 7
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 7
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 7
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 7
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 7
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 7
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 7
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 7
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 7
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 7
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 7
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 7
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 7
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 7
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 7
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 7
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 7
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 7
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 7
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 7
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 7
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 7
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 7
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 7
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 7
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 7
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 7
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 7
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 7
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 7
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 7
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 7
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 7
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 7
- XOFDBXYPKZUAAM-GUBZILKMSA-N Met-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N XOFDBXYPKZUAAM-GUBZILKMSA-N 0.000 description 7
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 7
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 7
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 7
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 7
- GBRUQFBAJOKCTF-DCAQKATOSA-N Pro-His-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O GBRUQFBAJOKCTF-DCAQKATOSA-N 0.000 description 7
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 7
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 7
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 7
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 7
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 7
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 7
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 7
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 7
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 7
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 7
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 7
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 7
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 7
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 7
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 7
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 7
- 125000000539 amino acid group Chemical group 0.000 description 7
- 239000003925 fat Substances 0.000 description 7
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 7
- 108010091871 leucylmethionine Proteins 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 230000000813 microbial effect Effects 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- 241000589212 Acetobacter pasteurianus Species 0.000 description 6
- 241000186066 Actinomyces odontolyticus Species 0.000 description 6
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 6
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 6
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 6
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 6
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 6
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 6
- KEZVOBAKAXHMOF-GUBZILKMSA-N Arg-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N KEZVOBAKAXHMOF-GUBZILKMSA-N 0.000 description 6
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 6
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 6
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 6
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 6
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 6
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 6
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 6
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 6
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 6
- 241000186227 Corynebacterium diphtheriae Species 0.000 description 6
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 6
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 6
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 6
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 6
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 6
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 6
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 6
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 6
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 6
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 6
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 6
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 6
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 6
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 6
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 6
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 6
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 6
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 6
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 6
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 6
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 6
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 6
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 6
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 6
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 6
- 241000589242 Legionella pneumophila Species 0.000 description 6
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 6
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 6
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 6
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 6
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 6
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 6
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 6
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 6
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 6
- FMYLZGQFKPHXHI-GUBZILKMSA-N Met-Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O FMYLZGQFKPHXHI-GUBZILKMSA-N 0.000 description 6
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 6
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 6
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 6
- 241000588653 Neisseria Species 0.000 description 6
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 6
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 6
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 6
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 6
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 6
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 6
- 101100309436 Streptococcus mutans serotype c (strain ATCC 700610 / UA159) ftf gene Proteins 0.000 description 6
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 6
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 6
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 6
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 6
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 6
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 6
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 6
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 6
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 6
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 6
- 108010091818 arginyl-glycyl-aspartyl-valine Proteins 0.000 description 6
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 229930027917 kanamycin Natural products 0.000 description 6
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 6
- 229960000318 kanamycin Drugs 0.000 description 6
- 229930182823 kanamycin A Natural products 0.000 description 6
- 229940115932 legionella pneumophila Drugs 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 238000003753 real-time PCR Methods 0.000 description 6
- 238000002864 sequence alignment Methods 0.000 description 6
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 5
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 5
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 5
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 5
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 5
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 5
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 5
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 5
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 5
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 5
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 5
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 5
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 5
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 5
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 5
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 5
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 5
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 5
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 5
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 5
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 5
- 241000606153 Chlamydia trachomatis Species 0.000 description 5
- 241000193155 Clostridium botulinum Species 0.000 description 5
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 5
- 241000194032 Enterococcus faecalis Species 0.000 description 5
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 5
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 5
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 5
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 5
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 5
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 5
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 5
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 5
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 5
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 5
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 5
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 5
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 5
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 5
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 5
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 5
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 5
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 5
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 5
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 5
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 5
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 5
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 5
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 5
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 5
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 5
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 5
- 240000001929 Lactobacillus brevis Species 0.000 description 5
- 235000013957 Lactobacillus brevis Nutrition 0.000 description 5
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 5
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 5
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 5
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 5
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 5
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 5
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 5
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 5
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 5
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 5
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 5
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 5
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 5
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 5
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 5
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 5
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 5
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 5
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 5
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 5
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 5
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 5
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 5
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 5
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 5
- IQAGKQWXVHTPOT-FHWLQOOXSA-N Pro-Lys-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O IQAGKQWXVHTPOT-FHWLQOOXSA-N 0.000 description 5
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 5
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 5
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 5
- 238000011529 RT qPCR Methods 0.000 description 5
- 241000158504 Rhodococcus hoagii Species 0.000 description 5
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 5
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 5
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 5
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 5
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 5
- 241000193998 Streptococcus pneumoniae Species 0.000 description 5
- 241000187747 Streptomyces Species 0.000 description 5
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 5
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 5
- RIKLKPANMFNREP-FDARSICLSA-N Trp-Met-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 RIKLKPANMFNREP-FDARSICLSA-N 0.000 description 5
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 5
- 108010064997 VPY tripeptide Proteins 0.000 description 5
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 5
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 5
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 5
- 241000607626 Vibrio cholerae Species 0.000 description 5
- 238000000137 annealing Methods 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 229940038705 chlamydia trachomatis Drugs 0.000 description 5
- 230000002759 chromosomal effect Effects 0.000 description 5
- 238000004925 denaturation Methods 0.000 description 5
- 230000036425 denaturation Effects 0.000 description 5
- 229940032049 enterococcus faecalis Drugs 0.000 description 5
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 108010090894 prolylleucine Proteins 0.000 description 5
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 5
- 229940118696 vibrio cholerae Drugs 0.000 description 5
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 4
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 4
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 4
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 4
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 4
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 4
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 4
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 4
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 4
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 4
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 4
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 4
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 4
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 4
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 4
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 4
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 4
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 4
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 4
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 4
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 4
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 4
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 4
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 4
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 4
- DDNIZQDYXDENIT-FXQIFTODSA-N Gln-Glu-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DDNIZQDYXDENIT-FXQIFTODSA-N 0.000 description 4
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 4
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 4
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 4
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 4
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 4
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 4
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 4
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 4
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 4
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 4
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 4
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 4
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 4
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 4
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 4
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 4
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 4
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 4
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 4
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 4
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 4
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 4
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 4
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 4
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 4
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 4
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 4
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 4
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 4
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 4
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 4
- WOAMZMXCLBBQKW-KKUMJFAQSA-N His-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)O WOAMZMXCLBBQKW-KKUMJFAQSA-N 0.000 description 4
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 4
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 4
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 4
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 4
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 4
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 4
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 4
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 4
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 4
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 4
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 4
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 4
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 4
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 4
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 4
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 4
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 4
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 4
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 4
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 4
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 4
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 4
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 4
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 4
- JQEBITVYKUCBMC-SRVKXCTJSA-N Met-Arg-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JQEBITVYKUCBMC-SRVKXCTJSA-N 0.000 description 4
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 4
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 4
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 4
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 4
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 4
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 4
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 4
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 4
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 4
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 4
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 4
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 4
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 4
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 4
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 4
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 4
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 4
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 4
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 4
- 241000589499 Thermus thermophilus Species 0.000 description 4
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 4
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 4
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 4
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 4
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 4
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 4
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 4
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 4
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 4
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 4
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 4
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 4
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 4
- KVMZNMYZCKORIG-UBHSHLNASA-N Trp-Cys-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KVMZNMYZCKORIG-UBHSHLNASA-N 0.000 description 4
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 4
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 4
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 4
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 4
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 4
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 4
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 4
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 4
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 4
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 4
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 4
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 4
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 4
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 4
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 210000004507 artificial chromosome Anatomy 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 238000010348 incorporation Methods 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 230000002503 metabolic effect Effects 0.000 description 4
- 238000002888 pairwise sequence alignment Methods 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 239000011541 reaction mixture Substances 0.000 description 4
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 4
- 239000013603 viral vector Substances 0.000 description 4
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 3
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 3
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 3
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 3
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 3
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 3
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 3
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 3
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 3
- RTDZQOFEGPWSJD-AVGNSLFASA-N Arg-Leu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O RTDZQOFEGPWSJD-AVGNSLFASA-N 0.000 description 3
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 3
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 3
- JJIBHAOBNIFUEL-SRVKXCTJSA-N Arg-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCN=C(N)N)N JJIBHAOBNIFUEL-SRVKXCTJSA-N 0.000 description 3
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 3
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 3
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 3
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 3
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 3
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 3
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 3
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 3
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 3
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 3
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 3
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 3
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 3
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 3
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 3
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 3
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 3
- 108020004513 Bacterial RNA Proteins 0.000 description 3
- 238000007702 DNA assembly Methods 0.000 description 3
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 3
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 3
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 3
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 3
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 3
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 3
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 3
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 3
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 3
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 3
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 3
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 3
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 3
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 3
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 3
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 3
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 3
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 3
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 3
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 3
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 3
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 3
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 3
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 3
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 3
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 3
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 3
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 3
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 3
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 3
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 3
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 3
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 3
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 3
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 3
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 3
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 3
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 3
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 3
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 3
- NJZGEXYLSFGPHG-GUBZILKMSA-N His-Gln-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NJZGEXYLSFGPHG-GUBZILKMSA-N 0.000 description 3
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 3
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 3
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 3
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 3
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 3
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 3
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 3
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 3
- AYLAAGNJNVZDPY-CYDGBPFRSA-N Ile-Met-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N AYLAAGNJNVZDPY-CYDGBPFRSA-N 0.000 description 3
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 3
- MITYXXNZSZLHGG-OBAATPRFSA-N Ile-Trp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N MITYXXNZSZLHGG-OBAATPRFSA-N 0.000 description 3
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 3
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 3
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 3
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 3
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 3
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 3
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 3
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 3
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 3
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 3
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 3
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 3
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 3
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 3
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 3
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 3
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 3
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 3
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 3
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 3
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 3
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 3
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 3
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 3
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 3
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 3
- GLUYKHMBGKQBHE-JYJNAYRXSA-N Phe-Val-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 GLUYKHMBGKQBHE-JYJNAYRXSA-N 0.000 description 3
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 3
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 3
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 3
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 3
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 3
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 3
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 3
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 3
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 3
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 3
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 3
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 3
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 3
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 3
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 3
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 3
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 3
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 3
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 3
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 3
- JWGRSJCYCXEIKH-QEJZJMRPSA-N Trp-Glu-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N JWGRSJCYCXEIKH-QEJZJMRPSA-N 0.000 description 3
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 3
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 3
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 3
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 3
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 3
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 3
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 3
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 3
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 3
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 3
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 3
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 3
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 3
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 3
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 3
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 3
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 3
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 3
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 3
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 3
- 108010087049 alanyl-alanyl-prolyl-valine Proteins 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- -1 for example Proteins 0.000 description 3
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 238000002887 multiple sequence alignment Methods 0.000 description 3
- 108010084572 phenylalanyl-valine Proteins 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 108700004896 tripeptide FEG Proteins 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 2
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 2
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 2
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 2
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 2
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 2
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 2
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 2
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 2
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 2
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 2
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 2
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 2
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 2
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 2
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 2
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 2
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 2
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 2
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 2
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 108020004638 Circular DNA Proteins 0.000 description 2
- 241000186216 Corynebacterium Species 0.000 description 2
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 2
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 2
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 2
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 2
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 2
- MRVYVEQPNDSWLH-XPUUQOCRSA-N Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O MRVYVEQPNDSWLH-XPUUQOCRSA-N 0.000 description 2
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 2
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 2
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 2
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 2
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 2
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 2
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 2
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 2
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 2
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 2
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 2
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 2
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 2
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 2
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 2
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 2
- XKJUFUPCHARJKX-UWVGGRQHSA-N Met-Gly-His Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 XKJUFUPCHARJKX-UWVGGRQHSA-N 0.000 description 2
- AEQVPPGEJJBFEE-CYDGBPFRSA-N Met-Ile-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEQVPPGEJJBFEE-CYDGBPFRSA-N 0.000 description 2
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 2
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 2
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 2
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 2
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 2
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 2
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 241000186359 Mycobacterium Species 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- 241000588652 Neisseria gonorrhoeae Species 0.000 description 2
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 2
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 2
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 2
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 2
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 2
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 2
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 2
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 2
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 2
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 241000187432 Streptomyces coelicolor Species 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 241000053227 Themus Species 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 2
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 2
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 2
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 2
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 2
- VDCGPCSLAJAKBB-XIRDDKMYSA-N Trp-Ser-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VDCGPCSLAJAKBB-XIRDDKMYSA-N 0.000 description 2
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 2
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 2
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 2
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 2
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 2
- YYZPVPJCOGGQPC-JYJNAYRXSA-N Tyr-His-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYZPVPJCOGGQPC-JYJNAYRXSA-N 0.000 description 2
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 2
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 2
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 2
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 102000005936 beta-Galactosidase Human genes 0.000 description 2
- 108010005774 beta-Galactosidase Proteins 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 210000000078 claw Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 108091036078 conserved sequence Proteins 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 238000004255 ion exchange chromatography Methods 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 244000000010 microbial pathogen Species 0.000 description 2
- 230000002906 microbiologic effect Effects 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 239000007320 rich medium Substances 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- XYWBPLHHAZLXAI-ASHKBJFXSA-N (2s)-2-[[(2s)-2-[[(2s)-4-amino-2-[[(2s)-2-amino-3-methylbutanoyl]amino]-4-oxobutanoyl]amino]-3-carboxypropanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)C(C)C XYWBPLHHAZLXAI-ASHKBJFXSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- 241000589220 Acetobacter Species 0.000 description 1
- 241000186046 Actinomyces Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- OARAZORWIMYUPO-FXQIFTODSA-N Ala-Met-Cys Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CS)C(O)=O OARAZORWIMYUPO-FXQIFTODSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- OAIGZYFGCNNVIE-ZPFDUUQYSA-N Ala-Val-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O OAIGZYFGCNNVIE-ZPFDUUQYSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- HJWQFFYRVFEWRM-SRVKXCTJSA-N Arg-Arg-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O HJWQFFYRVFEWRM-SRVKXCTJSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- WOPFJPHVBWKZJH-SRVKXCTJSA-N Arg-Arg-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O WOPFJPHVBWKZJH-SRVKXCTJSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- OKKMBOSPBDASEP-CYDGBPFRSA-N Arg-Ile-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O OKKMBOSPBDASEP-CYDGBPFRSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 1
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 1
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- CGXQUULXFWRJOI-SRVKXCTJSA-N Arg-Val-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O CGXQUULXFWRJOI-SRVKXCTJSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- ALKWEXBKAHPJAQ-NAKRPEOUSA-N Asn-Leu-Asp-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ALKWEXBKAHPJAQ-NAKRPEOUSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- CNKAZIGBGQIHLL-GUBZILKMSA-N Asp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N CNKAZIGBGQIHLL-GUBZILKMSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- XSXVLWBWIPKUSN-UHFFFAOYSA-N Asp-Leu-Glu-Asp Chemical compound OC(=O)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(O)=O)C(O)=O XSXVLWBWIPKUSN-UHFFFAOYSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- SDDAYZYYUJILPB-UHFFFAOYSA-N Asp-Leu-Val-Tyr Chemical compound OC(=O)CC(N)C(=O)NC(CC(C)C)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SDDAYZYYUJILPB-UHFFFAOYSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 1
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 1
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- FOXXZZGDIAQPQI-XKNYDFJKSA-N Asp-Pro-Ser-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FOXXZZGDIAQPQI-XKNYDFJKSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- GXIUDSXIUSTSLO-QXEWZRGKSA-N Asp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N GXIUDSXIUSTSLO-QXEWZRGKSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 206010008631 Cholera Diseases 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 241000186226 Corynebacterium glutamicum Species 0.000 description 1
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 1
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 1
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- UCMIKRLLIOVDRJ-XKBZYTNZSA-N Cys-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)O UCMIKRLLIOVDRJ-XKBZYTNZSA-N 0.000 description 1
- CFQVGYWKSLKWFX-KBIXCLLPSA-N Cys-Glu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CFQVGYWKSLKWFX-KBIXCLLPSA-N 0.000 description 1
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- VCIIDXDOPGHMDQ-WDSKDSINSA-N Cys-Gly-Gln Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VCIIDXDOPGHMDQ-WDSKDSINSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- KPENUVBHAKRDQR-GUBZILKMSA-N Cys-His-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPENUVBHAKRDQR-GUBZILKMSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 1
- ZLFRUAFDAIFNHN-LKXGYXEUSA-N Cys-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O ZLFRUAFDAIFNHN-LKXGYXEUSA-N 0.000 description 1
- MJOYUXLETJMQGG-IHRRRGAJSA-N Cys-Tyr-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJOYUXLETJMQGG-IHRRRGAJSA-N 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 241000194033 Enterococcus Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- MQANCSUBSBJNLU-KKUMJFAQSA-N Gln-Arg-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQANCSUBSBJNLU-KKUMJFAQSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- PZVJDMJHKUWSIV-AVGNSLFASA-N Gln-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)O PZVJDMJHKUWSIV-AVGNSLFASA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- GLEGHWQNGPMKHO-DCAQKATOSA-N Gln-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GLEGHWQNGPMKHO-DCAQKATOSA-N 0.000 description 1
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- ZXGLLNZQSBLQLT-SRVKXCTJSA-N Gln-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZXGLLNZQSBLQLT-SRVKXCTJSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- XFHMVFKCQSHLKW-HJGDQZAQSA-N Gln-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XFHMVFKCQSHLKW-HJGDQZAQSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 1
- CMBXOSFZCFGDLE-IHRRRGAJSA-N Gln-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O CMBXOSFZCFGDLE-IHRRRGAJSA-N 0.000 description 1
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- ZCFNZTVIDMLUQC-SXNHZJKMSA-N Glu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZCFNZTVIDMLUQC-SXNHZJKMSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- ZNOHKCPYDAYYDA-BPUTZDHNSA-N Glu-Trp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNOHKCPYDAYYDA-BPUTZDHNSA-N 0.000 description 1
- ZSIDREAPEPAPKL-XIRDDKMYSA-N Glu-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N ZSIDREAPEPAPKL-XIRDDKMYSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 1
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- YTKOTXRIWQHSAZ-GUBZILKMSA-N His-Glu-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N YTKOTXRIWQHSAZ-GUBZILKMSA-N 0.000 description 1
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 1
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 1
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- YBDOQKVAGTWZMI-XIRDDKMYSA-N His-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YBDOQKVAGTWZMI-XIRDDKMYSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- WKXVAXOSIPTXEC-HAFWLYHUSA-N Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O WKXVAXOSIPTXEC-HAFWLYHUSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- XLDYDEDTGMHUCZ-GHCJXIJMSA-N Ile-Asp-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N XLDYDEDTGMHUCZ-GHCJXIJMSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- CZWANIQKACCEKW-CYDGBPFRSA-N Ile-Pro-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N CZWANIQKACCEKW-CYDGBPFRSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- ZFWISYLMLXFBSX-KKPKCPPISA-N Ile-Trp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N ZFWISYLMLXFBSX-KKPKCPPISA-N 0.000 description 1
- YBHKCXNNNVDYEB-SPOWBLRKSA-N Ile-Trp-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N YBHKCXNNNVDYEB-SPOWBLRKSA-N 0.000 description 1
- OAQJOXZPGHTJNA-NGTWOADLSA-N Ile-Trp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N OAQJOXZPGHTJNA-NGTWOADLSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- HODVZHLJUUWPKY-STECZYCISA-N Ile-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=C(O)C=C1 HODVZHLJUUWPKY-STECZYCISA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- NUEHSWNAFIEBCQ-NAKRPEOUSA-N Ile-Val-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUEHSWNAFIEBCQ-NAKRPEOUSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- DXYBNWJZJVSZAE-GUBZILKMSA-N Leu-Gln-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DXYBNWJZJVSZAE-GUBZILKMSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- ZGGVHTQAPHVMKM-IHPCNDPISA-N Leu-Trp-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N ZGGVHTQAPHVMKM-IHPCNDPISA-N 0.000 description 1
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- XBYKTPZCWQQSGB-IHRRRGAJSA-N Met-Cys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBYKTPZCWQQSGB-IHRRRGAJSA-N 0.000 description 1
- GXYYFDKJHLRNSI-SRVKXCTJSA-N Met-Gln-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GXYYFDKJHLRNSI-SRVKXCTJSA-N 0.000 description 1
- KLFPZIUIXZNEKY-DCAQKATOSA-N Met-Gln-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O KLFPZIUIXZNEKY-DCAQKATOSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- FTQOFRPGLYXRFM-CYDGBPFRSA-N Met-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCSC)N FTQOFRPGLYXRFM-CYDGBPFRSA-N 0.000 description 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 1
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- LYCOGHUNJCETDK-JYJNAYRXSA-N Phe-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N LYCOGHUNJCETDK-JYJNAYRXSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 102000002067 Protein Subunits Human genes 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 241001138501 Salmonella enterica Species 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- AXOHAHIUJHCLQR-IHRRRGAJSA-N Ser-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CO)N AXOHAHIUJHCLQR-IHRRRGAJSA-N 0.000 description 1
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- KZUJCMPVNXOBAF-LKXGYXEUSA-N Thr-Cys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KZUJCMPVNXOBAF-LKXGYXEUSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- PEYSVKMXSLPQRU-FJHTZYQYSA-N Trp-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PEYSVKMXSLPQRU-FJHTZYQYSA-N 0.000 description 1
- KZTLJLFVOIMRAQ-IHPCNDPISA-N Trp-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZTLJLFVOIMRAQ-IHPCNDPISA-N 0.000 description 1
- DVAAUUVLDFKTAQ-VHWLVUOQSA-N Trp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DVAAUUVLDFKTAQ-VHWLVUOQSA-N 0.000 description 1
- XEHGAHOCTDKOKP-XIRDDKMYSA-N Trp-Cys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XEHGAHOCTDKOKP-XIRDDKMYSA-N 0.000 description 1
- LGEPIBQBGZTBHL-SXNHZJKMSA-N Trp-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LGEPIBQBGZTBHL-SXNHZJKMSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 1
- ABEVJDLMFPTGPS-SZMVWBNQSA-N Trp-Met-Met Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ABEVJDLMFPTGPS-SZMVWBNQSA-N 0.000 description 1
- KXFYAQUYJKOQMI-QEJZJMRPSA-N Trp-Ser-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 KXFYAQUYJKOQMI-QEJZJMRPSA-N 0.000 description 1
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 1
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 1
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 1
- UOXPLPBMEPLZBW-WDSOQIARSA-N Trp-Val-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 UOXPLPBMEPLZBW-WDSOQIARSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- WDIJBEWLXLQQKD-ULQDDVLXSA-N Tyr-Arg-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O WDIJBEWLXLQQKD-ULQDDVLXSA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 1
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 1
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- XOVDRAVPGHTYLP-JYJNAYRXSA-N Tyr-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O XOVDRAVPGHTYLP-JYJNAYRXSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- BUPRFDPUIJNOLS-UFYCRDLUSA-N Tyr-Tyr-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O BUPRFDPUIJNOLS-UFYCRDLUSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 1
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 1
- WGHVMKFREWGCGR-SRVKXCTJSA-N Val-Arg-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WGHVMKFREWGCGR-SRVKXCTJSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010047506 alanyl-glutaminyl-glycyl-valine Proteins 0.000 description 1
- 108010039538 alanyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 229920001577 copolymer Polymers 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 239000012978 lignocellulosic material Substances 0.000 description 1
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010065320 prolyl-lysyl-glutamyl-lysine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 239000012089 stop solution Substances 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 201000008827 tuberculosis Diseases 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1247—DNA-directed RNA polymerase (2.7.7.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/04—Polysaccharides, i.e. compounds containing more than five saccharide radicals attached to each other by glycosidic bonds
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
- C12P19/34—Polynucleotides, e.g. nucleic acids, oligoribonucleotides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/07—Nucleotidyltransferases (2.7.7)
- C12Y207/07006—DNA-directed RNA polymerase (2.7.7.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
변이형 rpoC 암호화 서열을 포함하는 핵산 분자를 개시한다. 변이형 rpoC 암호화 서열은 플라스미드의 카피수를 조절하는 변이형 RpoC를 암호화한다. 또한, 핵산 분자를 포함하는 재조합 미생물, 재조합 미생물에서 대상 벡터의 카피수를 조절하는 방법, 및 재조합 미생물의 사용에 의한 표적 생성물의 제조 방법을 개시한다.
Description
본 발명은 변이형 rpoC RNA-폴리머라제 β' 서브유닛 단백질 암호화 서열(또한 "변이형 rpoC 암호화 서열"이라 지칭된다)을 포함하는 핵산 분자에 관한 것으로, 여기에서 변이형 rpoC 암호화 서열은 변이형 RpoC RNA-폴리머라제 β' 서브유닛 단백질(또한 "변이형 RopC"라 지칭된다)을 암호화하고, 변이형 RpoC는 플라스미드의 카피수를 조절한다.
플라스미드는 생물공학에서 중요한 역할을 하며, 미생물로부터 표적 유전자를 도입, 변형 및 제거하고, 표적 유전자에 의해 암호화된 상응하는 단백질을 생성시키기 위한 수단을 제공한다. 플라스미드는 그것이 발생하는 미생물의 염색체로부터 물리적으로 분리되며 염색체와 독립적으로 복제하는, 박테리아(Bacteria), 알카에아(Archaea) 및 유카리오타(Eukaryota) 도메인의 다양한 범위의 미생물에서 자연적으로 발생하는 핵산 분자이다. 플라스미드는 전형적으로 이중 가닥 원형 DNA 분자이지만, 선형 DNA 분자 및/또는 RNA 분자일 수도 있다. 플라스미드는 약 1 kb 내지 2 Mb 이상의 크기 범위로 발생한다. 예를 들어, 최근의 리뷰 논문[Shintani et al., Frontiers in Microbiology 6:242 (2015)]에 따르면, GenBank 데이터베이스에서 발견되는 4602개 플라스미드 중에서 광범위한 크기 변화가 관찰되었으며, 이때 플라스미드의 크기 범위는 744 bp 내지 2.58 Mb이고 평균 크기가 80 kb이다. 또한,플라스미드는 세포당 카피의 범위로 발생한다. 예를 들어, 플라스미드는 일반적으로 낮은 카피, 예를 들어 세포당 1-20 카피, 중간 카피, 예를 들어 세포당 20-100 카피, 또는 높은 카피, 예를 들어 세포당 500-700 또는 그 이상의 카피를 특징으로 한다. 플라스미드는 표적 유전자를 포함하도록 변형될 수 있다.
생명공학에서 플라스미드를 사용하는 것과 관련된 도전은 생물공학적 응용이 일반적으로 미생물내 표적 유전자의 안정적인 혼입을 필요로 하고, 배양 동안 표적 유전자 및 이의 상응하는 단백질 생성물의 수율의 신중한 조절을 필요로 하지만, 이를 달성하기 위한 노력이 다른 것을 달성하는데 불리하게 작용할 수 있다는 것이다. 표적 유전자를 갖는 플라스미드를 포함하는 미생물을 배양하는 동안, 미생물의 세포가 성장하고 분열함에 따라 플라스미드가 안정하게 분리되도록 하는 것이 일반적으로 유리하며, 따라서 높은 비율의 미생물 세포가 배양 전체에 걸쳐 표적 유전자를 포함하게 된다. 또한, 표적 유전자를 구조적으로 안정하게 유지시키고, 일정한 뉴클레오티드 서열을 유지하여, 의도된 생성물만을 제조하는 것이 일반적으로 유리하다. 또한, 목적하는 결과를 달성하기에 충분히 높은 수준으로, 예를 들어 충분한 양 및 활성 형태로 상응하는 단백질 생성물의 생성을 달성하기에 충분히 높은 수준으로 표적 유전자를 발현시키는 것이 일반적으로 유리하다. 불행하게도, 플라스미드를 복제하고 플라스미드로부터 표적 유전자를 발현시키는 기술은, 특히 높은 수준에서, 세포에 대사 부담을 가한다. 이는 플라스미드가 세포로부터 상실되게 하고/하거나 돌연변이가 표적 유전자의 발현 수준 또는 일치성을 변화되게 할 수 있다. 이는 또한 상응하는 단백질 생성물의 응집 및 비활성을 유도할 수 있다. 따라서, 생물학적 기술분야에서 플라스미드를 사용하는 동안 안정한 혼입과 수율 조절의 균형을 맞추는 것은 일반적으로 시행착오를 포함하는 경험적 과정이다.
플라스미드 카피수는 안정한 혼입과 수율의 조절 모두에 관하여 중요한 고려사항이다. 플라스미드의 카피수는 일반적으로 3개의 인자, 즉 플라스미드의 복제 기원, 포함된 표적 유전자를 포함한 플라스미드의 크기, 및 배양 조건에 의해 결정된다. 복제 기원에 관하여, 플라스미드는 복제, 특히 복제 기원의 특징에 기초하여 불화합성(incompatibility) 그룹으로 분류될 수 있다. 특히, 플라스미드는 일반적으로 단일 복제 기원으로부터 복제하는 플라스미드의 영역에 상응하는 레플리콘(replicon)을 포함한다. 플라스미드는 또한 일반적으로 플라스미드의 복제 기원을 인식하고 복제를 개시시키는 단백질을 암호화하는 유전자를 포함하였다. 플라스미드의 단백질과 복제 기원간의 상호작용은 복제의 특이성, 및 레플리콘, 그에 따른 플라스미드의 카피수를 결정한다. 동일한 복제 기원을 갖는 플라스미드는, 플라스미드가 분리 안정성에 관하여 서로 불화합성임에 기초하여 동일한 불화합성 그룹내에서 분류된다. 상이한 복제 기원을 갖는 플라스미드는 플라스미드가 서로 화합성이라면 상이한 불화합성 그룹내에서 분류될 수 있다. 플라스미드의 크기와 관련하여, 크기를 증가시키는 것은 일반적으로 플라스미드의 복제 및 플라스미드로부터 표적 유전자의 발현과 관련된 대사 부담을 증가시키고, 따라서 플라스미드의 카피수를 감소시킨다. 배양 조건에 관하여, 상기는 또한 대사 부담에 영향을 미치며, 특정 조건에 따라 카피수의 증가 또는 감소를 초래할 수 있다.
3개의 인자 중에서, 복제 기원은 카피수에 대한 기준선을 수립하기 때문에,복제 기원은 일반적으로 특정 용도를 위해 플라스미드를 선택하는데 있어서 주된 고려 사항이다. 다른 2개 인자, 즉 플라스미드의 크기 및 배양 조건의 변화는 항상 선택사항인 것은 아니다. 플라스미드의 크기는 표적 유전자의 크기에 의해 결정되고/되거나 제한될 수 있다. 배양 조건도 또한 상응하는 생성물을 충분한 양 및 충분한 활성으로 수득하기 위한 요건에 의해 결정되고/되거나 제한될 수 있다. 이 또한 경험적 과정일 것이다.
돌연변이 RNA 폴리머라제의 사용은 플라스미드의 카피수를 변경하기 위한 잠재적인 접근법이다. RNA 폴리머라제는 전사에 하나의 역할을 한다. RNA 폴리머라제는 또한 염색체 및 플라스미드의 복제에도 하나의 역할을 한다. RNA 폴리머라제 서열은 다수의 세균에서 측정되었으며, 이는 RNA 폴리머라제 내의 보존된 영역을 식별하기 위한 기초를 제공한다. 예를 들어, 문헌[Lee et al., Antimicrobial Agents and Chemotherapy 57:56-65 (2013)]은 21개 균주로부터의 RNA 폴리머라제 β' 서브유닛의 C-말단 도메인의 정렬을 제공한다. 세균 RNA 폴리머라제의 구조가 또한 측정되었다. 예를 들어, 문헌[Mukhopadhyay et al., Cell 135:295-307 (2008)]은 RNA 폴리머라제가 ~ 150 옹스트롬 x ~ 100 옹스트롬 x ~ 100 옹스트롬의 치수 및 집게발을 연상시키는 모양을 갖는 구조를 드러냄을 보고한다. RNA 폴리머라제 β' 서브유닛은 "클램프(clamp)"라 칭하는 집게발, 및 활성 중심 틈새 부분을 구성한다.
RNA 폴리머라제 β' 서브유닛을 암호화하는 에스케리키아 콜라이(Escherichia coli)의 rpoC 유전자의 2 개의 돌연변이는 ColE1-유형 플라스미드의 카피수의 감소를 야기하는 것으로 보고되었다. 구체적으로, 문헌[Ederth et al, Molecular Genetics and Genomics 267:587-592(2002)]은 단일 아미노산 치환(G1161R) 및 41-아미노산 결실을 확인하였다. 둘 다 rpoC 유전자의 3'-말단 영역 부근에 위치한다. 2개의 돌연변이는 각각 ColE1 플라스미드의 카피수의 10배 및 20배 감소를 유발한다(추정상 각각 90% 및 95% 이상의 감소에 상응한다). Ederth 등은 DNA 폴리머라제 I에 대한 프리프라이머(preprimer) 및 상기 프리프라이머의 안티센스 억제제를 암호화하는 RNA II 및 RNA I에 대한 프로모터로부터의 변경된 발현이 감소를 야기할 수 있음을 제안한다.
rpoC 에서의 돌연변이는 또한 플라스미드 pBR322의 카피수의 증가를 야기하는 것으로 보고되었다. 구체적으로, 문헌[Petersen et al., Journal of Bacteriology 173:5200-5206(1991)]은 rpoC 유전자의 3'-말단 영역 부근에 또한 위치하는 단일 아미노산 치환(G1033D)을 확인하였으며, 이는 39℃의 반-허용 생육 온도에서 pBR322의 카피수를 증가시킨다. Petersen은 돌연변이가 또한 염색체 카피수의 증가를 유발한다는 것을 주목한다.
불행하게도, 플라스미드의 카피수를 변화시키는 추가의 돌연변이체를 얻기 위해 RNA 폴리머라제 β' 서브유닛을 예측가능하게 변형시키는 일반적인 접근법이 존재하지 않는다. 특정 돌연변이가 플라스미드의 카피수를 변경시킬 수 있는 지의 여부 및 변경 정도를 결정하는 것은 또한 경험적 과정일 것이다. 또한, 염색체 카피수의 상응하는 변화를 일으키지 않는 상기와 같은 돌연변이체를 획득하기 위해 RNA 폴리머라제 β' 서브유닛을 예측가능하게 변형시키는 어떠한 일반적인 접근법도 존재하지 않는다.
따라서, 이상적으로는 염색체 카피수의 상응하는 변화를 유발하지 않으면서 플라스미드의 카피수를 변화시키도록 변형된 RNA 폴리머라제 β' 서브유닛의 돌연변이체가 필요하다.
본원의 하나의 양태에 따라, 변이형 rpoC RNA-폴리머라제 β' 서브유닛 단백질 암호화 서열(또한 "변이형 rpoC 암호화 서열"이라 칭한다)을 포함하는 핵산 분자를 개시한다. 변이형 rpoC 암호화 서열은 변이형 RpoC RNA-폴리머라제 β' 서브유닛 단백질(또한 "변이형 RpoC"라 칭한다)을 암호화한다. 변이형 RpoC는 R47C 치환을 포함하며, 이때 R47C 치환의 넘버링은 에스케리키아 콜라이의 야생형 RpoC RNA-폴리머라제 β' 서브유닛 단백질(또한 "야생형 RpoC"라 칭한다)에 기초하여 정의된다.
일부 예에서, 변이형 RpoC의 발현은 서열번호 26을 포함하는 야생형 RpoC의 발현에 비해 플라스미드의 카피수를 감소시킨다.
또한 일부 예에서, 변이형 RpoC는 (1) 서열번호 28을 포함하는 N-말단 도메인, (2) 서열번호 29를 포함하는 센트럴 도메인, 및 (3) 서열번호 30을 포함하는 C-말단 도메인을 포함한다. R47C 치환은 N-말단 도메인 내에 존재한다.
또한 일부 예에서, 변이형 RpoC는 (1) 서열번호 31을 포함하는 N-말단 도메인, (2) 서열번호 32를 포함하는 센트럴 도메인, 및 (3) 서열번호 33을 포함하는 C-말단 도메인을 포함한다. R47C 치환은 N-말단 도메인 내에 존재한다.
또한 일부 예에서, 변이형 RpoC는 서열번호 27에 적어도 90% 일치하는 아미노산 서열을 포함한다.
또한 일부 예에서, 변이형 RpoC는 서열번호 27을 포함한다.
본원의 또 다른 양태에 따라, 핵산 분자를 포함하는 벡터를 개시한다.
본원의 또 다른 양태에 따라, 핵산 분자를 포함하는 재조합 미생물을 개시한다.
본원의 또 다른 양태에 따라, 재조합 미생물에서 대상 벡터의 카피수를 조절하는 방법을 개시한다. 방법은 재조합 미생물을 대상 벡터의 복제에 충분한 조건하에 배양 배지에서 배양하는 단계를 포함하여, 대상 벡터의 카피수를 조절함을 포함한다.
본원의 또 다른 양태에 따라, 재조합 미생물의 사용에 의한 표적 생성물의 제조 방법을 개시한다. 재조합 미생물은 표적 유전자 벡터를 포함한다. 표적 유전자 벡터는 표적 생성물을 제조하기 위한 표적 유전자를 포함한다. 방법은 (1) 재조합 미생물을, 재조합 미생물이 표적 유전자를 발현하는 조건하에 배양 배지에서 배양하여, 표적 생성물을 생성시키는 단계, 및 (2) 재조합 미생물 및/또는 배양 배지로부터 표적 생성물을 회수하는 단계를 포함한다.
본원의 또 다른 양태에 따라, 변이형 rpoC 암호화 서열을 포함하는 유전자 교체 벡터 및 유전자 교체 서열을 개시한다. 변이형 rpoC 암호화 서열은 서열번호 28을 포함하는 변이형 RpoC N-말단 도메인을 암호화한다. 유전자 교체 서열은 미생물의 염색체 중의 내인성 rpoC 암호화 서열을 변이형 rpoC 암호화 서열로 교체하기 위한 단백질을 암호화한다.
본원에 개시된 변이형 rpoC 암호화 서열을 포함하는 핵산 분자는 벡터, 예를 들어 플라스미드의 카피수를 조절하는데 유용하며, 따라서 벡터의 사용에 의해 표적 생성물의 상업적인 생산을 개선시키기에 유용하다.
도 1은 써무스 써모필루스(Thermus thermophilus) RpoC(Q8RQE8)(서열번호 34), 아세토박터 파스퇴리아누스(Acetobacter pasteurianus) RpoC(BAH99075.1)(서열번호 35), 네이세리아 고노로에아에(Neisseria gonorrhoeae) RpoC(Q5F5R6)(서열번호 36), 레지오넬라 뉴모필라(Legionella pneumophila) RpoC(Q5X865)(서열번호 37), 슈도모나스 아에루기노사(Pseudomonas aeruginosa) RpoC(Q9HWC9)(서열번호 38), 비브리오 콜레라에(Vibrio cholerae) RpoC(Q9KV29)(서열번호 39), 에스케리키아 콜라이(P0A8T7)(서열번호 26), 살모넬라 엔테리카(Salmonella enterica) 혈청형 티피뮤리움(Typhimurium) RpoC(P0A2R4)(서열번호 40), 액티노마이세스 오돈토리티쿠스(Actinomyces odontolyticus) RpoC(EDN79927.1)(서열번호 41), 스트렙토마이세스 코엘리콜라(Streptomyces coelicolor) RpoC(Q8CJT1)(서열번호 42), 코리네박테리움 디프테리아에(Corynebacterium diphtheriae) RpoC(Q6NJF6)(서열번호 43), 마이코박테리움 튜베르큘로시스(Mycobacterium tuberculosis) RpoC(A5U053)(서열번호 44), 로도코커스 에퀴(Rhodococcus equi) RpoC(CBH49656.1)(서열번호 45), 클라미디아 트라코마티스(Chlamydia trachomatis) RpoC(O84316)(서열번호 46), 클로스트리디움 보툴리늄(Clostridium botulinum) RpoC(A7FZ76)(서열번호 47), 바실러스 서브틸리스(Bacillus subtilis) RpoC(P37871)(서열번호 48), 스트렙토코커스 뉴모니아에(Streptococcus pneumoniae) RpoC(Q97NQ8)(서열번호 49), 엔테로코커스 파에칼리스(Enterococcus faecalis) RpoC(Q82Z41)(서열번호 50), 및 락토바실러스 브레비스(Lactobacillus brevis) RpoC(Q03PV0)(서열번호 51)의 N-말단 도메인의, CLUSTAL O(1.2.4)에 의한 다중 서열 정렬을 도시한다.
도 2는 도 1에 도시된 바와 같은 RpoC 단백질(각각 서열번호 34-39, 26, 및 40-51)의 센트럴 도메인의, CLUSTAL O(1.2.4)에 의한 다중 서열 정렬을 도시한다.
도 3A-B는 도 1에 도시된 바와 같은 RpoC 단백질(각각 서열번호 34-39, 26, 및 40-51)의 C-말단 도메인의, CLUSTAL O(1.2.4)에 의한 다중 서열 정렬을 도시한다.
도 4는 E.coli의 RpoC 단백질(서열번호 26) 및 변이형 RpoC(서열번호 27)의 N-말단 도메인의 서열 정렬을 도시한다.
도 5는 E.coli의 RpoC 단백질(서열번호 26) 및 변이형 RpoC(서열번호 27)의 정렬된 N-말단 도메인의 차이를 도시하며, 이때 전체가 제공된 E.coli의 RpoC의 N-말단 도메인의 서열 및 제공된 변이형 RpoC의 서열이 차이를 보인다.
도 6A-B는 염색체상의 rpoC 서열을 대체하기 위한, pJSL47이라 칭하는 재조합 플라스미드의 구성 과정을 예시한다.
도 7A-B는 야생형 RepFIC 레플리콘을 포함하는, pJSL48이라 칭하는 재조합 플라스미드의 구성 과정을 예시한다.
도 8A-B는 변형된 RepFIC 레플리콘을 포함하는, pJSL49라 칭하는 재조합 플라스미드의 구성 과정을 예시한다.
도 2는 도 1에 도시된 바와 같은 RpoC 단백질(각각 서열번호 34-39, 26, 및 40-51)의 센트럴 도메인의, CLUSTAL O(1.2.4)에 의한 다중 서열 정렬을 도시한다.
도 3A-B는 도 1에 도시된 바와 같은 RpoC 단백질(각각 서열번호 34-39, 26, 및 40-51)의 C-말단 도메인의, CLUSTAL O(1.2.4)에 의한 다중 서열 정렬을 도시한다.
도 4는 E.coli의 RpoC 단백질(서열번호 26) 및 변이형 RpoC(서열번호 27)의 N-말단 도메인의 서열 정렬을 도시한다.
도 5는 E.coli의 RpoC 단백질(서열번호 26) 및 변이형 RpoC(서열번호 27)의 정렬된 N-말단 도메인의 차이를 도시하며, 이때 전체가 제공된 E.coli의 RpoC의 N-말단 도메인의 서열 및 제공된 변이형 RpoC의 서열이 차이를 보인다.
도 6A-B는 염색체상의 rpoC 서열을 대체하기 위한, pJSL47이라 칭하는 재조합 플라스미드의 구성 과정을 예시한다.
도 7A-B는 야생형 RepFIC 레플리콘을 포함하는, pJSL48이라 칭하는 재조합 플라스미드의 구성 과정을 예시한다.
도 8A-B는 변형된 RepFIC 레플리콘을 포함하는, pJSL49라 칭하는 재조합 플라스미드의 구성 과정을 예시한다.
변이형 rpoC 암호화 서열을 포함하는 핵산 분자를 개시한다. 변이형 rpoC 암호화 서열은 변이형 RpoC를 암호화한다. 변이형 RpoC는 R47C 치환을 포함하며, 이때 R47C 치환의 넘버링은 에스케리키아 콜라이의 야생형 RpoC RNA-폴리머라제 β' 서브유닛 단백질(또한 "야생형 RpoC"라 칭한다)에 기초하여 정의된다.
놀랍게도, 변이형 rpoC 암호화 서열을 포함하는 핵산 분자(여기에서 변이형 rpoC 암호화 서열은 변이형 RpoC를 암호화하고, 변이형 RpoC는 R47C 치환을 포함한다)를 사용하여 염색체 카피수의 상응하는 감소를 야기하지 않으면서 뉴클레오티드 서열을 포함하는 재조합 미생물에서 플라스미드의 카피수를 상당히 감소시킬 수 있음이 밝혀졌다. 이는 다른 이유들 가운데에서도, 본원에 개시된 바와 같은 R47C 치환이 RpoC의 N-말단 도메인 서열에서 발생하는 반면, Ederth 등 및 Petersen 등에 의해 기재된 바와 같은 단일 치환을 포함하는 E.coli RpoC의 돌연변이체는 오직 rpoC 유전자의 3'-말단 영역 부근, 및 따라서 RpoC의 C-말단 도메인 서열에서만 돌연변이를 포함하였기 때문에 놀라운 것이다. 이는 또한 본원에 개시된 바와 같은 R47C 치환이 9 아미노산 잔기 N-말단 도메인 서열내에서 발생하는 반면(그 외에는 다양한 세균의 RpoC들 간에 엄격하게 보존된다), Ederth 등 및 Petersen 등에 의해 기재된 바와 같은 E.coli RpoC의 돌연변이체에서의 단일 치환은, 보존된 잔기에 의해 둘러싸이지 않고 따라서 보존된 서열내에 존재하지 않는 RpoC 서열내의 위치에서 발생하기 때문에 놀라운 것이다.
이론에 얽매이고하 하는 것은 아니지만, 다양한 세균으로부터의 야생형 RpoC는, 야생형 RpoC들간에 엄격하게 보존되는 서열번호 26의 잔기 40-48에 상응하는 N-말단 도메인 서열을 포함하는 것으로 여겨진다. 도 1에 도시된 바와 같이, 상기 서열은 다양한 계통발생, 대사 및 환경을 나타내는 19개의 다양한 세균, 즉 써무스 써모필루스, 아세토박터 파스퇴리아누스, 네이세리아 고노로에아에, 레지오넬라 뉴모필라, 슈도모나스 아에루기노사, 비브리오 콜레라에, 에스케리키아 콜라이, 살모넬라 엔테리카 혈청형 티피뮤리움, 액티노마이세스 오돈토리티쿠스, 스트렙토마이세스 코엘리콜라, 코리네박테리움 디프테리아에, 마이코박테리움 튜베르큘로시스, 로도코커스 에퀴, 클라미디아 트라코마티스, 클로스트리디움 보툴리늄, 바실러스 서브틸리스, 스트렙토코커스 뉴모니아에, 엔테로코커스 파에칼리스, 및 락토바실러스 브레비스들간에 엄격하게 보존된다. 참조를 위해, 이들 서열은 Lee 등에 의해 제공된 바와 같은 21개 균주의 RpoC의 C-말단 도메인의 정렬로부터 입수할 수 있는 전장 RpoC 서열에 상응한다. 또한, 다양한 범위의 세균으로부터의 야생형 RpoC가 또한 서열번호 29에 상응하는 센트럴 도메인 서열(이는 또한 야생형 RpoC들간에 엄격하게 보존된다)을 포함하는 것으로 여겨진다. 도 2에 도시된 바와 같이, 상기 서열은 또한 19개의 다양한 세균들간에 엄격하게 보존된다. 또한, 다양한 범위의 세균으로부터의 야생형 RpoC가 또한 서열번호 30에 상응하는 C-말단 도메인 서열(이는 또한 야생형 RpoC들간에 엄격하게 보존된다)을 포함하는 것으로 여겨진다. 도 3A-B에 도시된 바와 같이, 상기 서열은 또한 19개의 다양한 세균들간에 엄격하게 보존된다.
추가로, 각각 엄격하게 보존된 N-말단, 중심 및 C-말단 도메인 서열을 포함하여, 서열번호 26의 잔기 33-57에 상응하는 보다 긴 N-말단 도메인 서열, 서열번호 32에 상응하는 보다 긴 센트럴 도메인 서열, 및 서열번호 33에 상응하는 보다 긴 C-말단 도메인 서열은 19개의 다양한 세균의 RpoC들간에 일반적으로 보존되는 다수의 잔기를 포함하는 것으로 여겨진다. 도 1, 도 2 및 도 3A-B에 도시된 바와 같이, E.coli RpoC는 이들 보다 긴 서열을 포함한다. 또한, 다른 세균으로부터의 RpoC는 이들 보다 긴 서열에 고도로 유사한 서열을 포함한다.
다양한 세균으로부터의 야생형 RpoC는 엄격하게 보존된 N-말단, 중심 및 C-말단 도메인 서열을 포함하기 때문에, 이들 서열 또한 다른 세균의 야생형 RpoC들간에 엄격하게 보존되는 것으로 여겨진다. 맥락상, 표 1에 나타낸 바와 같이, E.coli의 RpoC의 짝짓기식 서열 정렬에 대한 결과는 다른 18개의 다양한 세균의 RpoC에 비해, 심지어 E.coli와 계통발생적으로, 대사적으로, 및 환경적으로 동떨어진 극호열성 써무스 써모필루스와 같은 세균의 RpoC에 대해서조차 비교적 고도의 서열 일치성 및 유사성을 나타낸다. 이는 RpoC가 전사 및 복제에서 하는 근본적인 역할과 일치한다.
세균 | 등록번호 | 길이 | 동일성 | 유사성 | 갭 | 점수 | 서열 |
써무스 써모필루스 | sp|Q8RQE8.1|RPOC_THET8 | 1765 | 36.1% | 48.3% | 33.9% | 2851.0 | 서열번호 34 |
아세토박터 파스퇴리아누스 | BAH99075.1 | 1439 | 59.6% | 74.6% | 5.6% | 4334.5 | 서열번호 35 |
네이세리아 고노로에아에 | sp|Q5F5R6.1|RPOC_NEIG1 | 1412 | 66.1% | 80.4% | 1.8% | 4851.0 | 서열번호 36 |
레지오넬라 뉴모필라 | sp|Q5X865.1|RPOC_LEGPA | 1413 | 71.8% | 83.3% | 1.3% | 5223.0 | 서열번호 37 |
슈도모나스 아에루기노사 | sp|Q9HWC9.1|RPOC_PSEAE | 1408 | 75.4% | 85.6% | 0.7% | 5477.0 | 서열번호 38 |
비브리아 콜레라에 | sp|Q9KV29.1|RPOC_VIBCH | 1407 | 82.4% | 89.8% | 0.4% | 5941.0 | 서열번호 39 |
에스케리키아 콜라이 | sp|P0A8T7.1|RPOC_ECOLI | 1407 | 100.0% | 100.0% | 0.0% | 7139.0 | 서열번호 26 |
살모넬라 엔테리카 혈청형 티피뮤리움 | sp|P0A2R4.1|RPOC_SALTY | 1407 | 98.6% | 99.3% | 0.0% | 7057.0 | 서열번호 40 |
액티노마이세스 오돈토리티쿠스 | EDN79927.1 | 1529 | 40.9% | 54.7% | 23.0% | 2883.5 | 서열번호 41 |
스트렙토마이세스 코엘리콜라 | sp|Q8CJT1.1|RPOC_STRCO | 1539 | 42.6% | 55.7% | 24.2% | 3048.0 | 서열번호 42 |
코리네박테리움 디프테리아에 | sp|Q6NJF6.1|RPOC_CORDI | 1566 | 40.9% | 54.1% | 24.8% | 2933.0 | 서열번호 43 |
마이코박테리움 튜베르큘로시스 | sp|A5U053.1|RPOC_MYCTA | 1552 | 41.0% | 55.0% | 24.5% | 2944.0 | 서열번호 44 |
로도코커스 에퀴 | CBH49656.1 | 1542 | 41.6% | 55.6% | 23.2% | 2986.5 | 서열번호 45 |
클라미디아 트라코마티스 | sp|O84316.1|RPOC_CHLTR | 1468 | 47.8% | 65.6% | 9.1% | 3447.0 | 서열번호 46 |
클로스트리디움 보툴리늄 | sp|A7FZ76.1|RPOC_CLOB1 | 1420 | 47.5% | 62.7% | 18.0% | 3344.5 | 서열번호 47 |
바실러스 서브틸리스 | sp|P37871.4|RPOC_BACSU | 1451 | 45.7% | 59.4% | 20.4% | 3248.5 | 서열번호 48 |
스트렙토코커스 뉴모니아에 | sp|Q97NQ8.1|RPOC_STRPN | 1460 | 44.5% | 58.4% | 19.7% | 3111.0 | 서열번호 49 |
엔테로코커스 파에칼리스 | sp|Q82Z41.1|RPOC_ENTFA | 1454 | 44.8% | 59.8% | 19.5% | 3199.0 | 서열번호 50 |
락토바실러스 브레비스 | sp|Q03PV0.1|RPOC_LACBA | 1449 | 44.4% | 59.4% | 19.0% | 3174.5 | 서열번호 51 |
*짝짓기식 서열 정렬을 디폴트 설정(행렬: BLOSUM62; 끊김: 10; 끊김 확장: 0.5; 출력 포맷; 쌍; 말단 끊김 벌점: 거짓; 말단 끊김: 10; 말단 끊김 확장: 0.5)(웹사이트: ebi.ac.uk/Tools/psa/emboss_needle/)을 사용하여 EMBOSS 니들 짝짓기식 서열 정렬(PROTEIN) 도구로 수행하였다.
또한, 다양한 세균으로부터의 야생형 RpoC는 보다 긴 N-말단, 중심 및 C-말단 도메인 서열과 매우 유사한 서열을 포함하기 때문에, 다른 세균의 야생형 RpoC는 이들 서열과도 또한 매우 유사한 서열을 포함하는 것으로 여겨진다. 또한, 다양한 서열이 엄격하게 또는 일반적으로 보존되는 것에 기초하여, 상응하는 N-말단, 중심 및 C-말단 도메인 서열은 RpoC가 염색체 및 플라스미드의 전사 및 복제에서 하는 역할에 구조적으로 및/또는 기능적으로 중요한 기여를 하는 것으로 여겨진다.
또한 이론에 얽매이고자 하는 것은 아니지만, RpoC에서 특히 N-말단 도메인은 플라스미드의 카피수 측정에 중요한 역할을 하는 것으로 여겨진다. 도 4 및 도 5에 도시된 바와 같이, 변형 rpoC 암호화 서열은 서열번호 28을 포함하는 N-말단 도메인을 포함하는데, 이는 47번 위치 아미노산에서 단일 치환, 즉 R(즉 아르기닌)에서 C(즉 시스테인)로의 치환(또한 "R47C"라 칭한다)에 의해 야생형 N-말단 도메인의 엄격하게 보존된 서열과 상이하며, 이때 넘버링은 E.coli의 야생형 RpoC에 기초하여 정의된다. 상기 R47C 치환을 포함하고 그 외에는 E.coli의 야생형 RpoC와 동일한 변이형 RpoC는 플라스미드 카피수의 감소, 예를 들어 25% 내지 75%의 감소를 나타낸다. N-말단 도메인 중 R47C 치환이 변이형 RpoC와 E.coli의 야생형 RpoC간의 유일한 차이이고, R47C 치환은 19개의 다양한 세균들간에 일반적으로 보존되는 다양한 잔기를 포함하는 보다 긴 N-말단 도메인 서열내에 위치하며, 19개의 다양한 세균으로부터의 야생형 RpoC들간에 서열번호 26의 잔기 40-48에 상응하는 엄격하게 보존된 N-말단 도메인 서열내에 다른 치환은 존재하지 않기 때문에, N-말단 도메인이 플라스미드 카피수의 측정에 중요한 것으로 보인다.
본원에 사용되는 바와 같이, 용어 핵산 분자는 예를 들어 이중가닥 DNA 분자, 단일가닥 DNA 분자, 이중가닥 RNA 분자, 단일가닥 RNA 분자, 또는 DNA/RNA 하이브리드 분자를 포함하여 DNA 및/또는 RNA의 분자를 의미하며, 이때 핵산 분자의 구조는 핵산 분자가 DNA 서열, RNA 서열, 또는 이 둘 모두를 포함하는 지의 여부에 따라 변한다.
본원에 사용되는 바와 같이, 용어 RNA-폴리머라제 β' 서브유닛 단백질은 RNA 폴리머라제의 RNA 폴리머라제 β' 서브유닛을 의미한다. 상기에 논의된 바와 같이, RNA 폴리머라제는 전사에서 하나의 역할을 한다. RNA 폴리머라제는 또한 염색체 및 플라스미드의 복제에서 하나의 역할을 한다. RNA-폴리머라제 β' 서브유닛 단백질은 공지된 RNA-폴리머라제 β' 서브유닛 단백질과 구조적 및/또는 기능적 유사성을 근거로, 예를 들어 Lee 등에 의해 도시된 바와 같은 서열 정렬 및/또는 Mukhopadhyay 등에 의해 논의된 바와 같은 구조적 특징을 근거로 식별될 수 있다. RNA 폴리머라제 활성을 예를 들어 문헌[Chamberlin et al., The Journal of Biological Chemistry 254(20):10061-10069 (1979)]에 기재된 바와 같이 측정할 수 있다.
본원에 사용되는 바와 같이, 용어 rpoC RNA-폴리머라제 β' 서브유닛 단백질 암호화 서열은 RNA-폴리머라제 β' 서브유닛 단백질의 서열을 암호화하는 DNA 분자 가닥, 또는 DNA 분자 가닥의 부분을 의미한다.
본원에 사용되는 바와 같이, 용어 야생형 RNA-폴리머라제 β' 서브유닛 단백질은 자연 조건하에서 하나의 종에 개별적으로 존재하는 RNA-폴리머라제 β' 서브유닛 단백질을 의미한다.
본원에 사용되는 바와 같이, 용어 N-말단 도메인은 단백질의 N-말단에 또는 그 부근에, 예를 들어 단백질의 아미노산 서열의 처음 1/3내에 존재하는 단백질의 일부를 의미한다.
본원에 사용되는 바와 같이, 용어 센트럴 도메인은 단백질의 중심에 또는 그 부근에, 예를 들어 단백질의 아미노산 서열의 중간 1/3내에 존재하는 단백질의 일부를 의미한다.
본원에 사용되는 바와 같이, 용어 C-말단 도메인은 단백질의 C-말단에 또는 그 부근에, 예를 들어 단백질의 아미노산 서열의 마지막 1/3내에 존재하는 단백질의 일부를 의미한다.
본원에 사용되는 바와 같이, 용어 레플리콘은 단일의 복제 기원으로부터 복제하는 DNA 분자의 영역을 의미한다.
본원에 사용되는 바와 같이, 용어 벡터는 미생물에서, 자연적으로 또는 미생물내로의 도입에 의해 발생할 수 있는 핵산 분자, 예를 들어 플라스미드, 바이러스 벡터, 코스미드, 또는 인공 염색체를 의미한다.
본원에 사용되는 바와 같이, 용어 플라스미드는 미생물에서, 자연적으로 또는 미생물내로의 도입에 의해 발생할 수 있는, 즉 미생물의 염색체(들)와 물리적으로 분리되고 염색체(들)와 독립적으로 복제할 수 있는 핵산 분자를 의미한다. 상기에서 논의된 바와 같이, 플라스미드는 전형적으로 이중가닥 환상 DNA 분자이나, 또한 선형 DNA 분자 및/또는 RNA 분자일 수도 있다. 플라스미드는 약 1 kb 내지 2 Mb 이상의 크기 범위로 존재한다. 플라스미드는 또한 세포당 카피의 범위, 즉 낮은 카피수에서부터 높은 카피수까지 존재한다. 플라스미드를 표적 유전자를 포함하도록 변형시킬 수 있다.
본원에 사용되는 바와 같이, 용어 플라스미드 카피수는 미생물 세포 내 플라스미드의 카피수를 의미한다. 플라스미드 카피수를, 다른 접근법 중에서도, 예를 들어 실시간 PCR을 사용하여 미생물의 염색체상에서 단일 카피로 존재하는 유전자에 비해, 플라스미드상에 단일 카피로 존재하는 유전자의 카피수를 비교함으로써 미생물 중의 플라스미드에 대해 측정할 수 있다.
본원에 사용되는 바와 같이, 용어 플라스미드 카피수의 조절인자는 미생물에서 플라스미드의 카피수의 변화를 야기하는, 예를 들어 인자가 미생물 중에 존재하는 경우 대 인자가 미생물 중에 존재하지 않는 경우에 증가 또는 감소를 야기하는 RNA 분자 또는 단백질과 같은 인자를 의미한다. 플라스미드 카피수를 조절함으로써, 플라스미드를 안정하게 발현시키고, 이에 의해 상기 플라스미드를 포함하는 미생물의 안정한 생육을 가능하게 할 수 있다.
나타낸 바와 같이, 변이형 rpoC 암호화 서열을 포함하는 핵산 분자를 개시한다. 핵산 분자는 예를 들어 이중가닥 DNA 분자, 예를 들어, 변이형 rpoC 암호화 서열이 도입된 염색체 DNA, 또는 변이형 rpoC 암호화 서열이 클로닝된 플라스미드일 수 있다.
또한 나타낸 바와 같이, 변이형 rpoC 암호화 서열은 변이형 RpoC를 암호화한다. 변이형 RpoC는 R47C 치환을 포함함을 기본으로 하는 변이체이며, 이때 R47C 치환의 넘버링은 E.coli의 야생형 RpoC에 기초하여 정의된다. 변이형 RpoC는 RpoC이며, 따라서 염색체의 전사, 복제, 및 플라스미드의 복제에 역할을 한다. 19개의 다양한 세균의 야생형 RpoC가, 예를 들어 보존되지 않은 아미노산 위치에서 서로에 대해 변하는 것처럼, 변이형 RpoC도 또한 변이형 rpoC 암호화 서열의 소스에 따라 변할 수 있다. 따라서, 예를 들어 변이형 RpoC는 R47C 치환을 포함할 수 있으며 그 외에는 E.coli의 야생형 RpoC에 적어도 90% 일치할 수 있다. 또한 예를 들어, 변이형 RpoC는 R47C 치환을 포함할 수 있으며 그 외에는 다른 18개의 상이한 세균, 즉 써무스 써모필루스, 아세토박터 파스퇴리아누스, 네이세리아 고노로에아에, 레지오넬라 뉴모필라, 슈도모나스 아에루기노사, 비브리오 콜레라에, 살모넬라 엔테리카 혈청형 티피뮤리움, 액티노마이세스 오돈토리티쿠스, 스트렙토마이세스 코엘리콜라, 코리네박테리움 디프테리아에, 마이코박테리움 튜베르큘로시스, 로도코커스 에퀴, 클라미디아 트라코마티스, 클로스트리디움 보툴리늄, 바실러스 서브틸리스, 스트렙토코커스 뉴모니아에, 엔테로코커스 파에칼리스, 및 락토바실러스 브레비스 중 어느 하나의 야생형 RpoC에 적어도 90% 일치할 수 있다. 또한 예를 들어, 변이형 RpoC는 R47C 치환 및 E.coli 또는 다른 18개의 다양한 세균의 야생형 RpoC 중 하나 이상의 하나 이상의 부분을 포함할 수 있다.
일부 예에서, 변이형 RpoC는 (1) 서열번호 28을 포함하는 N-말단 도메인, (2) 서열번호 29를 포함하는 센트럴 도메인, 및 (3) 서열번호 30을 포함하는 C-말단 도메인을 포함하며, 여기에서 R47C 치환은 N-말단 도메인 내에 존재한다. 이들 예에서, 변이형 RpoC는 R47C 치환을 포함하는, 서열번호 28에 상응하는 N-말단 도메인을 포함한다. 변이형 RpoC는 또한, RpoC가 염색체 및 플라스미드의 전사 및 복제에서 하는 역할과 일치하는, 서열번호 29에 상응하는 엄격하게 보존된 센트럴 도메인 서열 및 서열번호 30에 상응하는 엄격하게 보존된 C-말단 도메인 서열을 포함한다.
일부 예에서, 변이형 RpoC는 (1) 서열번호 31를 포함하는 N-말단 도메인, (2) 서열번호 32을 포함하는 센트럴 도메인, 및 (3) 서열번호 33을 포함하는 C-말단 도메인을 포함하며, 여기에서 R47C 치환은 N-말단 도메인 내에 존재한다. 이들 예에서, 변이형 RpoC는 R47C 치환을 포함하는, 서열번호 31에 상응하는 N-말단 도메인을 포함한다. 변이형 RpoC는 또한, RpoC가 염색체 및 플라스미드의 전사 및 복제에서 하는 역할과 일치하는, 서열번호 32에 상응하는 엄격하게 보존된 보다 긴 센트럴 도메인 서열 및 서열번호 33에 상응하는 엄격하게 보존된 보다 긴 C-말단 도메인 서열을 포함한다.
일부 예에서, 변이형 RpoC는 서열번호 27에 적어도 90% 일치하는 아미노산 서열을 포함한다. 참조를 위해서, 서열번호 27은, R47C 치환을 포함하며 그 외에는 E.coli의 야생형 RpoC와 일치하는 변이형 RpoC에 상응한다. 또한 참조를 위해서, 변이형 RpoC의 아미노산 서열과 서열번호 27간의 서열 일치성 백분율을 짝짓기식 서열 정렬을 수행하여 측정할 수 있다. 이를 디폴트 설정(행렬: BLOSUM62; 끊김: 10; 끊김 확장: 0.5; 출력 포맷; 쌍; 말단 끊김 벌점: 거짓; 말단 끊김: 10; 말단 끊김 확장: 0.5)(웹사이트: ebi.ac.uk/Tools/psa/emboss_needle/)을 사용하여 EMBOSS 니들 짝짓기식 서열 정렬(PROTEIN) 도구로 수행할 수 있다. 이를 또한 유사한 다른 짝짓기식 서열 정렬 도구로 수행할 수 있다.
변이형 RpoC의 아미노산 서열은, 예를 들어 우세하게 또는 전적으로 E.coli의 야생형 RpoC와 다른 18개의 다양한 세균의 RpoC간에 보존되지 않은 아미노산 잔기의 치환을 근거로 서열번호 27과와 상이할 수 있다. 표 1을 참조하여, 다른 18개의 다양한 세균의 RpoC와 비교된 E.coli의 RpoC의 작짓기식 서열 정렬의 결과는 비교적 높은 정도의 서열 일치성 및 유사성을 가리키지만, 상기 결과는 또한 다양한 세균 중 17개의 RpoC가 E.coli의 야생형 RpoC와 비교하여 36.1% 내지 82.4% 범위, 및 따라서 충분히 90% 아래인 서열 일치성을 가짐을 가리킨다. 유사한 단백질들간에 보존되지 않는 아미노산 잔기의 치환은 일반적으로 더 잘 허용되는 듯하다, 예를 들어 보존되는 아미노산 잔기의 치환과 비교하여 구조 및/또는 기능을 붕괴시키지 않는 듯하다. 표 1의 결과는 RpoC가 보존되지 않는 다수의 아미노산 잔기를 포함하고 따라서 치환을 잘 받아들일 수 있음을 가리킨다.
변이형 RpoC의 아미노산 서열은 또한, 예를 들어 일부 또는 다수의 보존적 치환을 포함함을 근거로 서열번호 27과 상이할 수 있는데, 이는 서열번호 27에 비해, 또 다른 구조적으로 유사한 아미노산 잔기에 의한 아미노산 잔기의 교체를 의미한다. 보존적 치환은 전형적으로 하기 그룹내의 치환을 포함한다: (1) 글리신 및 알라닌, (2) 발린, 이소류신, 및 류신, (3) 아스파트산 및 글루탐산, (4) 아스파라진 및 글루타민, (5) 세린 및 쓰레오닌, (6) 리신 및 아르기닌, 및 (7) 페닐알라닌 및 티로신. 보존적 치환은 일반적으로 보존적이지 않은 치환과 비교하여 더 잘 허용되는 듯 하다.
따라서, 이들 예에서 변이형 RpoC는 R47C 치환을 포함한다. 변이형 RpoC는 또한 서열번호 27에 적어도 90% 일치하는 아미노산 서열을 포함한다. 예를 들어 변이형 RpoC는 서열번호 27에 적어도 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 또는 100% 일치하는 아미노산 서열을 포함할 수 있다.
일부 예에서, 변이형 RpoC는 서열번호 27을 포함한다. 일부 예에서, 변이형 RpoC는 서열번호 27로 이루어진다.
일부 예에서, 변이형 RpoC의 발현은 서열번호 26을 포함하는 야생형 RpoC의 발현에 비해 플라스미드의 카피수를 감소시킨다. 참조를 위해, 서열번호 26은 E.coli의 야생형 RpoC에 상응한다. 상기에 나타낸 바와 같이, 변이형 RpoC는 RpoC이며, 따라서 염색체의 전사, 복제, 및 플라스미드의 복제에 역할을 한다. 또한 나타낸 바와 같이, R47C 치환을 포함하고 그 외에는 E.coli의 야생형 RpoC와 일치하는 변이형 RpoC는 예를 들어 약 25% 내지 75%의 플라스미드 카피수의 감소를 나타낸다. 일부 예에서, 변이형 RpoC의 발현은 플라스미드의 카피수를 서열번호 26을 포함하는 야생형 RpoC의 발현에 비해 10% 내지 80%까지, 예를 들어 25% 내지 75%, 30% 내지 70%, 35% 내지 65%, 40% 내지 60%, 45% 내지 55%, 10% 내지 40%, 20% 내지 50%, 30% 내지 60%, 40% 내지 70%, 또는 50% 내지 80%까지 감소시킨다.
핵산 분자를 포함하는 벡터를 또한 개시한다. 핵산 분자는 상술한 바와 같을 수 있다. 일부 예에서, 벡터는 플라스미드, 바이러스 벡터, 코스미드, 또는 인공 염색체 중 하나 이상에 상응할 수 있다.
핵산 분자를 포함하는 재조합 미생물을 또한 개시한다. 핵산 분자는 상술한 바와 같을 수 있다. 따라서, 일부 예에서, 변이형 RpoC는 상기에 논의된 바와 같이, (1) 서열번호 28을 포함하는 N-말단 도메인, (2) 서열번호 29를 포함하는 센트럴 도메인, 및 (3) 서열번호 30을 포함하는 C-말단 도메인을 포함하며, 여기에서 R47C 치환은 N-말단 도메인 내에 존재한다. 일부 예에서, 변이형 RpoC는 상기에 논의된 바와 같이, (1) 서열번호 31을 포함하는 N-말단 도메인, (2) 서열번호 32를 포함하는 센트럴 도메인, 및 (3) 서열번호 33을 포함하는 C-말단 도메인을 포함하며, 여기에서 R47C 치환은 N-말단 도메인 내에 존재한다. 일부 예에서 변이형 RpoC는 상기에 논의된 바와 같이, 서열번호 27에 적어도 90% 일치하는 아미노산 서열을 포함한다. 일부 예에서 변이형 RpoC는 상기에 논의된 바와 같이, 서열번호 27을 포함한다.
일부 예에서, 재조합 미생물에서 변이형 RpoC의 발현은 플라스미드의 카피수를, 대조용 미생물에서 서열번호 26을 포함하는 야생형 RpoC의 발현에 비해 감소시킨다. 대조용 미생물은 예를 들어 재조합 미생물과 동일한 속, 종, 및/또는 균주로부터 유래될 수 있으며, 유사하거나 동일한 플라스미드를 포함할 수 있고, 따라서 재조합 미생물의 변이형 rpoC 암호화 서열과 대조용 미생물의 상응하는 rpoC 서열간의 차이에 대한 것 외에는 계통발생적으로 유사하고, 밀접하게 관련되고, 및/또는 유전학적으로 일치할 수 있다. 상기에 논의된 바와 유사하게, 감소는 10% 내지 80%까지, 예를 들어 25% 내지 75%, 30% 내지 70%, 35% 내지 65%, 40% 내지 60%, 45% 내지 55%, 10% 내지 40%, 20% 내지 50%, 30% 내지 60%, 40% 내지 70%, 또는 50% 내지 80%까지 일 수 있다.
핵산 분자를 포함하는 재조합 미생물을, 예를 들어 벡터 중에 클로닝된 완전한 변이형 rpoC 암호화 서열을, 예를 들어 형질전환, 접합, 또는 형질도입에 의해 전구체 미생물내에 도입시켜 재조합 미생물을 수득하고, 이어서 재조합 미생물 중에 완전한 변이형 rpoC 암호화 서열을, 예를 들어 벡터의 선택에 의해 유지시킴으로써 수득할 수 있다. 이를 분자 생물학의 표준 기법에 의해 수행할 수 있다. 참조를 위해서, 벡터는 플라스미드, 바이러스 벡터, 코스미드 또는 인공 염색체 중 하나 이상에 상응할 수 있다. 따라서, 일부 예에서 재조합 미생물을, 형질전환, 접합 또는 형질도입 중 하나 이상에 의해 전구체 미생물내에 핵산 서열을 포함하는 변이형 rpoC 암호화 서열 벡터, 예를 들어 플라스미드를 도입시킴으로써 제조할 수 있다.
핵산 분자를 포함하는 재조합 미생물을, 예를 들어 벡터 중에 클로닝된 변이형 rpoC 암호화 서열의 일부를 도입시키고, 상기 부분을 사용하여, 예를 들어 sacB 벡터를 사용함으로써 상동성 재조합에 의한 유전자 교체에 의해, 내인성 염색체 야생형 rpoC 암호화 서열의 상응하는 부분을 교체시킴으로써 또한 수득할 수 있다. 이를 또한 분자 생물학의 표준 기법에 의해 수행할 수 있다. 따라서, 일부 예에서 재조합 미생물은 염색체를 포함하며, 변이형 rpoC 암호화 서열은 변이형 rpoC 암호화 서열에 의한 내인성 rpoC 암호화 서열의 교체에 기초하여 앰색체 중에 존재한다.
재조합 미생물을 다양한 세균의 세균으로부터 제조할 수 있다. 상기에 논의된 바와 같이, 19개의 다양한 세균들간에 엄격하게 또는 일반적으로 보존되는 다양한 N-말단, 중심 및 C-말단 도메인 서열에 기초하여, 상응하는 서열은 RpoC가 다양한 세균들간에 염색체 및 플라스미드의 전사 및 복제에서 하는 역할에 구조적으로 및/또는 기능적으로 중요한 기여를 하는 것으로 여겨진다. 또한, RpoC에서 특히 N-말단 도메인은 플라스미드의 카피수 측정에 중요한 역할을 하는 것으로 여겨진다. 따라서, 일부 예에서, 재조합 미생물을 써무스 속, 예를 들어 써무스 써모필루스, 아세토박터 속, 예를 들어 아세토박터 파스퇴리아누스, 네이세리아 속, 예를 들어 네이세리아 고노로에아에, 레지오넬라 속, 예를 들어 레지오넬라 뉴모필라, 슈도모나스 속, 예를 들어 슈도모나스 아에루기노사, 비브리오 속, 예를 들어 비브리오 콜레라에, 에스케리키아 속, 예를 들어 에스케리키아 콜라이, 살모넬라 속, 예를 들어 살모넬라 엔테리카 혈청형 티피뮤리움, 액티노마이세스 속, 예를 들어 액티노마이세스 오돈토리티쿠스, 스트렙토마이세스 속, 예를 들어 스트렙토마이세스 코엘리콜라, 코리네박테리움 속, 예를 들어 코리네박테리움 디프테리아에, 마이코박테리움 속, 예를 들어 마이코박테리움 튜베르큘로시스, 로도코커스 속, 예를 들어 로도코커스 에퀴, 클라미디아 속, 예를 들어 클라미디아 트라코마티스, 클로스트리디움 속, 예를 들어 클로스트리디움 보툴리늄, 바실러스 속, 예를 들어 바실러스 서브틸리스, 스트렙토코커스 속, 예를 들어 스트렙토코커스 뉴모니아에, 엔테로코커스 속, 예를 들어 엔테로코커스 파에칼리스, 및 락토바실러스 속, 예를 들어 락토바실러스 브레비스의 세균 중 하나 이상으로부터 제조할 수 있다.
상기에 나타낸 바와 같이, R47C 치환을 포함하고 그 외에는 E.coli의 야생형 RpoC와 일치하는 변이형 RpoC는 예를 들어 약 25% 내지 75%의 플라스미드 카피수의 감소를 나타낸다. 하기에 논의되는 바와 같이, 이는 다양한 E.coli 균주에서 성취되었다. 상응하게, 변이형 rpoC 암호화 가닥을 포함하는 핵산 분자는 플라스미드 카피수를 특별히 에스케리키아 속의 세균, 특히 에스케리키아 콜라이 종의 세균에서 조절할 수 있다. 따라서, 일부 예에서 재조합 미생물을 에스케리키아 속의 세균 또는 에스케리키아 콜라이 종의 세균 중 하나 이상으로부터 제조할 수 있다.
재조합 미생물에서 대상 벡터의 카피수를 조절하는 방법을 또한 개시한다. 재조합 미생물에서 대상 벡터의 카피수를 조절하기 위한 변이형 rpoC 암호화 서열의 용도를 또한 개시한다. 재조합 미생물은 상술한 바와 같을 수 있다. 따라서, 일부 예에서, 변이형 RpoC는 상기에 논의된 바와 같이, (1) 서열번호 28을 포함하는 N-말단 도메인, (2) 서열번호 29를 포함하는 센트럴 도메인, 및 (3) 서열번호 30을 포함하는 C-말단 도메인을 포함하며, 여기에서 R47C 치환은 N-말단 도메인 내에 존재한다. 일부 예에서, 변이형 RpoC는 상기에 논의된 바와 같이, (1) 서열번호 31을 포함하는 N-말단 도메인, (2) 서열번호 32를 포함하는 센트럴 도메인, 및 (3) 서열번호 33을 포함하는 C-말단 도메인을 포함하며, 여기에서 R47C 치환은 N-말단 도메인 내에 존재한다. 일부 예에서 변이형 RpoC는 상기에 논의된 바와 같이, 서열번호 27에 적어도 90% 일치하는 아미노산 서열을 포함한다. 일부 예에서 변이형 RpoC는 상기에 논의된 바와 같이, 서열번호 27을 포함한다. 일부 예에서 재조합 미생물을 에스케리키아 속의 세균 또는 에스케리키아 콜라이 종의 세균 중 하나 이상으로부터 제조할 수 있다. 일부 예에서, 재조합 미생물에서 변이형 RpoC의 발현은 플라스미드의 카피수를 서열번호 26을 포함하는 야생형 RpoC의 발현에 비해 예를 들어 10% 내지 80%까지, 예를 들어 25% 내지 75%, 30% 내지 70%, 35% 내지 65%, 40% 내지 60%, 45% 내지 55%, 10% 내지 40%, 20% 내지 50%, 30% 내지 60%, 40% 내지 70%, 또는 50% 내지 80% 까지 감소시킨다.
방법은 재조합 미생물을 대상 벡터의 복제에 충분한 조건하에 배양 배지에서 배양하여, 대상 벡터의 카피수를 조절함을 포함한다. 상기에 나타낸 바와 같이, 벡터는 플라스미드, 바이러스 벡터, 코스미드 또는 인공 염색체 중 하나 이상에 상응할 수 있다. 또한 논의된 바와 같이, RpoC는 플라스미드의 복제에 한 역할을 한다. 따라서, 일부 예에서 대상 벡터는 플라스미드를 포함한다.
배양을 예를 들어 배양 튜브, 플라스크, 및/또는 생물반응기(이들에 대한 세부사항은 당해 분야의 통상적인 숙련가에게 자명할 것이다)에서 표준 미생물학 기법에 의해 수행할 수 있다. 배양을 적합한 배양 배지, 예를 들어 영양분 풍부 배지 또는 최소 배지(이들에 대한 세부사항은 당해 분야의 통상적인 숙련가에게 자명할 것이다)에서 수행할 수 있다. 배양을 적합한 배양 온도에서, 예를 들어 25-38℃, 28-37℃, 또는 37℃에서 또는 약 상기 온도(이에 대한 세부사항은 당해 분야의 통상적인 숙련가에게 자명할 것이다)에서 수행할 수 있다. 재조합 미생물이 배양 중 생육하고 분열함에 따라, 벡터가 복제될 것이다. 따라서, 예를 들어 에스케리키아 콜라이로부터 제조된 재조합 미생물에 관하여, 상기 재조합 미생물을 생물반응기에서 배치식으로 또는 연속적으로 발효 기법에 의해 배양할 수 있다. 배양을 최소 배지, 예를 들어 다른 것들 중에서도, 한정된 양의 염, 예를 들어 M9 최소염 배지, 및 하나 이상의 탄소원, 예를 들어 글루코스, 슈크로스, 또는 리그노셀룰로스 물질을 포함하는 배지에서 수행할 수 있다. 배양을 약 37℃에서 수행할 수 있다. 상기와 같은 조건은 에스케리키아 콜라이의 생육 및 분열을 지지하며, 따라서 벡터의 복제를 지지할 것이다. 에스케리키아 콜라이뿐만 아니라 다른 미생물로부터 제조된 재조합 미생물의 배양에 적합한 다른 조건은 공지되어 있으며 당해 분야의 통상적인 숙련가에게 자명할 것이다.
재조합 미생물의 사용에 의한 표적 생성물의 제조 방법을 또한 개시한다. 표적 생성물의 제조를 위한 재조합 미생물의 용도를 또한 개시한다. 다시, 재조합 미생물은 상술한 바와 같을 수 있다. 따라서, 일부 예에서, 변이형 RpoC는 상기에 논의된 바와 같이, (1) 서열번호 28을 포함하는 N-말단 도메인, (2) 서열번호 29를 포함하는 센트럴 도메인, 및 (3) 서열번호 30을 포함하는 C-말단 도메인을 포함하며, 여기에서 R47C 치환은 N-말단 도메인 내에 존재한다. 일부 예에서, 변이형 RpoC는 상기에 논의된 바와 같이, (1) 서열번호 31을 포함하는 N-말단 도메인, (2) 서열번호 32를 포함하는 센트럴 도메인, 및 (3) 서열번호 33을 포함하는 C-말단 도메인을 포함하며, 여기에서 R47C 치환은 N-말단 도메인 내에 존재한다. 일부 예에서 변이형 RpoC는 상기에 논의된 바와 같이, 서열번호 27에 적어도 90% 일치하는 아미노산 서열을 포함한다. 일부 예에서 변이형 RpoC는 상기에 논의된 바와 같이, 서열번호 27을 포함한다. 일부 예에서 재조합 미생물을 에스케리키아 속의 세균 또는 에스케리키아 콜라이 종의 세균 중 하나 이상으로부터 제조할 수 있다. 일부 예에서, 재조합 미생물에서 변이형 RpoC의 발현은 플라스미드의 카피수를 서열번호 26을 포함하는 야생형 RpoC의 발현에 비해 예를 들어 10% 내지 80%까지, 예를 들어 25% 내지 75%, 30% 내지 70%, 35% 내지 65%, 40% 내지 60%, 45% 내지 55%, 10% 내지 40%, 20% 내지 50%, 30% 내지 60%, 40% 내지 70%, 또는 50% 내지 80% 까지 감소시킨다.
상기 방법에 따라, 재조합 미생물은 표적 유전자 벡터를 포함하고, 표적 유전자 벡터는 표적 생성물의 제조를 위한 표적 유전자를 포함한다. 벡터는 예를 들어 유기체, 예를 들어 다른 유기체 중에서도 박테리아, 알카에아, 또는 유카리오타 도메인의 미생물, 동물, 및/또는 식물로부터의 표적 유전자를 포함하는 재조합 플라스미드일 수 있다. 특히 박테리아 도메인의 미생물에 관하여, 표적 유전자는 예를 들어 다른 것들 중에서도, 에스케리키아 속의 세균, 예를 들어 에스케리키아 콜라이, 코리네박테리움 속의 세균, 예를 들어 코리네박테리움 글루타미쿰, 또는 바실러스 속의 세균, 예를 들어 바실러스 서브틸리스로부터 유래할 수 있다. 다른 유기체 중에서도 박테리아, 알카에아, 및 유카리오타 도메인의 미생물뿐만 아니라, 동물 및 식물을 포함한 다양한 유기체의 유전자의 재조합 발현을 허용하는 다수의 유전 공학 기법이 개발되었다. 상기와 같은 기법을 적용하여, 당해 분야의 통상적인 숙련가에게 자명한 바와 같이, 본 개시내용에 따라 재조합 미생물에서 다양한 유기체로부터의 표적 유전자를 클로닝하고 발현시킬 수 있다.
일부 예에서, 표적 생성물은 (i) 표적 RNA, (ii) 표적 단백질, (iii) 표적 생물물질, (iv) 표적 중합체, 그의 전구체, 및/또는 그의 생성을 위한 효소, (v) 표적 감미제, 그의 전구체, 및/또는 그의 생성을 위한 효소, (vi) 표적 오일, 그의 전구체, 및/또는 그의 생성을 위한 효소, (vii) 표적 지방, 그의 전구체, 및/또는 그의 생성을 위한 효소, (viii) 표적 폴리사카라이드, 그의 전구체, 및/또는 그의 생성을 위한 효소, (ix) 표적 아미노산, 그의 전구체, 및/또는 그의 생성을 위한 효소, (x) 표적 뉴클레오티드, 그의 전구체, 및/또는 그의 생성을 위한 효소, (xi) 표적 백신, 그의 전구체, 및/또는 그의 생성을 위한 효소, 또는 (xii) 표적 약학 생성물, 그의 전구체, 및/또는 그의 생성을 위한 효소 중 하나 이상을 포함한다. 따라서, 일부 예에서 표적 유전자는 표적 RNA를 암호화하고, 표적 생성물은 표적 RNA에 상응한다. 또한 일부 예에서, 표적 유전자는 표적 단백질을 암호화하고, 표적 생성물은 표적 단백질에 상응한다. 또한 일부 예에서 표적 유전자는 표적 RNA 및/또는 표적 단백질을 암호화하고, 표적 RNA 및/또는 표적 단백질은 차례로 표적 생물물질, 표적 중합체, 그의 전구체, 및/또는 그의 생성을 위한 효소, 표적 감미제, 그의 전구체, 및/또는 그의 생성을 위한 효소, 표적 오일, 그의 전구체, 및/또는 그의 생성을 위한 효소, 표적 지방, 그의 전구체, 및/또는 그의 생성을 위한 효소, 표적 폴리사카라이드, 그의 전구체, 및/또는 그의 생성을 위한 효소, 표적 아미노산, 그의 전구체, 및/또는 그의 생성을 위한 효소, 표적 뉴클레오티드, 그의 전구체, 및/또는 그의 생성을 위한 효소, 표적 백신, 그의 전구체, 및/또는 그의 생성을 위한 효소, 또는 표적 약학 생성물, 그의 전구체, 및/또는 그의 생성을 위한 효소의 생성에 한 역할을 한다. 예를 들어 표적 중합체, 표적 감미제, 표적 오일, 표적 지방, 표적 폴리사카라이드, 표적 아미노산, 및/또는 표적 뉴클레오티드에 관하여, 표적 유전자는 상기 표적 중합체, 표적 감미제, 표적 오일, 표적 지방, 표적 폴리사카라이드, 표적 아미노산, 및/또는 표적 뉴클레오티드를 생산하는 효소에 상응하는 표적 단백질을 직접적으로 또는 전구체를 통해 암호화할 수 있다. 또한 예를 들어, 백신에 관하여, 표적 유전자는 병원성 미생물에 대한 백신의 성분으로서 사용될 수 있는 항원 또는 항원 단편, 예를 들어 단백질 서브유닛, 수용체, 또는 병원성 미생물의 다른 단백질, 또는 그의 단편에 상응하는 표적 단백질을 암호화할 수 있다. 또한 예를 들어, 표적 약학 생성물에 관하여, 표적 유전자는 약학 생성물의 성분으로서 사용될 수 있는 표적 단백질, 예를 들어 항체, 수용체 또는 호르몬을 암호화할 수 있다.
일부 예에서, 벡터는 다수의 표적 유전자, 예를 들어 특정 유기체로부터의 다수의 표적 유전자, 및/또는 다수의 유기체 각각으로부터의 하나 이상의 표적 유전자를 포함한다. 또한 일부 예에서, 표적 유전자는 다수의 표적 생성물을 제조하기 위한 것이다.
일부 예에서, 표적 유전자 벡터, 예를 들어 플라스미드는 3 내지 120 kb의 크기를 갖는다. 재조합 벡터는 종종 3 내지 120 kb의 크기로 발생하는데, 그 이유는 이것이 표적 유전자가 클로닝된 벡터에 전형적인 크기이기 때문이다.
방법은 또한 (1) 재조합 미생물을 상기 재조합 미생물이 표적 유전자를 발현하는 조건하에 배양 배지에서 배양하여, 표적 생성물을 생성시키는 단계를 포함한다. 다시, 배양을 예를 들어 배양 튜브, 플라스크, 및/또는 생물반응기에서, 적합한 배양 배지, 예를 들어 영양분 풍부 배지 또는 최소 배지에서, 적합한 배양 온도에서, 예를 들어 25-38℃, 28-37℃, 또는 37℃에서 또는 약 상기 온도에서 표준 미생물학 기법에 의해 수행할 수 있으며, 이에 대한 세부사항은 당해 분야의 통상적인 숙련가에게 자명할 것이다.
방법은 또한 (2) 표적 생성물을 재조합 미생물 및/또는 배양 배지로부터 회수하는 단계를 포함한다. 표적 생성물의 회수에 적합한 접근법은 다른 접근법들 중에서도, 표적 생성물의 세부사항에 기초하여, 예를 들어 표적 생성물, 예를 들어 RNA, 단백질, 중합체 등의 유형, 표적 생성물의 구체적인 세부사항, 예를 들어 화학 구조, 분자량, 친화성 태그 등 및 목적하는 순도, 예를 들어 높고 낮은 순도에 따라, 표적 단백질에 상응하는 표적 생성물에 대한 표준 단백질 정제 기법 또는 표적 중합체에 상응하는 표적 생성물에 대한 표준 중합체 추출 및 침전 기법(세부사항은 당해 분야의 통상적인 숙련가에게 자명할 것이다)을 전개할 수 있다. 예를 들어, 표적 RNA에 상응하는 표적 생성물에 관하여, 재조합 미생물의 배양에 이어서, 표적 RNA를, 당해 분야에 공지된 다른 방법들 중에서도, 문헌[Stead et al., Nucleic Acids Research 40(20), e156:1-9 (2012)]에 기재된 바와 같은 RNAsnap(TM) 방법의 사용에 의해, 또는 상업적으로 입수할 수 있는 방법, 예를 들어 TRIzol(R) Max(TM) 세균 RNA 단리 키트(ThermoFisher Scientific), RNeasy(R) 보호 세균 단리 키트(Qiagen), 또는 RiboPure(TM) 세균 RNA 단리(ThermoFisher Scientific)에 의해 재조합 미생물로부터 회수할 수 있다. 표적 단백질에 상응하는 표적 생성물의 경우, 표적 단백질을 당해 분야에 주지된 과정에 따라, 추출, 이온-교환 크로마토그래피, 친화성 크로마토그래피, 및/또는 침전에 의한 농축에 의해 재조합 미생물로부터 회수할 수 있다. 표적 중합체에 상응하는 표적 생성물의 경우, 표적 중합체를, 또한 당해 분야에 주지된 과정에 따라, 화학 구조 및 분자량에 기초하여 선택된 세척을 위한 조성물 및 농축을 위한 침전물과 함께 추출, 세척 및 농축에 의해 회수할 수 있다. 표적 감미제, 표적 오일, 표적 지방, 표적 폴리사카라이드, 표적 아미노산, 또는 표적 뉴클레오티드에 상응하는 표적 생성물의 경우, 상기 표적 생성물이 재조합 미생물내에 축적되는 경우, 유사한 접근법을 또한 사용할 수 있는 반면, 상기 표적 생성물이 세포외에 축적되는 경우, 표적 생성물을, 예를 들어 다시 당해 분야에 주지된 과정에 따라, 배양 배지로부터의 침전에 의해 회수할 수 있다. 표적 백신 또는 표적 약학 생성물에 상응하는 표적 생성물의 경우, 표적 생성물을 다시 당해 분야에 주지된 과정에 따라, 다른 접근법들 중에서도, 예를 들어 표적 단백질에 대해 상술한 바와 같이, 예를 들어 항원 또는 항원 단편에 상응하는 표적 백신의 경우 이온-교환 크로마토그래피에 기초하여, 또는 단클론 항체에 상응하는 약학 생성물의 경우 친화성 크로마토그래피에 기초하여 회수할 수 있다.
일부 예에서 재조합 미생물을 상기에서 논의된 바와 같이, 표적 유전자 벡터를 형질전환, 접합, 또는 형질도입 중 하나 이상에 의해 재조합 미생물내에 도입시킴으로써 제조할 수 있다.
표적 생성물을 보다 상세히 고려하면, 일부 예에서, 표적 생성물은 (i) 표적 중합체, 그의 전구체, 및/또는 그의 생성을 위한 효소, 또는 (ii) 표적 생물중합체, 그의 전구체, 및/또는 그의 생성을 위한 효소 중 하나 이상을 포함한다. 생물중합체를 포함하여, 중합체의 생물학적 생성은, 특히 복잡한 화학 구조를 갖는 단량체를 기본으로 하는 중합체 및/또는 2개 이상의 단량체를 포함하는 공중합체의 경우, 재조합 미생물에서 다수의 표적 유전자의 조화된 도입 및 발현에 대한 필요성을 근거로, 도전일 수 있다. 유사한 고려사항을 또한 상기에 논의된 바와 같은 다른 표적 생성물, 특히 표적 감미제, 표적 지방, 표적 폴리사카라이드, 표적 아미노산, 표적 뉴클레오티드, 표적 백신, 및 표적 약학 생성물에 관하여 적용한다.
방법은, 예를 들어 표적 중합체, 표적 생물중합체, 표적 감미제, 표적 지방, 표적 폴리사카라이드, 표적 아미노산, 표적 뉴클레오티드, 표적 백신, 또는 표적 약학 생성물의 생성을 위해, 재조합 미생물 중의 벡터의 안정한 혼입과 표적 유전자의 생성물의 수율 조절과의 균형을 위해, 다수의 표적 유전자를 포함하는 벡터에 적합한 카피수를 신속하게 측정하는데 유용할 수 있다. 벡터의 세트를 제조할 수 있다. 벡터는 하나 이상의 표적 유전자를 포함할 수 있다. 벡터는 그의 기준선 카피수에 관하여 변할 수 있다. 벡터를 야생형 rpoC 암호화 서열을 포함하는(따라서 야생형 RpoC를 발현한다) 제1 세균 균주, 예를 들어 E.coli 균주내에 도입시켜, 벡터를 포함하고 야생형 RpoC를 발현하는 제1 E.coli 균주 세트를 수득할 수 있다. 벡터를 또한 R47C 치환을 포함하는 변이형 RpoC를 암호화하는 변이형 rpoC 암호화 서열을 포함하는(따라서 변이형 RpoC를 발현한다) 상응하는 제2 재조합 세균 균주(제2 균주는 그 외에는 제1 균주와 동일하다), 예를 들어 재조합 E.coli 균주에 도입시켜, 벡터를 포함하고 변이형 RpoC를 발현하는 상응하는 제2 E.coli 균주 세트를 수득할 수 있다. 방법을, 제1 및 제2 균주 세트를 이들 균주가 하나 이상의 표적 유전자를 발현하는 조건하에 배양 배지에서 배양하여, 표적 생성물을 생성시키고 표적 생성물을 회수함으로써 수행할 수 있다. 카피수를 배양 중 각각의 벡터에 대해 측정할 수 있다. 표적 생성물의 수율 또는 다른 관련된 특징을 회수 중에 측정할 수 있다. 상기 접근법은 제1 균주 세트에 대해 성취될 수 있는 카피수의 하한을 실질적으로 감소시킬 수 있으며, 이는 표적 유전자의 발현이 유해한 경우에, 예를 들어 세포에서 일정 수준을 초과하여 발현시 세포에 독성인 표적 RNA 및/또는 표적 단백질을 암호화하는 표적 유전자의 경우에, 상기 균주의 세포의 생육성을 유지하는데 유리할 수 있다. 상기 접근법은 또한 표적 생성물의 수율에 대한 벡터 카피수의 영향을 시험함에 관하여 샘플 크기를 유효하게 배가할 수 있다. 상기 접근법을, 특히 3 내지 120 kb 크기의 플라스미드를 포함하고/하거나 다수의 표적 유전자를 포함하는 플라스미드에 상응하는 벡터에 사용할 수 있다.
변이형 rpoC 암호화 서열 및 유전자 교체 서열을 포함하는 유전자 교체 벡터를 또한 개시한다. 변이형 rpoC 암호화 서열은 서열번호 28을 포함하는 변이형 RpoC N-말단 도메인을 암호화한다. 이 경우에 변이형 rpoC 암호화 서열은 전장 변이형 rpoC 암호화 서열일 필요가 없으며, 바람직하게는 단지 유전자 교체 벡터가 도입되는 미생물의 염색체에서 내인성 rpoC 암호화 서열과 상동성 재조합을 수행하기에 충분한 변이형 rpoC 암호화 서열만을 포함한다. 변이형 rpoC 암호화 서열은 예를 들어 0.2 내지 5 kb, 0.5 내지 3 kb, 0.7 내지 2.5 kb, 0.8 내지 2 kb, 0.9 내지 1.5 kb, 또는 약 1 kb의 전장 변이형 rpoC 암호화 서열을 포함할 수 있다.
유전자 교체 서열은 변이형 rpoC 암호화 서열에 의한, 미생물 염색체 중의 내인성 rpoC 암호화 서열의 교체를 위한 단백질을 암호화한다. 일부 예에서 유전자 교체 서열은 sacB 유전자를 포함하고 단백질은 SacB를 포함한다. 예시적인 sacB 벡터는 문헌[Link et al., Journal of Bacteriology 179:6228-6237 (1997)], 및 하기의 웹사이트: arep.med.harvard.edu/labgc/pko3.html에 기재된 바와 같이 pKO3 및 pKOV를 포함한다.
유전자 교체 벡터를, 변이형 rpoC 암호화 서열, 예를 들어 0.2 내지 5 kb, 0.5 내지 3 kb, 0.7 내지 2.5 kb, 0.8 내지 2 kb, 0.9 내지 1.5 kb, 또는 약 1 kb의 전장 변이형 rpoC 암호화 서열을 전구체 벡터내에 클로닝함으로써 제조할 수 있다. 전구체 벡터는 변이형 rpoC 암호화 서열에 의한, 미생물 염색체 중의 내인성 rpoC 암호화 서열의 교체를 위한 유전자 교체 서열, 예를 들어 단백질, 예를 들어 SacB를 암호화하는 sacB 유전자를 포함할 수 있다.
유전자 교체 벡터를 사용하여, 표준 분자 생물학 기법에 의해 서열번호 28을 포함하는 변이형 rpoC 암호화 서열로 미생물 염색체 중의 내인성 rpoC 암호화 서열을 교체시킬 수 있다. 유전자 교체를 위한 sacB 벡터의 용도는 또한 문헌[Link et al.] 및 웹사이트: arep.med.harvard.edu/labgc/pko3.html에 기재되어 있다.
발명의 방식
실시예
실시예 1: 염색체상의
rpoC
서열을 교체하기 위한 플라스미드의 구성
(1) rpoC 단편 및 sacB 벡터의 제조
변형된 뉴클레오티드 서열(서열번호 1)을 갖는 부분적인 변이형 rpoC 서열을 함유하는 2개의 0.5 kb DNA 단편을 증폭시키기 위해서, 에스케리키아 콜라이 균주 LS5218(Coli Genetic Stock Center(CGSC)(균주 6966)로부터 수득되었다)의 게놈 DNA(gDNA)를 QIAGEN 게노믹-팁 시스템을 사용하여 추출하고, 폴리머라제 연쇄 반응(PCR)을 PfuUltra II 융합 HS DNA 폴리머라제(Agilent에 의해 제조)와 함께 주형으로서 gDNA를 사용하여 수행하였다. 뉴클레오티드 서열로부터 추론된 상응하는 변형된 RpoC 단백질은 서열번호 27이다. 변형된 RpoC 단백질 서열은 rpoC 뉴클레오티드 서열이 ATG 대신에 대안의 개시 코돈 GTG를 포함하는 것으로 간주된다. GTG는 일반적으로 발린을 암호화하지만, 대안의 개시 코돈으로서 GTG는 메티오닌을 암호화한다. PCR을 서열번호 3 및 서열번호 4의 프라이머를 사용하여 하기와 같이 수행하였다: 30주기의 95℃에서 30초간의 변성, 56℃에서 30초간의 어닐링, 및 72℃에서 30초간의 연장. 또 다른 PCR을 72℃에서 30초간 연장을 위해 서열번호 5 및 서열번호 6의 프라이머를 사용하여 수행하였다. 혼합물을 QIAGEN 정제 키트로 정제하고 이어서 용출시켜 2개의 상이한 0.5 kb DNA 단편을 수득하였다.
sacB 유전자 및 R6K 기원을 함유하는 유전자 교체 벡터(도 6A-B)를 제조하기 위해서, pSKH130을 제한 효소 EcoRV로 절단하였다. PCR 혼합물 및 EcoRV 절단의 반응 혼합물을 QIAGEN 정제 키트로 정제하고 이어서 용출시켜 첫 번째 0.5 kb DNA 단편, 두 번째 0.5 kb DNA 단편, 및 4.7 kb 벡터 DNA 단편(또한 "sacB 벡터 컷"으로서 지칭된다)을 수득하였다.
(2) rpoC 서열을 교체하기 위한 플라스미드의 구성
실시예: 1-(1)에 기재된 첫 번째 0.5 kb DNA 단편, 두 번째 0.5 kb DNA 단편, 및 sacB 벡터 컷을 pJSL47의 구성에 사용하였다. pJSL47 플라스미드를 NEBuilder HiFi DNA 조립 마스터 믹스(NEB에 의해 제조) 및 BW25113(Coli Genetic Stock Center(CGSC)의 균주 7636이다)을 사용하여 구성하였다.
(3) 재조합 E.coli CC06-9642의 제조
에스케리키아 콜라이 LS5218(서열번호 7)의 염색체상의 rpoC를 변이형 rpoC 서열(서열번호 1)로 치환시키기 위해서, pJSL47 플라스미드를 E.coli 균주 LS5218 내로 일렉트로포레이션에 의해 도입시킨 다음 50 ㎎/L의 카나마이신을 함유하는 루리아 베르타니(LB) 아가 플레이트상에서 생육된 단일 콜로니를 선택하였다. 선택된 콜로니의 염색체내로의 pJSL47의 삽입을 서열번호 8 및 서열번호 9의 프라이머를 사용하여 PCR에 의해 확인하였다. 선택된 균주를, sacB 유전자 및 R6K 기원을 "팝 아웃(pop out)"시키기 위해서, NaCl은 없지만 10% 슈크로스를 함유하는 LB 아가 플레이트상에서 생육시켰다. 형질전환체를, PCR 및 서열 확인에 의해 LS5218 rpoC(서열번호 7)의 변이형 rpoC 서열(서열번호 1)에 의한 교체에 대해 확인하였다. 정확한 유전자형을 갖는 생성 균주를 E.coli CC06-9642로서 표시하였다.
실시예 2: 플라스미드 카피수의 측정
야생형 균주 LS5218, 및 균주 CC06-9642는 모두 F-유사 플라스미드(67,502 bp)를 함유한다. CC06-9642를 생성시킨 후에, F-유사 플라스미드의 존재를 PCR 방법에 의해 확인하였다. 사용된 프라이머는 예를 들어 서열번호 10 및 11의 것이었다. CC06-9642가 생성되었을 때, CC06-9637이 또한 생성되었다. 이들간의 차이는 CC06-9642가 F-유사 플라스미드를 함유하지만, CC06-9637은 이를 함유하지 않는다는 것이다. CC06-9637의 RpoC는 또한 변이형 rpoC이다.
상기 두 균주의 플라스미드 카피수를, PCR 도중 형성된 이중가닥 DNA를 결합시킴으로써 PCR 생성물을 검출하는 SYBR (R) Green I 염료를 사용하는 실시간 PCR(또한 "qPCR"이라 지칭된다)을 사용하여 측정하였다. 프로토콜은 Applied Biosystems 7500 Fast 실시간 PCR 시스템상에서 "단일-용기"로 세포 용해 및 PCR 반응을 수행하기 위해 Fast SYBR (R) Green Cells-to-Ct(TM) 키트를 사용하였다.
세포 용해물을 제조하기 위해서, LB 브로쓰 중의 각각의 균주의 밤새 배양물을 저온(4℃) 1xPBS로 희석한 다음 용해 용액, 정지 용액(Fast SYBR (R) Green Cells-to-Ct(TM) 키트, Cat. # 4402956) 및 RNase A(Life Technologies, Cat. # 12091-021, 20 ㎎/㎖)를 첨가하였다. qPCR 반응 혼합물을, 4 ㎕의 세포 용해물을 16 ㎕의 PCR 칵테일에 가함으로써 제조하였으며, 그의 조성을 표 2에 나타낸다.
성분 | 부피 |
Fast SYBR (R) Green PCR 마스터 믹스 | 10 ㎕ |
순방향 프라이머(50μM 스톡) | 0.12 ㎕ |
역방향 프라이머(50μM 스톡) | 0.12 ㎕ |
무-뉴클레아제 수 | 5.76 ㎕ |
20 ㎕ qPCR 반응 혼합물을 위한 PCR 칵테일의 최종 부피 | 16 ㎕ |
LS5218 및 CC06-9642 세포 샘플 중의 F-유사 플라스미드의 카피수를, β-갈락토시다제를 암호화하는 단일 카피 염색체 lacZ 유전자의 경우에 대한, 플라스미드상의 마커 DNA 서열, 구체적으로 RepFIA 및 RepFIC의 상대적인 풍부성으로부터 추정하였다. RepFIA에 사용된 프라이머는 서열번호 24 및 서열번호 25이었다. RepFIC에 사용된 프라이머는 서열번호 10 및 서열번호 11이었다. lacZ에 대해 사용된 프라이머는 서열번호 12 및 서열번호 13이었다.
실시간 PCR 반응을 하기와 같은 7500 Fast 실시간 PCR 디폴트 프로그램: 1 주기의 95℃에서 20초간의 효소 활성화, 40주기의 95℃에서 3초간의 변성, 60℃에서 30초간의 어닐링 및 연장, 및 해리 곡선을 사용하여 수행하였다.
플라스미드 카피수를 2DCt를 계산함으로써 측정하였으며, 여기에서 DCt는 lacZ 유전자 Ct 값으로부터 RepFIC Ct 값을 공제함으로써 계산되었다(DCt = Ct_lacZ - Ct_repFIC).
균주 | lacZ에 대한 RepFIA 상대적인 풍부성 | lacZ에 대한 RepFIC 상대적인 풍부성 |
LS5218 | 9.2 | 11.5 |
CC06-9642 | 6.2 | 4.3 |
표 3에 나타낸 바와 같이, CC06-9642의 F-유사 플라스미드의 카피수는 대조용 LS5218의 경우보다 더 낮았다. 따라서, 변이형 rpoC 서열은 F-유사 플라스미드의 카피수의 감소를 생성시킴이 확인되었다. 플라스미드 카피수가 과도한 경우, 대사 부담이 미생물의 세포에 가해질 수 있다. 상기 결과는 변이형 rpoC가 플라스미드 카피수의 조절의 함수를 가질 수 있음을 가리키며, 따라서 상기 결과로부터 균주가 안정하게 생육될 수 있고 플라스미드가 안정하게 발현될 수 있음을 알 수 있다.
실시예 3: RepFIC 레플리콘의 변형된 DNA 서열을 함유하는 플라스미드의 구성
(1) RepFIC 단편 및 카나마이신-내성 유전자 단편의 제조
RepFIC 레플리콘(서열번호 14)을 함유하는 5.2 kb DNA 단편을 증폭시키기 위해서, E.coli LS5218의 게놈 DNA(gDNA)를 QIAGEN 게노믹-팁 시스템을 사용하여 추출하고, 폴리머라제 연쇄 반응(PCR)을 PfuUltra II 융합 HS DNA 폴리머라제(Agilent에 의해 제조)와 함께 주형으로서 gDNA를 사용하여 수행하였다. PCR을 서열번호 15 및 서열번호 16의 프라이머를 사용하여 하기와 같이 수행하였다: 30주기의 95℃에서 30초간의 변성, 56℃에서 30초간의 어닐링, 및 72℃에서 5분간의 연장.
카나마이신 내성 유전자를 함유하는 1.4 kb DNA 단편을 증폭시키기 위해서, PCR을 PfuUltra II 융합 HS DNA 폴리머라제와 함께 주형으로서 플라스미드 pKD4를 사용하여 수행하였다. PCR을 서열번호 17 및 서열번호 18의 프라이머를 사용하여 하기와 같이 수행하였다: 30주기의 94℃에서 30초간의 변성, 56℃에서 30초간의 어닐링, 및 72℃에서 1분 30초간의 연장.
PCR 반응을 완료하고 이어서 혼합한 후에, 1.3 ㎖의 DpnI 및 5.7 ㎖의 10X 완충제 Tango(Thermo Fisher Scientific으로부터)(Cat No. ER1701)를 각각의 50 ㎕의 PCR 혼합물에 가하고, 이어서 이를 37℃에서 1시간 동안 배양하여 주형 DNA를 제거하였다. 혼합물을 QIAGEN 정제 키트로 정제하고 이어서 용출시켜 5.2 kb DNA 단편(또한 "RepFIC 단편"이라 지칭됨) 및 1.4 kb DNA 단편(또한 "KanR 단편"이라 지칭됨)을 수득하였다.
(2) 변형된 서열을 함유하는 RepFIC 단편의 제조
E.coli LS5218의 RepFIC 레플리콘에 비해 단일의 뉴클레오티드 치환을 포함하고 플라스미드 카피수의 증가를 생성시키는 변형된 RepFIC 레플리콘(서열번호 19)을 수득하였다. 변형된 RepFIC 레플리콘(서열번호 19)을 함유하는 5.2 kb DNA 단편을 증폭시키기 위해서, PCR을 PfuUltra II 융합 HS DNA 폴리머라제와 함께 주형으로서 E.coli LS5218의 gDNA를 사용하여 수행하였다. PCR을 서열번호 15 및 서열번호 20의 프라이머를 사용하여 하기와 같이 수행하였다: 30주기의 95℃에서 30초간의 변성, 56℃에서 30초간의 어닐링, 및 72℃에서 3분간의 연장. 또 다른 PCR을 72℃에서 2분 30초간의 연장을 위해 서열번호 21 및 서열번호 16의 프라이머를 사용하여 수행하였다.
PCR 반응의 완료 후에, 1.3 ㎖의 DpnI 및 5.7 ㎖의 10X 완충제 Tango(Thermo Fisher Scientific으로부터)(Cat No. ER1701)를 50 ㎕의 PCR 반응 혼합물에 가하고, 이어서 이를 37℃에서 1시간 동안 배양하여 주형 DNA를 제거하였다. 혼합물을 QIAGEN 정제 키트로 정제하고 이어서 용출시켜 2.8 kb DNA 단편 및 2.4 kb DNA 단편을 수득하였다.
(3) RepFIC 레플리콘의 야생형 또는 변형된 서열을 함유하는 플라스미드의 구성
실시예: 3-(1)에 기재된 RepFIC 단편 및 KanR 단편을 pJSL48이라 칭하는 플라스미드의 구성에 사용하였다. pJSL48 플라스미드를, 도 7A-B에 도시된 바와 같이 NEBuilder HiFi DNA 조립 마스터 믹스(NEB에 의해 제조)를 사용하여 구성하였다.
실시예: 3-(2)에 기재된 2.8 kb 및 2.4 kb DNA 단편 및 실시예: 3-(1)에 기재된 KanR 단편을 pJSL49라 칭하는 또 다른 플라스미드의 구성에 사용하였다. 3개 단편의 깁슨 조립을 도 8A-B에 도시된 바와 같이 NEBuilder HiFi DNA 조립 마스터 믹스로 수행하였다.
실시예 4: 플라스미드 카피수의 측정
플라스미드 카피수를 측정하기 위해서, 플라스미드 pJSL48 및 pJSL49를 E.coli LS5218에 도입시켜, E.coli 균주 CC06-9665 및 CC06-9666을 생성시켰다. 플라스미드 pJSL48 및 pJSL49를 또한 CC06-9642에 도입시켜, 각각 E.coli 균주 CC06-9638 및 CC06-9639를 생성시켰다.
E.coli 균주 CC06-9665, 9666, 9638 및 9639 중의 플라스미드의 카피수를 실시예 2에 기재된 바와 같이, 실시간 PCR 방법을 사용하여 측정하였다.
세포 샘플 중 플라스미드의 카피수를 β-갈락토시다제를 암호화하는 단일 카피 염색체 lacZ의 경우에 비해, 플라스미드상의 마커 DNA 서열, RepFIC 레플리콘의 상대적인 풍부성으로부터 평가하였다. RepFIC에 사용된 프라이머는 서열번호 10 및 서열번호 11이었다. lacZ에 사용된 프라이머는 서열번호 12 및 서열번호 13이었다. 카나마이신-내성 유전자에 사용된 프라이머는 서열번호 22 및 서열번호 23이었다.
균주 | lacZ에 대한 KanR 상대적인 풍부성 | lacZ에 대한 repFIC 상대적인 풍부성 |
CC06-9665 | 5.4 | 8.8 |
CC06-9666 | 51.7 | 71.0 |
CC06-9638 | 4.0 | 4.8 |
CC06-9639 | 18.4 | 19.6 |
표 4에 나타낸 바와 같이, 플라스미드 pJSL49내에 도입된 변형된 RepFIC 서열을 함유하는 균주인 균주 CC06-9666 및 9639의 플라스미드 카피수는 균주 CC06-9665 및 9638의 경우보다 더 높았다. 균주 CC06-9665의 플라스미드 카피수는 CC06-9638의 경우보다 더 높았고, 균주 CC06-9666의 플라스미드 카피수는 CC06-9639의 경우보다 더 높았다. 따라서, 교체된 rpoC 서열은 RepFIC 레플리콘, 즉 E.coli LS5218의 RepFIC 레플리콘 또는 변형된 RepFIC가 사용된 것과 독립적으로 플라스미드 카피수의 감소를 생성시킴이 확인되었다.
본원에 개시된 변이형 rpoC 암호화 서열을 포함하는 핵산 분자는 벡터, 예를 들어 플라스미드의 카피수를 조절하는데 유용하며, 따라서 벡터의 사용에 의한 표적 생성물의 상업적인 생산을 개선시키는데 유용하다.
생물학적 기탁에 관한 정보
R47C 치환을 포함하는 변이형 RpoC를 암호화하는 변이형 rpoC 암호화 서열을 포함하는 핵산 분자를 포함하도록 형질전환된 E.coli 균주를 상술한 바와 같이 제조하였으며, 에스케리키아 콜라이 CC06-9637로서 표시하였고, 2018년 6월 15일자로, 부다페스트 조약에 따라 국제 공인 기탁 기관인 한국 미생물 배양 센터에 수납번호 KCCM12276P로 기탁하였다. 상기 균주는 부다페스트 조약에 따라 국제 공인 기탁 기관에 의해 기탁된다.
<110> CJ CHEILJEDANG CORPORATION
<120> Nucleic Acid Molecules Comprising a Variant RpoC Coding Sequence
<130> IKPA201775-KR
<150> US 62/715,530
<151> 2018-08-07
<160> 51
<170> PatentIn version 3.5
<210> 1
<211> 4224
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct modified rpoC
<400> 1
gtgaaagatt tattaaagtt tctgaaagcg cagactaaaa ccgaagagtt tgatgcgatc 60
aaaattgctc tggcttcgcc agacatgatc cgttcatggt ctttcggtga agttaaaaag 120
ccggaaacca tcaactactg tacgttcaaa ccagaacgtg acggcctttt ctgcgcccgt 180
atctttgggc cggtaaaaga ttacgagtgc ctgtgcggta agtacaagcg cctgaaacac 240
cgtggcgtca tctgtgagaa gtgcggcgtt gaagtgaccc agactaaagt acgccgtgag 300
cgtatgggcc acatcgaact ggcttccccg actgcgcaca tctggttcct gaaatcgctg 360
ccgtcccgta tcggtctgct gctcgatatg ccgctgcgcg atatcgaacg cgtactgtac 420
tttgaatcct atgtggttat cgaaggcggt atgaccaacc tggaacgtca gcagatcctg 480
actgaagagc agtatctgga cgcgctggaa gagttcggtg acgaattcga cgcgaagatg 540
ggggcggaag caatccaggc tctgctgaag agcatggatc tggagcaaga gtgcgaacag 600
ctgcgtgaag agctgaacga aaccaactcc gaaaccaagc gtaaaaagct gaccaagcgt 660
atcaaactgc tggaagcgtt cgttcagtct ggtaacaaac cagagtggat gatcctgacc 720
gttctgccgg tactgccgcc agatctgcgt ccgctggttc cgctggatgg tggtcgtttc 780
gcgacttctg acctgaacga tctgtatcgt cgcgtcatta accgtaacaa ccgtctgaaa 840
cgtctgctgg atctggctgc gccggacatc atcgtacgta acgaaaaacg tatgctgcag 900
gaagcggtag acgccctgct ggataacggt cgtcgcggtc gtgcgatcac cggttctaac 960
aagcgtcctc tgaaatcttt ggccgacatg atcaaaggta aacagggtcg tttccgtcag 1020
aacctgctcg gtaagcgtgt tgactactcc ggtcgttctg taatcaccgt aggtccatac 1080
ctgcgtctgc atcagtgcgg tctgccgaag aaaatggcac tggagctgtt caaaccgttc 1140
atctacggca agctggaact gcgtggtctt gctaccacca ttaaagctgc gaagaaaatg 1200
gttgagcgcg aagaagctgt cgtttgggat atcctggacg aagttatccg cgaacacccg 1260
gtactgctga accgtgcacc gactctgcac cgtctgggta tccaggcatt tgaaccggta 1320
ctgatcgaag gtaaagctat ccagctgcac ccgctggttt gtgcggcata taacgccgac 1380
ttcgatggtg accagatggc tgttcacgta ccgctgacgc tggaagccca gctggaagcg 1440
cgtgcgctga tgatgtctac caacaacatc ctgtccccgg cgaacggcga accaatcatc 1500
gttccgtctc aggacgttgt actgggtctg tactacatga cccgtgactg tgttaacgcc 1560
aaaggcgaag gcatggtgct gactggcccg aaagaagcag aacgtctgta tcgctctggt 1620
ctggcttctc tgcatgcgcg cgttaaagtg cgtatcaccg agtatgaaaa agatgctaac 1680
ggtgaattag tagcgaaaac cagcctgaaa gacacgactg ttggccgtgc cattctgtgg 1740
atgattgtac cgaaaggtct gccttactcc atcgtcaacc aggcgctggg taaaaaagca 1800
atctccaaaa tgctgaacac ctgctaccgc attctcggtc tgaaaccgac cgttattttt 1860
gcggaccaga tcatgtacac cggcttcgcc tatgcagcgc gttctggtgc atctgttggt 1920
atcgatgaca tggtcatccc ggagaagaaa cacgaaatca tctccgaggc agaagcagaa 1980
gttgctgaaa ttcaggagca gttccagtct ggtctggtaa ctgcgggcga acgctacaac 2040
aaagttatcg atatctgggc tgcggcgaac gatcgtgtat ccaaagcgat gatggataac 2100
ctgcaaactg aaaccgtgat taaccgtgac ggtcaggaag agaagcaggt ttccttcaac 2160
agcatctaca tgatggccga ctccggtgcg cgtggttctg cggcacagat tcgtcagctt 2220
gctggtatgc gtggtctgat ggcgaagccg gatggctcca tcatcgaaac gccaatcacc 2280
gcgaacttcc gtgaaggtct gaacgtactc cagtacttca tctccaccca cggtgctcgt 2340
aaaggtctgg cggataccgc actgaaaact gcgaactccg gttacctgac tcgtcgtctg 2400
gttgacgtgg cgcaggacct ggtggttacc gaagacgatt gtggtaccca tgaaggtatc 2460
atgatgactc cggttatcga gggtggtgac gttaaagagc cgctgcgcga tcgcgtactg 2520
ggtcgtgtaa ctgctgaaga cgttctgaag ccgggtactg ctgatatcct cgttccgcgc 2580
aacacgctgc tgcacgaaca gtggtgtgac ctgctggaag agaactctgt cgacgcggtt 2640
aaagtacgtt ctgttgtatc ttgtgacacc gactttggtg tatgtgcgca ctgctacggt 2700
cgtgacctgg cgcgtggcca catcatcaac aagggtgaag caatcggtgt tatcgcggca 2760
cagtccatcg gtgaaccggg tacacagctg accatgcgta cgttccacat cggtggtgcg 2820
gcatctcgtg cggctgctga atccagcatc caagtgaaaa acaaaggtag catcaagctc 2880
agcaacgtga agtcggttgt gaactccagc ggtaaactgg ttatcacttc ccgtaatact 2940
gaactgaaac tgatcgacga attcggtcgt actaaagaaa gctacaaagt accttacggt 3000
gcggtactgg cgaaaggcga tggcgaacag gttgctggcg gcgaaaccgt tgcaaactgg 3060
gacccgcaca ccatgccggt tatcaccgaa gtaagcggtt ttgtacgctt tactgacatg 3120
atcgacggcc agaccattac gcgtcagacc gacgaactga ccggtctgtc ttcgctggtg 3180
gttctggatt ccgcagaacg taccgcaggt ggtaaagatc tgcgtccggc actgaaaatc 3240
gttgatgctc agggtaacga cgttctgatc ccaggtaccg atatgccagc gcagtacttc 3300
ctgccgggta aagcgattgt tcagctggaa gatggcgtac agatcagctc tggtgacacc 3360
ctggcgcgta ttccgcagga atccggcggt accaaggaca tcaccggtgg tctgccgcgc 3420
gttgcggacc tgttcgaagc acgtcgtccg aaagagccgg caatcctggc tgaaatcagc 3480
ggtatcgttt ccttcggtaa agaaaccaaa ggtaaacgtc gtctggttat caccccggta 3540
gacggtagcg atccgtacga agagatgatt ccgaaatggc gtcagctcaa cgtgttcgaa 3600
ggtgaacgtg tagaacgtgg tgacgtaatt tccgacggtc cggaagcgcc gcacgacatt 3660
ctgcgtctgc gtggtgttca tgctgttact cgttacatcg ttaacgaagt acaggacgta 3720
taccgtctgc agggcgttaa gattaacgat aaacacatcg aagttatcgt tcgtcagatg 3780
ctgcgtaaag ctaccatcgt taacgcgggt agctccgact tcctggaagg cgaacaggtt 3840
gaatactctc gcgtcaagat cgcaaaccgc gaactggaag cgaacggcaa agtgggtgca 3900
acttactccc gcgatctgct gggtatcacc aaagcgtctc tggcaaccga gtccttcatc 3960
tccgcggcat cgttccagga gaccactcgc gtgctgaccg aagcagccgt tgcgggcaaa 4020
cgcgacgaac tgcgcggcct gaaagagaac gttatcgtgg gtcgtctgat cccggcaggt 4080
accggttacg cgtaccacca ggatcgtatg cgtcgccgtg ctgcgggtga agctccggct 4140
gcaccgcagg tgactgcaga agacgcatct gccagcctgg cagaactgct gaacgcaggt 4200
ctgggcggtt ctgataacga gtaa 4224
<210> 2
<211> 1407
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct modified RpoC
<400> 2
Val Lys Asp Leu Leu Lys Phe Leu Lys Ala Gln Thr Lys Thr Glu Glu
1 5 10 15
Phe Asp Ala Ile Lys Ile Ala Leu Ala Ser Pro Asp Met Ile Arg Ser
20 25 30
Trp Ser Phe Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Cys Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Arg Ile Phe Gly Pro
50 55 60
Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu Lys His
65 70 75 80
Arg Gly Val Ile Cys Glu Lys Cys Gly Val Glu Val Thr Gln Thr Lys
85 90 95
Val Arg Arg Glu Arg Met Gly His Ile Glu Leu Ala Ser Pro Thr Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Leu Leu
115 120 125
Asp Met Pro Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu Ser Tyr
130 135 140
Val Val Ile Glu Gly Gly Met Thr Asn Leu Glu Arg Gln Gln Ile Leu
145 150 155 160
Thr Glu Glu Gln Tyr Leu Asp Ala Leu Glu Glu Phe Gly Asp Glu Phe
165 170 175
Asp Ala Lys Met Gly Ala Glu Ala Ile Gln Ala Leu Leu Lys Ser Met
180 185 190
Asp Leu Glu Gln Glu Cys Glu Gln Leu Arg Glu Glu Leu Asn Glu Thr
195 200 205
Asn Ser Glu Thr Lys Arg Lys Lys Leu Thr Lys Arg Ile Lys Leu Leu
210 215 220
Glu Ala Phe Val Gln Ser Gly Asn Lys Pro Glu Trp Met Ile Leu Thr
225 230 235 240
Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro Leu Asp
245 250 255
Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val
260 265 270
Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Asp Leu Ala Ala Pro
275 280 285
Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ala Val Asp
290 295 300
Ala Leu Leu Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Ser Asn
305 310 315 320
Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys Gln Gly
325 330 335
Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg
340 345 350
Ser Val Ile Thr Val Gly Pro Tyr Leu Arg Leu His Gln Cys Gly Leu
355 360 365
Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Tyr Gly Lys
370 375 380
Leu Glu Leu Arg Gly Leu Ala Thr Thr Ile Lys Ala Ala Lys Lys Met
385 390 395 400
Val Glu Arg Glu Glu Ala Val Val Trp Asp Ile Leu Asp Glu Val Ile
405 410 415
Arg Glu His Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu
420 425 430
Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys Ala Ile Gln
435 440 445
Leu His Pro Leu Val Cys Ala Ala Tyr Asn Ala Asp Phe Asp Gly Asp
450 455 460
Gln Met Ala Val His Val Pro Leu Thr Leu Glu Ala Gln Leu Glu Ala
465 470 475 480
Arg Ala Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Asn Gly
485 490 495
Glu Pro Ile Ile Val Pro Ser Gln Asp Val Val Leu Gly Leu Tyr Tyr
500 505 510
Met Thr Arg Asp Cys Val Asn Ala Lys Gly Glu Gly Met Val Leu Thr
515 520 525
Gly Pro Lys Glu Ala Glu Arg Leu Tyr Arg Ser Gly Leu Ala Ser Leu
530 535 540
His Ala Arg Val Lys Val Arg Ile Thr Glu Tyr Glu Lys Asp Ala Asn
545 550 555 560
Gly Glu Leu Val Ala Lys Thr Ser Leu Lys Asp Thr Thr Val Gly Arg
565 570 575
Ala Ile Leu Trp Met Ile Val Pro Lys Gly Leu Pro Tyr Ser Ile Val
580 585 590
Asn Gln Ala Leu Gly Lys Lys Ala Ile Ser Lys Met Leu Asn Thr Cys
595 600 605
Tyr Arg Ile Leu Gly Leu Lys Pro Thr Val Ile Phe Ala Asp Gln Ile
610 615 620
Met Tyr Thr Gly Phe Ala Tyr Ala Ala Arg Ser Gly Ala Ser Val Gly
625 630 635 640
Ile Asp Asp Met Val Ile Pro Glu Lys Lys His Glu Ile Ile Ser Glu
645 650 655
Ala Glu Ala Glu Val Ala Glu Ile Gln Glu Gln Phe Gln Ser Gly Leu
660 665 670
Val Thr Ala Gly Glu Arg Tyr Asn Lys Val Ile Asp Ile Trp Ala Ala
675 680 685
Ala Asn Asp Arg Val Ser Lys Ala Met Met Asp Asn Leu Gln Thr Glu
690 695 700
Thr Val Ile Asn Arg Asp Gly Gln Glu Glu Lys Gln Val Ser Phe Asn
705 710 715 720
Ser Ile Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala Ala Gln
725 730 735
Ile Arg Gln Leu Ala Gly Met Arg Gly Leu Met Ala Lys Pro Asp Gly
740 745 750
Ser Ile Ile Glu Thr Pro Ile Thr Ala Asn Phe Arg Glu Gly Leu Asn
755 760 765
Val Leu Gln Tyr Phe Ile Ser Thr His Gly Ala Arg Lys Gly Leu Ala
770 775 780
Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr Leu Thr Arg Arg Leu
785 790 795 800
Val Asp Val Ala Gln Asp Leu Val Val Thr Glu Asp Asp Cys Gly Thr
805 810 815
His Glu Gly Ile Met Met Thr Pro Val Ile Glu Gly Gly Asp Val Lys
820 825 830
Glu Pro Leu Arg Asp Arg Val Leu Gly Arg Val Thr Ala Glu Asp Val
835 840 845
Leu Lys Pro Gly Thr Ala Asp Ile Leu Val Pro Arg Asn Thr Leu Leu
850 855 860
His Glu Gln Trp Cys Asp Leu Leu Glu Glu Asn Ser Val Asp Ala Val
865 870 875 880
Lys Val Arg Ser Val Val Ser Cys Asp Thr Asp Phe Gly Val Cys Ala
885 890 895
His Cys Tyr Gly Arg Asp Leu Ala Arg Gly His Ile Ile Asn Lys Gly
900 905 910
Glu Ala Ile Gly Val Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
915 920 925
Gln Leu Thr Met Arg Thr Phe His Ile Gly Gly Ala Ala Ser Arg Ala
930 935 940
Ala Ala Glu Ser Ser Ile Gln Val Lys Asn Lys Gly Ser Ile Lys Leu
945 950 955 960
Ser Asn Val Lys Ser Val Val Asn Ser Ser Gly Lys Leu Val Ile Thr
965 970 975
Ser Arg Asn Thr Glu Leu Lys Leu Ile Asp Glu Phe Gly Arg Thr Lys
980 985 990
Glu Ser Tyr Lys Val Pro Tyr Gly Ala Val Leu Ala Lys Gly Asp Gly
995 1000 1005
Glu Gln Val Ala Gly Gly Glu Thr Val Ala Asn Trp Asp Pro His Thr
1010 1015 1020
Met Pro Val Ile Thr Glu Val Ser Gly Phe Val Arg Phe Thr Asp Met
1025 1030 1035 1040
Ile Asp Gly Gln Thr Ile Thr Arg Gln Thr Asp Glu Leu Thr Gly Leu
1045 1050 1055
Ser Ser Leu Val Val Leu Asp Ser Ala Glu Arg Thr Ala Gly Gly Lys
1060 1065 1070
Asp Leu Arg Pro Ala Leu Lys Ile Val Asp Ala Gln Gly Asn Asp Val
1075 1080 1085
Leu Ile Pro Gly Thr Asp Met Pro Ala Gln Tyr Phe Leu Pro Gly Lys
1090 1095 1100
Ala Ile Val Gln Leu Glu Asp Gly Val Gln Ile Ser Ser Gly Asp Thr
1105 1110 1115 1120
Leu Ala Arg Ile Pro Gln Glu Ser Gly Gly Thr Lys Asp Ile Thr Gly
1125 1130 1135
Gly Leu Pro Arg Val Ala Asp Leu Phe Glu Ala Arg Arg Pro Lys Glu
1140 1145 1150
Pro Ala Ile Leu Ala Glu Ile Ser Gly Ile Val Ser Phe Gly Lys Glu
1155 1160 1165
Thr Lys Gly Lys Arg Arg Leu Val Ile Thr Pro Val Asp Gly Ser Asp
1170 1175 1180
Pro Tyr Glu Glu Met Ile Pro Lys Trp Arg Gln Leu Asn Val Phe Glu
1185 1190 1195 1200
Gly Glu Arg Val Glu Arg Gly Asp Val Ile Ser Asp Gly Pro Glu Ala
1205 1210 1215
Pro His Asp Ile Leu Arg Leu Arg Gly Val His Ala Val Thr Arg Tyr
1220 1225 1230
Ile Val Asn Glu Val Gln Asp Val Tyr Arg Leu Gln Gly Val Lys Ile
1235 1240 1245
Asn Asp Lys His Ile Glu Val Ile Val Arg Gln Met Leu Arg Lys Ala
1250 1255 1260
Thr Ile Val Asn Ala Gly Ser Ser Asp Phe Leu Glu Gly Glu Gln Val
1265 1270 1275 1280
Glu Tyr Ser Arg Val Lys Ile Ala Asn Arg Glu Leu Glu Ala Asn Gly
1285 1290 1295
Lys Val Gly Ala Thr Tyr Ser Arg Asp Leu Leu Gly Ile Thr Lys Ala
1300 1305 1310
Ser Leu Ala Thr Glu Ser Phe Ile Ser Ala Ala Ser Phe Gln Glu Thr
1315 1320 1325
Thr Arg Val Leu Thr Glu Ala Ala Val Ala Gly Lys Arg Asp Glu Leu
1330 1335 1340
Arg Gly Leu Lys Glu Asn Val Ile Val Gly Arg Leu Ile Pro Ala Gly
1345 1350 1355 1360
Thr Gly Tyr Ala Tyr His Gln Asp Arg Met Arg Arg Arg Ala Ala Gly
1365 1370 1375
Glu Ala Pro Ala Ala Pro Gln Val Thr Ala Glu Asp Ala Ser Ala Ser
1380 1385 1390
Leu Ala Glu Leu Leu Asn Ala Gly Leu Gly Gly Ser Asp Asn Glu
1395 1400 1405
<210> 3
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct rpoC primer
<400> 3
ctgcaggaat tcgatcggtt cttacagcct ggttac 36
<210> 4
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct rpoC primer
<400> 4
gaacgtacag tagttgatgg 20
<210> 5
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct rpoC primer
<400> 5
ccatcaacta ctgtacgttc 20
<210> 6
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct rpoC primer
<400> 6
gtcgactagc gtgatcttgg tttcggagtt ggtttc 36
<210> 7
<211> 4224
<212> DNA
<213> Escherichia coli
<400> 7
gtgaaagatt tattaaagtt tctgaaagcg cagactaaaa ccgaagagtt tgatgcgatc 60
aaaattgctc tggcttcgcc agacatgatc cgttcatggt ctttcggtga agttaaaaag 120
ccggaaacca tcaactaccg tacgttcaaa ccagaacgtg acggcctttt ctgcgcccgt 180
atctttgggc cggtaaaaga ttacgagtgc ctgtgcggta agtacaagcg cctgaaacac 240
cgtggcgtca tctgtgagaa gtgcggcgtt gaagtgaccc agactaaagt acgccgtgag 300
cgtatgggcc acatcgaact ggcttccccg actgcgcaca tctggttcct gaaatcgctg 360
ccgtcccgta tcggtctgct gctcgatatg ccgctgcgcg atatcgaacg cgtactgtac 420
tttgaatcct atgtggttat cgaaggcggt atgaccaacc tggaacgtca gcagatcctg 480
actgaagagc agtatctgga cgcgctggaa gagttcggtg acgaattcga cgcgaagatg 540
ggggcggaag caatccaggc tctgctgaag agcatggatc tggagcaaga gtgcgaacag 600
ctgcgtgaag agctgaacga aaccaactcc gaaaccaagc gtaaaaagct gaccaagcgt 660
atcaaactgc tggaagcgtt cgttcagtct ggtaacaaac cagagtggat gatcctgacc 720
gttctgccgg tactgccgcc agatctgcgt ccgctggttc cgctggatgg tggtcgtttc 780
gcgacttctg acctgaacga tctgtatcgt cgcgtcatta accgtaacaa ccgtctgaaa 840
cgtctgctgg atctggctgc gccggacatc atcgtacgta acgaaaaacg tatgctgcag 900
gaagcggtag acgccctgct ggataacggt cgtcgcggtc gtgcgatcac cggttctaac 960
aagcgtcctc tgaaatcttt ggccgacatg atcaaaggta aacagggtcg tttccgtcag 1020
aacctgctcg gtaagcgtgt tgactactcc ggtcgttctg taatcaccgt aggtccatac 1080
ctgcgtctgc atcagtgcgg tctgccgaag aaaatggcac tggagctgtt caaaccgttc 1140
atctacggca agctggaact gcgtggtctt gctaccacca ttaaagctgc gaagaaaatg 1200
gttgagcgcg aagaagctgt cgtttgggat atcctggacg aagttatccg cgaacacccg 1260
gtactgctga accgtgcacc gactctgcac cgtctgggta tccaggcatt tgaaccggta 1320
ctgatcgaag gtaaagctat ccagctgcac ccgctggttt gtgcggcata taacgccgac 1380
ttcgatggtg accagatggc tgttcacgta ccgctgacgc tggaagccca gctggaagcg 1440
cgtgcgctga tgatgtctac caacaacatc ctgtccccgg cgaacggcga accaatcatc 1500
gttccgtctc aggacgttgt actgggtctg tactacatga cccgtgactg tgttaacgcc 1560
aaaggcgaag gcatggtgct gactggcccg aaagaagcag aacgtctgta tcgctctggt 1620
ctggcttctc tgcatgcgcg cgttaaagtg cgtatcaccg agtatgaaaa agatgctaac 1680
ggtgaattag tagcgaaaac cagcctgaaa gacacgactg ttggccgtgc cattctgtgg 1740
atgattgtac cgaaaggtct gccttactcc atcgtcaacc aggcgctggg taaaaaagca 1800
atctccaaaa tgctgaacac ctgctaccgc attctcggtc tgaaaccgac cgttattttt 1860
gcggaccaga tcatgtacac cggcttcgcc tatgcagcgc gttctggtgc atctgttggt 1920
atcgatgaca tggtcatccc ggagaagaaa cacgaaatca tctccgaggc agaagcagaa 1980
gttgctgaaa ttcaggagca gttccagtct ggtctggtaa ctgcgggcga acgctacaac 2040
aaagttatcg atatctgggc tgcggcgaac gatcgtgtat ccaaagcgat gatggataac 2100
ctgcaaactg aaaccgtgat taaccgtgac ggtcaggaag agaagcaggt ttccttcaac 2160
agcatctaca tgatggccga ctccggtgcg cgtggttctg cggcacagat tcgtcagctt 2220
gctggtatgc gtggtctgat ggcgaagccg gatggctcca tcatcgaaac gccaatcacc 2280
gcgaacttcc gtgaaggtct gaacgtactc cagtacttca tctccaccca cggtgctcgt 2340
aaaggtctgg cggataccgc actgaaaact gcgaactccg gttacctgac tcgtcgtctg 2400
gttgacgtgg cgcaggacct ggtggttacc gaagacgatt gtggtaccca tgaaggtatc 2460
atgatgactc cggttatcga gggtggtgac gttaaagagc cgctgcgcga tcgcgtactg 2520
ggtcgtgtaa ctgctgaaga cgttctgaag ccgggtactg ctgatatcct cgttccgcgc 2580
aacacgctgc tgcacgaaca gtggtgtgac ctgctggaag agaactctgt cgacgcggtt 2640
aaagtacgtt ctgttgtatc ttgtgacacc gactttggtg tatgtgcgca ctgctacggt 2700
cgtgacctgg cgcgtggcca catcatcaac aagggtgaag caatcggtgt tatcgcggca 2760
cagtccatcg gtgaaccggg tacacagctg accatgcgta cgttccacat cggtggtgcg 2820
gcatctcgtg cggctgctga atccagcatc caagtgaaaa acaaaggtag catcaagctc 2880
agcaacgtga agtcggttgt gaactccagc ggtaaactgg ttatcacttc ccgtaatact 2940
gaactgaaac tgatcgacga attcggtcgt actaaagaaa gctacaaagt accttacggt 3000
gcggtactgg cgaaaggcga tggcgaacag gttgctggcg gcgaaaccgt tgcaaactgg 3060
gacccgcaca ccatgccggt tatcaccgaa gtaagcggtt ttgtacgctt tactgacatg 3120
atcgacggcc agaccattac gcgtcagacc gacgaactga ccggtctgtc ttcgctggtg 3180
gttctggatt ccgcagaacg taccgcaggt ggtaaagatc tgcgtccggc actgaaaatc 3240
gttgatgctc agggtaacga cgttctgatc ccaggtaccg atatgccagc gcagtacttc 3300
ctgccgggta aagcgattgt tcagctggaa gatggcgtac agatcagctc tggtgacacc 3360
ctggcgcgta ttccgcagga atccggcggt accaaggaca tcaccggtgg tctgccgcgc 3420
gttgcggacc tgttcgaagc acgtcgtccg aaagagccgg caatcctggc tgaaatcagc 3480
ggtatcgttt ccttcggtaa agaaaccaaa ggtaaacgtc gtctggttat caccccggta 3540
gacggtagcg atccgtacga agagatgatt ccgaaatggc gtcagctcaa cgtgttcgaa 3600
ggtgaacgtg tagaacgtgg tgacgtaatt tccgacggtc cggaagcgcc gcacgacatt 3660
ctgcgtctgc gtggtgttca tgctgttact cgttacatcg ttaacgaagt acaggacgta 3720
taccgtctgc agggcgttaa gattaacgat aaacacatcg aagttatcgt tcgtcagatg 3780
ctgcgtaaag ctaccatcgt taacgcgggt agctccgact tcctggaagg cgaacaggtt 3840
gaatactctc gcgtcaagat cgcaaaccgc gaactggaag cgaacggcaa agtgggtgca 3900
acttactccc gcgatctgct gggtatcacc aaagcgtctc tggcaaccga gtccttcatc 3960
tccgcggcat cgttccagga gaccactcgc gtgctgaccg aagcagccgt tgcgggcaaa 4020
cgcgacgaac tgcgcggcct gaaagagaac gttatcgtgg gtcgtctgat cccggcaggt 4080
accggttacg cgtaccacca ggatcgtatg cgtcgccgtg ctgcgggtga agctccggct 4140
gcaccgcagg tgactgcaga agacgcatct gccagcctgg cagaactgct gaacgcaggt 4200
ctgggcggtt ctgataacga gtaa 4224
<210> 8
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct pJSL47 primer
<400> 8
gagcgtccgg taaccgttgg 20
<210> 9
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct pJSL47 primer
<400> 9
gtcaggatca tccactctgg 20
<210> 10
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIC primer
<400> 10
cgggtgggat tgaatcagat 20
<210> 11
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIC primer
<400> 11
tacgtttgcc atgcgctatt a 21
<210> 12
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct lacZ primer
<400> 12
ccttactgcc gcctgttttg 20
<210> 13
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct lacZ primer
<400> 13
ccactggtgt gggccataat 20
<210> 14
<211> 2459
<212> DNA
<213> Escherichia coli
<400> 14
atgtcgcaga cagaaaatgc agtgacttcc tcatcaggta acaagcgtgc ataccggaaa 60
ggtaaccctg ttccggccag agagaggcaa agggcttctc tagctcgcag aagcaacact 120
cataaggctt ttcatgcggt tatccaggcc cggttaaaag acaggctgag tgaactggca 180
gatgaggaag gtattaccca ggcgcagatg cttgaaaaac tgattgaatc agagctgaaa 240
cgcagagcga ctttgtaaat attcacattc ttgcttatct caggcgtgag tgatagattg 300
ctgatcgttt aaggaatttt gtggctggcc acgccataag gtggcaggga actggttctg 360
atgtggattt acaggagcca gaaaagcaaa aaccccgata atcttcatct agttttgcga 420
cgaggagaag attaccggga tccacttaaa ccgtatagcc aacaattcag ctatgcgggg 480
agtatagtta tatgcccgga aaagttcaag acttctttct gtgctcactc cttctgtgca 540
ttgtaagtgc aggatggtgt ggctaatcat gaaacacatt cagtaatagc gggtgggatt 600
gaatcagatc ttcacattga ttccagcaag tatcctcacc cgttttgcag ccttctccag 660
aaaagggctc attttgactc cttcaagcat ctgatcttca tcagaggttt gcttgtaata 720
gcgcatggca aacgtaaaaa taaaatcagc gcgtcgatgg ttagttttta tgtttccctc 780
gtacaagtaa tgtgcgcaca ctacatccct gatacgaaca aagttaactt atctgttaaa 840
gagctttcag tttgtagtgg aggttcatac actcgtgttt ggagagccct caaaactctt 900
gataatgatc ttcatctcat cgcttttgat ggaggaacca tctggttcag accagatatg 960
ttcgaaactt tacgtgtcgg cccagacgag ctagttgccg cccgtaggag ggggaatagt 1020
gttggagggg gacatggctg atctccttca aaaatactat tcacaggtta aaaacccgaa 1080
tccggtgttc acaccccgtg aaggtgccgg aacgctgaag ttctgcgaaa aactgatgga 1140
aaaggcggtg ggcttcacct cccgttttga tttcgccatt catgtggcac atgctcgttc 1200
gaagggactg cgtcggcgca tgccaccggt actgcgtcgc cgggctattg atgcgctgct 1260
gcaggggctg tgttttcact atgacccgct ggccaaccgc gtccagtgct ccatcactac 1320
gctggccatt gagtgcggac tggcgacgga gtctgctgcc ggaaaactct ccatcacccg 1380
ggccacccga gccttgacgt tccttgcaga gctgggactg attacctacc agacggaata 1440
tgatccgctt atcgggtgct acattccgac cgatatcacg ttcacaccgg cgctatttgc 1500
cgcccttgat gtgtctgagg atgcagtggt tgctgcgcgc cgcagtcgtg ttgaatggga 1560
aaacagacag cgcaaaaagc agggactgga taccctgggt atggatgaac tgatagcgaa 1620
agcctggcgt tttgtgcgtg agcgtttccg cagttaccag acagagctta agtcccgtgg 1680
aataaagcgt gcccgtgcgc gtcgtgatgc gaacagggaa cgtcaggata tcgtcaccct 1740
ggtgaaacgg cagctgacgc gtgaaatctc ggaagggcgc ttcactgcca atcgtgaggc 1800
ggtaaaacgc gaagtggagc gtcgtgtgaa agatcgcatg attctgtcac gtaaccgtaa 1860
ttacagccgg ctggccacag cttccccctg aaagtgacct cctctgaata atccggcccg 1920
caccggaggc atctgcacgc ctgaagcctg tcggcgaaca aaaaaacagc accgcataca 1980
aaaaacaacc tcatcatcca ccttcaggtg catccggtcc ctcctgtttt tgatacaaaa 2040
cacgcctcac agacgaggaa ttttgcttat ccacatttaa ctgcaaggga cttccccata 2100
aggttacaac cgttcatgtc ataaagcgcc atccgccagc cttacagggt gcaatgtatc 2160
ttttaaacac ctgtttatat ctcctttaaa ctacttaact acattcattt aaaaagaaaa 2220
cctattcact gcctgtcctg tggacagaca ggtatgcacc tcccaccgca agcggcgggc 2280
cccgaccgga gccactttaa ttacaacact cagatacaac caccagaaaa accccggtcc 2340
cgcgcagaac tgaaaccaca aagccccccc tcataactga aaagcggccc cgccccggcc 2400
caaagggccg gaacagagtc gcttttaatt atgaatgttg taactacaca gcatcatcg 2459
<210> 15
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIC primer
<400> 15
ttccggaagt gtgaggcggc cgcacttgtg tataa 35
<210> 16
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIC primer
<400> 16
gtcgacaagc tttacgcggc cagatctgat caaga 35
<210> 17
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct kanamycin primer
<400> 17
atcagatctg gccgcgtaaa gcttgtcgac gaatt 35
<210> 18
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct kanamycin primer
<400> 18
cacaagtgcg gccgcctcac acttccggaa agcgg 35
<210> 19
<211> 98
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIC sequence
<400> 19
aagcaaaaac cccgataatc ttcatctagt tttgcgacaa ggagaagatt accgggatcc 60
acttaaaccg tatagccaac aattcagcta tgcgggga 98
<210> 20
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIC primer
<400> 20
tcttctcctt gtcgcaaaac 20
<210> 21
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIC primer
<400> 21
gttttgcgac aaggagaaga 20
<210> 22
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct kanamycin primer
<400> 22
gcagccgatt gtctgttgtg 20
<210> 23
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct kanamycin primer
<400> 23
atggattgca cgcaggttct 20
<210> 24
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIA primer
<400> 24
atggctcagg catcgtctct 20
<210> 25
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic construct repFIA primer
<400> 25
agaggcgcat tggagttctg 20
<210> 26
<211> 1407
<212> PRT
<213> Escherichia coli
<400> 26
Met Lys Asp Leu Leu Lys Phe Leu Lys Ala Gln Thr Lys Thr Glu Glu
1 5 10 15
Phe Asp Ala Ile Lys Ile Ala Leu Ala Ser Pro Asp Met Ile Arg Ser
20 25 30
Trp Ser Phe Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Arg Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Arg Ile Phe Gly Pro
50 55 60
Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu Lys His
65 70 75 80
Arg Gly Val Ile Cys Glu Lys Cys Gly Val Glu Val Thr Gln Thr Lys
85 90 95
Val Arg Arg Glu Arg Met Gly His Ile Glu Leu Ala Ser Pro Thr Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Leu Leu
115 120 125
Asp Met Pro Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu Ser Tyr
130 135 140
Val Val Ile Glu Gly Gly Met Thr Asn Leu Glu Arg Gln Gln Ile Leu
145 150 155 160
Thr Glu Glu Gln Tyr Leu Asp Ala Leu Glu Glu Phe Gly Asp Glu Phe
165 170 175
Asp Ala Lys Met Gly Ala Glu Ala Ile Gln Ala Leu Leu Lys Ser Met
180 185 190
Asp Leu Glu Gln Glu Cys Glu Gln Leu Arg Glu Glu Leu Asn Glu Thr
195 200 205
Asn Ser Glu Thr Lys Arg Lys Lys Leu Thr Lys Arg Ile Lys Leu Leu
210 215 220
Glu Ala Phe Val Gln Ser Gly Asn Lys Pro Glu Trp Met Ile Leu Thr
225 230 235 240
Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro Leu Asp
245 250 255
Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val
260 265 270
Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Asp Leu Ala Ala Pro
275 280 285
Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ala Val Asp
290 295 300
Ala Leu Leu Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Ser Asn
305 310 315 320
Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys Gln Gly
325 330 335
Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg
340 345 350
Ser Val Ile Thr Val Gly Pro Tyr Leu Arg Leu His Gln Cys Gly Leu
355 360 365
Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Tyr Gly Lys
370 375 380
Leu Glu Leu Arg Gly Leu Ala Thr Thr Ile Lys Ala Ala Lys Lys Met
385 390 395 400
Val Glu Arg Glu Glu Ala Val Val Trp Asp Ile Leu Asp Glu Val Ile
405 410 415
Arg Glu His Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu
420 425 430
Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys Ala Ile Gln
435 440 445
Leu His Pro Leu Val Cys Ala Ala Tyr Asn Ala Asp Phe Asp Gly Asp
450 455 460
Gln Met Ala Val His Val Pro Leu Thr Leu Glu Ala Gln Leu Glu Ala
465 470 475 480
Arg Ala Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Asn Gly
485 490 495
Glu Pro Ile Ile Val Pro Ser Gln Asp Val Val Leu Gly Leu Tyr Tyr
500 505 510
Met Thr Arg Asp Cys Val Asn Ala Lys Gly Glu Gly Met Val Leu Thr
515 520 525
Gly Pro Lys Glu Ala Glu Arg Leu Tyr Arg Ser Gly Leu Ala Ser Leu
530 535 540
His Ala Arg Val Lys Val Arg Ile Thr Glu Tyr Glu Lys Asp Ala Asn
545 550 555 560
Gly Glu Leu Val Ala Lys Thr Ser Leu Lys Asp Thr Thr Val Gly Arg
565 570 575
Ala Ile Leu Trp Met Ile Val Pro Lys Gly Leu Pro Tyr Ser Ile Val
580 585 590
Asn Gln Ala Leu Gly Lys Lys Ala Ile Ser Lys Met Leu Asn Thr Cys
595 600 605
Tyr Arg Ile Leu Gly Leu Lys Pro Thr Val Ile Phe Ala Asp Gln Ile
610 615 620
Met Tyr Thr Gly Phe Ala Tyr Ala Ala Arg Ser Gly Ala Ser Val Gly
625 630 635 640
Ile Asp Asp Met Val Ile Pro Glu Lys Lys His Glu Ile Ile Ser Glu
645 650 655
Ala Glu Ala Glu Val Ala Glu Ile Gln Glu Gln Phe Gln Ser Gly Leu
660 665 670
Val Thr Ala Gly Glu Arg Tyr Asn Lys Val Ile Asp Ile Trp Ala Ala
675 680 685
Ala Asn Asp Arg Val Ser Lys Ala Met Met Asp Asn Leu Gln Thr Glu
690 695 700
Thr Val Ile Asn Arg Asp Gly Gln Glu Glu Lys Gln Val Ser Phe Asn
705 710 715 720
Ser Ile Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala Ala Gln
725 730 735
Ile Arg Gln Leu Ala Gly Met Arg Gly Leu Met Ala Lys Pro Asp Gly
740 745 750
Ser Ile Ile Glu Thr Pro Ile Thr Ala Asn Phe Arg Glu Gly Leu Asn
755 760 765
Val Leu Gln Tyr Phe Ile Ser Thr His Gly Ala Arg Lys Gly Leu Ala
770 775 780
Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr Leu Thr Arg Arg Leu
785 790 795 800
Val Asp Val Ala Gln Asp Leu Val Val Thr Glu Asp Asp Cys Gly Thr
805 810 815
His Glu Gly Ile Met Met Thr Pro Val Ile Glu Gly Gly Asp Val Lys
820 825 830
Glu Pro Leu Arg Asp Arg Val Leu Gly Arg Val Thr Ala Glu Asp Val
835 840 845
Leu Lys Pro Gly Thr Ala Asp Ile Leu Val Pro Arg Asn Thr Leu Leu
850 855 860
His Glu Gln Trp Cys Asp Leu Leu Glu Glu Asn Ser Val Asp Ala Val
865 870 875 880
Lys Val Arg Ser Val Val Ser Cys Asp Thr Asp Phe Gly Val Cys Ala
885 890 895
His Cys Tyr Gly Arg Asp Leu Ala Arg Gly His Ile Ile Asn Lys Gly
900 905 910
Glu Ala Ile Gly Val Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
915 920 925
Gln Leu Thr Met Arg Thr Phe His Ile Gly Gly Ala Ala Ser Arg Ala
930 935 940
Ala Ala Glu Ser Ser Ile Gln Val Lys Asn Lys Gly Ser Ile Lys Leu
945 950 955 960
Ser Asn Val Lys Ser Val Val Asn Ser Ser Gly Lys Leu Val Ile Thr
965 970 975
Ser Arg Asn Thr Glu Leu Lys Leu Ile Asp Glu Phe Gly Arg Thr Lys
980 985 990
Glu Ser Tyr Lys Val Pro Tyr Gly Ala Val Leu Ala Lys Gly Asp Gly
995 1000 1005
Glu Gln Val Ala Gly Gly Glu Thr Val Ala Asn Trp Asp Pro His Thr
1010 1015 1020
Met Pro Val Ile Thr Glu Val Ser Gly Phe Val Arg Phe Thr Asp Met
1025 1030 1035 1040
Ile Asp Gly Gln Thr Ile Thr Arg Gln Thr Asp Glu Leu Thr Gly Leu
1045 1050 1055
Ser Ser Leu Val Val Leu Asp Ser Ala Glu Arg Thr Ala Gly Gly Lys
1060 1065 1070
Asp Leu Arg Pro Ala Leu Lys Ile Val Asp Ala Gln Gly Asn Asp Val
1075 1080 1085
Leu Ile Pro Gly Thr Asp Met Pro Ala Gln Tyr Phe Leu Pro Gly Lys
1090 1095 1100
Ala Ile Val Gln Leu Glu Asp Gly Val Gln Ile Ser Ser Gly Asp Thr
1105 1110 1115 1120
Leu Ala Arg Ile Pro Gln Glu Ser Gly Gly Thr Lys Asp Ile Thr Gly
1125 1130 1135
Gly Leu Pro Arg Val Ala Asp Leu Phe Glu Ala Arg Arg Pro Lys Glu
1140 1145 1150
Pro Ala Ile Leu Ala Glu Ile Ser Gly Ile Val Ser Phe Gly Lys Glu
1155 1160 1165
Thr Lys Gly Lys Arg Arg Leu Val Ile Thr Pro Val Asp Gly Ser Asp
1170 1175 1180
Pro Tyr Glu Glu Met Ile Pro Lys Trp Arg Gln Leu Asn Val Phe Glu
1185 1190 1195 1200
Gly Glu Arg Val Glu Arg Gly Asp Val Ile Ser Asp Gly Pro Glu Ala
1205 1210 1215
Pro His Asp Ile Leu Arg Leu Arg Gly Val His Ala Val Thr Arg Tyr
1220 1225 1230
Ile Val Asn Glu Val Gln Asp Val Tyr Arg Leu Gln Gly Val Lys Ile
1235 1240 1245
Asn Asp Lys His Ile Glu Val Ile Val Arg Gln Met Leu Arg Lys Ala
1250 1255 1260
Thr Ile Val Asn Ala Gly Ser Ser Asp Phe Leu Glu Gly Glu Gln Val
1265 1270 1275 1280
Glu Tyr Ser Arg Val Lys Ile Ala Asn Arg Glu Leu Glu Ala Asn Gly
1285 1290 1295
Lys Val Gly Ala Thr Tyr Ser Arg Asp Leu Leu Gly Ile Thr Lys Ala
1300 1305 1310
Ser Leu Ala Thr Glu Ser Phe Ile Ser Ala Ala Ser Phe Gln Glu Thr
1315 1320 1325
Thr Arg Val Leu Thr Glu Ala Ala Val Ala Gly Lys Arg Asp Glu Leu
1330 1335 1340
Arg Gly Leu Lys Glu Asn Val Ile Val Gly Arg Leu Ile Pro Ala Gly
1345 1350 1355 1360
Thr Gly Tyr Ala Tyr His Gln Asp Arg Met Arg Arg Arg Ala Ala Gly
1365 1370 1375
Glu Ala Pro Ala Ala Pro Gln Val Thr Ala Glu Asp Ala Ser Ala Ser
1380 1385 1390
Leu Ala Glu Leu Leu Asn Ala Gly Leu Gly Gly Ser Asp Asn Glu
1395 1400 1405
<210> 27
<211> 1407
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct modified RpoC
<400> 27
Met Lys Asp Leu Leu Lys Phe Leu Lys Ala Gln Thr Lys Thr Glu Glu
1 5 10 15
Phe Asp Ala Ile Lys Ile Ala Leu Ala Ser Pro Asp Met Ile Arg Ser
20 25 30
Trp Ser Phe Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Cys Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Arg Ile Phe Gly Pro
50 55 60
Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu Lys His
65 70 75 80
Arg Gly Val Ile Cys Glu Lys Cys Gly Val Glu Val Thr Gln Thr Lys
85 90 95
Val Arg Arg Glu Arg Met Gly His Ile Glu Leu Ala Ser Pro Thr Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Leu Leu
115 120 125
Asp Met Pro Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu Ser Tyr
130 135 140
Val Val Ile Glu Gly Gly Met Thr Asn Leu Glu Arg Gln Gln Ile Leu
145 150 155 160
Thr Glu Glu Gln Tyr Leu Asp Ala Leu Glu Glu Phe Gly Asp Glu Phe
165 170 175
Asp Ala Lys Met Gly Ala Glu Ala Ile Gln Ala Leu Leu Lys Ser Met
180 185 190
Asp Leu Glu Gln Glu Cys Glu Gln Leu Arg Glu Glu Leu Asn Glu Thr
195 200 205
Asn Ser Glu Thr Lys Arg Lys Lys Leu Thr Lys Arg Ile Lys Leu Leu
210 215 220
Glu Ala Phe Val Gln Ser Gly Asn Lys Pro Glu Trp Met Ile Leu Thr
225 230 235 240
Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro Leu Asp
245 250 255
Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val
260 265 270
Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Asp Leu Ala Ala Pro
275 280 285
Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ala Val Asp
290 295 300
Ala Leu Leu Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Ser Asn
305 310 315 320
Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys Gln Gly
325 330 335
Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg
340 345 350
Ser Val Ile Thr Val Gly Pro Tyr Leu Arg Leu His Gln Cys Gly Leu
355 360 365
Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Tyr Gly Lys
370 375 380
Leu Glu Leu Arg Gly Leu Ala Thr Thr Ile Lys Ala Ala Lys Lys Met
385 390 395 400
Val Glu Arg Glu Glu Ala Val Val Trp Asp Ile Leu Asp Glu Val Ile
405 410 415
Arg Glu His Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu
420 425 430
Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys Ala Ile Gln
435 440 445
Leu His Pro Leu Val Cys Ala Ala Tyr Asn Ala Asp Phe Asp Gly Asp
450 455 460
Gln Met Ala Val His Val Pro Leu Thr Leu Glu Ala Gln Leu Glu Ala
465 470 475 480
Arg Ala Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Asn Gly
485 490 495
Glu Pro Ile Ile Val Pro Ser Gln Asp Val Val Leu Gly Leu Tyr Tyr
500 505 510
Met Thr Arg Asp Cys Val Asn Ala Lys Gly Glu Gly Met Val Leu Thr
515 520 525
Gly Pro Lys Glu Ala Glu Arg Leu Tyr Arg Ser Gly Leu Ala Ser Leu
530 535 540
His Ala Arg Val Lys Val Arg Ile Thr Glu Tyr Glu Lys Asp Ala Asn
545 550 555 560
Gly Glu Leu Val Ala Lys Thr Ser Leu Lys Asp Thr Thr Val Gly Arg
565 570 575
Ala Ile Leu Trp Met Ile Val Pro Lys Gly Leu Pro Tyr Ser Ile Val
580 585 590
Asn Gln Ala Leu Gly Lys Lys Ala Ile Ser Lys Met Leu Asn Thr Cys
595 600 605
Tyr Arg Ile Leu Gly Leu Lys Pro Thr Val Ile Phe Ala Asp Gln Ile
610 615 620
Met Tyr Thr Gly Phe Ala Tyr Ala Ala Arg Ser Gly Ala Ser Val Gly
625 630 635 640
Ile Asp Asp Met Val Ile Pro Glu Lys Lys His Glu Ile Ile Ser Glu
645 650 655
Ala Glu Ala Glu Val Ala Glu Ile Gln Glu Gln Phe Gln Ser Gly Leu
660 665 670
Val Thr Ala Gly Glu Arg Tyr Asn Lys Val Ile Asp Ile Trp Ala Ala
675 680 685
Ala Asn Asp Arg Val Ser Lys Ala Met Met Asp Asn Leu Gln Thr Glu
690 695 700
Thr Val Ile Asn Arg Asp Gly Gln Glu Glu Lys Gln Val Ser Phe Asn
705 710 715 720
Ser Ile Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala Ala Gln
725 730 735
Ile Arg Gln Leu Ala Gly Met Arg Gly Leu Met Ala Lys Pro Asp Gly
740 745 750
Ser Ile Ile Glu Thr Pro Ile Thr Ala Asn Phe Arg Glu Gly Leu Asn
755 760 765
Val Leu Gln Tyr Phe Ile Ser Thr His Gly Ala Arg Lys Gly Leu Ala
770 775 780
Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr Leu Thr Arg Arg Leu
785 790 795 800
Val Asp Val Ala Gln Asp Leu Val Val Thr Glu Asp Asp Cys Gly Thr
805 810 815
His Glu Gly Ile Met Met Thr Pro Val Ile Glu Gly Gly Asp Val Lys
820 825 830
Glu Pro Leu Arg Asp Arg Val Leu Gly Arg Val Thr Ala Glu Asp Val
835 840 845
Leu Lys Pro Gly Thr Ala Asp Ile Leu Val Pro Arg Asn Thr Leu Leu
850 855 860
His Glu Gln Trp Cys Asp Leu Leu Glu Glu Asn Ser Val Asp Ala Val
865 870 875 880
Lys Val Arg Ser Val Val Ser Cys Asp Thr Asp Phe Gly Val Cys Ala
885 890 895
His Cys Tyr Gly Arg Asp Leu Ala Arg Gly His Ile Ile Asn Lys Gly
900 905 910
Glu Ala Ile Gly Val Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
915 920 925
Gln Leu Thr Met Arg Thr Phe His Ile Gly Gly Ala Ala Ser Arg Ala
930 935 940
Ala Ala Glu Ser Ser Ile Gln Val Lys Asn Lys Gly Ser Ile Lys Leu
945 950 955 960
Ser Asn Val Lys Ser Val Val Asn Ser Ser Gly Lys Leu Val Ile Thr
965 970 975
Ser Arg Asn Thr Glu Leu Lys Leu Ile Asp Glu Phe Gly Arg Thr Lys
980 985 990
Glu Ser Tyr Lys Val Pro Tyr Gly Ala Val Leu Ala Lys Gly Asp Gly
995 1000 1005
Glu Gln Val Ala Gly Gly Glu Thr Val Ala Asn Trp Asp Pro His Thr
1010 1015 1020
Met Pro Val Ile Thr Glu Val Ser Gly Phe Val Arg Phe Thr Asp Met
1025 1030 1035 1040
Ile Asp Gly Gln Thr Ile Thr Arg Gln Thr Asp Glu Leu Thr Gly Leu
1045 1050 1055
Ser Ser Leu Val Val Leu Asp Ser Ala Glu Arg Thr Ala Gly Gly Lys
1060 1065 1070
Asp Leu Arg Pro Ala Leu Lys Ile Val Asp Ala Gln Gly Asn Asp Val
1075 1080 1085
Leu Ile Pro Gly Thr Asp Met Pro Ala Gln Tyr Phe Leu Pro Gly Lys
1090 1095 1100
Ala Ile Val Gln Leu Glu Asp Gly Val Gln Ile Ser Ser Gly Asp Thr
1105 1110 1115 1120
Leu Ala Arg Ile Pro Gln Glu Ser Gly Gly Thr Lys Asp Ile Thr Gly
1125 1130 1135
Gly Leu Pro Arg Val Ala Asp Leu Phe Glu Ala Arg Arg Pro Lys Glu
1140 1145 1150
Pro Ala Ile Leu Ala Glu Ile Ser Gly Ile Val Ser Phe Gly Lys Glu
1155 1160 1165
Thr Lys Gly Lys Arg Arg Leu Val Ile Thr Pro Val Asp Gly Ser Asp
1170 1175 1180
Pro Tyr Glu Glu Met Ile Pro Lys Trp Arg Gln Leu Asn Val Phe Glu
1185 1190 1195 1200
Gly Glu Arg Val Glu Arg Gly Asp Val Ile Ser Asp Gly Pro Glu Ala
1205 1210 1215
Pro His Asp Ile Leu Arg Leu Arg Gly Val His Ala Val Thr Arg Tyr
1220 1225 1230
Ile Val Asn Glu Val Gln Asp Val Tyr Arg Leu Gln Gly Val Lys Ile
1235 1240 1245
Asn Asp Lys His Ile Glu Val Ile Val Arg Gln Met Leu Arg Lys Ala
1250 1255 1260
Thr Ile Val Asn Ala Gly Ser Ser Asp Phe Leu Glu Gly Glu Gln Val
1265 1270 1275 1280
Glu Tyr Ser Arg Val Lys Ile Ala Asn Arg Glu Leu Glu Ala Asn Gly
1285 1290 1295
Lys Val Gly Ala Thr Tyr Ser Arg Asp Leu Leu Gly Ile Thr Lys Ala
1300 1305 1310
Ser Leu Ala Thr Glu Ser Phe Ile Ser Ala Ala Ser Phe Gln Glu Thr
1315 1320 1325
Thr Arg Val Leu Thr Glu Ala Ala Val Ala Gly Lys Arg Asp Glu Leu
1330 1335 1340
Arg Gly Leu Lys Glu Asn Val Ile Val Gly Arg Leu Ile Pro Ala Gly
1345 1350 1355 1360
Thr Gly Tyr Ala Tyr His Gln Asp Arg Met Arg Arg Arg Ala Ala Gly
1365 1370 1375
Glu Ala Pro Ala Ala Pro Gln Val Thr Ala Glu Asp Ala Ser Ala Ser
1380 1385 1390
Leu Ala Glu Leu Leu Asn Ala Gly Leu Gly Gly Ser Asp Asn Glu
1395 1400 1405
<210> 28
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct modified RpoC N terminal domain
<400> 28
Lys Pro Glu Thr Ile Asn Tyr Cys Thr
1 5
<210> 29
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct modified RpoC central domain
<400> 29
Asn Ala Asp Phe Asp Gly Asp
1 5
<210> 30
<211> 6
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct modified RpoC C-terminal domain
<400> 30
Ser Ala Ala Ser Phe Gln
1 5
<210> 31
<211> 25
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct modified RpoC N terminal domain extended
<400> 31
Trp Ser Phe Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Cys Thr
1 5 10 15
Phe Lys Pro Glu Arg Asp Gly Leu Phe
20 25
<210> 32
<211> 23
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct modified RpoC central domain extended
<400> 32
His Pro Leu Val Cys Ala Ala Tyr Asn Ala Asp Phe Asp Gly Asp Gln
1 5 10 15
Met Ala Val His Val Pro Leu
20
<210> 33
<211> 55
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic construct modified RpoC C-terminal domain extended
<400> 33
Gly Ile Thr Lys Ala Ser Leu Ala Thr Glu Ser Phe Ile Ser Ala Ala
1 5 10 15
Ser Phe Gln Glu Thr Thr Arg Val Leu Thr Glu Ala Ala Val Ala Gly
20 25 30
Lys Arg Asp Glu Leu Arg Gly Leu Lys Glu Asn Val Ile Val Gly Arg
35 40 45
Leu Ile Pro Ala Gly Thr Gly
50 55
<210> 34
<211> 1524
<212> PRT
<213> Thermus thermophilus
<400> 34
Met Lys Lys Glu Val Arg Lys Val Arg Ile Ala Leu Ala Ser Pro Glu
1 5 10 15
Lys Ile Arg Ser Trp Ser Tyr Gly Glu Val Glu Lys Pro Glu Thr Ile
20 25 30
Asn Tyr Arg Thr Leu Lys Pro Glu Arg Asp Gly Leu Phe Asp Glu Arg
35 40 45
Ile Phe Gly Pro Ile Lys Asp Tyr Glu Cys Ala Cys Gly Lys Tyr Lys
50 55 60
Arg Gln Arg Phe Glu Gly Lys Val Cys Glu Arg Cys Gly Val Glu Val
65 70 75 80
Thr Lys Ser Ile Val Arg Arg Tyr Arg Met Gly His Ile Glu Leu Ala
85 90 95
Thr Pro Ala Ala His Ile Trp Phe Val Lys Asp Val Pro Ser Lys Ile
100 105 110
Gly Thr Leu Leu Asp Leu Ser Ala Thr Glu Leu Glu Gln Val Leu Tyr
115 120 125
Phe Ser Lys Tyr Ile Val Leu Asp Pro Lys Gly Ala Ile Leu Asn Gly
130 135 140
Val Pro Val Glu Lys Arg Gln Leu Leu Thr Asp Glu Glu Tyr Arg Glu
145 150 155 160
Leu Arg Tyr Gly Lys Gln Glu Thr Tyr Pro Leu Pro Pro Gly Val Asp
165 170 175
Ala Leu Val Lys Asp Gly Glu Glu Val Val Lys Gly Gln Glu Leu Ala
180 185 190
Pro Gly Val Val Ser Arg Leu Asp Gly Val Ala Leu Tyr Arg Phe Pro
195 200 205
Arg Arg Val Arg Val Glu Tyr Val Lys Lys Glu Arg Ala Gly Leu Arg
210 215 220
Leu Pro Leu Ala Ala Trp Val Glu Lys Glu Ala Tyr Lys Pro Gly Glu
225 230 235 240
Ile Leu Ala Glu Leu Pro Glu Pro Tyr Leu Phe Arg Ala Glu Glu Glu
245 250 255
Gly Val Val Glu Leu Lys Glu Leu Glu Glu Gly Ala Phe Leu Val Leu
260 265 270
Arg Arg Glu Asp Glu Pro Val Ala Thr Tyr Phe Leu Pro Val Gly Met
275 280 285
Thr Pro Leu Val Val His Gly Glu Ile Val Glu Lys Gly Gln Pro Leu
290 295 300
Ala Glu Ala Lys Gly Leu Leu Arg Met Pro Arg Gln Val Arg Ala Ala
305 310 315 320
Gln Val Glu Ala Glu Glu Glu Gly Glu Thr Val Tyr Leu Thr Leu Phe
325 330 335
Leu Glu Trp Thr Glu Pro Lys Asp Tyr Arg Val Gln Pro His Met Asn
340 345 350
Val Val Val Pro Glu Gly Ala Arg Val Glu Ala Gly Asp Lys Ile Val
355 360 365
Ala Ala Ile Asp Pro Glu Glu Glu Val Ile Ala Glu Ala Glu Gly Val
370 375 380
Val His Leu His Glu Pro Ala Ser Ile Leu Val Val Lys Ala Arg Val
385 390 395 400
Tyr Pro Phe Glu Asp Asp Val Glu Val Ser Thr Gly Asp Arg Val Ala
405 410 415
Pro Gly Asp Val Leu Ala Asp Gly Gly Lys Val Lys Ser Asp Val Tyr
420 425 430
Gly Arg Val Glu Val Asp Leu Val Arg Asn Val Val Arg Val Val Glu
435 440 445
Ser Tyr Asp Ile Asp Ala Arg Met Gly Ala Glu Ala Ile Gln Gln Leu
450 455 460
Leu Lys Glu Leu Asp Leu Glu Ala Leu Glu Lys Glu Leu Leu Glu Glu
465 470 475 480
Met Lys His Pro Ser Arg Ala Arg Arg Ala Lys Ala Arg Lys Arg Leu
485 490 495
Glu Val Val Arg Ala Phe Leu Asp Ser Gly Asn Arg Pro Glu Trp Met
500 505 510
Ile Leu Glu Ala Val Pro Val Leu Pro Pro Asp Leu Arg Pro Met Val
515 520 525
Gln Val Asp Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr
530 535 540
Arg Arg Leu Ile Asn Arg Asn Asn Arg Leu Lys Lys Leu Leu Ala Gln
545 550 555 560
Gly Ala Pro Glu Ile Ile Ile Arg Asn Glu Lys Arg Met Leu Gln Glu
565 570 575
Ala Val Asp Ala Leu Leu Asp Asn Gly Arg Arg Gly Ala Pro Val Thr
580 585 590
Asn Pro Gly Ser Asp Arg Pro Leu Arg Ser Leu Thr Asp Ile Leu Ser
595 600 605
Gly Lys Gln Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp
610 615 620
Tyr Ser Gly Arg Ser Val Ile Val Val Gly Pro Gln Leu Lys Leu His
625 630 635 640
Gln Cys Gly Leu Pro Lys Arg Met Ala Leu Glu Leu Phe Lys Pro Phe
645 650 655
Leu Leu Lys Lys Met Glu Glu Lys Gly Ile Ala Pro Asn Val Lys Ala
660 665 670
Ala Arg Arg Met Leu Glu Arg Gln Arg Asp Ile Lys Asp Glu Val Trp
675 680 685
Asp Ala Leu Glu Glu Val Ile His Gly Lys Val Val Leu Leu Asn Arg
690 695 700
Ala Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe Gln Pro Val Leu
705 710 715 720
Val Glu Gly Gln Ser Ile Gln Leu His Pro Leu Val Cys Glu Ala Phe
725 730 735
Asn Ala Asp Phe Asp Gly Asp Gln Met Ala Val His Val Pro Leu Ser
740 745 750
Ser Phe Ala Gln Ala Glu Ala Arg Ile Gln Met Leu Ser Ala His Asn
755 760 765
Leu Leu Ser Pro Ala Ser Gly Glu Pro Leu Ala Lys Pro Ser Arg Asp
770 775 780
Ile Ile Leu Gly Leu Tyr Tyr Ile Thr Gln Val Arg Lys Glu Lys Lys
785 790 795 800
Gly Ala Gly Leu Glu Phe Ala Thr Pro Glu Glu Ala Leu Ala Ala His
805 810 815
Glu Arg Gly Glu Val Ala Leu Asn Ala Pro Ile Lys Val Ala Gly Arg
820 825 830
Glu Thr Ser Val Gly Arg Leu Lys Tyr Val Phe Ala Asn Pro Asp Glu
835 840 845
Ala Leu Leu Ala Val Ala His Gly Ile Val Asp Leu Gln Asp Val Val
850 855 860
Thr Val Arg Tyr Met Gly Lys Arg Leu Glu Thr Ser Pro Gly Arg Ile
865 870 875 880
Leu Phe Ala Arg Ile Val Ala Glu Ala Val Glu Asp Glu Lys Val Ala
885 890 895
Trp Glu Leu Ile Gln Leu Asp Val Pro Gln Glu Lys Asn Ser Leu Lys
900 905 910
Asp Leu Val Tyr Gln Ala Phe Leu Arg Leu Gly Met Glu Lys Thr Ala
915 920 925
Arg Leu Leu Asp Ala Leu Lys Tyr Tyr Gly Phe Thr Phe Ser Thr Thr
930 935 940
Ser Gly Ile Thr Ile Gly Ile Asp Asp Ala Val Ile Pro Glu Glu Lys
945 950 955 960
Lys Gln Tyr Leu Glu Glu Ala Asp Arg Lys Leu Leu Gln Ile Glu Gln
965 970 975
Ala Tyr Glu Met Gly Phe Leu Thr Asp Arg Glu Arg Tyr Asp Gln Ile
980 985 990
Leu Gln Leu Trp Thr Glu Thr Thr Glu Lys Val Thr Gln Ala Val Phe
995 1000 1005
Lys Asn Phe Glu Glu Asn Tyr Pro Phe Asn Pro Leu Tyr Val Met Ala
1010 1015 1020
Gln Ser Gly Ala Arg Gly Asn Pro Gln Gln Ile Arg Gln Leu Cys Gly
1025 1030 1035 1040
Leu Arg Gly Leu Met Gln Lys Pro Ser Gly Glu Thr Phe Glu Val Pro
1045 1050 1055
Val Arg Ser Ser Phe Arg Glu Gly Leu Thr Val Leu Glu Tyr Phe Ile
1060 1065 1070
Ser Ser His Gly Ala Arg Lys Gly Gly Ala Asp Thr Ala Leu Arg Thr
1075 1080 1085
Ala Asp Ser Gly Tyr Leu Thr Arg Lys Leu Val Asp Val Thr His Glu
1090 1095 1100
Ile Val Val Arg Glu Ala Asp Cys Gly Thr Thr Asn Tyr Ile Ser Val
1105 1110 1115 1120
Pro Leu Phe Gln Pro Asp Glu Val Thr Arg Ser Leu Arg Leu Arg Lys
1125 1130 1135
Arg Ala Asp Ile Glu Ala Gly Leu Tyr Gly Arg Val Leu Ala Arg Glu
1140 1145 1150
Val Glu Val Leu Gly Val Arg Leu Glu Glu Gly Arg Tyr Leu Ser Met
1155 1160 1165
Asp Asp Val His Leu Leu Ile Lys Ala Ala Glu Ala Gly Glu Ile Gln
1170 1175 1180
Glu Val Pro Val Arg Ser Pro Leu Thr Cys Gln Thr Arg Tyr Gly Val
1185 1190 1195 1200
Cys Gln Lys Cys Tyr Gly Tyr Asp Leu Ser Met Ala Arg Pro Val Ser
1205 1210 1215
Ile Gly Glu Ala Val Gly Ile Val Ala Ala Gln Ser Ile Gly Glu Pro
1220 1225 1230
Gly Thr Gln Leu Thr Met Arg Thr Phe His Thr Gly Gly Val Ala Gly
1235 1240 1245
Ala Ala Asp Ile Thr Gln Gly Leu Pro Arg Val Ile Glu Leu Phe Glu
1250 1255 1260
Ala Arg Arg Pro Lys Ala Lys Ala Val Ile Ser Glu Ile Asp Gly Val
1265 1270 1275 1280
Val Arg Ile Glu Glu Thr Glu Glu Lys Leu Ser Val Phe Val Glu Ser
1285 1290 1295
Glu Gly Phe Ser Lys Glu Tyr Lys Leu Pro Lys Glu Ala Arg Leu Leu
1300 1305 1310
Val Lys Asp Gly Asp Tyr Val Glu Ala Gly Gln Pro Leu Thr Arg Gly
1315 1320 1325
Ala Ile Asp Pro His Gln Leu Leu Glu Ala Lys Gly Pro Glu Ala Val
1330 1335 1340
Glu Arg Tyr Leu Val Glu Glu Ile Gln Lys Val Tyr Arg Ala Gln Gly
1345 1350 1355 1360
Val Lys Leu His Asp Lys His Ile Glu Ile Val Val Arg Gln Met Met
1365 1370 1375
Lys Tyr Val Glu Val Thr Asp Pro Gly Asp Ser Arg Leu Leu Glu Gly
1380 1385 1390
Gln Val Leu Glu Lys Trp Asp Val Glu Ala Leu Asn Glu Arg Leu Ile
1395 1400 1405
Ala Glu Gly Lys Thr Pro Val Ala Trp Lys Pro Leu Leu Met Gly Val
1410 1415 1420
Thr Lys Ser Ala Leu Ser Thr Lys Ser Trp Leu Ser Ala Ala Ser Phe
1425 1430 1435 1440
Gln Asn Thr Thr His Val Leu Thr Glu Ala Ala Ile Ala Gly Lys Lys
1445 1450 1455
Asp Glu Leu Ile Gly Leu Lys Glu Asn Val Ile Leu Gly Arg Leu Ile
1460 1465 1470
Pro Ala Gly Thr Gly Ser Asp Phe Val Arg Phe Thr Gln Val Val Asp
1475 1480 1485
Gln Lys Thr Leu Lys Ala Ile Glu Glu Ala Arg Lys Glu Ala Val Glu
1490 1495 1500
Ala Lys Glu Arg Pro Ala Ala Arg Arg Gly Val Lys Arg Glu Gln Pro
1505 1510 1515 1520
Gly Lys Gln Ala
<210> 35
<211> 1391
<212> PRT
<213> Acetobacter pasteurianus
<400> 35
Met Asn Glu Leu Met Lys Ile Leu Gly Gln Thr Gly Gln Ala Met Thr
1 5 10 15
Phe Asp Gln Ile Lys Ile Gln Leu Ala Ser Pro Glu Gln Ile Arg Ser
20 25 30
Trp Ser Tyr Gly Glu Ile Lys Lys Pro Glu Thr Ile Asn Tyr Arg Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Arg Ile Phe Gly Pro
50 55 60
Ile Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Met Lys Phe
65 70 75 80
Arg Gly Ile Ile Cys Glu Lys Cys Gly Val Glu Val Thr Leu Ala Lys
85 90 95
Val Arg Arg Glu Arg Met Gly His Ile Gln Leu Ala Ser Pro Val Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Met Val
115 120 125
Asp Met Thr Leu Lys Asp Leu Glu Lys Val Leu Tyr Phe Glu Ser Tyr
130 135 140
Leu Val Leu Glu Pro Gly Thr Ser Pro Leu Lys Gln Tyr Ser Leu Leu
145 150 155 160
Thr Glu Glu Gln Tyr Leu Asp Ala Met Asp Glu Tyr Gly Asp Glu Gly
165 170 175
Val Glu Val Gly Ile Gly Ala Glu Ala Ile Lys Lys Val Leu Glu Arg
180 185 190
Ile Asp Cys Asp Ala Glu Lys Val Glu Leu Arg Gln Glu Leu Lys Glu
195 200 205
Thr Thr Ser Glu Ala Lys Arg Lys Lys Leu Val Lys Arg Leu Lys Leu
210 215 220
Ile Glu Ala Phe Ala Glu Ser Gly Ser Arg Pro Glu Trp Met Ile Leu
225 230 235 240
Asp Leu Val Pro Val Ile Pro Pro Asp Leu Arg Pro Leu Val Pro Leu
245 250 255
Asp Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg
260 265 270
Val Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Met Glu Leu Arg Ala
275 280 285
Pro Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ala Val
290 295 300
Asp Ala Leu Phe Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Ala
305 310 315 320
Asn Lys Arg Pro Leu Lys Ser Leu Ser Asp Met Leu Lys Gly Lys Gln
325 330 335
Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly
340 345 350
Arg Ser Val Ile Val Val Gly Pro Glu Leu Lys Leu His Gln Cys Gly
355 360 365
Leu Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Tyr Ala
370 375 380
Lys Leu Glu Lys Tyr Gly His Ala Thr Thr Ile Lys Ala Ala Lys Arg
385 390 395 400
Met Val Glu Lys Glu Arg Pro Glu Val Trp Asp Ile Leu Glu Glu Val
405 410 415
Ile Arg Glu His Pro Val Met Leu Asn Arg Ala Pro Thr Leu His Arg
420 425 430
Leu Gly Ile Gln Ala Phe Glu Pro Val Leu Val Glu Gly Lys Ala Ile
435 440 445
Gln Leu His Pro Leu Val Cys Thr Ala Phe Asn Ala Asp Phe Asp Gly
450 455 460
Asp Gln Met Ala Val His Val Pro Leu Ser Leu Glu Ala Gln Leu Glu
465 470 475 480
Ala Arg Val Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Asn
485 490 495
Gly Lys Pro Ile Ile Val Pro Ser Gln Asp Ile Val Leu Gly Leu Tyr
500 505 510
Tyr Leu Ser Leu Glu Thr Pro Glu Phe Lys Val Thr Pro Asp Arg Cys
515 520 525
Glu Tyr Asp Glu Thr Thr Gly Ala Leu Thr Lys Glu Gly Ala Pro Ser
530 535 540
Phe Ser Ser Ile Gly Glu Val Glu Tyr Ala Leu Ser Ala Gly Ala Leu
545 550 555 560
Lys Leu His Asp Lys Ile Arg Ala Arg Phe Gln Lys Ile Gly Ala Asp
565 570 575
Gly Lys Val Thr Tyr Glu Thr Ala Val Thr Thr Pro Gly Arg Val Leu
580 585 590
Ile Ala Gln Ile Leu Pro Gln His Glu Ala Val Pro Phe Ser Leu Ile
595 600 605
Asn Arg Gln Leu Thr Lys Lys Ala Val Ser Asp Val Ile Asp Thr Val
610 615 620
Tyr Arg His Cys Gly Gln Lys Glu Ala Val Ile Phe Cys Asp Arg Leu
625 630 635 640
Met Ala Leu Gly Phe Arg His Ala Ala Lys Ala Gly Ile Ser Phe Gly
645 650 655
Lys Asp Asp Met Ile Ile Pro Pro Glu Lys Lys Glu Leu Val Asp Arg
660 665 670
Thr Ala Ala Glu Val Lys Glu Phe Glu Gln Gln Tyr Gln Asp Gly Leu
675 680 685
Ile Thr Ala Gly Glu Arg Tyr Asn Lys Val Val Asp Ala Trp Ser Arg
690 695 700
Cys Thr Asp Glu Val Gln Ala Ala Met Thr Lys Glu Ile Ser Arg Gln
705 710 715 720
Glu Val Gly Lys Gln Ile Asn Ser Val Trp Met Met Ser His Ser Gly
725 730 735
Ala Arg Gly Ser Pro Ala Gln Met Lys Gln Leu Ala Gly Met Arg Gly
740 745 750
Leu Met Ala Lys Pro Ser Gly Glu Ile Ile Glu Gln Pro Ile Ile Ala
755 760 765
Asn Phe Lys Glu Gly Leu Ser Val Leu Asp Tyr Phe Thr Ser Thr His
770 775 780
Gly Ala Arg Lys Gly Leu Ala Asp Thr Ala Leu Lys Thr Ala Asn Ser
785 790 795 800
Gly Tyr Leu Thr Arg Arg Leu Val Asp Val Ala Gln Asp Ser Ile Ile
805 810 815
Ile Glu Glu Asp Cys Gly Ser Glu Arg Gly Leu Thr Val Arg Ala Val
820 825 830
Met Asp Gly Gly Glu Val Val Ala Ser Leu Ser Glu Arg Ile Leu Gly
835 840 845
Arg Thr Val Ala Ser Asp Val Val Val Pro Gly Thr Gly Glu Val Ile
850 855 860
Val Pro Arg Asn His Leu Ile Asp Glu Ala Asp Ala Glu Arg Ile Glu
865 870 875 880
Lys Ser Gly Val Glu Thr Val His Ile Arg Ser Val Leu Thr Cys Asp
885 890 895
Ser Arg Val Gly Val Cys Gly Arg Cys Tyr Gly Arg Asp Leu Ala Arg
900 905 910
Gly Thr Pro Val Asn Ile Gly Glu Ala Val Gly Val Ile Ala Ala Gln
915 920 925
Ser Ile Gly Glu Pro Gly Thr Gln Leu Thr Met Arg Thr Phe His Ile
930 935 940
Gly Gly Ala Ala Gln Arg Gly Ala Glu Gln Ser Met Ile Glu Ala Ser
945 950 955 960
Arg Asp Gly His Val Val Ile Arg Asn Arg Asn Val Val His Asn Ser
965 970 975
Gln Asn Val Pro Ile Val Met Ala Arg Asn Cys Glu Ile Leu Leu Ser
980 985 990
Asp Asp Asn Gly Val Glu Lys Ala Arg Tyr Arg Val Pro Tyr Gly Ala
995 1000 1005
Arg Leu Leu Thr Glu Glu Gly Ala Lys Val Ala Arg Gly Gln Lys Leu
1010 1015 1020
Ala Glu Trp Asp Pro Tyr Thr Leu Pro Ile Ile Thr Glu Lys Ala Gly
1025 1030 1035 1040
Lys Val Glu Tyr Leu Asp Leu Ile Asp Ser Ile Thr Leu Val Glu Arg
1045 1050 1055
Met Asp Glu Val Thr Gly Leu Thr Ser Lys Val Val Val Asp Tyr Lys
1060 1065 1070
Gln Ala Gly Lys Gly Val Asp Leu Arg Pro Arg Leu Gln Leu Lys Asp
1075 1080 1085
Ala Asn Gly Asp Val Val Lys Leu Asp Asn Gly Ala Asp Ala Arg Tyr
1090 1095 1100
Phe Leu Ser Pro Glu Thr Leu Leu Ser Val Glu Asn Gly Thr Glu Val
1105 1110 1115 1120
Asn Ala Gly Asp Val Leu Ala Arg Leu Pro Arg Glu Gly Ser Lys Thr
1125 1130 1135
Arg Asp Ile Thr Gly Gly Leu Pro Arg Val Ala Glu Leu Phe Glu Ala
1140 1145 1150
Arg Arg Pro Lys Asp His Ala Ile Ile Ala Glu Met Glu Gly Arg Val
1155 1160 1165
Glu Phe Gly Lys Asp Tyr Lys Ser Lys Arg Arg Val Ile Val Lys Asn
1170 1175 1180
Asp Glu Thr Gly Glu Glu Gln Glu Tyr Leu Ile Pro Lys Gly Lys His
1185 1190 1195 1200
Ile Ser Val Gln Glu Gly Asp Phe Val Glu Lys Gly Asp Pro Leu Val
1205 1210 1215
Asp Gly Pro Arg Val Pro His Asp Ile Leu Lys Val Met Gly Val Glu
1220 1225 1230
Ala Leu Ser Asp Tyr Leu Ile Asn Glu Ile Gln Asp Val Tyr Arg Leu
1235 1240 1245
Gln Gly Val Lys Ile Asn Asp Lys His Ile Glu Val Ile Val Arg Gln
1250 1255 1260
Met Leu Gln Lys Val Glu Ile Leu Glu Pro Gly Asp Thr Thr Tyr Leu
1265 1270 1275 1280
Ile Gly Glu Thr Val Asp Arg Ile Glu Phe Glu Ala Glu Asn Ala Lys
1285 1290 1295
Cys Leu Lys Ala Gly Glu Arg Pro Ala Gln Gly Met Pro Val Leu Gln
1300 1305 1310
Gly Ile Thr Lys Ala Ser Leu Gln Thr Gln Ser Phe Ile Ser Ala Ala
1315 1320 1325
Ser Phe Gln Glu Thr Thr Arg Val Leu Thr Glu Ala Ala Thr Ala Gly
1330 1335 1340
Lys Val Asp Lys Leu Met Gly Leu Lys Glu Asn Val Ile Val Gly Arg
1345 1350 1355 1360
Leu Ile Pro Ala Gly Thr Gly Ser Val Met Lys Arg Leu Arg Ala Ile
1365 1370 1375
Ala Ala Glu Gln Asp Arg Gln Arg Val Gly Arg Ser Ala Ala Glu
1380 1385 1390
<210> 36
<211> 1391
<212> PRT
<213> Neisseria gonorrhoeae
<400> 36
Met Asn Leu Leu Asn Leu Phe Asn Pro Leu Gln Thr Ala Gly Met Glu
1 5 10 15
Glu Glu Phe Asp Ala Ile Lys Ile Gly Ile Ala Ser Pro Glu Thr Ile
20 25 30
Arg Ser Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr
35 40 45
Arg Thr Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Lys Ile Phe
50 55 60
Gly Pro Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu
65 70 75 80
Lys Phe Lys Gly Val Thr Cys Glu Lys Cys Gly Val Glu Val Thr Leu
85 90 95
Ser Lys Val Arg Arg Glu Arg Met Gly His Ile Glu Leu Ala Ala Pro
100 105 110
Val Ala His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Leu Gly Met
115 120 125
Val Leu Asn Met Thr Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu
130 135 140
Ala Phe Val Val Thr Asp Pro Gly Met Thr Pro Leu Gln Arg Arg Gln
145 150 155 160
Leu Leu Thr Glu Asp Asp Tyr Tyr Asn Lys Leu Asp Glu Tyr Gly Asp
165 170 175
Asp Phe Asp Ala Lys Met Gly Ala Glu Gly Ile Arg Glu Leu Leu Arg
180 185 190
Thr Leu Asp Val Ala Gly Glu Ile Glu Ile Leu Arg Gln Glu Leu Glu
195 200 205
Ser Thr Gly Ser Asp Thr Lys Ile Lys Lys Ile Ala Lys Arg Leu Lys
210 215 220
Val Leu Glu Ala Phe His Arg Ser Gly Met Lys Leu Glu Trp Met Ile
225 230 235 240
Met Asp Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro
245 250 255
Leu Asp Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg
260 265 270
Arg Val Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Glu Leu His
275 280 285
Ala Pro Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ala
290 295 300
Val Asp Ser Leu Leu Asp Asn Gly Arg Arg Gly Lys Ala Met Thr Gly
305 310 315 320
Ala Asn Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys
325 330 335
Gly Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser
340 345 350
Gly Arg Ser Val Ile Thr Val Gly Pro Tyr Leu Arg Leu His Gln Cys
355 360 365
Gly Leu Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Phe
370 375 380
His Lys Leu Glu Lys Gln Gly Leu Ala Ser Thr Val Lys Ala Ala Lys
385 390 395 400
Lys Leu Val Glu Gln Glu Val Pro Glu Val Trp Asp Ile Leu Glu Glu
405 410 415
Val Ile Arg Glu His Pro Ile Met Leu Asn Arg Ala Pro Thr Leu His
420 425 430
Arg Leu Gly Ile Gln Ala Phe Glu Pro Ile Leu Ile Glu Gly Lys Ala
435 440 445
Ile Gln Leu His Pro Leu Val Cys Ala Ala Phe Asn Ala Asp Phe Asp
450 455 460
Gly Asp Gln Met Ala Val His Val Pro Leu Ser Leu Glu Ala Gln Met
465 470 475 480
Glu Ala Arg Thr Leu Met Leu Ala Ser Asn Asn Val Leu Ser Pro Ala
485 490 495
Asn Gly Glu Pro Ile Ile Val Pro Ser Gln Asp Ile Val Leu Gly Leu
500 505 510
Tyr Tyr Met Thr Arg Asp Arg Ile Asn Ala Lys Gly Glu Gly Ser Leu
515 520 525
Phe Ala Asp Val Lys Glu Val His Arg Ala Tyr His Thr Lys Gln Val
530 535 540
Glu Leu Gly Thr Lys Ile Thr Val Arg Leu Arg Glu Trp Val Lys Asn
545 550 555 560
Glu Ala Gly Glu Phe Glu Pro Val Val Asn Arg Tyr Glu Thr Thr Val
565 570 575
Gly Arg Ala Leu Leu Ser Glu Ile Leu Pro Lys Gly Leu Pro Phe Glu
580 585 590
Tyr Val Asn Lys Ala Leu Lys Lys Lys Glu Ile Ser Lys Leu Ile Asn
595 600 605
Ala Ser Phe Arg Leu Cys Gly Leu Arg Asp Thr Val Ile Phe Ala Asp
610 615 620
His Leu Met Tyr Thr Gly Phe Gly Phe Ala Ala Lys Gly Gly Ile Ser
625 630 635 640
Ile Ala Val Asp Asp Met Glu Ile Pro Lys Glu Lys Ala Ala Leu Leu
645 650 655
Ala Glu Ala Asn Ala Glu Val Lys Glu Ile Glu Asp Gln Tyr Arg Gln
660 665 670
Gly Leu Val Thr Asn Gly Glu Arg Tyr Asn Lys Val Val Asp Ile Trp
675 680 685
Gly Arg Ala Gly Asp Lys Ile Ala Lys Ala Met Met Asp Asn Leu Ser
690 695 700
Lys Gln Lys Val Ile Asp Arg Asp Gly Asn Glu Val Asp Gln Glu Ser
705 710 715 720
Phe Asn Ser Ile Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala
725 730 735
Ala Gln Ile Lys Gln Leu Ser Gly Met Arg Gly Leu Met Ala Lys Pro
740 745 750
Asp Gly Ser Ile Ile Glu Thr Pro Ile Thr Ser Asn Phe Arg Glu Gly
755 760 765
Leu Thr Val Leu Gln Tyr Phe Ile Ala Thr His Gly Ala Arg Lys Gly
770 775 780
Leu Ala Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr Leu Thr Arg
785 790 795 800
Arg Leu Val Asp Val Thr Gln Asp Leu Val Val Val Glu Asp Asp Cys
805 810 815
Gly Thr Ser Asp Gly Phe Val Met Lys Ala Val Val Gln Gly Gly Asp
820 825 830
Val Ile Glu Ala Leu Arg Asp Arg Ile Leu Gly Arg Val Thr Ala Ser
835 840 845
Asp Val Val Asp Pro Ser Ser Gly Glu Thr Leu Val Glu Ala Gly Thr
850 855 860
Leu Leu Thr Glu Lys Leu Val Asp Met Ile Asp Gln Ser Gly Val Asp
865 870 875 880
Glu Val Lys Val Arg Thr Pro Ile Thr Cys Lys Thr Arg His Gly Leu
885 890 895
Cys Ala His Cys Tyr Gly Arg Asp Leu Ala Arg Gly Lys Leu Val Asn
900 905 910
Ala Gly Glu Ala Val Gly Val Ile Ala Ala Gln Ser Ile Gly Glu Pro
915 920 925
Gly Thr Gln Leu Thr Met Arg Thr Phe His Ile Gly Gly Ala Ala Ser
930 935 940
Arg Ala Ala Ala Ala Ser Gln Val Glu Ala Lys Ser Asn Gly Thr Ala
945 950 955 960
Arg Phe Ser Ser Gln Met Arg Tyr Val Ala Asn Asn Lys Gly Glu Leu
965 970 975
Val Val Ile Gly Arg Ser Cys Glu Val Val Ile His Asp Asp Ile Gly
980 985 990
Arg Glu Arg Glu Arg His Lys Val Pro Tyr Gly Ala Ile Leu Leu Val
995 1000 1005
Gln Asp Gly Met Ala Ile Lys Ala Gly Gln Thr Leu Ala Thr Trp Asp
1010 1015 1020
Pro His Thr Arg Pro Met Ile Thr Glu His Ala Gly Met Val Lys Phe
1025 1030 1035 1040
Glu Asn Met Glu Glu Gly Val Thr Val Ala Lys Gln Thr Asp Asp Val
1045 1050 1055
Thr Gly Leu Ser Thr Leu Val Val Ile Asp Gly Lys Arg Arg Ser Ser
1060 1065 1070
Ser Ala Ser Lys Leu Leu Arg Pro Thr Val Lys Leu Leu Asp Glu Asn
1075 1080 1085
Gly Val Glu Ile Cys Ile Pro Gly Thr Ser Thr Pro Val Ser Met Ala
1090 1095 1100
Phe Pro Val Gly Ala Val Ile Thr Val Arg Glu Gly Gln Glu Ile Gly
1105 1110 1115 1120
Lys Gly Asp Val Leu Ala Arg Ile Pro Gln Ala Ser Ser Lys Thr Arg
1125 1130 1135
Asp Ile Thr Gly Gly Leu Pro Arg Val Ala Glu Leu Phe Glu Ala Arg
1140 1145 1150
Val Pro Lys Asp Ala Gly Met Leu Ala Glu Ile Thr Gly Thr Val Ser
1155 1160 1165
Phe Gly Lys Glu Thr Lys Gly Lys Gln Arg Leu Ile Ile Thr Asp Val
1170 1175 1180
Asp Gly Val Ala Tyr Glu Thr Leu Ile Ser Lys Glu Lys Gln Ile Leu
1185 1190 1195 1200
Val His Asp Gly Gln Val Val Asn Arg Gly Glu Thr Ile Val Asp Gly
1205 1210 1215
Ala Val Asp Pro His Asp Ile Leu Arg Leu Gln Gly Ile Glu Ala Leu
1220 1225 1230
Ala Arg Tyr Ile Val Gln Glu Val Gln Glu Val Tyr Arg Leu Gln Gly
1235 1240 1245
Val Lys Ile Ser Asp Lys His Ile Glu Val Ile Ile Arg Gln Met Leu
1250 1255 1260
Arg Arg Val Asn Ile Ala Asp Ala Gly Glu Thr Gly Phe Ile Thr Gly
1265 1270 1275 1280
Glu Gln Val Glu Arg Gly Asp Val Met Ala Ala Asn Glu Lys Ala Leu
1285 1290 1295
Glu Glu Gly Lys Glu Pro Ala Arg Tyr Glu Asn Ile Leu Leu Gly Ile
1300 1305 1310
Thr Lys Ala Ser Leu Ser Thr Asp Ser Phe Ile Ser Ala Ala Ser Phe
1315 1320 1325
Gln Glu Thr Thr Arg Val Leu Thr Glu Ala Ala Ile Met Gly Lys Gln
1330 1335 1340
Asp Glu Leu Arg Gly Leu Lys Glu Asn Val Ile Val Gly Arg Leu Ile
1345 1350 1355 1360
Pro Ala Gly Thr Gly Leu Thr Tyr His Arg Ser Arg His Gln Gln Trp
1365 1370 1375
Gln Gly Val Glu Gln Glu Thr Ala Glu Thr Gln Val Thr Asp Glu
1380 1385 1390
<210> 37
<211> 1401
<212> PRT
<213> Legionella pneumophila
<400> 37
Met Ser Asp Leu Leu Gly Ile Leu Lys Gln Gln Gly Gln Ser Glu Glu
1 5 10 15
Phe Asp Ala Ile Lys Ile Ala Leu Ala Ser Pro Glu Leu Ile Arg Ser
20 25 30
Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Arg Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Lys Thr Phe Gly Pro
50 55 60
Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu Lys His
65 70 75 80
Arg Gly Val Ile Cys Glu Lys Cys Gly Val Glu Leu Ala Leu Ala Lys
85 90 95
Val Arg Arg Glu Arg Met Gly His Ile Glu Leu Ala Ser Pro Val Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Leu Leu
115 120 125
Asp Met Thr Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu Ala Phe
130 135 140
Val Val Val Asp Pro Gly Met Thr Glu Leu Glu Arg Gly Gln Leu Leu
145 150 155 160
Asn Asp Glu Ala Tyr Leu Asp Ala Met Glu Gln Tyr Gly Asp Glu Phe
165 170 175
Asp Ala Arg Met Gly Ala Glu Ala Ile Arg Asp Leu Leu Arg Gln Ile
180 185 190
Asp Leu Glu Asp Glu Ile Arg Asn Leu Arg Glu Glu Leu Pro Thr Thr
195 200 205
Asn Ser Glu Thr Lys Ile Lys Lys Ile Thr Lys Arg Leu Lys Leu Leu
210 215 220
Glu Ala Phe Tyr Glu Ser Gly Asn Lys Pro Glu Trp Met Ile Met Asp
225 230 235 240
Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro Leu Asp
245 250 255
Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val
260 265 270
Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Asp Leu Asn Ala Pro
275 280 285
Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ser Val Asp
290 295 300
Ala Leu Leu Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Thr Asn
305 310 315 320
Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys Gln Gly
325 330 335
Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg
340 345 350
Ser Val Ile Val Val Gly Pro Thr Leu Lys Leu His Gln Cys Gly Leu
355 360 365
Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Phe Ser Lys
370 375 380
Leu Glu Phe Arg Gly Leu Ala Thr Thr Ile Lys Ala Ala Lys Lys Met
385 390 395 400
Val Glu Arg Glu Glu Ser Val Val Trp Asp Ile Leu Asp Asp Val Ile
405 410 415
Arg Glu His Pro Ile Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu
420 425 430
Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys Ala Ile Gln
435 440 445
Leu His Pro Leu Val Cys Thr Ala Tyr Asn Ala Asp Phe Asp Gly Asp
450 455 460
Gln Met Ala Val His Val Pro Leu Thr Leu Glu Ala Gln Leu Glu Ala
465 470 475 480
Arg Ser Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Ser Gly
485 490 495
Glu Pro Ile Ile Val Pro Ser Gln Asp Val Val Leu Gly Leu Tyr Tyr
500 505 510
Leu Thr Arg Glu Lys Val Asn Ala Leu Gly Glu Gly Lys Ile Tyr Ser
515 520 525
Ser Ala Gln Glu Ala Gln Asn Phe Tyr Glu Ala Gly His Leu Asp Ile
530 535 540
His Ala Lys Ile Lys Ile Arg Met Pro Lys Glu Asp Gly Glu Thr Gly
545 550 555 560
Tyr His Leu Val Glu Thr Thr Val Gly Arg Ala Ile Leu Ala Glu Ile
565 570 575
Leu Pro Lys Gly Met Pro Phe Asp Tyr Ile Asn Arg Thr Met Thr Lys
580 585 590
Lys Val Ile Ser Lys Val Ile Asp Ser Cys Tyr Arg Lys Phe Gly Leu
595 600 605
Lys Glu Thr Val Ile Phe Ala Asp Gln Leu Met Tyr Thr Gly Phe Lys
610 615 620
Tyr Ala Thr Arg Ser Gly Ala Ser Ile Gly Ile Glu Asp Met Glu Ile
625 630 635 640
Pro Asp Asp Lys Ser Ser Ile Ile Glu His Ala Asp Asn Glu Val Arg
645 650 655
Glu Ile Glu Ser Gln Phe Arg Ser Gly Leu Val Thr Asn Gly Glu Arg
660 665 670
Tyr Asn Lys Val Ile Asp Ile Trp Ser Arg Thr Asn Glu Leu Val Ala
675 680 685
Lys Ser Met Met Ser Lys Ile Ala Thr Glu Glu Val Thr Asp Ala Lys
690 695 700
Gly Asn Lys Val Arg Gln Glu Ser Phe Asn Pro Ile Phe Met Met Ala
705 710 715 720
Asp Ser Gly Ala Arg Gly Ser Ala Ala Gln Ile Arg Gln Leu Ala Gly
725 730 735
Met Arg Gly Leu Met Ala Ala Pro Asp Gly Ser Ile Ile Glu Thr Pro
740 745 750
Ile Thr Ala Asn Phe Arg Glu Gly Leu Asn Val Phe Gln Tyr Phe Ile
755 760 765
Ser Thr His Gly Ala Arg Lys Gly Leu Ala Asp Thr Ala Leu Lys Thr
770 775 780
Ala Asn Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val Ala Gln Asp
785 790 795 800
Val Val Ile Thr Glu Asp Asp Cys Gly Thr Asp Thr Gly Ile Leu Met
805 810 815
Gln Pro Leu Ile Glu Gly Gly Asp Ile Val Glu Pro Leu His Glu Arg
820 825 830
Val Leu Gly Arg Val Val Ala Ser Asp Val Tyr Ile Pro Thr Gln Thr
835 840 845
Glu Pro Val Val Lys Ala Gly Thr Leu Leu Asp Glu Glu Trp Val Glu
850 855 860
Lys Leu Glu Lys His Gly Val Asp Gln Val Met Val Arg Ser Pro Ile
865 870 875 880
Thr Cys Gln Thr Arg Phe Gly Leu Cys Ala Lys Cys Tyr Gly Arg Asp
885 890 895
Leu Ala Arg Gly His Leu Val Asn Thr Gly Glu Ala Val Gly Ile Ile
900 905 910
Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr Gln Leu Thr Met Arg Thr
915 920 925
Phe His Ile Gly Gly Ala Ala Ser Arg Ala Thr Ala Ala Asn Asn Ile
930 935 940
Gln Ile Lys Thr Lys Gly Val Ile Arg Leu His Asn Ile Lys Thr Val
945 950 955 960
Thr His Glu Asn Lys Asn Leu Val Ala Val Ser Arg Ser Gly Glu Val
965 970 975
Thr Ile Val Asp Glu Phe Gly Arg Glu Arg Glu Arg Tyr Lys Val Pro
980 985 990
Tyr Gly Ala Val Ile Ser Ala Gln Asp Asn Ser Pro Val Glu Ala Gly
995 1000 1005
Gln Val Ile Ala Thr Trp Asp Pro His Thr His Pro Val Ile Ser Glu
1010 1015 1020
Val Ser Gly Arg Leu Lys Phe Val Asp Leu Ile Asp Gly Ile Thr Met
1025 1030 1035 1040
Asn Arg Gln Thr Asp Glu Leu Thr Gly Leu Ser Asn Ile Val Ile Ile
1045 1050 1055
Asp Ala Lys Gln Arg Ser Ala Ala Gly Arg Asp Leu Arg Pro Met Val
1060 1065 1070
Lys Leu Val Thr Asp Glu Gly Asp Asp Ile Tyr Leu Ala Gly Thr Asn
1075 1080 1085
Val Pro Ala Gln Tyr Tyr Leu Pro Val Asp Ala Ile Val Asn Phe Glu
1090 1095 1100
Asp Gly Ser Leu Val Gly Ile Gly Asp Val Ile Ala Arg Ile Pro Gln
1105 1110 1115 1120
Glu Arg Ser Lys Thr Arg Asp Ile Thr Gly Gly Leu Pro Arg Val Ala
1125 1130 1135
Asp Leu Phe Glu Ala Arg Lys Pro Lys Asp Ser Ala Val Met Ala Glu
1140 1145 1150
Val Ser Gly Leu Val Asn Phe Gly Lys Glu Thr Lys Gly Lys Arg Arg
1155 1160 1165
Leu Ile Ile Asn Val Ser Glu Asp Gln Cys His Glu Glu Leu Ile Pro
1170 1175 1180
Lys Trp Arg His Ile Ser Val Phe Glu Gly Glu His Val Glu Arg Gly
1185 1190 1195 1200
Glu Ile Ile Ala Glu Gly Ala Leu Asn Pro His Asp Ile Leu Arg Leu
1205 1210 1215
Leu Gly Val Gly Ala Leu Ala Asn Tyr Ile Val Asn Glu Val Gln Asp
1220 1225 1230
Val Tyr Arg Leu Gln Gly Val Lys Ile Asn Asp Lys His Ile Glu Val
1235 1240 1245
Ile Val Arg Gln Met Leu Arg Lys Arg Val Ile Thr Phe Ala Gly Asp
1250 1255 1260
Ser Lys Phe Leu Val Gly Glu Gln Val Glu Glu Ser Ala Met Leu Gln
1265 1270 1275 1280
Glu Asn Asp Lys Leu Leu Ala Glu Gly Lys Gln Ile Ala Arg Gly Thr
1285 1290 1295
Pro Ile Leu Leu Gly Ile Thr Lys Ala Ser Leu Ala Thr Glu Ser Phe
1300 1305 1310
Ile Ser Ala Ala Ser Phe Gln Glu Thr Thr Arg Val Leu Thr Glu Ala
1315 1320 1325
Ala Val Ser Gly Lys Val Asp Glu Leu Arg Gly Leu Lys Glu Asn Val
1330 1335 1340
Met Val Gly Arg Leu Ile Pro Ala Gly Thr Gly Tyr Thr Tyr His Gln
1345 1350 1355 1360
Ser Arg Lys Ala Lys Arg Ala Arg Ala Ala Ala Gly Gly Asp Ser Ser
1365 1370 1375
Ala Thr His Thr Val Thr Ala Ser Asp Val Glu His Ala Leu Ser Glu
1380 1385 1390
Ala Leu Asn Ala Asp Asn His Glu His
1395 1400
<210> 38
<211> 1399
<212> PRT
<213> Pseudomonas aeruginosa
<400> 38
Met Lys Asp Leu Leu Asn Leu Leu Lys Asn Gln Gly Gln Ile Glu Glu
1 5 10 15
Phe Asp Ala Ile Arg Ile Gly Leu Ala Ser Pro Glu Met Ile Arg Ser
20 25 30
Trp Ser Phe Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Arg Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Lys Ile Phe Gly Pro
50 55 60
Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu Lys His
65 70 75 80
Arg Gly Val Ile Cys Glu Lys Cys Gly Val Glu Val Ala Leu Ala Lys
85 90 95
Val Arg Arg Glu Arg Met Gly His Ile Glu Leu Ala Ser Pro Val Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Leu Leu
115 120 125
Asp Met Thr Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu Ser Tyr
130 135 140
Val Val Ile Asp Pro Gly Met Thr Thr Leu Glu Lys Gly Gln Leu Leu
145 150 155 160
Asn Asp Glu Gln Tyr Phe Glu Ala Leu Glu Glu Phe Gly Asp Asp Phe
165 170 175
Asp Ala Arg Met Gly Ala Glu Ala Val His Glu Leu Leu Asn Ala Ile
180 185 190
Asp Leu Glu His Glu Ile Gly Arg Leu Arg Glu Glu Ile Pro Gln Thr
195 200 205
Asn Ser Glu Thr Lys Ile Lys Lys Leu Ser Lys Arg Leu Lys Leu Met
210 215 220
Glu Ala Phe Gln Gly Ser Gly Asn Lys Pro Glu Trp Met Val Leu Thr
225 230 235 240
Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro Leu Asp
245 250 255
Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val
260 265 270
Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Asp Leu Ala Ala Pro
275 280 285
Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ala Val Asp
290 295 300
Ala Leu Leu Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Ser Asn
305 310 315 320
Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys Gln Gly
325 330 335
Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg
340 345 350
Ser Val Ile Thr Val Gly Pro Thr Leu Arg Leu His Gln Cys Gly Leu
355 360 365
Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Phe Gly Lys
370 375 380
Leu Glu Gly Arg Gly Met Ala Thr Thr Ile Lys Ala Ala Lys Lys Met
385 390 395 400
Val Glu Arg Glu Leu Pro Glu Val Trp Asp Val Leu Ala Glu Val Ile
405 410 415
Arg Glu His Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu
420 425 430
Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys Ala Ile Gln
435 440 445
Leu His Pro Leu Val Cys Ala Ala Tyr Asn Ala Asp Phe Asp Gly Asp
450 455 460
Gln Met Ala Val His Val Pro Leu Thr Leu Glu Ala Gln Leu Glu Ala
465 470 475 480
Arg Ala Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Asn Gly
485 490 495
Glu Pro Ile Ile Val Pro Ser Gln Asp Val Val Met Gly Leu Tyr Tyr
500 505 510
Met Thr Arg Glu Ala Ile Asn Ala Lys Gly Glu Gly Met Ala Phe Ala
515 520 525
Asp Leu Gln Glu Val Asp Arg Ala Tyr Arg Ser Gly Gln Ala Ser Leu
530 535 540
His Ala Arg Val Lys Val Arg Ile Asn Glu Lys Ile Lys Gly Glu Asp
545 550 555 560
Gly Gln Leu Thr Ala Asn Thr Arg Ile Val Asp Thr Thr Val Gly Arg
565 570 575
Ala Leu Leu Phe Gln Val Val Pro Ala Gly Leu Pro Phe Asp Val Val
580 585 590
Asn Gln Ser Met Lys Lys Lys Ala Ile Ser Lys Leu Ile Asn His Cys
595 600 605
Tyr Arg Val Val Gly Leu Lys Asp Thr Val Ile Phe Ala Asp Gln Leu
610 615 620
Met Tyr Thr Gly Phe Ala Tyr Ser Thr Ile Ser Gly Val Ser Ile Gly
625 630 635 640
Val Asn Asp Phe Val Ile Pro Asp Glu Lys Ala Arg Ile Ile Asn Ala
645 650 655
Ala Thr Asp Glu Val Lys Glu Ile Glu Ser Gln Tyr Ala Ser Gly Leu
660 665 670
Val Thr Gln Gly Glu Lys Tyr Asn Lys Val Ile Asp Leu Trp Ser Lys
675 680 685
Ala Asn Asp Glu Val Ser Lys Ala Met Met Ala Asn Leu Ser Lys Glu
690 695 700
Lys Val Val Asp Arg Glu Gly Lys Glu Val Asp Gln Glu Ser Phe Asn
705 710 715 720
Ser Met Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala Ala Gln
725 730 735
Ile Arg Gln Leu Ala Gly Met Arg Gly Leu Met Ala Lys Pro Asp Gly
740 745 750
Ser Ile Ile Glu Thr Pro Ile Thr Ala Asn Phe Arg Glu Gly Leu Asn
755 760 765
Val Leu Gln Tyr Phe Ile Ser Thr His Gly Ala Arg Lys Gly Leu Ala
770 775 780
Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr Leu Thr Arg Arg Leu
785 790 795 800
Val Asp Val Ala Gln Asp Leu Val Val Thr Glu Ile Asp Cys Gly Thr
805 810 815
Glu His Gly Leu Leu Met Ser Pro His Ile Glu Gly Gly Asp Val Val
820 825 830
Glu Pro Leu Gly Glu Arg Val Leu Gly Arg Val Ile Ala Arg Asp Val
835 840 845
Phe Lys Pro Gly Ser Asp Glu Val Ile Val Pro Ala Gly Thr Leu Ile
850 855 860
Asp Glu Lys Trp Val Asp Phe Leu Glu Val Met Ser Val Asp Glu Val
865 870 875 880
Val Val Arg Ser Pro Ile Thr Cys Glu Thr Arg His Gly Ile Cys Ala
885 890 895
Met Cys Tyr Gly Arg Asp Leu Ala Arg Gly His Arg Val Asn Ile Gly
900 905 910
Glu Ala Val Gly Val Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
915 920 925
Gln Leu Thr Met Arg Thr Phe His Ile Gly Gly Ala Ala Ser Arg Thr
930 935 940
Ser Ala Ala Asp Asn Val Gln Val Lys Asn Gly Gly Thr Ile Arg Leu
945 950 955 960
His Asn Leu Lys His Val Val Arg Ala Asp Gly Ala Leu Val Ala Val
965 970 975
Ser Arg Ser Gly Glu Leu Ala Val Ala Asp Asp Phe Gly Arg Glu Arg
980 985 990
Glu Arg Tyr Lys Leu Pro Tyr Gly Ala Val Ile Ser Val Lys Glu Gly
995 1000 1005
Asp Lys Val Asp Pro Gly Ala Ile Val Ala Lys Trp Asp Pro His Thr
1010 1015 1020
His Pro Ile Val Thr Glu Val Asp Gly Thr Val Ala Phe Val Gly Met
1025 1030 1035 1040
Glu Glu Gly Ile Thr Val Lys Arg Gln Thr Asp Glu Leu Thr Gly Leu
1045 1050 1055
Thr Asn Ile Glu Val Met Asp Pro Lys Asp Arg Pro Ala Ala Gly Lys
1060 1065 1070
Asp Ile Arg Pro Ala Val Lys Leu Ile Asp Ala Ala Gly Lys Asp Leu
1075 1080 1085
Leu Leu Pro Gly Thr Asp Val Pro Ala Gln Tyr Phe Leu Pro Ala Asn
1090 1095 1100
Ala Leu Val Asn Leu Thr Asp Gly Ala Lys Val Ser Ile Gly Asp Val
1105 1110 1115 1120
Val Ala Arg Ile Pro Gln Glu Thr Ser Lys Thr Arg Asp Ile Thr Gly
1125 1130 1135
Gly Leu Pro Arg Val Ala Asp Leu Phe Glu Ala Arg Arg Pro Lys Glu
1140 1145 1150
Pro Ser Ile Leu Ala Glu Ile Ser Gly Thr Ile Ser Phe Gly Lys Glu
1155 1160 1165
Thr Lys Gly Lys Arg Arg Leu Val Ile Thr Pro Asn Asp Gly Ser Asp
1170 1175 1180
Pro Tyr Glu Glu Leu Ile Pro Lys Trp Arg His Leu Asn Val Phe Glu
1185 1190 1195 1200
Gly Glu Gln Val Asn Arg Gly Glu Val Ile Ser Asp Gly Pro Ser Asn
1205 1210 1215
Pro His Asp Ile Leu Arg Leu Leu Gly Val Ser Ser Leu Ala Lys Tyr
1220 1225 1230
Ile Val Asn Glu Ile Gln Asp Val Tyr Arg Leu Gln Gly Val Lys Ile
1235 1240 1245
Asn Asp Lys His Ile Glu Thr Ile Leu Arg Gln Met Leu Arg Lys Val
1250 1255 1260
Glu Val Ser Glu Ser Gly Asp Ser Ser Phe Ile Lys Gly Asp Gln Val
1265 1270 1275 1280
Glu Leu Thr Gln Val Leu Glu Glu Asn Glu Gln Leu Gly Thr Glu Asp
1285 1290 1295
Lys Phe Pro Ala Lys Tyr Glu Arg Val Leu Leu Gly Ile Thr Lys Ala
1300 1305 1310
Ser Leu Ser Thr Glu Ser Phe Ile Ser Ala Ala Ser Phe Gln Glu Thr
1315 1320 1325
Thr Arg Val Leu Thr Glu Ala Ala Val Thr Gly Lys Arg Asp Phe Leu
1330 1335 1340
Arg Gly Leu Lys Glu Asn Val Val Val Gly Arg Leu Ile Pro Ala Gly
1345 1350 1355 1360
Thr Gly Leu Ala Tyr His Ser Glu Arg Lys Arg Gln Arg Asp Leu Gly
1365 1370 1375
Lys Pro Gln Arg Val Ser Ala Ser Glu Ala Glu Ala Ala Leu Thr Glu
1380 1385 1390
Ala Leu Asn Ser Ser Gly Asn
1395
<210> 39
<211> 1401
<212> PRT
<213> Vibrio cholerae
<400> 39
Met Lys Asp Leu Leu Asn Phe Leu Lys Ala Gln His Lys Thr Glu Glu
1 5 10 15
Phe Asp Ala Ile Lys Ile Gly Leu Ala Ser Pro Asp Met Ile Arg Ser
20 25 30
Trp Ser Phe Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Arg Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Arg Ile Phe Gly Pro
50 55 60
Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu Lys His
65 70 75 80
Arg Gly Val Ile Cys Glu Lys Cys Gly Val Glu Val Thr Gln Thr Lys
85 90 95
Val Arg Arg Asp Arg Met Gly His Ile Glu Leu Ala Ser Pro Val Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Leu Met
115 120 125
Asp Met Pro Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu Met Tyr
130 135 140
Val Val Thr Glu Pro Gly Met Thr Asp Leu Glu Arg Gly Gln Met Leu
145 150 155 160
Thr Glu Glu Glu Tyr Leu Asp Arg Leu Glu Glu Trp Gly Asp Glu Phe
165 170 175
Thr Ala Lys Met Gly Ala Glu Ala Ile Lys Asp Leu Leu Ala Ser Met
180 185 190
Asp Leu Pro Ala Glu Ala Glu Gln Met Arg Glu Glu Leu Asp Thr Thr
195 200 205
Asn Ser Glu Thr Lys Arg Lys Lys Leu Thr Lys Arg Leu Lys Leu Val
210 215 220
Glu Ala Phe Val Ala Ser Gly Asn Lys Pro Glu Trp Met Ile Leu Thr
225 230 235 240
Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro Leu Asp
245 250 255
Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val
260 265 270
Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Glu Leu Ala Ala Pro
275 280 285
Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ser Val Asp
290 295 300
Ala Leu Leu Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Ser Asn
305 310 315 320
Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys Gln Gly
325 330 335
Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg
340 345 350
Ser Val Ile Thr Val Gly Pro Tyr Leu Arg Leu His Gln Cys Gly Leu
355 360 365
Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Tyr Ser Lys
370 375 380
Leu Glu Thr Arg Gly Leu Ala Thr Thr Ile Lys Ala Ala Lys Lys Met
385 390 395 400
Val Glu Arg Glu Glu Ala Val Val Trp Asp Ile Leu Asp Glu Val Ile
405 410 415
Arg Glu His Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu
420 425 430
Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys Ala Ile Gln
435 440 445
Leu His Pro Leu Val Cys Ala Ala Tyr Asn Ala Asp Phe Asp Gly Asp
450 455 460
Gln Met Ala Val His Val Pro Leu Thr Leu Glu Ala Gln Leu Glu Ala
465 470 475 480
Arg Thr Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Ser Gly
485 490 495
Asp Pro Ile Ile Val Pro Ser Gln Asp Val Val Leu Gly Leu Tyr Tyr
500 505 510
Met Thr Arg Glu Lys Ile Asn Ala Lys Gly Glu Gly Met Tyr Leu Thr
515 520 525
Gly Pro Ala Glu Ala Glu Lys Ala Tyr Arg Thr Lys Thr Ala Glu Leu
530 535 540
His Ala Arg Val Lys Val Arg Ile Thr Glu Thr Ile Lys His Glu Asn
545 550 555 560
Gly Lys Leu Thr Thr Glu Thr Lys Met Ile Asp Thr Thr Val Gly Arg
565 570 575
Ala Met Leu Trp Gln Ile Val Pro Lys Gly Leu Pro Tyr Ser Leu Val
580 585 590
Asn Gln Lys Leu Gly Lys Lys Gln Ile Ser Asn Leu Leu Asn Glu Ala
595 600 605
Tyr Arg Lys Leu Gly Leu Lys Asp Thr Val Ile Phe Ala Asp Gln Ile
610 615 620
Met Tyr Thr Gly Phe Ala Tyr Ala Ala Leu Ser Gly Val Ser Val Gly
625 630 635 640
Ile Asp Asp Met Val Val Pro Ala Ala Lys Tyr Thr Glu Ile Ala Glu
645 650 655
Ala Glu Glu Glu Val Arg Glu Ile Gln Glu Gln Phe Gln Ser Gly Leu
660 665 670
Val Thr Ala Gly Glu Arg Tyr Asn Lys Val Ile Asp Ile Trp Ala Ser
675 680 685
Thr Asn Asp Arg Val Ala Lys Ala Met Met Glu Asn Leu Ser Ser Glu
690 695 700
Gln Val Ile Asn Arg Gln Gly Glu Gln Glu Lys Gln Glu Ser Phe Asn
705 710 715 720
Ser Ile Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala Ala Gln
725 730 735
Ile Arg Gln Leu Ala Gly Met Arg Gly Leu Met Ala Arg Pro Asp Gly
740 745 750
Ser Ile Ile Glu Thr Pro Ile Thr Ala Asn Phe Lys Glu Gly Leu Asn
755 760 765
Val Leu Gln Tyr Phe Ile Ser Thr His Gly Ala Arg Lys Gly Leu Ala
770 775 780
Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr Leu Thr Arg Arg Leu
785 790 795 800
Val Asp Val Ala Gln Asp Val Val Val Thr Glu His Asp Cys Gly Thr
805 810 815
Leu Glu Gly Val Val Met Thr Pro His Ile Glu Gly Gly Asp Val Lys
820 825 830
Val Ala Leu Thr Glu Leu Ala Leu Gly Arg Val Val Ser Glu Asp Ile
835 840 845
Leu Lys Pro Gly Thr Asp Glu Val Leu Ile Pro Arg Asn Thr Leu Leu
850 855 860
Asp Glu Lys Trp Cys Lys Val Ile Asn Asp Asn Ser Val Asp Gln Ile
865 870 875 880
Lys Val Arg Ser Val Val Thr Cys Asp Ser Asp Phe Gly Cys Cys Ala
885 890 895
Gln Cys Tyr Gly Arg Asp Leu Ala Arg Gly His Leu Val Asn Gln Gly
900 905 910
Glu Ala Val Gly Val Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
915 920 925
Gln Leu Thr Met Arg Thr Phe His Ile Gly Gly Ala Ala Ser Thr Ala
930 935 940
Ala Ala Glu Asn Ser Ile Gln Ala Lys Asn Asn Gly Ser Val Lys Leu
945 950 955 960
His Asn Ala Lys Phe Val Thr Asn Lys Asp Gly Lys Leu Val Ile Thr
965 970 975
Ser Arg Ala Ser Glu Leu Thr Ile Ile Asp Glu Phe Gly Arg Thr Lys
980 985 990
Glu Lys His Lys Leu Pro Tyr Gly Ser Met Leu Ser Lys Ala Asp Gly
995 1000 1005
Asp Ala Val Ala Ala Gly Glu Thr Val Ala Asn Trp Glu Ala His Thr
1010 1015 1020
Met Pro Ile Ile Thr Glu Val Ala Gly Arg Val Gln Phe Val Asp Met
1025 1030 1035 1040
Ile Asp Gly Val Thr Val Ser Arg Gln Thr Asp Asp Leu Thr Gly Leu
1045 1050 1055
Ser Ser Ser Glu Val Thr Glu Ala Ala Ala Arg Pro Ala Ala Gly Lys
1060 1065 1070
Asp Met Arg Pro Ala Ile Lys Leu Val Asp Ala Asn Gly Lys Asp Val
1075 1080 1085
Leu Ile Pro Gly Thr Asp Met Pro Ala Gln Tyr Phe Leu Pro Gly Lys
1090 1095 1100
Ala Ile Val Asn Leu Asp Asp Gly Ala Glu Val Asn Val Gly Asp Thr
1105 1110 1115 1120
Leu Ala Arg Ile Pro Gln Lys Ser Gly Gly Asn Lys Asp Ile Thr Gly
1125 1130 1135
Gly Leu Pro Arg Val Ala Asp Leu Phe Glu Ala Arg Lys Pro Lys Glu
1140 1145 1150
Pro Ala Ile Leu Ala Glu His Ser Gly Thr Val Ser Phe Gly Lys Glu
1155 1160 1165
Thr Lys Gly Lys Arg Arg Leu Ile Ile Thr Arg Asp Ser Gly Asp Thr
1170 1175 1180
Tyr Glu Glu Met Ile Pro Lys His Arg Gln Leu Asn Val Phe Glu Gly
1185 1190 1195 1200
Glu Arg Ile Glu Arg Gly Asp Val Ile Ala Asp Gly Pro Glu Ser Pro
1205 1210 1215
His Asp Ile Leu Arg Leu Arg Gly Ile His Ala Val Thr Thr Tyr Ile
1220 1225 1230
Ala Asn Glu Val Gln Glu Val Tyr Arg Leu Gln Gly Val Lys Ile Asn
1235 1240 1245
Asp Lys His Ile Glu Thr Ile Val Arg Gln Met Leu Arg Lys Cys Thr
1250 1255 1260
Ile Thr Phe Ala Gly Asp Ser Glu Phe Leu Pro Gly Glu Thr Val Glu
1265 1270 1275 1280
Tyr Ser Gln Val Lys Ile Ala Asn Arg Lys Leu Val Glu Glu Gly Lys
1285 1290 1295
Glu Pro Ala Arg Phe Glu Arg Glu Leu Leu Gly Ile Thr Lys Ala Ser
1300 1305 1310
Leu Ala Thr Glu Ser Phe Ile Ser Ala Ala Ser Phe Gln Glu Thr Thr
1315 1320 1325
Arg Val Leu Thr Glu Ala Ala Val Ser Gly Lys Arg Asp Asp Leu Arg
1330 1335 1340
Gly Leu Lys Glu Asn Val Ile Val Gly Arg Leu Ile Pro Ala Gly Thr
1345 1350 1355 1360
Gly Phe Ala Tyr His Gln Asp Arg Gln Ala Lys Arg Ala Gln Glu Gln
1365 1370 1375
Gln Gly Pro Ser Ala Glu Gln Ala Thr Asp Asn Leu Ala Ala Leu Leu
1380 1385 1390
Asn Ala Gly Phe Ser Ser Asp Asp Glu
1395 1400
<210> 40
<211> 1407
<212> PRT
<213> Salmonella enterica
<400> 40
Met Lys Asp Leu Leu Lys Phe Leu Lys Ala Gln Thr Lys Thr Glu Glu
1 5 10 15
Phe Asp Ala Ile Lys Ile Ala Leu Ala Ser Pro Asp Met Ile Arg Ser
20 25 30
Trp Ser Phe Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Arg Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Arg Ile Phe Gly Pro
50 55 60
Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu Lys His
65 70 75 80
Arg Gly Val Ile Cys Glu Lys Cys Gly Val Glu Val Thr Gln Thr Lys
85 90 95
Val Arg Arg Glu Arg Met Gly His Ile Glu Leu Ala Ser Pro Thr Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Leu Leu
115 120 125
Asp Met Pro Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu Ser Tyr
130 135 140
Val Val Ile Glu Gly Gly Met Thr Asn Leu Glu Arg Gln Gln Ile Leu
145 150 155 160
Thr Glu Glu Gln Tyr Leu Asp Ala Leu Glu Glu Phe Gly Asp Glu Phe
165 170 175
Asp Ala Lys Met Gly Ala Glu Ala Ile Gln Ala Leu Leu Lys Ser Met
180 185 190
Asp Leu Glu Gln Glu Cys Glu Thr Leu Arg Glu Glu Leu Asn Glu Thr
195 200 205
Asn Ser Glu Thr Lys Arg Lys Lys Leu Thr Lys Arg Ile Lys Leu Leu
210 215 220
Glu Ala Phe Val Gln Ser Gly Asn Lys Pro Glu Trp Met Ile Leu Thr
225 230 235 240
Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro Leu Asp
245 250 255
Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val
260 265 270
Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Asp Leu Ala Ala Pro
275 280 285
Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ala Val Asp
290 295 300
Ala Leu Leu Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Ser Asn
305 310 315 320
Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys Gln Gly
325 330 335
Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg
340 345 350
Ser Val Ile Thr Val Gly Pro Tyr Leu Arg Leu His Gln Cys Gly Leu
355 360 365
Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Tyr Gly Lys
370 375 380
Leu Glu Leu Arg Gly Leu Ala Thr Thr Ile Lys Ala Ala Lys Lys Met
385 390 395 400
Val Glu Arg Glu Glu Ala Val Val Trp Asp Ile Leu Asp Glu Val Ile
405 410 415
Arg Glu His Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu
420 425 430
Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys Ala Ile Gln
435 440 445
Leu His Pro Leu Val Cys Ala Ala Tyr Asn Ala Asp Phe Asp Gly Asp
450 455 460
Gln Met Ala Val His Val Pro Leu Thr Leu Glu Ala Gln Leu Glu Ala
465 470 475 480
Arg Ala Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Asn Gly
485 490 495
Glu Pro Ile Ile Val Pro Ser Gln Asp Val Val Leu Gly Leu Tyr Tyr
500 505 510
Met Thr Arg Asp Cys Val Asn Ala Lys Gly Glu Gly Met Val Leu Thr
515 520 525
Gly Pro Lys Glu Ala Glu Arg Ile Tyr Arg Ala Gly Leu Ala Ser Leu
530 535 540
His Ala Arg Val Lys Val Arg Ile Thr Glu Tyr Glu Lys Asp Glu Asn
545 550 555 560
Gly Glu Phe Val Ala His Thr Ser Leu Lys Asp Thr Thr Val Gly Arg
565 570 575
Ala Ile Leu Trp Met Ile Val Pro Lys Gly Leu Pro Phe Ser Ile Val
580 585 590
Asn Gln Ala Leu Gly Lys Lys Ala Ile Ser Lys Met Leu Asn Thr Cys
595 600 605
Tyr Arg Ile Leu Gly Leu Lys Pro Thr Val Ile Phe Ala Asp Gln Thr
610 615 620
Met Tyr Thr Gly Phe Ala Tyr Ala Ala Arg Ser Gly Ala Ser Val Gly
625 630 635 640
Ile Asp Asp Met Val Ile Pro Glu Lys Lys His Glu Ile Ile Ser Glu
645 650 655
Ala Glu Ala Glu Val Ala Glu Ile Gln Glu Gln Phe Gln Ser Gly Leu
660 665 670
Val Thr Ala Gly Glu Arg Tyr Asn Lys Val Ile Asp Ile Trp Ala Ala
675 680 685
Ala Asn Asp Arg Val Ser Lys Ala Met Met Asp Asn Leu Gln Thr Glu
690 695 700
Thr Val Ile Asn Arg Asp Gly Gln Glu Glu Gln Gln Val Ser Phe Asn
705 710 715 720
Ser Ile Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala Ala Gln
725 730 735
Ile Arg Gln Leu Ala Gly Met Arg Gly Leu Met Ala Lys Pro Asp Gly
740 745 750
Ser Ile Ile Glu Thr Pro Ile Thr Ala Asn Phe Arg Glu Gly Leu Asn
755 760 765
Val Leu Gln Tyr Phe Ile Ser Thr His Gly Ala Arg Lys Gly Leu Ala
770 775 780
Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr Leu Thr Arg Arg Leu
785 790 795 800
Val Asp Val Ala Gln Asp Leu Val Val Thr Glu Asp Asp Cys Gly Thr
805 810 815
His Glu Gly Ile Leu Met Thr Pro Val Ile Glu Gly Gly Asp Val Lys
820 825 830
Glu Pro Leu Arg Asp Arg Val Leu Gly Arg Val Thr Ala Glu Asp Val
835 840 845
Leu Lys Pro Gly Thr Ala Asp Ile Leu Val Pro Arg Asn Thr Leu Leu
850 855 860
His Glu Gln Trp Cys Asp Leu Leu Glu Ala Asn Ser Val Asp Ala Val
865 870 875 880
Lys Val Arg Ser Val Val Ser Cys Asp Thr Asp Phe Gly Val Cys Ala
885 890 895
His Cys Tyr Gly Arg Asp Leu Ala Arg Gly His Ile Ile Asn Lys Gly
900 905 910
Glu Ala Ile Gly Val Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
915 920 925
Gln Leu Thr Met Arg Thr Phe His Ile Gly Gly Ala Ala Ser Arg Ala
930 935 940
Ala Ala Glu Ser Ser Ile Gln Val Lys Asn Lys Gly Ser Ile Lys Leu
945 950 955 960
Ser Asn Val Lys Ser Val Val Asn Ser Ser Gly Lys Leu Val Ile Thr
965 970 975
Ser Arg Asn Thr Glu Leu Lys Leu Ile Asp Glu Phe Gly Arg Thr Lys
980 985 990
Glu Ser Tyr Lys Val Pro Tyr Gly Ala Val Met Ala Lys Gly Asp Gly
995 1000 1005
Glu Gln Val Ala Gly Gly Glu Thr Val Ala Asn Trp Asp Pro His Thr
1010 1015 1020
Met Pro Val Ile Thr Glu Val Ser Gly Phe Ile Arg Phe Thr Asp Met
1025 1030 1035 1040
Ile Asp Gly Gln Thr Ile Thr Arg Gln Thr Asp Glu Leu Thr Gly Leu
1045 1050 1055
Ser Ser Leu Val Val Leu Asp Ser Ala Glu Arg Thr Thr Gly Gly Lys
1060 1065 1070
Asp Leu Arg Pro Ala Leu Lys Ile Val Asp Ala Gln Gly Asn Asp Val
1075 1080 1085
Leu Ile Pro Gly Thr Asp Met Pro Ala Gln Tyr Phe Leu Pro Gly Lys
1090 1095 1100
Ala Ile Val Gln Leu Glu Asp Gly Val Gln Ile Ser Ser Gly Asp Thr
1105 1110 1115 1120
Leu Ala Arg Ile Pro Gln Glu Ser Gly Gly Thr Lys Asp Ile Thr Gly
1125 1130 1135
Gly Leu Pro Arg Val Ala Asp Leu Phe Glu Ala Arg Arg Pro Lys Glu
1140 1145 1150
Pro Ala Ile Leu Ala Glu Ile Ala Gly Ile Val Ser Phe Gly Lys Glu
1155 1160 1165
Thr Lys Gly Lys Arg Arg Leu Val Ile Thr Pro Val Asp Gly Ser Asp
1170 1175 1180
Pro Tyr Glu Glu Met Ile Pro Lys Trp Arg Gln Leu Asn Val Phe Glu
1185 1190 1195 1200
Gly Glu Arg Val Glu Arg Gly Asp Val Ile Ser Asp Gly Pro Glu Ala
1205 1210 1215
Pro His Asp Ile Leu Arg Leu Arg Gly Val His Ala Val Thr Arg Tyr
1220 1225 1230
Ile Val Asn Glu Val Gln Asp Val Tyr Arg Leu Gln Gly Val Lys Ile
1235 1240 1245
Asn Asp Lys His Ile Glu Val Ile Val Arg Gln Met Leu Arg Lys Ala
1250 1255 1260
Thr Ile Glu Ser Ala Gly Ser Ser Asp Phe Leu Glu Gly Glu Gln Val
1265 1270 1275 1280
Glu Tyr Ser Arg Val Lys Ile Ala Asn Arg Glu Leu Glu Ala Asn Gly
1285 1290 1295
Lys Val Gly Ala Thr Phe Ser Arg Asp Leu Leu Gly Ile Thr Lys Ala
1300 1305 1310
Ser Leu Ala Thr Glu Ser Phe Ile Ser Ala Ala Ser Phe Gln Glu Thr
1315 1320 1325
Thr Arg Val Leu Thr Glu Ala Ala Val Ala Gly Lys Arg Asp Glu Leu
1330 1335 1340
Arg Gly Leu Lys Glu Asn Val Ile Val Gly Arg Leu Ile Pro Ala Gly
1345 1350 1355 1360
Thr Gly Tyr Ala Tyr His Gln Asp Arg Met Arg Arg Arg Ala Ala Gly
1365 1370 1375
Glu Gln Pro Ala Thr Pro Gln Val Thr Ala Glu Asp Ala Ser Ala Ser
1380 1385 1390
Leu Ala Glu Leu Leu Asn Ala Gly Leu Gly Gly Ser Asp Asn Glu
1395 1400 1405
<210> 41
<211> 1300
<212> PRT
<213> Actinomyces odontolyticus
<400> 41
Met Leu Asp Ala Lys Thr Phe Asp Ser Leu Lys Ile Thr Leu Ala Thr
1 5 10 15
Gly Asp Asp Ile Ala Glu Trp Ser His Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Arg Asp Gly Leu Phe Gly
35 40 45
Glu Gln Ile Phe Gly Pro Thr Arg Asp Trp Glu Cys Ala Cys Gly Lys
50 55 60
Tyr Lys Arg Val Arg Tyr Lys Gly Ile Val Cys Glu Lys Cys Gly Val
65 70 75 80
Glu Val Thr Arg Ser Arg Val Arg Arg Glu Arg Met Gly His Ile Asp
85 90 95
Leu Ala Ala Pro Val Thr His Ile Trp Tyr Phe Lys Gly Val Pro Ser
100 105 110
Arg Leu Gly Tyr Val Leu Asn Leu Ala Pro Lys Asp Leu Glu Lys Val
115 120 125
Ile Tyr Phe Ala Ala Tyr Met Ile Thr Glu Val Asp Glu Lys Gly Arg
130 135 140
His Glu Asp Leu Ala Glu Leu Arg Ala Glu Leu Glu Val Gln Lys Lys
145 150 155 160
Gln Met Glu Asn Asn Arg Asp Ala Thr Ile Asn Asp Phe Ala Glu Gln
165 170 175
Leu Glu Ser Asp Met Ala Ala Leu Glu Lys Asp Gly Ala Ser Gln Ala
180 185 190
Glu Arg Glu Arg Ala Arg Lys Gln Gly Glu Arg Glu Met Ala Lys Ile
195 200 205
Arg Arg Arg Phe Asp Gly Asp Ile Glu Gly Leu Glu Ala Val Trp Glu
210 215 220
Arg Phe Lys Asp Leu Lys Val Gly Asp Leu Glu Gly Asp Glu Arg Leu
225 230 235 240
Tyr Arg Ala Met Val Ala Arg Tyr Gly Thr Tyr Phe Lys Gly Asp Met
245 250 255
Gly Ala Ala Ala Ile Gln Lys Arg Leu Glu Thr Phe Asp Leu Glu Ala
260 265 270
Glu Val Ala Ala Leu Arg Gln Thr Ile Gly Ser Asp Ser Gly Pro Arg
275 280 285
Lys Ala Arg Ala Ile Lys Arg Leu Lys Val Ile Asn Ala Phe Val Ala
290 295 300
Thr Gly Asn Ser Pro Ala Ser Met Val Leu Thr Lys Ile Pro Val Ile
305 310 315 320
Pro Pro Asp Leu Arg Pro Met Val Gln Leu Asp Gly Gly Arg Phe Ala
325 330 335
Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val Ile Asn Arg Asn Thr
340 345 350
Arg Leu Lys Arg Leu Leu Glu Leu Gly Ala Pro Glu Ile Ile Val Asn
355 360 365
Asn Glu Lys Arg Met Leu Gln Glu Ser Val Asp Ala Leu Phe Asp Asn
370 375 380
Gly Arg Arg Gly Arg Pro Val Ala Gly Pro Gly Asn Arg Pro Leu Lys
385 390 395 400
Ser Ile Ser Asp Met Leu Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn
405 410 415
Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg Ser Val Ile Val Val
420 425 430
Gly Pro Gln Leu Gln Leu His Gln Cys Gly Leu Pro Lys Gln Met Ala
435 440 445
Leu Glu Leu Phe Lys Pro Phe Val Met Lys Arg Leu Val Glu Lys Asn
450 455 460
Tyr Ala Gln Asn Val Lys Ala Ala Lys Arg Lys Val Glu Arg Gln Arg
465 470 475 480
Pro Glu Val Trp Asp Val Leu Asp Asp Val Ile Arg Glu His Pro Val
485 490 495
Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe
500 505 510
Glu Pro Gln Leu Ile Glu Gly Lys Ala Ile Gln Leu His Pro Leu Ala
515 520 525
Cys Gly Ala Phe Asn Ala Asp Phe Asp Gly Asp Gln Met Ala Val His
530 535 540
Leu Pro Leu Gly Ala Glu Ala Gln Ala Glu Ala Arg Ile Leu Met Leu
545 550 555 560
Ser Thr Asn Asn Ile Leu Lys Pro Ser Asp Gly Arg Pro Val Ala Met
565 570 575
Pro Ser Gln Asp Met Ile Ile Gly Leu Phe His Leu Thr Ser Thr Pro
580 585 590
Asp Pro Ser Val Pro Val Glu Lys Asp Glu Asp Gly Asn Pro Val Ile
595 600 605
Pro Tyr Phe Ser Ser Gln Ala Glu Ala Gln Met Ala Tyr Asp Ala Gly
610 615 620
Asn Leu His Leu Asn Ala Thr Ala Arg Ile Arg Phe Ala Asp Gly Thr
625 630 635 640
Val Pro Pro Glu Gly Trp Glu Ala Pro Glu Gly Trp Glu Pro Gly Asp
645 650 655
Glu Leu Ile Leu Glu Thr Ser Leu Gly Arg Ala Ile Phe Asn Glu Gln
660 665 670
Leu Pro Thr Asp Tyr Pro Phe Ile Asn Glu Val Val Gly Lys Lys Gln
675 680 685
Leu Gly Asn Ile Val Asn Thr Leu Thr Gln Arg Tyr Pro Asn Val Leu
690 695 700
Val Ala Asp Cys Leu Asp Ala Leu Lys Ser Ala Gly Phe His Trp Ser
705 710 715 720
Thr Trp Ser Gly Ile Thr Ile Ala Phe Ser Asp Ile Gln Ala Ser Pro
725 730 735
Arg Lys Arg Glu Ile Leu Ala Arg Tyr Glu Ala Lys Ala Ala Glu Ile
740 745 750
Val Glu Gln Phe Glu Thr Gly Ile Ile Leu Glu Glu Thr Arg Tyr Glu
755 760 765
Glu Leu Val Lys Leu Trp Leu Gln Cys Thr Glu Glu Val Ala Asp Asp
770 775 780
Met Arg Ala Asn Phe Asp Glu Arg Asn Thr Val Tyr Arg Met Val Asn
785 790 795 800
Ser Gly Ala Arg Gly Asn Trp Ser Gln Val Gln Gln Ile Ala Gly Met
805 810 815
Arg Gly Leu Val Ser Asp Pro Lys Gln Lys Leu Ile Glu Gln Pro Ile
820 825 830
Lys Ala Asn Tyr Arg Glu Gly Leu Thr Val Leu Glu Tyr Phe Ile Ala
835 840 845
Thr His Gly Ala Arg Lys Gly Leu Val Asp Thr Ala Leu Arg Thr Ala
850 855 860
Glu Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val Ser Gln Asp Val
865 870 875 880
Ile Val Arg Glu Gly Asp Cys Gly Thr Arg Ala Gly Leu Lys Ile Asp
885 890 895
Ile Ala His Lys Asn Glu Phe Gly Glu Trp Glu Ala Ser Glu Thr Ile
900 905 910
Glu Thr Thr Ala Tyr Ala Arg Asn Leu Ala Arg Asp Ala Val Asn Glu
915 920 925
Ala Gly Glu Val Val Met Pro Ala Gly Thr Asp Leu Gly Asp Asp Gln
930 935 940
Leu Ala Glu Leu Val Ala Ala Gly Val Glu Gln Ile Val Cys Arg Ser
945 950 955 960
Val Leu Thr Cys Glu Ser Gln Val Gly Thr Cys Ala Ala Cys Tyr Gly
965 970 975
Arg Ser Leu Ala Thr Gly Lys Gln Val Asp Ile Gly Glu Ala Val Gly
980 985 990
Ile Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr Gln Leu Thr Met
995 1000 1005
Arg Thr Phe His Thr Gly Gly Ala Ala Ser Ala Ala Asp Ile Thr Gln
1010 1015 1020
Gly Leu Pro Arg Val Gln Glu Leu Phe Glu Ala Arg Ser Pro Lys Val
1025 1030 1035 1040
Glu Ala Lys Met Asn Glu Ala Ala Gly Arg Val His Ile Asp Asp Glu
1045 1050 1055
Asp Pro Ser Ala Arg Lys Val Val Ile Thr Arg Asp Asp Gly Lys Glu
1060 1065 1070
Asp Leu Val Ile Glu Val Ser Arg Arg Gln Lys Leu Leu Val Ser Glu
1075 1080 1085
Gly Gln His Ile Glu Ala Gly Thr Pro Leu Thr Glu Gly Gln Leu Asp
1090 1095 1100
Pro Lys Glu Ile Leu Arg Ile Met Gly Arg Asn Val Ser Gln Lys Met
1105 1110 1115 1120
Leu Val Asp Glu Val Gln Lys Val Tyr Arg Asp Gln Gly Val Gly Ile
1125 1130 1135
His Ala Lys His Ile Glu Val Ile Val Arg Gln Met Leu Arg Arg Val
1140 1145 1150
Thr Ile Leu Glu Pro Gly Asp Thr Thr Phe Met Pro Gly Glu Leu Val
1155 1160 1165
Asp Arg Met Ala Tyr Leu Thr Gln Asn Arg Arg Val Ala Ala Glu Gly
1170 1175 1180
Gly Gln Pro Ala Ser Gly Arg Gln Met Leu Met Gly Ile Thr Lys Ala
1185 1190 1195 1200
Ser Leu Ala Thr Asp Ser Trp Leu Ser Ala Ala Ser Phe Gln Glu Thr
1205 1210 1215
Thr Lys Val Leu Thr Glu Ala Ala Met Asn Gly Lys Ser Asp Ser Leu
1220 1225 1230
Val Gly Leu Lys Glu Asn Val Ile Leu Gly Lys Leu Ile Pro Ala Gly
1235 1240 1245
Thr Gly Leu Ser Arg Tyr Asn Asp Val Ile Val Glu Pro Thr Ala Glu
1250 1255 1260
Ala Met Ala Asn Ser Asn Tyr Ser Glu Ala Asp Phe Gly Asp Gly Ser
1265 1270 1275 1280
Val Ser Glu Asp Phe Leu Asp Ala Leu Gly Ala Ile Asp Phe Gly Met
1285 1290 1295
Asn Phe Arg Glu
1300
<210> 42
<211> 1299
<212> PRT
<213> Streptomyces coelicolor
<400> 42
Met Leu Asp Val Asn Phe Phe Asp Glu Leu Arg Ile Gly Leu Ala Thr
1 5 10 15
Ala Asp Asp Ile Arg Gln Trp Ser His Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Lys Asp Gly Leu Phe Cys
35 40 45
Glu Lys Ile Phe Gly Pro Thr Arg Asp Trp Glu Cys Tyr Cys Gly Lys
50 55 60
Tyr Lys Arg Val Arg Phe Lys Gly Ile Ile Cys Glu Arg Cys Gly Val
65 70 75 80
Glu Val Thr Arg Ala Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Ala Ala Pro Val Thr His Ile Trp Tyr Phe Lys Gly Val Pro Ser
100 105 110
Arg Leu Gly Tyr Leu Leu Asp Leu Ala Pro Lys Asp Leu Glu Lys Val
115 120 125
Ile Tyr Phe Ala Ala Tyr Met Ile Thr Phe Val Asp Glu Glu Arg Arg
130 135 140
Thr Arg Asp Leu Pro Ser Leu Glu Ala His Val Ser Val Glu Arg Gln
145 150 155 160
Gln Ile Glu Gln Arg Arg Asp Ser Asp Leu Glu Ala Arg Ala Lys Lys
165 170 175
Leu Glu Thr Asp Leu Ala Glu Leu Glu Ala Glu Gly Ala Lys Ala Asp
180 185 190
Val Arg Arg Lys Val Arg Glu Gly Ala Glu Arg Glu Met Lys Gln Leu
195 200 205
Arg Asp Arg Ala Gln Arg Glu Ile Asp Arg Leu Asp Glu Val Trp Asn
210 215 220
Arg Phe Lys Asn Leu Lys Val Gln Asp Leu Glu Gly Asp Glu Leu Leu
225 230 235 240
Tyr Arg Glu Leu Arg Asp Arg Phe Gly Thr Tyr Phe Asp Gly Ser Met
245 250 255
Gly Ala Ala Ala Leu Gln Lys Arg Leu Glu Ser Phe Asp Leu Asp Glu
260 265 270
Glu Ala Glu Arg Leu Arg Glu Ile Ile Arg Thr Gly Lys Gly Gln Lys
275 280 285
Lys Thr Arg Ala Leu Lys Arg Leu Lys Val Val Ser Ala Phe Leu Gln
290 295 300
Thr Ser Asn Ser Pro Lys Gly Met Val Leu Asp Cys Val Pro Val Ile
305 310 315 320
Pro Pro Asp Leu Arg Pro Met Val Gln Leu Asp Gly Gly Arg Phe Ala
325 330 335
Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val Ile Asn Arg Asn Asn
340 345 350
Arg Leu Lys Arg Leu Leu Asp Leu Gly Ala Pro Glu Ile Ile Val Asn
355 360 365
Asn Glu Lys Arg Met Leu Gln Glu Ala Val Asp Ala Leu Phe Asp Asn
370 375 380
Gly Arg Arg Gly Arg Pro Val Thr Gly Pro Gly Asn Arg Pro Leu Lys
385 390 395 400
Ser Leu Ser Asp Met Leu Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn
405 410 415
Leu Leu Gly Lys Arg Val Asp Tyr Ser Ala Arg Ser Val Ile Val Val
420 425 430
Gly Pro Gln Leu Lys Leu His Gln Cys Gly Leu Pro Lys Ala Met Ala
435 440 445
Leu Glu Leu Phe Lys Pro Phe Val Met Lys Arg Leu Val Asp Leu Asn
450 455 460
His Ala Gln Asn Ile Lys Ser Ala Lys Arg Met Val Glu Arg Gly Arg
465 470 475 480
Thr Val Val Tyr Asp Val Leu Glu Glu Val Ile Ala Glu His Pro Val
485 490 495
Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe
500 505 510
Glu Pro Gln Leu Val Glu Gly Lys Ala Ile Gln Ile His Pro Leu Val
515 520 525
Cys Thr Ala Phe Asn Ala Asp Phe Asp Gly Asp Gln Met Ala Val His
530 535 540
Leu Pro Leu Ser Ala Glu Ala Gln Ala Glu Ala Arg Ile Leu Met Leu
545 550 555 560
Ser Ser Asn Asn Ile Leu Lys Pro Ala Asp Gly Arg Pro Val Thr Met
565 570 575
Pro Thr Gln Asp Met Val Leu Gly Leu Phe Phe Leu Thr Thr Asp Ser
580 585 590
Glu Gly Arg Ser Pro Lys Gly Glu Gly Arg Ala Phe Gly Ser Ser Ala
595 600 605
Glu Ala Ile Met Ala Phe Asp Ala Gly Asp Leu Thr Leu Gln Ala Lys
610 615 620
Ile Asp Ile Arg Phe Pro Val Gly Thr Ile Pro Pro Arg Gly Phe Glu
625 630 635 640
Pro Pro Ala Arg Glu Glu Gly Glu Pro Glu Trp Gln Gln Gly Asp Thr
645 650 655
Phe Thr Leu Lys Thr Thr Leu Gly Arg Ala Leu Phe Asn Glu Leu Leu
660 665 670
Pro Glu Asp Tyr Pro Phe Val Asp Tyr Glu Val Gly Lys Lys Gln Leu
675 680 685
Ser Glu Ile Val Asn Asp Leu Ala Glu Arg Tyr Pro Lys Val Ile Val
690 695 700
Ala Ala Thr Leu Asp Asn Leu Lys Ala Ala Gly Phe Phe Trp Ala Thr
705 710 715 720
Arg Ser Gly Val Thr Val Ala Ile Ser Asp Ile Val Val Pro Asp Ala
725 730 735
Lys Lys Glu Ile Val Lys Gly Tyr Glu Gly Gln Asp Glu Lys Val Gln
740 745 750
Lys Gln Tyr Glu Arg Gly Leu Ile Thr Lys Glu Glu Arg Thr Gln Glu
755 760 765
Leu Ile Ala Ile Trp Thr Lys Ala Thr Asn Glu Val Ala Glu Ala Met
770 775 780
Asn Asp Asn Phe Pro Lys Thr Asn Pro Val Ser Met Met Val Asn Ser
785 790 795 800
Gly Ala Arg Gly Asn Met Met Gln Met Arg Gln Ile Ala Gly Met Arg
805 810 815
Gly Leu Val Ser Asn Ala Lys Asn Glu Thr Ile Pro Arg Pro Ile Lys
820 825 830
Ala Ser Phe Arg Glu Gly Leu Ser Val Leu Glu Tyr Phe Ile Ser Thr
835 840 845
His Gly Ala Arg Lys Gly Leu Ala Asp Thr Ala Leu Arg Thr Ala Asp
850 855 860
Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val Ser Gln Asp Val Ile
865 870 875 880
Ile Arg Glu Glu Asp Cys Gly Thr Glu Arg Gly Leu Lys Leu Pro Ile
885 890 895
Ala Thr Arg Asp Ala Asp Gly Thr Leu Arg Lys Ala Glu Asp Val Glu
900 905 910
Thr Ser Val Tyr Ala Arg Met Leu Ala Glu Asp Val Val Ile Asp Gly
915 920 925
Lys Val Ile Ala Pro Ala Asn Val Asp Leu Gly Asp Val Leu Ile Asp
930 935 940
Ala Leu Val Ala His Gly Val Glu Glu Val Lys Thr Arg Ser Ile Leu
945 950 955 960
Thr Cys Glu Ser Gln Val Gly Thr Cys Ala Met Cys Tyr Gly Arg Ser
965 970 975
Leu Ala Thr Gly Lys Leu Val Asp Ile Gly Glu Ala Val Gly Ile Ile
980 985 990
Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr Gln Leu Thr Met Arg Thr
995 1000 1005
Phe His Thr Gly Gly Val Ala Gly Asp Asp Ile Thr Gln Gly Leu Pro
1010 1015 1020
Arg Val Val Glu Leu Phe Glu Ala Arg Thr Pro Lys Gly Val Ala Pro
1025 1030 1035 1040
Ile Ser Glu Ala Ser Gly Arg Val Arg Ile Glu Glu Thr Glu Lys Thr
1045 1050 1055
Lys Lys Ile Val Val Thr Pro Asp Asp Gly Ser Asp Glu Thr Ala Phe
1060 1065 1070
Pro Ile Ser Lys Arg Ala Arg Leu Leu Val Gly Glu Gly Asp His Val
1075 1080 1085
Glu Val Gly Gln Lys Leu Thr Val Gly Ala Thr Asn Pro His Asp Val
1090 1095 1100
Leu Arg Ile Leu Gly Gln Arg Ala Val Gln Val His Leu Val Gly Glu
1105 1110 1115 1120
Val Gln Lys Val Tyr Asn Ser Gln Gly Val Ser Ile His Asp Lys His
1125 1130 1135
Ile Glu Ile Ile Ile Arg Gln Met Leu Arg Arg Val Thr Ile Ile Glu
1140 1145 1150
Ser Gly Asp Ala Glu Leu Leu Pro Gly Glu Leu Val Glu Arg Thr Lys
1155 1160 1165
Phe Glu Thr Glu Asn Arg Arg Val Val Gln Glu Gly Gly His Pro Ala
1170 1175 1180
Ser Gly Arg Pro Gln Leu Met Gly Ile Thr Lys Ala Ser Leu Ala Thr
1185 1190 1195 1200
Glu Ser Trp Leu Ser Ala Ala Ser Phe Gln Glu Thr Thr Arg Val Leu
1205 1210 1215
Thr Asp Ala Ala Ile Asn Ala Lys Ser Asp Ser Leu Ile Gly Leu Lys
1220 1225 1230
Glu Asn Val Ile Ile Gly Lys Leu Ile Pro Ala Gly Thr Gly Leu Ser
1235 1240 1245
Arg Tyr Arg Asn Ile Arg Val Glu Pro Thr Glu Glu Ala Lys Ala Ala
1250 1255 1260
Met Tyr Ser Ala Val Gly Tyr Asp Asp Ile Asp Tyr Ser Pro Phe Gly
1265 1270 1275 1280
Thr Gly Ser Gly Gln Ala Val Pro Leu Glu Asp Tyr Asp Tyr Gly Pro
1285 1290 1295
Tyr Asn Gln
<210> 43
<211> 1336
<212> PRT
<213> Corynebacterium diphtheriae
<400> 43
Met Ile Asp Val Asn Phe Phe Asp Glu Leu Arg Ile Gly Leu Ala Thr
1 5 10 15
Ala Asp Asp Ile Arg Arg Trp Ser Lys Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Lys Asp Gly Leu Phe Cys
35 40 45
Glu Arg Ile Phe Gly Pro Thr Arg Asp Trp Glu Cys Ala Cys Gly Lys
50 55 60
Tyr Lys Arg Val Arg Tyr Lys Gly Ile Ile Cys Glu Arg Cys Gly Val
65 70 75 80
Glu Val Thr Lys Ser Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Ala Ala Pro Val Thr His Ile Trp Tyr Phe Lys Gly Val Pro Ser
100 105 110
Arg Leu Gly Tyr Leu Leu Asp Leu Ala Pro Lys Asp Leu Glu Arg Ile
115 120 125
Ile Tyr Phe Ala Ala Asn Ile Ile Thr Ser Val Asp Asp Glu Ala Arg
130 135 140
His Ala Asp Gln Thr Thr Leu Glu Ala Glu Met Leu Leu Glu Lys Lys
145 150 155 160
Asp Val Glu Ala Asp Met Glu Ser Glu Ile Ala Glu Arg Ala Ala Lys
165 170 175
Leu Glu Glu Asp Leu Ala Glu Leu Glu Ala Ala Gly Ala Lys Ala Asp
180 185 190
Ala Arg Asn Lys Val Lys Lys Ala Ala Glu Lys Glu Met Gln His Ile
195 200 205
Arg Glu Arg Ala Glu Arg Glu Ile Asp Arg Leu Glu Glu Ile Trp Gln
210 215 220
Thr Phe Ile Lys Leu Ala Pro Lys Gln Met Ile Ile Asp Glu Thr Ile
225 230 235 240
Tyr Glu Glu Leu Val Asp Arg Tyr Glu Asp Tyr Phe Thr Gly Gly Met
245 250 255
Gly Ala Glu Ala Ile Gln Thr Leu Ile Arg Asn Phe Asp Leu Asp Ser
260 265 270
Glu Ala Glu Glu Leu Arg Glu Ile Ile Asn Asn Gly Lys Gly Gln Lys
275 280 285
Lys Met Arg Ala Leu Lys Arg Leu Lys Val Val Ala Ala Phe Gln Arg
290 295 300
Ser Gly Asn Asp Pro Ala Gly Met Val Leu Asp Cys Ile Pro Val Ile
305 310 315 320
Pro Pro Glu Leu Arg Pro Met Val Gln Leu Asp Gly Gly Arg Phe Ala
325 330 335
Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val Ile Asn Arg Asn Asn
340 345 350
Arg Leu Lys Arg Met Ile Glu Leu Gly Ala Pro Glu Ile Ile Val Asn
355 360 365
Asn Glu Lys Arg Met Leu Gln Glu Ser Val Asp Ala Leu Phe Asp Asn
370 375 380
Gly Arg Arg Gly Arg Pro Val Thr Gly Pro Gly Asn Arg Pro Leu Lys
385 390 395 400
Ser Leu Ser Asp Leu Leu Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn
405 410 415
Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg Ser Val Ile Ile Val
420 425 430
Gly Pro Gln Leu Lys Leu His Glu Cys Gly Leu Pro Lys Leu Met Ala
435 440 445
Leu Glu Leu Phe Lys Pro Phe Val Met Lys Arg Leu Val Glu Asn Asp
450 455 460
Tyr Ala Gln Asn Ile Lys Ser Ala Lys Arg Met Val Glu Arg Gln Arg
465 470 475 480
Pro Glu Val Trp Asp Val Leu Glu Glu Ala Ile Ser Glu His Pro Val
485 490 495
Met Leu Asn Arg Ala Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe
500 505 510
Glu Pro Lys Leu Val Glu Gly Lys Ala Ile Gln Leu His Pro Leu Ala
515 520 525
Cys Glu Ala Phe Asn Ala Asp Phe Asp Gly Asp Gln Met Ala Val His
530 535 540
Leu Pro Leu Ser Ala Glu Ala Gln Ala Glu Ala Arg Ile Leu Met Leu
545 550 555 560
Ala Ser Asn Asn Ile Leu Ser Pro Ala Ser Gly Lys Pro Leu Ala Met
565 570 575
Pro Arg Leu Asp Met Val Thr Gly Leu Tyr Tyr Leu Thr Met Asp Lys
580 585 590
Asn Glu Asn Glu Ile Gly Gly Gln Gly Ala Tyr Ala Ser Ala Thr Glu
595 600 605
Glu Gly Pro Ala Gln Gly Val Tyr Ser Ser Tyr Ala Glu Ala Ile Met
610 615 620
Ala Arg Asp Arg Gly Val Leu Gly Leu Gln Ala Lys Ile Lys Val Arg
625 630 635 640
Ile Ser His Leu Arg Pro Pro Val Asp Ile Glu Ala Glu Gln Phe Pro
645 650 655
Glu Gly Trp Asn Lys Gly Asp Val Trp Leu Ala Asp Thr Thr Leu Gly
660 665 670
Arg Ile Met Phe Asn Glu Leu Leu Pro Trp Asn Tyr Pro Tyr Leu Glu
675 680 685
Gly Val Met Val Arg Lys Gly Gly Gly Thr Gly Lys Ile Met Leu Gly
690 695 700
Asp Val Ile Asn Asp Leu Ala Ala Thr Tyr Pro Met Ile Thr Val Ala
705 710 715 720
Gln Thr Met Asp Lys Met Lys Asp Ala Gly Phe Tyr Trp Ala Thr Arg
725 730 735
Ser Gly Val Thr Ile Thr Met Ser Asp Val Leu Val Leu Pro Asn Lys
740 745 750
Glu Glu Ile Leu Asp Arg Tyr Glu Ala Glu Ala Arg Lys Ile Glu Arg
755 760 765
Lys Tyr Trp Glu Gln Gly Ala Leu Thr Glu Arg Glu Arg Tyr Asp Arg
770 775 780
Leu Val Glu Leu Trp Lys Asp Ala Thr Asp Glu Val Gly Asn Ala Val
785 790 795 800
Glu Lys Leu Tyr Pro Asp Asp Asn Pro Ile Pro Met Ile Val Lys Ser
805 810 815
Gly Ala Ala Gly Asn Met Arg Gln Ile Trp Thr Leu Ala Gly Met Lys
820 825 830
Gly Met Val Val Asn Ser Lys Gly Asp Tyr Ile Thr Arg Pro Ile Lys
835 840 845
Thr Ser Phe Arg Glu Gly Leu Ser Val Leu Glu Tyr Phe Asn Asn Ser
850 855 860
His Gly Ser Arg Lys Gly Leu Ala Asp Thr Ala Leu Arg Thr Ala Asp
865 870 875 880
Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val Ala Gln Asp Val Ile
885 890 895
Val Arg Glu Asp Asp Cys Gly Thr Lys Gln Gly Ile Arg Val Pro Val
900 905 910
Ala Val Glu Val Lys Asp Ala Glu Gly Asn Val Thr Gly Tyr Thr Gly
915 920 925
His Ser Leu Ile Glu Thr Ser Val Ala Gly Arg Val Ala Ala Thr Ala
930 935 940
Val Lys Asp Ala Glu Gly Asn Val Met Val Glu Pro Gly Glu Asn Leu
945 950 955 960
Thr Asp Gln Leu Ile Asp Glu Leu Ile Ala Ala Gly Val Lys Glu Val
965 970 975
Lys Val Arg Ser Val Leu Thr Cys Gln Thr Pro Thr Gly Val Cys Ala
980 985 990
Lys Cys Tyr Gly Lys Ser Met Ala Thr Gly Lys Leu Val Asp Ile Gly
995 1000 1005
Glu Ala Val Gly Ile Val Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
1010 1015 1020
Gln Leu Thr Met Arg Thr Phe His Gln Gly Gly Val Gly Gly Asp Ile
1025 1030 1035 1040
Thr Gly Gly Leu Pro Arg Val Gln Glu Leu Phe Glu Ala Arg Val Pro
1045 1050 1055
Lys Asn Arg Ala Pro Ile Ala Ser Val Ala Gly Thr Val His Leu Asp
1060 1065 1070
Asp Glu Gly Asn Phe Tyr Thr Leu Thr Ile Asn Pro Asp Asp Gly Ser
1075 1080 1085
Asp Val Val Val Tyr Glu Lys Leu Ser Lys Arg Gln Gly Leu Ala Thr
1090 1095 1100
Val Arg Val Pro Met Glu Ser Asn Pro Gly Ala Met Ile Glu Arg Thr
1105 1110 1115 1120
Leu Ala Glu Gly Asp His Val Glu Val Gly Asp Arg Leu Leu Arg Gly
1125 1130 1135
Pro Ala Asp Pro His Asp Val Leu Glu Val Leu Gly Arg Arg Gly Val
1140 1145 1150
Glu Gln His Leu Val Asp Glu Val Gln Asp Val Tyr Arg Ala Gln Gly
1155 1160 1165
Val Ala Ile His Asp Lys His Ile Glu Ile Ile Ile Arg Gln Met Leu
1170 1175 1180
Arg Arg Gly Thr Val Ile Glu Ser Gly Ser Thr Glu Phe Leu Pro Gly
1185 1190 1195 1200
Thr Leu Val Asp Leu Ser Glu Ala Lys Ala Ala Asn Ala Glu Ala Leu
1205 1210 1215
Ala Asn Gly Gly Gln Pro Ala Glu Leu Arg Ser Glu Ile Met Gly Ile
1220 1225 1230
Thr Lys Ala Ser Leu Ala Thr Glu Ser Trp Leu Ser Ala Ala Ser Phe
1235 1240 1245
Gln Glu Thr Thr Arg Val Leu Thr Asp Ala Ala Ile Asn Lys Arg Ser
1250 1255 1260
Asp Lys Leu Ile Gly Leu Lys Glu Asn Val Ile Ile Gly Lys Leu Ile
1265 1270 1275 1280
Pro Ala Gly Thr Gly Ile Ser Arg Tyr Arg Asn Ile Ser Val Lys Pro
1285 1290 1295
Thr Glu Ala Ala Arg Asn Ala Ala Tyr Ser Ile Pro Thr Tyr Gly Asp
1300 1305 1310
Ser Ile Tyr Gly Asp Asp Gly Tyr Gly Glu Phe Thr Gly Ala Ser Val
1315 1320 1325
Pro Leu Asp Glu Ala Tyr Asp Leu
1330 1335
<210> 44
<211> 1316
<212> PRT
<213> Mycobacterium tuberculosis
<400> 44
Met Leu Asp Val Asn Phe Phe Asp Glu Leu Arg Ile Gly Leu Ala Thr
1 5 10 15
Ala Glu Asp Ile Arg Gln Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Lys Asp Gly Leu Phe Cys
35 40 45
Glu Lys Ile Phe Gly Pro Thr Arg Asp Trp Glu Cys Tyr Cys Gly Lys
50 55 60
Tyr Lys Arg Val Arg Phe Lys Gly Ile Ile Cys Glu Arg Cys Gly Val
65 70 75 80
Glu Val Thr Arg Ala Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Ala Ala Pro Val Thr His Ile Trp Tyr Phe Lys Gly Val Pro Ser
100 105 110
Arg Leu Gly Tyr Leu Leu Asp Leu Ala Pro Lys Asp Leu Glu Lys Ile
115 120 125
Ile Tyr Phe Ala Ala Tyr Val Ile Thr Ser Val Asp Glu Glu Met Arg
130 135 140
His Asn Glu Leu Ser Thr Leu Glu Ala Glu Met Ala Val Glu Arg Lys
145 150 155 160
Ala Val Glu Asp Gln Arg Asp Gly Glu Leu Glu Ala Arg Ala Gln Lys
165 170 175
Leu Glu Ala Asp Leu Ala Glu Leu Glu Ala Glu Gly Ala Lys Ala Asp
180 185 190
Ala Arg Arg Lys Val Arg Asp Gly Gly Glu Arg Glu Met Arg Gln Ile
195 200 205
Arg Asp Arg Ala Gln Arg Glu Leu Asp Arg Leu Glu Asp Ile Trp Ser
210 215 220
Thr Phe Thr Lys Leu Ala Pro Lys Gln Leu Ile Val Asp Glu Asn Leu
225 230 235 240
Tyr Arg Glu Leu Val Asp Arg Tyr Gly Glu Tyr Phe Thr Gly Ala Met
245 250 255
Gly Ala Glu Ser Ile Gln Lys Leu Ile Glu Asn Phe Asp Ile Asp Ala
260 265 270
Glu Ala Glu Ser Leu Arg Asp Val Ile Arg Asn Gly Lys Gly Gln Lys
275 280 285
Lys Leu Arg Ala Leu Lys Arg Leu Lys Val Val Ala Ala Phe Gln Gln
290 295 300
Ser Gly Asn Ser Pro Met Gly Met Val Leu Asp Ala Val Pro Val Ile
305 310 315 320
Pro Pro Glu Leu Arg Pro Met Val Gln Leu Asp Gly Gly Arg Phe Ala
325 330 335
Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val Ile Asn Arg Asn Asn
340 345 350
Arg Leu Lys Arg Leu Ile Asp Leu Gly Ala Pro Glu Ile Ile Val Asn
355 360 365
Asn Glu Lys Arg Met Leu Gln Glu Ser Val Asp Ala Leu Phe Asp Asn
370 375 380
Gly Arg Arg Gly Arg Pro Val Thr Gly Pro Gly Asn Arg Pro Leu Lys
385 390 395 400
Ser Leu Ser Asp Leu Leu Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn
405 410 415
Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg Ser Val Ile Val Val
420 425 430
Gly Pro Gln Leu Lys Leu His Gln Cys Gly Leu Pro Lys Leu Met Ala
435 440 445
Leu Glu Leu Phe Lys Pro Phe Val Met Lys Arg Leu Val Asp Leu Asn
450 455 460
His Ala Gln Asn Ile Lys Ser Ala Lys Arg Met Val Glu Arg Gln Arg
465 470 475 480
Pro Gln Val Trp Asp Val Leu Glu Glu Val Ile Ala Glu His Pro Val
485 490 495
Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe
500 505 510
Glu Pro Met Leu Val Glu Gly Lys Ala Ile Gln Leu His Pro Leu Val
515 520 525
Cys Glu Ala Phe Asn Ala Asp Phe Asp Gly Asp Gln Met Ala Val His
530 535 540
Leu Pro Leu Ser Ala Glu Ala Gln Ala Glu Ala Arg Ile Leu Met Leu
545 550 555 560
Ser Ser Asn Asn Ile Leu Ser Pro Ala Ser Gly Arg Pro Leu Ala Met
565 570 575
Pro Arg Leu Asp Met Val Thr Gly Leu Tyr Tyr Leu Thr Thr Glu Val
580 585 590
Pro Gly Asp Thr Gly Glu Tyr Gln Pro Ala Ser Gly Asp His Pro Glu
595 600 605
Thr Gly Val Tyr Ser Ser Pro Ala Glu Ala Ile Met Ala Ala Asp Arg
610 615 620
Gly Val Leu Ser Val Arg Ala Lys Ile Lys Val Arg Leu Thr Gln Leu
625 630 635 640
Arg Pro Pro Val Glu Ile Glu Ala Glu Leu Phe Gly His Ser Gly Trp
645 650 655
Gln Pro Gly Asp Ala Trp Met Ala Glu Thr Thr Leu Gly Arg Val Met
660 665 670
Phe Asn Glu Leu Leu Pro Leu Gly Tyr Pro Phe Val Asn Lys Gln Met
675 680 685
His Lys Lys Val Gln Ala Ala Ile Ile Asn Asp Leu Ala Glu Arg Tyr
690 695 700
Pro Met Ile Val Val Ala Gln Thr Val Asp Lys Leu Lys Asp Ala Gly
705 710 715 720
Phe Tyr Trp Ala Thr Arg Ser Gly Val Thr Val Ser Met Ala Asp Val
725 730 735
Leu Val Pro Pro Arg Lys Lys Glu Ile Leu Asp His Tyr Glu Glu Arg
740 745 750
Ala Asp Lys Val Glu Lys Gln Phe Gln Arg Gly Ala Leu Asn His Asp
755 760 765
Glu Arg Asn Glu Ala Leu Val Glu Ile Trp Lys Glu Ala Thr Asp Glu
770 775 780
Val Gly Gln Ala Leu Arg Glu His Tyr Pro Asp Asp Asn Pro Ile Ile
785 790 795 800
Thr Ile Val Asp Ser Gly Ala Thr Gly Asn Phe Thr Gln Thr Arg Thr
805 810 815
Leu Ala Gly Met Lys Gly Leu Val Thr Asn Pro Lys Gly Glu Phe Ile
820 825 830
Pro Arg Pro Val Lys Ser Ser Phe Arg Glu Gly Leu Thr Val Leu Glu
835 840 845
Tyr Phe Ile Asn Thr His Gly Ala Arg Lys Gly Leu Ala Asp Thr Ala
850 855 860
Leu Arg Thr Ala Asp Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val
865 870 875 880
Ser Gln Asp Val Ile Val Arg Glu His Asp Cys Gln Thr Glu Arg Gly
885 890 895
Ile Val Val Glu Leu Ala Glu Arg Ala Pro Asp Gly Thr Leu Ile Arg
900 905 910
Asp Pro Tyr Ile Glu Thr Ser Ala Tyr Ala Arg Thr Leu Gly Thr Asp
915 920 925
Ala Val Asp Glu Ala Gly Asn Val Ile Val Glu Arg Gly Gln Asp Leu
930 935 940
Gly Asp Pro Glu Ile Asp Ala Leu Leu Ala Ala Gly Ile Thr Gln Val
945 950 955 960
Lys Val Arg Ser Val Leu Thr Cys Ala Thr Ser Thr Gly Val Cys Ala
965 970 975
Thr Cys Tyr Gly Arg Ser Met Ala Thr Gly Lys Leu Val Asp Ile Gly
980 985 990
Glu Ala Val Gly Ile Val Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
995 1000 1005
Gln Leu Thr Met Arg Thr Phe His Gln Gly Gly Val Gly Glu Asp Ile
1010 1015 1020
Thr Gly Gly Leu Pro Arg Val Gln Glu Leu Phe Glu Ala Arg Val Pro
1025 1030 1035 1040
Arg Gly Lys Ala Pro Ile Ala Asp Val Thr Gly Arg Val Arg Leu Glu
1045 1050 1055
Asp Gly Glu Arg Phe Tyr Lys Ile Thr Ile Val Pro Asp Asp Gly Gly
1060 1065 1070
Glu Glu Val Val Tyr Asp Lys Ile Ser Lys Arg Gln Arg Leu Arg Val
1075 1080 1085
Phe Lys His Glu Asp Gly Ser Glu Arg Val Leu Ser Asp Gly Asp His
1090 1095 1100
Val Glu Val Gly Gln Gln Leu Met Glu Gly Ser Ala Asp Pro His Glu
1105 1110 1115 1120
Val Leu Arg Val Gln Gly Pro Arg Glu Val Gln Ile His Leu Val Arg
1125 1130 1135
Glu Val Gln Glu Val Tyr Arg Ala Gln Gly Val Ser Ile His Asp Lys
1140 1145 1150
His Ile Glu Val Ile Val Arg Gln Met Leu Arg Arg Val Thr Ile Ile
1155 1160 1165
Asp Ser Gly Ser Thr Glu Phe Leu Pro Gly Ser Leu Ile Asp Arg Ala
1170 1175 1180
Glu Phe Glu Ala Glu Asn Arg Arg Val Val Ala Glu Gly Gly Glu Pro
1185 1190 1195 1200
Ala Ala Gly Arg Pro Val Leu Met Gly Ile Thr Lys Ala Ser Leu Ala
1205 1210 1215
Thr Asp Ser Trp Leu Ser Ala Ala Ser Phe Gln Glu Thr Thr Arg Val
1220 1225 1230
Leu Thr Asp Ala Ala Ile Asn Cys Arg Ser Asp Lys Leu Asn Gly Leu
1235 1240 1245
Lys Glu Asn Val Ile Ile Gly Lys Leu Ile Pro Ala Gly Thr Gly Ile
1250 1255 1260
Asn Arg Tyr Arg Asn Ile Ala Val Gln Pro Thr Glu Glu Ala Arg Ala
1265 1270 1275 1280
Ala Ala Tyr Thr Ile Pro Ser Tyr Glu Asp Gln Tyr Tyr Ser Pro Asp
1285 1290 1295
Phe Gly Ala Ala Thr Gly Ala Ala Val Pro Leu Asp Asp Tyr Gly Tyr
1300 1305 1310
Ser Asp Tyr Arg
1315
<210> 45
<211> 1319
<212> PRT
<213> Rhodococcus equi
<400> 45
Met Leu Asp Val Asn Phe Phe Asp Glu Leu Arg Ile Gly Leu Ala Thr
1 5 10 15
Ala Glu Asp Ile Arg Asn Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Lys Asp Gly Leu Phe Cys
35 40 45
Glu Lys Ile Phe Gly Pro Thr Arg Asp Trp Glu Cys Tyr Cys Gly Lys
50 55 60
Tyr Lys Arg Val Arg Phe Lys Gly Ile Ile Cys Glu Arg Cys Gly Val
65 70 75 80
Glu Val Thr Arg Ala Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Ala Ala Pro Val Thr His Ile Trp Tyr Phe Lys Gly Val Pro Ser
100 105 110
Arg Leu Gly Tyr Leu Leu Asp Leu Ala Pro Lys Asp Leu Glu Lys Ile
115 120 125
Ile Tyr Phe Ala Ala Tyr Val Ile Val Gly Val Asp Glu Glu Leu Arg
130 135 140
His Asn Glu Leu Ser Thr Leu Glu Ala Glu Met Glu Val Glu Lys Lys
145 150 155 160
Thr Val Ala Asp Gln Arg Asp Ala Asp Leu Glu Ala Arg Ala Gln Lys
165 170 175
Leu Glu Ala Asp Ile Ala Glu Leu Glu Ala Glu Gly Ala Lys Ser Asp
180 185 190
Val Arg Arg Lys Val Lys Asp Gly Gly Glu Arg Glu Met Arg Gln Leu
195 200 205
Arg Asp Arg Ala Gln Arg Glu Leu Asp Arg Leu Asp Glu Ile Trp Thr
210 215 220
Thr Phe Thr Lys Leu Ser Val Lys Gln Leu Ile Val Asp Glu Ser Leu
225 230 235 240
Tyr Arg Glu Leu Val Asp Arg Tyr Gly Glu Tyr Phe Thr Gly Ala Met
245 250 255
Gly Ala Glu Ser Ile Gln Lys Leu Met Glu Asn Phe Asp Ile Glu Ala
260 265 270
Glu Ala Glu Ser Leu Arg Glu Thr Ile Arg Ser Gly Lys Gly Gln Lys
275 280 285
Lys Leu Arg Ala Leu Lys Arg Leu Lys Val Val Ala Ala Phe Gln Gln
290 295 300
Ser Gly Asn Ser Pro Met Gly Met Val Leu Asp Ala Val Pro Val Ile
305 310 315 320
Pro Pro Glu Leu Arg Pro Met Val Gln Leu Asp Gly Gly Arg Phe Ala
325 330 335
Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val Ile Asn Arg Asn Asn
340 345 350
Arg Leu Lys Arg Leu Ile Asp Leu Gly Ala Pro Glu Ile Ile Val Asn
355 360 365
Asn Glu Lys Arg Met Leu Gln Glu Ser Val Asp Ala Leu Phe Asp Asn
370 375 380
Gly Arg Arg Gly Arg Pro Val Thr Gly Pro Gly Asn Arg Pro Leu Lys
385 390 395 400
Ser Leu Ser Asp Leu Leu Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn
405 410 415
Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg Ser Val Ile Val Val
420 425 430
Gly Pro Gln Leu Lys Leu His Gln Cys Gly Leu Pro Lys Leu Met Ala
435 440 445
Leu Glu Leu Phe Lys Pro Phe Val Met Lys Arg Leu Val Asp Leu Asn
450 455 460
His Ala Gln Asn Ile Lys Ser Ala Lys Arg Met Val Glu Arg Gln Arg
465 470 475 480
Pro Gln Val Trp Asp Val Leu Glu Glu Val Ile Asn Glu His Pro Val
485 490 495
Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe
500 505 510
Glu Pro Gln Leu Val Glu Gly Lys Ala Ile Gln Leu His Pro Leu Val
515 520 525
Cys Glu Ala Phe Asn Ala Asp Phe Asp Gly Asp Gln Met Ala Val His
530 535 540
Leu Pro Leu Ser Ala Glu Ala Gln Ala Glu Ala Arg Ile Leu Met Leu
545 550 555 560
Ser Ser Asn Asn Ile Leu Ser Pro Ala Ser Gly Arg Pro Leu Ala Met
565 570 575
Pro Arg Leu Asp Met Val Thr Gly Leu Phe His Leu Thr Arg Glu Val
580 585 590
Glu Gly Ala Ile Gly Ala Tyr Gln Pro Ala Ala Asp Gly Gln Pro Glu
595 600 605
Gln Gly Val Tyr Ser Ser Pro Ala Glu Ala Gln Met Ala Val Asp Arg
610 615 620
Gly Val Leu Ser Val Gln Ala Lys Ile Lys Val Arg Leu Thr His Gln
625 630 635 640
Arg Pro Pro Arg Glu Ile Glu Ala Glu Leu Phe Pro Glu Gly Trp Asn
645 650 655
Phe Gly Asp Gly Trp Met Val Glu Thr Thr Leu Gly Arg Val Met Phe
660 665 670
Asn Asp Leu Leu Pro Ala Asp Tyr Pro Phe Ile Asn Glu Gln Met Pro
675 680 685
Lys Lys Arg Gln Ala Thr Ile Ile Asn Asp Leu Ala Glu Arg Tyr Pro
690 695 700
Met Ile Val Val Ala Gln Thr Val Asp Lys Met Lys Asp Thr Gly Phe
705 710 715 720
Tyr Trp Ala Thr Arg Ser Gly Val Thr Val Ser Ile Ser Asp Val Leu
725 730 735
Val Pro Pro Glu Lys Ala Gln Ile Met Glu Gln Phe Glu Ala Gln Ala
740 745 750
Asp Gln Ile Glu Lys Lys Tyr Gln Arg Gly Ala Leu Asn His Thr Glu
755 760 765
Arg Asn Ser Ala Leu Val Lys Ile Trp Ser Glu Ala Thr Asp Glu Val
770 775 780
Gly Lys Ala Met Glu Ala His Phe Pro Asp Asp Asn Pro Ile Pro Met
785 790 795 800
Ile Val Lys Ser Gly Ala Ala Gly Asn Met Thr Gln Val Arg Ser Leu
805 810 815
Ala Gly Met Lys Gly Leu Val Thr Asn Pro Lys Gly Glu Phe Ile Pro
820 825 830
Arg Pro Ile Lys Ser Ser Phe Lys Glu Gly Leu Thr Val Leu Glu Tyr
835 840 845
Phe Ile Asn Thr His Gly Ala Arg Lys Gly Leu Ala Asp Thr Ala Leu
850 855 860
Arg Thr Ala Asp Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val Ser
865 870 875 880
Gln Asp Val Ile Val Arg Glu Val Asp Cys Gly Thr Glu Arg Gly Ile
885 890 895
Leu Thr Thr Ile Ala Glu Lys Ala Ala Asp Gly Thr Met Ile Arg Asp
900 905 910
Ala His Val Glu Thr Ser Thr Tyr Ala Arg Thr Leu Ala Ala Asp Ala
915 920 925
Ile Asp Glu Asn Gly Asn Val Val Val Glu Arg Gly His Asp Leu Gly
930 935 940
Asp Pro Ala Ile Asp Ala Leu Leu Ala Ala Gly Ile Thr Gln Val Lys
945 950 955 960
Val Arg Ser Val Leu Thr Cys Thr Thr Ala Thr Gly Val Cys Ala Thr
965 970 975
Cys Tyr Gly Arg Ser Met Ala Thr Gly Lys Leu Val Asp Ile Gly Glu
980 985 990
Ala Val Gly Ile Val Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr Gln
995 1000 1005
Leu Thr Met Arg Thr Phe His Gln Gly Gly Ala Ala Gly Ala Ala Asp
1010 1015 1020
Ile Thr Gly Gly Leu Pro Arg Val Gln Glu Leu Phe Glu Ala Arg Val
1025 1030 1035 1040
Pro Lys Gly Lys Ala Pro Ile Thr Glu Val Ser Gly Arg Val Gln Leu
1045 1050 1055
Asp Asp Asp Asp Arg Phe Tyr Lys Ile Thr Val Val Pro Asp Asp Gly
1060 1065 1070
Gly Glu Glu Val Val Tyr Asp Lys Leu Ser Lys Arg Gln Arg Leu Arg
1075 1080 1085
Val Phe Lys His Asp Asp Gly Ser Glu Arg Leu Leu Ser Asp Gly Asp
1090 1095 1100
His Val Asp Val Gly Gln Gln Leu Leu Glu Gly Ala Ala Asp Pro His
1105 1110 1115 1120
Asp Val Leu Arg Val Met Gly Pro Arg Gln Val Gln Ile His Leu Val
1125 1130 1135
Asn Glu Val Gln Glu Val Tyr Arg Ser Gln Gly Val Ser Ile His Asp
1140 1145 1150
Lys His Ile Glu Val Ile Val Arg Gln Met Leu Arg Arg Val Thr Ile
1155 1160 1165
Ile Asp Ser Gly Ser Thr Glu Phe Leu Pro Gly Ser Leu Val Glu Arg
1170 1175 1180
Ala Glu Phe Glu Ala Ser Asn Arg Arg Val Val Ala Glu Gly Gly Glu
1185 1190 1195 1200
Pro Ala Ala Gly Arg Pro Val Leu Met Gly Ile Thr Lys Ala Ser Leu
1205 1210 1215
Ala Thr Asp Ser Trp Leu Ser Ala Ala Ser Phe Gln Glu Thr Thr Arg
1220 1225 1230
Val Leu Thr Asp Ala Ala Ile Asn Cys Arg Ser Asp Lys Leu Ile Gly
1235 1240 1245
Leu Lys Glu Asn Val Ile Ile Gly Lys Leu Ile Pro Ala Gly Thr Gly
1250 1255 1260
Ile Asn Arg Tyr Arg Asn Ile Gln Val Gln Pro Thr Glu Glu Ala Arg
1265 1270 1275 1280
Ala Ala Ala Tyr Ala Val Pro Ser Tyr Asp Asp Gln Tyr Tyr Ser Pro
1285 1290 1295
Glu Gly Phe Gly Thr Gly Thr Gly Ala Ala Val Pro Leu Asp Asp Tyr
1300 1305 1310
Gly Phe Gly Ser Asp Tyr Arg
1315
<210> 46
<211> 1396
<212> PRT
<213> Chlamydia trachomatis
<400> 46
Met Phe Arg Glu Gly Ser Arg Asp Asp Ala Ala Leu Val Lys Glu Gly
1 5 10 15
Leu Phe Asp Lys Leu Glu Ile Gly Ile Ala Ser Asp Val Thr Ile Arg
20 25 30
Asp Lys Trp Ser Cys Gly Glu Ile Lys Lys Pro Glu Thr Ile Asn Tyr
35 40 45
Arg Thr Phe Lys Pro Glu Lys Gly Gly Leu Phe Cys Glu Lys Ile Phe
50 55 60
Gly Pro Thr Lys Asp Trp Glu Cys Tyr Cys Gly Lys Tyr Lys Lys Ile
65 70 75 80
Lys His Lys Gly Ile Val Cys Asp Arg Cys Gly Val Glu Val Thr Leu
85 90 95
Ser Lys Val Arg Arg Glu Arg Met Ala His Ile Glu Leu Ala Val Pro
100 105 110
Ile Val His Ile Trp Phe Phe Lys Thr Thr Pro Ser Arg Ile Gly Asn
115 120 125
Val Leu Gly Met Thr Ala Ser Asp Leu Glu Arg Val Ile Tyr Tyr Glu
130 135 140
Glu Tyr Val Val Ile Asp Pro Gly Asn Thr Asp Leu Val Lys Lys Gln
145 150 155 160
Leu Leu Asn Asp Ala Lys Tyr Arg Glu Val Val Glu Lys Trp Gly Lys
165 170 175
Asp Ala Phe Val Ala Lys Met Gly Gly Glu Ala Val Tyr Asp Leu Leu
180 185 190
Lys Ser Glu Asp Leu Glu Ser Leu Leu Gly Glu Leu Lys Glu Arg Leu
195 200 205
Arg Lys Thr Lys Ser Gln Gln Ala Arg Met Lys Leu Ala Lys Arg Leu
210 215 220
Lys Ile Val Glu Gly Phe Val Ser Ser Ser Asn Arg Pro Glu Trp Met
225 230 235 240
Val Leu Lys Asn Ile Pro Val Val Pro Pro Asp Leu Arg Pro Leu Val
245 250 255
Pro Leu Asp Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr
260 265 270
Arg Arg Val Ile Asn Arg Asn Asn Arg Leu Lys Ala Ile Leu Arg Leu
275 280 285
Lys Thr Pro Glu Val Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu
290 295 300
Ala Val Asp Ala Leu Phe Asp Asn Gly Arg His Gly His Pro Val Met
305 310 315 320
Gly Ala Gly Asn Arg Pro Leu Lys Ser Leu Ser Glu Met Leu Lys Gly
325 330 335
Lys Asn Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr
340 345 350
Ser Gly Arg Ser Val Ile Ile Val Gly Pro Glu Leu Lys Phe Asn Gln
355 360 365
Cys Gly Leu Pro Lys Glu Met Ala Leu Glu Leu Phe Glu Pro Phe Ile
370 375 380
Ile Lys Arg Leu Lys Asp Gln Gly Ser Val Tyr Thr Ile Arg Ser Ala
385 390 395 400
Lys Lys Met Ile Gln Arg Gly Ala Pro Glu Val Trp Asp Val Leu Glu
405 410 415
Glu Ile Ile Lys Gly His Pro Val Leu Leu Asn Arg Ala Pro Thr Leu
420 425 430
His Arg Leu Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys
435 440 445
Ala Ile Arg Val His Pro Leu Val Cys Ala Ala Phe Asn Ala Asp Phe
450 455 460
Asp Gly Asp Gln Met Ala Val His Val Pro Leu Ser Ile Glu Ala Gln
465 470 475 480
Leu Glu Ala Lys Val Leu Met Met Ala Pro Asp Asn Ile Phe Leu Pro
485 490 495
Ser Ser Gly Lys Pro Val Ala Thr Pro Ser Lys Asp Met Thr Leu Gly
500 505 510
Ile Tyr Tyr Leu Met Ala Asp Pro Thr Tyr Phe Pro Glu Glu His Gly
515 520 525
Gly Lys Thr Lys Ala Phe Lys Asp Glu Val Glu Val Leu Arg Ala Leu
530 535 540
Asn Ala Gly Gly Phe Ile Leu Lys Asp Glu Ile Cys Gly Ser Arg Arg
545 550 555 560
Asp Glu Thr Gly Arg Gly Ile His Ile His Glu Lys Ile Lys Val Arg
565 570 575
Ile Asp Gly Gln Ile Ile Glu Thr Thr Pro Gly Arg Val Phe Phe Asn
580 585 590
Thr Ile Val Pro Lys Glu Leu Gly Phe Gln Asn Tyr Ser Met Pro Ser
595 600 605
Lys Arg Ile Ser Glu Leu Ile Leu Gln Cys Tyr Lys Lys Val Gly Leu
610 615 620
Glu Ala Thr Val Arg Phe Leu Asp Asp Leu Lys Glu Leu Gly Phe Val
625 630 635 640
Gln Ser Thr Lys Ala Ala Ile Ser Met Gly Leu Lys Asp Val Lys Ile
645 650 655
Pro Glu Ile Lys Lys Glu Ile Leu Lys Asp Ala Tyr Asp Lys Val Ala
660 665 670
Val Val Lys Lys Gln Tyr Glu Asp Gly Ile Ile Thr Asp Gly Glu Arg
675 680 685
His Ser Lys Thr Ile Ser Ile Trp Thr Glu Val Ser Asp Leu Leu Ser
690 695 700
Asn Ala Leu Tyr Ser Glu Ile Lys Lys Gln Thr Asn Ser Lys His Asn
705 710 715 720
Pro Leu Phe Leu Met Ile Asp Ser Gly Ala Arg Gly Asn Lys Ser Gln
725 730 735
Leu Lys Gln Leu Gly Ala Leu Arg Gly Leu Met Ala Lys Pro Asn Gly
740 745 750
Ala Ile Ile Glu Ser Pro Ile Thr Ser Asn Phe Arg Glu Gly Leu Thr
755 760 765
Val Leu Glu Tyr Ser Ile Ser Ser His Gly Ala Arg Lys Gly Leu Ala
770 775 780
Asp Thr Ala Leu Lys Thr Ala Asp Ser Gly Tyr Leu Thr Arg Arg Leu
785 790 795 800
Val Asp Val Ala Gln Asp Val Ile Ile Thr Glu Arg Asp Cys Gly Thr
805 810 815
Leu Asn His Ile Glu Val Ser Thr Ile Arg Gln Gly Ser Glu Glu Leu
820 825 830
Leu Pro Leu Lys Asp Arg Val Tyr Gly Arg Thr Val Ser Glu Asn Ile
835 840 845
Tyr Gln Pro Gly Asp Lys Ser Asn Val Leu Ala Tyr Ala Gly Asp Val
850 855 860
Leu Thr Ser Ala Gln Ala Glu Ala Ile Asp Asp Ala Gly Ile Glu Ser
865 870 875 880
Val Lys Ile Arg Ser Thr Leu Thr Cys Glu Ser Arg Arg Gly Val Cys
885 890 895
Ala Lys Cys Tyr Gly Leu Asn Leu Ala Asn Gly Arg Leu Ile Gly Leu
900 905 910
Gly Glu Ala Val Gly Ile Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly
915 920 925
Thr Gln Leu Thr Met Arg Thr Phe His Leu Gly Gly Ile Ala Ala Thr
930 935 940
Ser Ser Thr Pro Glu Ile Val Ala Glu Cys Asp Gly Ile Leu Val Tyr
945 950 955 960
Leu Asp Leu Arg Val Val Val Asp Gln Glu Gly Asn Asn Leu Val Leu
965 970 975
Asn Lys Met Gly Ala Leu His Leu Val Gln Asp Glu Gly Arg Ser Leu
980 985 990
Ser Glu Tyr Lys Lys Leu Leu Ser Thr Lys Ser Ile Glu Ser Leu Ala
995 1000 1005
Thr Phe Pro Val Glu Leu Gly Ala Lys Ile Leu Val Asn Asp Gly Ala
1010 1015 1020
Ala Val Ala Ala Gly Gln Arg Ile Ala Glu Val Glu Leu His Asn Ile
1025 1030 1035 1040
Pro Ile Ile Cys Asp Lys Pro Gly Phe Val His Tyr Glu Asp Leu Val
1045 1050 1055
Glu Gly Val Ser Thr Glu Lys Val Thr Asn Lys Asn Thr Gly Leu Val
1060 1065 1070
Glu Leu Ile Val Lys Gln His Arg Gly Glu Leu His Pro Gln Ile Ala
1075 1080 1085
Ile Tyr Ala Asp Ala Asn Met Lys Glu Leu Val Gly Thr Tyr Ala Ile
1090 1095 1100
Pro Ser Gly Ala Ile Ile Ser Val Glu Glu Gly Gln Arg Ile Ala Pro
1105 1110 1115 1120
Gly Met Leu Leu Ala Arg Leu Pro Arg Gly Ala Ile Lys Thr Lys Asp
1125 1130 1135
Ile Thr Gly Gly Leu Pro Arg Val Ala Glu Leu Val Glu Ala Arg Lys
1140 1145 1150
Pro Glu Asp Ala Ala Asp Ile Ala Lys Ile Asp Gly Val Val Asp Phe
1155 1160 1165
Lys Gly Ile Gln Lys Asn Lys Arg Ile Leu Val Val Arg Asp Glu Ile
1170 1175 1180
Thr Gly Met Glu Glu Glu His Leu Ile Ser Leu Thr Lys His Leu Ile
1185 1190 1195 1200
Val Gln Arg Gly Asp Ser Val Ile Lys Gly Gln Gln Leu Thr Asp Gly
1205 1210 1215
Leu Val Val Pro His Glu Ile Leu Glu Ile Cys Gly Val Arg Glu Leu
1220 1225 1230
Gln Lys Tyr Leu Val Asn Glu Val Gln Glu Val Tyr Arg Leu Gln Gly
1235 1240 1245
Val Asp Ile Asn Asp Lys His Val Glu Ile Ile Val Arg Gln Met Leu
1250 1255 1260
Gln Lys Val Arg Ile Thr Asp Pro Gly Asp Thr Thr Leu Leu Phe Gly
1265 1270 1275 1280
Glu Asp Val Asp Lys Lys Glu Phe Tyr Glu Glu Asn Arg Arg Thr Glu
1285 1290 1295
Glu Asp Gly Gly Lys Pro Ala Gln Ala Val Pro Val Leu Leu Gly Ile
1300 1305 1310
Thr Lys Ala Ser Leu Gly Thr Glu Ser Phe Ile Ser Ala Ala Ser Phe
1315 1320 1325
Gln Asp Thr Thr Arg Val Leu Thr Asp Ala Ala Cys Ser Ser Lys Thr
1330 1335 1340
Asp Tyr Leu Leu Gly Phe Lys Glu Asn Val Ile Met Gly His Met Ile
1345 1350 1355 1360
Pro Gly Gly Thr Gly Phe Asp Thr His Lys Arg Ile Lys Gln His Leu
1365 1370 1375
Glu Lys Glu Gln Glu Asp Leu Val Phe Asp Phe Asp Ser Glu Phe Glu
1380 1385 1390
Ser Val Ala Gly
1395
<210> 47
<211> 1178
<212> PRT
<213> Clostridium botulinum
<400> 47
Met Phe Glu Leu Asn Asn Phe Asp Ala Leu Gln Ile Gly Leu Ala Ser
1 5 10 15
Pro Glu Lys Ile Arg Glu Trp Ser Arg Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Arg Asp Gly Leu Phe Cys
35 40 45
Glu Arg Ile Phe Gly Pro Met Lys Asp Trp Glu Cys His Cys Gly Lys
50 55 60
Tyr Lys Arg Ile Arg Tyr Lys Gly Ile Val Cys Asp Arg Cys Gly Val
65 70 75 80
Glu Val Thr Lys Ala Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Ala Ala Pro Val Ser His Ile Trp Tyr Phe Lys Gly Ile Pro Ser
100 105 110
Arg Met Gly Leu Ile Leu Asp Met Ser Pro Arg Ala Leu Glu Lys Val
115 120 125
Leu Tyr Phe Ala Ser Tyr Val Val Leu Asp Pro Lys Glu Thr Pro Leu
130 135 140
Leu Lys Lys Gln Leu Leu Asn Glu Lys Glu Tyr Arg Glu Ser Ile Asp
145 150 155 160
Lys Tyr Gly Asp Asp Ser Phe Val Ala Ala Met Gly Ala Glu Ala Val
165 170 175
Lys Thr Leu Leu Asp Glu Ile Asp Leu Glu Gln Ser Ser Ile Glu Leu
180 185 190
Lys Glu Glu Leu Lys Thr Ser Thr Gly Gln Lys Lys Ile Arg Ile Ile
195 200 205
Arg Arg Leu Glu Val Val Glu Ser Phe Arg Lys Ser Gly Asn Arg Pro
210 215 220
Asp Trp Met Val Ile Asp Val Ile Pro Val Ile Pro Pro Asp Leu Arg
225 230 235 240
Pro Met Val Gln Leu Asp Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn
245 250 255
Asp Leu Tyr Arg Arg Val Ile Asn Arg Asn Asn Arg Leu Lys Lys Leu
260 265 270
Leu Asp Leu Gly Ala Pro Asp Ile Ile Val Arg Asn Glu Lys Arg Met
275 280 285
Leu Gln Glu Ala Val Asp Ala Leu Ile Asp Asn Gly Arg Arg Gly Arg
290 295 300
Pro Val Thr Gly Pro Gly Asn Arg Pro Leu Lys Ser Leu Ser Asp Met
305 310 315 320
Leu Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg
325 330 335
Val Asp Tyr Ser Gly Arg Ser Val Ile Val Val Gly Pro Glu Leu Lys
340 345 350
Met Tyr Gln Cys Gly Leu Pro Lys Glu Met Ala Leu Glu Leu Phe Lys
355 360 365
Pro Phe Val Met Lys Lys Leu Val Gln Asn Gly Leu Ala His Asn Ile
370 375 380
Lys Ser Ala Lys Arg Met Val Glu Arg Val Gln Pro Gln Val Trp Asp
385 390 395 400
Val Leu Glu Glu Val Ile Ser Asp His Pro Val Leu Leu Asn Arg Ala
405 410 415
Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe Gln Pro Val Leu Val
420 425 430
Glu Gly Arg Ala Ile Lys Leu His Pro Leu Val Cys Thr Ala Tyr Asn
435 440 445
Ala Asp Phe Asp Gly Asp Gln Met Ala Val His Val Pro Leu Ser Val
450 455 460
Glu Ala Gln Ala Glu Ala Arg Phe Leu Met Leu Ala Ala His Asn Ile
465 470 475 480
Leu Lys Pro Ser Asp Gly Lys Pro Val Ser Val Pro Thr Gln Asp Met
485 490 495
Val Leu Gly Ser Tyr Tyr Leu Thr Met Asp Lys Asp Gly Val Lys Gly
500 505 510
Glu Gly Lys Val Phe Ser Cys Pro Glu Glu Val Leu Met Ala Tyr Gln
515 520 525
Cys Lys Ala Val Asp Ile His Ala Lys Ile Lys Val Arg Leu Lys Lys
530 535 540
Val Ile Asp Gly Glu Thr Ile Glu Gly Ile Ile Glu Thr Thr Pro Gly
545 550 555 560
Lys Ile Ile Phe Asn Glu Ser Ile Pro Gln Asp Leu Gly Tyr Ile Asp
565 570 575
Arg Thr Val Pro Glu Asn Lys Leu Lys Leu Glu Val Asp Phe Leu Val
580 585 590
Ser Lys Lys Thr Leu Gly Gly Ile Ile Asn Arg Cys Tyr Met Lys His
595 600 605
Gly Ala Thr Lys Thr Ser Ile Met Leu Asp Lys Ile Lys Ala Lys Gly
610 615 620
Tyr His Tyr Ser Thr Ile Gly Ala Ile Thr Ile Ser Thr Ser Asp Met
625 630 635 640
Val Val Pro Glu Ala Lys Arg Glu Leu Leu Gln Asn Thr Glu Lys Gln
645 650 655
Val Glu Lys Ile Gln Lys Met Tyr Arg Arg Gly Phe Ile Ser Glu Glu
660 665 670
Glu Arg Tyr Glu Lys Val Ile Asp Leu Trp Thr Lys Thr Thr Glu Asp
675 680 685
Val Ala Asn Ala Leu Met Ala Ser Leu Asp Ser Phe Asn Pro Ile Tyr
690 695 700
Met Met Ala Asp Ser Gly Ala Arg Gly Ser Lys Ser Gln Ile Lys Gln
705 710 715 720
Leu Ala Gly Met Arg Gly Leu Met Ala Asn Pro Ser Gly Lys Ile Ile
725 730 735
Glu Leu Pro Ile Lys Ala Ser Phe Arg Glu Gly Leu Asp Val Leu Glu
740 745 750
Tyr Phe Ile Ser Thr His Gly Ala Arg Lys Gly Asn Ala Asp Thr Ala
755 760 765
Leu Lys Thr Ala Asp Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val
770 775 780
Ser Gln Asp Val Ile Val Arg Gln Glu Asp Cys Gly Thr Glu Glu Gly
785 790 795 800
Tyr Glu Val Ser Glu Ile Lys Glu Gly Asn Glu Val Ile Glu Pro Leu
805 810 815
Val Glu Arg Leu Ser Gly Arg Tyr Pro Ser Glu Asp Ile Ile Asn Pro
820 825 830
Thr Thr Gly Glu Val Ile Val Lys Arg Asn Thr Tyr Met Asn Glu Asp
835 840 845
Ile Ala Lys Lys Val Ser Asp Ala Gly Ile Lys Lys Val Lys Ile Arg
850 855 860
Ser Val Phe Thr Cys Lys Ser Lys His Gly Val Cys Ala Arg Cys Tyr
865 870 875 880
Gly Met Asn Met Ala Thr Ser Gln Lys Ile His Ile Gly Glu Ala Val
885 890 895
Gly Ile Val Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr Gln Leu Thr
900 905 910
Met Arg Thr Phe His Thr Gly Gly Val Ala Gly Ala Asp Ile Thr Gln
915 920 925
Gly Leu Pro Arg Val Glu Glu Leu Phe Glu Ala Arg Lys Pro Lys Gly
930 935 940
Leu Ala Ile Val Ser Glu Val Ser Gly Thr Val Lys Met Glu Glu Thr
945 950 955 960
Lys Lys Lys Arg Thr Ile Ile Val Val Thr Asp Asp Gly Glu Glu Val
965 970 975
Ser Tyr Asp Ile Pro Phe Gly Ser Arg Ile Lys Val Lys Asn Gly Asp
980 985 990
Ile Ile Ser Ala Gly Asp Glu Ile Thr Glu Gly Ser Ile Asn Pro His
995 1000 1005
Asp Ile Leu Arg Ile Lys Gly Val Asp Gly Val Lys Asn Tyr Leu Leu
1010 1015 1020
Ser Glu Val Gln Lys Val Tyr Arg Leu Gln Gly Val Asp Ile Asn Asp
1025 1030 1035 1040
Lys His Leu Glu Val Val Ile Arg Gln Met Thr Arg Lys Ile Lys Ile
1045 1050 1055
Glu Asp Ser Gly Asp Thr Glu Leu Leu Pro Gly Thr Met Ile Asp Val
1060 1065 1070
Phe Asp Phe Glu Glu Ala Asn Arg Glu Ile Leu Glu Lys Gly Gly Glu
1075 1080 1085
Pro Ala Val Gly Arg Ile Ala Leu Leu Gly Ile Thr Lys Ala Ala Leu
1090 1095 1100
Ala Thr Asp Ser Phe Leu Ser Ala Ala Ser Phe Gln Glu Thr Thr Arg
1105 1110 1115 1120
Val Leu Thr Asp Ala Ala Ile Lys Gly Lys Ile Asp Pro Leu Leu Gly
1125 1130 1135
Leu Lys Glu Asn Val Ile Ile Gly Lys Leu Ile Pro Ala Gly Thr Gly
1140 1145 1150
Met Thr Arg Tyr Arg Ser Ile Gln Ile Asn Thr Asp Asp Glu Asn Ile
1155 1160 1165
Glu Glu Asp Ser Met Asp Ser Ile Glu Val
1170 1175
<210> 48
<211> 1199
<212> PRT
<213> Bacillus subtilis
<400> 48
Met Leu Asp Val Asn Asn Phe Glu Tyr Met Asn Ile Gly Leu Ala Ser
1 5 10 15
Pro Asp Lys Ile Arg Ser Trp Ser Phe Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Lys Asp Gly Leu Phe Cys
35 40 45
Glu Arg Ile Phe Gly Pro Thr Lys Asp Trp Glu Cys His Cys Gly Lys
50 55 60
Tyr Lys Arg Val Arg Tyr Lys Gly Val Val Cys Asp Arg Cys Gly Val
65 70 75 80
Glu Val Thr Arg Ala Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Ala Ala Pro Val Ser His Ile Trp Tyr Phe Lys Gly Ile Pro Ser
100 105 110
Arg Met Gly Leu Val Leu Asp Met Ser Pro Arg Ala Leu Glu Glu Val
115 120 125
Ile Tyr Phe Ala Ser Tyr Val Val Thr Asp Pro Ala Asn Thr Pro Leu
130 135 140
Glu Lys Lys Gln Leu Leu Ser Glu Lys Glu Tyr Arg Ala Tyr Leu Asp
145 150 155 160
Lys Tyr Gly Asn Lys Phe Gln Ala Ser Met Gly Ala Glu Ala Ile His
165 170 175
Lys Leu Leu Gln Asp Ile Asp Leu Val Lys Glu Val Asp Met Leu Lys
180 185 190
Glu Glu Leu Lys Thr Ser Gln Gly Gln Arg Arg Thr Arg Ala Ile Lys
195 200 205
Arg Leu Glu Val Leu Glu Ala Phe Arg Asn Ser Gly Asn Lys Pro Ser
210 215 220
Trp Met Ile Leu Asp Val Leu Pro Val Ile Pro Pro Glu Leu Arg Pro
225 230 235 240
Met Val Gln Leu Asp Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp
245 250 255
Leu Tyr Arg Arg Val Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu
260 265 270
Asp Leu Gly Ala Pro Ser Ile Ile Val Gln Asn Glu Lys Arg Met Leu
275 280 285
Gln Glu Ala Val Asp Ala Leu Ile Asp Asn Gly Arg Arg Gly Arg Pro
290 295 300
Val Thr Gly Pro Gly Asn Arg Pro Leu Lys Ser Leu Ser His Met Leu
305 310 315 320
Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val
325 330 335
Asp Tyr Ser Gly Arg Ser Val Ile Val Val Gly Pro His Leu Lys Met
340 345 350
Tyr Gln Cys Gly Leu Pro Lys Glu Met Ala Leu Glu Leu Phe Lys Pro
355 360 365
Phe Val Met Lys Glu Leu Val Glu Lys Gly Leu Ala His Asn Ile Lys
370 375 380
Ser Ala Lys Arg Lys Ile Glu Arg Val Gln Pro Glu Val Trp Asp Val
385 390 395 400
Leu Glu Ser Val Ile Lys Glu His Pro Val Leu Leu Asn Arg Ala Pro
405 410 415
Thr Leu His Arg Leu Gly Ile Gln Ala Phe Glu Pro Thr Leu Val Glu
420 425 430
Gly Arg Ala Ile Arg Leu His Pro Leu Val Cys Thr Ala Tyr Asn Ala
435 440 445
Asp Phe Asp Gly Asp Gln Met Ala Val His Val Pro Leu Ser Ala Glu
450 455 460
Ala Gln Ala Glu Ala Arg Ile Leu Met Leu Ala Ala Gln Asn Ile Leu
465 470 475 480
Asn Pro Lys Asp Gly Lys Pro Val Val Thr Pro Ser Gln Asp Met Val
485 490 495
Leu Gly Asn Tyr Tyr Leu Thr Leu Glu Arg Ala Gly Ala Val Gly Glu
500 505 510
Gly Met Val Phe Lys Asn Thr Asp Glu Ala Leu Leu Ala Tyr Gln Asn
515 520 525
Gly Tyr Val His Leu His Thr Arg Val Ala Val Ala Ala Asn Ser Leu
530 535 540
Lys Asn Val Thr Phe Thr Glu Glu Gln Arg Ser Lys Leu Leu Ile Thr
545 550 555 560
Thr Val Gly Lys Leu Val Phe Asn Glu Ile Leu Pro Glu Ser Phe Pro
565 570 575
Tyr Met Asn Glu Pro Thr Lys Ser Asn Ile Glu Glu Lys Thr Pro Asp
580 585 590
Arg Phe Phe Leu Glu Lys Gly Ala Asp Val Lys Ala Val Ile Ala Gln
595 600 605
Gln Pro Ile Asn Ala Pro Phe Lys Lys Gly Ile Leu Gly Lys Ile Ile
610 615 620
Ala Glu Ile Phe Lys Arg Phe His Ile Thr Glu Thr Ser Lys Met Leu
625 630 635 640
Asp Arg Met Lys Asn Leu Gly Phe Lys Tyr Ser Thr Lys Ala Gly Ile
645 650 655
Thr Val Gly Val Ser Asp Ile Val Val Leu Asp Asp Lys Gln Glu Ile
660 665 670
Leu Glu Glu Ala Gln Ser Lys Val Asp Asn Val Met Lys Gln Phe Arg
675 680 685
Arg Gly Leu Ile Thr Glu Glu Glu Arg Tyr Glu Arg Val Ile Ser Ile
690 695 700
Trp Ser Ala Ala Lys Asp Val Ile Gln Gly Lys Leu Met Lys Ser Leu
705 710 715 720
Asp Glu Leu Asn Pro Ile Tyr Met Met Ser Asp Ser Gly Ala Arg Gly
725 730 735
Asn Ala Ser Asn Phe Thr Gln Leu Ala Gly Met Arg Gly Leu Met Ala
740 745 750
Asn Pro Ala Gly Arg Ile Ile Glu Leu Pro Ile Lys Ser Ser Phe Arg
755 760 765
Glu Gly Leu Thr Val Leu Glu Tyr Phe Ile Ser Thr His Gly Ala Arg
770 775 780
Lys Gly Leu Ala Asp Thr Ala Leu Lys Thr Ala Asp Ser Gly Tyr Leu
785 790 795 800
Thr Arg Arg Leu Val Asp Val Ala Gln Asp Val Ile Ile Arg Glu Thr
805 810 815
Asp Cys Gly Thr Asp Arg Gly Ile Leu Ala Lys Pro Leu Lys Glu Gly
820 825 830
Thr Glu Thr Ile Glu Arg Leu Glu Glu Arg Leu Ile Gly Arg Phe Ala
835 840 845
Arg Lys Gln Val Lys His Pro Glu Thr Gly Glu Val Leu Val Asn Glu
850 855 860
Asn Glu Leu Ile Asp Glu Asp Lys Ala Leu Glu Ile Val Glu Ala Gly
865 870 875 880
Ile Glu Glu Val Trp Ile Arg Ser Ala Phe Thr Cys Asn Thr Pro His
885 890 895
Gly Val Cys Lys Arg Cys Tyr Gly Arg Asn Leu Ala Thr Gly Ser Asp
900 905 910
Val Glu Val Gly Glu Ala Val Gly Ile Ile Ala Ala Gln Ser Ile Gly
915 920 925
Glu Pro Gly Thr Gln Leu Thr Met Arg Thr Phe His Thr Gly Gly Val
930 935 940
Ala Gly Asp Asp Ile Thr Gln Gly Leu Pro Arg Ile Gln Glu Leu Phe
945 950 955 960
Glu Ala Arg Asn Pro Lys Gly Gln Ala Thr Ile Thr Glu Ile Asp Gly
965 970 975
Thr Val Val Glu Ile Asn Glu Val Arg Asp Lys Gln Gln Glu Ile Val
980 985 990
Val Gln Gly Ala Val Glu Thr Arg Ser Tyr Thr Ala Pro Tyr Asn Ser
995 1000 1005
Arg Leu Lys Val Ala Glu Gly Asp Lys Ile Thr Arg Gly Gln Val Leu
1010 1015 1020
Thr Glu Gly Ser Ile Asp Pro Lys Glu Leu Leu Lys Val Thr Asp Leu
1025 1030 1035 1040
Thr Thr Val Gln Glu Tyr Leu Leu His Glu Val Gln Lys Val Tyr Arg
1045 1050 1055
Met Gln Gly Val Glu Ile Gly Asp Lys His Val Glu Val Met Val Arg
1060 1065 1070
Gln Met Leu Arg Lys Val Arg Val Ile Asp Ala Gly Asp Thr Asp Val
1075 1080 1085
Leu Pro Gly Thr Leu Leu Asp Ile His Gln Phe Thr Glu Ala Asn Lys
1090 1095 1100
Lys Val Leu Leu Glu Gly Asn Arg Pro Ala Thr Gly Arg Pro Val Leu
1105 1110 1115 1120
Leu Gly Ile Thr Lys Ala Ser Leu Glu Thr Asp Ser Phe Leu Ser Ala
1125 1130 1135
Ala Ser Phe Gln Glu Thr Thr Arg Val Leu Thr Asp Ala Ala Ile Lys
1140 1145 1150
Gly Lys Arg Asp Glu Leu Leu Gly Leu Lys Glu Asn Val Ile Ile Gly
1155 1160 1165
Lys Leu Val Pro Ala Gly Thr Gly Met Met Lys Tyr Arg Lys Val Lys
1170 1175 1180
Pro Val Ser Asn Val Gln Pro Thr Asp Asp Met Val Pro Val Glu
1185 1190 1195
<210> 49
<211> 1225
<212> PRT
<213> Streptococcus pneumoniae
<400> 49
Met Val Asp Val Asn Arg Phe Lys Ser Met Gln Ile Thr Leu Ala Ser
1 5 10 15
Pro Ser Lys Val Arg Ser Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Arg Glu Gly Leu Phe Asp
35 40 45
Glu Val Ile Phe Gly Pro Thr Lys Asp Trp Glu Cys Ala Cys Gly Lys
50 55 60
Tyr Lys Arg Ile Arg Tyr Arg Gly Ile Val Cys Asp Arg Cys Gly Val
65 70 75 80
Glu Val Thr Arg Thr Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Lys Ala Pro Val Ser His Ile Trp Tyr Phe Lys Gly Ile Pro Ser
100 105 110
Arg Met Gly Leu Thr Leu Asp Met Ser Pro Arg Ala Leu Glu Glu Val
115 120 125
Ile Tyr Phe Ala Ala Tyr Val Val Ile Asp Pro Lys Asp Thr Pro Leu
130 135 140
Glu His Lys Ser Ile Met Thr Glu Arg Glu Tyr Arg Glu Arg Leu Arg
145 150 155 160
Glu Tyr Gly Tyr Gly Ser Phe Val Ala Lys Met Gly Ala Glu Ala Ile
165 170 175
Gln Asp Leu Leu Lys Gln Val Asp Leu Glu Lys Glu Ile Ala Glu Leu
180 185 190
Lys Glu Glu Leu Lys Thr Ala Thr Gly Gln Lys Arg Val Lys Ala Ile
195 200 205
Arg Arg Leu Asp Val Leu Asp Ala Phe Tyr Lys Ser Gly Asn Lys Pro
210 215 220
Glu Trp Met Ile Leu Asn Ile Leu Pro Val Ile Pro Pro Asp Leu Arg
225 230 235 240
Pro Met Leu Gln Leu Asp Gly Gly Arg Phe Ala Ser Ser Asp Leu Asn
245 250 255
Asp Leu Tyr Arg Arg Val Ile Asn Arg Asn Asn Arg Leu Ala Arg Leu
260 265 270
Leu Glu Leu Asn Ala Pro Gly Ile Ile Val Gln Asn Glu Lys Arg Met
275 280 285
Leu Gln Glu Ala Val Asp Ala Leu Ile Asp Asn Gly Arg Arg Gly Arg
290 295 300
Pro Ile Thr Gly Pro Gly Ser Arg Pro Leu Lys Ser Leu Ser His Met
305 310 315 320
Leu Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg
325 330 335
Val Asp Phe Ser Gly Arg Ser Val Ile Ala Val Gly Pro Thr Leu Lys
340 345 350
Met Tyr Gln Cys Gly Val Pro Arg Glu Met Ala Ile Glu Leu Phe Lys
355 360 365
Pro Phe Val Met Arg Glu Ile Val Ala Arg Asp Ile Val Gln Asn Val
370 375 380
Lys Ala Ala Lys Arg Leu Val Glu Arg Gly Asp Glu Arg Ile Trp Asp
385 390 395 400
Ile Leu Glu Glu Val Ile Lys Glu His Pro Val Leu Leu Asn Arg Ala
405 410 415
Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe Glu Pro Val Leu Ile
420 425 430
Asp Gly Lys Ala Leu Arg Leu His Pro Leu Val Cys Glu Ala Tyr Asn
435 440 445
Ala Asp Phe Asp Gly Asp Gln Met Ala Ile His Val Pro Leu Ser Glu
450 455 460
Glu Ala Gln Ala Glu Ala Arg Ile Leu Met Leu Ala Ala Glu His Ile
465 470 475 480
Leu Asn Pro Lys Asp Gly Lys Pro Val Val Thr Pro Ser Gln Asp Met
485 490 495
Val Leu Gly Asn Tyr Tyr Leu Thr Met Glu Glu Ala Gly Arg Glu Gly
500 505 510
Glu Gly Met Val Phe Lys Asp Arg Asp Glu Ala Val Met Ala Tyr Arg
515 520 525
Asn Gly Tyr Val His Leu His Ser Arg Val Gly Ile Ala Thr Asp Ser
530 535 540
Leu Asn Lys Pro Trp Thr Glu Glu Gln Arg His Lys Val Leu Leu Thr
545 550 555 560
Thr Val Gly Lys Ile Leu Phe Asn Asp Ile Met Pro Glu Gly Leu Pro
565 570 575
Tyr Leu Gln Glu Pro Asn Asn Ala Asn Leu Thr Glu Gly Val Pro Ala
580 585 590
Lys Tyr Phe Leu Pro Leu Gly Gly Asp Ile Lys Glu Ala Ile Ser Asn
595 600 605
Leu Glu Leu Asn Pro Pro Phe Lys Lys Lys Asn Leu Gly Asn Ile Ile
610 615 620
Ala Glu Ile Phe Lys Arg Phe Arg Thr Thr Glu Thr Ser Ala Leu Leu
625 630 635 640
Asp Arg Met Lys Asn Leu Gly Tyr His His Ser Thr Leu Ala Gly Leu
645 650 655
Thr Val Gly Ile Ala Asp Ile Pro Val Val Asp Asp Lys Ala Glu Ile
660 665 670
Ile Glu Glu Ser His Lys Arg Val Glu Gln Ile Thr Lys Gln Phe Arg
675 680 685
Arg Gly Met Ile Thr Asp Asp Glu Arg Tyr Asn Ala Val Thr Ala Glu
690 695 700
Trp Arg Ala Ala Arg Glu Lys Leu Glu Lys Arg Leu Ile Ala Asn Gln
705 710 715 720
Asp Pro Lys Asn Pro Ile Val Met Met Met Asp Ser Gly Ala Arg Gly
725 730 735
Asn Ile Ser Asn Phe Ser Gln Leu Ala Gly Met Arg Gly Leu Met Ala
740 745 750
Ala Pro Asn Gly Arg Ile Met Glu Leu Pro Ile Leu Ser Asn Phe Arg
755 760 765
Glu Gly Leu Ser Val Leu Glu Met Phe Phe Ser Thr His Gly Ala Arg
770 775 780
Lys Gly Met Thr Asp Thr Ala Leu Lys Thr Ala Asp Ser Gly Tyr Leu
785 790 795 800
Thr Arg Arg Leu Val Asp Val Ala Gln Asp Val Ile Ile Arg Glu Asp
805 810 815
Asp Cys Gly Thr Asp Arg Gly Leu Leu Ile Arg Ser Ile Ala Glu Gly
820 825 830
Lys Glu Met Ile Glu Ser Leu Glu Glu Arg Leu Asn Gly Arg Tyr Thr
835 840 845
Lys Lys Thr Val Lys His Pro Glu Thr Gly Ala Val Ile Ile Gly Pro
850 855 860
Asn Glu Leu Ile Thr Glu Asp Lys Ala Arg Glu Ile Val Asn Ala Gly
865 870 875 880
Val Glu Glu Val Thr Ile Arg Ser Val Phe Thr Cys Asn Thr Arg His
885 890 895
Gly Val Cys Arg His Cys Tyr Gly Ile Asn Leu Ala Thr Gly Asp Ala
900 905 910
Val Glu Val Gly Glu Ala Val Gly Thr Ile Ala Ala Gln Ser Ile Gly
915 920 925
Glu Pro Gly Thr Gln Leu Thr Met Arg Thr Phe His Thr Gly Gly Val
930 935 940
Ala Ser Asn Thr Asp Ile Thr Gln Gly Leu Pro Arg Val Gln Glu Ile
945 950 955 960
Phe Glu Ala Arg Asn Pro Lys Gly Glu Ala Val Ile Thr Glu Val Lys
965 970 975
Gly Gln Val Thr Ala Ile Glu Glu Asp Ala Ser Thr Arg Thr Lys Lys
980 985 990
Val Phe Val Lys Gly Glu Thr Gly Glu Gly Glu Tyr Val Val Pro Phe
995 1000 1005
Thr Ala Arg Met Arg Val Glu Val Gly Gly Gln Val Ala Arg Gly Ala
1010 1015 1020
Ala Leu Thr Glu Gly Ser Ile Gln Pro Lys Arg Leu Leu Ala Val Arg
1025 1030 1035 1040
Asp Val Leu Ser Val Glu Thr Tyr Leu Leu Gly Glu Val Gln Lys Val
1045 1050 1055
Tyr Arg Ser Gln Gly Val Glu Ile Gly Asp Lys His Ile Glu Val Met
1060 1065 1070
Val Arg Gln Met Ile Arg Lys Val Arg Val Met Asp Pro Gly Asp Thr
1075 1080 1085
Asp Leu Leu Met Gly Thr Leu Met Asp Ile Asn Asp Phe Thr Asp Ala
1090 1095 1100
Asn Lys Asp Val Leu Ile Ala Gly Gly Val Pro Ala Thr Gly Arg Pro
1105 1110 1115 1120
Val Leu Met Gly Ile Thr Lys Ala Ser Leu Glu Thr Asn Ser Phe Leu
1125 1130 1135
Ser Ala Ala Ser Phe Gln Glu Thr Thr Arg Val Leu Thr Asp Ala Ala
1140 1145 1150
Ile Arg Gly Lys Lys Asp His Leu Leu Gly Leu Lys Glu Asn Val Ile
1155 1160 1165
Ile Gly Lys Ile Ile Pro Ala Gly Thr Gly Met Ala Arg Tyr Arg Asn
1170 1175 1180
Leu Glu Pro His Ala Val Asn Glu Glu Glu Tyr Leu Asn Pro Pro Val
1185 1190 1195 1200
Glu Glu Glu Gly Asn Glu Glu Thr Thr Glu Val Val Val Asp Thr Ala
1205 1210 1215
Val Glu Thr Val Glu Glu Thr Val Glu
1220 1225
<210> 50
<211> 1217
<212> PRT
<213> Enterococcus faecalis
<400> 50
Met Ile Asp Val Asn Lys Phe Glu Ser Met Gln Ile Gly Leu Ala Ser
1 5 10 15
Pro Glu Lys Ile Arg Ser Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Arg Glu Gly Leu Phe Cys
35 40 45
Glu Arg Ile Phe Gly Pro Thr Lys Asp Trp Glu Cys Ala Cys Gly Lys
50 55 60
Tyr Lys Arg Ile Arg Tyr Lys Gly Ile Val Cys Asp Arg Cys Gly Val
65 70 75 80
Glu Val Thr Arg Ser Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Ala Ala Pro Val Ser His Ile Trp Tyr Phe Lys Gly Ile Pro Ser
100 105 110
Arg Met Gly Leu Val Leu Asp Met Ser Pro Arg Ala Leu Glu Glu Val
115 120 125
Ile Tyr Phe Ala Ser Tyr Val Val Ile Glu Pro Gly Asp Thr Thr Leu
130 135 140
Glu Lys Lys Gln Leu Leu Thr Glu Arg Glu Tyr Arg Glu Lys Arg Glu
145 150 155 160
Gln Tyr Gly Gln Ala Phe Lys Ala Ala Met Gly Ala Glu Ala Val Lys
165 170 175
Gln Leu Leu Asp Asn Val Asp Leu Asp Gly Glu Val Ala Gln Leu Lys
180 185 190
Glu Glu Leu Lys Thr Ala Ser Gly Gln Lys Arg Thr Arg Ala Ile Arg
195 200 205
Arg Leu Asp Ile Leu Glu Ala Phe Arg Ala Ser Gly Asn Gln Pro Ser
210 215 220
Trp Met Val Met Asp Val Ile Pro Val Ile Pro Pro Asp Leu Arg Pro
225 230 235 240
Met Val Gln Leu Glu Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp
245 250 255
Leu Tyr Arg Arg Val Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu
260 265 270
Asp Leu Asn Ala Pro Ser Ile Ile Val Gln Asn Glu Lys Arg Met Leu
275 280 285
Gln Glu Ala Val Asp Ala Leu Ile Asp Asn Gly Arg Arg Gly Arg Pro
290 295 300
Val Thr Gly Pro Gly Asn Arg Pro Leu Lys Ser Leu Ser His Met Leu
305 310 315 320
Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val
325 330 335
Asp Tyr Ser Gly Arg Ser Val Ile Val Val Gly Pro Phe Leu Lys Met
340 345 350
Tyr Gln Cys Gly Leu Pro Lys Glu Met Ala Ile Glu Leu Phe Lys Pro
355 360 365
Phe Val Met Arg Glu Leu Val Gln Arg Glu Ile Ala Thr Asn Ile Lys
370 375 380
Asn Ala Lys Arg Lys Ile Glu Arg Gly Glu Asp Glu Val Trp Asp Ile
385 390 395 400
Leu Glu Glu Val Ile Gln Glu His Pro Val Leu Leu Asn Arg Ala Pro
405 410 415
Thr Leu His Arg Leu Gly Ile Gln Ala Phe Glu Pro Val Leu Val Glu
420 425 430
Gly Arg Ala Ile Arg Leu His Pro Leu Val Cys Glu Ala Tyr Asn Ala
435 440 445
Asp Phe Asp Gly Asp Gln Met Ala Val His Val Pro Leu Asn Glu Glu
450 455 460
Ala Gln Ala Glu Ala Arg Met Leu Met Leu Ala Ala Gln Asn Ile Leu
465 470 475 480
Asn Pro Lys Asp Gly Lys Pro Val Val Thr Pro Ser Gln Asp Met Val
485 490 495
Leu Gly Asn Tyr Tyr Leu Thr Met Glu Glu Glu Gly Arg Glu Gly Glu
500 505 510
Gly Met Ile Phe Arg Asp Met Asn Glu Ala Val Leu Ala Trp Gln Asn
515 520 525
Gly Tyr Val His Leu His Ser Arg Ile Gly Val Gln Thr Thr Leu Leu
530 535 540
Gly Asp Lys Pro Phe Thr Asp Trp Gln Lys Glu Arg Ile Leu Ile Thr
545 550 555 560
Thr Val Gly Lys Ile Ile Phe Asn Glu Ile Met Pro Val Glu Phe Pro
565 570 575
Tyr Leu Asn Glu Pro Thr Asp Tyr Asn Leu Thr Val Gln Thr Pro Asp
580 585 590
Lys Tyr Phe Val Glu Ala Gly Thr Asp Ile Pro Ala His Ile Lys Glu
595 600 605
Gln Glu Leu Val Leu Pro Phe Lys Lys Lys Asn Leu Gly Asn Ile Ile
610 615 620
Ala Glu Val Phe Lys Arg Phe His Ile Thr Glu Thr Ser Lys Met Leu
625 630 635 640
Asp Arg Met Lys Asp Leu Gly Tyr Lys His Ser Thr Tyr Ala Gly Met
645 650 655
Thr Val Gly Ile Ala Asp Ile Met Val Leu His Glu Lys Gln Ala Ile
660 665 670
Ile Asp Ala Ala His Lys Gln Val Glu Thr Ile Thr Lys Gln Phe Arg
675 680 685
Arg Gly Leu Ile Thr Asp Asp Glu Arg Tyr Glu Arg Val Ile Gly Val
690 695 700
Trp Asn Gly Ala Lys Asp Glu Ile Gln Gln Lys Leu Ile Glu Ser Met
705 710 715 720
Glu Ala Arg Asn Pro Ile Phe Met Met Ser Asp Ser Gly Ala Arg Gly
725 730 735
Asn Ile Ser Asn Phe Thr Gln Leu Ala Gly Met Arg Gly Leu Met Ala
740 745 750
Ala Pro Asn Gly Arg Ile Met Glu Leu Pro Ile Ile Ser Asn Phe Arg
755 760 765
Glu Gly Leu Ser Val Leu Glu Met Phe Ile Ser Thr His Gly Ala Arg
770 775 780
Lys Gly Met Thr Asp Thr Ala Leu Lys Thr Ala Asp Ser Gly Tyr Leu
785 790 795 800
Thr Arg Arg Leu Val Asp Val Ala Gln Asp Val Ile Ile Arg Glu Asp
805 810 815
Asp Cys Gly Thr Asp Arg Gly Leu Glu Ile Glu Ala Ile Arg Glu Gly
820 825 830
Asn Glu Ile Ile Glu Pro Leu Asp Glu Arg Leu Leu Gly Arg Tyr Thr
835 840 845
Arg Lys Ser Val Val His Pro Glu Thr Gly Ala Ile Ile Ile Gly Ala
850 855 860
Asp Gln Leu Ile Thr Glu Asp Leu Ala Arg Glu Ile Val Asp Ala Gly
865 870 875 880
Ile Glu Lys Val Thr Ile Arg Ser Val Phe Thr Cys Asn Thr Lys His
885 890 895
Gly Val Cys Lys His Cys Tyr Gly Arg Asn Leu Ala Thr Gly Ser Asp
900 905 910
Val Glu Val Gly Glu Ala Val Gly Thr Ile Ala Ala Gln Ser Ile Gly
915 920 925
Glu Pro Gly Thr Gln Leu Thr Met Arg Thr Phe His Thr Gly Gly Val
930 935 940
Ala Gly Asp Asp Ile Thr Gln Gly Leu Pro Arg Ile Gln Glu Ile Phe
945 950 955 960
Glu Ala Arg Asn Pro Lys Gly Gln Ala Val Ile Thr Glu Val Thr Gly
965 970 975
Glu Val Ile Asp Ile Ser Glu Asp Pro Ala Thr Arg Gln Lys Glu Val
980 985 990
Thr Ile Lys Gly Lys Thr Asp Thr Arg Thr Tyr Thr Val Pro Tyr Thr
995 1000 1005
Ala Arg Met Lys Val Ala Glu Gly Asp Ile Ile His Arg Gly Ala Pro
1010 1015 1020
Leu Thr Glu Gly Ser Ile Asp Pro Lys Gln Leu Leu Gln Val Arg Asp
1025 1030 1035 1040
Val Leu Ser Val Glu Asn Tyr Leu Leu Arg Glu Val Gln Arg Val Tyr
1045 1050 1055
Arg Met Gln Gly Val Glu Ile Gly Asp Lys His Ile Glu Val Met Val
1060 1065 1070
Arg Gln Met Leu Arg Lys Ile Arg Val Met Asp Pro Gly Asp Thr Glu
1075 1080 1085
Ile Leu Pro Gly Thr Leu Met Asp Ile Ala Glu Phe Lys Asp Arg Asn
1090 1095 1100
Tyr Asp Thr Leu Val Ala Gly Gly Val Pro Ala Thr Ser Arg Pro Val
1105 1110 1115 1120
Leu Leu Gly Ile Thr Lys Ala Ser Leu Glu Thr Asn Ser Phe Leu Ser
1125 1130 1135
Ala Ala Ser Phe Gln Glu Thr Thr Arg Val Leu Thr Asp Ala Ala Ile
1140 1145 1150
Arg Gly Lys Lys Asp Pro Leu Leu Gly Leu Lys Glu Asn Val Ile Ile
1155 1160 1165
Gly Lys Ile Ile Pro Ala Gly Thr Gly Met Ala Arg Tyr Arg Asn Met
1170 1175 1180
Glu Pro Lys Glu Val Gly Val Ala Ser Glu Asn Val Tyr Ser Ile Ser
1185 1190 1195 1200
Asp Ile Glu Ala Gln Met Ala Ala Glu Asp Ala Met Lys Asn Ile Asn
1205 1210 1215
Lys
<210> 51
<211> 1215
<212> PRT
<213> Lactobacillus brevis
<400> 51
Met Val Asp Val Asn Lys Phe Glu Ser Met Gln Ile Gly Leu Ala Ser
1 5 10 15
Pro Asp Lys Ile Arg Ser Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu
20 25 30
Thr Ile Asn Tyr Arg Thr Leu Lys Pro Glu Lys Asp Gly Leu Phe Asp
35 40 45
Glu Arg Ile Phe Gly Pro Thr Lys Asp Trp Glu Cys Ala Cys Gly Lys
50 55 60
Tyr Lys Arg Ile Arg Tyr Lys Gly Val Val Cys Asp Arg Cys Gly Val
65 70 75 80
Glu Val Thr Arg Ser Lys Val Arg Arg Glu Arg Met Gly His Ile Glu
85 90 95
Leu Ala Ala Pro Val Thr His Ile Trp Tyr Phe Lys Gly Ile Pro Ser
100 105 110
Arg Met Gly Leu Val Leu Asp Met Ser Pro Arg Ala Leu Glu Glu Ile
115 120 125
Ile Tyr Phe Ala Ser Tyr Val Val Leu Asp Pro Gly Asn Thr Pro Leu
130 135 140
Glu Lys Lys Gln Leu Leu Ser Glu Arg Asp Tyr Arg Asp Lys Leu Leu
145 150 155 160
Glu Tyr Gly Ser Asp Ala Phe Lys Ala Glu Met Gly Ala Glu Ala Ile
165 170 175
Lys Lys Leu Leu Met Ser Val Asp Leu Asp Lys Glu Val Thr Glu Leu
180 185 190
Lys Glu Glu Leu Lys Glu Ala Thr Gly Gln Lys Arg Thr Arg Ala Val
195 200 205
Arg Arg Leu Asp Ile Leu Glu Ala Phe Val Met Ser Gly Asn Arg Pro
210 215 220
Glu Trp Met Val Met Asp Ala Ile Pro Val Ile Pro Pro Asp Leu Arg
225 230 235 240
Pro Met Val Gln Leu Glu Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn
245 250 255
Asp Leu Tyr Arg Arg Val Ile Asn Arg Asn Ser Arg Leu Lys Arg Leu
260 265 270
Leu Asp Leu Asn Ala Pro Gly Ile Ile Val Gln Asn Glu Lys Arg Met
275 280 285
Leu Gln Glu Ala Val Asp Ala Leu Ile Asp Asn Gly Arg Arg Gly Arg
290 295 300
Pro Val Ala Gly Pro Gly Asn Arg Pro Leu Lys Ser Leu Ser His Met
305 310 315 320
Leu Lys Gly Lys Gln Gly Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg
325 330 335
Val Asp Tyr Ser Gly Arg Ser Val Ile Asp Val Gly Pro Ser Leu Lys
340 345 350
Phe Asn Gln Met Gly Leu Pro Val Pro Met Ala Leu Glu Leu Phe Arg
355 360 365
Pro Phe Ile Met Lys Glu Leu Val Ala Arg Gly Leu Ala Ser Asn Ile
370 375 380
Lys Asn Ala Lys Arg Gln Ile Asp Arg Glu Asp Asp Asp Val Phe Asn
385 390 395 400
Val Leu Glu Asp Val Ile Lys Glu His Pro Val Leu Leu Asn Arg Ala
405 410 415
Pro Thr Leu His Arg Leu Gly Ile Gln Ala Phe Glu Pro Val Leu Val
420 425 430
Ser Gly Lys Ala Met Arg Leu His Pro Leu Ala Cys Glu Ala Tyr Asn
435 440 445
Ala Asp Phe Asp Gly Asp Gln Met Ala Ile His Val Pro Leu Ser Asp
450 455 460
Glu Ala Gln Ala Glu Ala Arg Leu Leu Met Leu Ala Ala His His Ile
465 470 475 480
Leu Ala Pro Ala Ser Gly Lys Pro Val Val Ala Pro Ser Gln Asp Met
485 490 495
Val Ile Gly Asn Tyr Tyr Leu Thr Met Glu Glu Ala Asn Arg Glu Gly
500 505 510
Glu Gly Met Ile Phe Thr Asp Leu Asp Glu Ala Thr Leu Ala Tyr Arg
515 520 525
Asn Gly Ile Val His Trp His Thr Arg Val Gly Val Gln Val Thr Ser
530 535 540
Met Pro Asp Lys Pro Phe Thr Asp Glu Gln Arg Ser Lys Ile Met Val
545 550 555 560
Thr Thr Val Gly Lys Leu Ile Phe Asn Asn Ile Leu Pro Lys Ser Phe
565 570 575
Pro Tyr Leu Asn Glu Pro Thr Ser Thr Asn Leu Asn Gly Tyr Val Pro
580 585 590
Asp Lys Tyr Phe Leu Glu Pro Gly Glu Asp Ile His Asp Tyr Leu Gln
595 600 605
Asn Ala Glu Ile Ile Pro Pro Phe Lys Lys Gly Phe Leu Ser Asp Ile
610 615 620
Ile Ala Ala Val Tyr Gln Gln Tyr Lys Val Thr Ala Thr Ser Glu Leu
625 630 635 640
Leu Asp Arg Ile Lys Asp Leu Gly Tyr Asn Glu Ser Thr Lys Ser Gly
645 650 655
Leu Thr Val Gly Met Val Asp Val Thr Asp Leu Lys Glu Lys Pro Glu
660 665 670
Ile Ile Ala Ala Ala His Lys Gln Val Ser Thr Val Thr Lys Gln Phe
675 680 685
Arg Arg Gly Leu Ile Thr Asp His Glu Arg Tyr Glu Arg Val Ile Gly
690 695 700
Ile Trp Asn Asp Ala Lys Asp Glu Ile Gln Asn Ala Leu Ile His Ser
705 710 715 720
Phe Asp Gln Gln Asn Pro Ile Phe Met Met Ser Asp Ser Gly Ala Arg
725 730 735
Gly Asn Ile Ser Asn Phe Thr Gln Leu Ala Gly Met Arg Gly Leu Met
740 745 750
Ala Ala Pro Ser Gly Asp Ile Met Glu Leu Pro Ile Thr Ser Asn Phe
755 760 765
Arg Glu Gly Leu Thr Val Met Glu Met Phe Ile Ser Thr His Gly Ala
770 775 780
Arg Lys Gly Met Thr Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr
785 790 795 800
Leu Thr Arg Arg Leu Val Asp Val Ala Gln Asp Val Ile Ile Arg Glu
805 810 815
Lys Asp Cys Gly Thr Asp Arg Gly Leu Lys Ile Arg Ala Ile Thr Asp
820 825 830
Gly Asn Glu Met Ile Glu Pro Leu Tyr Asp Arg Ile Leu Gly Arg Tyr
835 840 845
Thr Gln Lys Thr Val Tyr Asp Pro Gln Thr Gly Asp Val Ile Val Pro
850 855 860
Lys Asn Gln Met Ile Val Glu Asp Thr Ala Gln Gln Ile Val Asp Ala
865 870 875 880
Gly Val Glu Glu Val Thr Ile Arg Ser Ala Phe Thr Cys Asn Thr Glu
885 890 895
His Gly Val Cys Glu His Cys Tyr Gly Arg Asn Met Ala Thr Gly Asp
900 905 910
Glu Val Glu Val Gly Glu Ala Val Gly Thr Val Ala Ala Gln Ser Ile
915 920 925
Gly Glu Pro Gly Thr Gln Leu Thr Met Arg Asn Phe His Thr Gly Gly
930 935 940
Val Ala Gly Asn Glu Asp Ile Thr Gln Gly Leu Pro Arg Val Gln Glu
945 950 955 960
Leu Phe Glu Ser Arg Asn Pro Lys Gly Lys Ala Glu Ile Thr Glu Val
965 970 975
Thr Gly Thr Val Glu Ser Ile Glu Glu Asn Pro Ala Glu Arg Thr Lys
980 985 990
Glu Ile Thr Ile Lys Gly Glu Ala Asp Thr Arg Ser Tyr Thr Leu Pro
995 1000 1005
Ile Thr Ala Arg Met Arg Val Ser Glu Gly Asp Phe Ile His Arg Gly
1010 1015 1020
Gly Ala Leu Asn Tyr Gly Ser Val Asp Pro Lys Glu Leu Leu Arg Val
1025 1030 1035 1040
Arg Asp Val Leu Ser Thr Glu Thr Tyr Ile Leu Gly Glu Val Gln Arg
1045 1050 1055
Val Tyr Arg Met Gln Gly Val Ala Ile Ser Asp Lys His Val Glu Ile
1060 1065 1070
Met Val Arg Gln Met Leu Arg Lys Val Arg Ile Met Asp Pro Gly Asp
1075 1080 1085
Thr Asp Val Leu Pro Gly Thr Leu Met Asp Ile Gln Asp Phe Arg Arg
1090 1095 1100
Ala Asn Tyr Gln Thr Leu Ile Asp Gly Gly Ile Ala Ala Thr Ala Arg
1105 1110 1115 1120
Pro Val Ile Leu Gly Ile Thr Lys Ala Ala Leu Glu Thr Asn Ser Phe
1125 1130 1135
Leu Ser Ala Ala Ser Phe Gln Glu Thr Thr Arg Val Leu Thr Asp Ala
1140 1145 1150
Ala Ile Arg Gly Lys Asn Asp Pro Leu Val Gly Leu Lys Glu Asn Val
1155 1160 1165
Ile Ile Gly Lys Ile Ile Pro Ala Gly Thr Gly Met Pro Asp Tyr Arg
1170 1175 1180
Gln Ile Lys Pro Lys Glu Val Gly Gly Thr Ser Thr Glu Gly Val Tyr
1185 1190 1195 1200
Ser Ile Ser Asp Leu Glu Lys Gln Met Gln Glu Asp Ser Gln Ala
1205 1210 1215
Claims (15)
- 변이형 rpoC RNA-폴리머라제 β' 서브유닛 단백질 암호화 서열(변이형 rpoC 암호화 서열)을 포함하는 핵산 분자로,
상기 변이형 rpoC 암호화 서열이 변이형 RpoC RNA-폴리머라제 β' 서브유닛 단백질(변이형 RpoC)을 암호화하고;
상기 변이형 RpoC가 서열번호 27에 적어도 90% 일치하는 아미노산 서열을 포함하고, R47C 치환을 포함하며, 이때 상기 R47C 치환의 넘버링은 서열번호 26의 아미노산 서열을 갖는 에스케리키아 콜라이(Escherichia coli)의 야생형 RpoC RNA-폴리머라제 β' 서브유닛 단백질(야생형 RpoC)에 기초하여 정의되는, 핵산 분자. - 제1항에 있어서,
변이형 RpoC의 발현이 서열번호 26을 포함하는 야생형 RpoC의 발현에 비해 플라스미드의 카피수를 감소시키는, 핵산 분자. - 제1항에 있어서,
변이형 RpoC가
(1) 서열번호 28을 포함하는 N-말단 도메인;
(2) 서열번호 29를 포함하는 센트럴 도메인; 및
(3) 서열번호 30을 포함하는 C-말단 도메인
을 포함하고, R47C 치환이 N-말단 도메인 내에 존재하는, 핵산 분자. - 제1항에 있어서,
변이형 RpoC가
(1) 서열번호 31을 포함하는 N-말단 도메인;
(2) 서열번호 32를 포함하는 센트럴 도메인; 및
(3) 서열번호 33을 포함하는 C-말단 도메인
을 포함하고, R47C 치환이 N-말단 도메인 내에 존재하는, 핵산 분자. - 삭제
- 제1항에 있어서,
변이형 RpoC가 서열번호 27을 포함하는, 핵산 분자. - 제1항 내지 제4항 및 제6항 중 어느 한 항의 핵산 분자를 포함하는 벡터.
- 제1항 내지 제4항 및 제6항 중 어느 한 항의 핵산 분자를 포함하는 재조합 미생물.
- 제8항에 있어서,
핵산 서열을 포함하는 rpoC 암호화 서열 벡터를 형질전환, 접합, 또는 형질도입 중 하나 이상에 의해 전구체 미생물 내에 도입시킴으로써 제조되거나; 또는
염색체를 포함하고, 변이형 rpoC 암호화 서열이 상기 변이형 rpoC 암호화 서열에 의한 내인성 rpoC 암호화 서열의 교체에 기초하여 상기 염색체 내에 존재하는
재조합 미생물. - 제8항에 있어서,
에스케리키아 속의 세균 또는 에스케리키아 콜라이 종의 세균 중 하나 이상으로부터 제조된 재조합 미생물. - 제8항의 재조합 미생물에서 대상 벡터의 카피수를 조절하는 방법으로, 상기 재조합 미생물을 상기 대상 벡터의 복제에 충분한 조건하에 배양 배지에서 배양하는 단계를 포함하는, 상기 대상 벡터의 카피수를 조절하는 방법.
- 제8항의 재조합 미생물의 사용에 의해 표적 생성물을 제조하는 방법으로, 상기 재조합 미생물이 표적 유전자 벡터를 포함하고, 상기 표적 유전자 벡터가 표적 생성물의 제조를 위한 표적 유전자를 포함하고,
(1) 상기 재조합 미생물을, 상기 재조합 미생물이 상기 표적 유전자를 발현하는 조건하에 배양 배지에서 배양하여, 상기 표적 생성물을 생성시키는 단계; 및
(2) 상기 재조합 미생물 및/또는 상기 배양 배지로부터 상기 표적 생성물을 회수하는 단계
를 포함하는 방법. - 제12항에 있어서,
표적 생성물이 (i) 표적 RNA, (ii) 표적 단백질, (iii) 표적 생물물질, (iv) 표적 중합체, 그의 전구체, 또는 그의 생성을 위한 효소, (v) 표적 감미제, 그의 전구체, 또는 그의 생성을 위한 효소, (vi) 표적 오일, 그의 전구체, 또는 그의 생성을 위한 효소, (vii) 표적 지방, 그의 전구체, 또는 그의 생성을 위한 효소, (viii) 표적 폴리사카라이드, 그의 전구체, 또는 그의 생성을 위한 효소, (ix) 표적 아미노산, 그의 전구체, 또는 그의 생성을 위한 효소, (x) 표적 뉴클레오티드, 그의 전구체, 또는 그의 생성을 위한 효소, (xi) 표적 백신, 그의 전구체, 또는 그의 생성을 위한 효소, 또는 (xii) 표적 약학 생성물, 그의 전구체, 또는 그의 생성을 위한 효소 중 하나 이상을 포함하는, 방법. - 제12항에 있어서,
표적 생성물이 (i) 표적 중합체, 그의 전구체, 또는 그의 생성을 위한 효소, 또는 (ii) 표적 생물중합체, 그의 전구체, 또는 그의 생성을 위한 효소 중 하나 이상을 포함하는 방법. - 변이형 rpoC 암호화 서열 및 유전자 교체 서열을 포함하는 유전자 교체 벡터로,
상기 변이형 rpoC 암호화 서열이 서열번호 28을 포함하는 변이형 RpoC N-말단 도메인을 암호화하고;
상기 유전자 교체 서열이 미생물의 염색체 중의 내인성 rpoC 암호화 서열을 상기 변이형 rpoC 암호화 서열로 교체하기 위한 것인
유전자 교체 벡터.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201862715530P | 2018-08-07 | 2018-08-07 | |
US62/715,530 | 2018-08-07 | ||
PCT/KR2019/009909 WO2020032590A1 (en) | 2018-08-07 | 2019-08-07 | Nucleic acid molecules comprising a variant rpoc coding sequence |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20210030487A KR20210030487A (ko) | 2021-03-17 |
KR102624718B1 true KR102624718B1 (ko) | 2024-01-12 |
Family
ID=69407110
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020217006910A KR102624718B1 (ko) | 2018-08-07 | 2019-08-07 | 변이형 rpoc 암호화 서열을 포함하는 핵산 분자 |
Country Status (6)
Country | Link |
---|---|
US (1) | US11407983B2 (ko) |
EP (1) | EP3824089A4 (ko) |
JP (1) | JP7296449B2 (ko) |
KR (1) | KR102624718B1 (ko) |
CN (1) | CN112840028B (ko) |
WO (1) | WO2020032590A1 (ko) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020032595A1 (en) * | 2018-08-07 | 2020-02-13 | Cj Cheiljedang Corporation | Nucleic acid molecules comprising a variant inc coding strand |
KR20220163754A (ko) | 2021-06-03 | 2022-12-12 | 씨제이제일제당 (주) | 신규한 YhhS 변이체 및 이를 이용한 O-포스포세린, 시스테인 및 이의 유도체의 생산방법 |
KR102654301B1 (ko) | 2021-06-23 | 2024-04-04 | 씨제이제일제당 주식회사 | NADH:quinone 산화환원효소의 발현이 조절된 재조합 미생물 및 이를 이용한 O-포스포세린, 시스테인 및 이의 유도체의 생산방법 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1461426B1 (en) * | 2001-12-29 | 2008-01-09 | Novozymes A/S | Eubacterial rna-polymerase mutants for increasing heterologous gene expression |
US7291482B2 (en) | 2002-12-20 | 2007-11-06 | E.I. Du Pont De Nemours And Company | Mutations affecting plasmid copy number |
KR101594156B1 (ko) * | 2013-06-25 | 2016-02-15 | 씨제이제일제당 주식회사 | L-라이신 생산능이 향상된 미생물 및 그를 이용하여 l-라이신을 생산하는 방법 |
JP7025325B2 (ja) | 2015-10-30 | 2022-02-24 | ダニスコ・ユーエス・インク | タンパク質発現の増強およびその方法 |
WO2018091525A1 (en) * | 2016-11-15 | 2018-05-24 | Danmarks Tekniske Universitet | Bacterial cells with improved tolerance to diacids |
WO2020032595A1 (en) * | 2018-08-07 | 2020-02-13 | Cj Cheiljedang Corporation | Nucleic acid molecules comprising a variant inc coding strand |
-
2019
- 2019-08-07 US US16/534,130 patent/US11407983B2/en active Active
- 2019-08-07 KR KR1020217006910A patent/KR102624718B1/ko active IP Right Grant
- 2019-08-07 CN CN201980050327.0A patent/CN112840028B/zh active Active
- 2019-08-07 WO PCT/KR2019/009909 patent/WO2020032590A1/en unknown
- 2019-08-07 EP EP19846693.0A patent/EP3824089A4/en active Pending
- 2019-08-07 JP JP2021506482A patent/JP7296449B2/ja active Active
Non-Patent Citations (1)
Title |
---|
Mol Genet Genomics. 2002 Jul, 267(5):587-92.* |
Also Published As
Publication number | Publication date |
---|---|
WO2020032590A1 (en) | 2020-02-13 |
KR20210030487A (ko) | 2021-03-17 |
CN112840028B (zh) | 2024-01-16 |
EP3824089A4 (en) | 2021-12-15 |
US11407983B2 (en) | 2022-08-09 |
JP2021531817A (ja) | 2021-11-25 |
CN112840028A (zh) | 2021-05-25 |
US20200048619A1 (en) | 2020-02-13 |
JP7296449B2 (ja) | 2023-06-22 |
EP3824089A1 (en) | 2021-05-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10457919B2 (en) | Feedback-resistant acetohydroxy acid synthase variant and method for producing L-valine using the same | |
US11898185B2 (en) | Process for the production of fucosylated oligosaccharides | |
KR102094875B1 (ko) | 신규한 이소프로필말레이트 신타제 변이체 및 이를 이용한 l-류신의 생산 방법 | |
JP6648345B1 (ja) | 新規ポリペプチド及びこれを用いたimpの生産方法 | |
KR102147776B1 (ko) | L-라이신 생산능력이 향상된 코리네박테리움속 미생물 및 이를 이용한 라이신의 생산 방법 | |
KR102624718B1 (ko) | 변이형 rpoc 암호화 서열을 포함하는 핵산 분자 | |
JP2001149086A (ja) | 単離したポリヌクレオチド、複製可能なdna、ポリヌクレオチド配列、ベクター、宿主細胞として使用されるコリネフォルム細菌、およびl−アミノ酸の発酵による製造法 | |
KR102277407B1 (ko) | 신규한 글루타메이트 합성 효소 서브 유니트 알파 변이체 및 이를 이용한 l-글루탐산 생산 방법 | |
KR102078732B1 (ko) | 변형된 막 투과성 | |
KR102565817B1 (ko) | 변이형 inc 암호화 가닥을 포함하는 핵산 분자 | |
JP2011520462A (ja) | L−リシン生産の方法 | |
KR101768391B1 (ko) | L-라이신 생산능이 향상된 미생물 및 이를 이용한 l-라이신 생산방법 | |
US20230332116A1 (en) | Polypeptide with aspartate kinase activity and use thereof in production of amino acid | |
KR20220139085A (ko) | L-아르기닌을 생산하는 코리네박테리움 속 미생물 및 이를 이용한 l-아르기닌 생산방법 | |
KR101768390B1 (ko) | L-라이신 생산능이 향상된 미생물 및 이를 이용한 l-라이신 생산방법 | |
KR101760219B1 (ko) | L-라이신 생산능이 향상된 미생물 및 이를 이용한 l-라이신 생산방법 | |
KR20230002331A (ko) | L-라이신을 생산하는 재조합 균주 및 이의 구축 방법과 응용 | |
AU2010318324B2 (en) | Microorganisms having enhanced sucrose mutase activity | |
KR101755767B1 (ko) | L-라이신 생산능이 향상된 미생물 및 이를 이용한 l-라이신 생산방법 | |
KR102688631B1 (ko) | 클래스 I 타입의 BirA를 포함하는 미생물 및 이를 이용한 바이오틴 생산방법 | |
KR102665227B1 (ko) | 고농도 l-글루탐산을 생산하기 위한 균주 및 이를 이용한 l-글루탐산 생산방법 | |
CN116004501A (zh) | Nadp-铁氧还蛋白还原酶突变体及其在生产谷氨酸中的应用 | |
KR20230042953A (ko) | 고농도 l-글루탐산을 생산하기 위한 균주 및 이를 이용한 l-글루탐산 생산방법 | |
CN114269909A (zh) | 新型可溶性吡啶核苷酸转氢酶变体及使用其生产l-色氨酸的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant |