KR20060123291A - 신규한 비정형 폐렴-원인성 바이러스 - Google Patents
신규한 비정형 폐렴-원인성 바이러스 Download PDFInfo
- Publication number
- KR20060123291A KR20060123291A KR1020067011389A KR20067011389A KR20060123291A KR 20060123291 A KR20060123291 A KR 20060123291A KR 1020067011389 A KR1020067011389 A KR 1020067011389A KR 20067011389 A KR20067011389 A KR 20067011389A KR 20060123291 A KR20060123291 A KR 20060123291A
- Authority
- KR
- South Korea
- Prior art keywords
- val
- leu
- ser
- phe
- gly
- Prior art date
Links
- 241000700605 Viruses Species 0.000 title claims description 140
- 206010003757 Atypical pneumonia Diseases 0.000 title claims description 12
- 241001493065 dsRNA viruses Species 0.000 claims abstract description 9
- 150000007523 nucleic acids Chemical group 0.000 claims description 60
- 108020004707 nucleic acids Proteins 0.000 claims description 44
- 102000039446 nucleic acids Human genes 0.000 claims description 44
- 238000000034 method Methods 0.000 claims description 39
- 239000012634 fragment Substances 0.000 claims description 37
- 230000003612 virological effect Effects 0.000 claims description 34
- 108700026244 Open Reading Frames Proteins 0.000 claims description 24
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 22
- 241000711573 Coronaviridae Species 0.000 claims description 21
- 239000000427 antigen Substances 0.000 claims description 21
- 108091007433 antigens Proteins 0.000 claims description 21
- 102000036639 antigens Human genes 0.000 claims description 21
- 208000015181 infectious disease Diseases 0.000 claims description 14
- 239000008194 pharmaceutical composition Substances 0.000 claims description 14
- 238000011282 treatment Methods 0.000 claims description 13
- 230000009385 viral infection Effects 0.000 claims description 12
- 230000002265 prevention Effects 0.000 claims description 11
- 239000013598 vector Substances 0.000 claims description 11
- 241000711467 Human coronavirus 229E Species 0.000 claims description 10
- 241000124008 Mammalia Species 0.000 claims description 10
- 230000009897 systematic effect Effects 0.000 claims description 10
- 101710198474 Spike protein Proteins 0.000 claims description 9
- 108010067390 Viral Proteins Proteins 0.000 claims description 9
- 241000282898 Sus scrofa Species 0.000 claims description 8
- 108090000565 Capsid Proteins Proteins 0.000 claims description 7
- 102100023321 Ceruloplasmin Human genes 0.000 claims description 7
- 229940096437 Protein S Drugs 0.000 claims description 7
- 238000004519 manufacturing process Methods 0.000 claims description 7
- 206010012735 Diarrhoea Diseases 0.000 claims description 6
- 108060003393 Granulin Proteins 0.000 claims description 6
- 238000009007 Diagnostic Kit Methods 0.000 claims description 4
- 238000012360 testing method Methods 0.000 claims description 3
- 238000007476 Maximum Likelihood Methods 0.000 claims description 2
- 101710091045 Envelope protein Proteins 0.000 claims 1
- 101710188315 Protein X Proteins 0.000 claims 1
- 102100021696 Syncytin-1 Human genes 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 238000002856 computational phylogenetic analysis Methods 0.000 claims 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims 1
- 101710149951 Protein Tat Proteins 0.000 description 331
- 241000282326 Felis catus Species 0.000 description 143
- 241000880493 Leptailurus serval Species 0.000 description 42
- 108010034529 leucyl-lysine Proteins 0.000 description 34
- 239000000523 sample Substances 0.000 description 32
- 150000001413 amino acids Chemical class 0.000 description 31
- 108010047857 aspartylglycine Proteins 0.000 description 28
- 108010050848 glycylleucine Proteins 0.000 description 27
- 108010037850 glycylvaline Proteins 0.000 description 27
- 108010073969 valyllysine Proteins 0.000 description 27
- 239000002773 nucleotide Substances 0.000 description 23
- 125000003729 nucleotide group Chemical group 0.000 description 23
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 21
- 108010068265 aspartyltyrosine Proteins 0.000 description 20
- 238000001514 detection method Methods 0.000 description 20
- 238000009396 hybridization Methods 0.000 description 20
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 20
- 108010061238 threonyl-glycine Proteins 0.000 description 20
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 19
- 108010057821 leucylproline Proteins 0.000 description 19
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 18
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 18
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 18
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 18
- 238000003199 nucleic acid amplification method Methods 0.000 description 18
- 108090000623 proteins and genes Proteins 0.000 description 18
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 17
- 108020004414 DNA Proteins 0.000 description 16
- 108010065920 Insulin Lispro Proteins 0.000 description 16
- 230000003321 amplification Effects 0.000 description 16
- 241000004176 Alphacoronavirus Species 0.000 description 15
- 108010038320 lysylphenylalanine Proteins 0.000 description 15
- 108010051242 phenylalanylserine Proteins 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 14
- 210000004027 cell Anatomy 0.000 description 14
- 230000000295 complement effect Effects 0.000 description 14
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 13
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 13
- 108010054813 diprotin B Proteins 0.000 description 13
- 108010064235 lysylglycine Proteins 0.000 description 13
- 108010017391 lysylvaline Proteins 0.000 description 13
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 12
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 12
- 108010047495 alanylglycine Proteins 0.000 description 12
- 108010069495 cysteinyltyrosine Proteins 0.000 description 12
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 12
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 12
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 12
- 108010081551 glycylphenylalanine Proteins 0.000 description 12
- 108010003137 tyrosyltyrosine Proteins 0.000 description 12
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 11
- 108010016616 cysteinylglycine Proteins 0.000 description 11
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 11
- 108010003700 lysyl aspartic acid Proteins 0.000 description 11
- 108010012581 phenylalanylglutamate Proteins 0.000 description 11
- 108010053725 prolylvaline Proteins 0.000 description 11
- 108010079364 N-glycylalanine Proteins 0.000 description 10
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 108010044940 alanylglutamine Proteins 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 10
- 108010004073 cysteinylcysteine Proteins 0.000 description 10
- 108010089804 glycyl-threonine Proteins 0.000 description 10
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 10
- 108010084572 phenylalanyl-valine Proteins 0.000 description 10
- 108010026333 seryl-proline Proteins 0.000 description 10
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 9
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 9
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 9
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 9
- 201000010099 disease Diseases 0.000 description 9
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 9
- 108010078144 glutaminyl-glycine Proteins 0.000 description 9
- 238000003018 immunoassay Methods 0.000 description 9
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 9
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 9
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 9
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 9
- 102000004169 proteins and genes Human genes 0.000 description 9
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 8
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 8
- 108091034117 Oligonucleotide Proteins 0.000 description 8
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 8
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 8
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 8
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 8
- 108010062796 arginyllysine Proteins 0.000 description 8
- 108010092854 aspartyllysine Proteins 0.000 description 8
- 230000027455 binding Effects 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- 108010027338 isoleucylcysteine Proteins 0.000 description 8
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 8
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 8
- 239000011159 matrix material Substances 0.000 description 8
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 8
- 108010048818 seryl-histidine Proteins 0.000 description 8
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 8
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 7
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 7
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 7
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 7
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 7
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 7
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 7
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 7
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 7
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 7
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 7
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 7
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 7
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 7
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 7
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 7
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 7
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 7
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 7
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 7
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 7
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 7
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 7
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 7
- 108010041407 alanylaspartic acid Proteins 0.000 description 7
- 108010087924 alanylproline Proteins 0.000 description 7
- 108010093581 aspartyl-proline Proteins 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 108010040030 histidinoalanine Proteins 0.000 description 7
- 108010025306 histidylleucine Proteins 0.000 description 7
- 108010092114 histidylphenylalanine Proteins 0.000 description 7
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 7
- 108010024607 phenylalanylalanine Proteins 0.000 description 7
- 108090000765 processed proteins & peptides Proteins 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 108010015796 prolylisoleucine Proteins 0.000 description 7
- 108010090894 prolylleucine Proteins 0.000 description 7
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 7
- 108010071207 serylmethionine Proteins 0.000 description 7
- 241000894007 species Species 0.000 description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical group N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 6
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 6
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 6
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 6
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 6
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 6
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 6
- 241000711506 Canine coronavirus Species 0.000 description 6
- 241000725579 Feline coronavirus Species 0.000 description 6
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 6
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 6
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 6
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 6
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 6
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 6
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 6
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 6
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 6
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 6
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 6
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 6
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 6
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 6
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 6
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 6
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 6
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 6
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 238000004113 cell culture Methods 0.000 description 6
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 6
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010087823 glycyltyrosine Proteins 0.000 description 6
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 6
- 108010078274 isoleucylvaline Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 6
- 108010068488 methionylphenylalanine Proteins 0.000 description 6
- 230000000069 prophylactic effect Effects 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 108700004896 tripeptide FEG Proteins 0.000 description 6
- 108010080629 tryptophan-leucine Proteins 0.000 description 6
- 108010051110 tyrosyl-lysine Proteins 0.000 description 6
- 239000013603 viral vector Substances 0.000 description 6
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 5
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 5
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 5
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 5
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 5
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 5
- 108091093088 Amplicon Proteins 0.000 description 5
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 5
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 5
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 5
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 5
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 5
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 5
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 5
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 5
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 5
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 5
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 5
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 5
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 5
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 5
- 241000282412 Homo Species 0.000 description 5
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 5
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 5
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 5
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 5
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 5
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 5
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 5
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 5
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 5
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 5
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 5
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 5
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 5
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 5
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 5
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 5
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 5
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 5
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 5
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 5
- 206010035664 Pneumonia Diseases 0.000 description 5
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 5
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 5
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 5
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 5
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 5
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 5
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 5
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 5
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 5
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 5
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 5
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 5
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 5
- 241000711484 Transmissible gastroenteritis virus Species 0.000 description 5
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 5
- 108010064997 VPY tripeptide Proteins 0.000 description 5
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 5
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 5
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 5
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 5
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 5
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 5
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 5
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 5
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 5
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 5
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 5
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 5
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 5
- 108010011559 alanylphenylalanine Proteins 0.000 description 5
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 238000003745 diagnosis Methods 0.000 description 5
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 5
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 5
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 5
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 5
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 5
- 108010077515 glycylproline Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 108010054155 lysyllysine Proteins 0.000 description 5
- 238000002844 melting Methods 0.000 description 5
- 230000008018 melting Effects 0.000 description 5
- 108010005942 methionylglycine Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- 108010044292 tryptophyltyrosine Proteins 0.000 description 5
- 108010020532 tyrosyl-proline Proteins 0.000 description 5
- 238000005406 washing Methods 0.000 description 5
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 4
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 4
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 4
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 4
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 4
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 4
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 4
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 4
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 4
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 4
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 4
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 4
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 4
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 4
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 4
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 4
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 4
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 4
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 4
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 4
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 4
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 4
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 4
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 4
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 4
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 4
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 4
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 4
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 4
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 4
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 4
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 4
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 4
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 4
- 241000282472 Canis lupus familiaris Species 0.000 description 4
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 4
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 4
- YYLBXQJGWOQZOU-IHRRRGAJSA-N Cys-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N YYLBXQJGWOQZOU-IHRRRGAJSA-N 0.000 description 4
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 4
- 238000002965 ELISA Methods 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 4
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 4
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 4
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 4
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 4
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 4
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 4
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 4
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 4
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 4
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 4
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 4
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 4
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 4
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 4
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 4
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 4
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 4
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 4
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 4
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 4
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 4
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 4
- 244000309467 Human Coronavirus Species 0.000 description 4
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 4
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 4
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 4
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 4
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 4
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 4
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 4
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 4
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 4
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 4
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 4
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 4
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 4
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 4
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 4
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 4
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 4
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 4
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 4
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 4
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 4
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 4
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 4
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 4
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 4
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 4
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 4
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 4
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 4
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 4
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 4
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 4
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 4
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 4
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 4
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 4
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 4
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 4
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 4
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 4
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 4
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 4
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 4
- 108010066427 N-valyltryptophan Proteins 0.000 description 4
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 4
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 4
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 4
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 4
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 4
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 4
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 4
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 4
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 4
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 4
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 4
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 4
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 4
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 4
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 4
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 4
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 4
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 4
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 4
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 4
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 4
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 4
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 4
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 4
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 4
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 4
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 4
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 4
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 4
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 4
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 4
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 4
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 4
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 4
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 4
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 4
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 4
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 4
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 4
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 4
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 4
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 4
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 4
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 4
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 4
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 4
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 4
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 4
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 4
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 4
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 4
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 4
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 4
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 4
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 4
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 4
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 4
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 4
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 4
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 4
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 4
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 4
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 4
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 4
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 4
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 4
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 4
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 4
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 4
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 4
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 4
- 208000036142 Viral infection Diseases 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 4
- 239000003443 antiviral agent Substances 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 4
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 4
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 4
- 108010036533 arginylvaline Proteins 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 210000000234 capsid Anatomy 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 239000000975 dye Substances 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 4
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 4
- 108010020688 glycylhistidine Proteins 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 230000002458 infectious effect Effects 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 108010018625 phenylalanylarginine Proteins 0.000 description 4
- 108010073101 phenylalanylleucine Proteins 0.000 description 4
- 238000013081 phylogenetic analysis Methods 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 230000002797 proteolythic effect Effects 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 230000002441 reversible effect Effects 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 4
- 108010045269 tryptophyltryptophan Proteins 0.000 description 4
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 3
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 3
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 3
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 3
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 3
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 3
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 3
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 3
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 3
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 3
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 3
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 3
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 3
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 3
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 3
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 3
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 3
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 3
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 3
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 3
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 3
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 3
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 3
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 3
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 3
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 3
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 3
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 3
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 3
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 3
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 3
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 3
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 3
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 3
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 3
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 3
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 3
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 3
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 3
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 3
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 3
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 3
- FGYUMGXLCZYNQG-UBHSHLNASA-N Asn-Cys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CS)NC(=O)[C@H](CC(N)=O)N)C(O)=O)=CNC2=C1 FGYUMGXLCZYNQG-UBHSHLNASA-N 0.000 description 3
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 3
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 3
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 3
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 3
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 3
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 3
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 3
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 3
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 3
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 3
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 3
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 3
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 3
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 3
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 3
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 3
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 3
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 3
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 3
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 3
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 3
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 3
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 3
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 3
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 3
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 3
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 3
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 3
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 3
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 3
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 3
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 3
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 3
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 3
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 3
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 3
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 3
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 3
- 102100031673 Corneodesmosin Human genes 0.000 description 3
- 229920000742 Cotton Polymers 0.000 description 3
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 description 3
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 3
- GGIHYKLJUIZYGH-ZLUOBGJFSA-N Cys-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O GGIHYKLJUIZYGH-ZLUOBGJFSA-N 0.000 description 3
- KCPOQGRVVXYLAC-KKUMJFAQSA-N Cys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KCPOQGRVVXYLAC-KKUMJFAQSA-N 0.000 description 3
- NLDWTJBJFVWBDQ-KKUMJFAQSA-N Cys-Lys-Phe Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NLDWTJBJFVWBDQ-KKUMJFAQSA-N 0.000 description 3
- IDZDFWJNPOOOHE-KKUMJFAQSA-N Cys-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N IDZDFWJNPOOOHE-KKUMJFAQSA-N 0.000 description 3
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 3
- IRKLTAKLAFUTLA-KATARQTJSA-N Cys-Thr-Lys Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCCCN)C(O)=O IRKLTAKLAFUTLA-KATARQTJSA-N 0.000 description 3
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 3
- 108010090461 DFG peptide Proteins 0.000 description 3
- 208000005577 Gastroenteritis Diseases 0.000 description 3
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 3
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 3
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 3
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 3
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 3
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 3
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 3
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 3
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 3
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 3
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 3
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 3
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 3
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 3
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 3
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 3
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 3
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 3
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 3
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 3
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 3
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 3
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 3
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 3
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 3
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 3
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 3
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 3
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 3
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 3
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 3
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 3
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 3
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 3
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 3
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 3
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 3
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 3
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 3
- VIVSWEBJUHXCDS-DCAQKATOSA-N His-Asn-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O VIVSWEBJUHXCDS-DCAQKATOSA-N 0.000 description 3
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 3
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 3
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 3
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 3
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 3
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 3
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 3
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 3
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 3
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 3
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 3
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 3
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 3
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 3
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 3
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 3
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 3
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 3
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 3
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 3
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 3
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 3
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 3
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 3
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 3
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 3
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 3
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 3
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 3
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 3
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 3
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 3
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 3
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 3
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 3
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 3
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 3
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 3
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 3
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 3
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 3
- BCUVPZLLSRMPJL-XIRDDKMYSA-N Leu-Trp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N BCUVPZLLSRMPJL-XIRDDKMYSA-N 0.000 description 3
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 3
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 3
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 3
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 3
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 3
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 3
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 3
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 3
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 3
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 3
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 3
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 3
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 3
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 3
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 3
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 3
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 3
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 3
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 3
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 3
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 3
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 3
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 3
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 3
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 3
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 3
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 3
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 3
- PJWDQHNOJIBMRY-JYJNAYRXSA-N Met-Arg-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PJWDQHNOJIBMRY-JYJNAYRXSA-N 0.000 description 3
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- 108010047562 NGR peptide Proteins 0.000 description 3
- 108010061100 Nucleoproteins Proteins 0.000 description 3
- 102000011931 Nucleoproteins Human genes 0.000 description 3
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 3
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 3
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 3
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 3
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 3
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 3
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 3
- OWCLJDXHHZUNEL-IHRRRGAJSA-N Phe-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OWCLJDXHHZUNEL-IHRRRGAJSA-N 0.000 description 3
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 3
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 3
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 3
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 3
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 3
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 3
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 3
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 3
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 3
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 3
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 3
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 3
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 3
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 3
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 3
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 3
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 3
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 3
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 3
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 3
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 3
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 3
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 3
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 3
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 3
- LEBTWGWVUVJNTA-FKBYEOEOSA-N Pro-Trp-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=CC=C4)C(=O)O LEBTWGWVUVJNTA-FKBYEOEOSA-N 0.000 description 3
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 3
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 3
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 3
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 3
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 3
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 3
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 3
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 3
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 3
- NJSPTZXVPZDRCU-UBHSHLNASA-N Ser-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N NJSPTZXVPZDRCU-UBHSHLNASA-N 0.000 description 3
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 3
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 3
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 3
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 3
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 3
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 3
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 3
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 3
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 3
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 3
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 3
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 3
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 3
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 3
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 3
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 3
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 3
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 3
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 3
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 3
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 3
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 3
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 3
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 3
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 3
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 3
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 3
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 3
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 3
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 3
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 3
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 3
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 3
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 3
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 3
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 3
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 3
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 3
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 3
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 3
- DWJQKEZKLQCHKO-SRVKXCTJSA-N Tyr-Asn-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O DWJQKEZKLQCHKO-SRVKXCTJSA-N 0.000 description 3
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 3
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 3
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 3
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 3
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 3
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 3
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 3
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 3
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 3
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 3
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 3
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 3
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 3
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 3
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 3
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 3
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 3
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 3
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 3
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 3
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 3
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 3
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 3
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 3
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 3
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 3
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 3
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 3
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 3
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 3
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 3
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 3
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 3
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 3
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 3
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 3
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 3
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 3
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 3
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 229960002685 biotin Drugs 0.000 description 3
- 235000020958 biotin Nutrition 0.000 description 3
- 239000011616 biotin Substances 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 230000002950 deficient Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 3
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 208000023504 respiratory system disease Diseases 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 229940031626 subunit vaccine Drugs 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 108010029384 tryptophyl-histidine Proteins 0.000 description 3
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 3
- 108010078580 tyrosylleucine Proteins 0.000 description 3
- 229960005486 vaccine Drugs 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 2
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 2
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 2
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 2
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 2
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 2
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 2
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 2
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 2
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 2
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 2
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 2
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 2
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Natural products CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 2
- AHPWQERCDZTTNB-FXQIFTODSA-N Arg-Cys-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AHPWQERCDZTTNB-FXQIFTODSA-N 0.000 description 2
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 2
- RWDVGVPHEWOZMO-GUBZILKMSA-N Arg-Cys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)C(O)=O RWDVGVPHEWOZMO-GUBZILKMSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- PSOPJDUQUVFSLS-GUBZILKMSA-N Arg-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PSOPJDUQUVFSLS-GUBZILKMSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 2
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 2
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 2
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 2
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 2
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 2
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 2
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 2
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 2
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 2
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 2
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 2
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 2
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 2
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 2
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 2
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 2
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 2
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 2
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 2
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 2
- MUWDILPCTSMUHI-ZLUOBGJFSA-N Asp-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O MUWDILPCTSMUHI-ZLUOBGJFSA-N 0.000 description 2
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 2
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- AMRANMVXQWXNAH-ZLUOBGJFSA-N Asp-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O AMRANMVXQWXNAH-ZLUOBGJFSA-N 0.000 description 2
- LXKLDWVHXNZQGB-SRVKXCTJSA-N Asp-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O LXKLDWVHXNZQGB-SRVKXCTJSA-N 0.000 description 2
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 2
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 2
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 2
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 2
- LBFYTUPYYZENIR-GHCJXIJMSA-N Asp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N LBFYTUPYYZENIR-GHCJXIJMSA-N 0.000 description 2
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 2
- LOEKZJRUVGORIY-CAMMJAKZSA-N Asp-Phe-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 LOEKZJRUVGORIY-CAMMJAKZSA-N 0.000 description 2
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 2
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 2
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 2
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 101710139375 Corneodesmosin Proteins 0.000 description 2
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 2
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 2
- HRJLVSQKBLZHSR-ZLUOBGJFSA-N Cys-Asn-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O HRJLVSQKBLZHSR-ZLUOBGJFSA-N 0.000 description 2
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 2
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 2
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 2
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 2
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 2
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 2
- DIUBVGXMXONJCF-KKUMJFAQSA-N Cys-His-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DIUBVGXMXONJCF-KKUMJFAQSA-N 0.000 description 2
- BBQIWFFTTQTNOC-AVGNSLFASA-N Cys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N BBQIWFFTTQTNOC-AVGNSLFASA-N 0.000 description 2
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 2
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 2
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 2
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 2
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 2
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 2
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 2
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 2
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 230000004544 DNA amplification Effects 0.000 description 2
- 101710204837 Envelope small membrane protein Proteins 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 2
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 2
- IPHGBVYWRKCGKG-FXQIFTODSA-N Gln-Cys-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O IPHGBVYWRKCGKG-FXQIFTODSA-N 0.000 description 2
- LPJVZYMINRLCQA-AVGNSLFASA-N Gln-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N LPJVZYMINRLCQA-AVGNSLFASA-N 0.000 description 2
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 2
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 2
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 2
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 2
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 2
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 2
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 2
- DSRVQBZAMPGEKU-AVGNSLFASA-N Gln-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DSRVQBZAMPGEKU-AVGNSLFASA-N 0.000 description 2
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 2
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 2
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 2
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 2
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 2
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 2
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 2
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 2
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 2
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 2
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- ISXJHXGYMJKXOI-GUBZILKMSA-N Glu-Cys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O ISXJHXGYMJKXOI-GUBZILKMSA-N 0.000 description 2
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 2
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 2
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 2
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 2
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 2
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 2
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 2
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 2
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 2
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 2
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 2
- VLIJYPMATZSOLL-YUMQZZPRSA-N Gly-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VLIJYPMATZSOLL-YUMQZZPRSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 2
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 2
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 2
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 2
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 2
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 2
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 2
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 2
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 2
- WHKLDLQHSYAVGU-ACRUOGEOSA-N His-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WHKLDLQHSYAVGU-ACRUOGEOSA-N 0.000 description 2
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 2
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 2
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 2
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 2
- 241001272567 Hominoidea Species 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 2
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 2
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 2
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 2
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 2
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 2
- UQXADIGYEYBJEI-DJFWLOJKSA-N Ile-His-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N UQXADIGYEYBJEI-DJFWLOJKSA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 2
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 2
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 2
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 2
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 2
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 2
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- HZVRQFKRALAMQS-SLBDDTMCSA-N Ile-Trp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZVRQFKRALAMQS-SLBDDTMCSA-N 0.000 description 2
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 2
- WJBOZUVRPOIQNN-KJYZGMDISA-N Ile-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)C1=CN=CN1 WJBOZUVRPOIQNN-KJYZGMDISA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- NUEHSWNAFIEBCQ-NAKRPEOUSA-N Ile-Val-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUEHSWNAFIEBCQ-NAKRPEOUSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 2
- 108700005091 Immunoglobulin Genes Proteins 0.000 description 2
- 241000711450 Infectious bronchitis virus Species 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 2
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 2
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 2
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 2
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 2
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 2
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 2
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 2
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 2
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 2
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 2
- VSJXPNCQYGOLFM-XIRDDKMYSA-N Lys-Cys-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VSJXPNCQYGOLFM-XIRDDKMYSA-N 0.000 description 2
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 2
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 2
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 2
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 2
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 2
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 2
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 2
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- NROQVSYLPRLJIP-PMVMPFDFSA-N Lys-Trp-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NROQVSYLPRLJIP-PMVMPFDFSA-N 0.000 description 2
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 2
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 2
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 2
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- 101710145006 Lysis protein Proteins 0.000 description 2
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 2
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 2
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 2
- HGKJFNCLOHKEHS-FXQIFTODSA-N Met-Cys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(O)=O HGKJFNCLOHKEHS-FXQIFTODSA-N 0.000 description 2
- YKWHHKDMBZBMLG-GUBZILKMSA-N Met-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N YKWHHKDMBZBMLG-GUBZILKMSA-N 0.000 description 2
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 2
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 2
- KBTQZYASLSUFJR-KKUMJFAQSA-N Met-Phe-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KBTQZYASLSUFJR-KKUMJFAQSA-N 0.000 description 2
- GRKPXCKLOOUDFG-UFYCRDLUSA-N Met-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 GRKPXCKLOOUDFG-UFYCRDLUSA-N 0.000 description 2
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 2
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 2
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 2
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 2
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 2
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 2
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 2
- 241000711466 Murine hepatitis virus Species 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 2
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 2
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 2
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 2
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 2
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 2
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 2
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- PSBJZLMFFTULDX-IXOXFDKPSA-N Phe-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N)O PSBJZLMFFTULDX-IXOXFDKPSA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 2
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 2
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 2
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- YVIVIQWMNCWUFS-UFYCRDLUSA-N Phe-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N YVIVIQWMNCWUFS-UFYCRDLUSA-N 0.000 description 2
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 2
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 2
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 2
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 2
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 2
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 2
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 2
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 2
- GLUYKHMBGKQBHE-JYJNAYRXSA-N Phe-Val-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 GLUYKHMBGKQBHE-JYJNAYRXSA-N 0.000 description 2
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- NUZHSNLQJDYSRW-BZSNNMDCSA-N Pro-Arg-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NUZHSNLQJDYSRW-BZSNNMDCSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 2
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 2
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 2
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 2
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 2
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- GBRUQFBAJOKCTF-DCAQKATOSA-N Pro-His-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O GBRUQFBAJOKCTF-DCAQKATOSA-N 0.000 description 2
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 2
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 2
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 2
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 2
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 2
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 241001428933 Rat coronavirus Species 0.000 description 2
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 2
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 2
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 2
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 2
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 2
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 2
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 2
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 2
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 2
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 2
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 2
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 2
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- 108010090804 Streptavidin Proteins 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 241000282887 Suidae Species 0.000 description 2
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 2
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 2
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 2
- LOHBIDZYHQQTDM-IXOXFDKPSA-N Thr-Cys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LOHBIDZYHQQTDM-IXOXFDKPSA-N 0.000 description 2
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 2
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 2
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 2
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 2
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 2
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 2
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 2
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 2
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 2
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- VEENWOSZGWWKHW-SZZJOZGLSA-N Thr-Trp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O VEENWOSZGWWKHW-SZZJOZGLSA-N 0.000 description 2
- VGNKUXWYFFDWDH-BEMMVCDISA-N Thr-Trp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N)O VGNKUXWYFFDWDH-BEMMVCDISA-N 0.000 description 2
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 2
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- QNMIVTOQXUSGLN-SZMVWBNQSA-N Trp-Arg-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QNMIVTOQXUSGLN-SZMVWBNQSA-N 0.000 description 2
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 2
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 description 2
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 2
- JGLXHHQUSIULAK-OYDLWJJNSA-N Trp-Pro-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]3CCCN3C(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(O)=O)=CNC2=C1 JGLXHHQUSIULAK-OYDLWJJNSA-N 0.000 description 2
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 2
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 2
- FBHHJGOJWXHGDO-TUSQITKMSA-N Trp-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 FBHHJGOJWXHGDO-TUSQITKMSA-N 0.000 description 2
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 2
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 2
- FFWCYWZIVFIUDM-OYDLWJJNSA-N Trp-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O FFWCYWZIVFIUDM-OYDLWJJNSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 2
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 2
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 2
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 2
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 2
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 2
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 2
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 2
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 2
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 2
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 2
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 2
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 2
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 2
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 2
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 2
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 2
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 2
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 2
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 2
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 2
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 2
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 2
- QMNWABHLJOHGDS-IHRRRGAJSA-N Tyr-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QMNWABHLJOHGDS-IHRRRGAJSA-N 0.000 description 2
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 2
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 2
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 2
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 2
- QFHRUCJIRVILCK-YJRXYDGGSA-N Tyr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O QFHRUCJIRVILCK-YJRXYDGGSA-N 0.000 description 2
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 2
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 2
- DJSYPCWZPNHQQE-FHWLQOOXSA-N Tyr-Tyr-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=C(O)C=C1 DJSYPCWZPNHQQE-FHWLQOOXSA-N 0.000 description 2
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 2
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 2
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 2
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- NMPXRFYMZDIBRF-ZOBUZTSGSA-N Val-Asn-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NMPXRFYMZDIBRF-ZOBUZTSGSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 2
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 2
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 2
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 2
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 2
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 2
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 2
- CJDZKZFMAXGUOJ-IHRRRGAJSA-N Val-Cys-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CJDZKZFMAXGUOJ-IHRRRGAJSA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 2
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 2
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 2
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 2
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 2
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 2
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 2
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 2
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 2
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 125000004122 cyclic group Chemical group 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 108010068404 exorphin B4 Proteins 0.000 description 2
- 210000003608 fece Anatomy 0.000 description 2
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 2
- 108010028403 hemagglutinin esterase Proteins 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 238000010166 immunofluorescence Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010026228 mRNA guanylyltransferase Proteins 0.000 description 2
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000006386 neutralization reaction Methods 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 229910052698 phosphorus Inorganic materials 0.000 description 2
- 239000011574 phosphorus Substances 0.000 description 2
- -1 polypropylene Polymers 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 230000000241 respiratory effect Effects 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 229910001415 sodium ion Inorganic materials 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- ZNAIHAPCDVUWRX-DUCUPYJCSA-N (4s,4as,5as,6s,12ar)-7-chloro-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide;4-amino-n-(4,6-dimethylpyrimidin-2-yl)benzenesulfonamide;(2s,5r,6r)-3,3-dimethyl-7-oxo-6-[(2-phenylacetyl)amino]-4-t Chemical compound CC1=CC(C)=NC(NS(=O)(=O)C=2C=CC(N)=CC=2)=N1.N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1.C1=CC(Cl)=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O ZNAIHAPCDVUWRX-DUCUPYJCSA-N 0.000 description 1
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 1
- YEJQWBFDKKTPNO-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylbutanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)C)C(O)=O YEJQWBFDKKTPNO-UHFFFAOYSA-N 0.000 description 1
- JUEUYDRZJNQZGR-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JUEUYDRZJNQZGR-UHFFFAOYSA-N 0.000 description 1
- 101150001666 2a gene Proteins 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- 108010036211 5-HT-moduline Proteins 0.000 description 1
- WOVKYSAHUYNSMH-RRKCRQDMSA-N 5-bromodeoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 WOVKYSAHUYNSMH-RRKCRQDMSA-N 0.000 description 1
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 1
- RCQRKPUXJAGEEC-ZLUOBGJFSA-N Ala-Cys-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RCQRKPUXJAGEEC-ZLUOBGJFSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- MAEQBGQTDWDSJQ-LSJOCFKGSA-N Ala-Met-His Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MAEQBGQTDWDSJQ-LSJOCFKGSA-N 0.000 description 1
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 1
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 1
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 1
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 1
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 1
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 1
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 1
- DGFGDPVSDQPANQ-XGEHTFHBSA-N Arg-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)O DGFGDPVSDQPANQ-XGEHTFHBSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 1
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 1
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- BXLDDWZOTGGNOJ-SZMVWBNQSA-N Arg-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N BXLDDWZOTGGNOJ-SZMVWBNQSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- BFDDUDQCPJWQRQ-IHRRRGAJSA-N Arg-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O BFDDUDQCPJWQRQ-IHRRRGAJSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- JRVABKHPWDRUJF-UBHSHLNASA-N Asn-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N JRVABKHPWDRUJF-UBHSHLNASA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- RFLVTVBAESPKKR-ZLUOBGJFSA-N Asn-Cys-Cys Chemical compound N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O RFLVTVBAESPKKR-ZLUOBGJFSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- IHUJUZBUOFTIOB-QEJZJMRPSA-N Asn-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N IHUJUZBUOFTIOB-QEJZJMRPSA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 1
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- GYOHQKJEQQJBOY-QEJZJMRPSA-N Asn-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N GYOHQKJEQQJBOY-QEJZJMRPSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- ZTRJUKDEALVRMW-SRVKXCTJSA-N Asn-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZTRJUKDEALVRMW-SRVKXCTJSA-N 0.000 description 1
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- GOKCTAJWRPSCHP-VHWLVUOQSA-N Asn-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N GOKCTAJWRPSCHP-VHWLVUOQSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- RZNAMKZJPBQWDJ-SRVKXCTJSA-N Asn-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N RZNAMKZJPBQWDJ-SRVKXCTJSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- LANZYLJEHLBUPR-BPUTZDHNSA-N Asn-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N LANZYLJEHLBUPR-BPUTZDHNSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 description 1
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 1
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 1
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 1
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 1
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 1
- CRNKLABLTICXDV-GUBZILKMSA-N Asp-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N CRNKLABLTICXDV-GUBZILKMSA-N 0.000 description 1
- ODNWIBOCFGMRTP-SRVKXCTJSA-N Asp-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CN=CN1 ODNWIBOCFGMRTP-SRVKXCTJSA-N 0.000 description 1
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- IQCJOIHDVFJQFV-LKXGYXEUSA-N Asp-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O IQCJOIHDVFJQFV-LKXGYXEUSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- VXEORMGBKTUUCM-KWBADKCTSA-N Asp-Val-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O VXEORMGBKTUUCM-KWBADKCTSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 241000020089 Atacta Species 0.000 description 1
- 101000666833 Autographa californica nuclear polyhedrosis virus Uncharacterized 20.8 kDa protein in FGF-VUBI intergenic region Proteins 0.000 description 1
- IVRMZWNICZWHMI-UHFFFAOYSA-N Azide Chemical compound [N-]=[N+]=[N-] IVRMZWNICZWHMI-UHFFFAOYSA-N 0.000 description 1
- 101000977027 Azospirillum brasilense Uncharacterized protein in nodG 5'region Proteins 0.000 description 1
- 101000962005 Bacillus thuringiensis Uncharacterized 23.6 kDa protein Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 101000713368 Bovine immunodeficiency virus (strain R29) Protein Tat Proteins 0.000 description 1
- 101100177112 Caenorhabditis elegans his-70 gene Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241001466804 Carnivora Species 0.000 description 1
- 208000003322 Coinfection Diseases 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 208000001528 Coronaviridae Infections Diseases 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 1
- XMTDCXXLDZKAGI-ACZMJKKPSA-N Cys-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N XMTDCXXLDZKAGI-ACZMJKKPSA-N 0.000 description 1
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 1
- FMDCYTBSPZMPQE-JBDRJPRFSA-N Cys-Ala-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMDCYTBSPZMPQE-JBDRJPRFSA-N 0.000 description 1
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 1
- KKZHXOOZHFABQQ-UWJYBYFXSA-N Cys-Ala-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKZHXOOZHFABQQ-UWJYBYFXSA-N 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 1
- BNRHLRWCERLRTQ-BPUTZDHNSA-N Cys-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N BNRHLRWCERLRTQ-BPUTZDHNSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 1
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 1
- WVJHEDOLHPZLRV-CIUDSAMLSA-N Cys-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N WVJHEDOLHPZLRV-CIUDSAMLSA-N 0.000 description 1
- OLIYIKRCOZBFCW-ZLUOBGJFSA-N Cys-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)O OLIYIKRCOZBFCW-ZLUOBGJFSA-N 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 1
- XRTISHJEPHMBJG-SRVKXCTJSA-N Cys-Asp-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XRTISHJEPHMBJG-SRVKXCTJSA-N 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- WYZLWZNAWQNLGQ-FXQIFTODSA-N Cys-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N WYZLWZNAWQNLGQ-FXQIFTODSA-N 0.000 description 1
- KOHBWQDSVCARMI-BWBBJGPYSA-N Cys-Cys-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KOHBWQDSVCARMI-BWBBJGPYSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- PFAQXUDMZVMADG-AVGNSLFASA-N Cys-Gln-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PFAQXUDMZVMADG-AVGNSLFASA-N 0.000 description 1
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 1
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 1
- SBORMUFGKSCGEN-XHNCKOQMSA-N Cys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)C(=O)O SBORMUFGKSCGEN-XHNCKOQMSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 1
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- XGHYKIDVGYYHDC-JBDRJPRFSA-N Cys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N XGHYKIDVGYYHDC-JBDRJPRFSA-N 0.000 description 1
- KCSDYJSCUWLILX-BJDJZHNGSA-N Cys-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N KCSDYJSCUWLILX-BJDJZHNGSA-N 0.000 description 1
- KKUVRYLJEXJSGX-MXAVVETBSA-N Cys-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KKUVRYLJEXJSGX-MXAVVETBSA-N 0.000 description 1
- PDRMRVHPAQKTLT-NAKRPEOUSA-N Cys-Ile-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O PDRMRVHPAQKTLT-NAKRPEOUSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- XXDATQFUGMAJRV-XIRDDKMYSA-N Cys-Leu-Trp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XXDATQFUGMAJRV-XIRDDKMYSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- VXLXATVURDNDCG-CIUDSAMLSA-N Cys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N VXLXATVURDNDCG-CIUDSAMLSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 description 1
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 1
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- JIVJQYNNAYFXDG-LKXGYXEUSA-N Cys-Thr-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JIVJQYNNAYFXDG-LKXGYXEUSA-N 0.000 description 1
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 1
- GFAPBMCRSMSGDZ-XGEHTFHBSA-N Cys-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N)O GFAPBMCRSMSGDZ-XGEHTFHBSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 1
- DQBRIEGWTLXALA-GQGQLFGLSA-N Cys-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N DQBRIEGWTLXALA-GQGQLFGLSA-N 0.000 description 1
- MSWBLPLBSLQVME-XIRDDKMYSA-N Cys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 MSWBLPLBSLQVME-XIRDDKMYSA-N 0.000 description 1
- RIONIAPMMKVUCX-IHPCNDPISA-N Cys-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CS)N)C(O)=O)C1=CC=C(O)C=C1 RIONIAPMMKVUCX-IHPCNDPISA-N 0.000 description 1
- QUQHPUMRFGFINP-BPUTZDHNSA-N Cys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N QUQHPUMRFGFINP-BPUTZDHNSA-N 0.000 description 1
- KXHAPEPORGOXDT-UWJYBYFXSA-N Cys-Tyr-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O KXHAPEPORGOXDT-UWJYBYFXSA-N 0.000 description 1
- MJOYUXLETJMQGG-IHRRRGAJSA-N Cys-Tyr-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJOYUXLETJMQGG-IHRRRGAJSA-N 0.000 description 1
- JIZRUFJGHPIYPS-SRVKXCTJSA-N Cys-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O JIZRUFJGHPIYPS-SRVKXCTJSA-N 0.000 description 1
- OEDPLIBVQGRKGZ-AVGNSLFASA-N Cys-Tyr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O OEDPLIBVQGRKGZ-AVGNSLFASA-N 0.000 description 1
- UGPCUUWZXRMCIJ-KKUMJFAQSA-N Cys-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N UGPCUUWZXRMCIJ-KKUMJFAQSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- ZOMMHASZJQRLFS-IHRRRGAJSA-N Cys-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N ZOMMHASZJQRLFS-IHRRRGAJSA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- IOLWXFWVYYCVTJ-NRPADANISA-N Cys-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N IOLWXFWVYYCVTJ-NRPADANISA-N 0.000 description 1
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 1
- ZXGDAZLSOSYSBA-IHRRRGAJSA-N Cys-Val-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZXGDAZLSOSYSBA-IHRRRGAJSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 125000000824 D-ribofuranosyl group Chemical group [H]OC([H])([H])[C@@]1([H])OC([H])(*)[C@]([H])(O[H])[C@]1([H])O[H] 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 101000785191 Drosophila melanogaster Uncharacterized 50 kDa protein in type I retrotransposable element R1DM Proteins 0.000 description 1
- 101000747704 Enterobacteria phage N4 Uncharacterized protein Gp1 Proteins 0.000 description 1
- 101000861206 Enterococcus faecalis (strain ATCC 700802 / V583) Uncharacterized protein EF_A0048 Proteins 0.000 description 1
- 101000769180 Escherichia coli Uncharacterized 11.1 kDa protein Proteins 0.000 description 1
- QTANTQQOYSUMLC-UHFFFAOYSA-O Ethidium cation Chemical compound C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 QTANTQQOYSUMLC-UHFFFAOYSA-O 0.000 description 1
- 241001200922 Gagata Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 1
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- FVEMBYKESRUFBG-SZMVWBNQSA-N Gln-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FVEMBYKESRUFBG-SZMVWBNQSA-N 0.000 description 1
- WBBVTGIFQIZBHP-JBACZVJFSA-N Gln-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N WBBVTGIFQIZBHP-JBACZVJFSA-N 0.000 description 1
- YJCZUTXLPXBNIO-BHYGNILZSA-N Gln-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N)C(=O)O YJCZUTXLPXBNIO-BHYGNILZSA-N 0.000 description 1
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- BJVBMSTUUWGZKX-JYJNAYRXSA-N Gln-Tyr-His Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BJVBMSTUUWGZKX-JYJNAYRXSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 1
- MXPBQDFWIMBACQ-ACZMJKKPSA-N Glu-Cys-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O MXPBQDFWIMBACQ-ACZMJKKPSA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- MIQCYAJSDGNCNK-BPUTZDHNSA-N Glu-Gln-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MIQCYAJSDGNCNK-BPUTZDHNSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- YDJOULGWHQRPEV-SRVKXCTJSA-N Glu-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N YDJOULGWHQRPEV-SRVKXCTJSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- GTFYQOVVVJASOA-ACZMJKKPSA-N Glu-Ser-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N GTFYQOVVVJASOA-ACZMJKKPSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 1
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 1
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 1
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- LEGMTEAZGRRIMY-ZKWXMUAHSA-N Gly-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN LEGMTEAZGRRIMY-ZKWXMUAHSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- GHHAMXVMWXMGSV-STQMWFEESA-N Gly-Cys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O)=CNC2=C1 GHHAMXVMWXMGSV-STQMWFEESA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 1
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- PASHZZBXZYEXFE-LSDHHAIUSA-N Gly-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN)C(=O)O PASHZZBXZYEXFE-LSDHHAIUSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- WRFOZIJRODPLIA-QWRGUYRKSA-N Gly-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O WRFOZIJRODPLIA-QWRGUYRKSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- 101150029742 HE gene Proteins 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- YXBRCTXAEYSCHS-XVYDVKMFSA-N His-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YXBRCTXAEYSCHS-XVYDVKMFSA-N 0.000 description 1
- IPIVXQQRZXEUGW-UWJYBYFXSA-N His-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IPIVXQQRZXEUGW-UWJYBYFXSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- ZNPRMNDAFQKATM-LKTVYLICSA-N His-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZNPRMNDAFQKATM-LKTVYLICSA-N 0.000 description 1
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 1
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 1
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 1
- LBCAQRFTWMMWRR-CIUDSAMLSA-N His-Cys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O LBCAQRFTWMMWRR-CIUDSAMLSA-N 0.000 description 1
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 1
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 1
- FLYSHWAAHYNKRT-JYJNAYRXSA-N His-Gln-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLYSHWAAHYNKRT-JYJNAYRXSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- CNHSMSFYVARZLI-YJRXYDGGSA-N His-His-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CNHSMSFYVARZLI-YJRXYDGGSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 1
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- DPQIPEAHIYMUEJ-IHRRRGAJSA-N His-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N DPQIPEAHIYMUEJ-IHRRRGAJSA-N 0.000 description 1
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 1
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- YAJQKIBLYPFAET-NAZCDGGXSA-N His-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O YAJQKIBLYPFAET-NAZCDGGXSA-N 0.000 description 1
- PDLQNLSEJXOQNQ-IHPCNDPISA-N His-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CN=CN1 PDLQNLSEJXOQNQ-IHPCNDPISA-N 0.000 description 1
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 1
- YBDOQKVAGTWZMI-XIRDDKMYSA-N His-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YBDOQKVAGTWZMI-XIRDDKMYSA-N 0.000 description 1
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 1
- WSWAUVHXQREQQG-JYJNAYRXSA-N His-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O WSWAUVHXQREQQG-JYJNAYRXSA-N 0.000 description 1
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- WSAILOWUJZEAGC-DCAQKATOSA-N His-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSAILOWUJZEAGC-DCAQKATOSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 241001428935 Human coronavirus OC43 Species 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- HZYHBDVRCBDJJV-HAFWLYHUSA-N Ile-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O HZYHBDVRCBDJJV-HAFWLYHUSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- LLHYWBGDMBGNHA-VGDYDELISA-N Ile-Cys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LLHYWBGDMBGNHA-VGDYDELISA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- SYVMEYAPXRRXAN-MXAVVETBSA-N Ile-Cys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SYVMEYAPXRRXAN-MXAVVETBSA-N 0.000 description 1
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- OONBGFHNQVSUBF-KBIXCLLPSA-N Ile-Gln-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O OONBGFHNQVSUBF-KBIXCLLPSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- JNDYZNJRRNFYIR-VGDYDELISA-N Ile-His-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N JNDYZNJRRNFYIR-VGDYDELISA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- VNDQNDYEPSXHLU-JUKXBJQTSA-N Ile-His-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N VNDQNDYEPSXHLU-JUKXBJQTSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- VUPHVQCDULLACF-NAKRPEOUSA-N Ile-Met-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N VUPHVQCDULLACF-NAKRPEOUSA-N 0.000 description 1
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- KTTMFLSBTNBAHL-MXAVVETBSA-N Ile-Phe-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N KTTMFLSBTNBAHL-MXAVVETBSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- KLJKJVXDHVUMMZ-KKPKCPPISA-N Ile-Phe-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N KLJKJVXDHVUMMZ-KKPKCPPISA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- PMAOIIWHZHAPBT-HJPIBITLSA-N Ile-Tyr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N PMAOIIWHZHAPBT-HJPIBITLSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- SWNRZNLXMXRCJC-VKOGCVSHSA-N Ile-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 SWNRZNLXMXRCJC-VKOGCVSHSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 101000976301 Leptospira interrogans Uncharacterized 35 kDa protein in sph 3'region Proteins 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- DKEZVKFLETVJFY-CIUDSAMLSA-N Leu-Cys-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DKEZVKFLETVJFY-CIUDSAMLSA-N 0.000 description 1
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- AZLASBBHHSLQDB-GUBZILKMSA-N Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(C)C AZLASBBHHSLQDB-GUBZILKMSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- HWMQRQIFVGEAPH-XIRDDKMYSA-N Leu-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 HWMQRQIFVGEAPH-XIRDDKMYSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 1
- LXGSOEPHQJONMG-PMVMPFDFSA-N Leu-Trp-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N LXGSOEPHQJONMG-PMVMPFDFSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- SEOXPEFQEOYURL-PMVMPFDFSA-N Leu-Tyr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SEOXPEFQEOYURL-PMVMPFDFSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- SFQPJNQDUUYCLA-BJDJZHNGSA-N Lys-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N SFQPJNQDUUYCLA-BJDJZHNGSA-N 0.000 description 1
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- DAOSYIZXRCOKII-SRVKXCTJSA-N Lys-His-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O DAOSYIZXRCOKII-SRVKXCTJSA-N 0.000 description 1
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- TYEJPFJNAHIKRT-DCAQKATOSA-N Lys-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N TYEJPFJNAHIKRT-DCAQKATOSA-N 0.000 description 1
- WKUXWMWQTOYTFI-SRVKXCTJSA-N Lys-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N WKUXWMWQTOYTFI-SRVKXCTJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- BVRNWWHJYNPJDG-XIRDDKMYSA-N Lys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N BVRNWWHJYNPJDG-XIRDDKMYSA-N 0.000 description 1
- OKCJTECLRDARDZ-XIRDDKMYSA-N Lys-Trp-Cys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 OKCJTECLRDARDZ-XIRDDKMYSA-N 0.000 description 1
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 101710085938 Matrix protein Proteins 0.000 description 1
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 1
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 1
- 101710127721 Membrane protein Proteins 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 1
- AHZNUGRZHMZGFL-GUBZILKMSA-N Met-Arg-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCNC(N)=N AHZNUGRZHMZGFL-GUBZILKMSA-N 0.000 description 1
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- AVTWKENDGGUWDC-BQBZGAKWSA-N Met-Cys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O AVTWKENDGGUWDC-BQBZGAKWSA-N 0.000 description 1
- PTYVBBNIAQWUFV-DCAQKATOSA-N Met-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N PTYVBBNIAQWUFV-DCAQKATOSA-N 0.000 description 1
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- PQPMMGQTRQFSDA-SRVKXCTJSA-N Met-Glu-His Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O PQPMMGQTRQFSDA-SRVKXCTJSA-N 0.000 description 1
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 1
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- MHQXIBRPDKXDGZ-ZFWWWQNUSA-N Met-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MHQXIBRPDKXDGZ-ZFWWWQNUSA-N 0.000 description 1
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 1
- DYTWOWJWJCBFLE-IHRRRGAJSA-N Met-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CNC=N1 DYTWOWJWJCBFLE-IHRRRGAJSA-N 0.000 description 1
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- ODFBIJXEWPWSAN-CYDGBPFRSA-N Met-Ile-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O ODFBIJXEWPWSAN-CYDGBPFRSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 1
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 1
- SBFPAAPFKZPDCZ-JYJNAYRXSA-N Met-Pro-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SBFPAAPFKZPDCZ-JYJNAYRXSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- JZXKNNOWPBVZEV-XIRDDKMYSA-N Met-Trp-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JZXKNNOWPBVZEV-XIRDDKMYSA-N 0.000 description 1
- XTSBLBXAUIBMLW-KKUMJFAQSA-N Met-Tyr-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N XTSBLBXAUIBMLW-KKUMJFAQSA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 1
- ATBJCCFCJXCNGZ-UFYCRDLUSA-N Met-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 ATBJCCFCJXCNGZ-UFYCRDLUSA-N 0.000 description 1
- VYXIKLFLGRTANT-HRCADAONSA-N Met-Tyr-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N VYXIKLFLGRTANT-HRCADAONSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- MFDDVIJCQYOOES-GUBZILKMSA-N Met-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N MFDDVIJCQYOOES-GUBZILKMSA-N 0.000 description 1
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- 208000005647 Mumps Diseases 0.000 description 1
- 241000428199 Mustelinae Species 0.000 description 1
- 101000658690 Neisseria meningitidis serogroup B Transposase for insertion sequence element IS1106 Proteins 0.000 description 1
- 102000008297 Nuclear Matrix-Associated Proteins Human genes 0.000 description 1
- 108010035916 Nuclear Matrix-Associated Proteins Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 208000002606 Paramyxoviridae Infections Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- ZBYHVSHBZYHQBW-SRVKXCTJSA-N Phe-Cys-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZBYHVSHBZYHQBW-SRVKXCTJSA-N 0.000 description 1
- ALHULIGNEXGFRM-QWRGUYRKSA-N Phe-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=CC=C1 ALHULIGNEXGFRM-QWRGUYRKSA-N 0.000 description 1
- VJEZWOSKRCLHRP-MELADBBJSA-N Phe-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O VJEZWOSKRCLHRP-MELADBBJSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- DHZOGDVYRQOGAC-BZSNNMDCSA-N Phe-Cys-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DHZOGDVYRQOGAC-BZSNNMDCSA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- UEADQPLTYBWWTG-AVGNSLFASA-N Phe-Glu-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEADQPLTYBWWTG-AVGNSLFASA-N 0.000 description 1
- AKJAKCBHLJGRBU-JYJNAYRXSA-N Phe-Glu-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AKJAKCBHLJGRBU-JYJNAYRXSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- UXQFHEKRGHYJRA-STQMWFEESA-N Phe-Met-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O UXQFHEKRGHYJRA-STQMWFEESA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- PBWNICYZGJQKJV-BZSNNMDCSA-N Phe-Phe-Cys Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O PBWNICYZGJQKJV-BZSNNMDCSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 1
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 1
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- OLZVAVSJEUAOHI-UNQGMJICSA-N Phe-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O OLZVAVSJEUAOHI-UNQGMJICSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- OAAWNUBFRMVIQS-IHPCNDPISA-N Phe-Trp-Cys Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CS)CC1=CNC2=CC=CC=C12)CC1=CC=CC=C1 OAAWNUBFRMVIQS-IHPCNDPISA-N 0.000 description 1
- YCEWAVIRWNGGSS-NQCBNZPSSA-N Phe-Trp-Ile Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)C1=CC=CC=C1 YCEWAVIRWNGGSS-NQCBNZPSSA-N 0.000 description 1
- YRHRGNUAXGUPTO-PMVMPFDFSA-N Phe-Trp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O)N YRHRGNUAXGUPTO-PMVMPFDFSA-N 0.000 description 1
- LKRUQZQZMXMKEQ-SFJXLCSZSA-N Phe-Trp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKRUQZQZMXMKEQ-SFJXLCSZSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 1
- 241000283216 Phocidae Species 0.000 description 1
- 108010089430 Phosphoproteins Proteins 0.000 description 1
- 102000007982 Phosphoproteins Human genes 0.000 description 1
- 206010035737 Pneumonia viral Diseases 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 241000711493 Porcine respiratory coronavirus Species 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 1
- WFLWKEUBTSOFMP-FXQIFTODSA-N Pro-Cys-Cys Chemical compound OC(=O)[C@H](CS)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 WFLWKEUBTSOFMP-FXQIFTODSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 1
- QCMYJBKTMIWZAP-AVGNSLFASA-N Pro-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 QCMYJBKTMIWZAP-AVGNSLFASA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- DYMPSOABVJIFBS-IHRRRGAJSA-N Pro-Phe-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CS)C(=O)O DYMPSOABVJIFBS-IHRRRGAJSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- RSTWKJFWBKFOFC-JYJNAYRXSA-N Pro-Trp-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RSTWKJFWBKFOFC-JYJNAYRXSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 102000029301 Protein S Human genes 0.000 description 1
- 101710150114 Protein rep Proteins 0.000 description 1
- 101000748660 Pseudomonas savastanoi Uncharacterized 21 kDa protein in iaaL 5'region Proteins 0.000 description 1
- 108010066717 Q beta Replicase Proteins 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241001068295 Replication defective viruses Species 0.000 description 1
- 101710152114 Replication protein Proteins 0.000 description 1
- 101000584469 Rice tungro bacilliform virus (isolate Philippines) Protein P1 Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 241000282849 Ruminantia Species 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 1
- XERQKTRGJIKTRB-CIUDSAMLSA-N Ser-His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CN=CN1 XERQKTRGJIKTRB-CIUDSAMLSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- AXVNLRQLPLSIPQ-FXQIFTODSA-N Ser-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N AXVNLRQLPLSIPQ-FXQIFTODSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- AXOHAHIUJHCLQR-IHRRRGAJSA-N Ser-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CO)N AXOHAHIUJHCLQR-IHRRRGAJSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 101000818096 Spirochaeta aurantia Uncharacterized 15.5 kDa protein in trpE 3'region Proteins 0.000 description 1
- 229910000831 Steel Inorganic materials 0.000 description 1
- 101000766081 Streptomyces ambofaciens Uncharacterized HTH-type transcriptional regulator in unstable DNA locus Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 241000272534 Struthio camelus Species 0.000 description 1
- 101000804403 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized HIT-like protein Synpcc7942_1390 Proteins 0.000 description 1
- 101000750910 Synechococcus elongatus (strain PCC 7942 / FACHB-805) Uncharacterized HTH-type transcriptional regulator Synpcc7942_2319 Proteins 0.000 description 1
- 101000644897 Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6) Uncharacterized protein SYNPCC7002_B0001 Proteins 0.000 description 1
- 108010008038 Synthetic Vaccines Proteins 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- VEWZSFGRQDUAJM-YJRXYDGGSA-N Thr-Cys-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O VEWZSFGRQDUAJM-YJRXYDGGSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- LUMXICQAOKVQOB-YWIQKCBGSA-N Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O LUMXICQAOKVQOB-YWIQKCBGSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- OWQKBXKXZFRRQL-XGEHTFHBSA-N Thr-Met-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N)O OWQKBXKXZFRRQL-XGEHTFHBSA-N 0.000 description 1
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 1
- UXUAZXWKIGPUCH-RCWTZXSCSA-N Thr-Met-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O UXUAZXWKIGPUCH-RCWTZXSCSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- UGFSAPWZBROURT-IXOXFDKPSA-N Thr-Phe-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N)O UGFSAPWZBROURT-IXOXFDKPSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- VFURAIPBOIWAKP-SZMVWBNQSA-N Trp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N VFURAIPBOIWAKP-SZMVWBNQSA-N 0.000 description 1
- AOAMKFFPFOPMLX-BVSLBCMMSA-N Trp-Arg-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AOAMKFFPFOPMLX-BVSLBCMMSA-N 0.000 description 1
- HQVKQINPFOCIIV-BVSLBCMMSA-N Trp-Arg-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 HQVKQINPFOCIIV-BVSLBCMMSA-N 0.000 description 1
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 1
- GUWJWCHZNGDKBG-UBHSHLNASA-N Trp-Asn-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N GUWJWCHZNGDKBG-UBHSHLNASA-N 0.000 description 1
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 1
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 1
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 1
- WEAPHMIKOICYAU-QEJZJMRPSA-N Trp-Cys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WEAPHMIKOICYAU-QEJZJMRPSA-N 0.000 description 1
- HXMJXDNSFVNSEH-IHPCNDPISA-N Trp-Cys-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXMJXDNSFVNSEH-IHPCNDPISA-N 0.000 description 1
- IQLVYVFBJUWZNT-BPUTZDHNSA-N Trp-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N IQLVYVFBJUWZNT-BPUTZDHNSA-N 0.000 description 1
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 1
- DZIKVMCFXIIETR-JSGCOSHPSA-N Trp-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O DZIKVMCFXIIETR-JSGCOSHPSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- LFMMXTLRXKBPMC-FDARSICLSA-N Trp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LFMMXTLRXKBPMC-FDARSICLSA-N 0.000 description 1
- RIOVOFZXVOWCCX-SBCJRHGPSA-N Trp-Ile-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)[C@@H](C)CC)C(O)=O)=CNC2=C1 RIOVOFZXVOWCCX-SBCJRHGPSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- OWSRIUBVJOQHNY-IHPCNDPISA-N Trp-Lys-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N OWSRIUBVJOQHNY-IHPCNDPISA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 1
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 1
- GIAMKIPJSRZVJB-IHPCNDPISA-N Trp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GIAMKIPJSRZVJB-IHPCNDPISA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- JEYRCNVVYHTZMY-SZMVWBNQSA-N Trp-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JEYRCNVVYHTZMY-SZMVWBNQSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- STJXERBCEWQLKS-IHPCNDPISA-N Trp-Tyr-Cys Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 STJXERBCEWQLKS-IHPCNDPISA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- YTHWAWACWGWBLE-MNSWYVGCSA-N Trp-Tyr-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 YTHWAWACWGWBLE-MNSWYVGCSA-N 0.000 description 1
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 1
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 1
- FBVGQXJIXFZKSQ-GMVOTWDCSA-N Tyr-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FBVGQXJIXFZKSQ-GMVOTWDCSA-N 0.000 description 1
- SEFNTZYRPGBDCY-IHRRRGAJSA-N Tyr-Arg-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O SEFNTZYRPGBDCY-IHRRRGAJSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- YLHFIMLKNPJRGY-BVSLBCMMSA-N Tyr-Arg-Trp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YLHFIMLKNPJRGY-BVSLBCMMSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 1
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 1
- CGDZGRLRXPNCOC-SRVKXCTJSA-N Tyr-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CGDZGRLRXPNCOC-SRVKXCTJSA-N 0.000 description 1
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 1
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- ARSHSYUZHSIYKR-ACRUOGEOSA-N Tyr-His-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ARSHSYUZHSIYKR-ACRUOGEOSA-N 0.000 description 1
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- BJCILVZEZRDIDR-PMVMPFDFSA-N Tyr-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 BJCILVZEZRDIDR-PMVMPFDFSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- HNERGSKJJZQGEA-JYJNAYRXSA-N Tyr-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HNERGSKJJZQGEA-JYJNAYRXSA-N 0.000 description 1
- FWOVTJKVUCGVND-UFYCRDLUSA-N Tyr-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FWOVTJKVUCGVND-UFYCRDLUSA-N 0.000 description 1
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 description 1
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 1
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 1
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 1
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- FRUYSSRPJXNRRB-GUBZILKMSA-N Val-Cys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FRUYSSRPJXNRRB-GUBZILKMSA-N 0.000 description 1
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 1
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- MJXNDRCLGDSBBE-FHWLQOOXSA-N Val-His-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N MJXNDRCLGDSBBE-FHWLQOOXSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- PHZGFLFMGLXCFG-FHWLQOOXSA-N Val-Lys-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PHZGFLFMGLXCFG-FHWLQOOXSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- IOETTZIEIBVWBZ-GUBZILKMSA-N Val-Met-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N IOETTZIEIBVWBZ-GUBZILKMSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- HPOSMQWRPMRMFO-GUBZILKMSA-N Val-Pro-Cys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HPOSMQWRPMRMFO-GUBZILKMSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 108010031318 Vitronectin Proteins 0.000 description 1
- 101000916336 Xenopus laevis Transposon TX1 uncharacterized 82 kDa protein Proteins 0.000 description 1
- 101001000760 Zea mays Putative Pol polyprotein from transposon element Bs1 Proteins 0.000 description 1
- 101000678262 Zymomonas mobilis subsp. mobilis (strain ATCC 10988 / DSM 424 / LMG 404 / NCIMB 8938 / NRRL B-806 / ZM1) 65 kDa protein Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- DPKHZNPWBDQZCN-UHFFFAOYSA-N acridine orange free base Chemical compound C1=CC(N(C)C)=CC2=NC3=CC(N(C)C)=CC=C3C=C21 DPKHZNPWBDQZCN-UHFFFAOYSA-N 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- 239000012491 analyte Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010066119 arginyl-leucyl-aspartyl-serine Proteins 0.000 description 1
- 238000002869 basic local alignment search tool Methods 0.000 description 1
- DZBUGLKDJFMEHC-UHFFFAOYSA-N benzoquinolinylidene Natural products C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 108010089894 bradykinin potentiating factors Proteins 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000034303 cell budding Effects 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 210000003850 cellular structure Anatomy 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000002967 competitive immunoassay Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 229940039227 diagnostic agent Drugs 0.000 description 1
- 239000000032 diagnostic agent Substances 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 238000013399 early diagnosis Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 201000002491 encephalomyelitis Diseases 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 238000012309 immunohistochemistry technique Methods 0.000 description 1
- 238000010324 immunological assay Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000009830 intercalation Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- SZVJSHCCFOBDDC-UHFFFAOYSA-N iron(II,III) oxide Inorganic materials O=[Fe]O[Fe]O[Fe]=O SZVJSHCCFOBDDC-UHFFFAOYSA-N 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- DVCSNHXRZUVYAM-BQBZGAKWSA-N leu-asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O DVCSNHXRZUVYAM-BQBZGAKWSA-N 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 238000007834 ligase chain reaction Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 208000010805 mumps infectious disease Diseases 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000036963 noncompetitive effect Effects 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 150000004713 phosphodiesters Chemical class 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 235000013594 poultry meat Nutrition 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 229940021993 prophylactic vaccine Drugs 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 239000003223 protective agent Substances 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 229940124551 recombinant vaccine Drugs 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000011895 specific detection Methods 0.000 description 1
- 239000010959 steel Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- IIACRCGMVDHOTQ-UHFFFAOYSA-M sulfamate Chemical compound NS([O-])(=O)=O IIACRCGMVDHOTQ-UHFFFAOYSA-M 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 229940021747 therapeutic vaccine Drugs 0.000 description 1
- ANRHNWWPFJCPAZ-UHFFFAOYSA-M thionine Chemical compound [Cl-].C1=CC(N)=CC2=[S+]C3=CC(N)=CC=C3N=C21 ANRHNWWPFJCPAZ-UHFFFAOYSA-M 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 208000009421 viral pneumonia Diseases 0.000 description 1
- 210000000605 viral structure Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N15/00—Investigating characteristics of particles; Investigating permeability, pore-volume or surface-area of porous materials
- G01N15/10—Investigating individual particles
- G01N15/14—Optical investigation techniques, e.g. flow cytometry
- G01N15/1468—Optical investigation techniques, e.g. flow cytometry with spatial resolution of the texture or inner structure of the particle
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P11/00—Drugs for disorders of the respiratory system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56983—Viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20021—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/005—Assays involving biological materials from specific organisms or of a specific nature from viruses
- G01N2333/08—RNA viruses
- G01N2333/165—Coronaviridae, e.g. avian infectious bronchitis virus
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Biomedical Technology (AREA)
- Immunology (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Virology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Hematology (AREA)
- Pathology (AREA)
- General Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Urology & Nephrology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Food Science & Technology (AREA)
- Pharmacology & Pharmacy (AREA)
- Cell Biology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Plant Pathology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Biophysics (AREA)
- Animal Behavior & Ethology (AREA)
- Dispersion Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Oncology (AREA)
- Communicable Diseases (AREA)
Abstract
본 발명은 바이러스학 분야에 관한 것이다. 본 발명은 코로나바이러스 및 그들의 성분의 군 내의 신규한 분리된 본질적으로 포유동물의 양성-센스 단일가닥 RNA 바이러스(EMCR-CoV)를 제공한다.
코로나바이러스, EMCR-CoV, 양성-센스 단일가닥 RNA 바이러스
Description
최근, SARS(중증 급성 호흡기 증후근) 또는 다른 알려진 바이러스 감염에 기인한다고 할 수 없는 호흡기 질병(비정형 폐렴)이 8개월된 환자에게서 진단되었다. 그 환자는 인플루엔자, 파라인플루엔자, 볼거리 및 RSV에 대해 음성으로 분석되었고 질환은 SARS와 매우 유사한 바이러스에 기인하는 것으로 동정되었다.
질병의 기원을 추적하고, 그 역학을 모니터하고 질병의 가능한 확산을 막기 위해, 초기 단계에서 폐렴의 바이러스성 원인을 인식할 수 있는 것이 매우 중요하다. 특히, 중증 질환이 바이러스에 기인한다는 것이 밝혀지면, 진단 수단 및 가능한 치료법의 개발을 위해, 가능한 한 빨리 바이러스의 신원을 검출할 필요가 있다. SARS 역학은 이 질병의 확산을 막기 위해 적시에 효과적인 격리조치 및 초기 검역예방을 취하기 위해 초기 진단을 얻는 것이 중요하다는 것을 나타낸다.
추가로, 질환을 일으키는 바이러스의 동정은 백신의 개발을 가능하게 하고, 이것은 감염될 수 있는 위험에 있는 사람들을 보호하기 위한 예방에 사용될 수 있다. 그리고, 마지막으로, 원인 바이러스에 대한 지식은 치료대책을 개발할 수 있게 한다.
그러므로, 일반적으로 바이러스성 폐렴 및 특별히는 신규한 질환-원인성 감 염성 약제에 대해, 특히 이 약제가 바이러스인 것으로 보일 때의 진단 수단과 치료법의 개발이 크게 요구된다.
본 발명은 분리된 본질적으로 코로나바이러스에 속하는 포유동물 양성-센스 단일가닥 RNA 바이러스의 뉴클레오티드 서열을 제공하고, 이것은 신규 질환에 대한 원인이 되는 인자이고, 이하에서 EMCR-CoV로 불린다. 바이러스(도 2a 및 2b)의 매트릭스 및 핵 캡시드 유전자 서열의 계통분석으로부터, 바이러스는 PEDV(돼지 유행성 설사 바이러스), HCoV-229E(인간 코로나바이러스 229E, PRCoV(돼지 호흡기 코로나바이러스), TGEV(전염성 위장염 바이러스), CaCoV(개 코로나바이러스) 및 FeCoV(고양이 코로나바이러스)에 의해 형성된 군의 독특한 맴버인 것으로 보인다. 아미노산 동정 매트릭스를 기초로, 인간 코로나바이러스 229E는 가장 근접한 관련물(PEDV에 약간 더 밀접한 것으로 보이는 매트릭스를 제외한 모든 ORFs의 경우-도 3 참조)인 것으로 보인다.
비록 계통분석이 바이러스의 동정에 편리한 방법을 제공하지만, 어느 정도 더 조악하지만 상기 바이러스 또는 바이러스 단백질 또는 상기 바이러스로부터의 핵산을 동정하는 몇몇 다른 가능한 직접적인 방법이 또한 제공된다. 경험적으로, EMCR-코로나바이러스는 본 명세서에서 서열에 의해 동정된 바이러스성 단백질 또는 핵산과 비교한, 동정되어질 바이러스, 단백질 또는 핵산의 상동률에 의해 동정될 수 있다. 일반적으로 바이러스종, 특히 RNA 바이러스 종은 종종 상기 바이러스의 클러스터가 그 맴버들 중에 이종성을 나타내는 유사 종을 구성한다는 것이 알려져 있다. 그러므로, 각각의 분리물은 여기서 제공된 분리물의 서열과 다소 다른 관련 백분률을 가질 수 있다는 것이 예상된다.
바이러스 분리물을 도 1에 기재된 바와 같은 서열과 비교하기 원하는 경우, 본 발명은, 상기 바이러스의 핵산서열을 결정하고 그리고 상기 핵산서열이 PEDV, 229E, PRCoV, TGEV, CaCoV 및 FeCoV와 비교하여 아래에서 확인되는 바와 같이 핵산에 대해 동정된 백분률 보다 높은, 기록된 서열에 대한 핵산 동일성 백분률을 갖는다는 것을 결정함에 의해, 코로나바이러스에 속하고 그것에 계통적으로 상응하는 것으로 동정가능한 분리된 본질적으로 포유동물의 양성-센스 단일가닥 RNA 바이러스(EMCR-CoV)를 제공한다. 한편, 상기 바이러스의 아미노산 서열을 결정하고 상기 아미노산 서열이 PEDV, 229E, PRCoV, TGEV, CaCoV 및 FeCoV와 비교하여 아래에서 제공된 백분률보다 높은, 서열에 대한 아미노산 동일성 백분률을 갖는다고 결정함에 의해, 코로나 바이러스에 속하고 그것에 계통적으로 상응하는 것으로 동정가능한, 분리된 본질적으로 포유동물의 양성-센스 단일가닥 RNA 바이러스(EMCR-CoV)가 제공된다.
이 EMCR-코로나바이러스(EMCR-CoV)의 서열정보를 공급함에 의해, 본 발명은 특히 포유동물, 더욱 특별히는 이 바이러스에 감염된 인간에게서 질병, 특히 호흡기 질병(비정형 폐렴)의 진단, 예방 및/또는 치료에 적용될 진단 수단 및 방법, 예방 수단 및 방법 및 치료수단 및 방법을 제공한다. 바이러스학에서, 특정 바이러스 감염의 진단, 예방 및/또는 치료는 상기 감염에 의한 상기 특정 바이러스에 가장 특이적인 시약으로 실시되는 것이 가장 바람직하다. 이 경우, 이것은 EMCR-CoV 바이러스의 상기 진단, 예방 및/또는 치료가 EMCR-CoV 바이러스에 대해 가장 특이적인 시약으로 실시되는 것이 바람직하다는 의미이다. 그러나, 이것은 덜 특이적인 것의 가능성을 배제하는 것은 아니고, 충분히 교차-반응성인 시약이, 예를 들면, 그들은 쉽게 구할 수 있고 비슷한 과제에 충분히 대처할 수 있기 때문에 대신 사용될 수 있다.
본 발명은 예를 들면, 상기 샘플과 본 발명에 따른 EMCR-CoV 특이 핵산 또는 항체의 반응에 의해, 상기 동물의 샘플에서의 바이러스 분리물 또는 그들 성분의 존재를 확인하는 것을 포함하는, 동물, 특히 포유동물, 더욱 특별히는 인간의 EMCR-CoV 감염을 바이러스학적으로 진단하는 방법, 및 상기 샘플과 EMCR-CoV 바이러스-특이 단백질 분해성 분자 또는 그것의 단편 또는 본 발명에 따른 항원의 반응에 의해, 상기 포유동물의 샘플에서 EMCR-CoV 바이러스 또는 그것의 성분에 대해 특이적으로 배향된 항체의 존재를 확인하는 것을 포함하는 포유동물의 EMCR-CoV 감염을 혈청학적으로 진단하는 방법을 제공한다.
본 발명은 또한, EMCR-CoV 바이러스, EMCR-CoV 바이러스-특이 핵산, 단백질분해성 분자 또는 그것의 단편, 본 발명에 따른 항원 및/또는 항체를 포함하는 EMCR-CoV 감염을 진단하기 위한 진단키트, 및 바람직하기는 상기 EMCR-CoV 바이러스, EMCR-CoV 바이러스-특이 핵산, 단백질 분해성 분자 또는 그것의 단편, 항원 및/또는 항체를 검출하기 위한 수단을 제공하고, 상기 수단은 예를 들면 본 분야(적절한 진단 키트 포멧의 예는 IF, ELISA, 중화분석, RT-PCR 분석)에서 사용되는 형광단 또는 효소 검출 시스템과 같은 여기성 기를 포함한다. 아직 동정되지 않은 바이러스 성분 또는 핵산, 단백질 분해성 분자 또는 그것의 단편과 같은 그것의 합성 동족체가 ECR-CoV-바이러스-특이적으로 동정될 수 있는 가를 결정하기 위해, 상기 성분의 핵산 또는 아미노산 서열을, 예를 들면, 여기에 제공된 계통분석을 사용하여 제공된 EMCR-CoV 바이러스 서열 및 공지의 비-EMCR-CoV 바이러스 서열(인간 코로나바이러스 299E가 사용되는 것이 바람직하다)과 비교한 서열 상동성에 의해 적어도 10, 바람직하기는 적어도 25, 더욱 바람직하기는 적어도 40 뉴클레오티드 또는 아미노산의 스트레치에 대해 분석하는 것으로 충분하다.
상기 EMCR-CoV 또는 비 EMCR-CoV 바이러스 서열과의 관계 정도에 따라, 성분 또는 합성 동족체가 동정될 수 있다.
그러므로, 본 발명은 신규한 병인성 약제, 코로나바이러스 과(family)에 속하는 분리된 필수적으로 포유동물 양성-센스 단일가닥 RNA 바이러스(이하 EMCR-CoV 바이러스라 명명), 및 EMCR-CoV 바이러스-특이 성분 또는 그들의 합성 동족체의 뉴클레오티드 서열을 제공한다.
코로나바이러스는 처음 1937년 닭으로부터 분리되었고, 처음 인간 코로나바이러스는 1965년 Tyrell과 Bonoe에 의해 인 비트로 증식되었다. 현재 이 과에는 약 13종이 있고, 이것은 소, 돼지, 설치류, 고양이, 개, 새 그리고 인간을 감염시킨다. 코로나바이러스 입자는 불규칙한 형태이고, 직경이 약 60~220nm이고, 외부 외피는 구별되는, '곤봉-형' 페플로머(peplomers; 길이 약 20nm 및 먼 말단에서의 폭 10nm)이다. '크라운-형' 외관은 과에 그 이름을 제공한다. 외피는 두개의 당단백질을 운반한다:S 세포 융합에 포함되고 주요 항원인 스파이크 당단백질 및 M, 막 당단백질, 이것은 출아와 외피 형성에 포함된다. 게놈은 염기성 N으로 명명된 인단백질에 결합되어 있다. 코로나바이러스의 게놈, 단일 가닥 양성-센스 RNA 가닥은 통상적으로 27~31Kb의 길이이고 5'메틸화 캡과 3' 폴리-A 테일을 함유하고, 그것에 의해 감염된 세포에서 mRNA로서 직접적으로 작용할 수 있다. 초기에, 5'ORF 1(약 20 Kb)는 바이러스성 폴리머라제를 생산하도록 번역되고, 이것은 이후 완전길이 음성 센스 가닥을 생산한다. 이것은 전사의 '네스티드 셋'과 같은 mRNA를 생산하기 위한 주형으로 사용되고, 72 뉴클레오티드의 동일한 5' 비-번역 리더서열 및 부합하는 3' 폴리아데닐화 말단을 갖는다. 이와 같이 생성된 각각의 mRNA는 단일시스트론성이고 5'말단에서 유전자는 가장 긴 mRNA 등으로부터 번역된다. 이들 독특한 세포질 구조물은 접합에 의해 생성되지 않고 전사 중의 폴리머라제에 의해 생산된다. 각 유전자 사이에 반복적인 유전자간 서열- AACUAAAC -이 있고, 이것은 각 ORF의 초기 위에 리더 서열을 접합하기 위한 전사효소 플러스 세포질 인자와 상호작용한다. 몇몇 코로나바이러스에서, 상기 단백질, 그러나 또한 적혈구응집소 에스테라제(HE) 및 여러 다른 비-구조적 단백질을 암호화하는 약 8개의 ORF가 있다.
새롭게 분리된 바이러스는 본 발명자들의 원형 EMCR-CoV 바이러스와 충분히 유사한 유전자 순서 및/또는 아미노산 서열 및/또는 핵산 서열을 포함할 때, 계통적으로 그리고 따라서 분류학상으로 EMCR-CoV에 상응하다. 지금까지 동일한 과의 다른 어느 바이러스와 EMCR-CoV 바이러스의 PRF 사이의 최고의 아미노산 서열 동일성은 인간 코로나바이러스 299E 또는 돼지 유행성 설사 바이러스이다(도 3 및 4 참조). 인간 코로나바이러스 229E와의 아미노산 동일성은 45%(핵단백질)에서 81%(RNA 합성효소 1b)의 범위이고; 흥미롭기는 RNA 합성효소 1a는 RNA 합성효소 1b의 81%와 반대로 단지 56%의 동일성을 갖는다. EMCR-CoV는 PEDV의 매트릭스 ORF에 약간 더 가까운 관계인 매트릭스를 제외하고, 모든 추정되는 ORF에서 지금까지 알려진 어느 다른 동일한 과의 바이러스보다 인간 코로나바이러스 229E와 더욱 밀접한 동일성을 갖는다. 각각 이들 언급된 최대값보다 더 높은 동질성을 갖는 개별적인 단백질 또는 전체 바이러스 분리물은 계통적으로 그리고 따라서 분류학상으로 EMCR-CoV 바이러스에 상응한다고 생각되며, 일반적으로 도 1에 나타낸 바와 같은 서열에 구조적으로 상응하는 핵산서열에 의해 암호화될 것이다. 이와 함께 본 발명은 도 1에 나타낸 서열의 분리된 바이러스에 계통적으로 상응하는 바이러스를 제공한다.
다른 바이러스들과 마찬가지로, 다른 근원에서 분리된 EMCR-CoV-바이러스들 간에 어느 정도의 변이가 발견될 수 있다는 것을 예상할 수 있음을 명심해야 한다.
또한, EMCR-CoV 바이러스의 바이러스 서열 또는 본 명세서에 제공된 분리된 EMCR-CoV 바이러스 유전자는 예를 들면, 유전자은행에서 발견되는 바와 같이(예를 들면, 수탁번호 af304460(HCoV-299E) 또는 af353511(PEDV)) 인간 코로나바이러스 299E 또는 돼지 유행성 설사 바이러스의 뉴클레오티드 또는 아미노산 서열과 비교하여, 예를 들면 95% 미만, 바람직하기는 90%미만, 더욱 바람직하기는 80% 미만, 더욱 바람직하기는 70% 미만 그리고 가장 바람직하기는 65% 미만의 뉴클레오티드 서열 동질성을 나타내거나 또는 95% 미만, 바람직하기는 90%미만, 더욱 바람직하기는 80% 미만, 더욱 바람직하기는 70% 미만 및 가장 바람직하기는 65% 미만의 아미노산 서열 동질성을 나타낸다.
대충 EMCR-CoV 균주의 서열 차이는 다른 코로나바이러스로 유추하여, 다소 높을 수 있다.
본 명세서에서 사용되는 용어 "뉴클레오티드 서열 동질성"은 두개의 (폴리)뉴클레오티드 사이의 동질성의 존재를 말하는 것이다. 폴리뉴클레오티드는, 두개의 서열에서 뉴클레오티드의 서열이 최대 일치를 위해 정렬될 때 동일하다면 "동질적인" 서열을 갖는다. 두개 이상의 폴리뉴클레오티드 간의 서열 비교는 일반적으로 서열 유사성의 국소적 영역을 동정하고 비교하기 위해 비교창에 걸쳐 두개의 서열의 일부를 비교함으로써 실행된다. 50, 60, 70, 80, 90, 95, 98, 99 또는 100% 서열 동질성과 같은, 폴리뉴클레오티드의 "서열 동질성 백분률"은 비교창에 걸쳐 두개의 최적으로 배열된 서열을 비교함으로써 결정되고, 여기서 비교창에서의 폴리뉴클레오티드 서열의 일부는 서열의 최적 배열을 위해 참조서열(추가 또는 결실을 포함하지 않는)과의 비교에 따라 추가 또는 결실(즉, 갭)을 포함할 수 있다. 백분률은:(a)매치된 위치의 수를 얻기 위해 두 서열에서 동일한 핵산 염기가 있는 위치의 수를 측정하고; (b) 매치된 위치의 수를 비교창에서의 위치의 총수로 나누고; (c) 결과에 100을 곱하여 서열 동질성의 백분률을 구하는 것으로 산출된다. 비교를 위한 서열의 최적 배열은 알려진 알고리즘의 컴퓨터화 처리에 의해 또는 검사에 의해 수행된다. 쉽게 이용가능한 서열 비교 및 복수개의 서열 배열 알고리즘은 각각 기본 국소배열 연구방법(Basic Local Alignment Search Tool)(Altschul, S.F. et al. 1990.J.Mol.Biol. 215:403; Altschul, S.F. et al.1997. Nucleic Acid Res. 25:3389-3402) 및 ClustalW 프로그램으로 둘 다 인터넷에서 이용가능하다. 다른 적절한 프로그램은 위스콘신 유전학 소프트웨어 패키지(Genetics Computer Group(GCG), Madison, WI, USA)의 GAP, BESTFIT 및 FASTA을 포함한다.
본 명세서에서 사용되는 바에 따르면, "실질적으로 상보적인"은 두개의 핵산서열이 서로에 대해 적어도 약 65%, 바람직하기는 약 70%, 더욱 바람직하기는 약 80%, 더욱 바람직하기는 90%, 가장 바람직하기는 약 98% 서열 상보성을 갖는 것을 의미한다. 이것은 프라이머와 프로브가 엄격한 조건하에 혼성화되기 위해 그들의 주형과 표적 핵산에 대해 각각 충분히 상보적이어야 한다는 의미이다. 그러므로, 본 명세서에 기재된 바와 같은 프라이머 서열은 주형 상의 결합 영역의 정확한 서열을 반영할 필요가 없고 변성 프라이머가 사용될 수 있다. 실질적으로 상보적인 프라이머 서열은 프라이머 결합 및 제2-가닥 합성을 갖도록 증폭 주형에 충분한 서열 상보성을 갖는 것이다.
용어 "혼성화"는 이중-가닥 핵산분자 또는 상보적 뉴클레오티드 사이의 수소결합에 의해 형성된 듀플렉스를 말한다. 용어 "교배하다" 또는 "어닐링"은 그 과정에 의해 핵산서열의 단일 가닥이 상보적 뉴클레오티드 사이의 수소결합을 통해 이중-나선 단편을 형성하는 과정을 말한다.
용어 "올리고뉴클레오티드"는 인 결합(예를 들면, 포스포디에스테르, 알킬 및 아릴-포스페이트, 포스포로티로에이트) 또는 비-인 결합(예를 들면, 펩티드, 술파메이트 및 기타)에 의해 연결된 뉴클레오티드 모노머들의 짧은 서열(보통 6~100 뉴클레오디드)을 언급한다. 올리고뉴클레오티드는 변형된 염기(예를 들면, 5-메틸 시토신) 및 변형된 당 기(예를 들면, 2'-O-메틸리보실, 2'-O-메톡시에틸 리보실, 2'-플루오로 리보실, 2'-아미노 리보실 등)를 갖는 변형된 뉴클레오티드를 포함할 수 있다. 올리고뉴클레오티드는 천연적으로 발생한 또는 합성의, 환형, 분지형 또는 선형의 및 임의로 안정한 2차 구조(예를 들면, 줄기-및-루프 그리고 루프-줄기-루프 구조)를 형성할 수 있는 도메인을 포함하는 단일-가닥 DNA 및 이중- 및 단일-가닥 RNA일 수 있다.
여기서 사용되는 용어 "프라이머"는, 핵산 가닥에 상보적인 프라이머 연장 생성물의 합성이 유도되는 환경에 놓이면, 즉 DNA 폴리머라제와 같은 폴리머반응을 위한 뉴클레오티드와 약제의 존재 및 적당한 온도 및 pH에서, DNA 폴리머가 접촉되도록 하고 그것에 의해 DNA 합성의 개시점으로 작용하는 증폭 표적에 어닐링할 수 있는 올리고뉴클레오티드를 말한다. (증폭) 프라이머는 증폭에서의 최대 효율을 위해 바람직하기는 단일 가닥이다. 바람직하기는, 프라이머는 올리고데옥시 리보뉴클레오티드이다. 프라이머는 중합반응을 위한 약제의 존재하에 연장 생성물의 합성을 준비하기에 충분히 길어야 한다. 프라이머의 정확한 길이는, 온도와 프라이머 근원을 포함하여, 여러 인자에 의존한다. 본 명세서에 사용되는 바와 같은 "이-방향성 프라이머의 쌍"은 PCR 증폭과 같은 DNA 증폭 분야에 일반적으로 사용되는 바와 같은, 하나는 정방향이고 다른 하나는 역방향인 프라이머를 말한다.
용어 "프로브"는 표적 핵산 서열 분석물 또는 그것의 cDNA 유도체에서 상보적 서열을 갖는 수소-결합된 듀플렉스를 인식하고 형성하는 단일-가닥 올리고뉴클레오티드 서열을 말한다.
용어 "스트린전트" 또는 "스트린전트 혼성화조건"은 예를 들면, 온도, 염농도, pH, 포름아미드 농도 등과 같은 혼성화의 안정성에 영향을 미치는 혼성화 조건을 말한다. 이들 조건은 경험적으로 프라이머 또는 프로브의 그것의 표적 핵산서열에 대한 특이결합을 최대화하고 비-특이 결합을 최소화하도록 최적화된다. 사용되는 바에 따라 용어는 프로브 또는 프라이머가 그것의 표적 서열에 다른 서열보다 검출가능하게(예를 들면, 적어도 기준보다 2배) 큰 정도로 혼성화되는 조건을 참조를 포함한다. 스트린전트 조건은 서열 의존이고 다른 환경에서 다를 것이다. 서열이 길수록 더 높은 온도에서 특이적으로 혼성화된다. 일반적으로, 스트린전트 조건은 정해진 이온강도 및 pH에서 특정 서열에 대한 열 용융점(Tm)보다 약 5℃ 낮도록 선택된다. Tm은 상보적 표적 서열의 50%가 바람직하기는 매치된 프로브 또는 프라이머에 혼성화되는 온도(정해진 이온강도 및 pH에서)이다.
통상적으로, 스트린전트 조건은 염 농도가 pH 7.0~8.3에서 Na+ 이온 약 1.0M 미만, 통상적으로 Na+ 이온 농도 (또는 다른 염) 약 0.01~1.0M이고 짧은 프로브 또는 프라이머(예를 들면 10~50 뉴클레오티드)의 경우 온도가 적어도 약 30℃이고 긴 프로브 또는 프라이머(예를 들면, 50 뉴클레오티드 초과)의 경우 온도가 적어도 약 60℃인 조건이다. 스트린전트 조건은 또한 포름아미드와 같은 탈안정화제의 첨가에 의해 달성될 수 있다. 예시적으로 낮은 스트린전트 조건 또는 "감소된 스트린전트 조건"은 37℃에서 30% 포름아미드, 1M NaCl, 1% SDS의 완충용액으로의 혼성화 및 40℃에서 2x SSC로의 세척을 포함한다. 예시적으로 높은 스트린전트 조건은 37℃에서 50% 포름아미드, 1M NaCl, 1% SDS에서의 혼성화 및 60℃에서 0.1x SSC로의 세척을 포함한다. 혼성화과정은 본 분야에 잘 알려져 있고 예를 들면, Ausubel et al, Current Protocols in Molecular Biology, John Wiley & Sons Inc., 1994에 기재되어 있다.
용어 "항체"는 항체의 항원 결합 형태에 대한 참조를 포함한다(예를 들면, Fab, F(ab)2). 용어 "항체"는 주로 면역글로불린 유전자 또는 면역글로불린 유전자들 또는 분석물에 특이적으로 결합하고 인식하는 그들의 단편에 의해 실질적으로 암호화되는 펩티드를 말한다. 그러나, 여러 다양한 항체 단편들이 불활성 항체의 소화의 의미로 정의되는 반면, 당업자는 이와 같은 단편들이 화학적으로 또는 재조합 DNA 방법론을 이용하여 합성될 수 있다는 것을 이해할 것이다. 그러므로, 여기서 사용되는 바에 따르면, 용어 항체는 또한 단일 사슬 Fv, 키메릭 항체(즉, 다양한 종의 일정 및 가변영역을 포함하는), 인간화된 항체(즉, 비-인간 근원으로부터 상보적 결정 영역(CDR)을 포함하는) 및 헤테로컨쥬게이트 항체(예를 들면, 이특이성 항체)와 같은 항체 단편을 포함한다.
간단히 말하면, 본 발명은 코로나바이러스에 속하고 계통적으로 거기에 상응한다는 것을, 상기 바이러스의 게놈의 적당한 단편의 핵산서열을 결정하고 계통트리 분석에서 그것을 시험하고 이것이 PEDV(돼지 유행성 설사 바이러스), HCoV-229E(인간 코로나바이러스 229E, PRCoV(돼지 호흡기 코로나바이러스), TGEV(전염성 위장염 바이러스), CaCoV(개 코로나바이러스) 및 FeCoV(고양이 코로나바이러스)의 바이러스 분리물에 상응하는 것보다 도 1에 나타낸 바와 같은 서열을 갖는 바이러스 분리물에 더욱 밀접히 계통적으로 상응한다는 것을 확인함에 의해 확인할 수 있는 분리된 본질적으로 포유동물의 양성 센스 단일가닥 RNA 바이러스(EMCR-CoV)를 제공하며, 여기서 최대 가능성 트리는 100 부트스트랩(bootstraps)과 3 점블(jumbles)을 이용하여 생성된다.
이와 같은 계통트리 분석에 알맞는 각각의 핵산 게놈 단편들은 예를 들면, 도 1에 나타낸 바와 같은 매트릭스 단백질 또는 핵 캡시드 단백질을 암호화하는 단편으로, 도2a 또는 도2b에 나타낸 계통트리 분석을 가져온다. 이와 같은 계통트리 분석에 유용한 다른 적절한 핵산 단편은 예를 들면, 리플리카제 1a 및 1b, 스파이크, orf 4a 및 4b, 그리고 E를 암호화하는 단편이다.
계통트리 분석에 유용한 적절한 개방형해독틀(ORF)은 바이러스 리플리카제를 암호화하는 ORF를 포함한다(ORF 1a). 도 1의 아미노산을 포함하는 서열을 갖는 리플리카제와, 분석된 리플리카제의 적어도 60%, 바람직하기는 적어도 70%, 더욱 바람직하기는 적어도 80%, 더욱 바람직하기는 적어도 90%, 가장 바람직하기는 적어도 95%의 전체 아미노산 동일성이 발견되면, 분석된 바이러스 분리물은 본 발명에 따른 EMCR-CoV 바이러스 분리물을 포함한다.
계통적 분석에 유용한 적절한 개방형해독틀(ORD)은 바이러스 리플리카제를 암호화하는 ORF를 포함한다(ORF 1b). 도 1의 아미노산을 포함하는 서열을 갖는 리플리카제와, 분석된 리플리카제의 적어도 82%, 바람직하기는 적어도 90%, 가장 바람직하기는 적어도 95%의 전체 아미노산 동일성이 발견되면, 분석된 바이러스 분리물은 본 발명에 따른 EMCR-CoV 바이러스 분리물을 포함한다.
계통적 분석에 유용한 또 다른 적절한 개방형해독틀(PRF)은 ORF 암호화 핵 캡시드 단백질을 포함한다. 도 1의 서열 F를 포함하는 서열에 의해 암호화된 핵 캡시드 단백질과, 분석된 핵 캡시드 단백질의 적어도 50%, 바람직하기는 적어도 60%, 더욱 바람직하기는 적어도 70%, 더욱 바람직하기는 적어도 80%, 더욱 바람직하기는 적어도 90%, 가장 바람직하기는 적어도 95%의 전체 아미노산 동일성이 발견되면, 분석된 바이러스 분리물은 본 발명에 따른 EMCR-CoV 바이러스 분리물을 포함한다.
계통 분석에 유용한 또 다른 적절한 개방형해독틀(ORF)은 매트릭스 단백질을 암호화하는 ORP를 포함한다. 도 1의 서열 F(의 일부)를 포함하는 서열에 의해 암호화된 매트릭스 단백질과, 분석된 매트릭스 단백질의 적어도 60%, 더욱 바람직하기는 적어도 70%, 더욱 바람직하기는 적어도 80%, 더욱 바람직하기는 적어도 90%, 가장 바람직하기는 적어도 95%의 전체 아미노산 동일성이 발견되면, 분석된 바이러스 분리물은 본 발명에 따른 EMCR-CoV 분리물을 포함한다.
계통 분석에 유용한 또 다른 적절한 개방형해독틀(ORF)은 스파이크 단백질 S를 암호화하는 ORF를 포함한다. 도 1에 나타낸 바와 같은 S 단백질의 E서열의 번역 2와 F 서열의 번역 1의 서열을 포함하는 서열에 의해 암호화된 분석된 S-단백질의 적어도 60%, 더욱 바람직하기는 적어도 70%, 더욱 바람직하기는 적어도 80%, 더욱 바람직하기는 적어도 90%, 가장 바람직하기는 적어도 95%의 전체 아미노산 동질성이 발견되면, 분석된 바이러스 분리물은 본 발명에 따른 EMCR-CoV 바이러스 분리물을 포함한다. EMCR-CoV 바이러스의 S ORF는(바이러스 복제물에 대해 암호화하는) PRF 1 주변에 위치하는 것으로 보이며, 이것은 S 단백질과 바이러스 폴리머라제 사이에 소위 2a유전자 및 HE-유전자를 갖는다.
본 발명은 다른 것 중에서 본 발명에 따른 바이러스로부터 얻을 수 있는 분리된 또는 재조합 핵산 또는 그들의 바이러스 특이 작용성 단편을 제공한다. 분리된 또는 재조합 핵산은 도 1에 주어진 바와 같은 서열 또는 스트린전트 조건하에서 이들과 혼성화될 수 있는 동족체의 서열을 포함한다. 특히, 본 발명은 EMCR-CoV 바이러스 핵산을 동정하기에 알맞는 프라이머 및/또는 프로브를 제공한다.
더우기, 본 발명은 본 발명에 따른 핵산을 포함하는 벡터를 제공한다. 무엇보다도 먼저, EMCR-CoV 바이러스의 게놈(의 일부)을 함유하는 플라즈미드 벡터와 같은 벡터, EMCR-CoV의 게놈(의 일부)을 함유하는 바이러스 벡터(예를 들면, 우두 바이러스, 레트로바이러스, 바쿨로바이러스, 그러나 이것으로 제한되는 것은 아님) 또는 다른 바이러스 또는 다른 병원체의 게놈(의 일부)을 함유하는 EMCR-CoV 바이러스가 제공된다.
또한, 본 발명은 본 발명에 따른 핵산 또는 벡터를 포함하는 숙주세포를 제공한다. EMCR-CoV 바이러스의 리플리카제 성분을 함유하는 플라즈미드 또는 바이러스 벡터는 관련 세포 타입(세균, 곤충세포, 진핵세포)에서 성분들의 발현을 위해 원핵세포에서 발생된다. EMCR-CoV 바이러스 게놈의 전장의 또는 부분적인 사본을 함유하는 플라즈마 또는 바이러스 벡터는 인비트로 또는 인비보 바이러스 핵산의 발현을 위해 원핵 세포에서 발생될 것이다. 후자의 벡터는 키메릭 바이러스 또는 키메릭 바이러스 단백질의 발생을 위한 다른 바이러스 서열을 함유할 수 있고, 복제 결함 바이러스의 발생을 위한 바이러스 게놈의 일부가 부족할 수 있고, 그리고 감독된 바이러스의 생성을 위한 돌연변이, 결실 또는 삽입물을 함유할 수 있다.
(야생형, 감독된, 복제-부족 또는 키메릭) EMCR-CoV 바이러스의 감염성 사본들이 상기의 최첨단 기술에 따른 폴리머라제 성분의 공동-발현 후 생성될 수 있다.
추가로, 진핵세포, 일시적으로 또는 안정하게 발현하는 하나 이상의 전장 또는 부분 EMCR-CoV 바이러스 단백질이 사용될 수 있다. 이와 같은 세포는 트란스펙션(단백질 또는 핵산 벡터), 감염(바이러스 벡터) 또는 형질도입(바이러스 벡터) 에 의해 제조될 수 있고 그리고 언급된 야생형, 감독된, 복제-부족 또는 키메릭 바이러스의 상보성에 유용할 것이다.
키메릭 바이러스는 2개 이상의 바이러스에 대해 보호하는 재조합 백신의 생산을 위해 특히 사용될 수 있다. 예를 들면, 인간 폐렴후 바이러스의 하나 이상의 단백질을 발현하는 EMCR-CoV 바이러스 벡터 또는 하나 이상의 EMCR-CoV 바이러스를 발현하는 인간 폐렴후 벡터는 두 바이러스 감염에 대한 이와 같은 벡터에 의해 백신화된 개개인을 보호할 것이다. 이와 같은 특이 키메릭 바이러스는 특히 본 발명에 유용한데, 이것이 예를 들면 인간 페렴후 바이러스의 공동-감염이 코로나바이러스 감염된 환자에게 흔히 일어나기 때문이다. 감독된 그리고 복제-부족 바이러스는 다른 바이러스에 대해 제안되어왔던 바와 같이 생백신으로 백신화 목적을 위해 사용될 수 있다.
바람직한 구현예에서, 본 발명은 단백질성 분자 또는 코로나바이러스-특이 바이러스 단백질 또는 본 발명에 따른 핵산으로 암호화된 그것의 기능적 단편을 제공한다. 유용한 단백질성 분자는 예를 들면, 본 발명에 따른 바이러스로부터 유래될 수 있는 유전자 또는 게놈의 단편의 어느 형태로부터 유래될 수 있다. 본 명세서에 제공된 바와 같이, 이와 같은 분자 또는 그들의 항원성 단편은 예를 들면, 진단방법 또는 키트 및 서브-유니트 백신 및 억제제 펩티드와 같은 약제학적 조성물에 유용하다. 특히 유용한 것은 바이러스 복제 단백질, 스파이크 단백질, 매트릭스 단백질, 핵 캡시드 또는 항원 또는 서브유니트 면역원성과 같은 그것의 항원성 단편이지만, 불활성화된 전체 바이러스가 또한 사용될 수 있다. 또한, 특히 유용한 것은 계통적 분석을 위해 동정된 재조합 핵산 단편에 의해 암호화된 단백질성 물질이고, 물론 바람직하기는 인비보(예를 들면 보호목적 또는 진단성 항제의 제공을 위해) 또는 인비트로(예를 들면, 파아지 디스플레이 기술 또는 합성 항체를 생성하는데 유용한 또 다른 기술)에서, EMCR-CoV 바이러스 특이 항체의 유도를 위한 계통적 분석에 유용한 ORF의 바람직한 범위 및 한계 내이다.
또한, 여기서 제공되는 것은, 천연 폴리클로날 또는 모노클로날인 항체 또는 본 발명에 따른 단백질성 분자 또는 EMCR-CoV 바이러스-특이 기능적 단편을 포함하는 항원과 특이적으로 반응하는 합성(예를 들면, (파아지)라이브러리-유래 결합분자)항체가 제공된다. 이와 같은 항체는 여기서 제공된 항체와 상기 바이러스 분리물 또는 그것의 단편의 반응을 포함하는 EMCR-CoV 바이러스와 같은 바이러스 분리물을 동정하는 방법에 유용하다. 이것은 정제된 또는 비-정제된 EMCR-CoV 바이러스 또는 ELISA, RIA, FACS 또는 항원 검출 분석의 유사한 포맷을 이용하는 그것의 일부(단백질, 펩티드)의 사용에 의해 달성될 수 있다. 선택적으로, 감염된 세포 또는 세포 배양물은 고전적인 면역형광법 또는 면역조직화학기술을 사용하여 바이러스 항체를 동정하는데 사용될 수 있다. 이와 관련하여 특히 유용한 것은 도 1에 기재된 서열의 하나 이상을 포함하는 뉴클레오티드 서열에 의해 암호화되는 EMCR-CoV 바이러스 단백질에 대해 발생하는 항체이다.
EMCR-CoV 바이러스로서 바이러스 분리물을 동정하기 위한 다른 방법은 상기 바이러스 분리물 또는 그것의 성분을 본 발명에 따른 바이러스 특이 핵산과 반응시키는 것을 포함한다.
이 방법으로, 본 발명은 독성학적으로 코로나바이러스과 내의 EMCR-CoV 바이러스 속에 속하는 것으로 동정가능한 양성-센스 단일 가닥 RNA 바이러스에 상응하는 포유동물 바이러스에 대하여 본 발명에 따른 방법으로 동정할 수 있는 바이러스 분리물을 제공한다.
본 방법은 포유동물의 EMCR-CoV 바이러스 감염을 바이러스학적으로 진단하는 방법에 유용하며, 상기 방법은 예를 들면 상기 포유동물의 샘플에서 바이러스 분리물 또는 그것의 분리물의 존재를 상기 샘플과 본 발명에 따른 핵산 또는 항체의 반응에 의해 결정하는 것을 포함한다.
본 발명의 방법은 폴리머라제 사슬반응(PCR; Mullis 1987, U.S. Pat.No. 4,683,195, 4,683,202 및 4,800,159)과 같은 어느 핵산 증폭방법에 의해 또는 리가제 사슬반응(LCR:Barany 1991, Proc. Natl. Acad.Sci.USA 88:189-193; EP Appl. No., 320,308), 자속성 서열복제(3SR; Guatlli et al., 1990, Proc.Natl. Acad.Sci.USA 87:1874-1878), 가닥 이동증폭(SDA; U.S.Pat Nos. 5,270,184, 및5,455,166), 전사증폭시스템(TAS; Kwoh et al., Proc. Natl. Acad. Sci. USA 86:1173-1177), Q-베타 리플리카제(Lizardi et al., 1988, Bio/Technology 6:1197), 회전순환증폭(RCQ; U.S.Pat No. 5,871,921), 핵산서열기재증폭(NASBA), 분열단편길이 다형성(U.S.Pat. No. 5,719,028), 핵산의 등온 및 키메릭 프라이머-개시 증폭(ICAN), 세분화-연장 증폭법(RAM;U.S.Pat Nos. 5,719,028 및 5,942,391) 또는 다른 적절한 핵산의 증폭법을 사용함에 의해 실시될 수 있다.
소수의 미스매치를 갖는 하나 이상의 증폭 프라이머로 핵산을 증폭하기 위해, 증폭반응은 감소된 스트린전트(38℃의 어닐링 온도를 사용한 PCR 증폭 또는 3.5mM MgCl2의 존재)조건하에서 실시될 수 있다. 당업자는 적절한 스트린전트 조건을 선택할 수 있을 것이다.
본 명세서의 프라이머는 증폭되어질 각 특정 서열의 다른 가닥상에 존재하는 그들의 표적 영역에 "실질적으로" 보체(즉, 적어도 65%, 더욱 바람직하기는 적어도 80% 정확하게 보체)가 되도록 선택된다. 이오시톨 잔기 또는 모호한 염기들을 함유하는 프라이머 서열 또는 심지어 표적서열과 비교했을 때 하나 이상의 미스매치를 함유하는 프라이머를 사용하는 것이 가능하다. 일반적으로 표적 DNA 또는 RNA 올리고뉴클레오티드 서열과 적어도 65%, 더욱 바람직하기는 적어도 80% 동질성을 나타내는 서열이 본 발명의 방법에 사용하기에 알맞다고 생각된다. 서열 미스매치는 낮은 스트린전트 혼성화 조건에 사용될 때도 중요하지는 않다.
증폭산물의 검출은 기본적으로 공지의 방법으로 수행될 수 있다. 검출 단면은 직접적으로 방사성 라벨, 항체, 발광염료, 형광염료 또는 효소시약으로 염색되거나 라벨화될 수 있다. 직접 DNA 균주는 예를 들면, 아크리딘 오렌지, 에티디움 모노아지드 또는 호에타스트(Hoechst) 염료와 같은 삽입 염료를 포함할 수 있다.
선택적으로, DNA 또는 RNA 단편은 라벨된 dNTP 염기의 합성 단편에의 병합에 의해 검출될 수 있다. 뉴클레오티드 염기와 결합될 수 있는 검출 라벨은 예를 들면, 플루오레세인, 시아닌 염료 또는 BrdUrd를 포함한다.
프로브-기재 검출시스템을 사용할 때, 본 발명에 사용하기에 적절한 검출방법은 예를 들면, 효소 면역조사(EIA) 포맷(Jacobs et al., 1997, J.Clin.Microbiol. 35, 791-795)을 포함한다. EIA 방법에 의해 검출을 시행하기 위해, 증폭반응에 사용되는 정 또는 역 프라이머 모두 표적 DNA-앰플리콘의 다음의 EIA 검출을 위한 예를 들면 스트렙타비딘 코팅된 미세역가 플레이트웰 상에 표적 PCR 앰플리콘의 면역화를 위한 비오틴기와 같은 포획기를 포함할 수 있다(이하 참조). 당업자는 EIA 포맷에서 표적 DNA PCR 앰플리콘의 고정화를 위해 다른 기들도 사용될 수 있다는 것을 이해할 것이다.
본 명세서에 기재된 바와 같이 표적 DNA의 검출에 유용한 프로브는 바람직하기는 DNA 증폭과정에 의해 증폭됨에 따라 DNA 서열 영역의 적어도 일부에만 결합한다. 당업자는 본 명세서에 설명된 바와 같이 불필요한 실험없이 표적 DNA의 뉴클레오티드 서열을 기재로 한 검출에 알맞는 프로브를 제조할 수 있다. 또한 상보적 뉴클레오티드 서열, DNA 또는 RNA 또는 표적 DNA의 화학적으로 합성된 동족체는, 이와 같은 상보적 가닥이 적용된 증폭반응에서 증폭되는 한, 본 발명의 방법에서 타입-특이 검출 프로브로서 사용하기에 알맞다.
여기서 사용하기 위한 알맞는 검출방법은 예를 들면 앰플리콘(amplicon)의 고정화 및 그것의 DNA 서열의 프로브, 예를 들면 서던 블롯팅을 포함한다. 다른 포맷들은 상기와 같은 EIA 포맷을 포함할 수 있다. 결합의 검출이 용이하도록, 특정 앰플리콘 검출 프로브는 형광단, 발색단, 효소 또는 방사선-라벨과 같은 라벨 모이어티를 포함할 수 있고, 따라서 증폭반응의 반응산물에 대한 프로브의 결합을 용이하게 모니터할 수 있다. 이와 같은 라벨은 당업자들에게 잘 알려져 있고, 예를 들면, 플루오레세인 이소티오시아네이트(FITC), β-갈락토시다제, 호스라디쉬 페옥시다제, 스트렙타비딘, 비오틴, 디그옥시게닌, 35S 또는 125I를 포함한다. 다른 예들은 당업자들에게 분명할 것이다.
검출은 또한 예를 들면 Van den Brule et al.(2002, J.Clin.Microbiol. 40, 779-787)에 의해 기재된 바와 같은 소위 역라인 블롯(RLB) 분석에 의해서도 실시될 수 있다. 이 목적을 위해, RLB 프로브는 바람직하기는 예를 들면 카르복실-코팅된 나일론 막 강의 이후의 고정화를 위해 5' 아미노기로 합성된다. RLB 포맷의 이점은 시스템의 용이성과 그 속도이며, 따라서 원료처리량 샘플 처리를 허용하는 것이다.
RNA 또는 DNA 단편의 검출을 위한 핵산 프로브의 사용은 본 분야에 잘 알려져 있다. 최근 이들 방법은 표적 핵산의 프로브와의 혼성화 및 후-혼성화 세척을 포함한다. 특이성은 통상적으로 후-혼성화 세척, 이온강도인 임계 인자 및 최종 세척용액의 온도의 함수이다. 핵산 혼성화의 경우, Tm은 Meinkoth 및 Wahl, Anal. Biochem., 138:267-284(1984)의 식으로부터 어림잡혀지고: Tm=81.5℃+16.6(log M)+0.41(%GC)-0.61(% form)-500/L; 여기서 M은 일가 양이온의 모이어티이고, %GC는 핵산 중 구아노신과 뉴클레오티드의 백분률이고 및 % form은 혼성용액 중 포름아미드의 백분률이고, 그리고 L은 염기쌍 중 혼성물의 길이이다. Tm은 각 1%의 미스매칭에 대해 약 1℃까지 감소된다; 그러므로, 혼성화 및/또는 세척 조건은 원하는 동일성의 서열에 대해 혼성화되도록 조절될 수 있다. 예를 들면, >90% 동일성을 갖는서열이 약 5℃가 되도록 선택된다면, Tm은 10℃ 감소될 수 있다. 일반적으로, 스트린전트 조건은 정해진 이온강도 및 pH에서 특정 서열과 그것의 보체에 대한 열적 녹는점(Tm)보다 약 5℃ 낮도록 선택된다. 그러나, 심각한 스트린전트 조건은 녹는점(Tm) 보다 1,2,3, 또는 4℃ 낮은 온도에서 혼성화 및/또는 세척을 이용할 수 있고; 중간의 스트린전트 조건은 녹는점(Tm)보다 약 6,7,8,9, 또는 10℃ 낮은 온도에서 혼성화 또는 세척을 이용할 수 있고; 낮은 스트린전트 조건은 녹는점(Tm)보다 약 11, 12, 13, 14, 15 또는 20℃ 낮은 온도에서 혼성화 또는 세척을 이용할 수 있다. 식, 혼성화 및 세척 조성물, 및 원하는 Tm을 이용하여, 당업자는 혼성화 및/또는 세척 조건의 스트린전트가 변화는 본질적으로 기재된다는 것을 이해할 것이다. 미스매칭의 원하는 정도가 45℃(수용액) 또는 32℃(포름아미드 용액)보다 낮은 Tm을 가져온다면, 더 높은 온도가 사용될 수 있도록 SSC 농도를 증가시키는 것이 바람직하다. 핵산의 혼성화에 대한 광범위한 가이드가 Tijssen, Laboratory Techniques in Biochemistm and Molecular Biology-Hybridization with Nucleic Acid Probes, Part I, chapter 2" Overview of principles of hybridization and the strategy of nucleaic acid probe assays" Elsevier. New York(1993); 및 Current Protocols in Milecular Biology, Chapter 2, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (1995)에서 발견된다.
다른 면에서, 본 발명은 표적 RNA 또는 DNA의 일반적인 검출을 위한 올리고뉴클레오티드 프로브를 제공한다. 여기서 검출 프로브는 본 발명의 증폭 반응에 의해 생성된 이중가닥 핵산 중 한 가닥에 "실질적으로" 상보적이도록 선택된다. 바람직하기는 프로브는 예를 들면 비오틴 라벨된 표적 RNA 또는 DNA로부터 생성된 앰플리콘의 안티센스 가닥의 고정화를 위한 실질적인 보체이다.
그들의 표적 서열에 대한 하나 이상의 미스매치를 함유하기 위한 본 발명의 프로브를 검출하는 것이 허용된다. 일반적으로, 표적 올리고뉴클레오티드 서열과 적어도 65%, 더욱 바람직하기는 적어도 80%의 동일성을 나타내는 서열이 본 발명의 방법에 사용하기에 알맞다고 간주된다.
항체, 모노클로날 및 폴리클로날 모두는 본 발명의 검출목적을 위해, 예를 들면, 액체상 중에 사용되거나 고체상 캐리어 상에 결합할 수 있는 면역조사에서 사용될 수 있다. 또한, 면역조사에서 모노클로날 항체는 여러 방법으로 검출가능하게 라벨될 수 있다. 면역조사 포맷의 다양성은 특정 단백질(또는 다른 분석체)와 특이적으로 반응하는 항체를 고르기 위해 사용될 수 있다. 예를 들면, 고체-상 ELISA 면역조사는 단백질과 특이적으로 면역반응하는 모노클로날 항체를 선택하는데 일상적으로 사용된다. 선택적 결합을 결정하는데 사용될 수 있는 면역조사 포맷과 조건의 기술에 관하여 Harlow and Lane, Antibodies, A Laboratiry Manual, Cold Spring Harbor Publications, New York(1988)을 참조하라. 본 발명의 항체를 이용할 수 있는 면역조사 타입의 예는 직접 또는 간접 포맷에서 경쟁 및 비-경쟁 면역조사이다. 이와 같은 면역조사의 예는 방사성면역조사(RIA) 및 샌드위치(면역매트릭)분석이다. 본 발명의 항체를 사용하는 항원의 검출은 생리적 샘플 상의 면역세포화학 분석을 포함하여, 정, 역 또는 동시 모드로 작동하는 면역조사를 이용하여 실시될 수 있다. 당업자들은 불필요한 실험없이 다른 면역조사를 알거나 또는 쉽게 인식할 수 있을 것이다.
항체는 여러 다양한 캐리어에 결합될 수 있고 표적 분자의 존재를 검출하는데 사용될 수 있다. 잘 알려진 캐리어의 예는 유리, 폴리스티렌, 폴리프로필렌, 폴리에틸렌, 덱스트린, 나일론, 아밀라제, 천연 및 변형 셀룰로스, 폴리아크릴아미드, 아가로스 및 마그네타이트를 포함한다. 캐리어의 특성은 본 발명의 목적을 위해 가용성 또는 불용성일 수 있다. 당업자는 모노클로날 항체에 결합하기 위한 다른 알맞는 캐리어를 알거나 또는 반복적인 실험을 사용하여 인식할 수 있을 것이다.
본 발명은 또한, 샘플과 단백질성 분자 또는 그것의 단편 또는 본 발명의 항원과의 반응에 의해 상기 포유동물의 샘플에서 EMCR-CoV 바이러스 또는 그것의 성분에 대해 특이적으로 지향된 항체의 존재를 검출하는 것을 포함하는 포유동물의 EMCR-CoV 바이러스 감염을 혈청학적으로 진단하는 방법을 제공한다.
여기에 제공된 방법 및 수단은 특히 바이러스학적 또는 혈청학적 진단에 의해, EMCR-CoV 바이러스 감염을 진단하기 위한 진단키트에 유용하다. 이와 같은 키트 또는 조사는 예를 들면, 본 발명에 따른 바이러스, 핵산, 단백질성 분자 또는 그것의 단편, 항원 및/또는 항체를 포함할 수 있다.
예를 들면, 특히 인간에게서, EMCR-CoV 바이러스 감염의 치료 또는 예방 및/또는 비정형 폐렴의 치료 또는 예방을 위한 약제학적 조성물의 생산을 위한 본 발명에 따른 바이러스, 핵산, 단백질성 분자 또는 그것의 단편, 항원 및/또는 항체의 용도가 또한 제공된다. 바람직하기는 도1의 관련 번역에 나타낸 바와 같이 스파이크 단백질의 아미노산 서열의 일부를 함유하는 펩티드가 치료 또는 예방 펩티드의 제조를 위해 사용된다. 또한, 바람직하기는 도1의 관련 번역에 기재된 바와 같이 스파이크 단백질의 아미노산 서열을 포함하는 단백질이 서브-유니트 백신의 제조에 사용된다. 더우기, 도1의 번역에 나타낸 바와 같이 코로나바이러스의 핵 캡시드는 코로나바이러스에 대한 세포-중재 면역을 이끌어내는데 특히 유용하며 서브-유니트 백신의 제조에 사용될 수 있다.
바이러스의 감독은 이 목적을 위해 개발된 확립된 방법에 의해 달성될 수 있고, 다른 종의 관련 바이러스, 실험실 동물 및/또는 배양물의 조직/세포를 통한 일련의 계대접종, 37℃ 이하의 온도(냉각-적응)에서 세포 배양물을 통한 일련의 계대접종, 분자 클론의 부위지향 돌연변이생성 및 관련 바이러스간의 유전자 또는 유전자 단편의 교환의 사용이 포함되지만 이것으로 제한되는 것은 아니다.
본 발명에 따른 바이러스, 핵산, 단백질성 분자 또는 그것의 단편, 항원 및/또는 항체를 포함하는 약제학적 조성물은 예를 들면, 본 발명에 따른 약제학적 조성물을 개인에게 제공하는 것을 포함하는 EMCR-CoV 바이러스 감염 및/또는 호흡기 질병의 치료 및 예방을 위한 방법에 사용될 수 있다. 이것은 상기 개인이 인간일 때 가장 유용하다. EMCR-CoV 바이러스 단백질에 대한, 특히 EMCR-CoV 바이러스의 스파이크 단백질에 대한, 바람직하기는 도 1의 번역에 나타낸 바와 같은 아미노산 서열에 대한 항체들은 수동 백신과 같이, 예방 또는 치료 목적을 위해서도 역시 유용하다. 다른 코로나바이러스로부터, 스파이크 단백질이 매우 강한 항원이고 스파이크 항원에 대한 항체가 예방 및 치료 백신에 사용될 수 있다는 것이 알려져 있다.
본 발명은 또한 본 발명에 따른 바이러스를 포함하는 세포 배양물 또는 실험 동물을 확립하고, 상기 배양물 또는 동물을 후보 항바이러스 약제로 치료하고, 그리고 상기 바이러스 또는 상기 배양물 또는 동물의 감염에 대한 상기 약제의 효과를 결정하는 것을 포함하는 비정형 폐렴의 치료에 유용한 방법을 제공한다. 이와 같은 항바이러스 약제의 예는 EMCR-CoV 바이러스-중화 항체, 또는 그것의 기능적 단편을 포함하지만, 다른 특성의 항바이러스 약제가 또한 얻어진다.
본 발명은 또한 약제학적 조성물의 제조를 위한, 특히 특별히는 EMCR-CoV 바이러스 감염에 의해 발생된 비정형 폐렴의 치료를 위한 약제학적 조성물의 제조를 위한 본 발명에 따른 항바이러스 약제의 용도를 제공하고, 그리고 EMCR-CoV 바이러스 감염 또는 비정형 페렴의 치료 및 예방을 위한 방법에 유용한, 본 발명에 따른 항바이러스 약제를 포함하는 약제학적 조성물을 제공하고, 상기 방법은 개개인에게 이와 같은 약제학적 조성물을 제공하는 것을 포함한다.
본 발명은 또한 예방 및/또는 치료적 방법 및/또는 제제의 시험에 유용한 동물 모델을 포함한다. 이것은 유인원이 EMCR-CoV 바이러스에 감염될 수 있고, 그것에 의해 임상적 증상, 더욱 중요하기는 EMCR-CoV 바이러스에 의한 비정형 폐렴을 앓는 인간에게 발견되는 것과 유사한 조직 형태학을 나타낼 수 있다고 가정한다. 바이러스로 감염되기 전 또는 감염 중 예방적 또는 치료적 처리를 받는 유인원은 인간 환자에서의 예방 또는 치료와 같은 적용에 양호하고 유용한 예상적 가치를 가질 것이다.
본 발명은 추가로 실시예에서 설명되지만, 이것으로 제한되는 것은 아니다.
도 1은 EMCR-CoV 바이러스의 일부의 뉴클레오티드 서열이다. 또한 폴리펩티드의 추정 아미노산 서열을 포함한다.
도 2는 분리된 EMCR-CoV의 뉴클레오티드 서열과 그것과 유전적으로 가장 밀접한 관련물과의 계통적 관계이다. 계통트리는 100 부트스트랩과 3 점블을 이용한 최대 가능성 분석에 의해 발생되었다. 핵산 교환의 수를 나타내는 단위를 각 계통트리에 나타내었다. 도 1a는 매트릭스 유전자 뉴클레오티드 서열의 최대 가능수(tree)이다. 트리에서의 숫자는 부트스트랩 값을 나타낸다. 스케일 바는 대략적으로 관련 서열들 간의 10% 뉴클레오티드 차이를 반영한다. 도 1b는 핵 캡시드 유전자 뉴클레오티드 서열의 최대 가능수이다. 트리에서의 숫자는 부트스트랩 값을 나타낸다. 스케일 바는 대략적으로 관련 서열들 간의 10% 뉴클레오티드 차이를 반영한다.
도 3은 추정된 리플리카제 1a 리플리카제 1b, 리플리카제 1ab, 스파이크, Orf E, 매트릭스 및 핵 캡시드 단백질(각각 3a-g)에 대한 그리고 EMCR-CoV 바이러스와 밀접히 관련된 코로나바이러스 사이의 추정 매트릭스 단백질과 뉴클레오단백질(각각 3h 및 3i)에 대한 아미노산 동정을 나타내는 유사성 매트릭스이다. 약자에 대해서는 내용을 참조.
도 4는 여러 코로나바이러스의 배열을 나타낸다. 5' 비번역 영역 게놈성 서열(a); 추정 orf 1a 아미노산 서열(b); 추정 orf 1b 아미노산 서열(c); 추정 orf 1ab 아미노산 서열(d); 추정 스파이크 아미노산 서열(e); 추정 orf 4a 아미노산 서열(f); 추정 orf 4ab 아미노산 서열(g); 추정 orf E 아미노산 서열(h); 추정 매트릭스 아미노산 서열(i); 추정 핵단백질 아미노산 서열(j); 추정 3' 비번역 게놈성 서열(k); 약자에 대해서는 내용을 참조.
실시예
시편 수집
코 솜을 사용하여 폐렴을 앓는 8월령 환자로부터 바이러스를 수집하였다.
바이러스 분리 및 배양
목 솜을 tMK 세포의 배양물에 깊이 담그고 4시간 동안 계대배양하였다. 바이러스는 그리고 나서 Vero-119 세포에 있었다. 세포 배양 상등물을 함유하는 바이러스 1역가를 수확하고, 바이러스를 초원심분리기에서 펠렛화하고 바이러스 펠렛을 PBS 1㎖중에 재현탁하였다.
RNA 분리
RNA를 감염된 세포 배양물의 상등액 또는 슈크로스 기울기 분획으로부터 제조자의 지침에 따라 고순도 RNA 분리 키트(Roche Diagnostics, Almere, the Netherlands)를 사용하여 분리하였다.
서열화
정제된 RNA를 서열화를 위해 BaseClear Hoding BV(Leiden, The netherlands)로 보냈다.
계통적 분석
BioEdit 버전 5.0.9.에서 작동하는 뉴클레오티드 서열을 Clustal W를 사용하여 배열하였다. 최대 가능수를 100 부트스트랩과 3 점블을 이용한 Phylip 5.6 Seqboot 와 DNA-Ml 패키지를 사용하여 생성하였다. 컨센서스 트리를 Phylip 5.6의 컨센서스 패키지를 사용하여 계산하였다. 이들 컨센서스 트리를 원 서열로부터 가지 길이를 재산출하기 위한 DAN-ML에서 사용자 트리로 사용하였다.
EMCR-CoV의 서열을 4개의 코로나바이러스 군에서 각 종들을 나타내는 참조 바이러스의 서열과 비교하였다. 이들은: 인간 코로나바이러스 229E(229E), af304460; 돼지 유행성 설사 바이러스(PEDV) af353511; 전염성 개스트로엔테리티스 바이러스(TGEV), aj271965; 송아지 코로나바이러스(BoCoV), af22029; 뮤린 간염 바이러스(MHV), af201929; 새 감염성 기관지염 바이러스(AIBV), m95169, 개 코로나바이러스(CaCoV), d13096; 고양이 코로나바이러스(FeCoV) ay204704; 돼지 호흡기 코로나바이러스(PRCoV), z24675; 인간 코로나바이러스 OC43(OC43), m76373, l14643, m933990; 돼지 적혈구응집 뇌척수염 바이러스(HEV), ay078417; 래트 코로나바이러스(RtCoV) af 207551). 바이러스들에 대한 참조는 NCBI 카탈로그의 넘버이다(http://www.ncbi.nlm.nih.gov/entrez/).
일반적으로, EMCR-CoV와 같은 코로나바이러스는 다음 프로토콜에 따라 분리 되고 동정될 수 있다.
시편 수집
바이러스 분리물을 찾기 위해 인간, 육식동물(개, 고양이, 족제비, 바다표범 등), 말, 반추동물(소, 양, 염소 등), 돼지, 토끼, 새(가금류, 타조 등)과 같은 포유동물로부터 비인두 흡입물, 목과 코의 솜, 기관지 폐포 세척, 혈청 및 플라즈마 샘플, 및 대변을 조사해야만 한다. 새 배설강으로부터 약솜과 배설물도 조사될 수 있다. 혈청은 ELISA과 같은 면역학적 분석, RT-PCR과 같은 분자-기초 분석 및 중화분석으로 수집되어야 한다.
수집된 바이러스 시편들은 5㎖ 둘베코 MEM 배지(BioWhittaker, Walkersville, MD)로 희석되고 1분 동안 와동 혼합기에서 완전히 혼합될 수 있다. 현택물은 10분 동안 840xg에서 원심분리된다. 침전물을 면역형광기술을 위해 멀티스팟 슬라이드 상에 펴바르고 상등액은 바이러스 분리를 위한 것이다.
바이러스 분리
바이러스 분리를 위해 Vero-118 세포 또는 tMK 세포(RIVM, Bilthoven, The Netherlands)를, 태아 소 혈청 10%로 보충된 하기의 배지(BioWhittaker, Vervier, Belgium)를 갖는 유리 슬라이드를 함유하는 24 웰 플레이트(Costar, Cambridge, UK)에서 배양하였다. 인큐베이션 전에 플레이트를 PBS로 세척하고 NaHCO3 0.52/ℓg, 0.025M Hepes(Biowhittaker), 2mM L-글루타민(BioWhittlker), 페니실린 200 유니트/ℓ, Hepes 0.025 M(Biowhittaker), 락트알부민 1g/ℓ(Sigma-Aldrich, Zwijndrecht, The Netherlands), D-글루코스 2.0 g/ℓ(Merck, Amsterdam, The Netherlands), 펩톤 10g/ℓ(Oxoid, Haarlem, The Netherlands) 및 트립신 0.02%(Life Tecnologies, Bethesda, MD)이 보충된 행크스염(ICN, Costa mesa, CA)를 갖는 이글스 MEM을 공급하였다. 플레이트를 환자 샘플의 상등액으로 웰당 0.2㎖ 로 삼중으로 접종하고, 이어서 1시간 동안 840x g에서 원심분리하였다. 인큐베이션 후, 플레이트를 37℃에서 1~7일 동안 배양하고, 배양물을 CPE에 대해 매일 확인하였다. 연장된 CPE는 일반적으로 5~10 내로 관찰되었고 단층의 세포 분리를 포함하였다.
바이러스 배양
상기와 같이 매질 중의 tMK 세포 또는 베로 클론 118 세포의 서브-합류 단일층을 CPE를 나타내는 샘플의 상등액 또는 환자로부터 취한 샘플로 배양하였다.
RNA 분리
RNA를 감염된 세포 배양물의 상등액 또는 슈크로스 기울기 분획으로부터 제조자의 지침에 따라 고순도 RNA 분리 키트(Roche Diagnostics, Almere, the Netherlands)를 사용하여 분리하였다. RNA는 또한 분야에 잘 알려진 다른 공정에 의해 분리될 수 있다(Current Protocols in Molecular Biology).
서열 분석
서열 분석을 BaseClear Hoding BV(Leiden, The Netherlands)에 의해 실시하였다.
SEQUENCE LISTING
<110> ViroNovative B.V.
<120> Novel atypical pneumonia-causing virus
<130> P67119KR00
<150> EP 03078613.1
<151> 2003-11-18
<150> EP 04808721.7
<151> 2004-11-18
<150> PCT/NL2004/000805
<151> 2004-11-18
<160> 97
<170> PatentIn version 3.3
<210> 1
<211> 27530
<212> DNA
<213> EMCR Coronavirus
<220>
<221> CDS
<222> (265)..(12432)
<223> Replicase 1a
<220>
<221> CDS
<222> (20435)..(24502)
<223> Spike protein
<220>
<221> CDS
<222> (25163)..(25396)
<223> E protein
<220>
<221> CDS
<222> (25405)..(26085)
<223> M protein
<220>
<221> CDS
<222> (26096)..(27229)
<223> E protein
<400> 1
agatagagaa ttttcttatt tagactttgt gtctactcct ctcaactaaa cgaaattttt 60
ctagtgctgt catttgttat ggcagtccta gtgtaattga aatttcgtca agtttgtaaa 120
ctggttaggc aagtgttgta ttttctgtgt ttaagcactg gtggttctgt ccactagtgc 180
acacattgat acttaagtgg tgttctgtca ctgcttattg tggaagcaac gttctgtcgt 240
tgtggaaacc aataactgct aacc atg ttt tac aat caa gtg aca ctt gct 291
Met Phe Tyr Asn Gln Val Thr Leu Ala
1 5
gtt gca agt gat tcg gaa att tca ggt ttt ggt ttt gcc att cct tct 339
Val Ala Ser Asp Ser Glu Ile Ser Gly Phe Gly Phe Ala Ile Pro Ser
10 15 20 25
gta gcc gtt cgc gct tat agc gaa gcc gct gca caa ggt ttt cag gca 387
Val Ala Val Arg Ala Tyr Ser Glu Ala Ala Ala Gln Gly Phe Gln Ala
30 35 40
tgc cgc ttt gtt gct ttt ggc tta cag gat tgt gta acc ggt att aat 435
Cys Arg Phe Val Ala Phe Gly Leu Gln Asp Cys Val Thr Gly Ile Asn
45 50 55
gat gac gat tat gtc att gca ttg act ggt act aat cag ctt tgt gcc 483
Asp Asp Asp Tyr Val Ile Ala Leu Thr Gly Thr Asn Gln Leu Cys Ala
60 65 70
aaa att tta ctt ttt tct gat aga cct ctt aat ttg cga ggt tgg ctc 531
Lys Ile Leu Leu Phe Ser Asp Arg Pro Leu Asn Leu Arg Gly Trp Leu
75 80 85
att ttt tct aac agc aat tat gtt ctt cag gac ttt gat gtt gtt ttt 579
Ile Phe Ser Asn Ser Asn Tyr Val Leu Gln Asp Phe Asp Val Val Phe
90 95 100 105
ggc cat ggt gca gga agt gtg gtt ttt gtg gat aag tat atg tgt ggt 627
Gly His Gly Ala Gly Ser Val Val Phe Val Asp Lys Tyr Met Cys Gly
110 115 120
ttt gat ggt aaa cct gtg tta cct aaa aac atg tgg gaa ttt aga gat 675
Phe Asp Gly Lys Pro Val Leu Pro Lys Asn Met Trp Glu Phe Arg Asp
125 130 135
tac ttt aat gat aat act gat agt att gtt att ggt ggt gtc act tat 723
Tyr Phe Asn Asp Asn Thr Asp Ser Ile Val Ile Gly Gly Val Thr Tyr
140 145 150
caa tta gca tgg gat gtt ata cgt aaa gac ctt tct tat gaa cag caa 771
Gln Leu Ala Trp Asp Val Ile Arg Lys Asp Leu Ser Tyr Glu Gln Gln
155 160 165
aat gtt tta gct att gag agc att cat tat ctt ggc act aca ggt cat 819
Asn Val Leu Ala Ile Glu Ser Ile His Tyr Leu Gly Thr Thr Gly His
170 175 180 185
act ttg aag tct ggt tgc aaa ctc att aat gcc aag ccg cct aaa tat 867
Thr Leu Lys Ser Gly Cys Lys Leu Ile Asn Ala Lys Pro Pro Lys Tyr
190 195 200
tct tct aag gtt gtt ttg agt ggt gaa tgg aat gct gtg tat aag gcg 915
Ser Ser Lys Val Val Leu Ser Gly Glu Trp Asn Ala Val Tyr Lys Ala
205 210 215
ttt ggt tca cca ttt att aca aat ggt ata tca ttg cta gat ata att 963
Phe Gly Ser Pro Phe Ile Thr Asn Gly Ile Ser Leu Leu Asp Ile Ile
220 225 230
gtt aaa cca gtt ttc ttt aat gct ttt gtt aaa tgc aat tgt ggt tct 1011
Val Lys Pro Val Phe Phe Asn Ala Phe Val Lys Cys Asn Cys Gly Ser
235 240 245
gag aat tgg agt gtt ggt gca tgg gat ggt tat cta tct tct tgt tgt 1059
Glu Asn Trp Ser Val Gly Ala Trp Asp Gly Tyr Leu Ser Ser Cys Cys
250 255 260 265
ggc aca cct gct aag aaa ctt tgt gtt gtt cct ggt aat gtt gtt cct 1107
Gly Thr Pro Ala Lys Lys Leu Cys Val Val Pro Gly Asn Val Val Pro
270 275 280
ggt gat gtg atc atc acc tca act gat gct ggt tgt ggt gtt aaa tac 1155
Gly Asp Val Ile Ile Thr Ser Thr Asp Ala Gly Cys Gly Val Lys Tyr
285 290 295
tat gct ggc tta gtt gtt aaa cat att act aac att act ggt gtg tct 1203
Tyr Ala Gly Leu Val Val Lys His Ile Thr Asn Ile Thr Gly Val Ser
300 305 310
tta tgg cgt gtt aca gct gtt cat tct gat gga atg ttt gtg gca aca 1251
Leu Trp Arg Val Thr Ala Val His Ser Asp Gly Met Phe Val Ala Thr
315 320 325
tct tct tat gat gca ctt ttg cat aga aat tca tta gac cct ttt tgc 1299
Ser Ser Tyr Asp Ala Leu Leu His Arg Asn Ser Leu Asp Pro Phe Cys
330 335 340 345
ttt gat gtt aac act tta ctt tct aat caa tta cgt cta gct ttt ctt 1347
Phe Asp Val Asn Thr Leu Leu Ser Asn Gln Leu Arg Leu Ala Phe Leu
350 355 360
ggt gct tct gtt aca gaa gat gtt aaa ttt gct gct agc act ggt gtt 1395
Gly Ala Ser Val Thr Glu Asp Val Lys Phe Ala Ala Ser Thr Gly Val
365 370 375
att gac att agt gct ggt atg ttt ggt ctt tac gat gac ata ttg aca 1443
Ile Asp Ile Ser Ala Gly Met Phe Gly Leu Tyr Asp Asp Ile Leu Thr
380 385 390
aac aat aaa cct tgg ttt gta cgc aaa gct tct ggg ctt ttt gat gca 1491
Asn Asn Lys Pro Trp Phe Val Arg Lys Ala Ser Gly Leu Phe Asp Ala
395 400 405
atc tgg gat gct ttt gtt gcc gct att aag ctt gtg cca act act act 1539
Ile Trp Asp Ala Phe Val Ala Ala Ile Lys Leu Val Pro Thr Thr Thr
410 415 420 425
ggt ggt ttg gtt agg ttt gtt aag tct atc gct tca act gtt tta act 1587
Gly Gly Leu Val Arg Phe Val Lys Ser Ile Ala Ser Thr Val Leu Thr
430 435 440
gtt tct aat ggt gtt att att atg tgt gca gat gtt cca gat gct ttt 1635
Val Ser Asn Gly Val Ile Ile Met Cys Ala Asp Val Pro Asp Ala Phe
445 450 455
caa cca gtt tac cgc aca ttt aca caa gct att tgt gct gca ttt gat 1683
Gln Pro Val Tyr Arg Thr Phe Thr Gln Ala Ile Cys Ala Ala Phe Asp
460 465 470
ttt tct tta gat gta ttt aaa att ggt gat gtt aaa ttt aaa cga ctt 1731
Phe Ser Leu Asp Val Phe Lys Ile Gly Asp Val Lys Phe Lys Arg Leu
475 480 485
ggt gat tat gtt ctt act gaa aat gct ctt gtt cgt ttg act act gaa 1779
Gly Asp Tyr Val Leu Thr Glu Asn Ala Leu Val Arg Leu Thr Thr Glu
490 495 500 505
gtt gtt cgt ggt gtt cgt gat gct cgc ata aag aaa gcc atg ttt act 1827
Val Val Arg Gly Val Arg Asp Ala Arg Ile Lys Lys Ala Met Phe Thr
510 515 520
aaa gta gtt gta ggt cct aca act gaa gtt aag ttt tct gtt att gaa 1875
Lys Val Val Val Gly Pro Thr Thr Glu Val Lys Phe Ser Val Ile Glu
525 530 535
ctt gcc act gtt aat ttg cgt ctt gtt gat tgt gca cct gta gtt tgc 1923
Leu Ala Thr Val Asn Leu Arg Leu Val Asp Cys Ala Pro Val Val Cys
540 545 550
cct aaa ggt aaa att gtt gtt att gct gga caa gct ttt ttc tat agt 1971
Pro Lys Gly Lys Ile Val Val Ile Ala Gly Gln Ala Phe Phe Tyr Ser
555 560 565
ggt ggt ttt tat cgt ttt atg gtt gat tct aca act gta tta aat gac 2019
Gly Gly Phe Tyr Arg Phe Met Val Asp Ser Thr Thr Val Leu Asn Asp
570 575 580 585
cct gtt ttt act ggt gag tta ttt tat act att aag ttt agt ggt ttt 2067
Pro Val Phe Thr Gly Glu Leu Phe Tyr Thr Ile Lys Phe Ser Gly Phe
590 595 600
aag ctt gat ggt ttt aac cat cag ttt gtt aat gct agt tct gct aca 2115
Lys Leu Asp Gly Phe Asn His Gln Phe Val Asn Ala Ser Ser Ala Thr
605 610 615
gat gcc att att gct gtt gag ctg ttg tta tcg gat ttt aaa act gca 2163
Asp Ala Ile Ile Ala Val Glu Leu Leu Leu Ser Asp Phe Lys Thr Ala
620 625 630
gtt ttt gtg tac aca tgt gtg gtt gat ggt tgt agt gtc att gtt aga 2211
Val Phe Val Tyr Thr Cys Val Val Asp Gly Cys Ser Val Ile Val Arg
635 640 645
cgt gat gct aca ttc gcc aca cat gtg tgt ttt aag gac tgt tat agt 2259
Arg Asp Ala Thr Phe Ala Thr His Val Cys Phe Lys Asp Cys Tyr Ser
650 655 660 665
att tgg gag caa ttc tgc att gat aat tgt ggt gag cca tgg ttt ttg 2307
Ile Trp Glu Gln Phe Cys Ile Asp Asn Cys Gly Glu Pro Trp Phe Leu
670 675 680
act gat tat aat gct atc ttg cag agt aat aac cct caa tgt gct att 2355
Thr Asp Tyr Asn Ala Ile Leu Gln Ser Asn Asn Pro Gln Cys Ala Ile
685 690 695
gtt caa gca tcg gag tct aaa gtt ttg ctt gag agg ttt tta cct aag 2403
Val Gln Ala Ser Glu Ser Lys Val Leu Leu Glu Arg Phe Leu Pro Lys
700 705 710
tgt cct gaa ata ctg ttg agt att gat gat ggc cat tta tgg aat ctt 2451
Cys Pro Glu Ile Leu Leu Ser Ile Asp Asp Gly His Leu Trp Asn Leu
715 720 725
ttt gtt gaa aag ttt aat ttt gtt aca gat tgg tta aaa act ctt aag 2499
Phe Val Glu Lys Phe Asn Phe Val Thr Asp Trp Leu Lys Thr Leu Lys
730 735 740 745
ctt aca ctt act tct aat ggt ctt tta ggt aat tgt gcc aaa cgt ttt 2547
Leu Thr Leu Thr Ser Asn Gly Leu Leu Gly Asn Cys Ala Lys Arg Phe
750 755 760
aga cgt gtt ttg gta aaa ttg ctt gat gtc tat aat ggt ttt ctt gaa 2595
Arg Arg Val Leu Val Lys Leu Leu Asp Val Tyr Asn Gly Phe Leu Glu
765 770 775
act gtc tgt agt gtc gta cac act gct ggt gtt tgc att aaa tat tat 2643
Thr Val Cys Ser Val Val His Thr Ala Gly Val Cys Ile Lys Tyr Tyr
780 785 790
gct gtt aat gtt cca tat gta gtt att agt ggt ttt gta agt cgt gta 2691
Ala Val Asn Val Pro Tyr Val Val Ile Ser Gly Phe Val Ser Arg Val
795 800 805
att cgt aga gaa agg tgt gac gtg act ttt cct tgt gtt agt tgt gtc 2739
Ile Arg Arg Glu Arg Cys Asp Val Thr Phe Pro Cys Val Ser Cys Val
810 815 820 825
act ttt ttc tat gaa ttt tta gac acg tgt ttt ggt gtt agt aaa cct 2787
Thr Phe Phe Tyr Glu Phe Leu Asp Thr Cys Phe Gly Val Ser Lys Pro
830 835 840
aat gcc att gat gtt gaa cat tta gag ctt aaa gaa act gtt ttt gtt 2835
Asn Ala Ile Asp Val Glu His Leu Glu Leu Lys Glu Thr Val Phe Val
845 850 855
gaa cct aag gat ggt ggt caa ttt ttt gtt tct gat gat tat ctt tgg 2883
Glu Pro Lys Asp Gly Gly Gln Phe Phe Val Ser Asp Asp Tyr Leu Trp
860 865 870
tat gtt gta gat gac att tat tat cca gct tca tgt aat ggt gta ttg 2931
Tyr Val Val Asp Asp Ile Tyr Tyr Pro Ala Ser Cys Asn Gly Val Leu
875 880 885
cca gtt gct ttt aca aaa ttg gca ggt ggt aaa ata tct ttt tct gat 2979
Pro Val Ala Phe Thr Lys Leu Ala Gly Gly Lys Ile Ser Phe Ser Asp
890 895 900 905
gat gtt ata gtt cat gat gtt gaa cct acc cat aaa gtc aag ctc ata 3027
Asp Val Ile Val His Asp Val Glu Pro Thr His Lys Val Lys Leu Ile
910 915 920
ttt gag ttt gaa gat gat gtt gtt acc agt ctt tgt aag aag agt ttt 3075
Phe Glu Phe Glu Asp Asp Val Val Thr Ser Leu Cys Lys Lys Ser Phe
925 930 935
ggt aag tct att att tat aca ggt gat tgg gaa ggt tta cat gaa gtt 3123
Gly Lys Ser Ile Ile Tyr Thr Gly Asp Trp Glu Gly Leu His Glu Val
940 945 950
ctt aca tct gca atg aat gtc att ggg caa cat att aag ttg cca caa 3171
Leu Thr Ser Ala Met Asn Val Ile Gly Gln His Ile Lys Leu Pro Gln
955 960 965
ttt tat att tat gat gaa gag ggt ggt tat gat gtt tct aaa cca gtt 3219
Phe Tyr Ile Tyr Asp Glu Glu Gly Gly Tyr Asp Val Ser Lys Pro Val
970 975 980 985
atg att tca caa tgg cct att agt gat gat agt gat ggt tgt gtt gtt 3267
Met Ile Ser Gln Trp Pro Ile Ser Asp Asp Ser Asp Gly Cys Val Val
990 995 1000
gaa gcg agc act gat ttt cat caa tta gaa tct gtt aga gaa gag 3312
Glu Ala Ser Thr Asp Phe His Gln Leu Glu Ser Val Arg Glu Glu
1005 1010 1015
gtt gat ata att gaa caa cct ttt ggg gaa gtt gaa cat gcg ctc 3357
Val Asp Ile Ile Glu Gln Pro Phe Gly Glu Val Glu His Ala Leu
1020 1025 1030
tca att aga caa cct ttt tct ttt tct ttt aga gat gaa ttg ggt 3402
Ser Ile Arg Gln Pro Phe Ser Phe Ser Phe Arg Asp Glu Leu Gly
1035 1040 1045
gtt cgt gtt tta gat caa tct gat aat aat tgt tgg att agt acc 3447
Val Arg Val Leu Asp Gln Ser Asp Asn Asn Cys Trp Ile Ser Thr
1050 1055 1060
aca ctt ata cag ttg caa ctt aca aag ctt ttg gat gat tct att 3492
Thr Leu Ile Gln Leu Gln Leu Thr Lys Leu Leu Asp Asp Ser Ile
1065 1070 1075
gag atg caa ttg ttt aaa gtt ggt aaa gtt gat tca att gtt caa 3537
Glu Met Gln Leu Phe Lys Val Gly Lys Val Asp Ser Ile Val Gln
1080 1085 1090
aag tgt tat gag ttg tct cat tta att agt ggt tca ctt ggt gat 3582
Lys Cys Tyr Glu Leu Ser His Leu Ile Ser Gly Ser Leu Gly Asp
1095 1100 1105
agt ggt aaa ctt ctt agt gaa ctt ctt aaa gat aaa tat aca tgt 3627
Ser Gly Lys Leu Leu Ser Glu Leu Leu Lys Asp Lys Tyr Thr Cys
1110 1115 1120
tct ata act ttt gag atg tct tgt gat tgt ggt aaa aag ttt gat 3672
Ser Ile Thr Phe Glu Met Ser Cys Asp Cys Gly Lys Lys Phe Asp
1125 1130 1135
gag caa gtt ggt tgt ttg ttt tgg att atg cct tac aca aaa ctt 3717
Glu Gln Val Gly Cys Leu Phe Trp Ile Met Pro Tyr Thr Lys Leu
1140 1145 1150
ttt caa aaa ggt gag tgt tgt att tgt cat aaa atg cag act tat 3762
Phe Gln Lys Gly Glu Cys Cys Ile Cys His Lys Met Gln Thr Tyr
1155 1160 1165
aag ctt gtt agt atg aaa ggt act ggt gtg ttt gta cag gat cca 3807
Lys Leu Val Ser Met Lys Gly Thr Gly Val Phe Val Gln Asp Pro
1170 1175 1180
gca cct att gac att gat gct ttc cct gtt aga cct ata tgt tca 3852
Ala Pro Ile Asp Ile Asp Ala Phe Pro Val Arg Pro Ile Cys Ser
1185 1190 1195
tct gta tat tta ggt gtt aag ggt tct ggt cat tat caa aca aat 3897
Ser Val Tyr Leu Gly Val Lys Gly Ser Gly His Tyr Gln Thr Asn
1200 1205 1210
tta tac agt ttt gac aaa gct att gat ggt ttt ggt gtc ttt gac 3942
Leu Tyr Ser Phe Asp Lys Ala Ile Asp Gly Phe Gly Val Phe Asp
1215 1220 1225
att aaa aat agt agt gtt aat act gtt tgt ttt gtt gat gtt gat 3987
Ile Lys Asn Ser Ser Val Asn Thr Val Cys Phe Val Asp Val Asp
1230 1235 1240
ttt cat agt gta gaa ata gaa gct ggt gaa gtt aaa cct ttt gct 4032
Phe His Ser Val Glu Ile Glu Ala Gly Glu Val Lys Pro Phe Ala
1245 1250 1255
gta tat aaa aat gtt aaa ttt tat tta ggt gat att tca cac ctt 4077
Val Tyr Lys Asn Val Lys Phe Tyr Leu Gly Asp Ile Ser His Leu
1260 1265 1270
gta aac tgt gtt tct ttt gac ttt gtt gtc aat gct gct aat gaa 4122
Val Asn Cys Val Ser Phe Asp Phe Val Val Asn Ala Ala Asn Glu
1275 1280 1285
aat ctc atg cat gga ggc ggt gtc gca cgt gct att gat att ttg 4167
Asn Leu Met His Gly Gly Gly Val Ala Arg Ala Ile Asp Ile Leu
1290 1295 1300
act gaa ggt caa ctt cag tca tta tct aaa gat tac att agt agt 4212
Thr Glu Gly Gln Leu Gln Ser Leu Ser Lys Asp Tyr Ile Ser Ser
1305 1310 1315
aat ggt cca ctt aag gtt gga gca ggt gtt atg ttg gag tgt gaa 4257
Asn Gly Pro Leu Lys Val Gly Ala Gly Val Met Leu Glu Cys Glu
1320 1325 1330
aaa ttc aat gta ttt aat gtt gtt ggt ccg cga act ggt aaa cat 4302
Lys Phe Asn Val Phe Asn Val Val Gly Pro Arg Thr Gly Lys His
1335 1340 1345
gag cat tca tta ctt gtt gaa gct tat aat tct att tta ttt gaa 4347
Glu His Ser Leu Leu Val Glu Ala Tyr Asn Ser Ile Leu Phe Glu
1350 1355 1360
aat ggt att cca ctt atg cct ctt ctt agt tgt ggt att ttt ggt 4392
Asn Gly Ile Pro Leu Met Pro Leu Leu Ser Cys Gly Ile Phe Gly
1365 1370 1375
gta agg att gaa aat tct ctt aaa gct ttg ttt agt tgt gac att 4437
Val Arg Ile Glu Asn Ser Leu Lys Ala Leu Phe Ser Cys Asp Ile
1380 1385 1390
aat aaa cca ttg caa gtt ttt gtt tat tct tca aat gaa gaa caa 4482
Asn Lys Pro Leu Gln Val Phe Val Tyr Ser Ser Asn Glu Glu Gln
1395 1400 1405
gct gtt ctt aag ttt tta gat ggt tta gat tta aca cca gtc att 4527
Ala Val Leu Lys Phe Leu Asp Gly Leu Asp Leu Thr Pro Val Ile
1410 1415 1420
gac gat gtt gat gtt gtt aaa cct ttt aga gtt gaa ggt aat ttt 4572
Asp Asp Val Asp Val Val Lys Pro Phe Arg Val Glu Gly Asn Phe
1425 1430 1435
tca ttc ttt gat tgt ggt gtc aat gcc ttg gat ggt gat att tac 4617
Ser Phe Phe Asp Cys Gly Val Asn Ala Leu Asp Gly Asp Ile Tyr
1440 1445 1450
tta tta ttt act aac tct att tta atg ttg gat aaa caa gga caa 4662
Leu Leu Phe Thr Asn Ser Ile Leu Met Leu Asp Lys Gln Gly Gln
1455 1460 1465
tta ttg gac aca aaa ctt aat ggt att ttg caa cag gca gtt ctt 4707
Leu Leu Asp Thr Lys Leu Asn Gly Ile Leu Gln Gln Ala Val Leu
1470 1475 1480
gat tat ctt gct aca gtt aaa act gta cca gct ggt aat ttg gtt 4752
Asp Tyr Leu Ala Thr Val Lys Thr Val Pro Ala Gly Asn Leu Val
1485 1490 1495
aaa ctt gtt gtt gag agt tgt acc att tat atg tgt gtt gta cca 4797
Lys Leu Val Val Glu Ser Cys Thr Ile Tyr Met Cys Val Val Pro
1500 1505 1510
tcg ata aat gat ctt tct ttt gat aaa aat ctt ggt cgt tgt gtg 4842
Ser Ile Asn Asp Leu Ser Phe Asp Lys Asn Leu Gly Arg Cys Val
1515 1520 1525
cgt aaa ctt aat aga ttg aaa act tgt gtt att gcc aat gtt cct 4887
Arg Lys Leu Asn Arg Leu Lys Thr Cys Val Ile Ala Asn Val Pro
1530 1535 1540
gct att gat gtt ttg aaa aag ctt ctt tca agt ttg act tta act 4932
Ala Ile Asp Val Leu Lys Lys Leu Leu Ser Ser Leu Thr Leu Thr
1545 1550 1555
gtt aaa ttt gtt gta gag agt aat gtt atg gat gtt aac gac tgt 4977
Val Lys Phe Val Val Glu Ser Asn Val Met Asp Val Asn Asp Cys
1560 1565 1570
ttt aag aat gat aat gta gtt ttg aaa att act gaa gat ggt att 5022
Phe Lys Asn Asp Asn Val Val Leu Lys Ile Thr Glu Asp Gly Ile
1575 1580 1585
aat gtt aaa gat gtt gtt gtt gag tct tct aag tca ctt ggt aaa 5067
Asn Val Lys Asp Val Val Val Glu Ser Ser Lys Ser Leu Gly Lys
1590 1595 1600
caa ttg ggt gtt gtg agt gat ggt gtt gac tct ttt gaa ggt gtt 5112
Gln Leu Gly Val Val Ser Asp Gly Val Asp Ser Phe Glu Gly Val
1605 1610 1615
tta cct att aat act gat act gtc tta tct gta gct cca gaa gtt 5157
Leu Pro Ile Asn Thr Asp Thr Val Leu Ser Val Ala Pro Glu Val
1620 1625 1630
gac tgg gtt gct ttt tac ggt ttt gaa aag gca gca ctt ttt gct 5202
Asp Trp Val Ala Phe Tyr Gly Phe Glu Lys Ala Ala Leu Phe Ala
1635 1640 1645
tct ttg gat gta aag cca tat ggt tac cct aat gat ttt gtt ggt 5247
Ser Leu Asp Val Lys Pro Tyr Gly Tyr Pro Asn Asp Phe Val Gly
1650 1655 1660
ggt ttt aga gtt ctt ggg acc acc gac aat aat tgt tgg gtt aat 5292
Gly Phe Arg Val Leu Gly Thr Thr Asp Asn Asn Cys Trp Val Asn
1665 1670 1675
gca act tgt ata att tta cag tat ctt aag cct act ttt aaa tct 5337
Ala Thr Cys Ile Ile Leu Gln Tyr Leu Lys Pro Thr Phe Lys Ser
1680 1685 1690
aag ggt tta aat gtt ctt tgg aac aaa ttt gtt aca ggt gat gtt 5382
Lys Gly Leu Asn Val Leu Trp Asn Lys Phe Val Thr Gly Asp Val
1695 1700 1705
gga cct ttt gtt agt ttt att tat ttt ata act atg tct tca aag 5427
Gly Pro Phe Val Ser Phe Ile Tyr Phe Ile Thr Met Ser Ser Lys
1710 1715 1720
ggt caa aag ggt gat gct gaa gag gca tta tct aaa ttg tca gag 5472
Gly Gln Lys Gly Asp Ala Glu Glu Ala Leu Ser Lys Leu Ser Glu
1725 1730 1735
tat ttg att agt gat tct att gtt act ctt gaa caa tat tca act 5517
Tyr Leu Ile Ser Asp Ser Ile Val Thr Leu Glu Gln Tyr Ser Thr
1740 1745 1750
tgt gac att tgt aaa agt act gta gtt gaa gtt aaa agt gct gtt 5562
Cys Asp Ile Cys Lys Ser Thr Val Val Glu Val Lys Ser Ala Val
1755 1760 1765
gtc tgt gct agt gtg ctt aaa gat ggt tgt gat gtt ggt ttt tgt 5607
Val Cys Ala Ser Val Leu Lys Asp Gly Cys Asp Val Gly Phe Cys
1770 1775 1780
cca cac aga cat aaa ttg cgt tca cgt gtt aag ttt gtt aat gga 5652
Pro His Arg His Lys Leu Arg Ser Arg Val Lys Phe Val Asn Gly
1785 1790 1795
cgt gtt gtt att acc aat gtt ggt gaa cct ata att tca caa cct 5697
Arg Val Val Ile Thr Asn Val Gly Glu Pro Ile Ile Ser Gln Pro
1800 1805 1810
tct aag ttg ctt aat ggt att gct tat aca aca ttt tca ggt tct 5742
Ser Lys Leu Leu Asn Gly Ile Ala Tyr Thr Thr Phe Ser Gly Ser
1815 1820 1825
ttt gat aac ggt cac tat gta gtt tat gat gct gct aat aat gct 5787
Phe Asp Asn Gly His Tyr Val Val Tyr Asp Ala Ala Asn Asn Ala
1830 1835 1840
gtc tat gat ggt gct cgt tta ttt gct tca gat ttg tct act tta 5832
Val Tyr Asp Gly Ala Arg Leu Phe Ala Ser Asp Leu Ser Thr Leu
1845 1850 1855
gct gtt aca gct att gtt gta gta ggt ggt tgt gta aca tct aat 5877
Ala Val Thr Ala Ile Val Val Val Gly Gly Cys Val Thr Ser Asn
1860 1865 1870
gtt cca cca att gtt agt gag aaa att tct gtt atg gat aaa ctt 5922
Val Pro Pro Ile Val Ser Glu Lys Ile Ser Val Met Asp Lys Leu
1875 1880 1885
gat act ggt gca caa aaa ttt ttc caa ttt ggt gat ttt gtt atg 5967
Asp Thr Gly Ala Gln Lys Phe Phe Gln Phe Gly Asp Phe Val Met
1890 1895 1900
aat aac att gtt ctg ttt tta act tgg ttg ctt agt atg ttt agt 6012
Asn Asn Ile Val Leu Phe Leu Thr Trp Leu Leu Ser Met Phe Ser
1905 1910 1915
ctt tta cgt act tct att atg aag cat gat att aaa gtt att gcc 6057
Leu Leu Arg Thr Ser Ile Met Lys His Asp Ile Lys Val Ile Ala
1920 1925 1930
aag gct cct aaa cgt aca ggt gtt att ttg aca cgt agt ttt aag 6102
Lys Ala Pro Lys Arg Thr Gly Val Ile Leu Thr Arg Ser Phe Lys
1935 1940 1945
tat aac att aga tct gct ttg ttt gtt gta aag cag aag tgg tgt 6147
Tyr Asn Ile Arg Ser Ala Leu Phe Val Val Lys Gln Lys Trp Cys
1950 1955 1960
gtt att gtt act ttg ttt aag ttc tta ttg tta tta tat gct att 6192
Val Ile Val Thr Leu Phe Lys Phe Leu Leu Leu Leu Tyr Ala Ile
1965 1970 1975
tat gca ctt gtt ttt atg att gtg caa ttt agt cct ttt aat agt 6237
Tyr Ala Leu Val Phe Met Ile Val Gln Phe Ser Pro Phe Asn Ser
1980 1985 1990
ctt tta tgt ggt gac att gta agt ggt tat gaa aaa tcc act ttt 6282
Leu Leu Cys Gly Asp Ile Val Ser Gly Tyr Glu Lys Ser Thr Phe
1995 2000 2005
aat aag gat att tat tgt ggt aat tct atg gtt tgt aag atg tgt 6327
Asn Lys Asp Ile Tyr Cys Gly Asn Ser Met Val Cys Lys Met Cys
2010 2015 2020
ttg ttt agt tat caa gag ttt aat gat ttg gat cat act agt ctt 6372
Leu Phe Ser Tyr Gln Glu Phe Asn Asp Leu Asp His Thr Ser Leu
2025 2030 2035
gtt tgg aag cac att cgt gat cct ata tta atc agt tta caa cca 6417
Val Trp Lys His Ile Arg Asp Pro Ile Leu Ile Ser Leu Gln Pro
2040 2045 2050
ttt gtt ata ctt gtt att ttg tta att ttt ggt aat atg tat ttg 6462
Phe Val Ile Leu Val Ile Leu Leu Ile Phe Gly Asn Met Tyr Leu
2055 2060 2065
cgt ttt gga ctt tta tat ttt gtt gca caa ttt att agt act ttt 6507
Arg Phe Gly Leu Leu Tyr Phe Val Ala Gln Phe Ile Ser Thr Phe
2070 2075 2080
ggt tct ttc tta ggc ttt cat cag aaa cag tgg ttt tta cat ttt 6552
Gly Ser Phe Leu Gly Phe His Gln Lys Gln Trp Phe Leu His Phe
2085 2090 2095
gtg ccg ttt gat gtt tta tgt aat gag ttt tta gct aca ttt att 6597
Val Pro Phe Asp Val Leu Cys Asn Glu Phe Leu Ala Thr Phe Ile
2100 2105 2110
gtc tgc aaa att gtt tta ttt gtt aga cat att att gtt ggc tgt 6642
Val Cys Lys Ile Val Leu Phe Val Arg His Ile Ile Val Gly Cys
2115 2120 2125
aat aat gct gac tgt gta gct tgt tct aaa agt gct aga ctt aaa 6687
Asn Asn Ala Asp Cys Val Ala Cys Ser Lys Ser Ala Arg Leu Lys
2130 2135 2140
cgt gta cca ctt caa act att att aat ggt atg cat aaa tca ttc 6732
Arg Val Pro Leu Gln Thr Ile Ile Asn Gly Met His Lys Ser Phe
2145 2150 2155
tat gtt aat gct aat ggt ggt act tgt ttc tgt aat aaa cat aac 6777
Tyr Val Asn Ala Asn Gly Gly Thr Cys Phe Cys Asn Lys His Asn
2160 2165 2170
ttc ttt tgt gtt aat tgt gat tct ttt ggg cct ggt aat act ttt 6822
Phe Phe Cys Val Asn Cys Asp Ser Phe Gly Pro Gly Asn Thr Phe
2175 2180 2185
att aat ggt gat att gca aga gag ctt ggt aat gtt gtt aaa aca 6867
Ile Asn Gly Asp Ile Ala Arg Glu Leu Gly Asn Val Val Lys Thr
2190 2195 2200
gct gtt caa ccc aca gct cct gca tat gtt att att gat aag gta 6912
Ala Val Gln Pro Thr Ala Pro Ala Tyr Val Ile Ile Asp Lys Val
2205 2210 2215
gat ttt gtt aat gga ttt tat cgt ctt tat agt ggt gac act ttt 6957
Asp Phe Val Asn Gly Phe Tyr Arg Leu Tyr Ser Gly Asp Thr Phe
2220 2225 2230
tgg cgg tat gac ttt gac att act gaa tct aag tat agt tgt aaa 7002
Trp Arg Tyr Asp Phe Asp Ile Thr Glu Ser Lys Tyr Ser Cys Lys
2235 2240 2245
gag gtt ctg aag aat tgt aat gtt tta gaa aat ttt att gtt tac 7047
Glu Val Leu Lys Asn Cys Asn Val Leu Glu Asn Phe Ile Val Tyr
2250 2255 2260
aat aat agt ggt agt aac att aca cag att aaa aat gct tgt gtt 7092
Asn Asn Ser Gly Ser Asn Ile Thr Gln Ile Lys Asn Ala Cys Val
2265 2270 2275
tat ttt tct caa ttg ttg tgt gaa cct ata aag ttg gta aat tca 7137
Tyr Phe Ser Gln Leu Leu Cys Glu Pro Ile Lys Leu Val Asn Ser
2280 2285 2290
gag ttg ttg tca act tta tca gtt gat ttt aat ggt gtt ttg cat 7182
Glu Leu Leu Ser Thr Leu Ser Val Asp Phe Asn Gly Val Leu His
2295 2300 2305
aag gca tat gtt gat gtt ttg tgt aat agt ttt ttt aag gag cta 7227
Lys Ala Tyr Val Asp Val Leu Cys Asn Ser Phe Phe Lys Glu Leu
2310 2315 2320
act gct aac atg tcc atg gct gaa tgt aaa gct aca ctt ggt ttg 7272
Thr Ala Asn Met Ser Met Ala Glu Cys Lys Ala Thr Leu Gly Leu
2325 2330 2335
act gtt tct gat gat gat ttt gtt tca gct gtt gcc aat gca cat 7317
Thr Val Ser Asp Asp Asp Phe Val Ser Ala Val Ala Asn Ala His
2340 2345 2350
agg tat gac gtt ttg ctt tca gat ttg tca ttt aat aat ttt ttt 7362
Arg Tyr Asp Val Leu Leu Ser Asp Leu Ser Phe Asn Asn Phe Phe
2355 2360 2365
att tct tat gct aaa cct gaa gat aag ttg tcc gtt tat gac att 7407
Ile Ser Tyr Ala Lys Pro Glu Asp Lys Leu Ser Val Tyr Asp Ile
2370 2375 2380
gct tgt tgt atg cgt gcc ggt tct aag gtt gtt aac cat aat gtt 7452
Ala Cys Cys Met Arg Ala Gly Ser Lys Val Val Asn His Asn Val
2385 2390 2395
tta atc aaa gag tca ata cct att gtt tgg ggt gtc aag gac ttt 7497
Leu Ile Lys Glu Ser Ile Pro Ile Val Trp Gly Val Lys Asp Phe
2400 2405 2410
aat act ctt tct caa gaa ggt aag aag tac ctt gtt aaa aca act 7542
Asn Thr Leu Ser Gln Glu Gly Lys Lys Tyr Leu Val Lys Thr Thr
2415 2420 2425
aaa gca aag ggt ttg act ttt tta tta act ttt aat gat aac caa 7587
Lys Ala Lys Gly Leu Thr Phe Leu Leu Thr Phe Asn Asp Asn Gln
2430 2435 2440
gca att aca caa gtt cct gct act agt ata gtt gca aaa cag ggt 7632
Ala Ile Thr Gln Val Pro Ala Thr Ser Ile Val Ala Lys Gln Gly
2445 2450 2455
gct ggt ttt aaa cgt act tat aat ttt ctg tgg tat gta tgt tta 7677
Ala Gly Phe Lys Arg Thr Tyr Asn Phe Leu Trp Tyr Val Cys Leu
2460 2465 2470
ttt gtt gtt gca ttg ttt att ggt gtc tca ttt att gat tat aca 7722
Phe Val Val Ala Leu Phe Ile Gly Val Ser Phe Ile Asp Tyr Thr
2475 2480 2485
acc act gta act agc ttt cat ggt tat gat ttt aag tac att gag 7767
Thr Thr Val Thr Ser Phe His Gly Tyr Asp Phe Lys Tyr Ile Glu
2490 2495 2500
aat ggt cag ttg aag gtg ttt gaa gca cct tta cac tgt gtt cgt 7812
Asn Gly Gln Leu Lys Val Phe Glu Ala Pro Leu His Cys Val Arg
2505 2510 2515
aat gtt ttt gat aat ttt aat caa tgg cat gag gct aag ttt ggt 7857
Asn Val Phe Asp Asn Phe Asn Gln Trp His Glu Ala Lys Phe Gly
2520 2525 2530
gtt gtt act act aat agt gat aaa tgt cct ata gtt gtt ggt gtt 7902
Val Val Thr Thr Asn Ser Asp Lys Cys Pro Ile Val Val Gly Val
2535 2540 2545
tca gag cgt att aat gtt gtt cct ggt gtt cca aca aat gta tat 7947
Ser Glu Arg Ile Asn Val Val Pro Gly Val Pro Thr Asn Val Tyr
2550 2555 2560
ttg gta gga aag act ctt gtt ttt aca tta cag gct gct ttt gga 7992
Leu Val Gly Lys Thr Leu Val Phe Thr Leu Gln Ala Ala Phe Gly
2565 2570 2575
aac aca ggt gtt tgt tat gac ttt gat ggt gtt acc act agt gat 8037
Asn Thr Gly Val Cys Tyr Asp Phe Asp Gly Val Thr Thr Ser Asp
2580 2585 2590
aag tgt att ttt aat tct gct tgt act agg ttg gaa ggt ttg ggt 8082
Lys Cys Ile Phe Asn Ser Ala Cys Thr Arg Leu Glu Gly Leu Gly
2595 2600 2605
ggt gac aat gtt tat tgt tac aac act gat ctt att gaa ggt tct 8127
Gly Asp Asn Val Tyr Cys Tyr Asn Thr Asp Leu Ile Glu Gly Ser
2610 2615 2620
aaa cct tat agt att tta cag ccc aat gct tat tat aag tat gat 8172
Lys Pro Tyr Ser Ile Leu Gln Pro Asn Ala Tyr Tyr Lys Tyr Asp
2625 2630 2635
gtt aaa aat tat gta cgt ttt cca gaa att tta gct aga ggt ttt 8217
Val Lys Asn Tyr Val Arg Phe Pro Glu Ile Leu Ala Arg Gly Phe
2640 2645 2650
ggc tta cgt act att aga act ttg gct aca cgt tat tgt aga gtt 8262
Gly Leu Arg Thr Ile Arg Thr Leu Ala Thr Arg Tyr Cys Arg Val
2655 2660 2665
ggt gaa tgc cgt gac tca cat aaa ggt gtt tgt ttt ggt ttt gat 8307
Gly Glu Cys Arg Asp Ser His Lys Gly Val Cys Phe Gly Phe Asp
2670 2675 2680
aaa tgg tat gtt aat gat gga cgt gtt gat gac ggt tac att tgt 8352
Lys Trp Tyr Val Asn Asp Gly Arg Val Asp Asp Gly Tyr Ile Cys
2685 2690 2695
ggt gat ggt ctt ata gac ctt ctt gtt aat gta ctc tca atc ttt 8397
Gly Asp Gly Leu Ile Asp Leu Leu Val Asn Val Leu Ser Ile Phe
2700 2705 2710
agt tca tct ttt agc gtt gtg gct atg tct gga cat atg ttg ttt 8442
Ser Ser Ser Phe Ser Val Val Ala Met Ser Gly His Met Leu Phe
2715 2720 2725
aat ttt ctt ttt gca gca ttt att aca ttt ttg tgc ttt tta gtt 8487
Asn Phe Leu Phe Ala Ala Phe Ile Thr Phe Leu Cys Phe Leu Val
2730 2735 2740
act aaa ttt aaa cgt gtt ttt ggt gat ctt tct tat ggt gtt ttt 8532
Thr Lys Phe Lys Arg Val Phe Gly Asp Leu Ser Tyr Gly Val Phe
2745 2750 2755
act gtt gtt tgt gca act ttg att aat aac att tct tat gtt gtt 8577
Thr Val Val Cys Ala Thr Leu Ile Asn Asn Ile Ser Tyr Val Val
2760 2765 2770
act caa aat tta ttt ttt atg ttg ctt tat gct att ttg tat ttt 8622
Thr Gln Asn Leu Phe Phe Met Leu Leu Tyr Ala Ile Leu Tyr Phe
2775 2780 2785
gtt ttt act agg aca gtg cgt tat gct tgg att tgg cat att gca 8667
Val Phe Thr Arg Thr Val Arg Tyr Ala Trp Ile Trp His Ile Ala
2790 2795 2800
tac att gtt gca tac ttc ttg tta ata cca tgg tgg ctt ctc aca 8712
Tyr Ile Val Ala Tyr Phe Leu Leu Ile Pro Trp Trp Leu Leu Thr
2805 2810 2815
tgg ttt agt ttt gct gca ttt tta gag ctt tta cct aat gtt ttt 8757
Trp Phe Ser Phe Ala Ala Phe Leu Glu Leu Leu Pro Asn Val Phe
2820 2825 2830
aag tta aaa atc tct act caa ttg ttt gaa ggt gat aag ttt ata 8802
Lys Leu Lys Ile Ser Thr Gln Leu Phe Glu Gly Asp Lys Phe Ile
2835 2840 2845
ggt act ttt gag agt gct gct gca ggt aca ttt gtt ctt gac atg 8847
Gly Thr Phe Glu Ser Ala Ala Ala Gly Thr Phe Val Leu Asp Met
2850 2855 2860
cgt tct tat gaa agg ctg ata aat act att tca cct gag aaa ctt 8892
Arg Ser Tyr Glu Arg Leu Ile Asn Thr Ile Ser Pro Glu Lys Leu
2865 2870 2875
aag aat tat gct gca agt tat aat aaa tat aaa tat tat agt ggt 8937
Lys Asn Tyr Ala Ala Ser Tyr Asn Lys Tyr Lys Tyr Tyr Ser Gly
2880 2885 2890
agt gct agt gag gct gat tat cgt tgt gct tgt tat gct cat tta 8982
Ser Ala Ser Glu Ala Asp Tyr Arg Cys Ala Cys Tyr Ala His Leu
2895 2900 2905
gcc aag gct atg tta gat tac gca aaa gat cat aat gac atg tta 9027
Ala Lys Ala Met Leu Asp Tyr Ala Lys Asp His Asn Asp Met Leu
2910 2915 2920
tat tct cca cct acc att agc tac aat tcc acc tta caa tct ggt 9072
Tyr Ser Pro Pro Thr Ile Ser Tyr Asn Ser Thr Leu Gln Ser Gly
2925 2930 2935
ctt aag aag atg gca caa cca tct ggt tgt gtt gag aga tgt gtg 9117
Leu Lys Lys Met Ala Gln Pro Ser Gly Cys Val Glu Arg Cys Val
2940 2945 2950
gtt cgc gtc tgt tat ggt agt act gtg ctt aat gga gtt tgg tta 9162
Val Arg Val Cys Tyr Gly Ser Thr Val Leu Asn Gly Val Trp Leu
2955 2960 2965
ggt gac act gtt act tgt cct aga cat gtc ata gca cca tca acc 9207
Gly Asp Thr Val Thr Cys Pro Arg His Val Ile Ala Pro Ser Thr
2970 2975 2980
act gtt ctt att gat tat gat cat gca tat agt act atg cgt ttg 9252
Thr Val Leu Ile Asp Tyr Asp His Ala Tyr Ser Thr Met Arg Leu
2985 2990 2995
cat aat ttt tca gtg tct cat aat ggt gtc ttc ttg gga gtt gtt 9297
His Asn Phe Ser Val Ser His Asn Gly Val Phe Leu Gly Val Val
3000 3005 3010
ggt gtt aca atg cat ggt tct gtg ttg cgt att aag gtt tca caa 9342
Gly Val Thr Met His Gly Ser Val Leu Arg Ile Lys Val Ser Gln
3015 3020 3025
tct aat gta cat aca cct aaa cat gtt ttt aaa acg ttg aaa cct 9387
Ser Asn Val His Thr Pro Lys His Val Phe Lys Thr Leu Lys Pro
3030 3035 3040
ggt gct tct ttt aat att tta gca tgt tat gaa ggt att gca tct 9432
Gly Ala Ser Phe Asn Ile Leu Ala Cys Tyr Glu Gly Ile Ala Ser
3045 3050 3055
ggt gtt ttt ggt gtt aat tta cgt aca aac ttt act att aaa ggt 9477
Gly Val Phe Gly Val Asn Leu Arg Thr Asn Phe Thr Ile Lys Gly
3060 3065 3070
tct ttt ata aat gga gct tgt ggt tct cct ggt tat aat gtt aga 9522
Ser Phe Ile Asn Gly Ala Cys Gly Ser Pro Gly Tyr Asn Val Arg
3075 3080 3085
aat gat ggt act gtt gag ttt tgt tat tta cac caa att gag tta 9567
Asn Asp Gly Thr Val Glu Phe Cys Tyr Leu His Gln Ile Glu Leu
3090 3095 3100
ggt agt ggt gct cat gtt ggt tct gat ttt act ggt agt gtt tat 9612
Gly Ser Gly Ala His Val Gly Ser Asp Phe Thr Gly Ser Val Tyr
3105 3110 3115
ggt aat ttt gat gac caa cct agt ttg caa gtt gag agt gcc aac 9657
Gly Asn Phe Asp Asp Gln Pro Ser Leu Gln Val Glu Ser Ala Asn
3120 3125 3130
ctt atg cta tca gat aat gtt gtt gcc ttt ttg tat gct gct ttg 9702
Leu Met Leu Ser Asp Asn Val Val Ala Phe Leu Tyr Ala Ala Leu
3135 3140 3145
ttg aat ggt tgt agg tgg tgg ttg cgt tca act aga gtt aat gtt 9747
Leu Asn Gly Cys Arg Trp Trp Leu Arg Ser Thr Arg Val Asn Val
3150 3155 3160
gat ggt ttt aat gaa tgg gct atg gct aat ggt tat aca att gtt 9792
Asp Gly Phe Asn Glu Trp Ala Met Ala Asn Gly Tyr Thr Ile Val
3165 3170 3175
tct agt gtt gag tgc tat tct att ttg gca gca aaa act ggt gtt 9837
Ser Ser Val Glu Cys Tyr Ser Ile Leu Ala Ala Lys Thr Gly Val
3180 3185 3190
agt gtt gaa caa ttg tta gct tcc att caa cat ctt cat gaa ggt 9882
Ser Val Glu Gln Leu Leu Ala Ser Ile Gln His Leu His Glu Gly
3195 3200 3205
ttt ggt ggt aaa aac ata ctt ggt tat tct agt tta tgt gat gag 9927
Phe Gly Gly Lys Asn Ile Leu Gly Tyr Ser Ser Leu Cys Asp Glu
3210 3215 3220
ttc aca cta gct gaa gtt gtg aag cag atg tat ggt gtt aac ttg 9972
Phe Thr Leu Ala Glu Val Val Lys Gln Met Tyr Gly Val Asn Leu
3225 3230 3235
caa agt ggt aag gtt att ttt ggt tta aaa aca atg ttt tta ttt 10017
Gln Ser Gly Lys Val Ile Phe Gly Leu Lys Thr Met Phe Leu Phe
3240 3245 3250
agc gtt ttc ttc aca atg ttt tgg gca gaa ctc ttt att tat aca 10062
Ser Val Phe Phe Thr Met Phe Trp Ala Glu Leu Phe Ile Tyr Thr
3255 3260 3265
aac act ata tgg ata aac cct gtt ata ctt aca cct ata ttt tgt 10107
Asn Thr Ile Trp Ile Asn Pro Val Ile Leu Thr Pro Ile Phe Cys
3270 3275 3280
tta ctt ttg ttt ttg tca tta gtt tta act atg ttt ctt aaa cat 10152
Leu Leu Leu Phe Leu Ser Leu Val Leu Thr Met Phe Leu Lys His
3285 3290 3295
aag ttt ttg ttt ttg caa gta ttt tta tta cct act gtt att gca 10197
Lys Phe Leu Phe Leu Gln Val Phe Leu Leu Pro Thr Val Ile Ala
3300 3305 3310
act gct tta tat aat tgt gtt ttg gat tat tac ata gta aaa ttt 10242
Thr Ala Leu Tyr Asn Cys Val Leu Asp Tyr Tyr Ile Val Lys Phe
3315 3320 3325
ttg gct gac cat ttt aac tat aat gtt tca gta tta caa atg gat 10287
Leu Ala Asp His Phe Asn Tyr Asn Val Ser Val Leu Gln Met Asp
3330 3335 3340
gtt cag ggt tta gtt aat gtt ttg gtc tgt tta ttt gtt gta ttt 10332
Val Gln Gly Leu Val Asn Val Leu Val Cys Leu Phe Val Val Phe
3345 3350 3355
tta cac aca tgg cgt ttt tct aaa gaa cgt ttc aca cat tgg ttt 10377
Leu His Thr Trp Arg Phe Ser Lys Glu Arg Phe Thr His Trp Phe
3360 3365 3370
aca tat gtg tgt tct ctt ata gca gtt gct tac act tat ttt tat 10422
Thr Tyr Val Cys Ser Leu Ile Ala Val Ala Tyr Thr Tyr Phe Tyr
3375 3380 3385
agt ggt gac ttt ttg agt ttg ctt gtt atg ttt tta tgt gct ata 10467
Ser Gly Asp Phe Leu Ser Leu Leu Val Met Phe Leu Cys Ala Ile
3390 3395 3400
tct agt gat tgg tac att ggt gcc att gtt ttt agg ttg tca cgt 10512
Ser Ser Asp Trp Tyr Ile Gly Ala Ile Val Phe Arg Leu Ser Arg
3405 3410 3415
ttg att ata ttt ttt tca cct gaa agt gta ttt agt gtt ttt ggt 10557
Leu Ile Ile Phe Phe Ser Pro Glu Ser Val Phe Ser Val Phe Gly
3420 3425 3430
gat gtg aaa ctc act tta gtt gtt tat tta att tgt ggt tat tta 10602
Asp Val Lys Leu Thr Leu Val Val Tyr Leu Ile Cys Gly Tyr Leu
3435 3440 3445
gtt tgt act tat tgg ggc att ttg tat tgg ttc aat agg ttt ttt 10647
Val Cys Thr Tyr Trp Gly Ile Leu Tyr Trp Phe Asn Arg Phe Phe
3450 3455 3460
aaa tgt act atg ggt gtt tat gat ttt aag gtg agt gct gct gaa 10692
Lys Cys Thr Met Gly Val Tyr Asp Phe Lys Val Ser Ala Ala Glu
3465 3470 3475
ttt aaa tac atg gtt gct aat gga ctt cat gca cca tat gga cct 10737
Phe Lys Tyr Met Val Ala Asn Gly Leu His Ala Pro Tyr Gly Pro
3480 3485 3490
ttt gat gca ctt tgg tta tca ttc aaa tta ctt ggt att ggt ggt 10782
Phe Asp Ala Leu Trp Leu Ser Phe Lys Leu Leu Gly Ile Gly Gly
3495 3500 3505
gac cgt tgt ata aaa att tca act gtc caa tcc aaa ctg act gat 10827
Asp Arg Cys Ile Lys Ile Ser Thr Val Gln Ser Lys Leu Thr Asp
3510 3515 3520
ttg aag tgt act aat gtt gtg tta ttg ggt tgt ttg tct agt atg 10872
Leu Lys Cys Thr Asn Val Val Leu Leu Gly Cys Leu Ser Ser Met
3525 3530 3535
aac att gca gct aat tct agt gaa tgg gct tat tgt gtt gat tta 10917
Asn Ile Ala Ala Asn Ser Ser Glu Trp Ala Tyr Cys Val Asp Leu
3540 3545 3550
cac aat aag att aat ctt tgt gat gac cca gaa aaa gct caa ggt 10962
His Asn Lys Ile Asn Leu Cys Asp Asp Pro Glu Lys Ala Gln Gly
3555 3560 3565
atg ttg tta gca ctc ctt gcg ttc ttt cta agt aaa cat agt gat 11007
Met Leu Leu Ala Leu Leu Ala Phe Phe Leu Ser Lys His Ser Asp
3570 3575 3580
ttt ggt ctt gat ggc ctt att gat tct tat ttt gat aat agt agc 11052
Phe Gly Leu Asp Gly Leu Ile Asp Ser Tyr Phe Asp Asn Ser Ser
3585 3590 3595
acc ctg cag agt gtt gct tca tca ttt gtt agt atg cca tca tat 11097
Thr Leu Gln Ser Val Ala Ser Ser Phe Val Ser Met Pro Ser Tyr
3600 3605 3610
att gct tat gaa aat gct aga caa gct tat gag gat gct att gct 11142
Ile Ala Tyr Glu Asn Ala Arg Gln Ala Tyr Glu Asp Ala Ile Ala
3615 3620 3625
aat gga tct tct tct caa ctt att aaa caa ttg aag cgt gcc atg 11187
Asn Gly Ser Ser Ser Gln Leu Ile Lys Gln Leu Lys Arg Ala Met
3630 3635 3640
aat atc gca aag tct gaa ttt gat cat gag ata tct gtt cag aag 11232
Asn Ile Ala Lys Ser Glu Phe Asp His Glu Ile Ser Val Gln Lys
3645 3650 3655
aaa att aat aga atg gct gaa caa gct gct act cag atg tat aaa 11277
Lys Ile Asn Arg Met Ala Glu Gln Ala Ala Thr Gln Met Tyr Lys
3660 3665 3670
gaa gca cgc tct gtt aat aga aaa tct aaa gtt att agt gct atg 11322
Glu Ala Arg Ser Val Asn Arg Lys Ser Lys Val Ile Ser Ala Met
3675 3680 3685
cac tct tta ctt ttt gga atg tta aga cgt ttg gat atg tct agt 11367
His Ser Leu Leu Phe Gly Met Leu Arg Arg Leu Asp Met Ser Ser
3690 3695 3700
gtt gaa act gtt ttg aat tta gca cgt gat ggt gtt gtg cca ttg 11412
Val Glu Thr Val Leu Asn Leu Ala Arg Asp Gly Val Val Pro Leu
3705 3710 3715
tca gtt ata cct gca act tca gct tcc aaa cta act att gtt agt 11457
Ser Val Ile Pro Ala Thr Ser Ala Ser Lys Leu Thr Ile Val Ser
3720 3725 3730
cca gat ctt gaa tct tat tct aag att gtt tgt gat ggt tct gtt 11502
Pro Asp Leu Glu Ser Tyr Ser Lys Ile Val Cys Asp Gly Ser Val
3735 3740 3745
cat tat gct gga gtt gtt tgg aca ctt aat gat gtt aaa gac aat 11547
His Tyr Ala Gly Val Val Trp Thr Leu Asn Asp Val Lys Asp Asn
3750 3755 3760
gat ggt aga cct gtt cat gtt aaa gag att aca agg gag aat gtt 11592
Asp Gly Arg Pro Val His Val Lys Glu Ile Thr Arg Glu Asn Val
3765 3770 3775
gaa act ttg aca tgg cct ctt atc ctt aat tgt gaa cgt gtt gtt 11637
Glu Thr Leu Thr Trp Pro Leu Ile Leu Asn Cys Glu Arg Val Val
3780 3785 3790
aaa ctt caa aat aat gaa att atg cct ggt aaa ctt aag caa aaa 11682
Lys Leu Gln Asn Asn Glu Ile Met Pro Gly Lys Leu Lys Gln Lys
3795 3800 3805
cct atg aaa gct gag ggt gat ggt ggt gtt tta ggt gat ggt aat 11727
Pro Met Lys Ala Glu Gly Asp Gly Gly Val Leu Gly Asp Gly Asn
3810 3815 3820
gct ttg tat aat act gag ggt ggt aaa act ttt atg tat gct tat 11772
Ala Leu Tyr Asn Thr Glu Gly Gly Lys Thr Phe Met Tyr Ala Tyr
3825 3830 3835
att tct aat aaa gct gac ctt aaa ttt gtt aag tgg gag tat gag 11817
Ile Ser Asn Lys Ala Asp Leu Lys Phe Val Lys Trp Glu Tyr Glu
3840 3845 3850
ggt ggt tgc aac aca atc gag tta gac tct cct tgt cga ttt atg 11862
Gly Gly Cys Asn Thr Ile Glu Leu Asp Ser Pro Cys Arg Phe Met
3855 3860 3865
gtc gaa aca cct aat ggt cct caa gtg aag tat ttg tat ttt gtt 11907
Val Glu Thr Pro Asn Gly Pro Gln Val Lys Tyr Leu Tyr Phe Val
3870 3875 3880
aaa aat tta aat acc tta cgt aga ggt gcc gtt ctt ggt ttt ata 11952
Lys Asn Leu Asn Thr Leu Arg Arg Gly Ala Val Leu Gly Phe Ile
3885 3890 3895
ggt gcc aca att cgt cta caa gct ggt aaa caa act gaa ttg gct 11997
Gly Ala Thr Ile Arg Leu Gln Ala Gly Lys Gln Thr Glu Leu Ala
3900 3905 3910
gtt aat tct gga ctt tta act gct tgt gct ttt tct gtt gat cca 12042
Val Asn Ser Gly Leu Leu Thr Ala Cys Ala Phe Ser Val Asp Pro
3915 3920 3925
gca acc act tac ttg gaa gct gtt aaa cat ggt gca aaa cct gta 12087
Ala Thr Thr Tyr Leu Glu Ala Val Lys His Gly Ala Lys Pro Val
3930 3935 3940
agt aat tgt att aag atg tta tct aat ggt gct ggt aat ggt caa 12132
Ser Asn Cys Ile Lys Met Leu Ser Asn Gly Ala Gly Asn Gly Gln
3945 3950 3955
gct ata aca act agt gta gat gct aac acc aat caa gat tct tat 12177
Ala Ile Thr Thr Ser Val Asp Ala Asn Thr Asn Gln Asp Ser Tyr
3960 3965 3970
ggt gga gcg tct att tgt ttg tat tgt cgg gcc cac gtt cct cac 12222
Gly Gly Ala Ser Ile Cys Leu Tyr Cys Arg Ala His Val Pro His
3975 3980 3985
cct agt atg gat ggt tac tgt aag ttt aag ggt aaa tgt gtt cag 12267
Pro Ser Met Asp Gly Tyr Cys Lys Phe Lys Gly Lys Cys Val Gln
3990 3995 4000
gtt cct att ggt tgt ttg gat cct att agg ttt tgt tta gaa aat 12312
Val Pro Ile Gly Cys Leu Asp Pro Ile Arg Phe Cys Leu Glu Asn
4005 4010 4015
aat gtg tgt aat gtt tgt ggt tgt tgg ttg gga cac ggg tgt gct 12357
Asn Val Cys Asn Val Cys Gly Cys Trp Leu Gly His Gly Cys Ala
4020 4025 4030
tgt gat cgt aca acc att caa agt gtt gac att tct tat tta aac 12402
Cys Asp Arg Thr Thr Ile Gln Ser Val Asp Ile Ser Tyr Leu Asn
4035 4040 4045
gag caa ggg gtt cta gtg cag ctc gac tag aaccctgtaa tggcacggac 12452
Glu Gln Gly Val Leu Val Gln Leu Asp
4050 4055
atcgataagt gtgttcgtgc ttttgacatt tataataaaa atgtttcatt cttgggtaag 12512
tgtttgaaga tgaactgtgt tcgttttaaa aatgctgatc ttaaggatgg ttattttgtt 12572
ataaagaggt gtactaagtc ggttatggaa cacgagcaat ccatgtataa cctacttaac 12632
ttttctggtg ctttggctga gcatgatttc tttacttgga aagatggcag agtcatttat 12692
ggtaatgtta gtagacataa tcttactaaa tatactatga tggacttggt ttatgctatg 12752
cgtaactttg atgaacaaaa ttgtgatgtt ctaaaagaag tattagtttt aactggttgt 12812
tgtgacaatt cttattttga tagtaagggt tggtatgacc cagttgaaaa tgaagatata 12872
catagagttt atgcatctct tggcaaaatt gtagctagag ctatgcttaa atgcgttgct 12932
ctatgtgatg cgatggttgc taaaggtgtt gttggtgttt taacattaga taaccaagat 12992
cttaatggta acttttatga ttttggtgat tttgttgtta gcttacctaa tatgggtgtt 13052
ccctgttgta catcatatta ttcttatatg atgcctatta tgggtttaac taattgttta 13112
gctagtgagt gttttgtcaa gagtgatatt tttggtagtg attttaaaac ttttgatttg 13172
cttaagtatg atttcactga acataaagaa aatttattca ataagtactt taagcattgg 13232
agttttgatt atcatcctaa ttgtagtgac tgttatgatg atatgtgtgt tatacattgt 13292
gctaatttta atacactatt tgccacaact ataccaggta ctgcttttgg tccactatgt 13352
cgtaaagttt ttatagatgg tgttccactt gttacaactg ctggttatca ttttaagcaa 13412
ttaggtttgg tttggaataa agatgttaac acacactcag ttaggttgac aatcactgaa 13472
cttttgcaat ttgttactga cccttccttg ataatagctt cttctccagc actcgttgat 13532
caacgcacta tttgtttttc tgttgcagca ttgagtactg gtttgacaaa tcaagttgtt 13592
aagccaggtc attttaatga agagttttat aactttcttc gtttaagagg tttctttgat 13652
gaaggttctg aacttacatt aaaacatttc ttcttcgcac agaatggtga tgctgctgtt 13712
aaagattttg acttttaccg ttataataag cctaccattt tagatatttg tcaagctaga 13772
gttacatata agatagtctc tcgttatttt gacatttatg aaggtggctg tattaaggca 13832
tgtgaagttg ttgtaacaaa tcttaataag agtgctggtt ggccattaaa taagtttggt 13892
aaagctagtt tgtattacga atctatatct tatgaagaac aggatgcttt gtttgctttg 13952
acaaagcgta atgtcctccc tactatgaca cagctgaatc ttaagtatgc tattagtggt 14012
aaagaacgtg ctagaactgt tggtggtgtt tctctgttgt ccacaatgac cacaagacaa 14072
taccatcaaa aacatcttaa atccattgtt aatacacgca atgccactgt tgttattggt 14132
actaccaaat tttatggtgg ttggaataat atgttgcgta ctttaattga tggtgttgaa 14192
aaccctatgc tcatgggttg ggattatccc aaatgtgata gagctttgcc taacatgata 14252
cgtatgattt cagccatggt gttgggttct aagcatgtta attgttgtac tgtaacagat 14312
aggttttata ggcttggtaa cgagttggca caagttttaa cagaagttgt ttattctaat 14372
ggtggttttt attttaagcc aggtggtacg acttctggtg acgctagtac agcttatgct 14432
aattctattt ttaacatttt tcaagccgtg agttctaaca ttaacaggtt gcttagtgtc 14492
ccatcagatt catgtaataa tgttaatgtt agggatctac aacgacgtct gtatgataat 14552
tgctataggt taactagtgt tgaagagtca ttcattgatg attattatgg ttatcttagg 14612
aaacattttt caatgatgat tctctctgat gacggtgttg tctgttataa caaggattat 14672
gctgagttag gttatatagc agacattagt gcttttaaag ccactttgta ttaccagaat 14732
aatgtcttta tgagtacttc taaatgttgg gttgaagaag atttaactaa gggaccacat 14792
gagttttgtt cccagcatac tatgcaaata gttgataaag atggtaccta ttatttgcct 14852
tacccagatc ctagtaggat cttgtcagct ggtgtttttg ttgatgatgt tgttaagaca 14912
gatgctgttg ttttgttaka acgttatgtg tctttagcta ttgatgcata ccctctttca 14972
aaacacccta attctgaata tcgtaaggtt ttttacgtat tacttgattg ggttaagcat 15032
cttaacaaaa atttgaatga gggtgttctt gaatcttttt ctgttacact tcttgataat 15092
caagaagata agttttggtg tgaagatttt tatgctagta tgtatgaaaa ttctacaata 15152
ttgcaagctg ctggcttatg tgttgtttgt ggttcacaaa ctgttcttcg ttgtggtgat 15212
tgtctgcgta agcctatgtt gtgcactaaa tgtgcatatg atcatgtatt tggtaccgac 15272
cacaagttta ttttggctat aacaccgtat gtatgtaatg catcaggttg tggtgttagt 15332
gatgttaaaa aattgtatct tggtggtttg aattactatt gtacaaatca taaaccacag 15392
ttgtcttttc cattatgttc tgctggtaat atatttggtt tatataaaaa ttcagcaact 15452
ggttccttag atgttgaagt ttttaatagg cttgcaacgt ctgattggac tgatgttagg 15512
gactataaac ttgctaatga tgttaaagat acacttagac tctttgcggc tgaaactatt 15572
aaagctaaag aagagagtgt taagtcttct tatgcttttg caactcttaa agaggttgtt 15632
ggacctaaag aattgcttct tagttgggaa agtggtaaag ttaaaccacc tttgaatcgt 15692
aattctgttt tcacctgttt tcaaataagt aaggactcaa aattccaaat aggtgagttc 15752
atctttgaaa aggttgaata tggttctgat actgttacgt ataagtctac tgtaaccact 15812
aagttagttc ctggtatgat ttttgtctta acatctcaca atgttcaacc tttacgtgca 15872
ccaactattg caaaccaaga gaagtattct agcatttata aattgcaccc tgcttttaat 15932
gtcagtgatg catatgctaa tttggttcca tattaccaac ttattggtaa acaaaagata 15992
actacaatac agggtcctcc tggtagtggt aagtcacatt gttccattgg acttggattg 16052
tactatccag gtgcgcgtat tgtttttgtt gcttgtgccc atgctgctgt tgattcctta 16112
tgtgcaaaag ctatgactgt ttatagcatt gataagtgta ctaggattat acctgcaaga 16172
gctcgggttg agtgttatag tggctttaaa ccaaataaca ctagtgcaca atacatattt 16232
agcactgtta acgcattacc tgagtgtaat gctgatattg ttgttgtaga tgaagtttca 16292
atgtgtacaa attatgacct ttctgttatt aatcagcgtt tatcatataa acatattgtt 16352
tatgttggtg atccacaaca acttcctgca cctagagtaa tgattactaa aggtgttatg 16412
gagcctgttg attataacgt tgttactcaa cgtatgtgtg ctataggccc tgatgttttt 16472
cttcataaat gttatagatg tcctgctgaa atagttaata cagtttctga acttgtttat 16532
gagaacaagt ttgtccctgt taaacctgct agtaaacagt gttttaaaat cttttttaag 16592
ggtaatgtac aggttgacaa tggctctagt attaacagaa agcagcttga aatagttaag 16652
ctgtttttag ttaaaaatcc aagttggagt aaggctgtgt ttatttctcc ttataatagt 16712
cagaattatg ttgctagtag atttttagga cttcaaattc aaactgttga ttcttctcaa 16772
ggtagtgagt atgattatgt aatctatgca caaacttctg acactgcaca tgcttgcaat 16832
gtaaaccgtt ttaatgttgc tataacacgt gctaagaagg gtatattttg tgtaatgtgt 16892
gataaaactt tgtttgattc acttaagttt tttgagatta aacatgcaga tttacactct 16952
agccaggttt gtggcttgtt taaaaattgt acacgcactc ctcttaattt accaccaact 17012
catgcacaca ctttcttgtc gttgtcagat cagtttaaga ctacaggtga tttagctgtt 17072
caaataggtt caaataatgt ttgtacttat gaacatgtta tatcatttat gggttttagg 17132
tttgatatta gtattcctgg tagtcatagt ttgttttgta cacgtgactt tgctattcgt 17192
aatgtgcgtg gttggttggg tatggatgtt gaaagtgctc atgtttgtgg cgataacata 17252
ggtactaatg ttcctttaca ggttggtttt tcaaatggtg ttaattttgt tgtgcaaact 17312
gaaggttgtg tgtctaccaa ttttggtgat gttattaaac ctgtttgtgc aaaatctcca 17372
ccaggtgaac aatttagaca ccttgttcct tttttacgta aaggacaacc ttggttaatt 17432
gttcgtagac gcattgtgca aatgatatct gattatttgt ccaatttgtc tgacattctt 17492
gtctttgttt tgtgggcagg tagtttggaa ttaactacaa tgcgttactt tgtaaaaata 17552
gggccaatta aatattgtta ttgtggtaat tctgccactt gttataattc agttagtaat 17612
gaatattgtt gttttaaaca tgcattgggt tgtgattatg tttacaatcc gtatgctttt 17672
gatatacaac agtggggtta tgttggttcc ttgagccaga accaccacac gttctgtaac 17732
attcatagaa acgagcatga tgcttctggt gatgctgtta tgacacgttg tttggcagta 17792
catgattgtt ttgtcaaaaa tgttgattgg actgtaacgt acccctttat tgcaaatgag 17852
aaatttatca atggctgtgg gcgtaatgtc cagggacatg ttgttcgcgc agccttgaaa 17912
ttgtataaac ctagtgttat tcatgatatt ggtaatccta aaggtgtacg ttgtgctgtt 17972
actgatgcca aatggtactg ttatgacaag caacctgtta atagtaatgt caagttgttg 18032
gattatgatt atgcaaccca tggtcaactt gatggtcttt gtttattctg gaattgtaat 18092
gttgatatgt atccagaatt ttcaattgtg tgtcgctttg acacacgtac tcgttctgtt 18152
tttaatttag aaggtgttaa tggtggttct ctttatgtta acaaacatgc gtttcataca 18212
ccagcatatg ataaacgtgc ttttgttaaa ttaaaaccta tgcccttttt ttactttgat 18272
gacagtgatt gtgatgttgt gcaagaacaa gttaattatg taccccttcg cgctagtagt 18332
tgtgttaccc gttgtaatat aggtggtgct gtttgttcaa aacatgcaaa tttgtatcaa 18392
aaatatgttg aggcatataa tacatttaca caggctggtt ttaacatttg ggtaccacat 18452
agttttgatg tttataattt gtggcaaatt tttattgaaa ctaatttaca aagtcttgaa 18512
aatatagcat ttaatgttgt aaaaaaaggg tgttttactg gtgttgatgg tgagttacct 18572
gttgcagttg ttaacgacaa agtttttgtt cgctatggcg atgttgacaa cttggttttt 18632
acaaataaaa caacattgcc tactaatgtt gcttttgaat tgtttgcaaa acgaaaaatg 18692
ggtttaacac caccattgtc tattctcaaa aatcttggtg ttgttgctac atataaattt 18752
gttttatggg attatgaagc tgaaagacct tttacctcat atactaagag tgtatgtaaa 18812
tacactgatt ttaatgagga tgtttgtgtt tgttttgaca atagtattca gggttcgtat 18872
gagcgtttta cgcttactac gaacgctgtt ttattttcta ctgttgtcat taaaaattta 18932
acacctataa agttgaattt tggtatgttg aatggtatgc cagtttcttc tattaagagt 18992
gataaaggtg ttgaaaaatt agttaattgg tacacatatg ttcgtaaaaa tggtcaattt 19052
caagatcatt atgatggttt ttacactcaa ggtaggaatt tatcagactt tacaccaaga 19112
agtgatatgg agtatgattt tcttaacatg gatatgggtg tttttattaa taaatatggt 19172
cttgaggatt ttaattttga acatgttgta tatggtgatg tttcaaaaac tacattagga 19232
ggtcttcatt tgttgatatc acagtttagg cttagtaaaa tgggtgtttt gaaagctgat 19292
gattttgtca ctgcttctga cacaactttg aggtgctgta ctgttactta tcttaatgaa 19352
cttagttcaa aagttgtttg tacttatatg gatttgttgt tggacgactt tgttactata 19412
ctaaagagtt tagatcttgg tgtaatatct aaagttcatg aagttattat agataataaa 19472
ccttataggt ggatgttgtg gtgtaaagat aaccacttgt ccacttttta tccacagttg 19532
cagtctgctg aatggaagtg tggttatgct atgccacaaa tttataagct tcaacgtatg 19592
tgtttggaac cttgtaattt atataattat ggtgctggta ttaagttgcc tagtggtata 19652
atgttaaatg ttgttaaata cactcagctt tgtcaatacc taaatagcac tacaatgtgc 19712
gtacctcata atatgcgtgt tttgcactat ggtgctggtt ctgacaaagg tgtggcacct 19772
ggtacaactg ttttaaaacg ttggctacca cctgatgcaa taatcattga taatgatatc 19832
aatgattatg ttagtgatgc agattttagc attacaggtg attgtgctac tgtttacctt 19892
gaagataagt ttgacttact tatttctgat atgtatgatg gtagaattaa attttgtgat 19952
ggtgaaaacg tctctaaaga tggttttttt acttatctta atggtgttat tagagaaaaa 20012
ttagctattg gtggtagtgt tgccattaag attacagaat atagttggaa taagtatctt 20072
tatgaattaa tacaaagatt tgctttttgg actttgttct gcacgtctgt taatacatcc 20132
tcttcagaag cttttcttat tggtattaat tatttaggtg actttattca aggtcctttt 20192
atagctggta acactgttca tgctaattat atattttggc gtaattctac tattatgtct 20252
ttgtcataca attcagtttt agatttaagt aagtttgaat gtaaacataa ggccactgtt 20312
gttgttacac ttaaagatag tgatgtaaat gatatggttt tgagtttgat taagagtggt 20372
aggttgttgt tacgtaatag tggccgtttt ggtggtttta gtaatcattt agtctcaact 20432
aa atg aaa ctt ttc ttg att ttg ctt att ttg ccc ctg gtt tct tgc 20479
Met Lys Leu Phe Leu Ile Leu Leu Ile Leu Pro Leu Val Ser Cys
4060 4065 4070
ttt tct aca tgt aac agt aat gct agt att tct atg tta caa tta 20524
Phe Ser Thr Cys Asn Ser Asn Ala Ser Ile Ser Met Leu Gln Leu
4075 4080 4085
ggt gtt cct gat aac tct tca act att gtc aca ggt ttg ttg cca 20569
Gly Val Pro Asp Asn Ser Ser Thr Ile Val Thr Gly Leu Leu Pro
4090 4095 4100
gtc cat tgg att tgt gct aat cag agt aca tct agt tac cca gcc 20614
Val His Trp Ile Cys Ala Asn Gln Ser Thr Ser Ser Tyr Pro Ala
4105 4110 4115
aac ggc ttt ttc tat att gat gtt ggt aaa cac cgt agt gcc ttt 20659
Asn Gly Phe Phe Tyr Ile Asp Val Gly Lys His Arg Ser Ala Phe
4120 4125 4130
gca ctc cat agt ggt tat tat gat gct aac cag tat tat att tat 20704
Ala Leu His Ser Gly Tyr Tyr Asp Ala Asn Gln Tyr Tyr Ile Tyr
4135 4140 4145
ctc act aat aaa ata cat tta aat gct cct gtc act ctg aag att 20749
Leu Thr Asn Lys Ile His Leu Asn Ala Pro Val Thr Leu Lys Ile
4150 4155 4160
tgt aag ttt gga aac act tct ttt gat ttt tta agt aat gtt tct 20794
Cys Lys Phe Gly Asn Thr Ser Phe Asp Phe Leu Ser Asn Val Ser
4165 4170 4175
act tct cat gat tgt ata gtt aat ttg tca ttc aca gaa cag tta 20839
Thr Ser His Asp Cys Ile Val Asn Leu Ser Phe Thr Glu Gln Leu
4180 4185 4190
ggt gtg cct ttg ggc ata act ata tcg ggt gaa act gta cgt ttg 20884
Gly Val Pro Leu Gly Ile Thr Ile Ser Gly Glu Thr Val Arg Leu
4195 4200 4205
cat tta tat aat gca act cgt act ttt tat gtg ccg gcc gct tat 20929
His Leu Tyr Asn Ala Thr Arg Thr Phe Tyr Val Pro Ala Ala Tyr
4210 4215 4220
aaa ctt act aaa ctt agt gtt aaa tgt tac ttt agt gaa tcc tgt 20974
Lys Leu Thr Lys Leu Ser Val Lys Cys Tyr Phe Ser Glu Ser Cys
4225 4230 4235
gtt ttt agt gtt gtc aat gcc acc att act gtt aat gtc acc aca 21019
Val Phe Ser Val Val Asn Ala Thr Ile Thr Val Asn Val Thr Thr
4240 4245 4250
ctt aat ggc cgt ata gtt aac tac act gtt tgt gat gat tgt aat 21064
Leu Asn Gly Arg Ile Val Asn Tyr Thr Val Cys Asp Asp Cys Asn
4255 4260 4265
ggt tat act gat aac ata ttt tct gtt caa cag gat ggc cgc att 21109
Gly Tyr Thr Asp Asn Ile Phe Ser Val Gln Gln Asp Gly Arg Ile
4270 4275 4280
cct aat ggt ttc cct ttt aat aat tgg ttt ttg tta act aat ggt 21154
Pro Asn Gly Phe Pro Phe Asn Asn Trp Phe Leu Leu Thr Asn Gly
4285 4290 4295
tcc aca tta gtg gac ggg gtc tct aga ctt tat caa cca ctc cgt 21199
Ser Thr Leu Val Asp Gly Val Ser Arg Leu Tyr Gln Pro Leu Arg
4300 4305 4310
tta act tgt tta tgg cct gta cct ggt ctt aaa tct tca act ggt 21244
Leu Thr Cys Leu Trp Pro Val Pro Gly Leu Lys Ser Ser Thr Gly
4315 4320 4325
ttt gtt tat ttt aat gcc act ggt tct gat gtt aat tgt aac ggc 21289
Phe Val Tyr Phe Asn Ala Thr Gly Ser Asp Val Asn Cys Asn Gly
4330 4335 4340
tat caa cat aat tct gtt gct gat gtt atg cgt tac aat ctt aac 21334
Tyr Gln His Asn Ser Val Ala Asp Val Met Arg Tyr Asn Leu Asn
4345 4350 4355
ctc agt gct aat tct gtg gac aat ctt aag agt ggt gtt ata gtt 21379
Leu Ser Ala Asn Ser Val Asp Asn Leu Lys Ser Gly Val Ile Val
4360 4365 4370
ttt aaa act tta cag tac gat gtt ttg ttt tat tgt agt aat tct 21424
Phe Lys Thr Leu Gln Tyr Asp Val Leu Phe Tyr Cys Ser Asn Ser
4375 4380 4385
tct tca ggt gtt ctt gac acc aca ata cct ttt ggc cct tcc tct 21469
Ser Ser Gly Val Leu Asp Thr Thr Ile Pro Phe Gly Pro Ser Ser
4390 4395 4400
caa cct tat tac tgt ttt ata aac agt act atc aac act act cat 21514
Gln Pro Tyr Tyr Cys Phe Ile Asn Ser Thr Ile Asn Thr Thr His
4405 4410 4415
gtt agc act ttt gtg ggt att tta cca ccc act gtg cgt gaa att 21559
Val Ser Thr Phe Val Gly Ile Leu Pro Pro Thr Val Arg Glu Ile
4420 4425 4430
gtt gtt gct aga act ggt cag ttt tat att aat ggt ttt aag tat 21604
Val Val Ala Arg Thr Gly Gln Phe Tyr Ile Asn Gly Phe Lys Tyr
4435 4440 4445
ttc gat ttg ggt ttc ata gaa gct gtc aat ttt aat gtc acg act 21649
Phe Asp Leu Gly Phe Ile Glu Ala Val Asn Phe Asn Val Thr Thr
4450 4455 4460
gct agt gcc aca gat ttt tgg acg gtt gca ttt gct act ttt gtt 21694
Ala Ser Ala Thr Asp Phe Trp Thr Val Ala Phe Ala Thr Phe Val
4465 4470 4475
gat gtt ttg gtt aat gtt agt gca act aac att caa aac tta ctt 21739
Asp Val Leu Val Asn Val Ser Ala Thr Asn Ile Gln Asn Leu Leu
4480 4485 4490
tat tgc gat tct cca ttt gaa aag ttg cag tgt gag cac ttg cag 21784
Tyr Cys Asp Ser Pro Phe Glu Lys Leu Gln Cys Glu His Leu Gln
4495 4500 4505
ttt gga ttg caa gat ggt ttt tat tct gca aat ttt ctt gat gat 21829
Phe Gly Leu Gln Asp Gly Phe Tyr Ser Ala Asn Phe Leu Asp Asp
4510 4515 4520
aat gtt ttg cct gag act tat gtt gca ctc ccc att tat tat caa 21874
Asn Val Leu Pro Glu Thr Tyr Val Ala Leu Pro Ile Tyr Tyr Gln
4525 4530 4535
cat acg gac ata aat ttt act gca act gca tct ttt ggt ggt tct 21919
His Thr Asp Ile Asn Phe Thr Ala Thr Ala Ser Phe Gly Gly Ser
4540 4545 4550
tgt tat gtt tgt aaa cca cgc cag gtt aat ata tct ctt aat ggt 21964
Cys Tyr Val Cys Lys Pro Arg Gln Val Asn Ile Ser Leu Asn Gly
4555 4560 4565
aac act tca gtg tgt gtt aga aca tct cat ttt tca att agg tat 22009
Asn Thr Ser Val Cys Val Arg Thr Ser His Phe Ser Ile Arg Tyr
4570 4575 4580
att tat aac cgc gtt aag agt ggt tca cca ggt gac tct tca tgg 22054
Ile Tyr Asn Arg Val Lys Ser Gly Ser Pro Gly Asp Ser Ser Trp
4585 4590 4595
cat att tat tta aag agt ggc act tgt cca ttt tct ttt tct aag 22099
His Ile Tyr Leu Lys Ser Gly Thr Cys Pro Phe Ser Phe Ser Lys
4600 4605 4610
tta aat aat ttt caa aag ttt aag act att tgt ttc tca acc gtc 22144
Leu Asn Asn Phe Gln Lys Phe Lys Thr Ile Cys Phe Ser Thr Val
4615 4620 4625
gaa gtg cct ggt agt tgt aat ttt cca ctt gaa gcc acc tgg cat 22189
Glu Val Pro Gly Ser Cys Asn Phe Pro Leu Glu Ala Thr Trp His
4630 4635 4640
tac act tct tat act att gtt ggt gct ttg tat gtt act tgg tct 22234
Tyr Thr Ser Tyr Thr Ile Val Gly Ala Leu Tyr Val Thr Trp Ser
4645 4650 4655
gaa ggt aat tcc att act ggt gta cct tat cct gtc tct ggt att 22279
Glu Gly Asn Ser Ile Thr Gly Val Pro Tyr Pro Val Ser Gly Ile
4660 4665 4670
cgt gag ttt agt aat tta gtt tta aat aat tgt acc aaa tat aat 22324
Arg Glu Phe Ser Asn Leu Val Leu Asn Asn Cys Thr Lys Tyr Asn
4675 4680 4685
att tat gat tat gtt ggt act gga att ata cgt tct tca aac cag 22369
Ile Tyr Asp Tyr Val Gly Thr Gly Ile Ile Arg Ser Ser Asn Gln
4690 4695 4700
tca ctt gct ggt ggt att aca tat gtt tct aac tct ggt aat tta 22414
Ser Leu Ala Gly Gly Ile Thr Tyr Val Ser Asn Ser Gly Asn Leu
4705 4710 4715
ctt ggt ttt aaa aat gtt tcc act ggt aac att ttt att gtg aca 22459
Leu Gly Phe Lys Asn Val Ser Thr Gly Asn Ile Phe Ile Val Thr
4720 4725 4730
cca tgt aac caa cca gat caa gta gct gtt tat caa caa agc att 22504
Pro Cys Asn Gln Pro Asp Gln Val Ala Val Tyr Gln Gln Ser Ile
4735 4740 4745
att ggt gcc atg acc gct gtt aat gag tct aga tat ggc ttg caa 22549
Ile Gly Ala Met Thr Ala Val Asn Glu Ser Arg Tyr Gly Leu Gln
4750 4755 4760
aac tta cta cag tta cct aac ttt tat tat gtt agt aat ggt ggt 22594
Asn Leu Leu Gln Leu Pro Asn Phe Tyr Tyr Val Ser Asn Gly Gly
4765 4770 4775
aac aat tgc act acg gct gtt atg att tat tct aat ttt ggt att 22639
Asn Asn Cys Thr Thr Ala Val Met Ile Tyr Ser Asn Phe Gly Ile
4780 4785 4790
tgt gct gat ggt tct tta att cct gtt cgt ccg cgt aat tct agt 22684
Cys Ala Asp Gly Ser Leu Ile Pro Val Arg Pro Arg Asn Ser Ser
4795 4800 4805
gat aat ggt att tca gcc ata atc act gct aat tta tcc att ccc 22729
Asp Asn Gly Ile Ser Ala Ile Ile Thr Ala Asn Leu Ser Ile Pro
4810 4815 4820
tct aac tgg act act tca gtt caa gtt gag tac ctc caa att act 22774
Ser Asn Trp Thr Thr Ser Val Gln Val Glu Tyr Leu Gln Ile Thr
4825 4830 4835
agt act cca ata gtt gtt gat tgt gct act tat gtg tgt aat ggt 22819
Ser Thr Pro Ile Val Val Asp Cys Ala Thr Tyr Val Cys Asn Gly
4840 4845 4850
aac cct cgt tgt aag aat cta ctt aag cag tat act tct gct tgt 22864
Asn Pro Arg Cys Lys Asn Leu Leu Lys Gln Tyr Thr Ser Ala Cys
4855 4860 4865
aaa act att gaa gat gcc tta cga ctt agt gct cat ttg gaa act 22909
Lys Thr Ile Glu Asp Ala Leu Arg Leu Ser Ala His Leu Glu Thr
4870 4875 4880
aat gat gtt agt agt atg cta act ttc gat agc aat gct ttt agt 22954
Asn Asp Val Ser Ser Met Leu Thr Phe Asp Ser Asn Ala Phe Ser
4885 4890 4895
ttg gct aat gtt act agt ttt gga gat tat aac ctt tct agt gtt 22999
Leu Ala Asn Val Thr Ser Phe Gly Asp Tyr Asn Leu Ser Ser Val
4900 4905 4910
tta cct cag aga aac att cat tca agc cgt ata gca gga cgt agt 23044
Leu Pro Gln Arg Asn Ile His Ser Ser Arg Ile Ala Gly Arg Ser
4915 4920 4925
gct ttg gaa gat ttg ttg ttt agc aaa gtt gtt aca tct ggt ttg 23089
Ala Leu Glu Asp Leu Leu Phe Ser Lys Val Val Thr Ser Gly Leu
4930 4935 4940
ggt act gtt gat gtt gac tat aag tct tgt act aaa ggt ctt tct 23134
Gly Thr Val Asp Val Asp Tyr Lys Ser Cys Thr Lys Gly Leu Ser
4945 4950 4955
att gct gac ctt gct tgt gct cag tac tac aat ggc ata atg gtt 23179
Ile Ala Asp Leu Ala Cys Ala Gln Tyr Tyr Asn Gly Ile Met Val
4960 4965 4970
ttg cca ggt gtt gct gat gct gaa cgt atg gcc atg tac aca ggt 23224
Leu Pro Gly Val Ala Asp Ala Glu Arg Met Ala Met Tyr Thr Gly
4975 4980 4985
tct ctt ata ggt ggc atg gtg ctc gga ggt ctt aca tca gca gcc 23269
Ser Leu Ile Gly Gly Met Val Leu Gly Gly Leu Thr Ser Ala Ala
4990 4995 5000
gcc ata cct ttt tct ttg gca ctg caa gca cga ctt aac tat gtt 23314
Ala Ile Pro Phe Ser Leu Ala Leu Gln Ala Arg Leu Asn Tyr Val
5005 5010 5015
gct tta caa act gat gtg ctt caa gaa aat cag aaa att ttg gct 23359
Ala Leu Gln Thr Asp Val Leu Gln Glu Asn Gln Lys Ile Leu Ala
5020 5025 5030
gca tca ttt aat aag gct att aat aat att gtt gct tct ttt agt 23404
Ala Ser Phe Asn Lys Ala Ile Asn Asn Ile Val Ala Ser Phe Ser
5035 5040 5045
agc gtt aat gat gct att aca cat act gca gag gct ata cat act 23449
Ser Val Asn Asp Ala Ile Thr His Thr Ala Glu Ala Ile His Thr
5050 5055 5060
gtt act att gca ctt aat aag att cag gat gtt gtt aat caa cag 23494
Val Thr Ile Ala Leu Asn Lys Ile Gln Asp Val Val Asn Gln Gln
5065 5070 5075
ggt agt gct ctt aac cat ctc act tca caa ttg aga cat aat ttt 23539
Gly Ser Ala Leu Asn His Leu Thr Ser Gln Leu Arg His Asn Phe
5080 5085 5090
cag gcc att tct aat tca att cat gct att tat gac cgg ctt gat 23584
Gln Ala Ile Ser Asn Ser Ile His Ala Ile Tyr Asp Arg Leu Asp
5095 5100 5105
tca att caa gcc gat caa caa gtt gac aga tta att act gga cgg 23629
Ser Ile Gln Ala Asp Gln Gln Val Asp Arg Leu Ile Thr Gly Arg
5110 5115 5120
ctt gca gct ttg aat gca ttt gtt tcc caa gtt ttg aat aaa tat 23674
Leu Ala Ala Leu Asn Ala Phe Val Ser Gln Val Leu Asn Lys Tyr
5125 5130 5135
act gaa gtt cgt ggt tcc aga cgc tta gca cag cag aag att aat 23719
Thr Glu Val Arg Gly Ser Arg Arg Leu Ala Gln Gln Lys Ile Asn
5140 5145 5150
gaa tgt gtc aag tca caa tct aat aga tat ggt ttt tgt ggc aat 23764
Glu Cys Val Lys Ser Gln Ser Asn Arg Tyr Gly Phe Cys Gly Asn
5155 5160 5165
ggc act cac atc ttt tca atc gtc aac tca gct cca gat ggt ttg 23809
Gly Thr His Ile Phe Ser Ile Val Asn Ser Ala Pro Asp Gly Leu
5170 5175 5180
ctt ttt ctt cat act gtt ttg ctg cca act gat tac aag aat gta 23854
Leu Phe Leu His Thr Val Leu Leu Pro Thr Asp Tyr Lys Asn Val
5185 5190 5195
aag gcg tgg tct ggt atc tgt gtt gat ggc att tat ggc tat gtt 23899
Lys Ala Trp Ser Gly Ile Cys Val Asp Gly Ile Tyr Gly Tyr Val
5200 5205 5210
ctg cgt caa cct aac ttg gtt ctt tat tct gat aat ggt gtc ttt 23944
Leu Arg Gln Pro Asn Leu Val Leu Tyr Ser Asp Asn Gly Val Phe
5215 5220 5225
cgt gta act tcc agg gtc atg ttt caa cct cgt tta cct gtt ttg 23989
Arg Val Thr Ser Arg Val Met Phe Gln Pro Arg Leu Pro Val Leu
5230 5235 5240
tct gat ttt gtg caa ata tat aat tgt aat gtt act ttt gtt aac 24034
Ser Asp Phe Val Gln Ile Tyr Asn Cys Asn Val Thr Phe Val Asn
5245 5250 5255
ata tct cgt gtc gag tta cat act gtc ata cct gac tac gtt gat 24079
Ile Ser Arg Val Glu Leu His Thr Val Ile Pro Asp Tyr Val Asp
5260 5265 5270
gtt aat aaa aca tta caa gag ttt gca caa aac tta cca aag tat 24124
Val Asn Lys Thr Leu Gln Glu Phe Ala Gln Asn Leu Pro Lys Tyr
5275 5280 5285
gtt aag cct aat ttt gac ttg act cct ttt aat tta aca tat ctt 24169
Val Lys Pro Asn Phe Asp Leu Thr Pro Phe Asn Leu Thr Tyr Leu
5290 5295 5300
aat ttg agt tct gag ttg aag caa ctc gaa gct aaa act gct agt 24214
Asn Leu Ser Ser Glu Leu Lys Gln Leu Glu Ala Lys Thr Ala Ser
5305 5310 5315
ctt ttc caa act act gtt gaa tta caa ggt ctt att gat cag att 24259
Leu Phe Gln Thr Thr Val Glu Leu Gln Gly Leu Ile Asp Gln Ile
5320 5325 5330
aac agt aca tat gtt gat ttg aag ttg ctt aat agg ttt gaa aat 24304
Asn Ser Thr Tyr Val Asp Leu Lys Leu Leu Asn Arg Phe Glu Asn
5335 5340 5345
tat atc aaa tgg cct tgg tgg gtt tgg ctc att att tct gtt gtt 24349
Tyr Ile Lys Trp Pro Trp Trp Val Trp Leu Ile Ile Ser Val Val
5350 5355 5360
ttt gtt gta ttg ttg agt ctt ctt gtg ttt tgt tgt ctt tct aca 24394
Phe Val Val Leu Leu Ser Leu Leu Val Phe Cys Cys Leu Ser Thr
5365 5370 5375
ggt tgt tgt ggt tgt tgc aat tgt tta act tca tca atg cga ggc 24439
Gly Cys Cys Gly Cys Cys Asn Cys Leu Thr Ser Ser Met Arg Gly
5380 5385 5390
tgt tgt gat tgt ggt tca act aaa ctt cct tat tat gaa ttt gaa 24484
Cys Cys Asp Cys Gly Ser Thr Lys Leu Pro Tyr Tyr Glu Phe Glu
5395 5400 5405
aag gtc cac gtt caa taa tgcctttcgg tggcctattt caacttactc 24532
Lys Val His Val Gln
5410
ttgaaagtac tattaataag agtgtggcta atctcaaatt accacctcat gatgttactg 24592
tcttgcgtga caatcttaaa cctgttacta cacttagtac tatcactgct tatttgttag 24652
ttagtttgtt tgtcacttat tttgctttat tcaaacctct tactgctaga ggtcgtgttg 24712
cttgttttgt tttaaaacta ttgacactat ctgtctatgt gcctttattg gttctttttg 24772
gtatgtatct tgacagtttt ataatttttt ttctacgctg ttgtttcgat tcatacatgt 24832
tggctattat gcctatctct aataaaaatt tttcatttgt tttgttcaat gttactaaac 24892
tatgcttcgt ttcaggcaag tgttggtatc ttgaacaatc attttatgaa aatcgttttg 24952
ctgctattta tggtggtgac cactatgtcg ttttaggtgg tgaaactatt acttttgttt 25012
cttttgatga cctttatgtt gctattagag gttcttgtga aaagaaccta caacttatgc 25072
gtaaggttga cttgtataat ggtgctgtca tttacatttt tgccgaagag cctgttgttg 25132
gtatagttta ctcctctcaa ctatacgaag atg ttc ctt cga tta att gat gac 25186
Met Phe Leu Arg Leu Ile Asp Asp
5415
aat ggc att gtc ctc aat tct att tta tgg ctc ctt gtt atg ata 25231
Asn Gly Ile Val Leu Asn Ser Ile Leu Trp Leu Leu Val Met Ile
5420 5425 5430
ttt ttc ttt gtg ttg gca atg acc ttt att aaa ctg att caa ttg 25276
Phe Phe Phe Val Leu Ala Met Thr Phe Ile Lys Leu Ile Gln Leu
5435 5440 5445
tgt ttt act tgt cat tat ttt ttt agt agg aca tta tat caa cca 25321
Cys Phe Thr Cys His Tyr Phe Phe Ser Arg Thr Leu Tyr Gln Pro
5450 5455 5460
gtt tat aaa att ttt ctt gct tac caa gat tat atg caa ata gca 25366
Val Tyr Lys Ile Phe Leu Ala Tyr Gln Asp Tyr Met Gln Ile Ala
5465 5470 5475
cct gtt cca gct gaa gta cta aat gtc taa actaaacg atg tct aat 25413
Pro Val Pro Ala Glu Val Leu Asn Val Met Ser Asn
5480 5485 5490
agt agt gtg cct ctt tca gag gtt tat gtc cat tta cgt aac tgg 25458
Ser Ser Val Pro Leu Ser Glu Val Tyr Val His Leu Arg Asn Trp
5495 5500 5505
aac ttt agt tgg aat tta att cta aca gtt ttt ata gtt gtg ttg 25503
Asn Phe Ser Trp Asn Leu Ile Leu Thr Val Phe Ile Val Val Leu
5510 5515 5520
cag tat ggg cat tat aag tat agc aga ctt ctt tat ggt tta aag 25548
Gln Tyr Gly His Tyr Lys Tyr Ser Arg Leu Leu Tyr Gly Leu Lys
5525 5530 5535
atg tct gtt tta tgg tgt tta tgg cca ctt gtt cta gct ttg tct 25593
Met Ser Val Leu Trp Cys Leu Trp Pro Leu Val Leu Ala Leu Ser
5540 5545 5550
att ttt gac tgt ttt gtc aat ttt aat gtg gac tgg gtc ttt ttt 25638
Ile Phe Asp Cys Phe Val Asn Phe Asn Val Asp Trp Val Phe Phe
5555 5560 5565
ggt ttt agt att ctt atg tct att att aca ctt tgt tta tgg gtt 25683
Gly Phe Ser Ile Leu Met Ser Ile Ile Thr Leu Cys Leu Trp Val
5570 5575 5580
atg tat ttt gtt aat agt ttc aga ctt tgg cgc cgt gtt aaa act 25728
Met Tyr Phe Val Asn Ser Phe Arg Leu Trp Arg Arg Val Lys Thr
5585 5590 5595
ttt tgg gct ttt aat cct gaa act aat gca atc atc tct ctc cag 25773
Phe Trp Ala Phe Asn Pro Glu Thr Asn Ala Ile Ile Ser Leu Gln
5600 5605 5610
gtt tat gga cat aat tat tac tta ccg gtg atg gct gca cct aca 25818
Val Tyr Gly His Asn Tyr Tyr Leu Pro Val Met Ala Ala Pro Thr
5615 5620 5625
ggt gtt aca tta aca ctt ctt agt ggt gta ctt ctt gtt gat ggc 25863
Gly Val Thr Leu Thr Leu Leu Ser Gly Val Leu Leu Val Asp Gly
5630 5635 5640
cat aag att gct act cgt gtt caa gtg ggt cag ttg cct aaa tat 25908
His Lys Ile Ala Thr Arg Val Gln Val Gly Gln Leu Pro Lys Tyr
5645 5650 5655
gta ata gtt gct aca cct agt acc aca att gtt tgt gac cgt gtt 25953
Val Ile Val Ala Thr Pro Ser Thr Thr Ile Val Cys Asp Arg Val
5660 5665 5670
ggt cgc tct gtt aat gaa aca agc cag act ggt tgg gca ttc tac 25998
Gly Arg Ser Val Asn Glu Thr Ser Gln Thr Gly Trp Ala Phe Tyr
5675 5680 5685
gtc cgt gct aaa cat ggt gat ttt tct ggt gtt gcc tct cag gag 26043
Val Arg Ala Lys His Gly Asp Phe Ser Gly Val Ala Ser Gln Glu
5690 5695 5700
ggt gtt ttg tca gaa aga gag aag ttg ctt cat tta atc taa 26085
Gly Val Leu Ser Glu Arg Glu Lys Leu Leu His Leu Ile
5705 5710
actaaacaaa atg gct agt gta aat tgg gcc gat gac aga gct gct agg 26134
Met Ala Ser Val Asn Trp Ala Asp Asp Arg Ala Ala Arg
5715 5720 5725
aag aaa ttt cct cct cct tca ttt tac atg cct ctt ttg gtt agt 26179
Lys Lys Phe Pro Pro Pro Ser Phe Tyr Met Pro Leu Leu Val Ser
5730 5735 5740
tct gat aag gca cca tat agg gtc att ccc agg aat ctt gtc cct 26224
Ser Asp Lys Ala Pro Tyr Arg Val Ile Pro Arg Asn Leu Val Pro
5745 5750 5755
att ggt aag ggt aat aaa gat gag cag att ggt tat tgg aat gtt 26269
Ile Gly Lys Gly Asn Lys Asp Glu Gln Ile Gly Tyr Trp Asn Val
5760 5765 5770
caa gag cgt tgg cgt atg cgc agg ggg caa cgt gtt gat ttg cct 26314
Gln Glu Arg Trp Arg Met Arg Arg Gly Gln Arg Val Asp Leu Pro
5775 5780 5785
cct aaa gtt cat ttt tat tac cta ggt act gga cct cat aag gac 26359
Pro Lys Val His Phe Tyr Tyr Leu Gly Thr Gly Pro His Lys Asp
5790 5795 5800
ctt aaa ttc aga caa cgt tct gat ggt gtt gtt tgg gtt gct aag 26404
Leu Lys Phe Arg Gln Arg Ser Asp Gly Val Val Trp Val Ala Lys
5805 5810 5815
gaa ggt gct aaa act gtt aat acc agt ctt ggt aat cgc aaa cgt 26449
Glu Gly Ala Lys Thr Val Asn Thr Ser Leu Gly Asn Arg Lys Arg
5820 5825 5830
aat cag aaa cct ttg gaa cca aag ttc tct att gct ttg cct cca 26494
Asn Gln Lys Pro Leu Glu Pro Lys Phe Ser Ile Ala Leu Pro Pro
5835 5840 5845
gag ctc tct gtt gtt gag ttt gag gat cgc tct aat aac tca tct 26539
Glu Leu Ser Val Val Glu Phe Glu Asp Arg Ser Asn Asn Ser Ser
5850 5855 5860
cgt gct agc agt cgt tct tca act cgt aac aac tca cga gac tct 26584
Arg Ala Ser Ser Arg Ser Ser Thr Arg Asn Asn Ser Arg Asp Ser
5865 5870 5875
tct cgt agt act tca aga caa cag tct cgc act cgt tct gat tct 26629
Ser Arg Ser Thr Ser Arg Gln Gln Ser Arg Thr Arg Ser Asp Ser
5880 5885 5890
aac cag tct tct tca gat ctt gtt gct gct gtt act ttg gct tta 26674
Asn Gln Ser Ser Ser Asp Leu Val Ala Ala Val Thr Leu Ala Leu
5895 5900 5905
aag aac tta ggt ttt gat aac cag tcg aag tca cct agt tct tct 26719
Lys Asn Leu Gly Phe Asp Asn Gln Ser Lys Ser Pro Ser Ser Ser
5910 5915 5920
ggt act tcc act cct aag aaa cct aat aag cct ctt tct caa ccc 26764
Gly Thr Ser Thr Pro Lys Lys Pro Asn Lys Pro Leu Ser Gln Pro
5925 5930 5935
agg gct gat aag cct tct cag ttg aag aaa cct cgt tgg aag cgt 26809
Arg Ala Asp Lys Pro Ser Gln Leu Lys Lys Pro Arg Trp Lys Arg
5940 5945 5950
gtt cct acc aga gag gaa aat gtt att cag tgc ttt ggt cct cgt 26854
Val Pro Thr Arg Glu Glu Asn Val Ile Gln Cys Phe Gly Pro Arg
5955 5960 5965
gat ttt aat cac aat atg ggg gat tca gat ctt gtt cag aat ggt 26899
Asp Phe Asn His Asn Met Gly Asp Ser Asp Leu Val Gln Asn Gly
5970 5975 5980
gtt gat gcc aag ggt ttt cca cag ctt gct gaa ttg att cct aat 26944
Val Asp Ala Lys Gly Phe Pro Gln Leu Ala Glu Leu Ile Pro Asn
5985 5990 5995
cag gct gcg tta ttc ttt gat agt gag gtt agc act gat gaa gtg 26989
Gln Ala Ala Leu Phe Phe Asp Ser Glu Val Ser Thr Asp Glu Val
6000 6005 6010
ggt gat aat gtt cag att acc tac acc tac aaa atg ctt gta gct 27034
Gly Asp Asn Val Gln Ile Thr Tyr Thr Tyr Lys Met Leu Val Ala
6015 6020 6025
aag gat aat aag aac ctt cct aag ttc att gag cag att agt gct 27079
Lys Asp Asn Lys Asn Leu Pro Lys Phe Ile Glu Gln Ile Ser Ala
6030 6035 6040
ttt act aaa ccc agt tct atc aaa gaa atg cag tca caa tca tct 27124
Phe Thr Lys Pro Ser Ser Ile Lys Glu Met Gln Ser Gln Ser Ser
6045 6050 6055
cat gtt gct cag aac aca gta ctt aat gct tct att cca gaa tct 27169
His Val Ala Gln Asn Thr Val Leu Asn Ala Ser Ile Pro Glu Ser
6060 6065 6070
aaa cca ttg gct gat gat gat tca gcc att ata gaa att gtc aac 27214
Lys Pro Leu Ala Asp Asp Asp Ser Ala Ile Ile Glu Ile Val Asn
6075 6080 6085
gag gtt ttg cat taa attgttttgt aattccagtt gaatgtttat tattattagt 27269
Glu Val Leu His
6090
tgcaacccca tgcgtttagc gcatgataag ggtttagtct tacacacaat ggtaggccag 27329
tgatagtaaa gtgtaagtaa tttgctatca tattaacatg tctagaggaa agtcagaact 27389
ttttctgttt gtgttgttgg agtacttaaa gatcgcatag gcgcgccaac aatggaagag 27449
ccaacaacat atctaaaaat gttttgtctg gtacttgtta atgatattgt ttttgatatg 27509
gatacacaaa aaaaaaaaaa a 27530
<210> 2
<211> 4055
<212> PRT
<213> EMCR Coronavirus
<400> 2
Met Phe Tyr Asn Gln Val Thr Leu Ala Val Ala Ser Asp Ser Glu Ile
1 5 10 15
Ser Gly Phe Gly Phe Ala Ile Pro Ser Val Ala Val Arg Ala Tyr Ser
20 25 30
Glu Ala Ala Ala Gln Gly Phe Gln Ala Cys Arg Phe Val Ala Phe Gly
35 40 45
Leu Gln Asp Cys Val Thr Gly Ile Asn Asp Asp Asp Tyr Val Ile Ala
50 55 60
Leu Thr Gly Thr Asn Gln Leu Cys Ala Lys Ile Leu Leu Phe Ser Asp
65 70 75 80
Arg Pro Leu Asn Leu Arg Gly Trp Leu Ile Phe Ser Asn Ser Asn Tyr
85 90 95
Val Leu Gln Asp Phe Asp Val Val Phe Gly His Gly Ala Gly Ser Val
100 105 110
Val Phe Val Asp Lys Tyr Met Cys Gly Phe Asp Gly Lys Pro Val Leu
115 120 125
Pro Lys Asn Met Trp Glu Phe Arg Asp Tyr Phe Asn Asp Asn Thr Asp
130 135 140
Ser Ile Val Ile Gly Gly Val Thr Tyr Gln Leu Ala Trp Asp Val Ile
145 150 155 160
Arg Lys Asp Leu Ser Tyr Glu Gln Gln Asn Val Leu Ala Ile Glu Ser
165 170 175
Ile His Tyr Leu Gly Thr Thr Gly His Thr Leu Lys Ser Gly Cys Lys
180 185 190
Leu Ile Asn Ala Lys Pro Pro Lys Tyr Ser Ser Lys Val Val Leu Ser
195 200 205
Gly Glu Trp Asn Ala Val Tyr Lys Ala Phe Gly Ser Pro Phe Ile Thr
210 215 220
Asn Gly Ile Ser Leu Leu Asp Ile Ile Val Lys Pro Val Phe Phe Asn
225 230 235 240
Ala Phe Val Lys Cys Asn Cys Gly Ser Glu Asn Trp Ser Val Gly Ala
245 250 255
Trp Asp Gly Tyr Leu Ser Ser Cys Cys Gly Thr Pro Ala Lys Lys Leu
260 265 270
Cys Val Val Pro Gly Asn Val Val Pro Gly Asp Val Ile Ile Thr Ser
275 280 285
Thr Asp Ala Gly Cys Gly Val Lys Tyr Tyr Ala Gly Leu Val Val Lys
290 295 300
His Ile Thr Asn Ile Thr Gly Val Ser Leu Trp Arg Val Thr Ala Val
305 310 315 320
His Ser Asp Gly Met Phe Val Ala Thr Ser Ser Tyr Asp Ala Leu Leu
325 330 335
His Arg Asn Ser Leu Asp Pro Phe Cys Phe Asp Val Asn Thr Leu Leu
340 345 350
Ser Asn Gln Leu Arg Leu Ala Phe Leu Gly Ala Ser Val Thr Glu Asp
355 360 365
Val Lys Phe Ala Ala Ser Thr Gly Val Ile Asp Ile Ser Ala Gly Met
370 375 380
Phe Gly Leu Tyr Asp Asp Ile Leu Thr Asn Asn Lys Pro Trp Phe Val
385 390 395 400
Arg Lys Ala Ser Gly Leu Phe Asp Ala Ile Trp Asp Ala Phe Val Ala
405 410 415
Ala Ile Lys Leu Val Pro Thr Thr Thr Gly Gly Leu Val Arg Phe Val
420 425 430
Lys Ser Ile Ala Ser Thr Val Leu Thr Val Ser Asn Gly Val Ile Ile
435 440 445
Met Cys Ala Asp Val Pro Asp Ala Phe Gln Pro Val Tyr Arg Thr Phe
450 455 460
Thr Gln Ala Ile Cys Ala Ala Phe Asp Phe Ser Leu Asp Val Phe Lys
465 470 475 480
Ile Gly Asp Val Lys Phe Lys Arg Leu Gly Asp Tyr Val Leu Thr Glu
485 490 495
Asn Ala Leu Val Arg Leu Thr Thr Glu Val Val Arg Gly Val Arg Asp
500 505 510
Ala Arg Ile Lys Lys Ala Met Phe Thr Lys Val Val Val Gly Pro Thr
515 520 525
Thr Glu Val Lys Phe Ser Val Ile Glu Leu Ala Thr Val Asn Leu Arg
530 535 540
Leu Val Asp Cys Ala Pro Val Val Cys Pro Lys Gly Lys Ile Val Val
545 550 555 560
Ile Ala Gly Gln Ala Phe Phe Tyr Ser Gly Gly Phe Tyr Arg Phe Met
565 570 575
Val Asp Ser Thr Thr Val Leu Asn Asp Pro Val Phe Thr Gly Glu Leu
580 585 590
Phe Tyr Thr Ile Lys Phe Ser Gly Phe Lys Leu Asp Gly Phe Asn His
595 600 605
Gln Phe Val Asn Ala Ser Ser Ala Thr Asp Ala Ile Ile Ala Val Glu
610 615 620
Leu Leu Leu Ser Asp Phe Lys Thr Ala Val Phe Val Tyr Thr Cys Val
625 630 635 640
Val Asp Gly Cys Ser Val Ile Val Arg Arg Asp Ala Thr Phe Ala Thr
645 650 655
His Val Cys Phe Lys Asp Cys Tyr Ser Ile Trp Glu Gln Phe Cys Ile
660 665 670
Asp Asn Cys Gly Glu Pro Trp Phe Leu Thr Asp Tyr Asn Ala Ile Leu
675 680 685
Gln Ser Asn Asn Pro Gln Cys Ala Ile Val Gln Ala Ser Glu Ser Lys
690 695 700
Val Leu Leu Glu Arg Phe Leu Pro Lys Cys Pro Glu Ile Leu Leu Ser
705 710 715 720
Ile Asp Asp Gly His Leu Trp Asn Leu Phe Val Glu Lys Phe Asn Phe
725 730 735
Val Thr Asp Trp Leu Lys Thr Leu Lys Leu Thr Leu Thr Ser Asn Gly
740 745 750
Leu Leu Gly Asn Cys Ala Lys Arg Phe Arg Arg Val Leu Val Lys Leu
755 760 765
Leu Asp Val Tyr Asn Gly Phe Leu Glu Thr Val Cys Ser Val Val His
770 775 780
Thr Ala Gly Val Cys Ile Lys Tyr Tyr Ala Val Asn Val Pro Tyr Val
785 790 795 800
Val Ile Ser Gly Phe Val Ser Arg Val Ile Arg Arg Glu Arg Cys Asp
805 810 815
Val Thr Phe Pro Cys Val Ser Cys Val Thr Phe Phe Tyr Glu Phe Leu
820 825 830
Asp Thr Cys Phe Gly Val Ser Lys Pro Asn Ala Ile Asp Val Glu His
835 840 845
Leu Glu Leu Lys Glu Thr Val Phe Val Glu Pro Lys Asp Gly Gly Gln
850 855 860
Phe Phe Val Ser Asp Asp Tyr Leu Trp Tyr Val Val Asp Asp Ile Tyr
865 870 875 880
Tyr Pro Ala Ser Cys Asn Gly Val Leu Pro Val Ala Phe Thr Lys Leu
885 890 895
Ala Gly Gly Lys Ile Ser Phe Ser Asp Asp Val Ile Val His Asp Val
900 905 910
Glu Pro Thr His Lys Val Lys Leu Ile Phe Glu Phe Glu Asp Asp Val
915 920 925
Val Thr Ser Leu Cys Lys Lys Ser Phe Gly Lys Ser Ile Ile Tyr Thr
930 935 940
Gly Asp Trp Glu Gly Leu His Glu Val Leu Thr Ser Ala Met Asn Val
945 950 955 960
Ile Gly Gln His Ile Lys Leu Pro Gln Phe Tyr Ile Tyr Asp Glu Glu
965 970 975
Gly Gly Tyr Asp Val Ser Lys Pro Val Met Ile Ser Gln Trp Pro Ile
980 985 990
Ser Asp Asp Ser Asp Gly Cys Val Val Glu Ala Ser Thr Asp Phe His
995 1000 1005
Gln Leu Glu Ser Val Arg Glu Glu Val Asp Ile Ile Glu Gln Pro
1010 1015 1020
Phe Gly Glu Val Glu His Ala Leu Ser Ile Arg Gln Pro Phe Ser
1025 1030 1035
Phe Ser Phe Arg Asp Glu Leu Gly Val Arg Val Leu Asp Gln Ser
1040 1045 1050
Asp Asn Asn Cys Trp Ile Ser Thr Thr Leu Ile Gln Leu Gln Leu
1055 1060 1065
Thr Lys Leu Leu Asp Asp Ser Ile Glu Met Gln Leu Phe Lys Val
1070 1075 1080
Gly Lys Val Asp Ser Ile Val Gln Lys Cys Tyr Glu Leu Ser His
1085 1090 1095
Leu Ile Ser Gly Ser Leu Gly Asp Ser Gly Lys Leu Leu Ser Glu
1100 1105 1110
Leu Leu Lys Asp Lys Tyr Thr Cys Ser Ile Thr Phe Glu Met Ser
1115 1120 1125
Cys Asp Cys Gly Lys Lys Phe Asp Glu Gln Val Gly Cys Leu Phe
1130 1135 1140
Trp Ile Met Pro Tyr Thr Lys Leu Phe Gln Lys Gly Glu Cys Cys
1145 1150 1155
Ile Cys His Lys Met Gln Thr Tyr Lys Leu Val Ser Met Lys Gly
1160 1165 1170
Thr Gly Val Phe Val Gln Asp Pro Ala Pro Ile Asp Ile Asp Ala
1175 1180 1185
Phe Pro Val Arg Pro Ile Cys Ser Ser Val Tyr Leu Gly Val Lys
1190 1195 1200
Gly Ser Gly His Tyr Gln Thr Asn Leu Tyr Ser Phe Asp Lys Ala
1205 1210 1215
Ile Asp Gly Phe Gly Val Phe Asp Ile Lys Asn Ser Ser Val Asn
1220 1225 1230
Thr Val Cys Phe Val Asp Val Asp Phe His Ser Val Glu Ile Glu
1235 1240 1245
Ala Gly Glu Val Lys Pro Phe Ala Val Tyr Lys Asn Val Lys Phe
1250 1255 1260
Tyr Leu Gly Asp Ile Ser His Leu Val Asn Cys Val Ser Phe Asp
1265 1270 1275
Phe Val Val Asn Ala Ala Asn Glu Asn Leu Met His Gly Gly Gly
1280 1285 1290
Val Ala Arg Ala Ile Asp Ile Leu Thr Glu Gly Gln Leu Gln Ser
1295 1300 1305
Leu Ser Lys Asp Tyr Ile Ser Ser Asn Gly Pro Leu Lys Val Gly
1310 1315 1320
Ala Gly Val Met Leu Glu Cys Glu Lys Phe Asn Val Phe Asn Val
1325 1330 1335
Val Gly Pro Arg Thr Gly Lys His Glu His Ser Leu Leu Val Glu
1340 1345 1350
Ala Tyr Asn Ser Ile Leu Phe Glu Asn Gly Ile Pro Leu Met Pro
1355 1360 1365
Leu Leu Ser Cys Gly Ile Phe Gly Val Arg Ile Glu Asn Ser Leu
1370 1375 1380
Lys Ala Leu Phe Ser Cys Asp Ile Asn Lys Pro Leu Gln Val Phe
1385 1390 1395
Val Tyr Ser Ser Asn Glu Glu Gln Ala Val Leu Lys Phe Leu Asp
1400 1405 1410
Gly Leu Asp Leu Thr Pro Val Ile Asp Asp Val Asp Val Val Lys
1415 1420 1425
Pro Phe Arg Val Glu Gly Asn Phe Ser Phe Phe Asp Cys Gly Val
1430 1435 1440
Asn Ala Leu Asp Gly Asp Ile Tyr Leu Leu Phe Thr Asn Ser Ile
1445 1450 1455
Leu Met Leu Asp Lys Gln Gly Gln Leu Leu Asp Thr Lys Leu Asn
1460 1465 1470
Gly Ile Leu Gln Gln Ala Val Leu Asp Tyr Leu Ala Thr Val Lys
1475 1480 1485
Thr Val Pro Ala Gly Asn Leu Val Lys Leu Val Val Glu Ser Cys
1490 1495 1500
Thr Ile Tyr Met Cys Val Val Pro Ser Ile Asn Asp Leu Ser Phe
1505 1510 1515
Asp Lys Asn Leu Gly Arg Cys Val Arg Lys Leu Asn Arg Leu Lys
1520 1525 1530
Thr Cys Val Ile Ala Asn Val Pro Ala Ile Asp Val Leu Lys Lys
1535 1540 1545
Leu Leu Ser Ser Leu Thr Leu Thr Val Lys Phe Val Val Glu Ser
1550 1555 1560
Asn Val Met Asp Val Asn Asp Cys Phe Lys Asn Asp Asn Val Val
1565 1570 1575
Leu Lys Ile Thr Glu Asp Gly Ile Asn Val Lys Asp Val Val Val
1580 1585 1590
Glu Ser Ser Lys Ser Leu Gly Lys Gln Leu Gly Val Val Ser Asp
1595 1600 1605
Gly Val Asp Ser Phe Glu Gly Val Leu Pro Ile Asn Thr Asp Thr
1610 1615 1620
Val Leu Ser Val Ala Pro Glu Val Asp Trp Val Ala Phe Tyr Gly
1625 1630 1635
Phe Glu Lys Ala Ala Leu Phe Ala Ser Leu Asp Val Lys Pro Tyr
1640 1645 1650
Gly Tyr Pro Asn Asp Phe Val Gly Gly Phe Arg Val Leu Gly Thr
1655 1660 1665
Thr Asp Asn Asn Cys Trp Val Asn Ala Thr Cys Ile Ile Leu Gln
1670 1675 1680
Tyr Leu Lys Pro Thr Phe Lys Ser Lys Gly Leu Asn Val Leu Trp
1685 1690 1695
Asn Lys Phe Val Thr Gly Asp Val Gly Pro Phe Val Ser Phe Ile
1700 1705 1710
Tyr Phe Ile Thr Met Ser Ser Lys Gly Gln Lys Gly Asp Ala Glu
1715 1720 1725
Glu Ala Leu Ser Lys Leu Ser Glu Tyr Leu Ile Ser Asp Ser Ile
1730 1735 1740
Val Thr Leu Glu Gln Tyr Ser Thr Cys Asp Ile Cys Lys Ser Thr
1745 1750 1755
Val Val Glu Val Lys Ser Ala Val Val Cys Ala Ser Val Leu Lys
1760 1765 1770
Asp Gly Cys Asp Val Gly Phe Cys Pro His Arg His Lys Leu Arg
1775 1780 1785
Ser Arg Val Lys Phe Val Asn Gly Arg Val Val Ile Thr Asn Val
1790 1795 1800
Gly Glu Pro Ile Ile Ser Gln Pro Ser Lys Leu Leu Asn Gly Ile
1805 1810 1815
Ala Tyr Thr Thr Phe Ser Gly Ser Phe Asp Asn Gly His Tyr Val
1820 1825 1830
Val Tyr Asp Ala Ala Asn Asn Ala Val Tyr Asp Gly Ala Arg Leu
1835 1840 1845
Phe Ala Ser Asp Leu Ser Thr Leu Ala Val Thr Ala Ile Val Val
1850 1855 1860
Val Gly Gly Cys Val Thr Ser Asn Val Pro Pro Ile Val Ser Glu
1865 1870 1875
Lys Ile Ser Val Met Asp Lys Leu Asp Thr Gly Ala Gln Lys Phe
1880 1885 1890
Phe Gln Phe Gly Asp Phe Val Met Asn Asn Ile Val Leu Phe Leu
1895 1900 1905
Thr Trp Leu Leu Ser Met Phe Ser Leu Leu Arg Thr Ser Ile Met
1910 1915 1920
Lys His Asp Ile Lys Val Ile Ala Lys Ala Pro Lys Arg Thr Gly
1925 1930 1935
Val Ile Leu Thr Arg Ser Phe Lys Tyr Asn Ile Arg Ser Ala Leu
1940 1945 1950
Phe Val Val Lys Gln Lys Trp Cys Val Ile Val Thr Leu Phe Lys
1955 1960 1965
Phe Leu Leu Leu Leu Tyr Ala Ile Tyr Ala Leu Val Phe Met Ile
1970 1975 1980
Val Gln Phe Ser Pro Phe Asn Ser Leu Leu Cys Gly Asp Ile Val
1985 1990 1995
Ser Gly Tyr Glu Lys Ser Thr Phe Asn Lys Asp Ile Tyr Cys Gly
2000 2005 2010
Asn Ser Met Val Cys Lys Met Cys Leu Phe Ser Tyr Gln Glu Phe
2015 2020 2025
Asn Asp Leu Asp His Thr Ser Leu Val Trp Lys His Ile Arg Asp
2030 2035 2040
Pro Ile Leu Ile Ser Leu Gln Pro Phe Val Ile Leu Val Ile Leu
2045 2050 2055
Leu Ile Phe Gly Asn Met Tyr Leu Arg Phe Gly Leu Leu Tyr Phe
2060 2065 2070
Val Ala Gln Phe Ile Ser Thr Phe Gly Ser Phe Leu Gly Phe His
2075 2080 2085
Gln Lys Gln Trp Phe Leu His Phe Val Pro Phe Asp Val Leu Cys
2090 2095 2100
Asn Glu Phe Leu Ala Thr Phe Ile Val Cys Lys Ile Val Leu Phe
2105 2110 2115
Val Arg His Ile Ile Val Gly Cys Asn Asn Ala Asp Cys Val Ala
2120 2125 2130
Cys Ser Lys Ser Ala Arg Leu Lys Arg Val Pro Leu Gln Thr Ile
2135 2140 2145
Ile Asn Gly Met His Lys Ser Phe Tyr Val Asn Ala Asn Gly Gly
2150 2155 2160
Thr Cys Phe Cys Asn Lys His Asn Phe Phe Cys Val Asn Cys Asp
2165 2170 2175
Ser Phe Gly Pro Gly Asn Thr Phe Ile Asn Gly Asp Ile Ala Arg
2180 2185 2190
Glu Leu Gly Asn Val Val Lys Thr Ala Val Gln Pro Thr Ala Pro
2195 2200 2205
Ala Tyr Val Ile Ile Asp Lys Val Asp Phe Val Asn Gly Phe Tyr
2210 2215 2220
Arg Leu Tyr Ser Gly Asp Thr Phe Trp Arg Tyr Asp Phe Asp Ile
2225 2230 2235
Thr Glu Ser Lys Tyr Ser Cys Lys Glu Val Leu Lys Asn Cys Asn
2240 2245 2250
Val Leu Glu Asn Phe Ile Val Tyr Asn Asn Ser Gly Ser Asn Ile
2255 2260 2265
Thr Gln Ile Lys Asn Ala Cys Val Tyr Phe Ser Gln Leu Leu Cys
2270 2275 2280
Glu Pro Ile Lys Leu Val Asn Ser Glu Leu Leu Ser Thr Leu Ser
2285 2290 2295
Val Asp Phe Asn Gly Val Leu His Lys Ala Tyr Val Asp Val Leu
2300 2305 2310
Cys Asn Ser Phe Phe Lys Glu Leu Thr Ala Asn Met Ser Met Ala
2315 2320 2325
Glu Cys Lys Ala Thr Leu Gly Leu Thr Val Ser Asp Asp Asp Phe
2330 2335 2340
Val Ser Ala Val Ala Asn Ala His Arg Tyr Asp Val Leu Leu Ser
2345 2350 2355
Asp Leu Ser Phe Asn Asn Phe Phe Ile Ser Tyr Ala Lys Pro Glu
2360 2365 2370
Asp Lys Leu Ser Val Tyr Asp Ile Ala Cys Cys Met Arg Ala Gly
2375 2380 2385
Ser Lys Val Val Asn His Asn Val Leu Ile Lys Glu Ser Ile Pro
2390 2395 2400
Ile Val Trp Gly Val Lys Asp Phe Asn Thr Leu Ser Gln Glu Gly
2405 2410 2415
Lys Lys Tyr Leu Val Lys Thr Thr Lys Ala Lys Gly Leu Thr Phe
2420 2425 2430
Leu Leu Thr Phe Asn Asp Asn Gln Ala Ile Thr Gln Val Pro Ala
2435 2440 2445
Thr Ser Ile Val Ala Lys Gln Gly Ala Gly Phe Lys Arg Thr Tyr
2450 2455 2460
Asn Phe Leu Trp Tyr Val Cys Leu Phe Val Val Ala Leu Phe Ile
2465 2470 2475
Gly Val Ser Phe Ile Asp Tyr Thr Thr Thr Val Thr Ser Phe His
2480 2485 2490
Gly Tyr Asp Phe Lys Tyr Ile Glu Asn Gly Gln Leu Lys Val Phe
2495 2500 2505
Glu Ala Pro Leu His Cys Val Arg Asn Val Phe Asp Asn Phe Asn
2510 2515 2520
Gln Trp His Glu Ala Lys Phe Gly Val Val Thr Thr Asn Ser Asp
2525 2530 2535
Lys Cys Pro Ile Val Val Gly Val Ser Glu Arg Ile Asn Val Val
2540 2545 2550
Pro Gly Val Pro Thr Asn Val Tyr Leu Val Gly Lys Thr Leu Val
2555 2560 2565
Phe Thr Leu Gln Ala Ala Phe Gly Asn Thr Gly Val Cys Tyr Asp
2570 2575 2580
Phe Asp Gly Val Thr Thr Ser Asp Lys Cys Ile Phe Asn Ser Ala
2585 2590 2595
Cys Thr Arg Leu Glu Gly Leu Gly Gly Asp Asn Val Tyr Cys Tyr
2600 2605 2610
Asn Thr Asp Leu Ile Glu Gly Ser Lys Pro Tyr Ser Ile Leu Gln
2615 2620 2625
Pro Asn Ala Tyr Tyr Lys Tyr Asp Val Lys Asn Tyr Val Arg Phe
2630 2635 2640
Pro Glu Ile Leu Ala Arg Gly Phe Gly Leu Arg Thr Ile Arg Thr
2645 2650 2655
Leu Ala Thr Arg Tyr Cys Arg Val Gly Glu Cys Arg Asp Ser His
2660 2665 2670
Lys Gly Val Cys Phe Gly Phe Asp Lys Trp Tyr Val Asn Asp Gly
2675 2680 2685
Arg Val Asp Asp Gly Tyr Ile Cys Gly Asp Gly Leu Ile Asp Leu
2690 2695 2700
Leu Val Asn Val Leu Ser Ile Phe Ser Ser Ser Phe Ser Val Val
2705 2710 2715
Ala Met Ser Gly His Met Leu Phe Asn Phe Leu Phe Ala Ala Phe
2720 2725 2730
Ile Thr Phe Leu Cys Phe Leu Val Thr Lys Phe Lys Arg Val Phe
2735 2740 2745
Gly Asp Leu Ser Tyr Gly Val Phe Thr Val Val Cys Ala Thr Leu
2750 2755 2760
Ile Asn Asn Ile Ser Tyr Val Val Thr Gln Asn Leu Phe Phe Met
2765 2770 2775
Leu Leu Tyr Ala Ile Leu Tyr Phe Val Phe Thr Arg Thr Val Arg
2780 2785 2790
Tyr Ala Trp Ile Trp His Ile Ala Tyr Ile Val Ala Tyr Phe Leu
2795 2800 2805
Leu Ile Pro Trp Trp Leu Leu Thr Trp Phe Ser Phe Ala Ala Phe
2810 2815 2820
Leu Glu Leu Leu Pro Asn Val Phe Lys Leu Lys Ile Ser Thr Gln
2825 2830 2835
Leu Phe Glu Gly Asp Lys Phe Ile Gly Thr Phe Glu Ser Ala Ala
2840 2845 2850
Ala Gly Thr Phe Val Leu Asp Met Arg Ser Tyr Glu Arg Leu Ile
2855 2860 2865
Asn Thr Ile Ser Pro Glu Lys Leu Lys Asn Tyr Ala Ala Ser Tyr
2870 2875 2880
Asn Lys Tyr Lys Tyr Tyr Ser Gly Ser Ala Ser Glu Ala Asp Tyr
2885 2890 2895
Arg Cys Ala Cys Tyr Ala His Leu Ala Lys Ala Met Leu Asp Tyr
2900 2905 2910
Ala Lys Asp His Asn Asp Met Leu Tyr Ser Pro Pro Thr Ile Ser
2915 2920 2925
Tyr Asn Ser Thr Leu Gln Ser Gly Leu Lys Lys Met Ala Gln Pro
2930 2935 2940
Ser Gly Cys Val Glu Arg Cys Val Val Arg Val Cys Tyr Gly Ser
2945 2950 2955
Thr Val Leu Asn Gly Val Trp Leu Gly Asp Thr Val Thr Cys Pro
2960 2965 2970
Arg His Val Ile Ala Pro Ser Thr Thr Val Leu Ile Asp Tyr Asp
2975 2980 2985
His Ala Tyr Ser Thr Met Arg Leu His Asn Phe Ser Val Ser His
2990 2995 3000
Asn Gly Val Phe Leu Gly Val Val Gly Val Thr Met His Gly Ser
3005 3010 3015
Val Leu Arg Ile Lys Val Ser Gln Ser Asn Val His Thr Pro Lys
3020 3025 3030
His Val Phe Lys Thr Leu Lys Pro Gly Ala Ser Phe Asn Ile Leu
3035 3040 3045
Ala Cys Tyr Glu Gly Ile Ala Ser Gly Val Phe Gly Val Asn Leu
3050 3055 3060
Arg Thr Asn Phe Thr Ile Lys Gly Ser Phe Ile Asn Gly Ala Cys
3065 3070 3075
Gly Ser Pro Gly Tyr Asn Val Arg Asn Asp Gly Thr Val Glu Phe
3080 3085 3090
Cys Tyr Leu His Gln Ile Glu Leu Gly Ser Gly Ala His Val Gly
3095 3100 3105
Ser Asp Phe Thr Gly Ser Val Tyr Gly Asn Phe Asp Asp Gln Pro
3110 3115 3120
Ser Leu Gln Val Glu Ser Ala Asn Leu Met Leu Ser Asp Asn Val
3125 3130 3135
Val Ala Phe Leu Tyr Ala Ala Leu Leu Asn Gly Cys Arg Trp Trp
3140 3145 3150
Leu Arg Ser Thr Arg Val Asn Val Asp Gly Phe Asn Glu Trp Ala
3155 3160 3165
Met Ala Asn Gly Tyr Thr Ile Val Ser Ser Val Glu Cys Tyr Ser
3170 3175 3180
Ile Leu Ala Ala Lys Thr Gly Val Ser Val Glu Gln Leu Leu Ala
3185 3190 3195
Ser Ile Gln His Leu His Glu Gly Phe Gly Gly Lys Asn Ile Leu
3200 3205 3210
Gly Tyr Ser Ser Leu Cys Asp Glu Phe Thr Leu Ala Glu Val Val
3215 3220 3225
Lys Gln Met Tyr Gly Val Asn Leu Gln Ser Gly Lys Val Ile Phe
3230 3235 3240
Gly Leu Lys Thr Met Phe Leu Phe Ser Val Phe Phe Thr Met Phe
3245 3250 3255
Trp Ala Glu Leu Phe Ile Tyr Thr Asn Thr Ile Trp Ile Asn Pro
3260 3265 3270
Val Ile Leu Thr Pro Ile Phe Cys Leu Leu Leu Phe Leu Ser Leu
3275 3280 3285
Val Leu Thr Met Phe Leu Lys His Lys Phe Leu Phe Leu Gln Val
3290 3295 3300
Phe Leu Leu Pro Thr Val Ile Ala Thr Ala Leu Tyr Asn Cys Val
3305 3310 3315
Leu Asp Tyr Tyr Ile Val Lys Phe Leu Ala Asp His Phe Asn Tyr
3320 3325 3330
Asn Val Ser Val Leu Gln Met Asp Val Gln Gly Leu Val Asn Val
3335 3340 3345
Leu Val Cys Leu Phe Val Val Phe Leu His Thr Trp Arg Phe Ser
3350 3355 3360
Lys Glu Arg Phe Thr His Trp Phe Thr Tyr Val Cys Ser Leu Ile
3365 3370 3375
Ala Val Ala Tyr Thr Tyr Phe Tyr Ser Gly Asp Phe Leu Ser Leu
3380 3385 3390
Leu Val Met Phe Leu Cys Ala Ile Ser Ser Asp Trp Tyr Ile Gly
3395 3400 3405
Ala Ile Val Phe Arg Leu Ser Arg Leu Ile Ile Phe Phe Ser Pro
3410 3415 3420
Glu Ser Val Phe Ser Val Phe Gly Asp Val Lys Leu Thr Leu Val
3425 3430 3435
Val Tyr Leu Ile Cys Gly Tyr Leu Val Cys Thr Tyr Trp Gly Ile
3440 3445 3450
Leu Tyr Trp Phe Asn Arg Phe Phe Lys Cys Thr Met Gly Val Tyr
3455 3460 3465
Asp Phe Lys Val Ser Ala Ala Glu Phe Lys Tyr Met Val Ala Asn
3470 3475 3480
Gly Leu His Ala Pro Tyr Gly Pro Phe Asp Ala Leu Trp Leu Ser
3485 3490 3495
Phe Lys Leu Leu Gly Ile Gly Gly Asp Arg Cys Ile Lys Ile Ser
3500 3505 3510
Thr Val Gln Ser Lys Leu Thr Asp Leu Lys Cys Thr Asn Val Val
3515 3520 3525
Leu Leu Gly Cys Leu Ser Ser Met Asn Ile Ala Ala Asn Ser Ser
3530 3535 3540
Glu Trp Ala Tyr Cys Val Asp Leu His Asn Lys Ile Asn Leu Cys
3545 3550 3555
Asp Asp Pro Glu Lys Ala Gln Gly Met Leu Leu Ala Leu Leu Ala
3560 3565 3570
Phe Phe Leu Ser Lys His Ser Asp Phe Gly Leu Asp Gly Leu Ile
3575 3580 3585
Asp Ser Tyr Phe Asp Asn Ser Ser Thr Leu Gln Ser Val Ala Ser
3590 3595 3600
Ser Phe Val Ser Met Pro Ser Tyr Ile Ala Tyr Glu Asn Ala Arg
3605 3610 3615
Gln Ala Tyr Glu Asp Ala Ile Ala Asn Gly Ser Ser Ser Gln Leu
3620 3625 3630
Ile Lys Gln Leu Lys Arg Ala Met Asn Ile Ala Lys Ser Glu Phe
3635 3640 3645
Asp His Glu Ile Ser Val Gln Lys Lys Ile Asn Arg Met Ala Glu
3650 3655 3660
Gln Ala Ala Thr Gln Met Tyr Lys Glu Ala Arg Ser Val Asn Arg
3665 3670 3675
Lys Ser Lys Val Ile Ser Ala Met His Ser Leu Leu Phe Gly Met
3680 3685 3690
Leu Arg Arg Leu Asp Met Ser Ser Val Glu Thr Val Leu Asn Leu
3695 3700 3705
Ala Arg Asp Gly Val Val Pro Leu Ser Val Ile Pro Ala Thr Ser
3710 3715 3720
Ala Ser Lys Leu Thr Ile Val Ser Pro Asp Leu Glu Ser Tyr Ser
3725 3730 3735
Lys Ile Val Cys Asp Gly Ser Val His Tyr Ala Gly Val Val Trp
3740 3745 3750
Thr Leu Asn Asp Val Lys Asp Asn Asp Gly Arg Pro Val His Val
3755 3760 3765
Lys Glu Ile Thr Arg Glu Asn Val Glu Thr Leu Thr Trp Pro Leu
3770 3775 3780
Ile Leu Asn Cys Glu Arg Val Val Lys Leu Gln Asn Asn Glu Ile
3785 3790 3795
Met Pro Gly Lys Leu Lys Gln Lys Pro Met Lys Ala Glu Gly Asp
3800 3805 3810
Gly Gly Val Leu Gly Asp Gly Asn Ala Leu Tyr Asn Thr Glu Gly
3815 3820 3825
Gly Lys Thr Phe Met Tyr Ala Tyr Ile Ser Asn Lys Ala Asp Leu
3830 3835 3840
Lys Phe Val Lys Trp Glu Tyr Glu Gly Gly Cys Asn Thr Ile Glu
3845 3850 3855
Leu Asp Ser Pro Cys Arg Phe Met Val Glu Thr Pro Asn Gly Pro
3860 3865 3870
Gln Val Lys Tyr Leu Tyr Phe Val Lys Asn Leu Asn Thr Leu Arg
3875 3880 3885
Arg Gly Ala Val Leu Gly Phe Ile Gly Ala Thr Ile Arg Leu Gln
3890 3895 3900
Ala Gly Lys Gln Thr Glu Leu Ala Val Asn Ser Gly Leu Leu Thr
3905 3910 3915
Ala Cys Ala Phe Ser Val Asp Pro Ala Thr Thr Tyr Leu Glu Ala
3920 3925 3930
Val Lys His Gly Ala Lys Pro Val Ser Asn Cys Ile Lys Met Leu
3935 3940 3945
Ser Asn Gly Ala Gly Asn Gly Gln Ala Ile Thr Thr Ser Val Asp
3950 3955 3960
Ala Asn Thr Asn Gln Asp Ser Tyr Gly Gly Ala Ser Ile Cys Leu
3965 3970 3975
Tyr Cys Arg Ala His Val Pro His Pro Ser Met Asp Gly Tyr Cys
3980 3985 3990
Lys Phe Lys Gly Lys Cys Val Gln Val Pro Ile Gly Cys Leu Asp
3995 4000 4005
Pro Ile Arg Phe Cys Leu Glu Asn Asn Val Cys Asn Val Cys Gly
4010 4015 4020
Cys Trp Leu Gly His Gly Cys Ala Cys Asp Arg Thr Thr Ile Gln
4025 4030 4035
Ser Val Asp Ile Ser Tyr Leu Asn Glu Gln Gly Val Leu Val Gln
4040 4045 4050
Leu Asp
4055
<210> 3
<211> 1355
<212> PRT
<213> EMCR Coronavirus
<400> 3
Met Lys Leu Phe Leu Ile Leu Leu Ile Leu Pro Leu Val Ser Cys Phe
1 5 10 15
Ser Thr Cys Asn Ser Asn Ala Ser Ile Ser Met Leu Gln Leu Gly Val
20 25 30
Pro Asp Asn Ser Ser Thr Ile Val Thr Gly Leu Leu Pro Val His Trp
35 40 45
Ile Cys Ala Asn Gln Ser Thr Ser Ser Tyr Pro Ala Asn Gly Phe Phe
50 55 60
Tyr Ile Asp Val Gly Lys His Arg Ser Ala Phe Ala Leu His Ser Gly
65 70 75 80
Tyr Tyr Asp Ala Asn Gln Tyr Tyr Ile Tyr Leu Thr Asn Lys Ile His
85 90 95
Leu Asn Ala Pro Val Thr Leu Lys Ile Cys Lys Phe Gly Asn Thr Ser
100 105 110
Phe Asp Phe Leu Ser Asn Val Ser Thr Ser His Asp Cys Ile Val Asn
115 120 125
Leu Ser Phe Thr Glu Gln Leu Gly Val Pro Leu Gly Ile Thr Ile Ser
130 135 140
Gly Glu Thr Val Arg Leu His Leu Tyr Asn Ala Thr Arg Thr Phe Tyr
145 150 155 160
Val Pro Ala Ala Tyr Lys Leu Thr Lys Leu Ser Val Lys Cys Tyr Phe
165 170 175
Ser Glu Ser Cys Val Phe Ser Val Val Asn Ala Thr Ile Thr Val Asn
180 185 190
Val Thr Thr Leu Asn Gly Arg Ile Val Asn Tyr Thr Val Cys Asp Asp
195 200 205
Cys Asn Gly Tyr Thr Asp Asn Ile Phe Ser Val Gln Gln Asp Gly Arg
210 215 220
Ile Pro Asn Gly Phe Pro Phe Asn Asn Trp Phe Leu Leu Thr Asn Gly
225 230 235 240
Ser Thr Leu Val Asp Gly Val Ser Arg Leu Tyr Gln Pro Leu Arg Leu
245 250 255
Thr Cys Leu Trp Pro Val Pro Gly Leu Lys Ser Ser Thr Gly Phe Val
260 265 270
Tyr Phe Asn Ala Thr Gly Ser Asp Val Asn Cys Asn Gly Tyr Gln His
275 280 285
Asn Ser Val Ala Asp Val Met Arg Tyr Asn Leu Asn Leu Ser Ala Asn
290 295 300
Ser Val Asp Asn Leu Lys Ser Gly Val Ile Val Phe Lys Thr Leu Gln
305 310 315 320
Tyr Asp Val Leu Phe Tyr Cys Ser Asn Ser Ser Ser Gly Val Leu Asp
325 330 335
Thr Thr Ile Pro Phe Gly Pro Ser Ser Gln Pro Tyr Tyr Cys Phe Ile
340 345 350
Asn Ser Thr Ile Asn Thr Thr His Val Ser Thr Phe Val Gly Ile Leu
355 360 365
Pro Pro Thr Val Arg Glu Ile Val Val Ala Arg Thr Gly Gln Phe Tyr
370 375 380
Ile Asn Gly Phe Lys Tyr Phe Asp Leu Gly Phe Ile Glu Ala Val Asn
385 390 395 400
Phe Asn Val Thr Thr Ala Ser Ala Thr Asp Phe Trp Thr Val Ala Phe
405 410 415
Ala Thr Phe Val Asp Val Leu Val Asn Val Ser Ala Thr Asn Ile Gln
420 425 430
Asn Leu Leu Tyr Cys Asp Ser Pro Phe Glu Lys Leu Gln Cys Glu His
435 440 445
Leu Gln Phe Gly Leu Gln Asp Gly Phe Tyr Ser Ala Asn Phe Leu Asp
450 455 460
Asp Asn Val Leu Pro Glu Thr Tyr Val Ala Leu Pro Ile Tyr Tyr Gln
465 470 475 480
His Thr Asp Ile Asn Phe Thr Ala Thr Ala Ser Phe Gly Gly Ser Cys
485 490 495
Tyr Val Cys Lys Pro Arg Gln Val Asn Ile Ser Leu Asn Gly Asn Thr
500 505 510
Ser Val Cys Val Arg Thr Ser His Phe Ser Ile Arg Tyr Ile Tyr Asn
515 520 525
Arg Val Lys Ser Gly Ser Pro Gly Asp Ser Ser Trp His Ile Tyr Leu
530 535 540
Lys Ser Gly Thr Cys Pro Phe Ser Phe Ser Lys Leu Asn Asn Phe Gln
545 550 555 560
Lys Phe Lys Thr Ile Cys Phe Ser Thr Val Glu Val Pro Gly Ser Cys
565 570 575
Asn Phe Pro Leu Glu Ala Thr Trp His Tyr Thr Ser Tyr Thr Ile Val
580 585 590
Gly Ala Leu Tyr Val Thr Trp Ser Glu Gly Asn Ser Ile Thr Gly Val
595 600 605
Pro Tyr Pro Val Ser Gly Ile Arg Glu Phe Ser Asn Leu Val Leu Asn
610 615 620
Asn Cys Thr Lys Tyr Asn Ile Tyr Asp Tyr Val Gly Thr Gly Ile Ile
625 630 635 640
Arg Ser Ser Asn Gln Ser Leu Ala Gly Gly Ile Thr Tyr Val Ser Asn
645 650 655
Ser Gly Asn Leu Leu Gly Phe Lys Asn Val Ser Thr Gly Asn Ile Phe
660 665 670
Ile Val Thr Pro Cys Asn Gln Pro Asp Gln Val Ala Val Tyr Gln Gln
675 680 685
Ser Ile Ile Gly Ala Met Thr Ala Val Asn Glu Ser Arg Tyr Gly Leu
690 695 700
Gln Asn Leu Leu Gln Leu Pro Asn Phe Tyr Tyr Val Ser Asn Gly Gly
705 710 715 720
Asn Asn Cys Thr Thr Ala Val Met Ile Tyr Ser Asn Phe Gly Ile Cys
725 730 735
Ala Asp Gly Ser Leu Ile Pro Val Arg Pro Arg Asn Ser Ser Asp Asn
740 745 750
Gly Ile Ser Ala Ile Ile Thr Ala Asn Leu Ser Ile Pro Ser Asn Trp
755 760 765
Thr Thr Ser Val Gln Val Glu Tyr Leu Gln Ile Thr Ser Thr Pro Ile
770 775 780
Val Val Asp Cys Ala Thr Tyr Val Cys Asn Gly Asn Pro Arg Cys Lys
785 790 795 800
Asn Leu Leu Lys Gln Tyr Thr Ser Ala Cys Lys Thr Ile Glu Asp Ala
805 810 815
Leu Arg Leu Ser Ala His Leu Glu Thr Asn Asp Val Ser Ser Met Leu
820 825 830
Thr Phe Asp Ser Asn Ala Phe Ser Leu Ala Asn Val Thr Ser Phe Gly
835 840 845
Asp Tyr Asn Leu Ser Ser Val Leu Pro Gln Arg Asn Ile His Ser Ser
850 855 860
Arg Ile Ala Gly Arg Ser Ala Leu Glu Asp Leu Leu Phe Ser Lys Val
865 870 875 880
Val Thr Ser Gly Leu Gly Thr Val Asp Val Asp Tyr Lys Ser Cys Thr
885 890 895
Lys Gly Leu Ser Ile Ala Asp Leu Ala Cys Ala Gln Tyr Tyr Asn Gly
900 905 910
Ile Met Val Leu Pro Gly Val Ala Asp Ala Glu Arg Met Ala Met Tyr
915 920 925
Thr Gly Ser Leu Ile Gly Gly Met Val Leu Gly Gly Leu Thr Ser Ala
930 935 940
Ala Ala Ile Pro Phe Ser Leu Ala Leu Gln Ala Arg Leu Asn Tyr Val
945 950 955 960
Ala Leu Gln Thr Asp Val Leu Gln Glu Asn Gln Lys Ile Leu Ala Ala
965 970 975
Ser Phe Asn Lys Ala Ile Asn Asn Ile Val Ala Ser Phe Ser Ser Val
980 985 990
Asn Asp Ala Ile Thr His Thr Ala Glu Ala Ile His Thr Val Thr Ile
995 1000 1005
Ala Leu Asn Lys Ile Gln Asp Val Val Asn Gln Gln Gly Ser Ala
1010 1015 1020
Leu Asn His Leu Thr Ser Gln Leu Arg His Asn Phe Gln Ala Ile
1025 1030 1035
Ser Asn Ser Ile His Ala Ile Tyr Asp Arg Leu Asp Ser Ile Gln
1040 1045 1050
Ala Asp Gln Gln Val Asp Arg Leu Ile Thr Gly Arg Leu Ala Ala
1055 1060 1065
Leu Asn Ala Phe Val Ser Gln Val Leu Asn Lys Tyr Thr Glu Val
1070 1075 1080
Arg Gly Ser Arg Arg Leu Ala Gln Gln Lys Ile Asn Glu Cys Val
1085 1090 1095
Lys Ser Gln Ser Asn Arg Tyr Gly Phe Cys Gly Asn Gly Thr His
1100 1105 1110
Ile Phe Ser Ile Val Asn Ser Ala Pro Asp Gly Leu Leu Phe Leu
1115 1120 1125
His Thr Val Leu Leu Pro Thr Asp Tyr Lys Asn Val Lys Ala Trp
1130 1135 1140
Ser Gly Ile Cys Val Asp Gly Ile Tyr Gly Tyr Val Leu Arg Gln
1145 1150 1155
Pro Asn Leu Val Leu Tyr Ser Asp Asn Gly Val Phe Arg Val Thr
1160 1165 1170
Ser Arg Val Met Phe Gln Pro Arg Leu Pro Val Leu Ser Asp Phe
1175 1180 1185
Val Gln Ile Tyr Asn Cys Asn Val Thr Phe Val Asn Ile Ser Arg
1190 1195 1200
Val Glu Leu His Thr Val Ile Pro Asp Tyr Val Asp Val Asn Lys
1205 1210 1215
Thr Leu Gln Glu Phe Ala Gln Asn Leu Pro Lys Tyr Val Lys Pro
1220 1225 1230
Asn Phe Asp Leu Thr Pro Phe Asn Leu Thr Tyr Leu Asn Leu Ser
1235 1240 1245
Ser Glu Leu Lys Gln Leu Glu Ala Lys Thr Ala Ser Leu Phe Gln
1250 1255 1260
Thr Thr Val Glu Leu Gln Gly Leu Ile Asp Gln Ile Asn Ser Thr
1265 1270 1275
Tyr Val Asp Leu Lys Leu Leu Asn Arg Phe Glu Asn Tyr Ile Lys
1280 1285 1290
Trp Pro Trp Trp Val Trp Leu Ile Ile Ser Val Val Phe Val Val
1295 1300 1305
Leu Leu Ser Leu Leu Val Phe Cys Cys Leu Ser Thr Gly Cys Cys
1310 1315 1320
Gly Cys Cys Asn Cys Leu Thr Ser Ser Met Arg Gly Cys Cys Asp
1325 1330 1335
Cys Gly Ser Thr Lys Leu Pro Tyr Tyr Glu Phe Glu Lys Val His
1340 1345 1350
Val Gln
1355
<210> 4
<211> 77
<212> PRT
<213> EMCR Coronavirus
<400> 4
Met Phe Leu Arg Leu Ile Asp Asp Asn Gly Ile Val Leu Asn Ser Ile
1 5 10 15
Leu Trp Leu Leu Val Met Ile Phe Phe Phe Val Leu Ala Met Thr Phe
20 25 30
Ile Lys Leu Ile Gln Leu Cys Phe Thr Cys His Tyr Phe Phe Ser Arg
35 40 45
Thr Leu Tyr Gln Pro Val Tyr Lys Ile Phe Leu Ala Tyr Gln Asp Tyr
50 55 60
Met Gln Ile Ala Pro Val Pro Ala Glu Val Leu Asn Val
65 70 75
<210> 5
<211> 226
<212> PRT
<213> EMCR Coronavirus
<400> 5
Met Ser Asn Ser Ser Val Pro Leu Ser Glu Val Tyr Val His Leu Arg
1 5 10 15
Asn Trp Asn Phe Ser Trp Asn Leu Ile Leu Thr Val Phe Ile Val Val
20 25 30
Leu Gln Tyr Gly His Tyr Lys Tyr Ser Arg Leu Leu Tyr Gly Leu Lys
35 40 45
Met Ser Val Leu Trp Cys Leu Trp Pro Leu Val Leu Ala Leu Ser Ile
50 55 60
Phe Asp Cys Phe Val Asn Phe Asn Val Asp Trp Val Phe Phe Gly Phe
65 70 75 80
Ser Ile Leu Met Ser Ile Ile Thr Leu Cys Leu Trp Val Met Tyr Phe
85 90 95
Val Asn Ser Phe Arg Leu Trp Arg Arg Val Lys Thr Phe Trp Ala Phe
100 105 110
Asn Pro Glu Thr Asn Ala Ile Ile Ser Leu Gln Val Tyr Gly His Asn
115 120 125
Tyr Tyr Leu Pro Val Met Ala Ala Pro Thr Gly Val Thr Leu Thr Leu
130 135 140
Leu Ser Gly Val Leu Leu Val Asp Gly His Lys Ile Ala Thr Arg Val
145 150 155 160
Gln Val Gly Gln Leu Pro Lys Tyr Val Ile Val Ala Thr Pro Ser Thr
165 170 175
Thr Ile Val Cys Asp Arg Val Gly Arg Ser Val Asn Glu Thr Ser Gln
180 185 190
Thr Gly Trp Ala Phe Tyr Val Arg Ala Lys His Gly Asp Phe Ser Gly
195 200 205
Val Ala Ser Gln Glu Gly Val Leu Ser Glu Arg Glu Lys Leu Leu His
210 215 220
Leu Ile
225
<210> 6
<211> 377
<212> PRT
<213> EMCR Coronavirus
<400> 6
Met Ala Ser Val Asn Trp Ala Asp Asp Arg Ala Ala Arg Lys Lys Phe
1 5 10 15
Pro Pro Pro Ser Phe Tyr Met Pro Leu Leu Val Ser Ser Asp Lys Ala
20 25 30
Pro Tyr Arg Val Ile Pro Arg Asn Leu Val Pro Ile Gly Lys Gly Asn
35 40 45
Lys Asp Glu Gln Ile Gly Tyr Trp Asn Val Gln Glu Arg Trp Arg Met
50 55 60
Arg Arg Gly Gln Arg Val Asp Leu Pro Pro Lys Val His Phe Tyr Tyr
65 70 75 80
Leu Gly Thr Gly Pro His Lys Asp Leu Lys Phe Arg Gln Arg Ser Asp
85 90 95
Gly Val Val Trp Val Ala Lys Glu Gly Ala Lys Thr Val Asn Thr Ser
100 105 110
Leu Gly Asn Arg Lys Arg Asn Gln Lys Pro Leu Glu Pro Lys Phe Ser
115 120 125
Ile Ala Leu Pro Pro Glu Leu Ser Val Val Glu Phe Glu Asp Arg Ser
130 135 140
Asn Asn Ser Ser Arg Ala Ser Ser Arg Ser Ser Thr Arg Asn Asn Ser
145 150 155 160
Arg Asp Ser Ser Arg Ser Thr Ser Arg Gln Gln Ser Arg Thr Arg Ser
165 170 175
Asp Ser Asn Gln Ser Ser Ser Asp Leu Val Ala Ala Val Thr Leu Ala
180 185 190
Leu Lys Asn Leu Gly Phe Asp Asn Gln Ser Lys Ser Pro Ser Ser Ser
195 200 205
Gly Thr Ser Thr Pro Lys Lys Pro Asn Lys Pro Leu Ser Gln Pro Arg
210 215 220
Ala Asp Lys Pro Ser Gln Leu Lys Lys Pro Arg Trp Lys Arg Val Pro
225 230 235 240
Thr Arg Glu Glu Asn Val Ile Gln Cys Phe Gly Pro Arg Asp Phe Asn
245 250 255
His Asn Met Gly Asp Ser Asp Leu Val Gln Asn Gly Val Asp Ala Lys
260 265 270
Gly Phe Pro Gln Leu Ala Glu Leu Ile Pro Asn Gln Ala Ala Leu Phe
275 280 285
Phe Asp Ser Glu Val Ser Thr Asp Glu Val Gly Asp Asn Val Gln Ile
290 295 300
Thr Tyr Thr Tyr Lys Met Leu Val Ala Lys Asp Asn Lys Asn Leu Pro
305 310 315 320
Lys Phe Ile Glu Gln Ile Ser Ala Phe Thr Lys Pro Ser Ser Ile Lys
325 330 335
Glu Met Gln Ser Gln Ser Ser His Val Ala Gln Asn Thr Val Leu Asn
340 345 350
Ala Ser Ile Pro Glu Ser Lys Pro Leu Ala Asp Asp Asp Ser Ala Ile
355 360 365
Ile Glu Ile Val Asn Glu Val Leu His
370 375
<210> 7
<211> 27530
<212> DNA
<213> EMCR Coronavirus
<220>
<221> CDS
<222> (12402)..(20438)
<223> Replicase 1b
<220>
<221> CDS
<222> (24502)..(25182)
<223> ORF 4ab
<400> 7
agatagagaa ttttcttatt tagactttgt gtctactcct ctcaactaaa cgaaattttt 60
ctagtgctgt catttgttat ggcagtccta gtgtaattga aatttcgtca agtttgtaaa 120
ctggttaggc aagtgttgta ttttctgtgt ttaagcactg gtggttctgt ccactagtgc 180
acacattgat acttaagtgg tgttctgtca ctgcttattg tggaagcaac gttctgtcgt 240
tgtggaaacc aataactgct aaccatgttt tacaatcaag tgacacttgc tgttgcaagt 300
gattcggaaa tttcaggttt tggttttgcc attccttctg tagccgttcg cgcttatagc 360
gaagccgctg cacaaggttt tcaggcatgc cgctttgttg cttttggctt acaggattgt 420
gtaaccggta ttaatgatga cgattatgtc attgcattga ctggtactaa tcagctttgt 480
gccaaaattt tacttttttc tgatagacct cttaatttgc gaggttggct cattttttct 540
aacagcaatt atgttcttca ggactttgat gttgtttttg gccatggtgc aggaagtgtg 600
gtttttgtgg ataagtatat gtgtggtttt gatggtaaac ctgtgttacc taaaaacatg 660
tgggaattta gagattactt taatgataat actgatagta ttgttattgg tggtgtcact 720
tatcaattag catgggatgt tatacgtaaa gacctttctt atgaacagca aaatgtttta 780
gctattgaga gcattcatta tcttggcact acaggtcata ctttgaagtc tggttgcaaa 840
ctcattaatg ccaagccgcc taaatattct tctaaggttg ttttgagtgg tgaatggaat 900
gctgtgtata aggcgtttgg ttcaccattt attacaaatg gtatatcatt gctagatata 960
attgttaaac cagttttctt taatgctttt gttaaatgca attgtggttc tgagaattgg 1020
agtgttggtg catgggatgg ttatctatct tcttgttgtg gcacacctgc taagaaactt 1080
tgtgttgttc ctggtaatgt tgttcctggt gatgtgatca tcacctcaac tgatgctggt 1140
tgtggtgtta aatactatgc tggcttagtt gttaaacata ttactaacat tactggtgtg 1200
tctttatggc gtgttacagc tgttcattct gatggaatgt ttgtggcaac atcttcttat 1260
gatgcacttt tgcatagaaa ttcattagac cctttttgct ttgatgttaa cactttactt 1320
tctaatcaat tacgtctagc ttttcttggt gcttctgtta cagaagatgt taaatttgct 1380
gctagcactg gtgttattga cattagtgct ggtatgtttg gtctttacga tgacatattg 1440
acaaacaata aaccttggtt tgtacgcaaa gcttctgggc tttttgatgc aatctgggat 1500
gcttttgttg ccgctattaa gcttgtgcca actactactg gtggtttggt taggtttgtt 1560
aagtctatcg cttcaactgt tttaactgtt tctaatggtg ttattattat gtgtgcagat 1620
gttccagatg cttttcaacc agtttaccgc acatttacac aagctatttg tgctgcattt 1680
gatttttctt tagatgtatt taaaattggt gatgttaaat ttaaacgact tggtgattat 1740
gttcttactg aaaatgctct tgttcgtttg actactgaag ttgttcgtgg tgttcgtgat 1800
gctcgcataa agaaagccat gtttactaaa gtagttgtag gtcctacaac tgaagttaag 1860
ttttctgtta ttgaacttgc cactgttaat ttgcgtcttg ttgattgtgc acctgtagtt 1920
tgccctaaag gtaaaattgt tgttattgct ggacaagctt ttttctatag tggtggtttt 1980
tatcgtttta tggttgattc tacaactgta ttaaatgacc ctgtttttac tggtgagtta 2040
ttttatacta ttaagtttag tggttttaag cttgatggtt ttaaccatca gtttgttaat 2100
gctagttctg ctacagatgc cattattgct gttgagctgt tgttatcgga ttttaaaact 2160
gcagtttttg tgtacacatg tgtggttgat ggttgtagtg tcattgttag acgtgatgct 2220
acattcgcca cacatgtgtg ttttaaggac tgttatagta tttgggagca attctgcatt 2280
gataattgtg gtgagccatg gtttttgact gattataatg ctatcttgca gagtaataac 2340
cctcaatgtg ctattgttca agcatcggag tctaaagttt tgcttgagag gtttttacct 2400
aagtgtcctg aaatactgtt gagtattgat gatggccatt tatggaatct ttttgttgaa 2460
aagtttaatt ttgttacaga ttggttaaaa actcttaagc ttacacttac ttctaatggt 2520
cttttaggta attgtgccaa acgttttaga cgtgttttgg taaaattgct tgatgtctat 2580
aatggttttc ttgaaactgt ctgtagtgtc gtacacactg ctggtgtttg cattaaatat 2640
tatgctgtta atgttccata tgtagttatt agtggttttg taagtcgtgt aattcgtaga 2700
gaaaggtgtg acgtgacttt tccttgtgtt agttgtgtca cttttttcta tgaattttta 2760
gacacgtgtt ttggtgttag taaacctaat gccattgatg ttgaacattt agagcttaaa 2820
gaaactgttt ttgttgaacc taaggatggt ggtcaatttt ttgtttctga tgattatctt 2880
tggtatgttg tagatgacat ttattatcca gcttcatgta atggtgtatt gccagttgct 2940
tttacaaaat tggcaggtgg taaaatatct ttttctgatg atgttatagt tcatgatgtt 3000
gaacctaccc ataaagtcaa gctcatattt gagtttgaag atgatgttgt taccagtctt 3060
tgtaagaaga gttttggtaa gtctattatt tatacaggtg attgggaagg tttacatgaa 3120
gttcttacat ctgcaatgaa tgtcattggg caacatatta agttgccaca attttatatt 3180
tatgatgaag agggtggtta tgatgtttct aaaccagtta tgatttcaca atggcctatt 3240
agtgatgata gtgatggttg tgttgttgaa gcgagcactg attttcatca attagaatct 3300
gttagagaag aggttgatat aattgaacaa ccttttgggg aagttgaaca tgcgctctca 3360
attagacaac ctttttcttt ttcttttaga gatgaattgg gtgttcgtgt tttagatcaa 3420
tctgataata attgttggat tagtaccaca cttatacagt tgcaacttac aaagcttttg 3480
gatgattcta ttgagatgca attgtttaaa gttggtaaag ttgattcaat tgttcaaaag 3540
tgttatgagt tgtctcattt aattagtggt tcacttggtg atagtggtaa acttcttagt 3600
gaacttctta aagataaata tacatgttct ataacttttg agatgtcttg tgattgtggt 3660
aaaaagtttg atgagcaagt tggttgtttg ttttggatta tgccttacac aaaacttttt 3720
caaaaaggtg agtgttgtat ttgtcataaa atgcagactt ataagcttgt tagtatgaaa 3780
ggtactggtg tgtttgtaca ggatccagca cctattgaca ttgatgcttt ccctgttaga 3840
cctatatgtt catctgtata tttaggtgtt aagggttctg gtcattatca aacaaattta 3900
tacagttttg acaaagctat tgatggtttt ggtgtctttg acattaaaaa tagtagtgtt 3960
aatactgttt gttttgttga tgttgatttt catagtgtag aaatagaagc tggtgaagtt 4020
aaaccttttg ctgtatataa aaatgttaaa ttttatttag gtgatatttc acaccttgta 4080
aactgtgttt cttttgactt tgttgtcaat gctgctaatg aaaatctcat gcatggaggc 4140
ggtgtcgcac gtgctattga tattttgact gaaggtcaac ttcagtcatt atctaaagat 4200
tacattagta gtaatggtcc acttaaggtt ggagcaggtg ttatgttgga gtgtgaaaaa 4260
ttcaatgtat ttaatgttgt tggtccgcga actggtaaac atgagcattc attacttgtt 4320
gaagcttata attctatttt atttgaaaat ggtattccac ttatgcctct tcttagttgt 4380
ggtatttttg gtgtaaggat tgaaaattct cttaaagctt tgtttagttg tgacattaat 4440
aaaccattgc aagtttttgt ttattcttca aatgaagaac aagctgttct taagttttta 4500
gatggtttag atttaacacc agtcattgac gatgttgatg ttgttaaacc ttttagagtt 4560
gaaggtaatt tttcattctt tgattgtggt gtcaatgcct tggatggtga tatttactta 4620
ttatttacta actctatttt aatgttggat aaacaaggac aattattgga cacaaaactt 4680
aatggtattt tgcaacaggc agttcttgat tatcttgcta cagttaaaac tgtaccagct 4740
ggtaatttgg ttaaacttgt tgttgagagt tgtaccattt atatgtgtgt tgtaccatcg 4800
ataaatgatc tttcttttga taaaaatctt ggtcgttgtg tgcgtaaact taatagattg 4860
aaaacttgtg ttattgccaa tgttcctgct attgatgttt tgaaaaagct tctttcaagt 4920
ttgactttaa ctgttaaatt tgttgtagag agtaatgtta tggatgttaa cgactgtttt 4980
aagaatgata atgtagtttt gaaaattact gaagatggta ttaatgttaa agatgttgtt 5040
gttgagtctt ctaagtcact tggtaaacaa ttgggtgttg tgagtgatgg tgttgactct 5100
tttgaaggtg ttttacctat taatactgat actgtcttat ctgtagctcc agaagttgac 5160
tgggttgctt tttacggttt tgaaaaggca gcactttttg cttctttgga tgtaaagcca 5220
tatggttacc ctaatgattt tgttggtggt tttagagttc ttgggaccac cgacaataat 5280
tgttgggtta atgcaacttg tataatttta cagtatctta agcctacttt taaatctaag 5340
ggtttaaatg ttctttggaa caaatttgtt acaggtgatg ttggaccttt tgttagtttt 5400
atttatttta taactatgtc ttcaaagggt caaaagggtg atgctgaaga ggcattatct 5460
aaattgtcag agtatttgat tagtgattct attgttactc ttgaacaata ttcaacttgt 5520
gacatttgta aaagtactgt agttgaagtt aaaagtgctg ttgtctgtgc tagtgtgctt 5580
aaagatggtt gtgatgttgg tttttgtcca cacagacata aattgcgttc acgtgttaag 5640
tttgttaatg gacgtgttgt tattaccaat gttggtgaac ctataatttc acaaccttct 5700
aagttgctta atggtattgc ttatacaaca ttttcaggtt cttttgataa cggtcactat 5760
gtagtttatg atgctgctaa taatgctgtc tatgatggtg ctcgtttatt tgcttcagat 5820
ttgtctactt tagctgttac agctattgtt gtagtaggtg gttgtgtaac atctaatgtt 5880
ccaccaattg ttagtgagaa aatttctgtt atggataaac ttgatactgg tgcacaaaaa 5940
tttttccaat ttggtgattt tgttatgaat aacattgttc tgtttttaac ttggttgctt 6000
agtatgttta gtcttttacg tacttctatt atgaagcatg atattaaagt tattgccaag 6060
gctcctaaac gtacaggtgt tattttgaca cgtagtttta agtataacat tagatctgct 6120
ttgtttgttg taaagcagaa gtggtgtgtt attgttactt tgtttaagtt cttattgtta 6180
ttatatgcta tttatgcact tgtttttatg attgtgcaat ttagtccttt taatagtctt 6240
ttatgtggtg acattgtaag tggttatgaa aaatccactt ttaataagga tatttattgt 6300
ggtaattcta tggtttgtaa gatgtgtttg tttagttatc aagagtttaa tgatttggat 6360
catactagtc ttgtttggaa gcacattcgt gatcctatat taatcagttt acaaccattt 6420
gttatacttg ttattttgtt aatttttggt aatatgtatt tgcgttttgg acttttatat 6480
tttgttgcac aatttattag tacttttggt tctttcttag gctttcatca gaaacagtgg 6540
tttttacatt ttgtgccgtt tgatgtttta tgtaatgagt ttttagctac atttattgtc 6600
tgcaaaattg ttttatttgt tagacatatt attgttggct gtaataatgc tgactgtgta 6660
gcttgttcta aaagtgctag acttaaacgt gtaccacttc aaactattat taatggtatg 6720
cataaatcat tctatgttaa tgctaatggt ggtacttgtt tctgtaataa acataacttc 6780
ttttgtgtta attgtgattc ttttgggcct ggtaatactt ttattaatgg tgatattgca 6840
agagagcttg gtaatgttgt taaaacagct gttcaaccca cagctcctgc atatgttatt 6900
attgataagg tagattttgt taatggattt tatcgtcttt atagtggtga cactttttgg 6960
cggtatgact ttgacattac tgaatctaag tatagttgta aagaggttct gaagaattgt 7020
aatgttttag aaaattttat tgtttacaat aatagtggta gtaacattac acagattaaa 7080
aatgcttgtg tttatttttc tcaattgttg tgtgaaccta taaagttggt aaattcagag 7140
ttgttgtcaa ctttatcagt tgattttaat ggtgttttgc ataaggcata tgttgatgtt 7200
ttgtgtaata gtttttttaa ggagctaact gctaacatgt ccatggctga atgtaaagct 7260
acacttggtt tgactgtttc tgatgatgat tttgtttcag ctgttgccaa tgcacatagg 7320
tatgacgttt tgctttcaga tttgtcattt aataattttt ttatttctta tgctaaacct 7380
gaagataagt tgtccgttta tgacattgct tgttgtatgc gtgccggttc taaggttgtt 7440
aaccataatg ttttaatcaa agagtcaata cctattgttt ggggtgtcaa ggactttaat 7500
actctttctc aagaaggtaa gaagtacctt gttaaaacaa ctaaagcaaa gggtttgact 7560
tttttattaa cttttaatga taaccaagca attacacaag ttcctgctac tagtatagtt 7620
gcaaaacagg gtgctggttt taaacgtact tataattttc tgtggtatgt atgtttattt 7680
gttgttgcat tgtttattgg tgtctcattt attgattata caaccactgt aactagcttt 7740
catggttatg attttaagta cattgagaat ggtcagttga aggtgtttga agcaccttta 7800
cactgtgttc gtaatgtttt tgataatttt aatcaatggc atgaggctaa gtttggtgtt 7860
gttactacta atagtgataa atgtcctata gttgttggtg tttcagagcg tattaatgtt 7920
gttcctggtg ttccaacaaa tgtatatttg gtaggaaaga ctcttgtttt tacattacag 7980
gctgcttttg gaaacacagg tgtttgttat gactttgatg gtgttaccac tagtgataag 8040
tgtattttta attctgcttg tactaggttg gaaggtttgg gtggtgacaa tgtttattgt 8100
tacaacactg atcttattga aggttctaaa ccttatagta ttttacagcc caatgcttat 8160
tataagtatg atgttaaaaa ttatgtacgt tttccagaaa ttttagctag aggttttggc 8220
ttacgtacta ttagaacttt ggctacacgt tattgtagag ttggtgaatg ccgtgactca 8280
cataaaggtg tttgttttgg ttttgataaa tggtatgtta atgatggacg tgttgatgac 8340
ggttacattt gtggtgatgg tcttatagac cttcttgtta atgtactctc aatctttagt 8400
tcatctttta gcgttgtggc tatgtctgga catatgttgt ttaattttct ttttgcagca 8460
tttattacat ttttgtgctt tttagttact aaatttaaac gtgtttttgg tgatctttct 8520
tatggtgttt ttactgttgt ttgtgcaact ttgattaata acatttctta tgttgttact 8580
caaaatttat tttttatgtt gctttatgct attttgtatt ttgtttttac taggacagtg 8640
cgttatgctt ggatttggca tattgcatac attgttgcat acttcttgtt aataccatgg 8700
tggcttctca catggtttag ttttgctgca tttttagagc ttttacctaa tgtttttaag 8760
ttaaaaatct ctactcaatt gtttgaaggt gataagttta taggtacttt tgagagtgct 8820
gctgcaggta catttgttct tgacatgcgt tcttatgaaa ggctgataaa tactatttca 8880
cctgagaaac ttaagaatta tgctgcaagt tataataaat ataaatatta tagtggtagt 8940
gctagtgagg ctgattatcg ttgtgcttgt tatgctcatt tagccaaggc tatgttagat 9000
tacgcaaaag atcataatga catgttatat tctccaccta ccattagcta caattccacc 9060
ttacaatctg gtcttaagaa gatggcacaa ccatctggtt gtgttgagag atgtgtggtt 9120
cgcgtctgtt atggtagtac tgtgcttaat ggagtttggt taggtgacac tgttacttgt 9180
cctagacatg tcatagcacc atcaaccact gttcttattg attatgatca tgcatatagt 9240
actatgcgtt tgcataattt ttcagtgtct cataatggtg tcttcttggg agttgttggt 9300
gttacaatgc atggttctgt gttgcgtatt aaggtttcac aatctaatgt acatacacct 9360
aaacatgttt ttaaaacgtt gaaacctggt gcttctttta atattttagc atgttatgaa 9420
ggtattgcat ctggtgtttt tggtgttaat ttacgtacaa actttactat taaaggttct 9480
tttataaatg gagcttgtgg ttctcctggt tataatgtta gaaatgatgg tactgttgag 9540
ttttgttatt tacaccaaat tgagttaggt agtggtgctc atgttggttc tgattttact 9600
ggtagtgttt atggtaattt tgatgaccaa cctagtttgc aagttgagag tgccaacctt 9660
atgctatcag ataatgttgt tgcctttttg tatgctgctt tgttgaatgg ttgtaggtgg 9720
tggttgcgtt caactagagt taatgttgat ggttttaatg aatgggctat ggctaatggt 9780
tatacaattg tttctagtgt tgagtgctat tctattttgg cagcaaaaac tggtgttagt 9840
gttgaacaat tgttagcttc cattcaacat cttcatgaag gttttggtgg taaaaacata 9900
cttggttatt ctagtttatg tgatgagttc acactagctg aagttgtgaa gcagatgtat 9960
ggtgttaact tgcaaagtgg taaggttatt tttggtttaa aaacaatgtt tttatttagc 10020
gttttcttca caatgttttg ggcagaactc tttatttata caaacactat atggataaac 10080
cctgttatac ttacacctat attttgttta cttttgtttt tgtcattagt tttaactatg 10140
tttcttaaac ataagttttt gtttttgcaa gtatttttat tacctactgt tattgcaact 10200
gctttatata attgtgtttt ggattattac atagtaaaat ttttggctga ccattttaac 10260
tataatgttt cagtattaca aatggatgtt cagggtttag ttaatgtttt ggtctgttta 10320
tttgttgtat ttttacacac atggcgtttt tctaaagaac gtttcacaca ttggtttaca 10380
tatgtgtgtt ctcttatagc agttgcttac acttattttt atagtggtga ctttttgagt 10440
ttgcttgtta tgtttttatg tgctatatct agtgattggt acattggtgc cattgttttt 10500
aggttgtcac gtttgattat atttttttca cctgaaagtg tatttagtgt ttttggtgat 10560
gtgaaactca ctttagttgt ttatttaatt tgtggttatt tagtttgtac ttattggggc 10620
attttgtatt ggttcaatag gttttttaaa tgtactatgg gtgtttatga ttttaaggtg 10680
agtgctgctg aatttaaata catggttgct aatggacttc atgcaccata tggacctttt 10740
gatgcacttt ggttatcatt caaattactt ggtattggtg gtgaccgttg tataaaaatt 10800
tcaactgtcc aatccaaact gactgatttg aagtgtacta atgttgtgtt attgggttgt 10860
ttgtctagta tgaacattgc agctaattct agtgaatggg cttattgtgt tgatttacac 10920
aataagatta atctttgtga tgacccagaa aaagctcaag gtatgttgtt agcactcctt 10980
gcgttctttc taagtaaaca tagtgatttt ggtcttgatg gccttattga ttcttatttt 11040
gataatagta gcaccctgca gagtgttgct tcatcatttg ttagtatgcc atcatatatt 11100
gcttatgaaa atgctagaca agcttatgag gatgctattg ctaatggatc ttcttctcaa 11160
cttattaaac aattgaagcg tgccatgaat atcgcaaagt ctgaatttga tcatgagata 11220
tctgttcaga agaaaattaa tagaatggct gaacaagctg ctactcagat gtataaagaa 11280
gcacgctctg ttaatagaaa atctaaagtt attagtgcta tgcactcttt actttttgga 11340
atgttaagac gtttggatat gtctagtgtt gaaactgttt tgaatttagc acgtgatggt 11400
gttgtgccat tgtcagttat acctgcaact tcagcttcca aactaactat tgttagtcca 11460
gatcttgaat cttattctaa gattgtttgt gatggttctg ttcattatgc tggagttgtt 11520
tggacactta atgatgttaa agacaatgat ggtagacctg ttcatgttaa agagattaca 11580
agggagaatg ttgaaacttt gacatggcct cttatcctta attgtgaacg tgttgttaaa 11640
cttcaaaata atgaaattat gcctggtaaa cttaagcaaa aacctatgaa agctgagggt 11700
gatggtggtg ttttaggtga tggtaatgct ttgtataata ctgagggtgg taaaactttt 11760
atgtatgctt atatttctaa taaagctgac cttaaatttg ttaagtggga gtatgagggt 11820
ggttgcaaca caatcgagtt agactctcct tgtcgattta tggtcgaaac acctaatggt 11880
cctcaagtga agtatttgta ttttgttaaa aatttaaata ccttacgtag aggtgccgtt 11940
cttggtttta taggtgccac aattcgtcta caagctggta aacaaactga attggctgtt 12000
aattctggac ttttaactgc ttgtgctttt tctgttgatc cagcaaccac ttacttggaa 12060
gctgttaaac atggtgcaaa acctgtaagt aattgtatta agatgttatc taatggtgct 12120
ggtaatggtc aagctataac aactagtgta gatgctaaca ccaatcaaga ttcttatggt 12180
ggagcgtcta tttgtttgta ttgtcgggcc cacgttcctc accctagtat ggatggttac 12240
tgtaagttta agggtaaatg tgttcaggtt cctattggtt gtttggatcc tattaggttt 12300
tgtttagaaa ataatgtgtg taatgtttgt ggttgttggt tgggacacgg gtgtgcttgt 12360
gatcgtacaa ccattcaaag tgttgacatt tcttatttaa a cga gca agg ggt tct 12416
Arg Ala Arg Gly Ser
1 5
agt gca gct cga cta gaa ccc tgt aat ggc acg gac atc gat aag tgt 12464
Ser Ala Ala Arg Leu Glu Pro Cys Asn Gly Thr Asp Ile Asp Lys Cys
10 15 20
gtt cgt gct ttt gac att tat aat aaa aat gtt tca ttc ttg ggt aag 12512
Val Arg Ala Phe Asp Ile Tyr Asn Lys Asn Val Ser Phe Leu Gly Lys
25 30 35
tgt ttg aag atg aac tgt gtt cgt ttt aaa aat gct gat ctt aag gat 12560
Cys Leu Lys Met Asn Cys Val Arg Phe Lys Asn Ala Asp Leu Lys Asp
40 45 50
ggt tat ttt gtt ata aag agg tgt act aag tcg gtt atg gaa cac gag 12608
Gly Tyr Phe Val Ile Lys Arg Cys Thr Lys Ser Val Met Glu His Glu
55 60 65
caa tcc atg tat aac cta ctt aac ttt tct ggt gct ttg gct gag cat 12656
Gln Ser Met Tyr Asn Leu Leu Asn Phe Ser Gly Ala Leu Ala Glu His
70 75 80 85
gat ttc ttt act tgg aaa gat ggc aga gtc att tat ggt aat gtt agt 12704
Asp Phe Phe Thr Trp Lys Asp Gly Arg Val Ile Tyr Gly Asn Val Ser
90 95 100
aga cat aat ctt act aaa tat act atg atg gac ttg gtt tat gct atg 12752
Arg His Asn Leu Thr Lys Tyr Thr Met Met Asp Leu Val Tyr Ala Met
105 110 115
cgt aac ttt gat gaa caa aat tgt gat gtt cta aaa gaa gta tta gtt 12800
Arg Asn Phe Asp Glu Gln Asn Cys Asp Val Leu Lys Glu Val Leu Val
120 125 130
tta act ggt tgt tgt gac aat tct tat ttt gat agt aag ggt tgg tat 12848
Leu Thr Gly Cys Cys Asp Asn Ser Tyr Phe Asp Ser Lys Gly Trp Tyr
135 140 145
gac cca gtt gaa aat gaa gat ata cat aga gtt tat gca tct ctt ggc 12896
Asp Pro Val Glu Asn Glu Asp Ile His Arg Val Tyr Ala Ser Leu Gly
150 155 160 165
aaa att gta gct aga gct atg ctt aaa tgc gtt gct cta tgt gat gcg 12944
Lys Ile Val Ala Arg Ala Met Leu Lys Cys Val Ala Leu Cys Asp Ala
170 175 180
atg gtt gct aaa ggt gtt gtt ggt gtt tta aca tta gat aac caa gat 12992
Met Val Ala Lys Gly Val Val Gly Val Leu Thr Leu Asp Asn Gln Asp
185 190 195
ctt aat ggt aac ttt tat gat ttt ggt gat ttt gtt gtt agc tta cct 13040
Leu Asn Gly Asn Phe Tyr Asp Phe Gly Asp Phe Val Val Ser Leu Pro
200 205 210
aat atg ggt gtt ccc tgt tgt aca tca tat tat tct tat atg atg cct 13088
Asn Met Gly Val Pro Cys Cys Thr Ser Tyr Tyr Ser Tyr Met Met Pro
215 220 225
att atg ggt tta act aat tgt tta gct agt gag tgt ttt gtc aag agt 13136
Ile Met Gly Leu Thr Asn Cys Leu Ala Ser Glu Cys Phe Val Lys Ser
230 235 240 245
gat att ttt ggt agt gat ttt aaa act ttt gat ttg ctt aag tat gat 13184
Asp Ile Phe Gly Ser Asp Phe Lys Thr Phe Asp Leu Leu Lys Tyr Asp
250 255 260
ttc act gaa cat aaa gaa aat tta ttc aat aag tac ttt aag cat tgg 13232
Phe Thr Glu His Lys Glu Asn Leu Phe Asn Lys Tyr Phe Lys His Trp
265 270 275
agt ttt gat tat cat cct aat tgt agt gac tgt tat gat gat atg tgt 13280
Ser Phe Asp Tyr His Pro Asn Cys Ser Asp Cys Tyr Asp Asp Met Cys
280 285 290
gtt ata cat tgt gct aat ttt aat aca cta ttt gcc aca act ata cca 13328
Val Ile His Cys Ala Asn Phe Asn Thr Leu Phe Ala Thr Thr Ile Pro
295 300 305
ggt act gct ttt ggt cca cta tgt cgt aaa gtt ttt ata gat ggt gtt 13376
Gly Thr Ala Phe Gly Pro Leu Cys Arg Lys Val Phe Ile Asp Gly Val
310 315 320 325
cca ctt gtt aca act gct ggt tat cat ttt aag caa tta ggt ttg gtt 13424
Pro Leu Val Thr Thr Ala Gly Tyr His Phe Lys Gln Leu Gly Leu Val
330 335 340
tgg aat aaa gat gtt aac aca cac tca gtt agg ttg aca atc act gaa 13472
Trp Asn Lys Asp Val Asn Thr His Ser Val Arg Leu Thr Ile Thr Glu
345 350 355
ctt ttg caa ttt gtt act gac cct tcc ttg ata ata gct tct tct cca 13520
Leu Leu Gln Phe Val Thr Asp Pro Ser Leu Ile Ile Ala Ser Ser Pro
360 365 370
gca ctc gtt gat caa cgc act att tgt ttt tct gtt gca gca ttg agt 13568
Ala Leu Val Asp Gln Arg Thr Ile Cys Phe Ser Val Ala Ala Leu Ser
375 380 385
act ggt ttg aca aat caa gtt gtt aag cca ggt cat ttt aat gaa gag 13616
Thr Gly Leu Thr Asn Gln Val Val Lys Pro Gly His Phe Asn Glu Glu
390 395 400 405
ttt tat aac ttt ctt cgt tta aga ggt ttc ttt gat gaa ggt tct gaa 13664
Phe Tyr Asn Phe Leu Arg Leu Arg Gly Phe Phe Asp Glu Gly Ser Glu
410 415 420
ctt aca tta aaa cat ttc ttc ttc gca cag aat ggt gat gct gct gtt 13712
Leu Thr Leu Lys His Phe Phe Phe Ala Gln Asn Gly Asp Ala Ala Val
425 430 435
aaa gat ttt gac ttt tac cgt tat aat aag cct acc att tta gat att 13760
Lys Asp Phe Asp Phe Tyr Arg Tyr Asn Lys Pro Thr Ile Leu Asp Ile
440 445 450
tgt caa gct aga gtt aca tat aag ata gtc tct cgt tat ttt gac att 13808
Cys Gln Ala Arg Val Thr Tyr Lys Ile Val Ser Arg Tyr Phe Asp Ile
455 460 465
tat gaa ggt ggc tgt att aag gca tgt gaa gtt gtt gta aca aat ctt 13856
Tyr Glu Gly Gly Cys Ile Lys Ala Cys Glu Val Val Val Thr Asn Leu
470 475 480 485
aat aag agt gct ggt tgg cca tta aat aag ttt ggt aaa gct agt ttg 13904
Asn Lys Ser Ala Gly Trp Pro Leu Asn Lys Phe Gly Lys Ala Ser Leu
490 495 500
tat tac gaa tct ata tct tat gaa gaa cag gat gct ttg ttt gct ttg 13952
Tyr Tyr Glu Ser Ile Ser Tyr Glu Glu Gln Asp Ala Leu Phe Ala Leu
505 510 515
aca aag cgt aat gtc ctc cct act atg aca cag ctg aat ctt aag tat 14000
Thr Lys Arg Asn Val Leu Pro Thr Met Thr Gln Leu Asn Leu Lys Tyr
520 525 530
gct att agt ggt aaa gaa cgt gct aga act gtt ggt ggt gtt tct ctg 14048
Ala Ile Ser Gly Lys Glu Arg Ala Arg Thr Val Gly Gly Val Ser Leu
535 540 545
ttg tcc aca atg acc aca aga caa tac cat caa aaa cat ctt aaa tcc 14096
Leu Ser Thr Met Thr Thr Arg Gln Tyr His Gln Lys His Leu Lys Ser
550 555 560 565
att gtt aat aca cgc aat gcc act gtt gtt att ggt act acc aaa ttt 14144
Ile Val Asn Thr Arg Asn Ala Thr Val Val Ile Gly Thr Thr Lys Phe
570 575 580
tat ggt ggt tgg aat aat atg ttg cgt act tta att gat ggt gtt gaa 14192
Tyr Gly Gly Trp Asn Asn Met Leu Arg Thr Leu Ile Asp Gly Val Glu
585 590 595
aac cct atg ctc atg ggt tgg gat tat ccc aaa tgt gat aga gct ttg 14240
Asn Pro Met Leu Met Gly Trp Asp Tyr Pro Lys Cys Asp Arg Ala Leu
600 605 610
cct aac atg ata cgt atg att tca gcc atg gtg ttg ggt tct aag cat 14288
Pro Asn Met Ile Arg Met Ile Ser Ala Met Val Leu Gly Ser Lys His
615 620 625
gtt aat tgt tgt act gta aca gat agg ttt tat agg ctt ggt aac gag 14336
Val Asn Cys Cys Thr Val Thr Asp Arg Phe Tyr Arg Leu Gly Asn Glu
630 635 640 645
ttg gca caa gtt tta aca gaa gtt gtt tat tct aat ggt ggt ttt tat 14384
Leu Ala Gln Val Leu Thr Glu Val Val Tyr Ser Asn Gly Gly Phe Tyr
650 655 660
ttt aag cca ggt ggt acg act tct ggt gac gct agt aca gct tat gct 14432
Phe Lys Pro Gly Gly Thr Thr Ser Gly Asp Ala Ser Thr Ala Tyr Ala
665 670 675
aat tct att ttt aac att ttt caa gcc gtg agt tct aac att aac agg 14480
Asn Ser Ile Phe Asn Ile Phe Gln Ala Val Ser Ser Asn Ile Asn Arg
680 685 690
ttg ctt agt gtc cca tca gat tca tgt aat aat gtt aat gtt agg gat 14528
Leu Leu Ser Val Pro Ser Asp Ser Cys Asn Asn Val Asn Val Arg Asp
695 700 705
cta caa cga cgt ctg tat gat aat tgc tat agg tta act agt gtt gaa 14576
Leu Gln Arg Arg Leu Tyr Asp Asn Cys Tyr Arg Leu Thr Ser Val Glu
710 715 720 725
gag tca ttc att gat gat tat tat ggt tat ctt agg aaa cat ttt tca 14624
Glu Ser Phe Ile Asp Asp Tyr Tyr Gly Tyr Leu Arg Lys His Phe Ser
730 735 740
atg atg att ctc tct gat gac ggt gtt gtc tgt tat aac aag gat tat 14672
Met Met Ile Leu Ser Asp Asp Gly Val Val Cys Tyr Asn Lys Asp Tyr
745 750 755
gct gag tta ggt tat ata gca gac att agt gct ttt aaa gcc act ttg 14720
Ala Glu Leu Gly Tyr Ile Ala Asp Ile Ser Ala Phe Lys Ala Thr Leu
760 765 770
tat tac cag aat aat gtc ttt atg agt act tct aaa tgt tgg gtt gaa 14768
Tyr Tyr Gln Asn Asn Val Phe Met Ser Thr Ser Lys Cys Trp Val Glu
775 780 785
gaa gat tta act aag gga cca cat gag ttt tgt tcc cag cat act atg 14816
Glu Asp Leu Thr Lys Gly Pro His Glu Phe Cys Ser Gln His Thr Met
790 795 800 805
caa ata gtt gat aaa gat ggt acc tat tat ttg cct tac cca gat cct 14864
Gln Ile Val Asp Lys Asp Gly Thr Tyr Tyr Leu Pro Tyr Pro Asp Pro
810 815 820
agt agg atc ttg tca gct ggt gtt ttt gtt gat gat gtt gtt aag aca 14912
Ser Arg Ile Leu Ser Ala Gly Val Phe Val Asp Asp Val Val Lys Thr
825 830 835
gat gct gtt gtt ttg tta kaa cgt tat gtg tct tta gct att gat gca 14960
Asp Ala Val Val Leu Leu Xaa Arg Tyr Val Ser Leu Ala Ile Asp Ala
840 845 850
tac cct ctt tca aaa cac cct aat tct gaa tat cgt aag gtt ttt tac 15008
Tyr Pro Leu Ser Lys His Pro Asn Ser Glu Tyr Arg Lys Val Phe Tyr
855 860 865
gta tta ctt gat tgg gtt aag cat ctt aac aaa aat ttg aat gag ggt 15056
Val Leu Leu Asp Trp Val Lys His Leu Asn Lys Asn Leu Asn Glu Gly
870 875 880 885
gtt ctt gaa tct ttt tct gtt aca ctt ctt gat aat caa gaa gat aag 15104
Val Leu Glu Ser Phe Ser Val Thr Leu Leu Asp Asn Gln Glu Asp Lys
890 895 900
ttt tgg tgt gaa gat ttt tat gct agt atg tat gaa aat tct aca ata 15152
Phe Trp Cys Glu Asp Phe Tyr Ala Ser Met Tyr Glu Asn Ser Thr Ile
905 910 915
ttg caa gct gct ggc tta tgt gtt gtt tgt ggt tca caa act gtt ctt 15200
Leu Gln Ala Ala Gly Leu Cys Val Val Cys Gly Ser Gln Thr Val Leu
920 925 930
cgt tgt ggt gat tgt ctg cgt aag cct atg ttg tgc act aaa tgt gca 15248
Arg Cys Gly Asp Cys Leu Arg Lys Pro Met Leu Cys Thr Lys Cys Ala
935 940 945
tat gat cat gta ttt ggt acc gac cac aag ttt att ttg gct ata aca 15296
Tyr Asp His Val Phe Gly Thr Asp His Lys Phe Ile Leu Ala Ile Thr
950 955 960 965
ccg tat gta tgt aat gca tca ggt tgt ggt gtt agt gat gtt aaa aaa 15344
Pro Tyr Val Cys Asn Ala Ser Gly Cys Gly Val Ser Asp Val Lys Lys
970 975 980
ttg tat ctt ggt ggt ttg aat tac tat tgt aca aat cat aaa cca cag 15392
Leu Tyr Leu Gly Gly Leu Asn Tyr Tyr Cys Thr Asn His Lys Pro Gln
985 990 995
ttg tct ttt cca tta tgt tct gct ggt aat ata ttt ggt tta tat 15437
Leu Ser Phe Pro Leu Cys Ser Ala Gly Asn Ile Phe Gly Leu Tyr
1000 1005 1010
aaa aat tca gca act ggt tcc tta gat gtt gaa gtt ttt aat agg 15482
Lys Asn Ser Ala Thr Gly Ser Leu Asp Val Glu Val Phe Asn Arg
1015 1020 1025
ctt gca acg tct gat tgg act gat gtt agg gac tat aaa ctt gct 15527
Leu Ala Thr Ser Asp Trp Thr Asp Val Arg Asp Tyr Lys Leu Ala
1030 1035 1040
aat gat gtt aaa gat aca ctt aga ctc ttt gcg gct gaa act att 15572
Asn Asp Val Lys Asp Thr Leu Arg Leu Phe Ala Ala Glu Thr Ile
1045 1050 1055
aaa gct aaa gaa gag agt gtt aag tct tct tat gct ttt gca act 15617
Lys Ala Lys Glu Glu Ser Val Lys Ser Ser Tyr Ala Phe Ala Thr
1060 1065 1070
ctt aaa gag gtt gtt gga cct aaa gaa ttg ctt ctt agt tgg gaa 15662
Leu Lys Glu Val Val Gly Pro Lys Glu Leu Leu Leu Ser Trp Glu
1075 1080 1085
agt ggt aaa gtt aaa cca cct ttg aat cgt aat tct gtt ttc acc 15707
Ser Gly Lys Val Lys Pro Pro Leu Asn Arg Asn Ser Val Phe Thr
1090 1095 1100
tgt ttt caa ata agt aag gac tca aaa ttc caa ata ggt gag ttc 15752
Cys Phe Gln Ile Ser Lys Asp Ser Lys Phe Gln Ile Gly Glu Phe
1105 1110 1115
atc ttt gaa aag gtt gaa tat ggt tct gat act gtt acg tat aag 15797
Ile Phe Glu Lys Val Glu Tyr Gly Ser Asp Thr Val Thr Tyr Lys
1120 1125 1130
tct act gta acc act aag tta gtt cct ggt atg att ttt gtc tta 15842
Ser Thr Val Thr Thr Lys Leu Val Pro Gly Met Ile Phe Val Leu
1135 1140 1145
aca tct cac aat gtt caa cct tta cgt gca cca act att gca aac 15887
Thr Ser His Asn Val Gln Pro Leu Arg Ala Pro Thr Ile Ala Asn
1150 1155 1160
caa gag aag tat tct agc att tat aaa ttg cac cct gct ttt aat 15932
Gln Glu Lys Tyr Ser Ser Ile Tyr Lys Leu His Pro Ala Phe Asn
1165 1170 1175
gtc agt gat gca tat gct aat ttg gtt cca tat tac caa ctt att 15977
Val Ser Asp Ala Tyr Ala Asn Leu Val Pro Tyr Tyr Gln Leu Ile
1180 1185 1190
ggt aaa caa aag ata act aca ata cag ggt cct cct ggt agt ggt 16022
Gly Lys Gln Lys Ile Thr Thr Ile Gln Gly Pro Pro Gly Ser Gly
1195 1200 1205
aag tca cat tgt tcc att gga ctt gga ttg tac tat cca ggt gcg 16067
Lys Ser His Cys Ser Ile Gly Leu Gly Leu Tyr Tyr Pro Gly Ala
1210 1215 1220
cgt att gtt ttt gtt gct tgt gcc cat gct gct gtt gat tcc tta 16112
Arg Ile Val Phe Val Ala Cys Ala His Ala Ala Val Asp Ser Leu
1225 1230 1235
tgt gca aaa gct atg act gtt tat agc att gat aag tgt act agg 16157
Cys Ala Lys Ala Met Thr Val Tyr Ser Ile Asp Lys Cys Thr Arg
1240 1245 1250
att ata cct gca aga gct cgg gtt gag tgt tat agt ggc ttt aaa 16202
Ile Ile Pro Ala Arg Ala Arg Val Glu Cys Tyr Ser Gly Phe Lys
1255 1260 1265
cca aat aac act agt gca caa tac ata ttt agc act gtt aac gca 16247
Pro Asn Asn Thr Ser Ala Gln Tyr Ile Phe Ser Thr Val Asn Ala
1270 1275 1280
tta cct gag tgt aat gct gat att gtt gtt gta gat gaa gtt tca 16292
Leu Pro Glu Cys Asn Ala Asp Ile Val Val Val Asp Glu Val Ser
1285 1290 1295
atg tgt aca aat tat gac ctt tct gtt att aat cag cgt tta tca 16337
Met Cys Thr Asn Tyr Asp Leu Ser Val Ile Asn Gln Arg Leu Ser
1300 1305 1310
tat aaa cat att gtt tat gtt ggt gat cca caa caa ctt cct gca 16382
Tyr Lys His Ile Val Tyr Val Gly Asp Pro Gln Gln Leu Pro Ala
1315 1320 1325
cct aga gta atg att act aaa ggt gtt atg gag cct gtt gat tat 16427
Pro Arg Val Met Ile Thr Lys Gly Val Met Glu Pro Val Asp Tyr
1330 1335 1340
aac gtt gtt act caa cgt atg tgt gct ata ggc cct gat gtt ttt 16472
Asn Val Val Thr Gln Arg Met Cys Ala Ile Gly Pro Asp Val Phe
1345 1350 1355
ctt cat aaa tgt tat aga tgt cct gct gaa ata gtt aat aca gtt 16517
Leu His Lys Cys Tyr Arg Cys Pro Ala Glu Ile Val Asn Thr Val
1360 1365 1370
tct gaa ctt gtt tat gag aac aag ttt gtc cct gtt aaa cct gct 16562
Ser Glu Leu Val Tyr Glu Asn Lys Phe Val Pro Val Lys Pro Ala
1375 1380 1385
agt aaa cag tgt ttt aaa atc ttt ttt aag ggt aat gta cag gtt 16607
Ser Lys Gln Cys Phe Lys Ile Phe Phe Lys Gly Asn Val Gln Val
1390 1395 1400
gac aat ggc tct agt att aac aga aag cag ctt gaa ata gtt aag 16652
Asp Asn Gly Ser Ser Ile Asn Arg Lys Gln Leu Glu Ile Val Lys
1405 1410 1415
ctg ttt tta gtt aaa aat cca agt tgg agt aag gct gtg ttt att 16697
Leu Phe Leu Val Lys Asn Pro Ser Trp Ser Lys Ala Val Phe Ile
1420 1425 1430
tct cct tat aat agt cag aat tat gtt gct agt aga ttt tta gga 16742
Ser Pro Tyr Asn Ser Gln Asn Tyr Val Ala Ser Arg Phe Leu Gly
1435 1440 1445
ctt caa att caa act gtt gat tct tct caa ggt agt gag tat gat 16787
Leu Gln Ile Gln Thr Val Asp Ser Ser Gln Gly Ser Glu Tyr Asp
1450 1455 1460
tat gta atc tat gca caa act tct gac act gca cat gct tgc aat 16832
Tyr Val Ile Tyr Ala Gln Thr Ser Asp Thr Ala His Ala Cys Asn
1465 1470 1475
gta aac cgt ttt aat gtt gct ata aca cgt gct aag aag ggt ata 16877
Val Asn Arg Phe Asn Val Ala Ile Thr Arg Ala Lys Lys Gly Ile
1480 1485 1490
ttt tgt gta atg tgt gat aaa act ttg ttt gat tca ctt aag ttt 16922
Phe Cys Val Met Cys Asp Lys Thr Leu Phe Asp Ser Leu Lys Phe
1495 1500 1505
ttt gag att aaa cat gca gat tta cac tct agc cag gtt tgt ggc 16967
Phe Glu Ile Lys His Ala Asp Leu His Ser Ser Gln Val Cys Gly
1510 1515 1520
ttg ttt aaa aat tgt aca cgc act cct ctt aat tta cca cca act 17012
Leu Phe Lys Asn Cys Thr Arg Thr Pro Leu Asn Leu Pro Pro Thr
1525 1530 1535
cat gca cac act ttc ttg tcg ttg tca gat cag ttt aag act aca 17057
His Ala His Thr Phe Leu Ser Leu Ser Asp Gln Phe Lys Thr Thr
1540 1545 1550
ggt gat tta gct gtt caa ata ggt tca aat aat gtt tgt act tat 17102
Gly Asp Leu Ala Val Gln Ile Gly Ser Asn Asn Val Cys Thr Tyr
1555 1560 1565
gaa cat gtt ata tca ttt atg ggt ttt agg ttt gat att agt att 17147
Glu His Val Ile Ser Phe Met Gly Phe Arg Phe Asp Ile Ser Ile
1570 1575 1580
cct ggt agt cat agt ttg ttt tgt aca cgt gac ttt gct att cgt 17192
Pro Gly Ser His Ser Leu Phe Cys Thr Arg Asp Phe Ala Ile Arg
1585 1590 1595
aat gtg cgt ggt tgg ttg ggt atg gat gtt gaa agt gct cat gtt 17237
Asn Val Arg Gly Trp Leu Gly Met Asp Val Glu Ser Ala His Val
1600 1605 1610
tgt ggc gat aac ata ggt act aat gtt cct tta cag gtt ggt ttt 17282
Cys Gly Asp Asn Ile Gly Thr Asn Val Pro Leu Gln Val Gly Phe
1615 1620 1625
tca aat ggt gtt aat ttt gtt gtg caa act gaa ggt tgt gtg tct 17327
Ser Asn Gly Val Asn Phe Val Val Gln Thr Glu Gly Cys Val Ser
1630 1635 1640
acc aat ttt ggt gat gtt att aaa cct gtt tgt gca aaa tct cca 17372
Thr Asn Phe Gly Asp Val Ile Lys Pro Val Cys Ala Lys Ser Pro
1645 1650 1655
cca ggt gaa caa ttt aga cac ctt gtt cct ttt tta cgt aaa gga 17417
Pro Gly Glu Gln Phe Arg His Leu Val Pro Phe Leu Arg Lys Gly
1660 1665 1670
caa cct tgg tta att gtt cgt aga cgc att gtg caa atg ata tct 17462
Gln Pro Trp Leu Ile Val Arg Arg Arg Ile Val Gln Met Ile Ser
1675 1680 1685
gat tat ttg tcc aat ttg tct gac att ctt gtc ttt gtt ttg tgg 17507
Asp Tyr Leu Ser Asn Leu Ser Asp Ile Leu Val Phe Val Leu Trp
1690 1695 1700
gca ggt agt ttg gaa tta act aca atg cgt tac ttt gta aaa ata 17552
Ala Gly Ser Leu Glu Leu Thr Thr Met Arg Tyr Phe Val Lys Ile
1705 1710 1715
ggg cca att aaa tat tgt tat tgt ggt aat tct gcc act tgt tat 17597
Gly Pro Ile Lys Tyr Cys Tyr Cys Gly Asn Ser Ala Thr Cys Tyr
1720 1725 1730
aat tca gtt agt aat gaa tat tgt tgt ttt aaa cat gca ttg ggt 17642
Asn Ser Val Ser Asn Glu Tyr Cys Cys Phe Lys His Ala Leu Gly
1735 1740 1745
tgt gat tat gtt tac aat ccg tat gct ttt gat ata caa cag tgg 17687
Cys Asp Tyr Val Tyr Asn Pro Tyr Ala Phe Asp Ile Gln Gln Trp
1750 1755 1760
ggt tat gtt ggt tcc ttg agc cag aac cac cac acg ttc tgt aac 17732
Gly Tyr Val Gly Ser Leu Ser Gln Asn His His Thr Phe Cys Asn
1765 1770 1775
att cat aga aac gag cat gat gct tct ggt gat gct gtt atg aca 17777
Ile His Arg Asn Glu His Asp Ala Ser Gly Asp Ala Val Met Thr
1780 1785 1790
cgt tgt ttg gca gta cat gat tgt ttt gtc aaa aat gtt gat tgg 17822
Arg Cys Leu Ala Val His Asp Cys Phe Val Lys Asn Val Asp Trp
1795 1800 1805
act gta acg tac ccc ttt att gca aat gag aaa ttt atc aat ggc 17867
Thr Val Thr Tyr Pro Phe Ile Ala Asn Glu Lys Phe Ile Asn Gly
1810 1815 1820
tgt ggg cgt aat gtc cag gga cat gtt gtt cgc gca gcc ttg aaa 17912
Cys Gly Arg Asn Val Gln Gly His Val Val Arg Ala Ala Leu Lys
1825 1830 1835
ttg tat aaa cct agt gtt att cat gat att ggt aat cct aaa ggt 17957
Leu Tyr Lys Pro Ser Val Ile His Asp Ile Gly Asn Pro Lys Gly
1840 1845 1850
gta cgt tgt gct gtt act gat gcc aaa tgg tac tgt tat gac aag 18002
Val Arg Cys Ala Val Thr Asp Ala Lys Trp Tyr Cys Tyr Asp Lys
1855 1860 1865
caa cct gtt aat agt aat gtc aag ttg ttg gat tat gat tat gca 18047
Gln Pro Val Asn Ser Asn Val Lys Leu Leu Asp Tyr Asp Tyr Ala
1870 1875 1880
acc cat ggt caa ctt gat ggt ctt tgt tta ttc tgg aat tgt aat 18092
Thr His Gly Gln Leu Asp Gly Leu Cys Leu Phe Trp Asn Cys Asn
1885 1890 1895
gtt gat atg tat cca gaa ttt tca att gtg tgt cgc ttt gac aca 18137
Val Asp Met Tyr Pro Glu Phe Ser Ile Val Cys Arg Phe Asp Thr
1900 1905 1910
cgt act cgt tct gtt ttt aat tta gaa ggt gtt aat ggt ggt tct 18182
Arg Thr Arg Ser Val Phe Asn Leu Glu Gly Val Asn Gly Gly Ser
1915 1920 1925
ctt tat gtt aac aaa cat gcg ttt cat aca cca gca tat gat aaa 18227
Leu Tyr Val Asn Lys His Ala Phe His Thr Pro Ala Tyr Asp Lys
1930 1935 1940
cgt gct ttt gtt aaa tta aaa cct atg ccc ttt ttt tac ttt gat 18272
Arg Ala Phe Val Lys Leu Lys Pro Met Pro Phe Phe Tyr Phe Asp
1945 1950 1955
gac agt gat tgt gat gtt gtg caa gaa caa gtt aat tat gta ccc 18317
Asp Ser Asp Cys Asp Val Val Gln Glu Gln Val Asn Tyr Val Pro
1960 1965 1970
ctt cgc gct agt agt tgt gtt acc cgt tgt aat ata ggt ggt gct 18362
Leu Arg Ala Ser Ser Cys Val Thr Arg Cys Asn Ile Gly Gly Ala
1975 1980 1985
gtt tgt tca aaa cat gca aat ttg tat caa aaa tat gtt gag gca 18407
Val Cys Ser Lys His Ala Asn Leu Tyr Gln Lys Tyr Val Glu Ala
1990 1995 2000
tat aat aca ttt aca cag gct ggt ttt aac att tgg gta cca cat 18452
Tyr Asn Thr Phe Thr Gln Ala Gly Phe Asn Ile Trp Val Pro His
2005 2010 2015
agt ttt gat gtt tat aat ttg tgg caa att ttt att gaa act aat 18497
Ser Phe Asp Val Tyr Asn Leu Trp Gln Ile Phe Ile Glu Thr Asn
2020 2025 2030
tta caa agt ctt gaa aat ata gca ttt aat gtt gta aaa aaa ggg 18542
Leu Gln Ser Leu Glu Asn Ile Ala Phe Asn Val Val Lys Lys Gly
2035 2040 2045
tgt ttt act ggt gtt gat ggt gag tta cct gtt gca gtt gtt aac 18587
Cys Phe Thr Gly Val Asp Gly Glu Leu Pro Val Ala Val Val Asn
2050 2055 2060
gac aaa gtt ttt gtt cgc tat ggc gat gtt gac aac ttg gtt ttt 18632
Asp Lys Val Phe Val Arg Tyr Gly Asp Val Asp Asn Leu Val Phe
2065 2070 2075
aca aat aaa aca aca ttg cct act aat gtt gct ttt gaa ttg ttt 18677
Thr Asn Lys Thr Thr Leu Pro Thr Asn Val Ala Phe Glu Leu Phe
2080 2085 2090
gca aaa cga aaa atg ggt tta aca cca cca ttg tct att ctc aaa 18722
Ala Lys Arg Lys Met Gly Leu Thr Pro Pro Leu Ser Ile Leu Lys
2095 2100 2105
aat ctt ggt gtt gtt gct aca tat aaa ttt gtt tta tgg gat tat 18767
Asn Leu Gly Val Val Ala Thr Tyr Lys Phe Val Leu Trp Asp Tyr
2110 2115 2120
gaa gct gaa aga cct ttt acc tca tat act aag agt gta tgt aaa 18812
Glu Ala Glu Arg Pro Phe Thr Ser Tyr Thr Lys Ser Val Cys Lys
2125 2130 2135
tac act gat ttt aat gag gat gtt tgt gtt tgt ttt gac aat agt 18857
Tyr Thr Asp Phe Asn Glu Asp Val Cys Val Cys Phe Asp Asn Ser
2140 2145 2150
att cag ggt tcg tat gag cgt ttt acg ctt act acg aac gct gtt 18902
Ile Gln Gly Ser Tyr Glu Arg Phe Thr Leu Thr Thr Asn Ala Val
2155 2160 2165
tta ttt tct act gtt gtc att aaa aat tta aca cct ata aag ttg 18947
Leu Phe Ser Thr Val Val Ile Lys Asn Leu Thr Pro Ile Lys Leu
2170 2175 2180
aat ttt ggt atg ttg aat ggt atg cca gtt tct tct att aag agt 18992
Asn Phe Gly Met Leu Asn Gly Met Pro Val Ser Ser Ile Lys Ser
2185 2190 2195
gat aaa ggt gtt gaa aaa tta gtt aat tgg tac aca tat gtt cgt 19037
Asp Lys Gly Val Glu Lys Leu Val Asn Trp Tyr Thr Tyr Val Arg
2200 2205 2210
aaa aat ggt caa ttt caa gat cat tat gat ggt ttt tac act caa 19082
Lys Asn Gly Gln Phe Gln Asp His Tyr Asp Gly Phe Tyr Thr Gln
2215 2220 2225
ggt agg aat tta tca gac ttt aca cca aga agt gat atg gag tat 19127
Gly Arg Asn Leu Ser Asp Phe Thr Pro Arg Ser Asp Met Glu Tyr
2230 2235 2240
gat ttt ctt aac atg gat atg ggt gtt ttt att aat aaa tat ggt 19172
Asp Phe Leu Asn Met Asp Met Gly Val Phe Ile Asn Lys Tyr Gly
2245 2250 2255
ctt gag gat ttt aat ttt gaa cat gtt gta tat ggt gat gtt tca 19217
Leu Glu Asp Phe Asn Phe Glu His Val Val Tyr Gly Asp Val Ser
2260 2265 2270
aaa act aca tta gga ggt ctt cat ttg ttg ata tca cag ttt agg 19262
Lys Thr Thr Leu Gly Gly Leu His Leu Leu Ile Ser Gln Phe Arg
2275 2280 2285
ctt agt aaa atg ggt gtt ttg aaa gct gat gat ttt gtc act gct 19307
Leu Ser Lys Met Gly Val Leu Lys Ala Asp Asp Phe Val Thr Ala
2290 2295 2300
tct gac aca act ttg agg tgc tgt act gtt act tat ctt aat gaa 19352
Ser Asp Thr Thr Leu Arg Cys Cys Thr Val Thr Tyr Leu Asn Glu
2305 2310 2315
ctt agt tca aaa gtt gtt tgt act tat atg gat ttg ttg ttg gac 19397
Leu Ser Ser Lys Val Val Cys Thr Tyr Met Asp Leu Leu Leu Asp
2320 2325 2330
gac ttt gtt act ata cta aag agt tta gat ctt ggt gta ata tct 19442
Asp Phe Val Thr Ile Leu Lys Ser Leu Asp Leu Gly Val Ile Ser
2335 2340 2345
aaa gtt cat gaa gtt att ata gat aat aaa cct tat agg tgg atg 19487
Lys Val His Glu Val Ile Ile Asp Asn Lys Pro Tyr Arg Trp Met
2350 2355 2360
ttg tgg tgt aaa gat aac cac ttg tcc act ttt tat cca cag ttg 19532
Leu Trp Cys Lys Asp Asn His Leu Ser Thr Phe Tyr Pro Gln Leu
2365 2370 2375
cag tct gct gaa tgg aag tgt ggt tat gct atg cca caa att tat 19577
Gln Ser Ala Glu Trp Lys Cys Gly Tyr Ala Met Pro Gln Ile Tyr
2380 2385 2390
aag ctt caa cgt atg tgt ttg gaa cct tgt aat tta tat aat tat 19622
Lys Leu Gln Arg Met Cys Leu Glu Pro Cys Asn Leu Tyr Asn Tyr
2395 2400 2405
ggt gct ggt att aag ttg cct agt ggt ata atg tta aat gtt gtt 19667
Gly Ala Gly Ile Lys Leu Pro Ser Gly Ile Met Leu Asn Val Val
2410 2415 2420
aaa tac act cag ctt tgt caa tac cta aat agc act aca atg tgc 19712
Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Ser Thr Thr Met Cys
2425 2430 2435
gta cct cat aat atg cgt gtt ttg cac tat ggt gct ggt tct gac 19757
Val Pro His Asn Met Arg Val Leu His Tyr Gly Ala Gly Ser Asp
2440 2445 2450
aaa ggt gtg gca cct ggt aca act gtt tta aaa cgt tgg cta cca 19802
Lys Gly Val Ala Pro Gly Thr Thr Val Leu Lys Arg Trp Leu Pro
2455 2460 2465
cct gat gca ata atc att gat aat gat atc aat gat tat gtt agt 19847
Pro Asp Ala Ile Ile Ile Asp Asn Asp Ile Asn Asp Tyr Val Ser
2470 2475 2480
gat gca gat ttt agc att aca ggt gat tgt gct act gtt tac ctt 19892
Asp Ala Asp Phe Ser Ile Thr Gly Asp Cys Ala Thr Val Tyr Leu
2485 2490 2495
gaa gat aag ttt gac tta ctt att tct gat atg tat gat ggt aga 19937
Glu Asp Lys Phe Asp Leu Leu Ile Ser Asp Met Tyr Asp Gly Arg
2500 2505 2510
att aaa ttt tgt gat ggt gaa aac gtc tct aaa gat ggt ttt ttt 19982
Ile Lys Phe Cys Asp Gly Glu Asn Val Ser Lys Asp Gly Phe Phe
2515 2520 2525
act tat ctt aat ggt gtt att aga gaa aaa tta gct att ggt ggt 20027
Thr Tyr Leu Asn Gly Val Ile Arg Glu Lys Leu Ala Ile Gly Gly
2530 2535 2540
agt gtt gcc att aag att aca gaa tat agt tgg aat aag tat ctt 20072
Ser Val Ala Ile Lys Ile Thr Glu Tyr Ser Trp Asn Lys Tyr Leu
2545 2550 2555
tat gaa tta ata caa aga ttt gct ttt tgg act ttg ttc tgc acg 20117
Tyr Glu Leu Ile Gln Arg Phe Ala Phe Trp Thr Leu Phe Cys Thr
2560 2565 2570
tct gtt aat aca tcc tct tca gaa gct ttt ctt att ggt att aat 20162
Ser Val Asn Thr Ser Ser Ser Glu Ala Phe Leu Ile Gly Ile Asn
2575 2580 2585
tat tta ggt gac ttt att caa ggt cct ttt ata gct ggt aac act 20207
Tyr Leu Gly Asp Phe Ile Gln Gly Pro Phe Ile Ala Gly Asn Thr
2590 2595 2600
gtt cat gct aat tat ata ttt tgg cgt aat tct act att atg tct 20252
Val His Ala Asn Tyr Ile Phe Trp Arg Asn Ser Thr Ile Met Ser
2605 2610 2615
ttg tca tac aat tca gtt tta gat tta agt aag ttt gaa tgt aaa 20297
Leu Ser Tyr Asn Ser Val Leu Asp Leu Ser Lys Phe Glu Cys Lys
2620 2625 2630
cat aag gcc act gtt gtt gtt aca ctt aaa gat agt gat gta aat 20342
His Lys Ala Thr Val Val Val Thr Leu Lys Asp Ser Asp Val Asn
2635 2640 2645
gat atg gtt ttg agt ttg att aag agt ggt agg ttg ttg tta cgt 20387
Asp Met Val Leu Ser Leu Ile Lys Ser Gly Arg Leu Leu Leu Arg
2650 2655 2660
aat agt ggc cgt ttt ggt ggt ttt agt aat cat tta gtc tca act 20432
Asn Ser Gly Arg Phe Gly Gly Phe Ser Asn His Leu Val Ser Thr
2665 2670 2675
aaa tga aacttttctt gattttgctt attttgcccc tggtttcttg cttttctaca 20488
Lys
tgtaacagta atgctagtat ttctatgtta caattaggtg ttcctgataa ctcttcaact 20548
attgtcacag gtttgttgcc agtccattgg atttgtgcta atcagagtac atctagttac 20608
ccagccaacg gctttttcta tattgatgtt ggtaaacacc gtagtgcctt tgcactccat 20668
agtggttatt atgatgctaa ccagtattat atttatctca ctaataaaat acatttaaat 20728
gctcctgtca ctctgaagat ttgtaagttt ggaaacactt cttttgattt tttaagtaat 20788
gtttctactt ctcatgattg tatagttaat ttgtcattca cagaacagtt aggtgtgcct 20848
ttgggcataa ctatatcggg tgaaactgta cgtttgcatt tatataatgc aactcgtact 20908
ttttatgtgc cggccgctta taaacttact aaacttagtg ttaaatgtta ctttagtgaa 20968
tcctgtgttt ttagtgttgt caatgccacc attactgtta atgtcaccac acttaatggc 21028
cgtatagtta actacactgt ttgtgatgat tgtaatggtt atactgataa catattttct 21088
gttcaacagg atggccgcat tcctaatggt ttccctttta ataattggtt tttgttaact 21148
aatggttcca cattagtgga cggggtctct agactttatc aaccactccg tttaacttgt 21208
ttatggcctg tacctggtct taaatcttca actggttttg tttattttaa tgccactggt 21268
tctgatgtta attgtaacgg ctatcaacat aattctgttg ctgatgttat gcgttacaat 21328
cttaacctca gtgctaattc tgtggacaat cttaagagtg gtgttatagt ttttaaaact 21388
ttacagtacg atgttttgtt ttattgtagt aattcttctt caggtgttct tgacaccaca 21448
ataccttttg gcccttcctc tcaaccttat tactgtttta taaacagtac tatcaacact 21508
actcatgtta gcacttttgt gggtatttta ccacccactg tgcgtgaaat tgttgttgct 21568
agaactggtc agttttatat taatggtttt aagtatttcg atttgggttt catagaagct 21628
gtcaatttta atgtcacgac tgctagtgcc acagattttt ggacggttgc atttgctact 21688
tttgttgatg ttttggttaa tgttagtgca actaacattc aaaacttact ttattgcgat 21748
tctccatttg aaaagttgca gtgtgagcac ttgcagtttg gattgcaaga tggtttttat 21808
tctgcaaatt ttcttgatga taatgttttg cctgagactt atgttgcact ccccatttat 21868
tatcaacata cggacataaa ttttactgca actgcatctt ttggtggttc ttgttatgtt 21928
tgtaaaccac gccaggttaa tatatctctt aatggtaaca cttcagtgtg tgttagaaca 21988
tctcattttt caattaggta tatttataac cgcgttaaga gtggttcacc aggtgactct 22048
tcatggcata tttatttaaa gagtggcact tgtccatttt ctttttctaa gttaaataat 22108
tttcaaaagt ttaagactat ttgtttctca accgtcgaag tgcctggtag ttgtaatttt 22168
ccacttgaag ccacctggca ttacacttct tatactattg ttggtgcttt gtatgttact 22228
tggtctgaag gtaattccat tactggtgta ccttatcctg tctctggtat tcgtgagttt 22288
agtaatttag ttttaaataa ttgtaccaaa tataatattt atgattatgt tggtactgga 22348
attatacgtt cttcaaacca gtcacttgct ggtggtatta catatgtttc taactctggt 22408
aatttacttg gttttaaaaa tgtttccact ggtaacattt ttattgtgac accatgtaac 22468
caaccagatc aagtagctgt ttatcaacaa agcattattg gtgccatgac cgctgttaat 22528
gagtctagat atggcttgca aaacttacta cagttaccta acttttatta tgttagtaat 22588
ggtggtaaca attgcactac ggctgttatg atttattcta attttggtat ttgtgctgat 22648
ggttctttaa ttcctgttcg tccgcgtaat tctagtgata atggtatttc agccataatc 22708
actgctaatt tatccattcc ctctaactgg actacttcag ttcaagttga gtacctccaa 22768
attactagta ctccaatagt tgttgattgt gctacttatg tgtgtaatgg taaccctcgt 22828
tgtaagaatc tacttaagca gtatacttct gcttgtaaaa ctattgaaga tgccttacga 22888
cttagtgctc atttggaaac taatgatgtt agtagtatgc taactttcga tagcaatgct 22948
tttagtttgg ctaatgttac tagttttgga gattataacc tttctagtgt tttacctcag 23008
agaaacattc attcaagccg tatagcagga cgtagtgctt tggaagattt gttgtttagc 23068
aaagttgtta catctggttt gggtactgtt gatgttgact ataagtcttg tactaaaggt 23128
ctttctattg ctgaccttgc ttgtgctcag tactacaatg gcataatggt tttgccaggt 23188
gttgctgatg ctgaacgtat ggccatgtac acaggttctc ttataggtgg catggtgctc 23248
ggaggtctta catcagcagc cgccatacct ttttctttgg cactgcaagc acgacttaac 23308
tatgttgctt tacaaactga tgtgcttcaa gaaaatcaga aaattttggc tgcatcattt 23368
aataaggcta ttaataatat tgttgcttct tttagtagcg ttaatgatgc tattacacat 23428
actgcagagg ctatacatac tgttactatt gcacttaata agattcagga tgttgttaat 23488
caacagggta gtgctcttaa ccatctcact tcacaattga gacataattt tcaggccatt 23548
tctaattcaa ttcatgctat ttatgaccgg cttgattcaa ttcaagccga tcaacaagtt 23608
gacagattaa ttactggacg gcttgcagct ttgaatgcat ttgtttccca agttttgaat 23668
aaatatactg aagttcgtgg ttccagacgc ttagcacagc agaagattaa tgaatgtgtc 23728
aagtcacaat ctaatagata tggtttttgt ggcaatggca ctcacatctt ttcaatcgtc 23788
aactcagctc cagatggttt gctttttctt catactgttt tgctgccaac tgattacaag 23848
aatgtaaagg cgtggtctgg tatctgtgtt gatggcattt atggctatgt tctgcgtcaa 23908
cctaacttgg ttctttattc tgataatggt gtctttcgtg taacttccag ggtcatgttt 23968
caacctcgtt tacctgtttt gtctgatttt gtgcaaatat ataattgtaa tgttactttt 24028
gttaacatat ctcgtgtcga gttacatact gtcatacctg actacgttga tgttaataaa 24088
acattacaag agtttgcaca aaacttacca aagtatgtta agcctaattt tgacttgact 24148
ccttttaatt taacatatct taatttgagt tctgagttga agcaactcga agctaaaact 24208
gctagtcttt tccaaactac tgttgaatta caaggtctta ttgatcagat taacagtaca 24268
tatgttgatt tgaagttgct taataggttt gaaaattata tcaaatggcc ttggtgggtt 24328
tggctcatta tttctgttgt ttttgttgta ttgttgagtc ttcttgtgtt ttgttgtctt 24388
tctacaggtt gttgtggttg ttgcaattgt ttaacttcat caatgcgagg ctgttgtgat 24448
tgtggttcaa ctaaacttcc ttattatgaa tttgaaaagg tccacgttca ata atg 24504
Met
cct ttc ggt ggc cta ttt caa ctt act ctt gaa agt act att aat 24549
Pro Phe Gly Gly Leu Phe Gln Leu Thr Leu Glu Ser Thr Ile Asn
2680 2685 2690
aag agt gtg gct aat ctc aaa tta cca cct cat gat gtt act gtc 24594
Lys Ser Val Ala Asn Leu Lys Leu Pro Pro His Asp Val Thr Val
2695 2700 2705
ttg cgt gac aat ctt aaa cct gtt act aca ctt agt act atc act 24639
Leu Arg Asp Asn Leu Lys Pro Val Thr Thr Leu Ser Thr Ile Thr
2710 2715 2720
gct tat ttg tta gtt agt ttg ttt gtc act tat ttt gct tta ttc 24684
Ala Tyr Leu Leu Val Ser Leu Phe Val Thr Tyr Phe Ala Leu Phe
2725 2730 2735
aaa cct ctt act gct aga ggt cgt gtt gct tgt ttt gtt tta aaa 24729
Lys Pro Leu Thr Ala Arg Gly Arg Val Ala Cys Phe Val Leu Lys
2740 2745 2750
cta ttg aca cta tct gtc tat gtg cct tta ttg gtt ctt ttt ggt 24774
Leu Leu Thr Leu Ser Val Tyr Val Pro Leu Leu Val Leu Phe Gly
2755 2760 2765
atg tat ctt gac agt ttt ata att ttt ttt cta cgc tgt tgt ttc 24819
Met Tyr Leu Asp Ser Phe Ile Ile Phe Phe Leu Arg Cys Cys Phe
2770 2775 2780
gat tca tac atg ttg gct att atg cct atc tct aat aaa aat ttt 24864
Asp Ser Tyr Met Leu Ala Ile Met Pro Ile Ser Asn Lys Asn Phe
2785 2790 2795
tca ttt gtt ttg ttc aat gtt act aaa cta tgc ttc gtt tca ggc 24909
Ser Phe Val Leu Phe Asn Val Thr Lys Leu Cys Phe Val Ser Gly
2800 2805 2810
aag tgt tgg tat ctt gaa caa tca ttt tat gaa aat cgt ttt gct 24954
Lys Cys Trp Tyr Leu Glu Gln Ser Phe Tyr Glu Asn Arg Phe Ala
2815 2820 2825
gct att tat ggt ggt gac cac tat gtc gtt tta ggt ggt gaa act 24999
Ala Ile Tyr Gly Gly Asp His Tyr Val Val Leu Gly Gly Glu Thr
2830 2835 2840
att act ttt gtt tct ttt gat gac ctt tat gtt gct att aga ggt 25044
Ile Thr Phe Val Ser Phe Asp Asp Leu Tyr Val Ala Ile Arg Gly
2845 2850 2855
tct tgt gaa aag aac cta caa ctt atg cgt aag gtt gac ttg tat 25089
Ser Cys Glu Lys Asn Leu Gln Leu Met Arg Lys Val Asp Leu Tyr
2860 2865 2870
aat ggt gct gtc att tac att ttt gcc gaa gag cct gtt gtt ggt 25134
Asn Gly Ala Val Ile Tyr Ile Phe Ala Glu Glu Pro Val Val Gly
2875 2880 2885
ata gtt tac tcc tct caa cta tac gaa gat gtt cct tcg att aat 25179
Ile Val Tyr Ser Ser Gln Leu Tyr Glu Asp Val Pro Ser Ile Asn
2890 2895 2900
tga tgacaatggc attgtcctca attctatttt atggctcctt gttatgatat 25232
ttttctttgt gttggcaatg acctttatta aactgattca attgtgtttt acttgtcatt 25292
atttttttag taggacatta tatcaaccag tttataaaat ttttcttgct taccaagatt 25352
atatgcaaat agcacctgtt ccagctgaag tactaaatgt ctaaactaaa cgatgtctaa 25412
tagtagtgtg cctctttcag aggtttatgt ccatttacgt aactggaact ttagttggaa 25472
tttaattcta acagttttta tagttgtgtt gcagtatggg cattataagt atagcagact 25532
tctttatggt ttaaagatgt ctgttttatg gtgtttatgg ccacttgttc tagctttgtc 25592
tatttttgac tgttttgtca attttaatgt ggactgggtc ttttttggtt ttagtattct 25652
tatgtctatt attacacttt gtttatgggt tatgtatttt gttaatagtt tcagactttg 25712
gcgccgtgtt aaaacttttt gggcttttaa tcctgaaact aatgcaatca tctctctcca 25772
ggtttatgga cataattatt acttaccggt gatggctgca cctacaggtg ttacattaac 25832
acttcttagt ggtgtacttc ttgttgatgg ccataagatt gctactcgtg ttcaagtggg 25892
tcagttgcct aaatatgtaa tagttgctac acctagtacc acaattgttt gtgaccgtgt 25952
tggtcgctct gttaatgaaa caagccagac tggttgggca ttctacgtcc gtgctaaaca 26012
tggtgatttt tctggtgttg cctctcagga gggtgttttg tcagaaagag agaagttgct 26072
tcatttaatc taaactaaac aaaatggcta gtgtaaattg ggccgatgac agagctgcta 26132
ggaagaaatt tcctcctcct tcattttaca tgcctctttt ggttagttct gataaggcac 26192
catatagggt cattcccagg aatcttgtcc ctattggtaa gggtaataaa gatgagcaga 26252
ttggttattg gaatgttcaa gagcgttggc gtatgcgcag ggggcaacgt gttgatttgc 26312
ctcctaaagt tcatttttat tacctaggta ctggacctca taaggacctt aaattcagac 26372
aacgttctga tggtgttgtt tgggttgcta aggaaggtgc taaaactgtt aataccagtc 26432
ttggtaatcg caaacgtaat cagaaacctt tggaaccaaa gttctctatt gctttgcctc 26492
cagagctctc tgttgttgag tttgaggatc gctctaataa ctcatctcgt gctagcagtc 26552
gttcttcaac tcgtaacaac tcacgagact cttctcgtag tacttcaaga caacagtctc 26612
gcactcgttc tgattctaac cagtcttctt cagatcttgt tgctgctgtt actttggctt 26672
taaagaactt aggttttgat aaccagtcga agtcacctag ttcttctggt acttccactc 26732
ctaagaaacc taataagcct ctttctcaac ccagggctga taagccttct cagttgaaga 26792
aacctcgttg gaagcgtgtt cctaccagag aggaaaatgt tattcagtgc tttggtcctc 26852
gtgattttaa tcacaatatg ggggattcag atcttgttca gaatggtgtt gatgccaagg 26912
gttttccaca gcttgctgaa ttgattccta atcaggctgc gttattcttt gatagtgagg 26972
ttagcactga tgaagtgggt gataatgttc agattaccta cacctacaaa atgcttgtag 27032
ctaaggataa taagaacctt cctaagttca ttgagcagat tagtgctttt actaaaccca 27092
gttctatcaa agaaatgcag tcacaatcat ctcatgttgc tcagaacaca gtacttaatg 27152
cttctattcc agaatctaaa ccattggctg atgatgattc agccattata gaaattgtca 27212
acgaggtttt gcattaaatt gttttgtaat tccagttgaa tgtttattat tattagttgc 27272
aaccccatgc gtttagcgca tgataagggt ttagtcttac acacaatggt aggccagtga 27332
tagtaaagtg taagtaattt gctatcatat taacatgtct agaggaaagt cagaactttt 27392
tctgtttgtg ttgttggagt acttaaagat cgcataggcg cgccaacaat ggaagagcca 27452
acaacatatc taaaaatgtt ttgtctggta cttgttaatg atattgtttt tgatatggat 27512
acacaaaaaa aaaaaaaa 27530
<210> 8
<211> 2678
<212> PRT
<213> EMCR Coronavirus
<220>
<221> misc_feature
<222> (844)..(844)
<223> The 'Xaa' at location 844 stands for Glu.
<400> 8
Arg Ala Arg Gly Ser Ser Ala Ala Arg Leu Glu Pro Cys Asn Gly Thr
1 5 10 15
Asp Ile Asp Lys Cys Val Arg Ala Phe Asp Ile Tyr Asn Lys Asn Val
20 25 30
Ser Phe Leu Gly Lys Cys Leu Lys Met Asn Cys Val Arg Phe Lys Asn
35 40 45
Ala Asp Leu Lys Asp Gly Tyr Phe Val Ile Lys Arg Cys Thr Lys Ser
50 55 60
Val Met Glu His Glu Gln Ser Met Tyr Asn Leu Leu Asn Phe Ser Gly
65 70 75 80
Ala Leu Ala Glu His Asp Phe Phe Thr Trp Lys Asp Gly Arg Val Ile
85 90 95
Tyr Gly Asn Val Ser Arg His Asn Leu Thr Lys Tyr Thr Met Met Asp
100 105 110
Leu Val Tyr Ala Met Arg Asn Phe Asp Glu Gln Asn Cys Asp Val Leu
115 120 125
Lys Glu Val Leu Val Leu Thr Gly Cys Cys Asp Asn Ser Tyr Phe Asp
130 135 140
Ser Lys Gly Trp Tyr Asp Pro Val Glu Asn Glu Asp Ile His Arg Val
145 150 155 160
Tyr Ala Ser Leu Gly Lys Ile Val Ala Arg Ala Met Leu Lys Cys Val
165 170 175
Ala Leu Cys Asp Ala Met Val Ala Lys Gly Val Val Gly Val Leu Thr
180 185 190
Leu Asp Asn Gln Asp Leu Asn Gly Asn Phe Tyr Asp Phe Gly Asp Phe
195 200 205
Val Val Ser Leu Pro Asn Met Gly Val Pro Cys Cys Thr Ser Tyr Tyr
210 215 220
Ser Tyr Met Met Pro Ile Met Gly Leu Thr Asn Cys Leu Ala Ser Glu
225 230 235 240
Cys Phe Val Lys Ser Asp Ile Phe Gly Ser Asp Phe Lys Thr Phe Asp
245 250 255
Leu Leu Lys Tyr Asp Phe Thr Glu His Lys Glu Asn Leu Phe Asn Lys
260 265 270
Tyr Phe Lys His Trp Ser Phe Asp Tyr His Pro Asn Cys Ser Asp Cys
275 280 285
Tyr Asp Asp Met Cys Val Ile His Cys Ala Asn Phe Asn Thr Leu Phe
290 295 300
Ala Thr Thr Ile Pro Gly Thr Ala Phe Gly Pro Leu Cys Arg Lys Val
305 310 315 320
Phe Ile Asp Gly Val Pro Leu Val Thr Thr Ala Gly Tyr His Phe Lys
325 330 335
Gln Leu Gly Leu Val Trp Asn Lys Asp Val Asn Thr His Ser Val Arg
340 345 350
Leu Thr Ile Thr Glu Leu Leu Gln Phe Val Thr Asp Pro Ser Leu Ile
355 360 365
Ile Ala Ser Ser Pro Ala Leu Val Asp Gln Arg Thr Ile Cys Phe Ser
370 375 380
Val Ala Ala Leu Ser Thr Gly Leu Thr Asn Gln Val Val Lys Pro Gly
385 390 395 400
His Phe Asn Glu Glu Phe Tyr Asn Phe Leu Arg Leu Arg Gly Phe Phe
405 410 415
Asp Glu Gly Ser Glu Leu Thr Leu Lys His Phe Phe Phe Ala Gln Asn
420 425 430
Gly Asp Ala Ala Val Lys Asp Phe Asp Phe Tyr Arg Tyr Asn Lys Pro
435 440 445
Thr Ile Leu Asp Ile Cys Gln Ala Arg Val Thr Tyr Lys Ile Val Ser
450 455 460
Arg Tyr Phe Asp Ile Tyr Glu Gly Gly Cys Ile Lys Ala Cys Glu Val
465 470 475 480
Val Val Thr Asn Leu Asn Lys Ser Ala Gly Trp Pro Leu Asn Lys Phe
485 490 495
Gly Lys Ala Ser Leu Tyr Tyr Glu Ser Ile Ser Tyr Glu Glu Gln Asp
500 505 510
Ala Leu Phe Ala Leu Thr Lys Arg Asn Val Leu Pro Thr Met Thr Gln
515 520 525
Leu Asn Leu Lys Tyr Ala Ile Ser Gly Lys Glu Arg Ala Arg Thr Val
530 535 540
Gly Gly Val Ser Leu Leu Ser Thr Met Thr Thr Arg Gln Tyr His Gln
545 550 555 560
Lys His Leu Lys Ser Ile Val Asn Thr Arg Asn Ala Thr Val Val Ile
565 570 575
Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asn Asn Met Leu Arg Thr Leu
580 585 590
Ile Asp Gly Val Glu Asn Pro Met Leu Met Gly Trp Asp Tyr Pro Lys
595 600 605
Cys Asp Arg Ala Leu Pro Asn Met Ile Arg Met Ile Ser Ala Met Val
610 615 620
Leu Gly Ser Lys His Val Asn Cys Cys Thr Val Thr Asp Arg Phe Tyr
625 630 635 640
Arg Leu Gly Asn Glu Leu Ala Gln Val Leu Thr Glu Val Val Tyr Ser
645 650 655
Asn Gly Gly Phe Tyr Phe Lys Pro Gly Gly Thr Thr Ser Gly Asp Ala
660 665 670
Ser Thr Ala Tyr Ala Asn Ser Ile Phe Asn Ile Phe Gln Ala Val Ser
675 680 685
Ser Asn Ile Asn Arg Leu Leu Ser Val Pro Ser Asp Ser Cys Asn Asn
690 695 700
Val Asn Val Arg Asp Leu Gln Arg Arg Leu Tyr Asp Asn Cys Tyr Arg
705 710 715 720
Leu Thr Ser Val Glu Glu Ser Phe Ile Asp Asp Tyr Tyr Gly Tyr Leu
725 730 735
Arg Lys His Phe Ser Met Met Ile Leu Ser Asp Asp Gly Val Val Cys
740 745 750
Tyr Asn Lys Asp Tyr Ala Glu Leu Gly Tyr Ile Ala Asp Ile Ser Ala
755 760 765
Phe Lys Ala Thr Leu Tyr Tyr Gln Asn Asn Val Phe Met Ser Thr Ser
770 775 780
Lys Cys Trp Val Glu Glu Asp Leu Thr Lys Gly Pro His Glu Phe Cys
785 790 795 800
Ser Gln His Thr Met Gln Ile Val Asp Lys Asp Gly Thr Tyr Tyr Leu
805 810 815
Pro Tyr Pro Asp Pro Ser Arg Ile Leu Ser Ala Gly Val Phe Val Asp
820 825 830
Asp Val Val Lys Thr Asp Ala Val Val Leu Leu Xaa Arg Tyr Val Ser
835 840 845
Leu Ala Ile Asp Ala Tyr Pro Leu Ser Lys His Pro Asn Ser Glu Tyr
850 855 860
Arg Lys Val Phe Tyr Val Leu Leu Asp Trp Val Lys His Leu Asn Lys
865 870 875 880
Asn Leu Asn Glu Gly Val Leu Glu Ser Phe Ser Val Thr Leu Leu Asp
885 890 895
Asn Gln Glu Asp Lys Phe Trp Cys Glu Asp Phe Tyr Ala Ser Met Tyr
900 905 910
Glu Asn Ser Thr Ile Leu Gln Ala Ala Gly Leu Cys Val Val Cys Gly
915 920 925
Ser Gln Thr Val Leu Arg Cys Gly Asp Cys Leu Arg Lys Pro Met Leu
930 935 940
Cys Thr Lys Cys Ala Tyr Asp His Val Phe Gly Thr Asp His Lys Phe
945 950 955 960
Ile Leu Ala Ile Thr Pro Tyr Val Cys Asn Ala Ser Gly Cys Gly Val
965 970 975
Ser Asp Val Lys Lys Leu Tyr Leu Gly Gly Leu Asn Tyr Tyr Cys Thr
980 985 990
Asn His Lys Pro Gln Leu Ser Phe Pro Leu Cys Ser Ala Gly Asn Ile
995 1000 1005
Phe Gly Leu Tyr Lys Asn Ser Ala Thr Gly Ser Leu Asp Val Glu
1010 1015 1020
Val Phe Asn Arg Leu Ala Thr Ser Asp Trp Thr Asp Val Arg Asp
1025 1030 1035
Tyr Lys Leu Ala Asn Asp Val Lys Asp Thr Leu Arg Leu Phe Ala
1040 1045 1050
Ala Glu Thr Ile Lys Ala Lys Glu Glu Ser Val Lys Ser Ser Tyr
1055 1060 1065
Ala Phe Ala Thr Leu Lys Glu Val Val Gly Pro Lys Glu Leu Leu
1070 1075 1080
Leu Ser Trp Glu Ser Gly Lys Val Lys Pro Pro Leu Asn Arg Asn
1085 1090 1095
Ser Val Phe Thr Cys Phe Gln Ile Ser Lys Asp Ser Lys Phe Gln
1100 1105 1110
Ile Gly Glu Phe Ile Phe Glu Lys Val Glu Tyr Gly Ser Asp Thr
1115 1120 1125
Val Thr Tyr Lys Ser Thr Val Thr Thr Lys Leu Val Pro Gly Met
1130 1135 1140
Ile Phe Val Leu Thr Ser His Asn Val Gln Pro Leu Arg Ala Pro
1145 1150 1155
Thr Ile Ala Asn Gln Glu Lys Tyr Ser Ser Ile Tyr Lys Leu His
1160 1165 1170
Pro Ala Phe Asn Val Ser Asp Ala Tyr Ala Asn Leu Val Pro Tyr
1175 1180 1185
Tyr Gln Leu Ile Gly Lys Gln Lys Ile Thr Thr Ile Gln Gly Pro
1190 1195 1200
Pro Gly Ser Gly Lys Ser His Cys Ser Ile Gly Leu Gly Leu Tyr
1205 1210 1215
Tyr Pro Gly Ala Arg Ile Val Phe Val Ala Cys Ala His Ala Ala
1220 1225 1230
Val Asp Ser Leu Cys Ala Lys Ala Met Thr Val Tyr Ser Ile Asp
1235 1240 1245
Lys Cys Thr Arg Ile Ile Pro Ala Arg Ala Arg Val Glu Cys Tyr
1250 1255 1260
Ser Gly Phe Lys Pro Asn Asn Thr Ser Ala Gln Tyr Ile Phe Ser
1265 1270 1275
Thr Val Asn Ala Leu Pro Glu Cys Asn Ala Asp Ile Val Val Val
1280 1285 1290
Asp Glu Val Ser Met Cys Thr Asn Tyr Asp Leu Ser Val Ile Asn
1295 1300 1305
Gln Arg Leu Ser Tyr Lys His Ile Val Tyr Val Gly Asp Pro Gln
1310 1315 1320
Gln Leu Pro Ala Pro Arg Val Met Ile Thr Lys Gly Val Met Glu
1325 1330 1335
Pro Val Asp Tyr Asn Val Val Thr Gln Arg Met Cys Ala Ile Gly
1340 1345 1350
Pro Asp Val Phe Leu His Lys Cys Tyr Arg Cys Pro Ala Glu Ile
1355 1360 1365
Val Asn Thr Val Ser Glu Leu Val Tyr Glu Asn Lys Phe Val Pro
1370 1375 1380
Val Lys Pro Ala Ser Lys Gln Cys Phe Lys Ile Phe Phe Lys Gly
1385 1390 1395
Asn Val Gln Val Asp Asn Gly Ser Ser Ile Asn Arg Lys Gln Leu
1400 1405 1410
Glu Ile Val Lys Leu Phe Leu Val Lys Asn Pro Ser Trp Ser Lys
1415 1420 1425
Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln Asn Tyr Val Ala Ser
1430 1435 1440
Arg Phe Leu Gly Leu Gln Ile Gln Thr Val Asp Ser Ser Gln Gly
1445 1450 1455
Ser Glu Tyr Asp Tyr Val Ile Tyr Ala Gln Thr Ser Asp Thr Ala
1460 1465 1470
His Ala Cys Asn Val Asn Arg Phe Asn Val Ala Ile Thr Arg Ala
1475 1480 1485
Lys Lys Gly Ile Phe Cys Val Met Cys Asp Lys Thr Leu Phe Asp
1490 1495 1500
Ser Leu Lys Phe Phe Glu Ile Lys His Ala Asp Leu His Ser Ser
1505 1510 1515
Gln Val Cys Gly Leu Phe Lys Asn Cys Thr Arg Thr Pro Leu Asn
1520 1525 1530
Leu Pro Pro Thr His Ala His Thr Phe Leu Ser Leu Ser Asp Gln
1535 1540 1545
Phe Lys Thr Thr Gly Asp Leu Ala Val Gln Ile Gly Ser Asn Asn
1550 1555 1560
Val Cys Thr Tyr Glu His Val Ile Ser Phe Met Gly Phe Arg Phe
1565 1570 1575
Asp Ile Ser Ile Pro Gly Ser His Ser Leu Phe Cys Thr Arg Asp
1580 1585 1590
Phe Ala Ile Arg Asn Val Arg Gly Trp Leu Gly Met Asp Val Glu
1595 1600 1605
Ser Ala His Val Cys Gly Asp Asn Ile Gly Thr Asn Val Pro Leu
1610 1615 1620
Gln Val Gly Phe Ser Asn Gly Val Asn Phe Val Val Gln Thr Glu
1625 1630 1635
Gly Cys Val Ser Thr Asn Phe Gly Asp Val Ile Lys Pro Val Cys
1640 1645 1650
Ala Lys Ser Pro Pro Gly Glu Gln Phe Arg His Leu Val Pro Phe
1655 1660 1665
Leu Arg Lys Gly Gln Pro Trp Leu Ile Val Arg Arg Arg Ile Val
1670 1675 1680
Gln Met Ile Ser Asp Tyr Leu Ser Asn Leu Ser Asp Ile Leu Val
1685 1690 1695
Phe Val Leu Trp Ala Gly Ser Leu Glu Leu Thr Thr Met Arg Tyr
1700 1705 1710
Phe Val Lys Ile Gly Pro Ile Lys Tyr Cys Tyr Cys Gly Asn Ser
1715 1720 1725
Ala Thr Cys Tyr Asn Ser Val Ser Asn Glu Tyr Cys Cys Phe Lys
1730 1735 1740
His Ala Leu Gly Cys Asp Tyr Val Tyr Asn Pro Tyr Ala Phe Asp
1745 1750 1755
Ile Gln Gln Trp Gly Tyr Val Gly Ser Leu Ser Gln Asn His His
1760 1765 1770
Thr Phe Cys Asn Ile His Arg Asn Glu His Asp Ala Ser Gly Asp
1775 1780 1785
Ala Val Met Thr Arg Cys Leu Ala Val His Asp Cys Phe Val Lys
1790 1795 1800
Asn Val Asp Trp Thr Val Thr Tyr Pro Phe Ile Ala Asn Glu Lys
1805 1810 1815
Phe Ile Asn Gly Cys Gly Arg Asn Val Gln Gly His Val Val Arg
1820 1825 1830
Ala Ala Leu Lys Leu Tyr Lys Pro Ser Val Ile His Asp Ile Gly
1835 1840 1845
Asn Pro Lys Gly Val Arg Cys Ala Val Thr Asp Ala Lys Trp Tyr
1850 1855 1860
Cys Tyr Asp Lys Gln Pro Val Asn Ser Asn Val Lys Leu Leu Asp
1865 1870 1875
Tyr Asp Tyr Ala Thr His Gly Gln Leu Asp Gly Leu Cys Leu Phe
1880 1885 1890
Trp Asn Cys Asn Val Asp Met Tyr Pro Glu Phe Ser Ile Val Cys
1895 1900 1905
Arg Phe Asp Thr Arg Thr Arg Ser Val Phe Asn Leu Glu Gly Val
1910 1915 1920
Asn Gly Gly Ser Leu Tyr Val Asn Lys His Ala Phe His Thr Pro
1925 1930 1935
Ala Tyr Asp Lys Arg Ala Phe Val Lys Leu Lys Pro Met Pro Phe
1940 1945 1950
Phe Tyr Phe Asp Asp Ser Asp Cys Asp Val Val Gln Glu Gln Val
1955 1960 1965
Asn Tyr Val Pro Leu Arg Ala Ser Ser Cys Val Thr Arg Cys Asn
1970 1975 1980
Ile Gly Gly Ala Val Cys Ser Lys His Ala Asn Leu Tyr Gln Lys
1985 1990 1995
Tyr Val Glu Ala Tyr Asn Thr Phe Thr Gln Ala Gly Phe Asn Ile
2000 2005 2010
Trp Val Pro His Ser Phe Asp Val Tyr Asn Leu Trp Gln Ile Phe
2015 2020 2025
Ile Glu Thr Asn Leu Gln Ser Leu Glu Asn Ile Ala Phe Asn Val
2030 2035 2040
Val Lys Lys Gly Cys Phe Thr Gly Val Asp Gly Glu Leu Pro Val
2045 2050 2055
Ala Val Val Asn Asp Lys Val Phe Val Arg Tyr Gly Asp Val Asp
2060 2065 2070
Asn Leu Val Phe Thr Asn Lys Thr Thr Leu Pro Thr Asn Val Ala
2075 2080 2085
Phe Glu Leu Phe Ala Lys Arg Lys Met Gly Leu Thr Pro Pro Leu
2090 2095 2100
Ser Ile Leu Lys Asn Leu Gly Val Val Ala Thr Tyr Lys Phe Val
2105 2110 2115
Leu Trp Asp Tyr Glu Ala Glu Arg Pro Phe Thr Ser Tyr Thr Lys
2120 2125 2130
Ser Val Cys Lys Tyr Thr Asp Phe Asn Glu Asp Val Cys Val Cys
2135 2140 2145
Phe Asp Asn Ser Ile Gln Gly Ser Tyr Glu Arg Phe Thr Leu Thr
2150 2155 2160
Thr Asn Ala Val Leu Phe Ser Thr Val Val Ile Lys Asn Leu Thr
2165 2170 2175
Pro Ile Lys Leu Asn Phe Gly Met Leu Asn Gly Met Pro Val Ser
2180 2185 2190
Ser Ile Lys Ser Asp Lys Gly Val Glu Lys Leu Val Asn Trp Tyr
2195 2200 2205
Thr Tyr Val Arg Lys Asn Gly Gln Phe Gln Asp His Tyr Asp Gly
2210 2215 2220
Phe Tyr Thr Gln Gly Arg Asn Leu Ser Asp Phe Thr Pro Arg Ser
2225 2230 2235
Asp Met Glu Tyr Asp Phe Leu Asn Met Asp Met Gly Val Phe Ile
2240 2245 2250
Asn Lys Tyr Gly Leu Glu Asp Phe Asn Phe Glu His Val Val Tyr
2255 2260 2265
Gly Asp Val Ser Lys Thr Thr Leu Gly Gly Leu His Leu Leu Ile
2270 2275 2280
Ser Gln Phe Arg Leu Ser Lys Met Gly Val Leu Lys Ala Asp Asp
2285 2290 2295
Phe Val Thr Ala Ser Asp Thr Thr Leu Arg Cys Cys Thr Val Thr
2300 2305 2310
Tyr Leu Asn Glu Leu Ser Ser Lys Val Val Cys Thr Tyr Met Asp
2315 2320 2325
Leu Leu Leu Asp Asp Phe Val Thr Ile Leu Lys Ser Leu Asp Leu
2330 2335 2340
Gly Val Ile Ser Lys Val His Glu Val Ile Ile Asp Asn Lys Pro
2345 2350 2355
Tyr Arg Trp Met Leu Trp Cys Lys Asp Asn His Leu Ser Thr Phe
2360 2365 2370
Tyr Pro Gln Leu Gln Ser Ala Glu Trp Lys Cys Gly Tyr Ala Met
2375 2380 2385
Pro Gln Ile Tyr Lys Leu Gln Arg Met Cys Leu Glu Pro Cys Asn
2390 2395 2400
Leu Tyr Asn Tyr Gly Ala Gly Ile Lys Leu Pro Ser Gly Ile Met
2405 2410 2415
Leu Asn Val Val Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Ser
2420 2425 2430
Thr Thr Met Cys Val Pro His Asn Met Arg Val Leu His Tyr Gly
2435 2440 2445
Ala Gly Ser Asp Lys Gly Val Ala Pro Gly Thr Thr Val Leu Lys
2450 2455 2460
Arg Trp Leu Pro Pro Asp Ala Ile Ile Ile Asp Asn Asp Ile Asn
2465 2470 2475
Asp Tyr Val Ser Asp Ala Asp Phe Ser Ile Thr Gly Asp Cys Ala
2480 2485 2490
Thr Val Tyr Leu Glu Asp Lys Phe Asp Leu Leu Ile Ser Asp Met
2495 2500 2505
Tyr Asp Gly Arg Ile Lys Phe Cys Asp Gly Glu Asn Val Ser Lys
2510 2515 2520
Asp Gly Phe Phe Thr Tyr Leu Asn Gly Val Ile Arg Glu Lys Leu
2525 2530 2535
Ala Ile Gly Gly Ser Val Ala Ile Lys Ile Thr Glu Tyr Ser Trp
2540 2545 2550
Asn Lys Tyr Leu Tyr Glu Leu Ile Gln Arg Phe Ala Phe Trp Thr
2555 2560 2565
Leu Phe Cys Thr Ser Val Asn Thr Ser Ser Ser Glu Ala Phe Leu
2570 2575 2580
Ile Gly Ile Asn Tyr Leu Gly Asp Phe Ile Gln Gly Pro Phe Ile
2585 2590 2595
Ala Gly Asn Thr Val His Ala Asn Tyr Ile Phe Trp Arg Asn Ser
2600 2605 2610
Thr Ile Met Ser Leu Ser Tyr Asn Ser Val Leu Asp Leu Ser Lys
2615 2620 2625
Phe Glu Cys Lys His Lys Ala Thr Val Val Val Thr Leu Lys Asp
2630 2635 2640
Ser Asp Val Asn Asp Met Val Leu Ser Leu Ile Lys Ser Gly Arg
2645 2650 2655
Leu Leu Leu Arg Asn Ser Gly Arg Phe Gly Gly Phe Ser Asn His
2660 2665 2670
Leu Val Ser Thr Lys
2675
<210> 9
<211> 226
<212> PRT
<213> EMCR Coronavirus
<400> 9
Met Pro Phe Gly Gly Leu Phe Gln Leu Thr Leu Glu Ser Thr Ile Asn
1 5 10 15
Lys Ser Val Ala Asn Leu Lys Leu Pro Pro His Asp Val Thr Val Leu
20 25 30
Arg Asp Asn Leu Lys Pro Val Thr Thr Leu Ser Thr Ile Thr Ala Tyr
35 40 45
Leu Leu Val Ser Leu Phe Val Thr Tyr Phe Ala Leu Phe Lys Pro Leu
50 55 60
Thr Ala Arg Gly Arg Val Ala Cys Phe Val Leu Lys Leu Leu Thr Leu
65 70 75 80
Ser Val Tyr Val Pro Leu Leu Val Leu Phe Gly Met Tyr Leu Asp Ser
85 90 95
Phe Ile Ile Phe Phe Leu Arg Cys Cys Phe Asp Ser Tyr Met Leu Ala
100 105 110
Ile Met Pro Ile Ser Asn Lys Asn Phe Ser Phe Val Leu Phe Asn Val
115 120 125
Thr Lys Leu Cys Phe Val Ser Gly Lys Cys Trp Tyr Leu Glu Gln Ser
130 135 140
Phe Tyr Glu Asn Arg Phe Ala Ala Ile Tyr Gly Gly Asp His Tyr Val
145 150 155 160
Val Leu Gly Gly Glu Thr Ile Thr Phe Val Ser Phe Asp Asp Leu Tyr
165 170 175
Val Ala Ile Arg Gly Ser Cys Glu Lys Asn Leu Gln Leu Met Arg Lys
180 185 190
Val Asp Leu Tyr Asn Gly Ala Val Ile Tyr Ile Phe Ala Glu Glu Pro
195 200 205
Val Val Gly Ile Val Tyr Ser Ser Gln Leu Tyr Glu Asp Val Pro Ser
210 215 220
Ile Asn
225
<210> 10
<211> 292
<212> DNA
<213> Coronavirus 229E
<220>
<221> 5'UTR
<222> (1)..(292)
<400> 10
acttaagtac cttatctatc tacagataga aaagttgctt tttagacttt gtgtctactt 60
ttctcaacta aacgaaattt ttgctatggc cggcatcttt gatgctggag tcgtagtgta 120
attgaaattt catttgggtt gcaacagttt ggaagcaagt gctgtgtgtc ctagtctaag 180
ggtttcgtgt tccgtcacga gattccattc tacaaacgcc ttactcgagg ttccgtctcg 240
tgtttgtgtg gaagcaaagt tctgtctttg tggaaaccag taactgttcc ta 292
<210> 11
<211> 3951
<212> PRT
<213> avian infectious bronchitis virus
<220>
<221> MISC_FEATURE
<223> ORF1A
<400> 11
Met Ala Ser Ser Leu Lys Gln Gly Val Ser Pro Lys Pro Arg Asp Val
1 5 10 15
Ile Leu Val Ser Lys Asp Ile Pro Glu Gln Leu Cys Asp Ala Leu Phe
20 25 30
Phe Tyr Thr Ser His Asn Pro Lys Asp Tyr Ala Asp Ala Phe Ala Val
35 40 45
Arg Gln Lys Phe Asp Arg Ser Leu Gln Thr Gly Lys Gln Phe Lys Phe
50 55 60
Glu Thr Val Cys Gly Leu Phe Leu Leu Lys Gly Val Asp Lys Ile Thr
65 70 75 80
Pro Gly Val Pro Ala Lys Val Leu Lys Ala Thr Ser Lys Leu Ala Asp
85 90 95
Leu Glu Asp Ile Phe Gly Val Ser Pro Leu Ala Arg Lys Tyr Arg Glu
100 105 110
Leu Leu Lys Thr Ala Cys Gln Trp Ser Leu Thr Val Glu Ala Leu Asp
115 120 125
Val Arg Ala Gln Thr Leu Asp Glu Ile Phe Asp Pro Thr Glu Ile Leu
130 135 140
Trp Leu Gln Val Ala Ala Lys Ile His Val Ser Ser Met Ala Met Arg
145 150 155 160
Arg Leu Val Gly Glu Val Thr Ala Lys Val Met Asp Ala Leu Gly Ser
165 170 175
Asn Leu Ser Ala Leu Phe Gln Ile Val Lys Gln Gln Ile Ala Arg Ile
180 185 190
Phe Gln Lys Ala Leu Ala Ile Phe Glu Asn Val Asn Glu Leu Pro Gln
195 200 205
Arg Ile Ala Ala Leu Lys Met Ala Phe Ala Lys Cys Ala Arg Ser Ile
210 215 220
Thr Val Val Val Val Glu Arg Thr Leu Val Val Lys Glu Phe Ala Gly
225 230 235 240
Thr Cys Leu Ala Ser Ile Asn Gly Ala Val Ala Lys Phe Phe Glu Glu
245 250 255
Leu Pro Asn Gly Phe Met Gly Ser Lys Ile Phe Thr Thr Leu Ala Phe
260 265 270
Phe Lys Glu Ala Ala Val Arg Val Val Glu Asn Ile Pro Asn Ala Pro
275 280 285
Arg Gly Thr Lys Gly Phe Glu Val Val Gly Asn Ala Lys Gly Thr Gln
290 295 300
Val Val Val Arg Gly Met Arg Asn Asp Leu Thr Leu Leu Asp Gln Lys
305 310 315 320
Ala Asp Ile Pro Val Glu Pro Glu Gly Trp Ser Ala Ile Leu Asp Gly
325 330 335
His Leu Cys Tyr Val Phe Arg Ser Gly Asp Arg Phe Tyr Ala Ala Pro
340 345 350
Leu Ser Gly Asn Phe Ala Leu Ser Asp Val His Cys Cys Glu Arg Val
355 360 365
Val Cys Leu Ser Asp Gly Val Thr Pro Glu Ile Asn Asp Gly Leu Ile
370 375 380
Leu Ala Ala Ile Tyr Ser Ser Phe Ser Val Ser Glu Leu Val Thr Ala
385 390 395 400
Leu Lys Lys Gly Glu Pro Phe Lys Phe Leu Gly His Lys Phe Val Tyr
405 410 415
Ala Lys Asp Ala Ala Val Ser Phe Thr Leu Ala Lys Ala Ala Thr Ile
420 425 430
Ala Asp Val Leu Arg Leu Phe Gln Ser Ala Arg Val Ile Ala Glu Asp
435 440 445
Val Trp Ser Ser Phe Thr Glu Lys Ser Phe Glu Phe Trp Lys Leu Ala
450 455 460
Tyr Gly Lys Val Arg Asn Leu Glu Glu Phe Val Lys Thr Tyr Val Cys
465 470 475 480
Lys Ala Gln Met Ser Ile Val Ile Leu Ala Ala Val Leu Gly Glu Asp
485 490 495
Ile Trp His Leu Val Ser Gln Val Ile Tyr Lys Leu Gly Val Leu Phe
500 505 510
Thr Lys Val Val Asp Phe Cys Asp Lys His Trp Lys Gly Phe Cys Val
515 520 525
Gln Leu Lys Arg Ala Lys Leu Ile Val Thr Glu Thr Phe Cys Val Leu
530 535 540
Lys Gly Val Ala Gln His Cys Phe Gln Leu Leu Leu Asp Ala Ile His
545 550 555 560
Ser Leu Tyr Lys Ser Phe Lys Lys Cys Ala Leu Gly Arg Ile His Gly
565 570 575
Asp Leu Leu Phe Trp Lys Gly Gly Val His Lys Ile Val Gln Asp Gly
580 585 590
Asp Glu Ile Trp Phe Asp Ala Ile Asp Ser Val Asp Val Glu Asp Leu
595 600 605
Gly Val Val Gln Glu Lys Ser Ile Asp Phe Glu Val Cys Asp Asp Val
610 615 620
Thr Leu Pro Glu Asn Gln Pro Gly His Met Val Gln Ile Glu Asp Asp
625 630 635 640
Gly Lys Asn Tyr Met Phe Phe Arg Phe Lys Lys Asp Glu Asn Ile Tyr
645 650 655
Tyr Thr Pro Met Ser Gln Leu Gly Ala Ile Asn Val Val Cys Lys Ala
660 665 670
Gly Gly Lys Thr Val Thr Phe Gly Glu Thr Thr Val Gln Glu Ile Pro
675 680 685
Pro Pro Asp Val Val Pro Ile Lys Val Ser Ile Glu Cys Cys Gly Glu
690 695 700
Pro Trp Asn Thr Ile Phe Lys Lys Ala Tyr Lys Glu Pro Ile Glu Val
705 710 715 720
Asp Thr Asp Leu Thr Val Glu Gln Leu Leu Ser Val Ile Tyr Glu Lys
725 730 735
Met Cys Asp Asp Leu Lys Leu Phe Pro Glu Ala Pro Glu Pro Pro Pro
740 745 750
Phe Glu Asn Val Ala Leu Val Asp Lys Asn Gly Lys Asp Leu Asp Cys
755 760 765
Ile Lys Ser Cys His Leu Ile Tyr Arg Asp Tyr Glu Ser Asp Asp Asp
770 775 780
Ile Glu Glu Glu Asp Ala Glu Glu Cys Asp Thr Asp Ser Gly Glu Ala
785 790 795 800
Glu Glu Cys Asp Thr Asn Ser Glu Cys Glu Glu Glu Asp Glu Asp Thr
805 810 815
Lys Val Leu Ala Leu Ile Gln Asp Pro Ala Ser Ile Lys Tyr Pro Leu
820 825 830
Pro Leu Asp Glu Asp Tyr Ser Val Tyr Asn Gly Cys Ile Val His Lys
835 840 845
Asp Ala Leu Asp Val Val Asn Leu Pro Ser Gly Glu Glu Thr Phe Val
850 855 860
Val Asn Asn Cys Phe Glu Gly Ala Val Lys Pro Leu Pro Gln Lys Val
865 870 875 880
Val Asp Val Leu Gly Asp Trp Gly Glu Ala Val Asp Ala Gln Glu Gln
885 890 895
Leu Cys Gln Gln Glu Pro Leu Gln His Thr Phe Glu Glu Pro Val Glu
900 905 910
Asn Ser Thr Gly Ser Ser Lys Thr Met Thr Glu Gln Val Val Val Glu
915 920 925
Asp Gln Glu Leu Pro Val Val Glu Gln Asp Gln Asp Val Val Val Tyr
930 935 940
Thr Pro Thr Asp Leu Glu Val Ala Lys Glu Thr Ala Glu Glu Val Asp
945 950 955 960
Glu Phe Ile Leu Ile Phe Ala Val Pro Lys Glu Glu Val Val Ser Gln
965 970 975
Lys Asp Gly Ala Gln Ile Lys Gln Glu Pro Ile Gln Val Val Lys Pro
980 985 990
Gln Arg Glu Lys Lys Ala Lys Lys Phe Lys Val Lys Pro Ala Thr Cys
995 1000 1005
Glu Lys Pro Lys Phe Leu Glu Tyr Lys Thr Cys Val Gly Asp Leu
1010 1015 1020
Thr Val Val Ile Ala Lys Ala Leu Asp Glu Phe Lys Glu Phe Cys
1025 1030 1035
Ile Val Asn Ala Ala Asn Glu His Met Thr His Gly Ser Gly Val
1040 1045 1050
Ala Lys Ala Ile Ala Asp Phe Cys Gly Leu Asp Phe Val Glu Tyr
1055 1060 1065
Cys Glu Asp Tyr Val Lys Lys His Gly Pro Gln Gln Arg Leu Val
1070 1075 1080
Thr Pro Ser Phe Val Lys Gly Ile Gln Cys Val Asn Asn Val Val
1085 1090 1095
Gly Pro Arg His Gly Asp Asn Asn Leu His Glu Lys Leu Val Ala
1100 1105 1110
Ala Tyr Lys Asn Val Leu Val Asp Gly Val Val Asn Tyr Val Val
1115 1120 1125
Pro Val Leu Ser Leu Gly Ile Phe Gly Val Asp Phe Lys Met Ser
1130 1135 1140
Ile Asp Ala Met Arg Glu Ala Phe Glu Gly Cys Thr Ile Arg Val
1145 1150 1155
Leu Leu Phe Ser Leu Ser Gln Glu His Ile Asp Tyr Phe Asp Val
1160 1165 1170
Thr Cys Lys Gln Lys Thr Ile Tyr Leu Thr Glu Asp Gly Val Lys
1175 1180 1185
Tyr Arg Ser Ile Val Leu Lys Pro Gly Asp Ser Leu Gly Gln Phe
1190 1195 1200
Gly Gln Val Tyr Ala Lys Asn Lys Ile Val Phe Thr Ala Asp Asp
1205 1210 1215
Val Glu Asp Lys Glu Ile Leu Tyr Val Pro Thr Thr Asp Lys Ser
1220 1225 1230
Ile Leu Glu Tyr Tyr Gly Leu Asp Ala Gln Lys Tyr Val Ile Tyr
1235 1240 1245
Leu Gln Thr Leu Ala Gln Lys Trp Asn Val Gln Tyr Arg Asp Asn
1250 1255 1260
Phe Leu Ile Leu Glu Trp Arg Asp Gly Asn Cys Trp Ile Ser Ser
1265 1270 1275
Ala Ile Val Leu Leu Gln Ala Ala Lys Ile Arg Phe Lys Gly Phe
1280 1285 1290
Leu Thr Glu Ala Trp Ala Lys Leu Leu Gly Gly Asp Pro Thr Asp
1295 1300 1305
Phe Val Ala Trp Cys Tyr Ala Ser Cys Thr Ala Lys Val Gly Asp
1310 1315 1320
Phe Ser Asp Ala Asn Trp Leu Leu Ala Asn Leu Ala Glu His Phe
1325 1330 1335
Asp Ala Asp Tyr Thr Asn Ala Phe Leu Lys Lys Arg Val Ser Cys
1340 1345 1350
Asn Cys Gly Ile Lys Ser Tyr Glu Leu Arg Gly Leu Glu Ala Cys
1355 1360 1365
Ile Gln Pro Val Arg Ala Thr Asn Leu Leu His Phe Lys Thr Gln
1370 1375 1380
Tyr Ser Asn Cys Pro Thr Cys Gly Ala Asn Asn Thr Asp Glu Val
1385 1390 1395
Ile Glu Ala Ser Leu Pro Tyr Leu Leu Leu Phe Ala Thr Asp Gly
1400 1405 1410
Pro Ala Thr Val Asp Cys Asp Glu Asp Ala Val Gly Thr Val Val
1415 1420 1425
Phe Val Gly Ser Thr Asn Ser Gly His Cys Tyr Thr Gln Ala Ala
1430 1435 1440
Gly Gln Ala Phe Asp Asn Leu Ala Lys Asp Arg Lys Phe Gly Lys
1445 1450 1455
Lys Ser Pro Tyr Ile Thr Ala Met Tyr Thr Arg Phe Ala Phe Lys
1460 1465 1470
Asn Glu Thr Ser Leu Pro Val Ala Lys Gln Ser Lys Gly Lys Ser
1475 1480 1485
Lys Ser Val Lys Glu Asp Val Ser Asn Leu Ala Thr Ser Ser Lys
1490 1495 1500
Ala Ser Phe Asp Asn Leu Thr Asp Phe Glu Gln Trp Tyr Asp Ser
1505 1510 1515
Asn Ile Tyr Glu Ser Leu Lys Val Gln Glu Ser Pro Asp Asn Phe
1520 1525 1530
Asp Lys Tyr Val Ser Phe Thr Thr Lys Glu Asp Ser Lys Leu Pro
1535 1540 1545
Leu Thr Leu Lys Val Arg Gly Ile Lys Ser Val Val Asp Phe Arg
1550 1555 1560
Ser Lys Asp Gly Phe Ile Tyr Lys Leu Thr Pro Asp Thr Asp Glu
1565 1570 1575
Asn Ser Lys Ala Pro Val Tyr Tyr Pro Val Leu Asp Ala Ile Ser
1580 1585 1590
Leu Lys Ala Ile Trp Val Glu Gly Asn Ala Asn Phe Val Val Gly
1595 1600 1605
His Pro Asn Tyr Tyr Ser Lys Ser Leu His Ile Pro Thr Phe Trp
1610 1615 1620
Glu Asn Ala Glu Asn Phe Val Lys Met Gly Asp Lys Ile Gly Gly
1625 1630 1635
Val Thr Met Gly Leu Trp Arg Ala Glu His Leu Asn Lys Pro Asn
1640 1645 1650
Leu Glu Arg Ile Phe Asn Ile Ala Lys Lys Ala Ile Val Gly Ser
1655 1660 1665
Ser Val Val Thr Thr Gln Cys Gly Lys Leu Ile Gly Lys Ala Ala
1670 1675 1680
Thr Phe Ile Ala Asp Lys Val Gly Gly Gly Val Val Arg Asn Ile
1685 1690 1695
Thr Asp Ser Ile Lys Gly Leu Cys Gly Ile Thr Arg Gly His Phe
1700 1705 1710
Glu Arg Lys Met Ser Pro Gln Phe Leu Lys Thr Leu Met Phe Phe
1715 1720 1725
Leu Phe Tyr Phe Leu Lys Ala Ser Val Lys Ser Val Val Ala Ser
1730 1735 1740
Tyr Lys Thr Val Leu Cys Lys Val Val Leu Ala Thr Leu Leu Ile
1745 1750 1755
Val Trp Phe Val Tyr Thr Ser Asn Pro Val Met Phe Thr Gly Ile
1760 1765 1770
Arg Val Leu Asp Phe Leu Phe Glu Gly Ser Leu Cys Gly Pro Tyr
1775 1780 1785
Lys Asp Tyr Gly Lys Asp Ser Phe Asp Val Leu Arg Tyr Cys Ala
1790 1795 1800
Asp Asp Phe Ile Cys Arg Val Cys Leu His Asp Lys Asp Ser Leu
1805 1810 1815
His Leu Tyr Lys His Ala Tyr Ser Val Glu Gln Val Tyr Lys Asp
1820 1825 1830
Ala Ala Ser Gly Phe Ile Phe Asn Trp Asn Trp Leu Tyr Leu Val
1835 1840 1845
Phe Leu Ile Leu Phe Val Lys Pro Val Ala Gly Phe Val Ile Ile
1850 1855 1860
Cys Tyr Cys Val Lys Tyr Leu Val Leu Asn Ser Thr Val Leu Gln
1865 1870 1875
Thr Gly Val Cys Phe Leu Asp Trp Phe Val Gln Thr Val Phe Ser
1880 1885 1890
His Phe Asn Phe Met Gly Ala Gly Phe Tyr Phe Trp Leu Phe Tyr
1895 1900 1905
Lys Ile Tyr Ile Gln Val His His Ile Leu Tyr Cys Lys Asp Val
1910 1915 1920
Thr Cys Glu Val Cys Lys Arg Val Ala Arg Ser Asn Arg Gln Glu
1925 1930 1935
Val Ser Val Val Val Gly Gly Arg Lys Gln Ile Val His Val Tyr
1940 1945 1950
Thr Asn Ser Gly Tyr Asn Phe Cys Lys Arg His Asn Trp Tyr Cys
1955 1960 1965
Arg Asn Cys Asp Asp Tyr Gly His Gln Asn Thr Phe Met Ser Pro
1970 1975 1980
Glu Val Ala Gly Glu Leu Ser Glu Lys Leu Lys Arg His Val Lys
1985 1990 1995
Pro Thr Ala Tyr Ala Tyr His Val Val Asp Glu Ala Cys Leu Val
2000 2005 2010
Asp Asp Phe Val Asn Leu Lys Tyr Lys Ala Ala Thr Pro Gly Lys
2015 2020 2025
Asp Ser Ala Ser Ser Ala Val Lys Cys Phe Ser Val Thr Asp Phe
2030 2035 2040
Leu Lys Lys Ala Val Phe Leu Lys Glu Ala Leu Lys Cys Glu Gln
2045 2050 2055
Ile Ser Asn Asp Gly Phe Ile Val Cys Asn Thr Gln Ser Ala His
2060 2065 2070
Ala Leu Glu Glu Ala Lys Asn Ala Ala Ile Tyr Tyr Ala Gln Tyr
2075 2080 2085
Leu Cys Lys Pro Ile Leu Ile Leu Asp Gln Ala Leu Tyr Glu Gln
2090 2095 2100
Leu Val Val Glu Pro Val Ser Lys Ser Val Ile Asp Lys Val Cys
2105 2110 2115
Ser Ile Leu Ser Ser Ile Ile Ser Val Asp Thr Ala Ala Leu Asn
2120 2125 2130
Tyr Lys Ala Gly Thr Leu Arg Asp Ala Leu Leu Ser Ile Thr Lys
2135 2140 2145
Asp Glu Glu Ala Val Asp Met Ala Ile Phe Cys His Asn His Asp
2150 2155 2160
Val Asp Tyr Thr Gly Asp Gly Phe Thr Asn Val Ile Pro Ser Tyr
2165 2170 2175
Gly Ile Asp Thr Gly Lys Leu Thr Pro Arg Asp Arg Gly Phe Leu
2180 2185 2190
Ile Asn Ala Asp Ala Ser Ile Ala Asn Leu Arg Val Lys Asn Ala
2195 2200 2205
Pro Pro Val Val Trp Lys Phe Ser Glu Leu Ile Lys Leu Ser Asp
2210 2215 2220
Ser Cys Leu Lys Tyr Leu Ile Ser Ala Thr Val Lys Ser Gly Val
2225 2230 2235
Arg Phe Phe Ile Thr Lys Ser Gly Ala Lys Gln Val Ile Ala Cys
2240 2245 2250
His Thr Gln Lys Leu Leu Val Glu Lys Lys Ala Gly Gly Ile Val
2255 2260 2265
Ser Gly Thr Phe Lys Cys Phe Lys Ser Tyr Phe Lys Trp Leu Leu
2270 2275 2280
Ile Phe Tyr Ile Leu Phe Thr Ala Cys Cys Ser Gly Tyr Tyr Tyr
2285 2290 2295
Met Glu Val Ser Lys Ser Phe Val His Pro Met Tyr Asp Val Asn
2300 2305 2310
Ser Thr Leu His Val Glu Gly Phe Lys Val Ile Asp Lys Gly Val
2315 2320 2325
Leu Arg Glu Ile Val Pro Glu Asp Thr Cys Phe Ser Asn Lys Phe
2330 2335 2340
Val Asn Phe Asp Ala Phe Trp Gly Arg Pro Tyr Asp Asn Ser Arg
2345 2350 2355
Asn Cys Pro Ile Val Thr Ala Val Ile Asp Gly Asp Gly Thr Val
2360 2365 2370
Ala Thr Gly Val Pro Gly Phe Val Ser Trp Val Met Asp Gly Val
2375 2380 2385
Met Phe Ile His Met Thr Gln Thr Glu Arg Lys Pro Trp Tyr Ile
2390 2395 2400
Pro Thr Trp Phe Asn Arg Glu Ile Val Gly Tyr Thr Gln Asp Ser
2405 2410 2415
Ile Ile Thr Glu Gly Ser Phe Tyr Thr Ser Ile Ala Leu Phe Ser
2420 2425 2430
Ala Arg Cys Leu Tyr Leu Thr Ala Ser Asn Thr Pro Gln Leu Tyr
2435 2440 2445
Cys Phe Asn Gly Asp Asn Asp Ala Pro Gly Ala Leu Pro Phe Gly
2450 2455 2460
Ser Ile Ile Pro His Arg Val Tyr Phe Gln Pro Asn Gly Val Arg
2465 2470 2475
Leu Ile Val Pro Gln Gln Ile Leu His Thr Pro Tyr Val Val Lys
2480 2485 2490
Phe Val Ser Asp Ser Tyr Cys Arg Gly Ser Val Cys Glu Tyr Thr
2495 2500 2505
Arg Pro Gly Tyr Cys Val Ser Leu Asn Pro Gln Trp Val Leu Phe
2510 2515 2520
Asn Asp Glu Tyr Thr Ser Lys Pro Gly Val Phe Cys Gly Ser Thr
2525 2530 2535
Val Arg Glu Leu Met Phe Ser Met Val Ser Thr Phe Phe Thr Gly
2540 2545 2550
Val Asn Pro Asn Ile Tyr Met Gln Leu Ala Thr Met Phe Leu Ile
2555 2560 2565
Leu Val Val Val Val Leu Ile Phe Ala Met Val Ile Lys Phe Gln
2570 2575 2580
Gly Val Phe Lys Ala Tyr Ala Thr Thr Val Phe Ile Thr Met Leu
2585 2590 2595
Val Trp Val Ile Asn Ala Phe Ile Leu Cys Val His Ser Tyr Asn
2600 2605 2610
Ser Val Leu Ala Val Ile Leu Leu Val Leu Tyr Cys Tyr Ala Ser
2615 2620 2625
Leu Val Thr Ser Arg Asn Thr Val Ile Ile Met His Cys Trp Leu
2630 2635 2640
Val Phe Thr Phe Gly Leu Ile Val Pro Thr Trp Leu Ala Cys Cys
2645 2650 2655
Tyr Leu Gly Phe Ile Ile Tyr Met Tyr Thr Pro Leu Phe Leu Trp
2660 2665 2670
Cys Tyr Gly Thr Thr Lys Asn Thr Arg Lys Leu Tyr Asp Gly Asn
2675 2680 2685
Glu Phe Val Gly Asn Tyr Asp Leu Ala Ala Lys Ser Thr Phe Val
2690 2695 2700
Ile Arg Gly Ser Glu Phe Val Lys Leu Thr Asn Glu Ile Gly Asp
2705 2710 2715
Lys Phe Glu Ala Tyr Leu Ser Ala Tyr Ala Arg Leu Lys Tyr Tyr
2720 2725 2730
Ser Gly Thr Gly Ser Glu Gln Asp Tyr Leu Gln Ala Cys Arg Ala
2735 2740 2745
Trp Leu Ala Tyr Ala Leu Asp Gln Tyr Arg Asn Ser Gly Val Glu
2750 2755 2760
Ile Val Tyr Thr Pro Pro Arg Tyr Ser Ile Gly Val Ser Arg Leu
2765 2770 2775
Gln Ser Gly Phe Lys Lys Leu Val Ser Pro Ser Ser Ala Val Glu
2780 2785 2790
Lys Cys Ile Val Ser Val Ser Tyr Arg Gly Asn Asn Leu Asn Gly
2795 2800 2805
Leu Trp Leu Gly Asp Thr Ile Tyr Cys Pro Arg His Val Leu Gly
2810 2815 2820
Lys Phe Ser Gly Asp Gln Trp Asn Asp Val Leu Asn Leu Ala Asn
2825 2830 2835
Asn His Glu Phe Glu Val Thr Thr Gln His Gly Val Thr Leu Asn
2840 2845 2850
Val Val Ser Arg Arg Leu Lys Gly Ala Val Leu Ile Leu Gln Thr
2855 2860 2865
Ala Val Ala Asn Ala Glu Thr Pro Lys Tyr Lys Phe Ile Lys Ala
2870 2875 2880
Asn Cys Gly Asp Ser Phe Thr Ile Ala Cys Ala Tyr Gly Gly Thr
2885 2890 2895
Val Val Gly Leu Tyr Pro Val Thr Met Arg Ser Asn Gly Thr Ile
2900 2905 2910
Arg Ala Ser Phe Leu Ala Gly Ala Cys Gly Ser Val Gly Phe Asn
2915 2920 2925
Ile Glu Lys Gly Val Val Asn Phe Phe Tyr Met His His Leu Glu
2930 2935 2940
Leu Pro Asn Ala Leu His Thr Gly Thr Asp Leu Met Gly Glu Phe
2945 2950 2955
Tyr Gly Gly Tyr Val Asp Glu Glu Val Ala Gln Arg Val Pro Pro
2960 2965 2970
Asp Asn Leu Val Thr Asn Asn Ile Val Ala Trp Leu Tyr Ala Ala
2975 2980 2985
Ile Ile Ser Val Lys Glu Ser Ser Phe Ser Leu Pro Lys Trp Leu
2990 2995 3000
Glu Ser Thr Thr Val Ser Val Asp Asp Tyr Asn Lys Trp Ala Gly
3005 3010 3015
Asp Asn Gly Phe Thr Pro Phe Ser Thr Ser Thr Ala Ile Thr Lys
3020 3025 3030
Leu Ser Ala Ile Thr Gly Val Asp Val Cys Lys Leu Leu Arg Thr
3035 3040 3045
Ile Met Val Lys Asn Ser Gln Trp Gly Gly Asp Pro Ile Leu Gly
3050 3055 3060
Gln Tyr Asn Phe Glu Asp Glu Leu Thr Pro Glu Ser Val Phe Asn
3065 3070 3075
Gln Ile Gly Gly Val Arg Leu Gln Ser Ser Phe Val Arg Lys Ala
3080 3085 3090
Thr Ser Trp Phe Trp Ser Arg Cys Val Leu Ala Cys Phe Leu Phe
3095 3100 3105
Val Leu Cys Ala Ile Val Leu Phe Thr Ala Val Pro Leu Lys Phe
3110 3115 3120
Tyr Val Tyr Ala Ala Val Ile Leu Leu Met Ala Val Leu Phe Ile
3125 3130 3135
Ser Phe Thr Val Lys His Val Met Ala Tyr Met Asp Thr Phe Leu
3140 3145 3150
Leu Pro Thr Leu Ile Thr Val Ile Ile Gly Val Cys Ala Glu Val
3155 3160 3165
Pro Phe Ile Tyr Asn Thr Leu Ile Ser Gln Val Val Ile Phe Leu
3170 3175 3180
Ser Gln Trp Tyr Asp Pro Val Val Phe Asp Thr Met Val Pro Trp
3185 3190 3195
Met Phe Leu Pro Leu Val Leu Tyr Thr Ala Phe Lys Cys Val Gln
3200 3205 3210
Gly Cys Tyr Met Asn Ser Phe Asn Thr Ser Leu Leu Met Leu Tyr
3215 3220 3225
Gln Phe Val Lys Leu Gly Phe Val Ile Tyr Thr Ser Ser Asn Thr
3230 3235 3240
Leu Thr Ala Tyr Thr Glu Gly Asn Trp Glu Leu Phe Phe Glu Leu
3245 3250 3255
Val His Thr Thr Val Leu Ala Asn Val Ser Ser Asn Ser Leu Ile
3260 3265 3270
Gly Leu Phe Val Phe Lys Cys Ala Lys Trp Met Leu Tyr Tyr Cys
3275 3280 3285
Asn Ala Thr Tyr Leu Asn Asn Tyr Val Leu Met Ala Val Met Val
3290 3295 3300
Asn Cys Ile Gly Trp Leu Cys Thr Cys Tyr Phe Gly Leu Tyr Trp
3305 3310 3315
Trp Val Asn Lys Val Phe Gly Leu Thr Leu Gly Lys Tyr Asn Phe
3320 3325 3330
Lys Val Ser Val Asp Gln Tyr Arg Tyr Met Cys Leu His Lys Ile
3335 3340 3345
Asn Pro Pro Lys Thr Val Trp Glu Val Phe Ser Thr Asn Ile Leu
3350 3355 3360
Ile Gln Gly Ile Gly Gly Asp Arg Val Leu Pro Ile Ala Thr Val
3365 3370 3375
Gln Ala Lys Leu Ser Asp Val Lys Cys Thr Thr Val Val Leu Met
3380 3385 3390
Gln Leu Leu Thr Lys Leu Asn Val Glu Ala Asn Ser Lys Met His
3395 3400 3405
Val Tyr Leu Val Glu Leu His Asn Lys Ile Leu Ala Ser Asp Asp
3410 3415 3420
Val Gly Glu Cys Met Asp Asn Leu Leu Gly Met Leu Ile Thr Leu
3425 3430 3435
Phe Cys Ile Asp Ser Thr Ile Asp Leu Ser Glu Tyr Cys Asp Asp
3440 3445 3450
Ile Leu Lys Arg Ser Thr Val Leu Gln Ser Val Thr Gln Glu Phe
3455 3460 3465
Ser His Ile Pro Ser Tyr Ala Glu Tyr Glu Arg Ala Lys Asn Leu
3470 3475 3480
Tyr Glu Lys Val Leu Val Asp Ser Lys Asn Gly Gly Val Thr Gln
3485 3490 3495
Gln Glu Leu Ala Ala Tyr Arg Lys Ala Ala Asn Ile Ala Lys Ser
3500 3505 3510
Val Phe Asp Arg Asp Leu Ala Val Gln Lys Lys Leu Asp Ser Met
3515 3520 3525
Ala Glu Arg Ala Met Thr Thr Met Tyr Lys Glu Ala Arg Val Thr
3530 3535 3540
Asp Arg Arg Ala Lys Leu Val Ser Ser Leu His Ala Leu Leu Phe
3545 3550 3555
Ser Met Leu Lys Lys Ile Asp Ser Glu Lys Leu Asn Val Leu Phe
3560 3565 3570
Asp Gln Ala Ser Ser Gly Val Val Pro Leu Ala Thr Val Pro Ile
3575 3580 3585
Val Cys Ser Asn Lys Leu Thr Leu Val Ile Pro Asp Pro Glu Thr
3590 3595 3600
Trp Val Lys Cys Val Glu Gly Val His Val Thr Tyr Ser Thr Val
3605 3610 3615
Val Trp Asn Ile Asp Thr Val Ile Asp Ala Asp Gly Thr Glu Leu
3620 3625 3630
His Pro Thr Ser Thr Gly Ser Gly Leu Thr Tyr Cys Ile Ser Gly
3635 3640 3645
Ala Asn Ile Ala Trp Pro Leu Lys Val Asn Leu Thr Arg Asn Gly
3650 3655 3660
His Asn Lys Val Asp Val Val Leu Gln Asn Asn Glu Leu Met Pro
3665 3670 3675
His Gly Val Lys Thr Lys Ala Cys Val Ala Gly Val Asp Gln Ala
3680 3685 3690
His Cys Ser Val Glu Ser Lys Cys Tyr Tyr Thr Asn Ile Ser Gly
3695 3700 3705
Asn Ser Val Val Ala Ala Ile Thr Ser Ser Asn Pro Asn Leu Lys
3710 3715 3720
Val Ala Ser Phe Leu Asn Glu Ala Gly Asn Gln Ile Tyr Val Asp
3725 3730 3735
Leu Asp Pro Pro Cys Lys Phe Gly Met Lys Val Gly Val Lys Val
3740 3745 3750
Glu Val Val Tyr Leu Tyr Phe Ile Lys Asn Thr Arg Ser Ile Val
3755 3760 3765
Arg Gly Met Val Leu Gly Ala Ile Ser Asn Val Val Val Leu Gln
3770 3775 3780
Ser Lys Gly His Glu Thr Glu Glu Val Asp Ala Val Gly Ile Leu
3785 3790 3795
Ser Leu Cys Ser Phe Ala Val Asp Pro Ala Asp Thr Tyr Cys Lys
3800 3805 3810
Tyr Val Ala Ala Gly Asn Gln Pro Leu Gly Asn Cys Val Lys Met
3815 3820 3825
Leu Thr Val His Asn Gly Ser Gly Phe Ala Ile Thr Ser Lys Pro
3830 3835 3840
Ser Pro Thr Pro Asp Gln Asp Ser Tyr Gly Gly Ala Ser Val Cys
3845 3850 3855
Leu Tyr Cys Arg Ala His Ile Ala His Pro Gly Ser Val Gly Asn
3860 3865 3870
Leu Asp Gly Arg Cys Gln Phe Lys Gly Ser Phe Val Gln Ile Pro
3875 3880 3885
Thr Thr Glu Lys Asp Pro Val Gly Phe Cys Leu Arg Asn Lys Val
3890 3895 3900
Cys Thr Val Cys Gln Cys Trp Ile Gly Tyr Gly Cys Gln Cys Asp
3905 3910 3915
Ser Leu Arg Gln Pro Lys Ser Ser Val Gln Ser Val Ala Gly Ala
3920 3925 3930
Ser Asp Phe Asp Lys Asn Tyr Leu Asn Gly Tyr Gly Val Ala Val
3935 3940 3945
Arg Leu Gly
3950
<210> 12
<211> 4085
<212> PRT
<213> human coronavirsu 229E
<220>
<221> MISC_FEATURE
<223> ORF 1A
<400> 12
Met Ala Cys Asn Arg Val Thr Leu Ala Val Ala Ser Asp Ser Glu Ile
1 5 10 15
Ser Ala Asn Gly Cys Ser Thr Ile Ala Gln Ala Val Arg Arg Tyr Ser
20 25 30
Glu Ala Ala Ser Asn Gly Phe Arg Ala Cys Arg Phe Val Ser Leu Asp
35 40 45
Leu Gln Asp Cys Ile Val Gly Ile Ala Asp Asp Thr Tyr Val Met Gly
50 55 60
Leu His Gly Asn Gln Thr Leu Phe Cys Asn Ile Met Lys Phe Ser Asp
65 70 75 80
Arg Pro Phe Met Leu His Gly Trp Leu Val Phe Ser Asn Ser Asn Tyr
85 90 95
Leu Leu Glu Glu Phe Asp Val Val Phe Gly Lys Arg Gly Gly Gly Asn
100 105 110
Val Thr Tyr Thr Asp Gln Tyr Leu Cys Gly Ala Asp Gly Lys Pro Val
115 120 125
Met Ser Glu Asp Leu Trp Gln Phe Val Asp His Phe Gly Glu Asn Glu
130 135 140
Glu Ile Ile Ile Asn Gly His Thr Tyr Val Cys Ala Trp Leu Thr Lys
145 150 155 160
Arg Lys Pro Leu Asp Tyr Lys Arg Gln Asn Asn Leu Ala Ile Glu Glu
165 170 175
Ile Glu Tyr Val His Gly Asp Ala Leu His Thr Leu Arg Asn Gly Ser
180 185 190
Val Leu Glu Met Ala Lys Glu Val Lys Thr Ser Ser Lys Val Val Leu
195 200 205
Ser Asp Ala Leu Asp Lys Leu Tyr Lys Val Phe Gly Ser Pro Val Met
210 215 220
Thr Asn Gly Ser Asn Ile Leu Glu Ala Phe Thr Lys Pro Val Phe Ile
225 230 235 240
Ser Ala Leu Val Gln Cys Thr Cys Gly Thr Lys Ser Trp Ser Val Gly
245 250 255
Asp Trp Thr Gly Phe Lys Ser Ser Cys Cys Asn Val Ile Ser Asn Lys
260 265 270
Leu Cys Val Val Pro Gly Asn Val Lys Pro Gly Asp Ala Val Ile Thr
275 280 285
Thr Gln Gln Ala Gly Ala Gly Ile Lys Tyr Phe Cys Gly Met Thr Leu
290 295 300
Lys Phe Val Ala Asn Ile Glu Gly Val Ser Val Trp Arg Val Ile Ala
305 310 315 320
Leu Gln Ser Val Asp Cys Phe Val Ala Ser Ser Thr Phe Val Glu Glu
325 330 335
Glu His Val Asn Arg Met Asp Thr Phe Cys Phe Asn Val Arg Asn Ser
340 345 350
Val Thr Asp Glu Cys Arg Leu Ala Met Leu Gly Ala Glu Met Thr Ser
355 360 365
Asn Val Arg Arg Gln Val Ala Ser Gly Val Ile Asp Ile Ser Thr Gly
370 375 380
Trp Phe Asp Val Tyr Asp Asp Ile Phe Ala Glu Ser Lys Pro Trp Phe
385 390 395 400
Val Arg Lys Ala Glu Asp Ile Phe Gly Pro Cys Trp Ser Ala Leu Ala
405 410 415
Ser Ala Leu Lys Gln Leu Lys Val Thr Thr Gly Glu Leu Val Arg Phe
420 425 430
Val Lys Ser Ile Cys Asn Ser Ala Val Ala Val Val Gly Gly Thr Ile
435 440 445
Gln Ile Leu Ala Ser Val Pro Glu Lys Phe Leu Asn Ala Phe Asp Val
450 455 460
Phe Val Thr Ala Ile Gln Thr Val Phe Asp Cys Ala Val Glu Thr Cys
465 470 475 480
Thr Ile Ala Gly Lys Ala Phe Asp Lys Val Phe Asp Tyr Val Leu Leu
485 490 495
Asp Asn Ala Leu Val Lys Leu Val Thr Thr Lys Leu Lys Gly Val Arg
500 505 510
Glu Arg Gly Leu Asn Lys Val Lys Tyr Ala Thr Val Val Val Gly Ser
515 520 525
Thr Glu Glu Val Lys Ser Ser Arg Val Glu Arg Ser Thr Ala Val Leu
530 535 540
Thr Ile Ala Asn Asn Tyr Ser Lys Leu Phe Asp Glu Gly Tyr Thr Val
545 550 555 560
Val Ile Gly Asp Val Ala Tyr Phe Val Ser Asp Gly Tyr Phe Arg Leu
565 570 575
Met Ala Ser Pro Asn Ser Val Leu Thr Thr Ala Val Tyr Lys Pro Leu
580 585 590
Phe Ala Phe Asn Val Asn Val Met Gly Thr Arg Pro Glu Lys Phe Pro
595 600 605
Thr Thr Val Thr Cys Glu Asn Leu Glu Ser Ala Val Leu Phe Val Asn
610 615 620
Asp Lys Ile Thr Glu Phe Gln Leu Asp Tyr Ser Ile Asp Val Ile Asp
625 630 635 640
Asn Glu Ile Ile Val Lys Pro Asn Ile Ser Leu Cys Val Pro Leu Tyr
645 650 655
Val Arg Asp Tyr Val Asp Lys Trp Asp Asp Phe Cys Arg Gln Tyr Ser
660 665 670
Asn Glu Ser Trp Phe Glu Asp Asp Tyr Arg Ala Phe Ile Ser Val Leu
675 680 685
Asp Ile Thr Asp Ala Ala Val Lys Ala Ala Glu Ser Lys Ala Phe Val
690 695 700
Asp Thr Ile Val Pro Pro Cys Pro Ser Ile Leu Lys Val Ile Asp Gly
705 710 715 720
Gly Lys Ile Trp Asn Gly Val Ile Lys Asn Val Asn Ser Val Arg Asp
725 730 735
Trp Leu Lys Ser Leu Lys Leu Asn Leu Thr Gln Gln Gly Leu Leu Gly
740 745 750
Thr Cys Ala Lys Arg Phe Lys Arg Trp Leu Gly Ile Leu Leu Glu Ala
755 760 765
Tyr Asn Ala Phe Leu Asp Thr Val Val Ser Thr Val Lys Ile Gly Gly
770 775 780
Leu Thr Phe Lys Thr Tyr Ala Phe Asp Lys Pro Tyr Ile Val Ile Arg
785 790 795 800
Asp Ile Val Cys Lys Val Glu Asn Lys Thr Glu Ala Glu Trp Ile Glu
805 810 815
Leu Phe Pro His Asn Asp Arg Ile Lys Ser Phe Ser Thr Phe Glu Ser
820 825 830
Ala Tyr Met Pro Ile Ala Asp Pro Thr His Phe Asp Ile Glu Glu Val
835 840 845
Glu Leu Leu Asp Ala Glu Phe Val Glu Pro Gly Cys Gly Gly Ile Leu
850 855 860
Ala Val Ile Asp Glu His Val Phe Tyr Lys Lys Asp Gly Val Tyr Tyr
865 870 875 880
Pro Ser Asn Gly Thr Asn Ile Leu Pro Val Ala Phe Thr Lys Ala Ala
885 890 895
Gly Gly Lys Val Ser Phe Ser Asp Asp Val Glu Val Lys Asp Ile Glu
900 905 910
Pro Val Tyr Arg Val Lys Leu Cys Phe Glu Phe Glu Asp Glu Lys Leu
915 920 925
Val Asp Val Cys Glu Lys Ala Ile Gly Lys Lys Ile Lys His Glu Gly
930 935 940
Asp Trp Asp Ser Phe Cys Lys Thr Ile Gln Ser Ala Leu Ser Val Val
945 950 955 960
Ser Cys Tyr Val Asn Leu Pro Thr Tyr Tyr Ile Tyr Asp Glu Glu Gly
965 970 975
Gly Asn Asp Leu Ser Leu Pro Val Met Ile Ser Glu Trp Pro Leu Ser
980 985 990
Val Gln Gln Ala Gln Gln Glu Ala Thr Leu Pro Asp Ile Ala Glu Asp
995 1000 1005
Val Val Asp Gln Val Glu Glu Val Asn Ser Ile Phe Asp Ile Glu
1010 1015 1020
Thr Val Asp Val Lys His Asp Val Ser Pro Phe Glu Met Pro Phe
1025 1030 1035
Glu Glu Leu Asn Gly Leu Lys Ile Leu Lys Gln Leu Asp Asn Asn
1040 1045 1050
Cys Trp Val Asn Ser Val Met Leu Gln Ile Gln Leu Thr Gly Ile
1055 1060 1065
Leu Asp Gly Asp Tyr Ala Met Gln Phe Phe Lys Met Gly Arg Val
1070 1075 1080
Ala Lys Met Ile Glu Arg Cys Tyr Thr Ala Glu Gln Cys Ile Arg
1085 1090 1095
Gly Ala Met Gly Asp Val Gly Leu Cys Met Tyr Arg Leu Leu Lys
1100 1105 1110
Asp Leu His Thr Gly Phe Met Val Met Asp Tyr Lys Cys Ser Cys
1115 1120 1125
Thr Ser Gly Arg Leu Glu Glu Ser Gly Ala Val Leu Phe Cys Thr
1130 1135 1140
Pro Thr Lys Lys Ala Phe Pro Tyr Gly Thr Cys Leu Asn Cys Asn
1145 1150 1155
Ala Pro Arg Met Cys Thr Ile Arg Gln Leu Gln Gly Thr Ile Ile
1160 1165 1170
Phe Val Gln Gln Lys Pro Glu Pro Val Asn Pro Val Ser Phe Val
1175 1180 1185
Val Lys Pro Val Cys Ser Ser Ile Phe Arg Gly Ala Val Ser Cys
1190 1195 1200
Gly His Tyr Gln Thr Asn Ile Tyr Ser Gln Asn Leu Cys Val Asp
1205 1210 1215
Gly Phe Gly Val Asn Lys Ile Gln Pro Trp Thr Asn Asp Ala Leu
1220 1225 1230
Asn Thr Ile Cys Ile Lys Asp Ala Asp Tyr Asn Ala Lys Val Glu
1235 1240 1245
Ile Ser Val Thr Pro Ile Lys Asn Thr Val Asp Thr Thr Pro Lys
1250 1255 1260
Glu Glu Phe Val Val Lys Glu Lys Leu Asn Ala Phe Leu Val His
1265 1270 1275
Asp Asn Val Ala Phe Tyr Gln Gly Asp Val Asp Thr Val Val Asn
1280 1285 1290
Gly Val Asp Phe Asp Phe Ile Val Asn Ala Ala Asn Glu Asn Leu
1295 1300 1305
Ala His Gly Gly Gly Leu Ala Lys Ala Leu Asp Val Tyr Thr Lys
1310 1315 1320
Gly Lys Leu Gln Arg Leu Ser Lys Glu His Ile Gly Leu Ala Gly
1325 1330 1335
Lys Val Lys Val Gly Thr Gly Val Met Val Glu Cys Asp Ser Leu
1340 1345 1350
Arg Ile Phe Asn Val Val Gly Pro Arg Lys Gly Lys His Glu Arg
1355 1360 1365
Asp Leu Leu Ile Lys Ala Tyr Asn Thr Ile Asn Asn Glu Gln Gly
1370 1375 1380
Thr Pro Leu Thr Pro Ile Leu Ser Cys Gly Ile Phe Gly Ile Lys
1385 1390 1395
Leu Glu Thr Ser Leu Glu Val Leu Leu Asp Val Cys Asn Thr Lys
1400 1405 1410
Glu Val Lys Val Phe Val Tyr Thr Asp Thr Glu Val Cys Lys Val
1415 1420 1425
Lys Asp Phe Val Ser Gly Leu Val Asn Val Gln Lys Val Glu Gln
1430 1435 1440
Pro Lys Ile Glu Pro Lys Pro Val Ser Val Ile Lys Val Ala Pro
1445 1450 1455
Lys Pro Tyr Arg Val Asp Gly Lys Phe Ser Tyr Phe Thr Glu Asp
1460 1465 1470
Leu Leu Cys Val Ala Asp Asp Lys Pro Ile Val Leu Phe Thr Asp
1475 1480 1485
Ser Met Leu Thr Leu Asp Asp Arg Gly Leu Ala Leu Asp Asn Ala
1490 1495 1500
Leu Ser Gly Val Leu Ser Ala Ala Ile Lys Asp Cys Val Asp Ile
1505 1510 1515
Asn Lys Ala Ile Pro Ser Gly Asn Leu Ile Lys Phe Asp Ile Gly
1520 1525 1530
Ser Val Val Val Tyr Met Cys Val Val Pro Ser Glu Lys Asp Lys
1535 1540 1545
His Leu Asp Asn Asn Val Gln Arg Cys Thr Arg Lys Leu Asn Arg
1550 1555 1560
Leu Met Cys Asp Ile Val Cys Thr Ile Pro Ala Asp Tyr Ile Leu
1565 1570 1575
Pro Leu Val Leu Ser Ser Leu Thr Cys Asn Val Ser Phe Val Gly
1580 1585 1590
Glu Leu Lys Ala Ala Glu Ala Lys Val Ile Thr Ile Lys Val Thr
1595 1600 1605
Glu Asp Gly Val Asn Val His Asp Val Thr Val Thr Thr Asp Lys
1610 1615 1620
Ser Phe Glu Gln Gln Val Gly Val Ile Ala Asp Lys Asp Lys Asp
1625 1630 1635
Leu Ser Gly Ala Val Pro Ser Asp Leu Asn Thr Ser Glu Leu Leu
1640 1645 1650
Thr Lys Ala Ile Asp Val Asp Trp Val Glu Phe Tyr Gly Phe Lys
1655 1660 1665
Asp Ala Val Thr Phe Ala Thr Val Asp His Ser Ala Phe Ala Tyr
1670 1675 1680
Glu Ser Ala Val Val Asn Gly Ile Arg Val Leu Lys Thr Ser Asp
1685 1690 1695
Asn Asn Cys Trp Val Asn Ala Val Cys Ile Ala Leu Gln Tyr Ser
1700 1705 1710
Lys Pro His Phe Ile Ser Gln Gly Leu Asp Ala Ala Trp Asn Lys
1715 1720 1725
Phe Val Leu Gly Asp Val Glu Ile Phe Val Ala Phe Val Tyr Tyr
1730 1735 1740
Val Ala Arg Leu Met Lys Gly Asp Lys Gly Asp Ala Glu Asp Thr
1745 1750 1755
Leu Thr Lys Leu Ser Lys Tyr Leu Ala Asn Glu Ala Gln Val Gln
1760 1765 1770
Leu Glu His Tyr Ser Ser Cys Val Glu Cys Asp Ala Lys Phe Lys
1775 1780 1785
Asn Ser Val Ala Ser Ile Asn Ser Ala Ile Val Cys Ala Ser Val
1790 1795 1800
Lys Arg Asp Gly Val Gln Val Gly Tyr Cys Val His Gly Ile Lys
1805 1810 1815
Tyr Tyr Ser Arg Val Arg Ser Val Arg Gly Arg Ala Ile Ile Val
1820 1825 1830
Ser Val Glu Gln Leu Glu Pro Cys Ala Gln Ser Arg Leu Leu Ser
1835 1840 1845
Gly Val Ala Tyr Thr Ala Phe Ser Gly Pro Val Asp Lys Gly His
1850 1855 1860
Tyr Thr Val Tyr Asp Thr Ala Lys Lys Ser Met Tyr Asp Gly Asp
1865 1870 1875
Arg Phe Val Lys His Asp Leu Ser Leu Leu Ser Val Thr Ser Val
1880 1885 1890
Val Met Val Gly Gly Tyr Val Ala Pro Val Asn Thr Val Lys Pro
1895 1900 1905
Lys Pro Val Ile Asn Gln Leu Asp Glu Lys Ala Gln Lys Phe Phe
1910 1915 1920
Asp Phe Gly Asp Phe Leu Ile His Asn Phe Val Ile Phe Phe Thr
1925 1930 1935
Trp Leu Leu Ser Met Phe Thr Leu Cys Lys Thr Ala Val Thr Thr
1940 1945 1950
Gly Asp Val Lys Ile Met Ala Lys Ala Pro Gln Arg Thr Gly Val
1955 1960 1965
Val Leu Lys Arg Ser Leu Lys Tyr Asn Leu Lys Ala Ser Ala Ala
1970 1975 1980
Val Leu Lys Ser Lys Trp Trp Leu Leu Ala Lys Phe Thr Lys Leu
1985 1990 1995
Leu Leu Leu Ile Tyr Thr Leu Tyr Ser Val Val Leu Leu Cys Val
2000 2005 2010
Arg Phe Gly Pro Phe Asn Phe Cys Ser Glu Thr Val Asn Gly Tyr
2015 2020 2025
Ala Lys Ser Asn Phe Val Lys Asp Asp Tyr Cys Asp Gly Ser Leu
2030 2035 2040
Gly Cys Lys Met Cys Leu Phe Gly Tyr Gln Glu Leu Ser Gln Phe
2045 2050 2055
Ser His Leu Asp Val Val Trp Lys His Ile Thr Asp Pro Leu Phe
2060 2065 2070
Ser Asn Met Gln Pro Phe Ile Val Met Val Leu Leu Leu Ile Phe
2075 2080 2085
Gly Asp Asn Tyr Leu Arg Cys Phe Leu Leu Tyr Phe Val Ala Gln
2090 2095 2100
Met Ile Ser Thr Val Gly Val Phe Leu Gly Tyr Lys Glu Thr Asn
2105 2110 2115
Trp Phe Leu His Phe Ile Pro Phe Asp Val Ile Cys Asp Glu Leu
2120 2125 2130
Leu Val Thr Val Ile Val Ile Lys Val Ile Ser Phe Val Arg His
2135 2140 2145
Val Leu Phe Gly Cys Glu Asn Pro Asp Cys Ile Ala Cys Ser Lys
2150 2155 2160
Ser Ala Arg Leu Lys Arg Phe Pro Val Asn Thr Ile Val Asn Gly
2165 2170 2175
Val Gln Arg Ser Phe Tyr Val Asn Ala Asn Gly Gly Ser Lys Phe
2180 2185 2190
Cys Lys Lys His Arg Phe Phe Cys Val Asp Cys Asp Ser Tyr Gly
2195 2200 2205
Tyr Gly Ser Thr Phe Ile Thr Pro Glu Val Ser Arg Glu Leu Gly
2210 2215 2220
Asn Ile Thr Lys Thr Asn Val Gln Pro Thr Gly Pro Ala Tyr Val
2225 2230 2235
Met Ile Asp Lys Val Glu Phe Glu Asn Gly Phe Tyr Arg Leu Tyr
2240 2245 2250
Ser Cys Glu Thr Phe Trp Arg Tyr Asn Phe Asp Ile Thr Glu Ser
2255 2260 2265
Lys Tyr Ser Cys Lys Glu Val Phe Lys Asn Cys Asn Val Leu Asp
2270 2275 2280
Asp Phe Ile Val Phe Asn Asn Asn Gly Thr Asn Val Thr Gln Val
2285 2290 2295
Lys Asn Ala Ser Val Tyr Phe Ser Gln Leu Leu Cys Arg Pro Ile
2300 2305 2310
Lys Leu Val Asp Ser Glu Leu Leu Ser Thr Leu Ser Val Asp Phe
2315 2320 2325
Asn Gly Val Leu His Lys Ala Tyr Ile Asp Val Leu Arg Asn Ser
2330 2335 2340
Phe Gly Lys Asp Leu Asn Ala Asn Met Ser Leu Ala Glu Cys Lys
2345 2350 2355
Arg Ala Leu Gly Leu Ser Ile Ser Asp His Glu Phe Thr Ser Ala
2360 2365 2370
Ile Ser Asn Ala His Arg Cys Asp Val Leu Leu Ser Asp Leu Ser
2375 2380 2385
Phe Asn Asn Phe Val Ser Ser Tyr Ala Lys Pro Glu Glu Lys Leu
2390 2395 2400
Ser Ala Tyr Asp Leu Ala Cys Cys Met Arg Ala Gly Ala Lys Val
2405 2410 2415
Val Asn Ala Asn Val Leu Thr Lys Asp Gln Thr Pro Ile Val Trp
2420 2425 2430
His Ala Lys Asp Phe Asn Ser Leu Ser Ala Glu Gly Arg Lys Tyr
2435 2440 2445
Ile Val Lys Thr Ser Lys Ala Lys Gly Leu Thr Phe Leu Leu Thr
2450 2455 2460
Ile Asn Glu Asn Gln Ala Val Thr Gln Ile Pro Ala Thr Ser Ile
2465 2470 2475
Val Ala Lys Gln Gly Ala Gly Asp Ala Gly His Ser Leu Thr Trp
2480 2485 2490
Leu Trp Leu Leu Cys Gly Leu Val Cys Leu Ile Gln Phe Tyr Leu
2495 2500 2505
Cys Phe Phe Met Pro Tyr Phe Met Tyr Asp Ile Val Ser Ser Phe
2510 2515 2520
Glu Gly Tyr Asp Phe Lys Tyr Ile Glu Asn Gly Gln Leu Lys Asn
2525 2530 2535
Phe Glu Ala Pro Leu Lys Cys Val Arg Asn Val Phe Glu Asn Phe
2540 2545 2550
Glu Asp Trp His Tyr Ala Lys Phe Gly Phe Thr Pro Leu Asn Lys
2555 2560 2565
Gln Ser Cys Pro Ile Val Val Gly Val Ser Glu Ile Val Asn Thr
2570 2575 2580
Val Ala Gly Ile Pro Ser Asn Val Tyr Leu Val Gly Lys Thr Leu
2585 2590 2595
Ile Phe Thr Leu Gln Ala Ala Phe Gly Asn Ala Gly Val Cys Tyr
2600 2605 2610
Asp Ile Phe Gly Val Thr Thr Pro Glu Lys Cys Ile Phe Thr Ser
2615 2620 2625
Ala Cys Thr Arg Leu Glu Gly Leu Gly Gly Asn Asn Val Tyr Cys
2630 2635 2640
Tyr Asn Thr Ala Leu Met Glu Gly Ser Leu Pro Tyr Ser Ser Ile
2645 2650 2655
Gln Ala Asn Ala Tyr Tyr Lys Tyr Asp Asn Gly Asn Phe Ile Lys
2660 2665 2670
Leu Pro Glu Val Ile Ala Gln Gly Phe Gly Phe Arg Thr Val Arg
2675 2680 2685
Thr Ile Ala Thr Lys Tyr Cys Arg Val Gly Glu Cys Val Glu Ser
2690 2695 2700
Asn Ala Gly Val Cys Phe Gly Phe Asp Lys Trp Phe Val Asn Asp
2705 2710 2715
Gly Arg Val Ala Asn Gly Tyr Val Cys Gly Thr Gly Leu Trp Asn
2720 2725 2730
Leu Val Phe Asn Ile Leu Ser Met Phe Ser Ser Ser Phe Ser Val
2735 2740 2745
Ala Ala Met Ser Gly Gln Ile Leu Leu Asn Cys Ala Leu Gly Ala
2750 2755 2760
Phe Ala Ile Phe Cys Cys Phe Leu Val Thr Lys Phe Arg Arg Met
2765 2770 2775
Phe Gly Asp Leu Ser Val Gly Val Cys Thr Val Val Val Ala Val
2780 2785 2790
Leu Leu Asn Asn Val Ser Tyr Ile Val Thr Gln Asn Leu Val Thr
2795 2800 2805
Met Ile Ala Tyr Ala Ile Leu Tyr Phe Phe Ala Thr Arg Ser Leu
2810 2815 2820
Arg Tyr Ala Trp Ile Trp Cys Ala Ala Tyr Leu Ile Ala Tyr Ile
2825 2830 2835
Ser Phe Ala Pro Trp Trp Leu Cys Ala Trp Tyr Phe Leu Ala Met
2840 2845 2850
Leu Thr Gly Leu Leu Pro Ser Leu Leu Lys Leu Lys Val Ser Thr
2855 2860 2865
Asn Leu Phe Glu Gly Asp Lys Phe Val Gly Thr Phe Glu Ser Ala
2870 2875 2880
Ala Ala Gly Thr Phe Val Ile Asp Met Arg Ser Tyr Glu Lys Leu
2885 2890 2895
Ala Asn Ser Ile Ser Pro Glu Lys Leu Lys Ser Tyr Ala Ala Ser
2900 2905 2910
Tyr Asn Arg Tyr Lys Tyr Tyr Ser Gly Asn Ala Asn Glu Ala Asp
2915 2920 2925
Tyr Arg Cys Ala Cys Tyr Ala Tyr Leu Ala Lys Ala Met Leu Asp
2930 2935 2940
Phe Ser Arg Asp His Asn Asp Ile Leu Tyr Thr Pro Pro Thr Val
2945 2950 2955
Ser Tyr Gly Ser Thr Leu Gln Ala Gly Leu Arg Lys Met Ala Gln
2960 2965 2970
Pro Ser Gly Phe Val Glu Lys Cys Val Val Arg Val Cys Tyr Gly
2975 2980 2985
Asn Thr Val Leu Asn Gly Leu Trp Leu Gly Asp Ile Val Tyr Cys
2990 2995 3000
Pro Arg His Val Ile Ala Ser Asn Thr Thr Ser Ala Ile Asp Tyr
3005 3010 3015
Asp His Glu Tyr Ser Ile Met Arg Leu His Asn Phe Ser Ile Ile
3020 3025 3030
Ser Gly Thr Ala Phe Leu Gly Val Val Gly Ala Thr Met His Gly
3035 3040 3045
Val Thr Leu Lys Ile Lys Val Ser Gln Thr Asn Met His Thr Pro
3050 3055 3060
Arg His Ser Phe Arg Thr Leu Lys Ser Gly Glu Gly Phe Asn Ile
3065 3070 3075
Leu Ala Cys Tyr Asp Gly Cys Ala Gln Gly Val Phe Gly Val Asn
3080 3085 3090
Met Arg Thr Asn Trp Thr Ile Arg Gly Ser Phe Ile Asn Gly Ala
3095 3100 3105
Cys Gly Ser Pro Gly Tyr Asn Leu Lys Asn Gly Glu Val Glu Phe
3110 3115 3120
Val Tyr Met His Gln Ile Glu Leu Gly Ser Gly Ser His Val Gly
3125 3130 3135
Ser Ser Phe Asp Gly Val Met Tyr Gly Gly Phe Glu Asp Gln Pro
3140 3145 3150
Asn Leu Gln Val Glu Ser Ala Asn Gln Met Leu Thr Val Asn Val
3155 3160 3165
Val Ala Phe Leu Tyr Ala Ala Ile Leu Asn Gly Cys Thr Trp Trp
3170 3175 3180
Leu Lys Gly Glu Lys Leu Phe Val Glu His Tyr Asn Glu Trp Ala
3185 3190 3195
Gln Ala Asn Gly Phe Thr Ala Met Asn Gly Glu Asp Ala Phe Ser
3200 3205 3210
Ile Leu Ala Ala Lys Thr Gly Val Cys Val Glu Arg Leu Leu His
3215 3220 3225
Ala Ile Gln Val Leu Asn Asn Gly Phe Gly Gly Lys Gln Ile Leu
3230 3235 3240
Gly Tyr Ser Ser Leu Asn Asp Glu Phe Ser Ile Asn Glu Val Val
3245 3250 3255
Lys Gln Met Phe Gly Val Asn Leu Gln Ser Gly Lys Thr Thr Ser
3260 3265 3270
Met Phe Lys Ser Ile Ser Leu Phe Ala Gly Phe Phe Val Met Phe
3275 3280 3285
Trp Ala Glu Leu Phe Val Tyr Thr Thr Thr Ile Trp Val Asn Pro
3290 3295 3300
Gly Phe Leu Thr Pro Phe Met Ile Leu Leu Val Ala Leu Ser Leu
3305 3310 3315
Cys Leu Thr Phe Val Val Lys His Lys Val Leu Phe Leu Gln Val
3320 3325 3330
Phe Leu Leu Pro Ser Ile Ile Val Ala Ala Ile Gln Asn Cys Ala
3335 3340 3345
Trp Asp Tyr His Val Thr Lys Val Leu Ala Glu Lys Phe Asp Tyr
3350 3355 3360
Asn Val Ser Val Met Gln Met Asp Ile Gln Gly Phe Val Asn Ile
3365 3370 3375
Phe Ile Cys Leu Phe Val Ala Leu Leu His Thr Trp Arg Phe Ala
3380 3385 3390
Lys Glu Arg Cys Thr His Trp Cys Thr Tyr Leu Phe Ser Leu Ile
3395 3400 3405
Ala Val Leu Tyr Thr Ala Leu Tyr Ser Tyr Asp Tyr Val Ser Leu
3410 3415 3420
Leu Val Met Leu Leu Cys Ala Ile Ser Asn Glu Trp Tyr Ile Gly
3425 3430 3435
Ala Ile Ile Phe Arg Ile Cys Arg Phe Gly Val Ala Phe Leu Pro
3440 3445 3450
Val Glu Tyr Val Ser Tyr Phe Asp Gly Val Lys Thr Val Leu Leu
3455 3460 3465
Phe Tyr Met Leu Leu Gly Phe Val Ser Cys Met Tyr Tyr Gly Leu
3470 3475 3480
Leu Tyr Trp Ile Asn Arg Phe Cys Lys Cys Thr Leu Gly Val Tyr
3485 3490 3495
Asp Phe Cys Val Ser Pro Ala Glu Phe Lys Tyr Met Val Ala Asn
3500 3505 3510
Gly Leu Asn Ala Pro Asn Gly Pro Phe Asp Ala Leu Phe Leu Ser
3515 3520 3525
Phe Lys Leu Met Gly Ile Gly Gly Pro Arg Thr Ile Lys Val Ser
3530 3535 3540
Thr Val Gln Ser Lys Leu Thr Asp Leu Lys Cys Thr Asn Val Val
3545 3550 3555
Leu Met Gly Ile Leu Ser Asn Met Asn Ile Ala Ser Asn Ser Lys
3560 3565 3570
Glu Trp Ala Tyr Cys Val Glu Met His Asn Lys Ile Asn Leu Cys
3575 3580 3585
Asp Asp Pro Glu Thr Ala Gln Glu Leu Leu Leu Ala Leu Leu Ala
3590 3595 3600
Phe Phe Leu Ser Lys His Ser Asp Phe Gly Leu Gly Asp Leu Val
3605 3610 3615
Asp Ser Tyr Phe Glu Asn Asp Ser Ile Leu Gln Ser Val Ala Ser
3620 3625 3630
Ser Phe Val Gly Met Pro Ser Phe Val Ala Tyr Glu Thr Ala Arg
3635 3640 3645
Gln Glu Tyr Glu Asn Ala Val Ala Asn Gly Ser Ser Pro Gln Ile
3650 3655 3660
Ile Lys Gln Leu Lys Lys Ala Met Asn Val Ala Lys Ala Glu Phe
3665 3670 3675
Asp Arg Glu Ser Ser Val Gln Lys Lys Ile Asn Arg Met Ala Glu
3680 3685 3690
Gln Ala Ala Ala Ala Met Tyr Lys Glu Ala Arg Ala Val Asn Arg
3695 3700 3705
Lys Ser Lys Val Val Ser Ala Met His Ser Leu Leu Phe Gly Met
3710 3715 3720
Leu Arg Arg Leu Asp Met Ser Ser Val Asp Thr Ile Leu Asn Met
3725 3730 3735
Ala Arg Asn Gly Val Val Pro Leu Ser Val Ile Pro Ala Thr Ser
3740 3745 3750
Ala Ala Arg Leu Val Val Val Val Pro Asp His Asp Ser Phe Val
3755 3760 3765
Lys Met Met Val Asp Gly Phe Val His Tyr Ala Gly Val Val Trp
3770 3775 3780
Thr Leu Gln Glu Val Lys Asp Asn Asp Gly Lys Asn Val His Leu
3785 3790 3795
Lys Asp Val Thr Lys Glu Asn Gln Glu Ile Leu Val Trp Pro Leu
3800 3805 3810
Ile Leu Thr Cys Glu Arg Val Val Lys Leu Gln Asn Asn Glu Ile
3815 3820 3825
Met Pro Gly Lys Met Lys Val Lys Ala Thr Lys Gly Glu Gly Asp
3830 3835 3840
Gly Gly Ile Thr Ser Glu Gly Asn Ala Leu Tyr Asn Asn Glu Gly
3845 3850 3855
Gly Arg Ala Phe Met Tyr Ala Tyr Val Thr Thr Lys Pro Gly Met
3860 3865 3870
Lys Tyr Val Lys Trp Glu His Asp Ser Gly Val Val Thr Val Glu
3875 3880 3885
Leu Glu Pro Pro Cys Arg Phe Val Ile Asp Thr Pro Thr Gly Pro
3890 3895 3900
Gln Ile Lys Tyr Leu Tyr Phe Val Lys Asn Leu Asn Asn Leu Arg
3905 3910 3915
Arg Gly Ala Val Leu Gly Tyr Ile Gly Ala Thr Val Arg Leu Gln
3920 3925 3930
Ala Gly Lys Gln Thr Glu Phe Val Ser Asn Ser His Leu Leu Thr
3935 3940 3945
His Cys Ser Phe Ala Val Asp Pro Ala Ala Ala Tyr Leu Asp Ala
3950 3955 3960
Val Lys Gln Gly Ala Lys Pro Val Gly Asn Cys Val Lys Met Leu
3965 3970 3975
Thr Asn Gly Ser Gly Ser Gly Gln Ala Ile Thr Cys Thr Ile Asp
3980 3985 3990
Ser Asn Thr Thr Gln Asp Thr Tyr Gly Gly Ala Ser Val Cys Ile
3995 4000 4005
Tyr Cys Arg Ala His Val Ala His Pro Thr Met Asp Gly Phe Cys
4010 4015 4020
Gln Tyr Lys Gly Lys Trp Val Gln Val Pro Ile Gly Thr Asn Asp
4025 4030 4035
Pro Ile Arg Phe Cys Leu Glu Asn Thr Val Cys Lys Val Cys Gly
4040 4045 4050
Cys Trp Leu Asn His Gly Cys Thr Cys Asp Arg Thr Ala Ile Gln
4055 4060 4065
Ser Phe Asp Asn Ser Tyr Leu Asn Glu Ser Gly Ala Leu Val Pro
4070 4075 4080
Leu Asp
4085
<210> 13
<211> 4017
<212> PRT
<213> transmissible gastroeneteritis virus
<220>
<221> MISC_FEATURE
<223> ORF 1A
<400> 13
Met Ser Ser Lys Gln Phe Lys Ile Leu Val Asn Glu Asp Tyr Gln Val
1 5 10 15
Asn Val Pro Ser Leu Pro Ile Arg Asp Val Leu Gln Glu Ile Lys Tyr
20 25 30
Cys Tyr Arg Asn Gly Phe Glu Gly Tyr Val Phe Val Pro Glu Tyr Cys
35 40 45
Arg Asp Leu Val Asp Cys Asp Arg Lys Asp His Tyr Val Ile Gly Val
50 55 60
Leu Gly Asn Gly Val Ser Asp Leu Lys Pro Val Leu Leu Thr Glu Pro
65 70 75 80
Ser Val Met Leu Gln Gly Phe Ile Val Arg Ala Asn Cys Asn Gly Val
85 90 95
Leu Glu Asp Phe Asp Leu Lys Ile Ala Arg Thr Gly Arg Gly Ala Ile
100 105 110
Tyr Val Asp Gln Tyr Met Cys Gly Ala Asp Gly Lys Pro Val Ile Glu
115 120 125
Gly Asp Phe Lys Asp Tyr Phe Gly Asp Glu Asp Ile Ile Glu Phe Glu
130 135 140
Gly Glu Glu Tyr His Cys Ala Trp Thr Thr Val Arg Asp Glu Lys Pro
145 150 155 160
Leu Asn Gln Gln Thr Leu Phe Thr Ile Gln Glu Ile Gln Tyr Asn Leu
165 170 175
Asp Ile Pro His Lys Leu Pro Asn Cys Ala Thr Arg His Val Ala Pro
180 185 190
Pro Val Lys Lys Asn Ser Lys Ile Val Leu Ser Glu Asp Tyr Lys Lys
195 200 205
Leu Tyr Asp Ile Phe Gly Ser Pro Phe Met Gly Asn Gly Asp Cys Leu
210 215 220
Ser Lys Cys Phe Asp Thr Leu His Phe Ile Ala Ala Thr Leu Arg Cys
225 230 235 240
Pro Cys Gly Ser Glu Ser Ser Gly Val Gly Asp Trp Thr Gly Phe Lys
245 250 255
Thr Ala Cys Cys Gly Leu Ser Gly Lys Val Lys Gly Val Thr Leu Gly
260 265 270
Asp Ile Lys Pro Gly Asp Ala Val Val Thr Ser Met Ser Ala Gly Lys
275 280 285
Gly Val Lys Phe Phe Ala Asn Cys Val Leu Gln Tyr Ala Gly Asp Val
290 295 300
Glu Gly Val Ser Ile Trp Lys Val Ile Lys Thr Phe Thr Val Asp Glu
305 310 315 320
Thr Val Cys Thr Pro Gly Phe Glu Gly Glu Leu Asn Asp Phe Ile Lys
325 330 335
Pro Glu Ser Lys Ser Leu Val Ala Cys Ser Val Lys Arg Ala Phe Ile
340 345 350
Thr Gly Asp Ile Asp Asp Ala Val His Asp Cys Ile Ile Thr Gly Lys
355 360 365
Leu Asp Leu Ser Thr Asn Leu Phe Gly Asn Val Gly Leu Leu Phe Lys
370 375 380
Lys Thr Pro Trp Phe Val Gln Lys Cys Gly Ala Leu Phe Val Asp Ala
385 390 395 400
Trp Lys Val Val Glu Glu Leu Cys Gly Ser Leu Thr Leu Thr Tyr Lys
405 410 415
Gln Ile Tyr Glu Val Val Ala Ser Leu Cys Thr Ser Ala Phe Thr Ile
420 425 430
Val Asn Tyr Lys Pro Thr Phe Val Val Pro Asp Asn Arg Val Lys Asp
435 440 445
Leu Val Asp Lys Cys Val Lys Val Leu Val Lys Ala Phe Asp Val Phe
450 455 460
Thr Gln Ile Ile Thr Ile Ala Gly Ile Glu Ala Lys Cys Phe Val Leu
465 470 475 480
Gly Ala Lys Tyr Leu Leu Phe Asn Asn Ala Leu Val Lys Leu Val Ser
485 490 495
Val Lys Ile Leu Gly Lys Lys Gln Lys Gly Leu Glu Cys Ala Phe Phe
500 505 510
Ala Thr Ser Leu Val Gly Ala Thr Val Asn Val Thr Pro Lys Arg Thr
515 520 525
Glu Thr Ala Thr Ile Ser Leu Asn Lys Val Asp Asp Val Val Ala Pro
530 535 540
Gly Glu Gly Tyr Ile Val Ile Val Gly Asp Met Ala Phe Tyr Lys Ser
545 550 555 560
Gly Glu Tyr Tyr Phe Met Met Ser Ser Pro Asn Phe Val Leu Thr Asn
565 570 575
Asn Val Phe Lys Ala Val Lys Val Pro Ser Tyr Asp Ile Val Tyr Asp
580 585 590
Val Asp Asn Asp Thr Lys Ser Lys Met Ile Ala Lys Leu Gly Ser Ser
595 600 605
Phe Glu Tyr Asp Gly Asp Ile Asp Ala Ala Ile Val Lys Val Asn Glu
610 615 620
Leu Leu Ile Glu Phe Arg Gln Gln Ser Leu Cys Phe Arg Ala Phe Lys
625 630 635 640
Asp Asp Lys Ser Ile Phe Val Glu Ala Tyr Phe Lys Lys Tyr Lys Met
645 650 655
Pro Ala Cys Leu Ala Lys His Ile Gly Leu Trp Asn Ile Ile Lys Lys
660 665 670
Asp Ser Cys Lys Arg Gly Phe Leu Asn Leu Phe Asn His Leu Asn Glu
675 680 685
Leu Glu Asp Ile Lys Glu Thr Asn Ile Gln Ala Ile Lys Asn Ile Leu
690 695 700
Cys Pro Asp Pro Leu Leu Asp Leu Asp Tyr Gly Ala Ile Trp Tyr Asn
705 710 715 720
Cys Met Pro Gly Cys Ser Asp Pro Ser Val Leu Gly Ser Val Gln Leu
725 730 735
Leu Ile Gly Asn Gly Val Lys Val Val Cys Asp Gly Cys Lys Gly Phe
740 745 750
Ala Asn Gln Leu Ser Lys Gly Tyr Asn Lys Leu Cys Asn Ala Ala Arg
755 760 765
Asn Asp Ile Glu Ile Gly Gly Ile Pro Phe Ser Thr Phe Lys Thr Pro
770 775 780
Thr Asn Thr Phe Ile Glu Met Thr Asp Ala Ile Tyr Ser Val Ile Glu
785 790 795 800
Gln Gly Lys Ala Leu Ser Phe Arg Asp Ala Asp Val Pro Val Val Asp
805 810 815
Asn Gly Thr Ile Ser Thr Ala Asp Trp Ser Glu Pro Ile Leu Leu Glu
820 825 830
Pro Ala Glu Tyr Val Lys Pro Lys Asn Asn Gly Asn Val Ile Val Ile
835 840 845
Ala Gly Tyr Thr Phe Tyr Lys Asp Glu Asp Glu His Phe Tyr Pro Tyr
850 855 860
Gly Phe Gly Lys Ile Val Gln Arg Met Tyr Asn Lys Met Gly Gly Gly
865 870 875 880
Asp Lys Thr Val Ser Phe Ser Glu Glu Val Asp Val Gln Glu Ile Ala
885 890 895
Pro Val Thr Arg Val Lys Leu Glu Phe Glu Phe Asp Asn Glu Ile Val
900 905 910
Thr Gly Val Leu Glu Arg Ala Ile Gly Thr Arg Tyr Lys Phe Thr Gly
915 920 925
Thr Thr Trp Glu Glu Phe Glu Glu Ser Ile Ser Glu Glu Leu Asp Ala
930 935 940
Ile Phe Asp Thr Leu Ala Asn Gln Gly Val Glu Leu Glu Gly Tyr Phe
945 950 955 960
Ile Tyr Asp Thr Cys Gly Gly Phe Asp Ile Lys Asn Pro Asp Gly Ile
965 970 975
Met Ile Ser Gln Tyr Asp Ile Asn Ile Thr Ala Asp Glu Lys Ser Glu
980 985 990
Val Ser Ala Ser Ser Glu Glu Glu Glu Val Glu Ser Val Glu Glu Asp
995 1000 1005
Pro Glu Asn Glu Ile Val Glu Ala Ser Glu Gly Ala Glu Gly Thr
1010 1015 1020
Ser Ser Gln Glu Glu Val Glu Thr Val Glu Val Ala Asp Ile Thr
1025 1030 1035
Ser Thr Glu Glu Asp Val Asp Ile Val Glu Val Ser Ala Lys Asp
1040 1045 1050
Asp Pro Trp Ala Ala Ala Val Asp Val Gln Glu Ala Glu Gln Phe
1055 1060 1065
Asn Pro Ser Leu Pro Pro Phe Lys Thr Thr Asn Leu Asn Gly Lys
1070 1075 1080
Ile Ile Leu Lys Gln Gly Asp Asn Asn Cys Trp Ile Asn Ala Cys
1085 1090 1095
Cys Tyr Gln Leu Gln Ala Phe Asp Phe Phe Asn Asn Glu Ala Trp
1100 1105 1110
Glu Lys Phe Lys Lys Gly Asp Val Met Asp Phe Val Asn Leu Cys
1115 1120 1125
Tyr Ala Ala Thr Thr Leu Ala Arg Gly His Ser Gly Asp Ala Glu
1130 1135 1140
Tyr Leu Leu Glu Leu Met Leu Asn Asp Tyr Ser Thr Ala Lys Ile
1145 1150 1155
Val Leu Ala Ala Lys Cys Gly Cys Gly Glu Lys Glu Ile Val Leu
1160 1165 1170
Glu Arg Ala Val Phe Lys Leu Thr Pro Leu Lys Glu Ser Phe Asn
1175 1180 1185
Tyr Gly Val Cys Gly Asp Cys Met Gln Val Asn Thr Cys Arg Phe
1190 1195 1200
Leu Ser Val Glu Gly Ser Gly Val Phe Val His Asp Ile Leu Ser
1205 1210 1215
Lys Gln Thr Pro Glu Ala Met Phe Val Val Lys Pro Val Met His
1220 1225 1230
Ala Val Tyr Thr Gly Thr Thr Gln Asn Gly His Tyr Met Val Asp
1235 1240 1245
Asp Ile Glu His Gly Tyr Cys Val Asp Gly Met Gly Ile Lys Pro
1250 1255 1260
Leu Lys Lys Arg Cys Tyr Thr Ser Thr Leu Phe Ile Asn Ala Asn
1265 1270 1275
Val Met Thr Arg Ala Glu Lys Pro Lys Gln Glu Phe Lys Val Glu
1280 1285 1290
Lys Val Glu Gln Gln Pro Ile Val Glu Glu Asn Lys Ser Ser Ile
1295 1300 1305
Glu Lys Glu Glu Ile Gln Ser Pro Lys Asn Asp Asp Leu Ile Leu
1310 1315 1320
Pro Phe Tyr Lys Ala Gly Lys Leu Ser Phe Tyr Gln Gly Ala Leu
1325 1330 1335
Asp Val Leu Ile Asn Phe Leu Glu Pro Asp Val Ile Val Asn Ala
1340 1345 1350
Ala Asn Gly Asp Leu Lys His Met Gly Gly Val Ala Arg Ala Ile
1355 1360 1365
Asp Val Phe Thr Gly Gly Lys Leu Thr Glu Arg Ser Lys Asp Tyr
1370 1375 1380
Leu Lys Lys Asn Lys Ser Ile Ala Pro Gly Asn Ala Val Phe Phe
1385 1390 1395
Glu Asn Val Ile Glu His Leu Ser Val Leu Asn Ala Val Gly Pro
1400 1405 1410
Arg Asn Gly Asp Ser Arg Val Glu Ala Lys Leu Cys Asn Val Tyr
1415 1420 1425
Lys Ala Ile Ala Lys Cys Glu Gly Lys Ile Leu Thr Pro Leu Ile
1430 1435 1440
Ser Val Gly Ile Phe Asn Val Arg Leu Glu Thr Ser Leu Gln Cys
1445 1450 1455
Leu Leu Lys Thr Val Asn Asp Arg Gly Leu Asn Val Phe Val Tyr
1460 1465 1470
Thr Asp Gln Glu Arg Gln Thr Ile Glu Asn Phe Phe Ser Cys Ser
1475 1480 1485
Ile Pro Val Asn Val Thr Glu Asp Asn Val Asn His Glu Arg Val
1490 1495 1500
Ser Val Ser Phe Asp Lys Thr Tyr Gly Glu Gln Leu Lys Gly Thr
1505 1510 1515
Val Val Ile Lys Asp Lys Asp Val Thr Asn Gln Leu Pro Ser Ala
1520 1525 1530
Phe Asp Val Gly Gln Lys Val Ile Lys Ala Ile Asp Ile Asp Trp
1535 1540 1545
Gln Ala His Tyr Gly Phe Arg Asp Ala Ala Ala Phe Ser Ala Ser
1550 1555 1560
Ser His Asp Ala Tyr Lys Phe Glu Val Val Thr His Ser Asn Phe
1565 1570 1575
Ile Val His Lys Gln Thr Asp Asn Asn Cys Trp Ile Asn Ala Ile
1580 1585 1590
Cys Leu Ala Leu Gln Arg Leu Lys Pro Gln Trp Lys Phe Pro Gly
1595 1600 1605
Val Arg Gly Leu Trp Asn Glu Phe Leu Glu Arg Lys Thr Gln Gly
1610 1615 1620
Phe Val His Met Leu Tyr His Ile Ser Gly Val Lys Lys Gly Glu
1625 1630 1635
Pro Gly Asp Ala Glu Leu Met Leu His Lys Leu Gly Asp Leu Met
1640 1645 1650
Asp Asn Asp Cys Glu Ile Ile Val Thr His Thr Thr Ala Cys Asp
1655 1660 1665
Lys Cys Ala Lys Val Glu Lys Phe Val Gly Pro Val Val Ala Ala
1670 1675 1680
Pro Leu Ala Ile His Gly Thr Asp Glu Thr Cys Val His Gly Val
1685 1690 1695
Ser Val Asn Val Lys Val Thr Gln Ile Lys Gly Thr Val Ala Ile
1700 1705 1710
Thr Ser Leu Ile Gly Pro Ile Ile Gly Glu Val Leu Glu Ala Thr
1715 1720 1725
Gly Tyr Ile Cys Tyr Ser Gly Ser Asn Arg Asn Gly His Tyr Thr
1730 1735 1740
Tyr Tyr Asp Asn Arg Asn Gly Leu Val Val Asp Ala Glu Lys Ala
1745 1750 1755
Tyr His Phe Asn Arg Asp Leu Leu Gln Val Thr Thr Ala Ile Ala
1760 1765 1770
Ser Asn Phe Val Val Lys Lys Pro Gln Ala Glu Glu Arg Pro Lys
1775 1780 1785
Asn Cys Ala Phe Asn Lys Val Ala Ala Ser Pro Lys Ile Val Gln
1790 1795 1800
Glu Gln Lys Leu Leu Ala Ile Glu Ser Gly Ala Asn Tyr Ala Leu
1805 1810 1815
Thr Glu Phe Gly Arg Tyr Ala Asp Met Phe Phe Met Ala Gly Asp
1820 1825 1830
Lys Ile Leu Arg Leu Leu Leu Glu Val Phe Lys Tyr Leu Leu Val
1835 1840 1845
Leu Phe Met Cys Leu Arg Ser Thr Lys Met Pro Lys Val Lys Val
1850 1855 1860
Lys Pro Pro Leu Ala Phe Lys Asp Phe Gly Ala Lys Val Arg Thr
1865 1870 1875
Leu Asn Tyr Met Arg Gln Leu Asn Lys Pro Ser Val Trp Arg Tyr
1880 1885 1890
Ala Lys Leu Val Leu Leu Leu Ile Ala Ile Tyr Asn Phe Phe Tyr
1895 1900 1905
Leu Phe Val Ser Ile Pro Val Val His Lys Leu Thr Cys Asn Gly
1910 1915 1920
Ala Val Gln Ala Tyr Lys Asn Ser Ser Phe Ile Lys Ser Ala Val
1925 1930 1935
Cys Gly Asn Ser Ile Leu Cys Lys Ala Cys Leu Ala Ser Tyr Asp
1940 1945 1950
Glu Leu Ala Asp Phe Gln His Leu Gln Val Thr Trp Asp Phe Lys
1955 1960 1965
Ser Asp Pro Leu Trp Asn Arg Leu Val Gln Leu Ser Tyr Phe Ala
1970 1975 1980
Phe Leu Ala Val Phe Gly Asn Asn Tyr Val Arg Cys Phe Leu Met
1985 1990 1995
Tyr Phe Val Ser Gln Tyr Leu Asn Leu Trp Leu Ser Tyr Phe Gly
2000 2005 2010
Tyr Val Glu Tyr Ser Trp Phe Leu His Val Val Asn Phe Glu Ser
2015 2020 2025
Ile Ser Ala Glu Phe Val Ile Val Val Ile Val Val Lys Ala Val
2030 2035 2040
Leu Ala Leu Lys His Ile Val Phe Ala Cys Ser Asn Pro Ser Cys
2045 2050 2055
Lys Thr Cys Ser Arg Thr Ala Arg Gln Thr Arg Ile Pro Ile Gln
2060 2065 2070
Val Val Val Asn Gly Ser Met Lys Thr Val Tyr Val His Ala Asn
2075 2080 2085
Gly Thr Gly Lys Phe Cys Lys Lys His Asn Phe Tyr Cys Lys Asn
2090 2095 2100
Cys Asp Ser Tyr Gly Phe Glu Asn Thr Phe Ile Cys Asp Glu Ile
2105 2110 2115
Val Arg Asp Leu Ser Asn Ser Val Lys Gln Thr Val Tyr Ala Thr
2120 2125 2130
Asp Arg Ser His Gln Glu Val Thr Lys Val Glu Cys Ser Asp Gly
2135 2140 2145
Phe Tyr Arg Phe Tyr Val Gly Asp Glu Phe Thr Ser Tyr Asp Tyr
2150 2155 2160
Asp Val Lys His Lys Lys Tyr Ser Ser Gln Glu Val Leu Lys Ser
2165 2170 2175
Met Leu Leu Leu Asp Asp Phe Ile Val Tyr Ser Pro Ser Gly Ser
2180 2185 2190
Ala Leu Ala Asn Val Arg Asn Ala Cys Val Tyr Phe Ser Gln Leu
2195 2200 2205
Ile Gly Lys Pro Ile Lys Ile Val Asn Ser Asp Leu Leu Glu Asp
2210 2215 2220
Leu Ser Val Asp Phe Lys Gly Ala Leu Phe Asn Ala Lys Lys Asn
2225 2230 2235
Val Ile Lys Asn Ser Phe Asn Val Asp Val Ser Glu Cys Lys Asn
2240 2245 2250
Leu Asp Glu Cys Tyr Arg Ala Cys Asn Leu Asn Val Ser Phe Ser
2255 2260 2265
Thr Phe Glu Met Ala Val Asn Asn Ala His Arg Phe Gly Ile Leu
2270 2275 2280
Ile Thr Asp Arg Ser Phe Asn Asn Phe Trp Pro Ser Lys Val Lys
2285 2290 2295
Pro Gly Ser Ser Gly Val Ser Ala Met Asp Ile Gly Lys Cys Met
2300 2305 2310
Thr Ser Asp Ala Lys Ile Val Asn Ala Lys Val Leu Thr Gln Arg
2315 2320 2325
Gly Lys Ser Val Val Trp Leu Ser Gln Asp Phe Ala Ala Leu Ser
2330 2335 2340
Ser Thr Ala Gln Lys Val Leu Val Lys Thr Phe Val Glu Glu Gly
2345 2350 2355
Val Asn Phe Ser Leu Thr Phe Asn Ala Val Gly Ser Asp Asp Asp
2360 2365 2370
Leu Pro Tyr Glu Arg Phe Thr Glu Ser Val Ser Pro Lys Ser Gly
2375 2380 2385
Ser Gly Phe Phe Asp Val Ile Thr Gln Leu Lys Gln Ile Val Ile
2390 2395 2400
Leu Val Phe Val Phe Ile Phe Ile Cys Gly Leu Cys Ser Val Tyr
2405 2410 2415
Ser Val Ala Thr Gln Ser Tyr Ile Glu Ser Ala Glu Gly Tyr Asp
2420 2425 2430
Tyr Met Val Ile Lys Asn Gly Ile Val Gln Pro Phe Asp Asp Thr
2435 2440 2445
Ile Ser Cys Val His Asn Thr Tyr Lys Gly Phe Gly Asp Trp Phe
2450 2455 2460
Lys Ala Lys Tyr Gly Phe Ile Pro Thr Phe Gly Lys Ser Cys Pro
2465 2470 2475
Ile Val Val Gly Thr Val Phe Asp Leu Glu Asn Met Arg Pro Ile
2480 2485 2490
Pro Asp Val Pro Ala Tyr Val Ser Ile Val Gly Arg Ser Leu Val
2495 2500 2505
Phe Ala Ile Asn Ala Ala Phe Gly Val Thr Asn Met Cys Tyr Asp
2510 2515 2520
His Thr Gly Asn Ala Val Ser Lys Asp Ser Tyr Phe Asp Thr Cys
2525 2530 2535
Val Phe Asn Thr Ala Cys Thr Thr Leu Thr Gly Leu Gly Gly Thr
2540 2545 2550
Ile Val Tyr Cys Ala Lys Gln Gly Leu Val Glu Gly Ala Lys Leu
2555 2560 2565
Tyr Ser Asp Leu Met Pro Asp Tyr Tyr Tyr Glu His Ala Ser Gly
2570 2575 2580
Asn Met Val Lys Leu Pro Ala Ile Ile Arg Gly Leu Gly Leu Arg
2585 2590 2595
Phe Val Lys Thr Gln Ala Thr Thr Tyr Cys Arg Val Gly Glu Cys
2600 2605 2610
Ile Asp Ser Lys Ala Gly Phe Cys Phe Gly Gly Asp Asn Trp Phe
2615 2620 2625
Val Tyr Asp Asn Glu Phe Gly Asn Gly Tyr Ile Cys Gly Asn Ser
2630 2635 2640
Val Leu Gly Phe Phe Lys Asn Val Phe Lys Leu Phe Asn Ser Asn
2645 2650 2655
Met Ser Val Val Ala Thr Ser Gly Ala Met Leu Val Asn Ile Ile
2660 2665 2670
Ile Ala Cys Leu Ala Ile Ala Met Cys Tyr Gly Val Leu Lys Phe
2675 2680 2685
Lys Lys Ile Phe Gly Asp Cys Thr Phe Leu Ile Val Met Ile Ile
2690 2695 2700
Val Thr Leu Val Val Asn Asn Val Ser Tyr Phe Val Thr Gln Asn
2705 2710 2715
Thr Phe Phe Met Ile Ile Tyr Ala Ile Val Tyr Tyr Phe Ile Thr
2720 2725 2730
Arg Lys Leu Ala Tyr Pro Gly Ile Leu Asp Ala Gly Phe Ile Ile
2735 2740 2745
Ala Tyr Ile Asn Met Ala Pro Trp Tyr Val Ile Thr Ala Tyr Ile
2750 2755 2760
Leu Val Phe Leu Tyr Asp Ser Leu Pro Ser Leu Phe Lys Leu Lys
2765 2770 2775
Val Ser Thr Asn Leu Phe Glu Gly Asp Lys Phe Val Gly Asn Phe
2780 2785 2790
Glu Ser Ala Ala Met Gly Thr Phe Val Ile Asp Met Arg Ser Tyr
2795 2800 2805
Glu Thr Ile Val Asn Ser Thr Ser Ile Ala Arg Ile Lys Ser Tyr
2810 2815 2820
Ala Asn Ser Phe Asn Lys Tyr Lys Tyr Tyr Thr Gly Ser Met Gly
2825 2830 2835
Glu Ala Asp Tyr Arg Met Ala Cys Tyr Ala His Leu Gly Lys Ala
2840 2845 2850
Leu Met Asp Tyr Ser Val Asn Arg Thr Asp Met Leu Tyr Thr Pro
2855 2860 2865
Pro Thr Val Ser Val Asn Ser Thr Leu Gln Ser Gly Leu Arg Lys
2870 2875 2880
Met Ala Gln Pro Ser Gly Leu Val Glu Pro Cys Ile Val Arg Val
2885 2890 2895
Ser Tyr Gly Asn Asn Val Leu Asn Gly Leu Trp Leu Gly Asp Glu
2900 2905 2910
Val Ile Cys Pro Arg His Val Ile Ala Ser Asp Thr Thr Arg Val
2915 2920 2925
Ile Asn Tyr Glu Asn Glu Met Ser Ser Val Arg Leu His Asn Phe
2930 2935 2940
Ser Val Ser Lys Asn Asn Val Phe Leu Gly Val Val Ser Ala Arg
2945 2950 2955
Tyr Lys Gly Val Asn Leu Val Leu Lys Val Asn Gln Val Asn Pro
2960 2965 2970
Asn Thr Pro Glu His Lys Phe Lys Ser Ile Lys Ala Gly Glu Ser
2975 2980 2985
Phe Asn Ile Leu Ala Cys Tyr Glu Gly Cys Pro Gly Ser Val Tyr
2990 2995 3000
Gly Val Asn Met Arg Ser Gln Gly Thr Ile Lys Gly Ser Phe Ile
3005 3010 3015
Ala Gly Thr Cys Gly Ser Val Gly Tyr Val Leu Glu Asn Gly Ile
3020 3025 3030
Leu Tyr Phe Val Tyr Met His His Leu Glu Leu Gly Asn Gly Ser
3035 3040 3045
His Val Gly Ser Asn Phe Glu Gly Glu Met Tyr Gly Gly Tyr Glu
3050 3055 3060
Asp Gln Pro Ser Met Gln Leu Glu Gly Thr Asn Val Met Ser Ser
3065 3070 3075
Asp Asn Val Val Ala Phe Leu Tyr Ala Ala Leu Ile Asn Gly Glu
3080 3085 3090
Arg Trp Phe Val Thr Asn Thr Ser Met Ser Leu Glu Ser Tyr Asn
3095 3100 3105
Thr Trp Ala Lys Thr Asn Ser Phe Thr Glu Leu Ser Ser Thr Asp
3110 3115 3120
Ala Phe Ser Met Leu Ala Ala Lys Thr Gly Gln Ser Val Glu Lys
3125 3130 3135
Leu Leu Asp Ser Ile Val Arg Leu Asn Lys Gly Phe Gly Gly Arg
3140 3145 3150
Thr Ile Leu Ser Tyr Gly Ser Leu Cys Asp Glu Phe Thr Pro Thr
3155 3160 3165
Glu Val Ile Arg Gln Met Tyr Gly Val Asn Leu Gln Ala Gly Lys
3170 3175 3180
Val Lys Ser Phe Phe Tyr Pro Ile Met Thr Ala Met Thr Ile Leu
3185 3190 3195
Phe Ala Phe Trp Leu Glu Phe Phe Met Tyr Thr Pro Phe Thr Trp
3200 3205 3210
Ile Asn Pro Thr Phe Val Ser Ile Val Leu Ala Val Thr Thr Leu
3215 3220 3225
Ile Ser Thr Val Phe Val Ser Gly Ile Lys His Lys Met Leu Phe
3230 3235 3240
Phe Met Ser Phe Val Leu Pro Ser Val Ile Leu Val Thr Ala His
3245 3250 3255
Asn Leu Phe Trp Asp Phe Ser Tyr Tyr Glu Ser Leu Gln Ser Ile
3260 3265 3270
Val Glu Asn Thr Asn Thr Met Phe Leu Pro Val Asp Met Gln Gly
3275 3280 3285
Val Met Leu Thr Val Phe Cys Phe Ile Val Phe Val Thr Tyr Ser
3290 3295 3300
Val Arg Phe Phe Thr Cys Lys Gln Ser Trp Phe Ser Leu Ala Val
3305 3310 3315
Thr Thr Ile Leu Val Ile Phe Asn Met Val Lys Ile Phe Gly Thr
3320 3325 3330
Ser Asp Glu Pro Trp Thr Glu Asn Gln Ile Ala Phe Cys Phe Val
3335 3340 3345
Asn Met Leu Thr Met Ile Val Ser Leu Thr Thr Lys Asp Trp Met
3350 3355 3360
Val Val Ile Ala Ser Tyr Arg Ile Ala Tyr Tyr Ile Val Val Cys
3365 3370 3375
Val Met Pro Ser Ala Phe Val Ser Asp Phe Gly Phe Met Lys Cys
3380 3385 3390
Ile Ser Ile Val Tyr Met Ala Cys Gly Tyr Leu Phe Cys Cys Tyr
3395 3400 3405
Tyr Gly Ile Leu Tyr Trp Val Asn Arg Phe Thr Cys Met Thr Cys
3410 3415 3420
Gly Val Tyr Gln Phe Thr Val Ser Ala Ala Glu Leu Lys Tyr Met
3425 3430 3435
Thr Ala Asn Asn Leu Ser Ala Pro Lys Asn Ala Tyr Asp Ala Met
3440 3445 3450
Ile Leu Ser Ala Lys Leu Ile Gly Val Gly Gly Lys Arg Asn Ile
3455 3460 3465
Lys Ile Ser Thr Val Gln Ser Lys Leu Thr Glu Met Lys Cys Thr
3470 3475 3480
Asn Val Val Leu Leu Gly Leu Leu Ser Lys Met His Val Glu Ser
3485 3490 3495
Asn Ser Lys Glu Trp Asn Tyr Cys Val Gly Leu His Asn Glu Ile
3500 3505 3510
Asn Leu Cys Asp Asp Pro Glu Ile Val Leu Glu Lys Leu Leu Ala
3515 3520 3525
Leu Ile Ala Phe Phe Leu Ser Lys His Asn Thr Cys Asp Leu Ser
3530 3535 3540
Glu Leu Ile Glu Ser Tyr Phe Glu Asn Thr Thr Ile Leu Gln Ser
3545 3550 3555
Val Ala Ser Ala Tyr Ala Ala Leu Pro Ser Trp Ile Ala Leu Glu
3560 3565 3570
Lys Ala Arg Ala Asp Leu Glu Glu Ala Lys Lys Asn Asp Val Ser
3575 3580 3585
Pro Gln Ile Leu Lys Gln Leu Thr Lys Ala Phe Asn Ile Ala Lys
3590 3595 3600
Ser Asp Phe Glu Arg Glu Ala Ser Val Gln Lys Lys Leu Asp Lys
3605 3610 3615
Met Ala Glu Gln Ala Ala Ala Ser Met Tyr Lys Glu Ala Arg Ala
3620 3625 3630
Val Asp Arg Lys Ser Lys Ile Val Ser Ala Met His Ser Leu Leu
3635 3640 3645
Phe Gly Met Leu Lys Lys Leu Asp Met Ser Ser Val Asn Thr Ile
3650 3655 3660
Ile Asp Gln Ala Arg Asn Gly Val Leu Pro Leu Ser Ile Ile Pro
3665 3670 3675
Ala Ala Ser Ala Thr Arg Leu Val Val Ile Thr Pro Ser Leu Glu
3680 3685 3690
Val Phe Ser Lys Ile Arg Gln Glu Asn Asn Val His Tyr Ala Gly
3695 3700 3705
Ala Ile Trp Thr Ile Val Glu Val Lys Asp Ala Asn Gly Ser His
3710 3715 3720
Val His Leu Lys Glu Val Thr Ala Ala Asn Glu Leu Asn Leu Thr
3725 3730 3735
Trp Pro Leu Ser Ile Thr Cys Glu Arg Thr Thr Lys Leu Gln Asn
3740 3745 3750
Asn Glu Ile Met Pro Gly Lys Leu Lys Glu Arg Ala Val Arg Ala
3755 3760 3765
Ser Ala Thr Leu Asp Gly Glu Ala Phe Gly Ser Gly Lys Ala Leu
3770 3775 3780
Met Ala Ser Glu Ser Gly Lys Ser Phe Met Tyr Ala Phe Ile Ala
3785 3790 3795
Ser Asp Asn Asn Leu Lys Tyr Val Lys Trp Glu Ser Asn Asn Asp
3800 3805 3810
Ile Ile Pro Ile Glu Leu Glu Ala Pro Leu Arg Phe Tyr Val Asp
3815 3820 3825
Gly Ala Asn Gly Pro Glu Val Lys Tyr Leu Tyr Phe Val Lys Asn
3830 3835 3840
Leu Asn Thr Leu Arg Arg Gly Ala Val Leu Gly Tyr Ile Gly Ala
3845 3850 3855
Thr Val Arg Leu Gln Ala Gly Lys Pro Thr Glu His Pro Ser Asn
3860 3865 3870
Ser Ser Leu Leu Thr Leu Cys Ala Phe Ser Pro Asp Pro Ala Lys
3875 3880 3885
Ala Tyr Val Asp Ala Val Lys Arg Gly Met Gln Pro Val Asn Asn
3890 3895 3900
Cys Val Lys Met Leu Ser Asn Gly Ala Gly Asn Gly Met Ala Val
3905 3910 3915
Thr Asn Gly Val Glu Ala Asn Thr Gln Gln Asp Ser Tyr Gly Gly
3920 3925 3930
Ala Ser Val Cys Ile Tyr Cys Arg Cys His Val Glu His Pro Ala
3935 3940 3945
Ile Asp Gly Leu Cys Arg Tyr Lys Gly Lys Phe Val Gln Ile Pro
3950 3955 3960
Thr Gly Thr Gln Asp Pro Ile Arg Phe Cys Ile Glu Asn Glu Val
3965 3970 3975
Cys Val Val Cys Gly Cys Trp Leu Asn Asn Gly Cys Met Cys Asp
3980 3985 3990
Arg Thr Ser Met Gln Ser Phe Thr Val Asp Gln Ser Tyr Leu Asn
3995 4000 4005
Glu Cys Gly Val Leu Val Gln Leu Asp
4010 4015
<210> 14
<211> 4055
<212> PRT
<213> EMCR Coronavirus
<220>
<221> MISC_FEATURE
<223> ORF 1A
<400> 14
Met Phe Tyr Asn Gln Val Thr Leu Ala Val Ala Ser Asp Ser Glu Ile
1 5 10 15
Ser Gly Phe Gly Phe Ala Ile Pro Ser Val Ala Val Arg Ala Tyr Ser
20 25 30
Glu Ala Ala Ala Gln Gly Phe Gln Ala Cys Arg Phe Val Ala Phe Gly
35 40 45
Leu Gln Asp Cys Val Thr Gly Ile Asn Asp Asp Asp Tyr Val Ile Ala
50 55 60
Leu Thr Gly Thr Asn Gln Leu Cys Ala Lys Ile Leu Leu Phe Ser Asp
65 70 75 80
Arg Pro Leu Asn Leu Arg Gly Trp Leu Ile Phe Ser Asn Ser Asn Tyr
85 90 95
Val Leu Gln Asp Phe Asp Val Val Phe Gly His Gly Ala Gly Ser Val
100 105 110
Val Phe Val Asp Lys Tyr Met Cys Gly Phe Asp Gly Lys Pro Val Leu
115 120 125
Pro Lys Asn Met Trp Glu Phe Arg Asp Tyr Phe Asn Asp Asn Thr Asp
130 135 140
Ser Ile Val Ile Gly Gly Val Thr Tyr Gln Leu Ala Trp Asp Val Ile
145 150 155 160
Arg Lys Asp Leu Ser Tyr Glu Gln Gln Asn Val Leu Ala Ile Glu Ser
165 170 175
Ile His Tyr Leu Gly Thr Thr Gly His Thr Leu Lys Ser Gly Cys Lys
180 185 190
Leu Ile Asn Ala Lys Pro Pro Lys Tyr Ser Ser Lys Val Val Leu Ser
195 200 205
Gly Glu Trp Asn Ala Val Tyr Lys Ala Phe Gly Ser Pro Phe Ile Thr
210 215 220
Asn Gly Ile Ser Leu Leu Asp Ile Ile Val Lys Pro Val Phe Phe Asn
225 230 235 240
Ala Phe Val Lys Cys Asn Cys Gly Ser Glu Asn Trp Ser Val Gly Ala
245 250 255
Trp Asp Gly Tyr Leu Ser Ser Cys Cys Gly Thr Pro Ala Lys Lys Leu
260 265 270
Cys Val Val Pro Gly Asn Val Val Pro Gly Asp Val Ile Ile Thr Ser
275 280 285
Thr Asp Ala Gly Cys Gly Val Lys Tyr Tyr Ala Gly Leu Val Val Lys
290 295 300
His Ile Thr Asn Ile Thr Gly Val Ser Leu Trp Arg Val Thr Ala Val
305 310 315 320
His Ser Asp Gly Met Phe Val Ala Thr Ser Ser Tyr Asp Ala Leu Leu
325 330 335
His Arg Asn Ser Leu Asp Pro Phe Cys Phe Asp Val Asn Thr Leu Leu
340 345 350
Ser Asn Gln Leu Arg Leu Ala Phe Leu Gly Ala Ser Val Thr Glu Asp
355 360 365
Val Lys Phe Ala Ala Ser Thr Gly Val Ile Asp Ile Ser Ala Gly Met
370 375 380
Phe Gly Leu Tyr Asp Asp Ile Leu Thr Asn Asn Lys Pro Trp Phe Val
385 390 395 400
Arg Lys Ala Ser Gly Leu Phe Asp Ala Ile Trp Asp Ala Phe Val Ala
405 410 415
Ala Ile Lys Leu Val Pro Thr Thr Thr Gly Gly Leu Val Arg Phe Val
420 425 430
Lys Ser Ile Ala Ser Thr Val Leu Thr Val Ser Asn Gly Val Ile Ile
435 440 445
Met Cys Ala Asp Val Pro Asp Ala Phe Gln Pro Val Tyr Arg Thr Phe
450 455 460
Thr Gln Ala Ile Cys Ala Ala Phe Asp Phe Ser Leu Asp Val Phe Lys
465 470 475 480
Ile Gly Asp Val Lys Phe Lys Arg Leu Gly Asp Tyr Val Leu Thr Glu
485 490 495
Asn Ala Leu Val Arg Leu Thr Thr Glu Val Val Arg Gly Val Arg Asp
500 505 510
Ala Arg Ile Lys Lys Ala Met Phe Thr Lys Val Val Val Gly Pro Thr
515 520 525
Thr Glu Val Lys Phe Ser Val Ile Glu Leu Ala Thr Val Asn Leu Arg
530 535 540
Leu Val Asp Cys Ala Pro Val Val Cys Pro Lys Gly Lys Ile Val Val
545 550 555 560
Ile Ala Gly Gln Ala Phe Phe Tyr Ser Gly Gly Phe Tyr Arg Phe Met
565 570 575
Val Asp Ser Thr Thr Val Leu Asn Asp Pro Val Phe Thr Gly Glu Leu
580 585 590
Phe Tyr Thr Ile Lys Phe Ser Gly Phe Lys Leu Asp Gly Phe Asn His
595 600 605
Gln Phe Val Asn Ala Ser Ser Ala Thr Asp Ala Ile Ile Ala Val Glu
610 615 620
Leu Leu Leu Ser Asp Phe Lys Thr Ala Val Phe Val Tyr Thr Cys Val
625 630 635 640
Val Asp Gly Cys Ser Val Ile Val Arg Arg Asp Ala Thr Phe Ala Thr
645 650 655
His Val Cys Phe Lys Asp Cys Tyr Ser Ile Trp Glu Gln Phe Cys Ile
660 665 670
Asp Asn Cys Gly Glu Pro Trp Phe Leu Thr Asp Tyr Asn Ala Ile Leu
675 680 685
Gln Ser Asn Asn Pro Gln Cys Ala Ile Val Gln Ala Ser Glu Ser Lys
690 695 700
Val Leu Leu Glu Arg Phe Leu Pro Lys Cys Pro Glu Ile Leu Leu Ser
705 710 715 720
Ile Asp Asp Gly His Leu Trp Asn Leu Phe Val Glu Lys Phe Asn Phe
725 730 735
Val Thr Asp Trp Leu Lys Thr Leu Lys Leu Thr Leu Thr Ser Asn Gly
740 745 750
Leu Leu Gly Asn Cys Ala Lys Arg Phe Arg Arg Val Leu Val Lys Leu
755 760 765
Leu Asp Val Tyr Asn Gly Phe Leu Glu Thr Val Cys Ser Val Val His
770 775 780
Thr Ala Gly Val Cys Ile Lys Tyr Tyr Ala Val Asn Val Pro Tyr Val
785 790 795 800
Val Ile Ser Gly Phe Val Ser Arg Val Ile Arg Arg Glu Arg Cys Asp
805 810 815
Val Thr Phe Pro Cys Val Ser Cys Val Thr Phe Phe Tyr Glu Phe Leu
820 825 830
Asp Thr Cys Phe Gly Val Ser Lys Pro Asn Ala Ile Asp Val Glu His
835 840 845
Leu Glu Leu Lys Glu Thr Val Phe Val Glu Pro Lys Asp Gly Gly Gln
850 855 860
Phe Phe Val Ser Asp Asp Tyr Leu Trp Tyr Val Val Asp Asp Ile Tyr
865 870 875 880
Tyr Pro Ala Ser Cys Asn Gly Val Leu Pro Val Ala Phe Thr Lys Leu
885 890 895
Ala Gly Gly Lys Ile Ser Phe Ser Asp Asp Val Ile Val His Asp Val
900 905 910
Glu Pro Thr His Lys Val Lys Leu Ile Phe Glu Phe Glu Asp Asp Val
915 920 925
Val Thr Ser Leu Cys Lys Lys Ser Phe Gly Lys Ser Ile Ile Tyr Thr
930 935 940
Gly Asp Trp Glu Gly Leu His Glu Val Leu Thr Ser Ala Met Asn Val
945 950 955 960
Ile Gly Gln His Ile Lys Leu Pro Gln Phe Tyr Ile Tyr Asp Glu Glu
965 970 975
Gly Gly Tyr Asp Val Ser Lys Pro Val Met Ile Ser Gln Trp Pro Ile
980 985 990
Ser Asp Asp Ser Asp Gly Cys Val Val Glu Ala Ser Thr Asp Phe His
995 1000 1005
Gln Leu Glu Ser Val Arg Glu Glu Val Asp Ile Ile Glu Gln Pro
1010 1015 1020
Phe Gly Glu Val Glu His Ala Leu Ser Ile Arg Gln Pro Phe Ser
1025 1030 1035
Phe Ser Phe Arg Asp Glu Leu Gly Val Arg Val Leu Asp Gln Ser
1040 1045 1050
Asp Asn Asn Cys Trp Ile Ser Thr Thr Leu Ile Gln Leu Gln Leu
1055 1060 1065
Thr Lys Leu Leu Asp Asp Ser Ile Glu Met Gln Leu Phe Lys Val
1070 1075 1080
Gly Lys Val Asp Ser Ile Val Gln Lys Cys Tyr Glu Leu Ser His
1085 1090 1095
Leu Ile Ser Gly Ser Leu Gly Asp Ser Gly Lys Leu Leu Ser Glu
1100 1105 1110
Leu Leu Lys Asp Lys Tyr Thr Cys Ser Ile Thr Phe Glu Met Ser
1115 1120 1125
Cys Asp Cys Gly Lys Lys Phe Asp Glu Gln Val Gly Cys Leu Phe
1130 1135 1140
Trp Ile Met Pro Tyr Thr Lys Leu Phe Gln Lys Gly Glu Cys Cys
1145 1150 1155
Ile Cys His Lys Met Gln Thr Tyr Lys Leu Val Ser Met Lys Gly
1160 1165 1170
Thr Gly Val Phe Val Gln Asp Pro Ala Pro Ile Asp Ile Asp Ala
1175 1180 1185
Phe Pro Val Arg Pro Ile Cys Ser Ser Val Tyr Leu Gly Val Lys
1190 1195 1200
Gly Ser Gly His Tyr Gln Thr Asn Leu Tyr Ser Phe Asp Lys Ala
1205 1210 1215
Ile Asp Gly Phe Gly Val Phe Asp Ile Lys Asn Ser Ser Val Asn
1220 1225 1230
Thr Val Cys Phe Val Asp Val Asp Phe His Ser Val Glu Ile Glu
1235 1240 1245
Ala Gly Glu Val Lys Pro Phe Ala Val Tyr Lys Asn Val Lys Phe
1250 1255 1260
Tyr Leu Gly Asp Ile Ser His Leu Val Asn Cys Val Ser Phe Asp
1265 1270 1275
Phe Val Val Asn Ala Ala Asn Glu Asn Leu Met His Gly Gly Gly
1280 1285 1290
Val Ala Arg Ala Ile Asp Ile Leu Thr Glu Gly Gln Leu Gln Ser
1295 1300 1305
Leu Ser Lys Asp Tyr Ile Ser Ser Asn Gly Pro Leu Lys Val Gly
1310 1315 1320
Ala Gly Val Met Leu Glu Cys Glu Lys Phe Asn Val Phe Asn Val
1325 1330 1335
Val Gly Pro Arg Thr Gly Lys His Glu His Ser Leu Leu Val Glu
1340 1345 1350
Ala Tyr Asn Ser Ile Leu Phe Glu Asn Gly Ile Pro Leu Met Pro
1355 1360 1365
Leu Leu Ser Cys Gly Ile Phe Gly Val Arg Ile Glu Asn Ser Leu
1370 1375 1380
Lys Ala Leu Phe Ser Cys Asp Ile Asn Lys Pro Leu Gln Val Phe
1385 1390 1395
Val Tyr Ser Ser Asn Glu Glu Gln Ala Val Leu Lys Phe Leu Asp
1400 1405 1410
Gly Leu Asp Leu Thr Pro Val Ile Asp Asp Val Asp Val Val Lys
1415 1420 1425
Pro Phe Arg Val Glu Gly Asn Phe Ser Phe Phe Asp Cys Gly Val
1430 1435 1440
Asn Ala Leu Asp Gly Asp Ile Tyr Leu Leu Phe Thr Asn Ser Ile
1445 1450 1455
Leu Met Leu Asp Lys Gln Gly Gln Leu Leu Asp Thr Lys Leu Asn
1460 1465 1470
Gly Ile Leu Gln Gln Ala Val Leu Asp Tyr Leu Ala Thr Val Lys
1475 1480 1485
Thr Val Pro Ala Gly Asn Leu Val Lys Leu Val Val Glu Ser Cys
1490 1495 1500
Thr Ile Tyr Met Cys Val Val Pro Ser Ile Asn Asp Leu Ser Phe
1505 1510 1515
Asp Lys Asn Leu Gly Arg Cys Val Arg Lys Leu Asn Arg Leu Lys
1520 1525 1530
Thr Cys Val Ile Ala Asn Val Pro Ala Ile Asp Val Leu Lys Lys
1535 1540 1545
Leu Leu Ser Ser Leu Thr Leu Thr Val Lys Phe Val Val Glu Ser
1550 1555 1560
Asn Val Met Asp Val Asn Asp Cys Phe Lys Asn Asp Asn Val Val
1565 1570 1575
Leu Lys Ile Thr Glu Asp Gly Ile Asn Val Lys Asp Val Val Val
1580 1585 1590
Glu Ser Ser Lys Ser Leu Gly Lys Gln Leu Gly Val Val Ser Asp
1595 1600 1605
Gly Val Asp Ser Phe Glu Gly Val Leu Pro Ile Asn Thr Asp Thr
1610 1615 1620
Val Leu Ser Val Ala Pro Glu Val Asp Trp Val Ala Phe Tyr Gly
1625 1630 1635
Phe Glu Lys Ala Ala Leu Phe Ala Ser Leu Asp Val Lys Pro Tyr
1640 1645 1650
Gly Tyr Pro Asn Asp Phe Val Gly Gly Phe Arg Val Leu Gly Thr
1655 1660 1665
Thr Asp Asn Asn Cys Trp Val Asn Ala Thr Cys Ile Ile Leu Gln
1670 1675 1680
Tyr Leu Lys Pro Thr Phe Lys Ser Lys Gly Leu Asn Val Leu Trp
1685 1690 1695
Asn Lys Phe Val Thr Gly Asp Val Gly Pro Phe Val Ser Phe Ile
1700 1705 1710
Tyr Phe Ile Thr Met Ser Ser Lys Gly Gln Lys Gly Asp Ala Glu
1715 1720 1725
Glu Ala Leu Ser Lys Leu Ser Glu Tyr Leu Ile Ser Asp Ser Ile
1730 1735 1740
Val Thr Leu Glu Gln Tyr Ser Thr Cys Asp Ile Cys Lys Ser Thr
1745 1750 1755
Val Val Glu Val Lys Ser Ala Val Val Cys Ala Ser Val Leu Lys
1760 1765 1770
Asp Gly Cys Asp Val Gly Phe Cys Pro His Arg His Lys Leu Arg
1775 1780 1785
Ser Arg Val Lys Phe Val Asn Gly Arg Val Val Ile Thr Asn Val
1790 1795 1800
Gly Glu Pro Ile Ile Ser Gln Pro Ser Lys Leu Leu Asn Gly Ile
1805 1810 1815
Ala Tyr Thr Thr Phe Ser Gly Ser Phe Asp Asn Gly His Tyr Val
1820 1825 1830
Val Tyr Asp Ala Ala Asn Asn Ala Val Tyr Asp Gly Ala Arg Leu
1835 1840 1845
Phe Ala Ser Asp Leu Ser Thr Leu Ala Val Thr Ala Ile Val Val
1850 1855 1860
Val Gly Gly Cys Val Thr Ser Asn Val Pro Pro Ile Val Ser Glu
1865 1870 1875
Lys Ile Ser Val Met Asp Lys Leu Asp Thr Gly Ala Gln Lys Phe
1880 1885 1890
Phe Gln Phe Gly Asp Phe Val Met Asn Asn Ile Val Leu Phe Leu
1895 1900 1905
Thr Trp Leu Leu Ser Met Phe Ser Leu Leu Arg Thr Ser Ile Met
1910 1915 1920
Lys His Asp Ile Lys Val Ile Ala Lys Ala Pro Lys Arg Thr Gly
1925 1930 1935
Val Ile Leu Thr Arg Ser Phe Lys Tyr Asn Ile Arg Ser Ala Leu
1940 1945 1950
Phe Val Val Lys Gln Lys Trp Cys Val Ile Val Thr Leu Phe Lys
1955 1960 1965
Phe Leu Leu Leu Leu Tyr Ala Ile Tyr Ala Leu Val Phe Met Ile
1970 1975 1980
Val Gln Phe Ser Pro Phe Asn Ser Leu Leu Cys Gly Asp Ile Val
1985 1990 1995
Ser Gly Tyr Glu Lys Ser Thr Phe Asn Lys Asp Ile Tyr Cys Gly
2000 2005 2010
Asn Ser Met Val Cys Lys Met Cys Leu Phe Ser Tyr Gln Glu Phe
2015 2020 2025
Asn Asp Leu Asp His Thr Ser Leu Val Trp Lys His Ile Arg Asp
2030 2035 2040
Pro Ile Leu Ile Ser Leu Gln Pro Phe Val Ile Leu Val Ile Leu
2045 2050 2055
Leu Ile Phe Gly Asn Met Tyr Leu Arg Phe Gly Leu Leu Tyr Phe
2060 2065 2070
Val Ala Gln Phe Ile Ser Thr Phe Gly Ser Phe Leu Gly Phe His
2075 2080 2085
Gln Lys Gln Trp Phe Leu His Phe Val Pro Phe Asp Val Leu Cys
2090 2095 2100
Asn Glu Phe Leu Ala Thr Phe Ile Val Cys Lys Ile Val Leu Phe
2105 2110 2115
Val Arg His Ile Ile Val Gly Cys Asn Asn Ala Asp Cys Val Ala
2120 2125 2130
Cys Ser Lys Ser Ala Arg Leu Lys Arg Val Pro Leu Gln Thr Ile
2135 2140 2145
Ile Asn Gly Met His Lys Ser Phe Tyr Val Asn Ala Asn Gly Gly
2150 2155 2160
Thr Cys Phe Cys Asn Lys His Asn Phe Phe Cys Val Asn Cys Asp
2165 2170 2175
Ser Phe Gly Pro Gly Asn Thr Phe Ile Asn Gly Asp Ile Ala Arg
2180 2185 2190
Glu Leu Gly Asn Val Val Lys Thr Ala Val Gln Pro Thr Ala Pro
2195 2200 2205
Ala Tyr Val Ile Ile Asp Lys Val Asp Phe Val Asn Gly Phe Tyr
2210 2215 2220
Arg Leu Tyr Ser Gly Asp Thr Phe Trp Arg Tyr Asp Phe Asp Ile
2225 2230 2235
Thr Glu Ser Lys Tyr Ser Cys Lys Glu Val Leu Lys Asn Cys Asn
2240 2245 2250
Val Leu Glu Asn Phe Ile Val Tyr Asn Asn Ser Gly Ser Asn Ile
2255 2260 2265
Thr Gln Ile Lys Asn Ala Cys Val Tyr Phe Ser Gln Leu Leu Cys
2270 2275 2280
Glu Pro Ile Lys Leu Val Asn Ser Glu Leu Leu Ser Thr Leu Ser
2285 2290 2295
Val Asp Phe Asn Gly Val Leu His Lys Ala Tyr Val Asp Val Leu
2300 2305 2310
Cys Asn Ser Phe Phe Lys Glu Leu Thr Ala Asn Met Ser Met Ala
2315 2320 2325
Glu Cys Lys Ala Thr Leu Gly Leu Thr Val Ser Asp Asp Asp Phe
2330 2335 2340
Val Ser Ala Val Ala Asn Ala His Arg Tyr Asp Val Leu Leu Ser
2345 2350 2355
Asp Leu Ser Phe Asn Asn Phe Phe Ile Ser Tyr Ala Lys Pro Glu
2360 2365 2370
Asp Lys Leu Ser Val Tyr Asp Ile Ala Cys Cys Met Arg Ala Gly
2375 2380 2385
Ser Lys Val Val Asn His Asn Val Leu Ile Lys Glu Ser Ile Pro
2390 2395 2400
Ile Val Trp Gly Val Lys Asp Phe Asn Thr Leu Ser Gln Glu Gly
2405 2410 2415
Lys Lys Tyr Leu Val Lys Thr Thr Lys Ala Lys Gly Leu Thr Phe
2420 2425 2430
Leu Leu Thr Phe Asn Asp Asn Gln Ala Ile Thr Gln Val Pro Ala
2435 2440 2445
Thr Ser Ile Val Ala Lys Gln Gly Ala Gly Phe Lys Arg Thr Tyr
2450 2455 2460
Asn Phe Leu Trp Tyr Val Cys Leu Phe Val Val Ala Leu Phe Ile
2465 2470 2475
Gly Val Ser Phe Ile Asp Tyr Thr Thr Thr Val Thr Ser Phe His
2480 2485 2490
Gly Tyr Asp Phe Lys Tyr Ile Glu Asn Gly Gln Leu Lys Val Phe
2495 2500 2505
Glu Ala Pro Leu His Cys Val Arg Asn Val Phe Asp Asn Phe Asn
2510 2515 2520
Gln Trp His Glu Ala Lys Phe Gly Val Val Thr Thr Asn Ser Asp
2525 2530 2535
Lys Cys Pro Ile Val Val Gly Val Ser Glu Arg Ile Asn Val Val
2540 2545 2550
Pro Gly Val Pro Thr Asn Val Tyr Leu Val Gly Lys Thr Leu Val
2555 2560 2565
Phe Thr Leu Gln Ala Ala Phe Gly Asn Thr Gly Val Cys Tyr Asp
2570 2575 2580
Phe Asp Gly Val Thr Thr Ser Asp Lys Cys Ile Phe Asn Ser Ala
2585 2590 2595
Cys Thr Arg Leu Glu Gly Leu Gly Gly Asp Asn Val Tyr Cys Tyr
2600 2605 2610
Asn Thr Asp Leu Ile Glu Gly Ser Lys Pro Tyr Ser Ile Leu Gln
2615 2620 2625
Pro Asn Ala Tyr Tyr Lys Tyr Asp Val Lys Asn Tyr Val Arg Phe
2630 2635 2640
Pro Glu Ile Leu Ala Arg Gly Phe Gly Leu Arg Thr Ile Arg Thr
2645 2650 2655
Leu Ala Thr Arg Tyr Cys Arg Val Gly Glu Cys Arg Asp Ser His
2660 2665 2670
Lys Gly Val Cys Phe Gly Phe Asp Lys Trp Tyr Val Asn Asp Gly
2675 2680 2685
Arg Val Asp Asp Gly Tyr Ile Cys Gly Asp Gly Leu Ile Asp Leu
2690 2695 2700
Leu Val Asn Val Leu Ser Ile Phe Ser Ser Ser Phe Ser Val Val
2705 2710 2715
Ala Met Ser Gly His Met Leu Phe Asn Phe Leu Phe Ala Ala Phe
2720 2725 2730
Ile Thr Phe Leu Cys Phe Leu Val Thr Lys Phe Lys Arg Val Phe
2735 2740 2745
Gly Asp Leu Ser Tyr Gly Val Phe Thr Val Val Cys Ala Thr Leu
2750 2755 2760
Ile Asn Asn Ile Ser Tyr Val Val Thr Gln Asn Leu Phe Phe Met
2765 2770 2775
Leu Leu Tyr Ala Ile Leu Tyr Phe Val Phe Thr Arg Thr Val Arg
2780 2785 2790
Tyr Ala Trp Ile Trp His Ile Ala Tyr Ile Val Ala Tyr Phe Leu
2795 2800 2805
Leu Ile Pro Trp Trp Leu Leu Thr Trp Phe Ser Phe Ala Ala Phe
2810 2815 2820
Leu Glu Leu Leu Pro Asn Val Phe Lys Leu Lys Ile Ser Thr Gln
2825 2830 2835
Leu Phe Glu Gly Asp Lys Phe Ile Gly Thr Phe Glu Ser Ala Ala
2840 2845 2850
Ala Gly Thr Phe Val Leu Asp Met Arg Ser Tyr Glu Arg Leu Ile
2855 2860 2865
Asn Thr Ile Ser Pro Glu Lys Leu Lys Asn Tyr Ala Ala Ser Tyr
2870 2875 2880
Asn Lys Tyr Lys Tyr Tyr Ser Gly Ser Ala Ser Glu Ala Asp Tyr
2885 2890 2895
Arg Cys Ala Cys Tyr Ala His Leu Ala Lys Ala Met Leu Asp Tyr
2900 2905 2910
Ala Lys Asp His Asn Asp Met Leu Tyr Ser Pro Pro Thr Ile Ser
2915 2920 2925
Tyr Asn Ser Thr Leu Gln Ser Gly Leu Lys Lys Met Ala Gln Pro
2930 2935 2940
Ser Gly Cys Val Glu Arg Cys Val Val Arg Val Cys Tyr Gly Ser
2945 2950 2955
Thr Val Leu Asn Gly Val Trp Leu Gly Asp Thr Val Thr Cys Pro
2960 2965 2970
Arg His Val Ile Ala Pro Ser Thr Thr Val Leu Ile Asp Tyr Asp
2975 2980 2985
His Ala Tyr Ser Thr Met Arg Leu His Asn Phe Ser Val Ser His
2990 2995 3000
Asn Gly Val Phe Leu Gly Val Val Gly Val Thr Met His Gly Ser
3005 3010 3015
Val Leu Arg Ile Lys Val Ser Gln Ser Asn Val His Thr Pro Lys
3020 3025 3030
His Val Phe Lys Thr Leu Lys Pro Gly Ala Ser Phe Asn Ile Leu
3035 3040 3045
Ala Cys Tyr Glu Gly Ile Ala Ser Gly Val Phe Gly Val Asn Leu
3050 3055 3060
Arg Thr Asn Phe Thr Ile Lys Gly Ser Phe Ile Asn Gly Ala Cys
3065 3070 3075
Gly Ser Pro Gly Tyr Asn Val Arg Asn Asp Gly Thr Val Glu Phe
3080 3085 3090
Cys Tyr Leu His Gln Ile Glu Leu Gly Ser Gly Ala His Val Gly
3095 3100 3105
Ser Asp Phe Thr Gly Ser Val Tyr Gly Asn Phe Asp Asp Gln Pro
3110 3115 3120
Ser Leu Gln Val Glu Ser Ala Asn Leu Met Leu Ser Asp Asn Val
3125 3130 3135
Val Ala Phe Leu Tyr Ala Ala Leu Leu Asn Gly Cys Arg Trp Trp
3140 3145 3150
Leu Arg Ser Thr Arg Val Asn Val Asp Gly Phe Asn Glu Trp Ala
3155 3160 3165
Met Ala Asn Gly Tyr Thr Ile Val Ser Ser Val Glu Cys Tyr Ser
3170 3175 3180
Ile Leu Ala Ala Lys Thr Gly Val Ser Val Glu Gln Leu Leu Ala
3185 3190 3195
Ser Ile Gln His Leu His Glu Gly Phe Gly Gly Lys Asn Ile Leu
3200 3205 3210
Gly Tyr Ser Ser Leu Cys Asp Glu Phe Thr Leu Ala Glu Val Val
3215 3220 3225
Lys Gln Met Tyr Gly Val Asn Leu Gln Ser Gly Lys Val Ile Phe
3230 3235 3240
Gly Leu Lys Thr Met Phe Leu Phe Ser Val Phe Phe Thr Met Phe
3245 3250 3255
Trp Ala Glu Leu Phe Ile Tyr Thr Asn Thr Ile Trp Ile Asn Pro
3260 3265 3270
Val Ile Leu Thr Pro Ile Phe Cys Leu Leu Leu Phe Leu Ser Leu
3275 3280 3285
Val Leu Thr Met Phe Leu Lys His Lys Phe Leu Phe Leu Gln Val
3290 3295 3300
Phe Leu Leu Pro Thr Val Ile Ala Thr Ala Leu Tyr Asn Cys Val
3305 3310 3315
Leu Asp Tyr Tyr Ile Val Lys Phe Leu Ala Asp His Phe Asn Tyr
3320 3325 3330
Asn Val Ser Val Leu Gln Met Asp Val Gln Gly Leu Val Asn Val
3335 3340 3345
Leu Val Cys Leu Phe Val Val Phe Leu His Thr Trp Arg Phe Ser
3350 3355 3360
Lys Glu Arg Phe Thr His Trp Phe Thr Tyr Val Cys Ser Leu Ile
3365 3370 3375
Ala Val Ala Tyr Thr Tyr Phe Tyr Ser Gly Asp Phe Leu Ser Leu
3380 3385 3390
Leu Val Met Phe Leu Cys Ala Ile Ser Ser Asp Trp Tyr Ile Gly
3395 3400 3405
Ala Ile Val Phe Arg Leu Ser Arg Leu Ile Ile Phe Phe Ser Pro
3410 3415 3420
Glu Ser Val Phe Ser Val Phe Gly Asp Val Lys Leu Thr Leu Val
3425 3430 3435
Val Tyr Leu Ile Cys Gly Tyr Leu Val Cys Thr Tyr Trp Gly Ile
3440 3445 3450
Leu Tyr Trp Phe Asn Arg Phe Phe Lys Cys Thr Met Gly Val Tyr
3455 3460 3465
Asp Phe Lys Val Ser Ala Ala Glu Phe Lys Tyr Met Val Ala Asn
3470 3475 3480
Gly Leu His Ala Pro Tyr Gly Pro Phe Asp Ala Leu Trp Leu Ser
3485 3490 3495
Phe Lys Leu Leu Gly Ile Gly Gly Asp Arg Cys Ile Lys Ile Ser
3500 3505 3510
Thr Val Gln Ser Lys Leu Thr Asp Leu Lys Cys Thr Asn Val Val
3515 3520 3525
Leu Leu Gly Cys Leu Ser Ser Met Asn Ile Ala Ala Asn Ser Ser
3530 3535 3540
Glu Trp Ala Tyr Cys Val Asp Leu His Asn Lys Ile Asn Leu Cys
3545 3550 3555
Asp Asp Pro Glu Lys Ala Gln Gly Met Leu Leu Ala Leu Leu Ala
3560 3565 3570
Phe Phe Leu Ser Lys His Ser Asp Phe Gly Leu Asp Gly Leu Ile
3575 3580 3585
Asp Ser Tyr Phe Asp Asn Ser Ser Thr Leu Gln Ser Val Ala Ser
3590 3595 3600
Ser Phe Val Ser Met Pro Ser Tyr Ile Ala Tyr Glu Asn Ala Arg
3605 3610 3615
Gln Ala Tyr Glu Asp Ala Ile Ala Asn Gly Ser Ser Ser Gln Leu
3620 3625 3630
Ile Lys Gln Leu Lys Arg Ala Met Asn Ile Ala Lys Ser Glu Phe
3635 3640 3645
Asp His Glu Ile Ser Val Gln Lys Lys Ile Asn Arg Met Ala Glu
3650 3655 3660
Gln Ala Ala Thr Gln Met Tyr Lys Glu Ala Arg Ser Val Asn Arg
3665 3670 3675
Lys Ser Lys Val Ile Ser Ala Met His Ser Leu Leu Phe Gly Met
3680 3685 3690
Leu Arg Arg Leu Asp Met Ser Ser Val Glu Thr Val Leu Asn Leu
3695 3700 3705
Ala Arg Asp Gly Val Val Pro Leu Ser Val Ile Pro Ala Thr Ser
3710 3715 3720
Ala Ser Lys Leu Thr Ile Val Ser Pro Asp Leu Glu Ser Tyr Ser
3725 3730 3735
Lys Ile Val Cys Asp Gly Ser Val His Tyr Ala Gly Val Val Trp
3740 3745 3750
Thr Leu Asn Asp Val Lys Asp Asn Asp Gly Arg Pro Val His Val
3755 3760 3765
Lys Glu Ile Thr Arg Glu Asn Val Glu Thr Leu Thr Trp Pro Leu
3770 3775 3780
Ile Leu Asn Cys Glu Arg Val Val Lys Leu Gln Asn Asn Glu Ile
3785 3790 3795
Met Pro Gly Lys Leu Lys Gln Lys Pro Met Lys Ala Glu Gly Asp
3800 3805 3810
Gly Gly Val Leu Gly Asp Gly Asn Ala Leu Tyr Asn Thr Glu Gly
3815 3820 3825
Gly Lys Thr Phe Met Tyr Ala Tyr Ile Ser Asn Lys Ala Asp Leu
3830 3835 3840
Lys Phe Val Lys Trp Glu Tyr Glu Gly Gly Cys Asn Thr Ile Glu
3845 3850 3855
Leu Asp Ser Pro Cys Arg Phe Met Val Glu Thr Pro Asn Gly Pro
3860 3865 3870
Gln Val Lys Tyr Leu Tyr Phe Val Lys Asn Leu Asn Thr Leu Arg
3875 3880 3885
Arg Gly Ala Val Leu Gly Phe Ile Gly Ala Thr Ile Arg Leu Gln
3890 3895 3900
Ala Gly Lys Gln Thr Glu Leu Ala Val Asn Ser Gly Leu Leu Thr
3905 3910 3915
Ala Cys Ala Phe Ser Val Asp Pro Ala Thr Thr Tyr Leu Glu Ala
3920 3925 3930
Val Lys His Gly Ala Lys Pro Val Ser Asn Cys Ile Lys Met Leu
3935 3940 3945
Ser Asn Gly Ala Gly Asn Gly Gln Ala Ile Thr Thr Ser Val Asp
3950 3955 3960
Ala Asn Thr Asn Gln Asp Ser Tyr Gly Gly Ala Ser Ile Cys Leu
3965 3970 3975
Tyr Cys Arg Ala His Val Pro His Pro Ser Met Asp Gly Tyr Cys
3980 3985 3990
Lys Phe Lys Gly Lys Cys Val Gln Val Pro Ile Gly Cys Leu Asp
3995 4000 4005
Pro Ile Arg Phe Cys Leu Glu Asn Asn Val Cys Asn Val Cys Gly
4010 4015 4020
Cys Trp Leu Gly His Gly Cys Ala Cys Asp Arg Thr Thr Ile Gln
4025 4030 4035
Ser Val Asp Ile Ser Tyr Leu Asn Glu Gln Gly Val Leu Val Gln
4040 4045 4050
Leu Asp
4055
<210> 15
<211> 4416
<212> PRT
<213> murine hepatitis virus
<220>
<221> MISC_FEATURE
<223> ORF 1A
<400> 15
Met Ala Lys Met Gly Lys Tyr Gly Leu Gly Phe Lys Trp Ala Pro Glu
1 5 10 15
Phe Pro Trp Met Leu Pro Asn Ala Ser Glu Lys Leu Gly Ser Pro Glu
20 25 30
Arg Ser Glu Glu Asp Gly Phe Cys Pro Ser Ala Ala Gln Glu Pro Lys
35 40 45
Thr Lys Gly Lys Thr Leu Ile Asn His Val Arg Val Asp Cys Ser Arg
50 55 60
Leu Pro Ala Leu Glu Cys Cys Val Gln Ser Ala Ile Ile Arg Asp Ile
65 70 75 80
Phe Val Asp Glu Asp Pro Leu Asn Val Glu Ala Ser Thr Met Met Ala
85 90 95
Leu Gln Phe Gly Ser Ala Val Leu Val Lys Pro Ser Lys Arg Leu Ser
100 105 110
Ile Gln Ala Trp Ala Lys Leu Gly Val Leu Pro Lys Thr Pro Ala Met
115 120 125
Gly Leu Phe Lys Arg Phe Cys Leu Cys Asn Thr Arg Glu Cys Val Cys
130 135 140
Asp Ala His Val Ala Phe Gln Leu Phe Thr Val Gln Pro Asp Gly Val
145 150 155 160
Cys Leu Gly Asn Gly Arg Phe Ile Gly Trp Phe Val Pro Val Thr Ala
165 170 175
Ile Pro Ala Tyr Ala Lys Gln Trp Leu Gln Pro Trp Ser Ile Leu Leu
180 185 190
Arg Lys Gly Gly Asn Lys Gly Ser Val Thr Ser Gly His Phe Arg Arg
195 200 205
Ala Val Thr Met Pro Val Tyr Asp Phe Asn Val Glu Asp Ala Cys Glu
210 215 220
Glu Val His Leu Asn Pro Lys Gly Lys Tyr Ser Arg Lys Ala Tyr Ala
225 230 235 240
Leu Leu Lys Gly Tyr Arg Gly Val Lys Ser Ile Leu Phe Leu Asp Gln
245 250 255
Tyr Gly Cys Asp Tyr Thr Gly Arg Leu Ala Lys Gly Leu Glu Asp Tyr
260 265 270
Gly Asp Cys Thr Leu Glu Glu Met Lys Glu Leu Phe Pro Val Trp Cys
275 280 285
Asp Ser Leu Asp Asn Glu Val Val Val Ala Trp His Val Asp Arg Asp
290 295 300
Pro Arg Ala Val Met Arg Leu Gln Thr Leu Ala Thr Ile Arg Ser Ile
305 310 315 320
Gly Tyr Val Gly Gln Pro Thr Glu Asp Leu Val Asp Gly Asp Val Val
325 330 335
Val Arg Glu Pro Ala His Leu Leu Ala Ala Asn Ala Ile Val Lys Arg
340 345 350
Leu Pro Arg Leu Val Glu Thr Met Leu Tyr Thr Asp Ser Ser Val Thr
355 360 365
Glu Phe Cys Tyr Lys Thr Lys Leu Cys Asp Cys Gly Phe Ile Thr Gln
370 375 380
Phe Gly Tyr Val Asp Cys Cys Gly Asp Ala Cys Asp Phe Arg Gly Trp
385 390 395 400
Val Pro Gly Asn Met Met Asp Gly Phe Leu Cys Pro Gly Cys Ser Lys
405 410 415
Ser Tyr Met Pro Trp Glu Leu Glu Ala Gln Ser Ser Gly Val Ile Pro
420 425 430
Lys Gly Gly Val Leu Phe Thr Gln Ser Thr Asp Thr Val Asn Arg Glu
435 440 445
Ser Phe Lys Leu Tyr Gly His Ala Val Val Pro Phe Gly Ser Ala Val
450 455 460
Tyr Trp Ser Pro Tyr Pro Gly Met Trp Leu Pro Val Ile Trp Ser Ser
465 470 475 480
Val Lys Ser Tyr Ala Asp Leu Thr Tyr Thr Gly Val Val Gly Cys Lys
485 490 495
Ala Ile Val Gln Glu Thr Asp Ala Ile Cys Arg Ser Leu Tyr Met Asp
500 505 510
Tyr Val Gln His Lys Cys Gly Asn Leu Glu Gln Arg Ala Ile Leu Gly
515 520 525
Leu Asp Asp Val Tyr His Arg Gln Leu Leu Val Asn Arg Gly Asp Tyr
530 535 540
Ser Leu Leu Leu Glu Asn Val Asp Leu Phe Val Lys Arg Arg Ala Glu
545 550 555 560
Phe Ala Cys Lys Phe Ala Thr Cys Gly Asp Gly Leu Val Pro Leu Leu
565 570 575
Leu Asp Gly Leu Val Pro Arg Ser Tyr Tyr Leu Ile Lys Ser Gly Gln
580 585 590
Ala Phe Thr Ser Met Met Val Asn Phe Ser His Glu Val Thr Asp Met
595 600 605
Cys Met Asp Met Ala Leu Leu Phe Met His Asp Val Lys Val Ala Thr
610 615 620
Lys Tyr Val Lys Lys Val Thr Gly Lys Leu Ala Val Arg Phe Lys Ala
625 630 635 640
Leu Gly Val Ala Val Val Arg Lys Ile Thr Glu Trp Phe Asp Leu Ala
645 650 655
Val Asp Thr Ala Ala Ser Ala Ala Gly Trp Leu Cys Tyr Gln Leu Val
660 665 670
Asn Gly Leu Phe Ala Val Ala Asn Gly Gly Ile Thr Phe Leu Ser Asp
675 680 685
Val Pro Glu Leu Val Lys Asn Phe Val Asp Lys Phe Lys Val Phe Phe
690 695 700
Lys Val Leu Ile Asp Ser Met Ser Val Ser Val Leu Ser Gly Leu Thr
705 710 715 720
Val Val Lys Thr Ala Ser Asn Arg Val Cys Leu Ala Gly Cys Lys Val
725 730 735
Tyr Glu Val Val Gln Lys Arg Leu Ser Ala Tyr Val Met Pro Val Gly
740 745 750
Cys Asn Glu Ala Thr Cys Leu Val Gly Glu Ile Glu Pro Ala Val Val
755 760 765
Glu Asp Asp Val Val Asp Val Val Lys Ala Pro Leu Thr Tyr Gln Gly
770 775 780
Cys Cys Lys Pro Pro Thr Ser Phe Glu Lys Ile Cys Val Val Asp Lys
785 790 795 800
Leu Tyr Met Ala Lys Cys Gly Asp Gln Phe Tyr Pro Val Val Val Asp
805 810 815
Asn Asp Thr Ile Gly Val Leu Asp Gln Cys Trp Arg Phe Pro Cys Ala
820 825 830
Gly Lys Lys Val Glu Phe Asn Asp Lys Pro Lys Val Lys Glu Ile Pro
835 840 845
Ser Thr Arg Lys Ile Lys Ile Asn Phe Ala Leu Asp Ala Thr Phe Asp
850 855 860
Ser Val Leu Ser Lys Ala Cys Ser Glu Phe Glu Val Asp Lys Asp Val
865 870 875 880
Thr Leu Asp Glu Leu Leu Asp Val Val Leu Asp Ala Val Glu Ser Thr
885 890 895
Leu Ser Pro Cys Lys Glu His Asp Val Ile Gly Thr Lys Val Cys Ala
900 905 910
Leu Leu Asn Arg Leu Ala Glu Asp Tyr Val Tyr Leu Phe Asp Glu Gly
915 920 925
Gly Glu Glu Val Ile Ala Pro Lys Met Tyr Cys Ser Phe Ser Ala Pro
930 935 940
Asp Asp Glu Asp Cys Val Ala Ala Asp Val Val Asp Ala Asp Glu Asn
945 950 955 960
Gln Gly Asp Asp Ala Asp Asp Ser Ala Ala Leu Val Thr Asp Thr Gln
965 970 975
Glu Glu Asp Gly Val Ala Lys Gly Gln Val Gly Val Ala Glu Ser Asp
980 985 990
Ala Arg Leu Asp Gln Val Glu Ala Phe Asp Ile Glu Lys Val Glu Asp
995 1000 1005
Pro Ile Leu Asn Glu Leu Ser Ala Glu Leu Asn Ala Pro Ala Asp
1010 1015 1020
Lys Thr Tyr Glu Asp Val Leu Ala Phe Asp Ala Ile Tyr Ser Glu
1025 1030 1035
Ala Leu Ser Ala Phe Tyr Ala Val Pro Gly Asp Glu Thr His Phe
1040 1045 1050
Lys Val Cys Gly Phe Tyr Ser Pro Ala Ile Glu Arg Thr Asn Cys
1055 1060 1065
Trp Leu Arg Ser Thr Leu Ile Val Met Gln Ser Leu Pro Leu Glu
1070 1075 1080
Phe Lys Asp Leu Glu Met Gln Lys Leu Trp Leu Ser Tyr Lys Ser
1085 1090 1095
Ser Tyr Asn Lys Glu Phe Val Asp Lys Leu Val Lys Ser Val Pro
1100 1105 1110
Lys Ser Ile Ile Leu Pro Gln Gly Gly Tyr Val Ala Asp Phe Ala
1115 1120 1125
Tyr Phe Phe Leu Ser Gln Cys Ser Phe Lys Ala Tyr Ala Asn Trp
1130 1135 1140
Arg Cys Leu Lys Cys Asp Met Asp Leu Lys Leu Gln Gly Leu Asp
1145 1150 1155
Ala Met Phe Phe Tyr Gly Asp Val Val Ser His Val Cys Lys Cys
1160 1165 1170
Gly Thr Gly Met Thr Leu Leu Ser Ala Asp Ile Pro Tyr Thr Leu
1175 1180 1185
His Phe Gly Leu Arg Asp Asp Lys Phe Cys Ala Phe Tyr Thr Pro
1190 1195 1200
Arg Lys Val Phe Arg Ala Ala Cys Val Val Asp Val Asn Asp Cys
1205 1210 1215
His Ser Met Ala Val Val Asp Gly Lys Gln Ile Asp Gly Lys Val
1220 1225 1230
Val Thr Lys Phe Asn Gly Asp Lys Tyr Asp Phe Met Val Gly His
1235 1240 1245
Gly Met Ala Phe Ser Met Ser Ala Phe Glu Ile Ala Gln Leu Tyr
1250 1255 1260
Gly Ser Cys Ile Thr Pro Asn Val Cys Phe Val Lys Gly Asp Val
1265 1270 1275
Ile Lys Val Leu Arg Arg Val Gly Ala Glu Val Ile Val Asn Pro
1280 1285 1290
Ala Asn Gly Arg Met Ala His Gly Ala Gly Val Ala Gly Ala Ile
1295 1300 1305
Ala Lys Ala Ala Gly Lys Ser Phe Ile Lys Glu Thr Ala Asp Met
1310 1315 1320
Val Lys Asn Gln Gly Val Cys Gln Val Gly Glu Cys Tyr Glu Ser
1325 1330 1335
Thr Gly Gly Asn Leu Cys Lys Thr Val Leu Asn Ile Val Gly Pro
1340 1345 1350
Asp Ala Arg Gly His Gly Lys Gln Cys Tyr Ser Phe Leu Glu Arg
1355 1360 1365
Ala Tyr Gln His Ile Asn Lys Cys Asp Asp Val Val Thr Thr Leu
1370 1375 1380
Ile Ser Ala Gly Ile Phe Ser Val Pro Thr Asp Val Ser Leu Thr
1385 1390 1395
Tyr Leu Ile Gly Val Val Thr Lys Asn Val Ile Leu Val Ser Asn
1400 1405 1410
Asn Lys Asp Asp Phe Asp Val Ile Glu Lys Cys Gln Val Thr Ser
1415 1420 1425
Ile Ala Gly Thr Lys Ala Leu Ser Leu Gln Leu Ala Lys Asn Leu
1430 1435 1440
Cys Arg Asp Val Lys Phe Glu Thr Asn Ala Cys Asp Ser Leu Phe
1445 1450 1455
Ser Asp Ser Cys Phe Val Ser Ser Tyr Asp Val Leu Gln Glu Val
1460 1465 1470
Glu Leu Leu Arg His Asp Ile Gln Leu Asp Asp Asp Ala Arg Val
1475 1480 1485
Phe Val Gln Ala His Met Asp Asn Leu Pro Ala Asp Trp Arg Leu
1490 1495 1500
Val Asn Lys Phe Asp Ser Val Asp Gly Val Arg Thr Val Lys Tyr
1505 1510 1515
Phe Glu Cys Pro Gly Glu Ile Phe Val Ser Ser Gln Gly Lys Lys
1520 1525 1530
Phe Gly Tyr Val Gln Asn Gly Ser Phe Lys Val Ala Ser Val Ser
1535 1540 1545
Gln Ile Arg Ala Leu Leu Ala Asn Lys Val Asp Val Leu Cys Thr
1550 1555 1560
Val Asp Gly Val Asn Phe Arg Ser Cys Cys Val Ala Glu Gly Glu
1565 1570 1575
Val Phe Gly Lys Thr Leu Gly Ser Val Phe Cys Asp Gly Ile Asn
1580 1585 1590
Val Thr Lys Val Arg Cys Ser Ala Ile His Lys Gly Lys Val Phe
1595 1600 1605
Phe Gln Tyr Ser Gly Leu Ser Ala Ala Asp Leu Val Ala Val Thr
1610 1615 1620
Asp Ala Phe Gly Phe Asp Glu Pro Gln Leu Leu Lys Tyr Tyr Asn
1625 1630 1635
Met Leu Gly Met Cys Lys Trp Pro Val Val Val Cys Gly Asn Tyr
1640 1645 1650
Phe Ala Phe Lys Gln Ser Asn Asn Asn Cys Tyr Ile Asn Val Ala
1655 1660 1665
Cys Leu Met Leu Gln His Leu Ser Leu Lys Phe His Lys Trp Gln
1670 1675 1680
Trp Gln Glu Ala Trp Asn Glu Phe Arg Ser Gly Lys Pro Leu Arg
1685 1690 1695
Phe Val Ser Leu Val Leu Ala Lys Gly Ser Phe Lys Phe Asn Glu
1700 1705 1710
Pro Ser Asp Ser Thr Asp Phe Met Arg Val Val Leu Arg Glu Ala
1715 1720 1725
Asp Leu Ser Gly Ala Thr Cys Asp Phe Glu Phe Val Cys Lys Cys
1730 1735 1740
Gly Val Lys Gln Glu Gln Arg Lys Gly Val Asp Ala Val Met His
1745 1750 1755
Phe Gly Thr Leu Asp Lys Gly Asp Leu Ala Lys Gly Tyr Thr Ile
1760 1765 1770
Ala Cys Thr Cys Gly Asn Lys Leu Val His Cys Thr Gln Leu Asn
1775 1780 1785
Val Pro Phe Leu Ile Cys Ser Asn Lys Pro Glu Gly Lys Lys Leu
1790 1795 1800
Pro Asp Asp Val Val Ala Ala Asn Ile Phe Thr Gly Gly Ser Leu
1805 1810 1815
Gly His Tyr Thr His Val Lys Cys Lys Pro Lys Tyr Gln Leu Tyr
1820 1825 1830
Asp Ala Cys Asn Val Ser Lys Val Ser Glu Ala Lys Gly Asn Phe
1835 1840 1845
Thr Asp Cys Leu Tyr Leu Lys Asn Leu Lys Gln Thr Phe Ser Ser
1850 1855 1860
Lys Leu Thr Thr Phe Tyr Leu Asp Asp Val Lys Cys Val Glu Tyr
1865 1870 1875
Asn Pro Asp Leu Ser Gln Tyr Tyr Cys Glu Ser Gly Lys Tyr Tyr
1880 1885 1890
Thr Lys Pro Ile Ile Lys Ala Gln Phe Arg Thr Phe Glu Lys Val
1895 1900 1905
Glu Gly Val Tyr Thr Asn Phe Lys Leu Val Gly His Ser Ile Ala
1910 1915 1920
Glu Lys Phe Asn Ala Lys Leu Gly Phe Asp Cys Asn Ser Pro Phe
1925 1930 1935
Thr Glu Tyr Lys Ile Thr Glu Trp Pro Thr Ala Thr Gly Asp Val
1940 1945 1950
Val Leu Ala Ser Asp Asp Leu Tyr Val Ser Arg Tyr Ser Gly Gly
1955 1960 1965
Cys Val Thr Phe Gly Lys Pro Val Ile Trp Leu Gly His Glu Glu
1970 1975 1980
Ala Ser Leu Lys Ser Leu Thr Tyr Phe Asn Arg Pro Ser Val Val
1985 1990 1995
Cys Glu Asn Lys Phe Asn Val Leu Pro Val Asp Val Ser Glu Pro
2000 2005 2010
Thr Asp Lys Gly Pro Val Pro Ala Ala Val Leu Val Thr Gly Ala
2015 2020 2025
Leu Ser Gly Ala Ala Thr Ala Pro Gly Thr Ala Lys Glu Gln Lys
2030 2035 2040
Val Cys Ala Ser Asp Ser Val Val Asp Gln Val Val Ser Gly Phe
2045 2050 2055
Leu Ser Asp Leu Ser Gly Ala Thr Val Asp Val Lys Glu Val Lys
2060 2065 2070
Leu Asn Gly Val Lys Lys Pro Ile Lys Val Glu Asp Ser Val Val
2075 2080 2085
Val Asn Asp Pro Thr Ser Glu Thr Lys Val Val Lys Ser Leu Ser
2090 2095 2100
Ile Val Asp Val Tyr Asp Met Phe Leu Thr Gly Cys Arg Tyr Val
2105 2110 2115
Val Trp Met Ala Asn Glu Leu Ser Arg Leu Val Asn Ser Pro Thr
2120 2125 2130
Val Arg Glu Tyr Val Lys Trp Gly Met Thr Lys Ile Val Ile Pro
2135 2140 2145
Ala Lys Leu Val Leu Leu Arg Asp Glu Lys Gln Glu Phe Val Ala
2150 2155 2160
Pro Lys Val Val Lys Ala Lys Val Ile Ala Cys Tyr Ser Ala Val
2165 2170 2175
Lys Trp Phe Phe Leu Tyr Cys Phe Ser Trp Ile Lys Phe Asn Thr
2180 2185 2190
Asp Asn Lys Val Ile Tyr Thr Thr Glu Val Ala Ser Lys Leu Thr
2195 2200 2205
Phe Asn Leu Cys Cys Leu Ala Phe Lys Asn Ala Leu Gln Thr Phe
2210 2215 2220
Asn Trp Asn Val Val Ser Arg Gly Phe Phe Leu Val Ala Thr Val
2225 2230 2235
Phe Leu Leu Trp Phe Asn Phe Leu Tyr Ala Asn Val Ile Leu Ser
2240 2245 2250
Asp Phe Tyr Leu Pro Asn Ile Gly Phe Phe Pro Thr Phe Val Gly
2255 2260 2265
Gln Ile Val Ala Trp Val Lys Thr Thr Phe Gly Ile Phe Thr Leu
2270 2275 2280
Cys Asp Leu Tyr Gln Val Ser Asp Val Gly Tyr Arg Ser Ser Phe
2285 2290 2295
Cys Asn Gly Ser Met Val Cys Glu Leu Cys Phe Ser Gly Phe Asp
2300 2305 2310
Met Leu Asp Asn Tyr Asp Ala Ile Asn Val Val Gln His Val Val
2315 2320 2325
Asp Arg Arg Val Ser Phe Asp Tyr Ile Ser Leu Phe Lys Leu Val
2330 2335 2340
Val Glu Leu Val Ile Gly Tyr Ser Leu Tyr Thr Val Cys Phe Tyr
2345 2350 2355
Pro Leu Phe Gly Leu Ile Gly Met Gln Leu Leu Thr Thr Trp Leu
2360 2365 2370
Pro Glu Phe Phe Met Leu Glu Thr Met His Trp Ser Ala Arg Phe
2375 2380 2385
Phe Val Phe Val Ala Asn Met Leu Pro Ala Phe Thr Leu Leu Arg
2390 2395 2400
Phe Tyr Ile Val Val Thr Ala Met Tyr Lys Ile Phe Cys Leu Cys
2405 2410 2415
Arg His Val Met Tyr Gly Cys Ser Arg Pro Gly Cys Leu Phe Cys
2420 2425 2430
Tyr Lys Arg Asn Arg Ser Val Arg Val Lys Cys Ser Thr Val Val
2435 2440 2445
Gly Gly Thr Leu Arg Tyr Tyr Asp Val Met Ala Asn Gly Gly Thr
2450 2455 2460
Gly Phe Cys Ala Lys His Gln Trp Asn Cys Leu Asn Cys Ser Ala
2465 2470 2475
Phe Gly Pro Gly Asn Thr Phe Ile Thr His Glu Ala Ala Ala Asp
2480 2485 2490
Leu Ser Lys Glu Leu Lys Arg Pro Val Asn Pro Thr Asp Ser Ala
2495 2500 2505
Tyr Tyr Leu Val Thr Glu Val Lys Gln Val Gly Cys Ser Met Arg
2510 2515 2520
Leu Phe Tyr Glu Arg Asp Gly Gln Arg Val Tyr Asp Asp Val Ser
2525 2530 2535
Ala Ser Leu Phe Val Asp Met Asn Gly Leu Leu His Ser Lys Val
2540 2545 2550
Lys Gly Val Pro Glu Thr His Val Val Val Val Glu Asn Glu Ala
2555 2560 2565
Asp Lys Ala Gly Phe Leu Asn Ala Ala Val Phe Tyr Ala Gln Ser
2570 2575 2580
Leu Tyr Arg Pro Met Leu Leu Val Glu Lys Lys Leu Ile Thr Thr
2585 2590 2595
Ala Asn Thr Gly Leu Ser Val Ser Gln Thr Met Phe Asp Leu Tyr
2600 2605 2610
Val Asp Ser Leu Leu Gly Val Leu Asp Val Asp Arg Lys Ser Leu
2615 2620 2625
Thr Ser Phe Val Asn Ala Ala His Asn Ser Leu Lys Glu Gly Val
2630 2635 2640
Gln Leu Glu Gln Val Met Asp Thr Phe Ile Gly Cys Ala Arg Arg
2645 2650 2655
Lys Cys Ala Ile Asp Ser Asp Val Glu Thr Lys Ser Ile Thr Lys
2660 2665 2670
Ser Ile Met Ser Ala Val Asn Ala Gly Val Asp Phe Thr Asp Glu
2675 2680 2685
Ser Cys Asn Asn Leu Val Pro Thr Tyr Val Lys Ser Asp Thr Ile
2690 2695 2700
Val Ala Ala Asp Leu Gly Val Leu Ile Gln Asn Asn Ala Lys His
2705 2710 2715
Val Gln Ala Asn Val Ala Lys Ala Ala Asn Val Ala Cys Ile Trp
2720 2725 2730
Ser Val Asp Ala Phe Asn Gln Leu Ser Ala Asp Leu Gln His Arg
2735 2740 2745
Leu Arg Lys Ala Cys Ser Lys Thr Gly Leu Lys Ile Lys Leu Thr
2750 2755 2760
Tyr Asn Lys Gln Glu Ala Asn Val Pro Ile Leu Thr Thr Pro Phe
2765 2770 2775
Ser Leu Lys Gly Gly Ala Val Phe Ser Lys Val Leu Gln Trp Leu
2780 2785 2790
Phe Val Val Asn Leu Ile Cys Phe Ile Val Leu Trp Ala Leu Met
2795 2800 2805
Pro Thr Tyr Ala Val His Lys Ser Asp Met Gln Leu Pro Leu Tyr
2810 2815 2820
Ala Ser Phe Lys Val Ile Asp Asn Gly Val Leu Arg Asp Val Thr
2825 2830 2835
Val Thr Asp Ala Cys Phe Ala Asn Lys Phe Ile Gln Phe Asp Gln
2840 2845 2850
Trp Tyr Glu Ser Thr Phe Gly Leu Val Tyr Tyr Arg Asn Ser Arg
2855 2860 2865
Ala Cys Pro Val Val Val Ala Val Ile Asp Gln Asp Ile Gly Tyr
2870 2875 2880
Thr Leu Phe Asn Val Pro Thr Lys Val Leu Arg Tyr Gly Phe His
2885 2890 2895
Val Leu His Phe Ile Thr His Ala Phe Ala Thr Asp Ser Val Gln
2900 2905 2910
Cys Tyr Thr Pro His Met Gln Ile Pro Tyr Asp Asn Phe Tyr Ala
2915 2920 2925
Ser Gly Cys Val Leu Ser Ser Leu Cys Thr Met Leu Ala His Ala
2930 2935 2940
Asp Gly Thr Pro His Pro Tyr Cys Tyr Thr Glu Gly Ile Met His
2945 2950 2955
Asn Ala Ser Leu Tyr Asp Ser Leu Ala Pro His Val Arg Tyr Asn
2960 2965 2970
Leu Ala Asn Ser Asn Gly Tyr Ile Arg Phe Pro Glu Val Val Ser
2975 2980 2985
Glu Gly Ile Val Arg Ile Val Arg Thr Arg Ser Met Thr Tyr Cys
2990 2995 3000
Arg Val Gly Leu Cys Glu Asp Ala Glu Glu Gly Val Cys Phe Asn
3005 3010 3015
Phe Asn Ser Ser Trp Val Leu Asn Asn Pro Tyr Tyr Arg Ala Met
3020 3025 3030
Pro Gly Thr Phe Cys Gly Arg Asn Ala Phe Asp Leu Ile His Gln
3035 3040 3045
Val Leu Gly Gly Leu Val Arg Pro Ile Asp Phe Phe Ala Leu Thr
3050 3055 3060
Ala Ser Ser Val Ala Gly Ala Ile Leu Ala Ile Ile Val Val Leu
3065 3070 3075
Ala Phe Tyr Tyr Leu Ile Lys Leu Lys Arg Ala Phe Gly Asp Tyr
3080 3085 3090
Thr Ser Val Val Val Ile Asn Val Ile Val Trp Cys Ile Asn Phe
3095 3100 3105
Leu Met Leu Phe Val Phe Gln Val Tyr Pro Thr Leu Ser Cys Leu
3110 3115 3120
Tyr Ala Cys Phe Tyr Phe Tyr Thr Thr Leu Tyr Phe Pro Ser Glu
3125 3130 3135
Ile Ser Val Val Met His Leu Gln Trp Leu Val Met Tyr Gly Ala
3140 3145 3150
Ile Met Pro Leu Trp Phe Cys Ile Ile Tyr Val Ala Val Val Val
3155 3160 3165
Ser Asn His Ala Leu Trp Leu Phe Ser Tyr Cys Arg Lys Leu Gly
3170 3175 3180
Thr Glu Val Arg Ser Asp Gly Thr Phe Glu Glu Met Ser Leu Thr
3185 3190 3195
Thr Phe Met Ile Thr Lys Glu Ser Tyr Cys Lys Leu Lys Asn Ser
3200 3205 3210
Val Ser Asp Val Ala Phe Asn Arg Tyr Leu Ser Leu Tyr Asn Lys
3215 3220 3225
Tyr Arg Tyr Phe Ser Gly Lys Met Asp Thr Ala Ala Tyr Arg Glu
3230 3235 3240
Ala Ala Cys Ser Gln Leu Ala Lys Ala Met Glu Thr Phe Asn His
3245 3250 3255
Asn Asn Gly Asn Asp Val Leu Tyr Gln Pro Pro Thr Ala Ser Val
3260 3265 3270
Thr Thr Ser Phe Leu Gln Ser Gly Ile Val Lys Met Val Phe Pro
3275 3280 3285
Thr Ser Lys Val Glu Pro Cys Val Val Ser Val Thr Tyr Gly Asn
3290 3295 3300
Met Thr Leu Asn Gly Leu Trp Leu Asp Asp Lys Val Tyr Cys Pro
3305 3310 3315
Arg His Val Ile Cys Ser Ser Ala Asp Met Thr Asp Pro Asp Tyr
3320 3325 3330
Ser Asn Leu Leu Cys Arg Val Ile Ser Ser Asp Phe Cys Val Met
3335 3340 3345
Ser Gly Arg Met Ser Leu Thr Val Met Ser Tyr Gln Met Gln Gly
3350 3355 3360
Ser Leu Leu Val Leu Thr Val Thr Leu Gln Asn Pro Asn Thr Pro
3365 3370 3375
Lys Tyr Ser Phe Gly Val Val Lys Pro Gly Glu Thr Phe Thr Val
3380 3385 3390
Leu Ala Ala Tyr Asn Gly Lys Ser Gln Gly Ala Phe His Val Thr
3395 3400 3405
Met Arg Ser Ser Tyr Thr Ile Lys Gly Ser Phe Leu Cys Gly Ser
3410 3415 3420
Cys Gly Ser Val Gly Tyr Val Leu Thr Gly Asp Ser Val Arg Phe
3425 3430 3435
Val Tyr Met His Gln Leu Glu Leu Ser Thr Gly Cys His Thr Gly
3440 3445 3450
Thr Asp Phe Ser Gly Asn Phe Tyr Gly Pro Tyr Arg Asp Ala Gln
3455 3460 3465
Val Val Gln Leu Pro Val Gln Asp Tyr Thr Gln Thr Val Asn Val
3470 3475 3480
Val Ala Trp Leu Tyr Ala Ala Ile Leu Asn Arg Cys Asn Trp Phe
3485 3490 3495
Val Gln Ser Asp Ser Cys Ser Leu Glu Glu Phe Asn Val Trp Ala
3500 3505 3510
Met Thr Asn Gly Phe Ser Ser Ile Lys Ala Asp Leu Val Leu Asp
3515 3520 3525
Ala Leu Ala Ser Met Thr Gly Val Thr Val Glu Gln Ile Leu Ala
3530 3535 3540
Ala Ile Lys Arg Leu Tyr Ser Gly Phe Gln Gly Lys Gln Ile Leu
3545 3550 3555
Gly Ser Cys Val Leu Glu Asp Glu Leu Thr Pro Ser Asp Val Tyr
3560 3565 3570
Gln Gln Leu Ala Gly Val Lys Leu Gln Ser Lys Arg Thr Arg Val
3575 3580 3585
Val Lys Gly Thr Cys Cys Trp Ile Leu Ala Ser Thr Leu Leu Phe
3590 3595 3600
Cys Ser Ile Ile Ser Ala Phe Val Lys Trp Thr Met Phe Met Tyr
3605 3610 3615
Val Thr Thr His Met Leu Gly Val Thr Leu Cys Ala Leu Cys Phe
3620 3625 3630
Val Ser Phe Ala Met Leu Leu Val Lys His Lys His Leu Tyr Leu
3635 3640 3645
Thr Met Phe Ile Met Pro Val Leu Cys Thr Leu Phe Tyr Thr Asn
3650 3655 3660
Tyr Leu Val Val Tyr Lys Gln Ser Phe Arg Gly Leu Ala Tyr Ala
3665 3670 3675
Trp Leu Ser His Phe Val Pro Ala Val Asp Tyr Thr Tyr Met Asp
3680 3685 3690
Glu Val Leu Tyr Gly Val Val Leu Leu Val Ala Met Val Phe Val
3695 3700 3705
Thr Met Arg Ser Ile Asn His Asp Val Phe Ser Val Met Phe Leu
3710 3715 3720
Val Gly Arg Leu Val Ser Leu Val Ser Met Trp Tyr Phe Gly Ala
3725 3730 3735
Asn Leu Glu Glu Glu Val Leu Leu Phe Leu Thr Ser Leu Phe Gly
3740 3745 3750
Thr Tyr Thr Trp Thr Thr Met Leu Ser Leu Ala Thr Ala Lys Val
3755 3760 3765
Ile Ala Lys Trp Leu Ala Val Asn Val Leu Tyr Phe Thr Asp Val
3770 3775 3780
Pro Gln Val Lys Leu Val Leu Leu Ser Tyr Leu Cys Ile Gly Tyr
3785 3790 3795
Val Cys Cys Cys Tyr Trp Gly Val Leu Ser Leu Leu Asn Ser Ile
3800 3805 3810
Phe Arg Met Pro Leu Gly Val Tyr Asn Tyr Lys Ile Ser Val Gln
3815 3820 3825
Glu Leu Arg Tyr Met Asn Ala Asn Gly Leu Arg Pro Pro Arg Asn
3830 3835 3840
Ser Phe Glu Ala Leu Val Leu Asn Phe Lys Leu Leu Gly Ile Gly
3845 3850 3855
Gly Val Pro Val Ile Glu Val Ser Gln Ile Gln Ser Arg Leu Thr
3860 3865 3870
Asp Val Lys Cys Val Asn Val Val Leu Leu Asn Cys Leu Gln His
3875 3880 3885
Leu His Ile Ala Ser Ser Ser Lys Leu Trp Gln Tyr Cys Ser Thr
3890 3895 3900
Leu His Asn Glu Ile Leu Ala Thr Ser Asp Leu Ser Val Ala Phe
3905 3910 3915
Asp Lys Leu Ala Gln Leu Leu Val Val Leu Phe Ala Asn Pro Ala
3920 3925 3930
Ala Val Asp Ser Lys Cys Leu Ala Ser Ile Glu Glu Val Ser Asp
3935 3940 3945
Asp Tyr Val Arg Asp Ser Thr Val Leu Gln Ala Leu Gln Ser Glu
3950 3955 3960
Phe Val Asn Met Ala Ser Phe Val Glu Tyr Glu Leu Ala Lys Lys
3965 3970 3975
Asn Leu Asp Glu Ala Lys Ala Ser Gly Ser Ala Asn Gln Gln Gln
3980 3985 3990
Ile Lys Gln Leu Glu Lys Ala Cys Asn Ile Ala Lys Ser Ala Tyr
3995 4000 4005
Glu Arg Asp Arg Ala Val Ala Arg Lys Leu Glu Arg Met Ala Asp
4010 4015 4020
Leu Ala Leu Thr Asn Met Tyr Lys Glu Ala Arg Ile Asn Asp Lys
4025 4030 4035
Lys Ser Lys Val Val Ser Ala Leu Gln Thr Met Leu Phe Ser Met
4040 4045 4050
Ile Arg Lys Leu Asp Asn Gln Ala Leu Asn Ser Ile Leu Asp Asn
4055 4060 4065
Ala Val Lys Gly Cys Val Pro Leu Asn Ala Ile Pro Ser Leu Thr
4070 4075 4080
Ser Asn Thr Leu Thr Ile Ile Val Pro Asp Lys Gln Val Phe Asp
4085 4090 4095
Gln Val Val Asp Asn Val Tyr Val Thr Tyr Ala Gly Asn Val Trp
4100 4105 4110
His Ile Gln Ser Ile Gln Asp Ala Asp Gly Ala Val Lys Gln Leu
4115 4120 4125
Asn Glu Ile Asp Val Asn Ile Thr Trp Pro Leu Val Ile Ala Ala
4130 4135 4140
Asn Arg His Asn Glu Val Ser Ser Val Val Leu Gln Asn Asn Glu
4145 4150 4155
Leu Met Pro Gln Lys Leu Arg Thr Gln Val Val Asn Ser Gly Ser
4160 4165 4170
Asp Met Asn Cys Asn Thr Pro Thr Gln Cys Tyr Tyr Asn Thr Thr
4175 4180 4185
Gly Met Gly Lys Ile Val Tyr Ala Ile Leu Ser Asp Cys Asp Gly
4190 4195 4200
Leu Lys Tyr Thr Lys Ile Val Lys Glu Asp Gly Asn Cys Val Val
4205 4210 4215
Leu Glu Leu Asp Pro Pro Cys Lys Phe Ser Val Gln Asp Val Lys
4220 4225 4230
Gly Leu Lys Ile Lys Tyr Leu Tyr Phe Val Lys Gly Cys Asn Thr
4235 4240 4245
Leu Ala Arg Gly Trp Val Val Gly Thr Leu Ser Ser Thr Val Arg
4250 4255 4260
Leu Gln Ala Gly Thr Ala Thr Glu Tyr Ala Ser Asn Ser Ala Ile
4265 4270 4275
Arg Ser Leu Cys Ala Phe Ser Val Asp Pro Lys Lys Thr Tyr Leu
4280 4285 4290
Asp Tyr Ile Gln Gln Gly Gly Ala Pro Val Thr Asn Cys Val Lys
4295 4300 4305
Met Leu Cys Asp His Ala Gly Thr Gly Met Ala Ile Thr Ile Lys
4310 4315 4320
Pro Glu Ala Thr Thr Asn Gln Asp Ser Tyr Gly Gly Ala Ser Val
4325 4330 4335
Cys Ile Tyr Cys Arg Ser Arg Val Glu His Pro Asp Val Asp Gly
4340 4345 4350
Leu Cys Lys Leu Arg Gly Lys Phe Val Gln Val Pro Leu Gly Ile
4355 4360 4365
Lys Asp Pro Val Ser Tyr Val Leu Thr His Asp Val Cys Gln Val
4370 4375 4380
Cys Gly Phe Trp Arg Asp Gly Ser Cys Ser Cys Val Gly Thr Gly
4385 4390 4395
Ser Gln Phe Gln Ser Lys Asp Thr Asn Phe Leu Asn Gly Phe Gly
4400 4405 4410
Val Gln Val
4415
<210> 16
<211> 4373
<212> PRT
<213> human coronavirus OC43
<220>
<221> MISC_FEATURE
<223> ORF 1A
<400> 16
Met Ser Lys Ile Asn Lys Tyr Gly Leu Glu Leu His Trp Ala Pro Glu
1 5 10 15
Phe Pro Trp Met Phe Glu Asp Ala Glu Glu Lys Leu Asp Asn Pro Ser
20 25 30
Ser Ser Glu Val Asp Met Ile Cys Ser Thr Thr Ala Gln Lys Leu Glu
35 40 45
Thr Asp Gly Ile Cys Pro Glu Asn His Val Met Val Asp Cys Arg Arg
50 55 60
Leu Leu Lys Gln Glu Cys Cys Val Gln Ser Ser Leu Ile Arg Glu Ile
65 70 75 80
Val Met Asn Ala Ser Pro Tyr Asp Leu Glu Val Leu Leu Gln Asp Ala
85 90 95
Leu Gln Ser Arg Glu Ala Val Leu Val Thr Thr Pro Leu Gly Met Ser
100 105 110
Leu Glu Ala Cys Tyr Val Arg Gly Cys Asn Pro Lys Gly Trp Thr Met
115 120 125
Gly Leu Phe Arg Arg Arg Ser Val Cys Asn Thr Gly Arg Cys Thr Val
130 135 140
Asn Lys His Val Ala Tyr Gln Leu Tyr Met Ile Asp Pro Ala Gly Val
145 150 155 160
Cys Leu Gly Ala Gly Gln Phe Val Gly Trp Val Ile Pro Leu Ala Phe
165 170 175
Met Pro Val Gln Ser Arg Lys Phe Ile Val Pro Trp Val Met Tyr Leu
180 185 190
Arg Lys Arg Gly Glu Lys Gly Ala Tyr Asn Lys Asp His Gly Arg Gly
195 200 205
Gly Phe Gly His Val Tyr Asp Phe Lys Val Glu Asp Ala Tyr Asp Gln
210 215 220
Val His Asp Glu Pro Lys Gly Lys Phe Ser Lys Lys Ala Tyr Ala Leu
225 230 235 240
Ile Arg Gly Tyr Arg Gly Val Lys Pro Leu Leu Tyr Val Asp Gln Tyr
245 250 255
Gly Cys Asp Tyr Thr Gly Ser Leu Ala Asp Gly Leu Glu Ala Tyr Ala
260 265 270
Asp Lys Thr Leu Gln Glu Met Lys Ala Leu Phe Pro Thr Trp Ser Gln
275 280 285
Glu Leu Leu Phe Asp Val Ile Val Ala Trp His Val Val Arg Asp Pro
290 295 300
Arg Tyr Val Met Arg Leu Gln Ser Ala Ala Thr Ile Arg Ser Val Ala
305 310 315 320
Tyr Val Ala Asn Pro Thr Glu Asp Leu Cys Asp Gly Ser Val Val Ile
325 330 335
Lys Glu Pro Val His Val Tyr Ala Asp Asp Ser Ile Ile Leu Arg Gln
340 345 350
Tyr Asn Leu Val Asp Ile Met Ser His Phe Tyr Met Glu Ala Asp Thr
355 360 365
Val Val Asn Ala Phe Tyr Gly Val Ala Leu Lys Asp Cys Gly Phe Val
370 375 380
Met Gln Phe Gly Tyr Ile Asp Cys Glu Gln Asp Ser Cys Asp Phe Lys
385 390 395 400
Gly Trp Ile Pro Gly Asn Met Ile Asp Gly Phe Ala Cys Thr Thr Cys
405 410 415
Gly His Val Tyr Glu Val Gly Asp Leu Ile Ala Gln Ser Ser Gly Val
420 425 430
Leu Pro Val Asn Pro Val Leu His Thr Lys Ser Ala Ala Gly Tyr Gly
435 440 445
Gly Phe Gly Cys Lys Asp Ser Phe Thr Leu Tyr Gly Gln Thr Val Val
450 455 460
Tyr Phe Gly Gly Cys Val Tyr Trp Ser Pro Ala Arg Asn Ile Trp Ile
465 470 475 480
Pro Ile Leu Lys Ser Ser Val Lys Ser Tyr Asp Ser Leu Val Tyr Thr
485 490 495
Gly Val Leu Gly Cys Lys Ala Ile Val Lys Glu Thr Asn Leu Ile Cys
500 505 510
Lys Ala Leu Tyr Leu Asp Tyr Val Gln His Lys Cys Gly Asn Leu His
515 520 525
Gln Arg Glu Leu Leu Gly Val Ser Asp Val Trp His Lys Gln Leu Leu
530 535 540
Leu Asn Arg Gly Val Tyr Lys Pro Leu Leu Glu Asn Ile Asp Tyr Phe
545 550 555 560
Asn Met Arg Arg Ala Lys Phe Ser Leu Glu Thr Phe Thr Val Cys Ala
565 570 575
Asp Gly Phe Met Pro Phe Leu Leu Asp Asp Leu Val Pro Arg Ala Tyr
580 585 590
Tyr Leu Ala Val Ser Gly Gln Ala Phe Cys Asp Tyr Ala Asp Lys Leu
595 600 605
Cys His Ala Val Val Ser Lys Ser Lys Glu Leu Leu Asp Val Ser Leu
610 615 620
Asp Ser Leu Gly Ala Ala Ile His Tyr Leu Asn Ser Lys Ile Val Asp
625 630 635 640
Leu Ala Gln His Phe Ser Asp Phe Gly Thr Ser Phe Val Ser Lys Ile
645 650 655
Val His Phe Phe Lys Thr Phe Thr Thr Ser Thr Ala Leu Ala Phe Ala
660 665 670
Trp Val Leu Phe His Val Leu His Gly Ala Tyr Ile Val Val Glu Ser
675 680 685
Asp Ile Tyr Phe Val Lys Asn Ile Pro Arg Tyr Ala Ser Ala Val Ala
690 695 700
Gln Ala Phe Gln Ser Val Ala Lys Val Val Leu Asp Ser Leu Arg Val
705 710 715 720
Thr Phe Ile Asp Gly Leu Ser Cys Phe Lys Ile Gly Arg Arg Arg Ile
725 730 735
Cys Leu Ser Gly Arg Lys Ile Tyr Glu Val Glu Arg Gly Leu Leu His
740 745 750
Ser Ser Gln Leu Pro Leu Asp Val Tyr Asp Leu Thr Met Pro Ser Gln
755 760 765
Val Gln Lys Ala Lys Gln Lys Pro Ile Tyr Leu Lys Gly Ser Gly Ser
770 775 780
Asp Phe Ser Leu Ala Asp Ser Val Val Glu Val Val Thr Thr Ser Leu
785 790 795 800
Thr Pro Cys Gly Tyr Ser Glu Pro Pro Lys Val Ala Asp Lys Ile Cys
805 810 815
Ile Val Asp Asn Val Tyr Met Ala Lys Ala Gly Asp Lys Tyr Tyr Pro
820 825 830
Val Val Val Asp Asp His Val Gly Leu Leu Asp Gln Ala Trp Arg Val
835 840 845
Pro Cys Ala Gly Arg Arg Val Thr Phe Lys Glu Gln Pro Thr Val Lys
850 855 860
Glu Ile Ile Ser Met Pro Lys Ile Ile Lys Val Phe Tyr Glu Leu Asp
865 870 875 880
Asn Asp Phe Asn Thr Ile Leu Asn Thr Ala Cys Gly Val Phe Glu Val
885 890 895
Asp Asp Thr Val Asp Met Glu Glu Phe Tyr Ala Val Val Ile Asp Ala
900 905 910
Ile Glu Glu Lys Leu Ser Pro Cys Lys Glu Leu Glu Gly Val Gly Ala
915 920 925
Lys Val Ser Ala Phe Leu Gln Lys Leu Glu Asp Asn Pro Leu Phe Leu
930 935 940
Phe Asp Glu Ala Gly Glu Glu Val Leu Ala Pro Lys Leu Tyr Cys Ala
945 950 955 960
Phe Thr Ala Pro Glu Asp Asp Asp Phe Leu Glu Glu Ser Asp Val Glu
965 970 975
Glu Asp Asp Val Glu Gly Glu Glu Thr Asp Leu Thr Val Thr Ser Ala
980 985 990
Gly Gln Pro Cys Val Ala Ser Glu Gln Glu Glu Ser Ser Glu Val Leu
995 1000 1005
Glu Asp Thr Leu Asp Asp Gly Pro Ser Val Glu Thr Ser Asp Ser
1010 1015 1020
Gln Val Glu Glu Asp Val Glu Met Ser Asp Phe Val Asp Leu Glu
1025 1030 1035
Ser Val Ile Gln Asp Tyr Glu Asn Val Cys Phe Glu Phe Tyr Thr
1040 1045 1050
Thr Glu Pro Glu Phe Val Lys Val Leu Gly Leu Tyr Val Pro Lys
1055 1060 1065
Ala Thr Arg Asn Asn Cys Trp Leu Arg Ser Val Leu Ala Val Met
1070 1075 1080
Gln Lys Leu Pro Cys Gln Phe Lys Asp Lys Asn Leu Gln Asp Leu
1085 1090 1095
Trp Val Leu Tyr Lys Gln Gln Tyr Ser Gln Leu Phe Val Asp Thr
1100 1105 1110
Leu Val Asn Lys Ile Pro Ala Asn Ile Val Leu Pro Gln Gly Gly
1115 1120 1125
Tyr Val Ala Asp Phe Ala Tyr Trp Phe Leu Thr Leu Cys Asp Trp
1130 1135 1140
Gln Cys Val Ala Tyr Trp Lys Cys Ile Lys Cys Asp Leu Ala Leu
1145 1150 1155
Lys Leu Lys Gly Leu Asp Ala Met Phe Phe Tyr Gly Asp Val Val
1160 1165 1170
Ser His Ile Cys Lys Cys Gly Glu Ser Met Val Leu Ile Asp Val
1175 1180 1185
Asp Val Pro Phe Thr Ala His Phe Ala Leu Lys Asp Lys Leu Phe
1190 1195 1200
Cys Ala Phe Ile Thr Lys Arg Ile Val Tyr Lys Ala Ala Cys Val
1205 1210 1215
Val Asp Val Asn Asp Ser His Ser Met Ala Val Val Asp Gly Lys
1220 1225 1230
Gln Ile Asp Asp His Arg Ile Thr Ser Ile Thr Ser Asp Lys Phe
1235 1240 1245
Asp Phe Ile Ile Gly His Gly Met Ser Phe Ser Met Thr Thr Phe
1250 1255 1260
Glu Ile Ala Gln Leu Tyr Gly Ser Cys Ile Thr Pro Asn Val Cys
1265 1270 1275
Phe Val Lys Gly Asp Ile Ile Lys Val Ser Lys Leu Val Lys Ala
1280 1285 1290
Glu Val Val Val Asn Pro Ala Asn Gly His Met Ala His Gly Gly
1295 1300 1305
Gly Val Ala Lys Ala Ile Ala Val Ala Ala Gly Gln Gln Phe Val
1310 1315 1320
Lys Glu Thr Thr Asp Met Val Lys Ser Lys Gly Val Cys Ala Thr
1325 1330 1335
Gly Asp Cys Tyr Val Ser Thr Gly Gly Lys Leu Cys Lys Thr Val
1340 1345 1350
Leu Asn Val Val Gly Pro Asp Ala Arg Thr Gln Gly Lys Gln Ser
1355 1360 1365
Tyr Val Leu Leu Glu Arg Val Tyr Lys His Leu Asn Asn Tyr Asp
1370 1375 1380
Cys Val Val Thr Thr Leu Ile Ser Ala Gly Ile Phe Ser Val Pro
1385 1390 1395
Ser Asp Val Ser Leu Thr Tyr Leu Leu Gly Thr Ala Lys Lys Gln
1400 1405 1410
Val Val Leu Val Ser Asn Asn Gln Glu Asp Phe Asp Leu Ile Ser
1415 1420 1425
Lys Cys Gln Ile Thr Ala Val Glu Gly Thr Lys Lys Leu Ala Ala
1430 1435 1440
Arg Leu Ser Phe Asn Val Gly Arg Ser Ile Val Tyr Glu Thr Asp
1445 1450 1455
Ala Asn Lys Leu Ile Leu Ile Asn Asp Val Ala Phe Val Ser Thr
1460 1465 1470
Phe Asn Val Leu Gln Asp Val Leu Ser Leu Arg His Asp Ile Ala
1475 1480 1485
Leu Asp Asp Asp Ala Arg Thr Phe Val Gln Ser Asn Val Asp Val
1490 1495 1500
Val Pro Glu Gly Trp Arg Val Val Asn Lys Phe Tyr Gln Ile Asn
1505 1510 1515
Gly Val Arg Thr Val Lys Tyr Phe Glu Cys Thr Gly Gly Ile Asp
1520 1525 1530
Ile Cys Ser Gln Asp Lys Val Phe Gly Tyr Val Gln Gln Gly Ile
1535 1540 1545
Phe Asn Lys Ala Thr Val Ala Gln Ile Lys Ala Leu Phe Leu Asp
1550 1555 1560
Lys Val Asp Ile Leu Leu Thr Val Asp Gly Val Asn Phe Thr Asn
1565 1570 1575
Arg Phe Val Pro Val Gly Glu Ser Phe Gly Lys Ser Leu Gly Asn
1580 1585 1590
Val Phe Cys Asp Gly Val Asn Val Thr Lys His Lys Cys Asp Ile
1595 1600 1605
Asn Tyr Lys Gly Lys Val Phe Phe Gln Phe Asp Asn Leu Ser Ser
1610 1615 1620
Glu Asp Leu Lys Ala Val Arg Ser Ser Phe Asn Phe Asp Gln Lys
1625 1630 1635
Glu Leu Leu Ala Tyr Tyr Asn Met Leu Val Asn Cys Phe Lys Trp
1640 1645 1650
Gln Val Val Val Asn Gly Lys Tyr Phe Thr Phe Lys Gln Ala Asn
1655 1660 1665
Asn Asn Cys Phe Val Asn Val Ser Cys Leu Met Leu Gln Ser Leu
1670 1675 1680
His Leu Thr Phe Lys Ile Val Gln Trp Gln Glu Ala Trp Leu Glu
1685 1690 1695
Phe Arg Ser Gly Arg Pro Ala Arg Phe Val Ala Leu Val Leu Ala
1700 1705 1710
Lys Gly Gly Phe Lys Phe Gly Asp Pro Ala Asp Ser Arg Asp Phe
1715 1720 1725
Leu Arg Val Val Phe Ser Gln Val Asp Leu Thr Gly Ala Ile Cys
1730 1735 1740
Asp Phe Glu Ile Ala Cys Lys Cys Gly Val Lys Gln Glu Gln Arg
1745 1750 1755
Thr Gly Leu Asp Ala Val Met His Phe Gly Thr Leu Ser Arg Glu
1760 1765 1770
Asp Leu Glu Ile Gly Tyr Thr Val Asp Cys Ser Cys Gly Lys Lys
1775 1780 1785
Leu Ile His Cys Val Arg Phe Asp Val Pro Phe Leu Ile Cys Ser
1790 1795 1800
Asn Thr Pro Ala Ser Val Lys Leu Pro Lys Gly Val Gly Ser Ala
1805 1810 1815
Asn Ile Phe Ile Gly Asp Lys Val Gly His Tyr Val His Val Lys
1820 1825 1830
Cys Glu Gln Ser Tyr Gln Leu Tyr Asp Ala Ser Asn Val Lys Lys
1835 1840 1845
Val Thr Asp Val Thr Gly Lys Leu Ser Asp Cys Leu Tyr Leu Lys
1850 1855 1860
Asn Leu Lys Gln Thr Phe Lys Ser Val Leu Thr Thr Tyr Tyr Leu
1865 1870 1875
Asp Asp Val Lys Lys Ile Glu Tyr Lys Pro Asp Leu Ser Gln Tyr
1880 1885 1890
Tyr Cys Asp Gly Gly Lys Tyr Tyr Thr Gln Arg Ile Ile Lys Ala
1895 1900 1905
Gln Phe Lys Thr Phe Glu Lys Val Asp Gly Val Tyr Thr Asn Phe
1910 1915 1920
Lys Leu Ile Gly His Thr Val Cys Asp Ser Leu Asn Ala Lys Leu
1925 1930 1935
Gly Phe Asp Ser Ser Lys Glu Phe Val Glu Tyr Lys Ile Thr Glu
1940 1945 1950
Trp Pro Thr Ala Thr Gly Asp Val Val Leu Ala Thr Asp Asp Leu
1955 1960 1965
Tyr Val Lys Arg Tyr Glu Arg Gly Cys Ile Thr Phe Gly Lys Pro
1970 1975 1980
Val Ile Trp Leu Ser His Glu Lys Ala Ser Leu Asn Ser Leu Thr
1985 1990 1995
Tyr Phe Asn Arg Pro Ser Leu Val Asp Asp Asn Lys Phe Asp Val
2000 2005 2010
Leu Lys Val Asp Asp Val Asp Asp Gly Gly Asp Ser Ser Glu Ser
2015 2020 2025
Gly Ala Lys Glu Thr Lys Glu Ile Asn Ile Ile Lys Leu Ser Gly
2030 2035 2040
Val Lys Lys Pro Phe Lys Val Glu Asp Ser Val Ile Val Asn Asp
2045 2050 2055
Asp Thr Ser Glu Thr Lys Tyr Val Lys Ser Leu Ser Ile Val Asp
2060 2065 2070
Val Tyr Asp Met Trp Leu Thr Gly Cys Lys Tyr Val Val Arg Thr
2075 2080 2085
Ala Asn Ala Leu Ser Arg Ala Val Asn Val Pro Thr Ile Arg Lys
2090 2095 2100
Phe Ile Lys Phe Gly Met Thr Leu Val Ser Ile Pro Ile Asp Leu
2105 2110 2115
Leu Asn Leu Arg Glu Ile Lys Pro Ala Val Asn Val Val Lys Ala
2120 2125 2130
Val Arg Asn Lys Ile Ser Val Cys Phe Asn Phe Ile Lys Trp Leu
2135 2140 2145
Phe Val Leu Leu Phe Gly Trp Ile Lys Ile Ser Ala Asp Asn Lys
2150 2155 2160
Val Ile Tyr Thr Thr Glu Ile Ala Ser Lys Leu Thr Cys Lys Leu
2165 2170 2175
Val Ala Leu Ala Phe Lys Asn Ala Phe Leu Thr Phe Lys Trp Ser
2180 2185 2190
Met Val Ala Arg Gly Ala Cys Ile Ile Ala Thr Ile Phe Leu Leu
2195 2200 2205
Trp Phe Asn Phe Ile Tyr Ala Asn Val Ile Phe Ser Asp Phe Tyr
2210 2215 2220
Leu Pro Lys Ile Gly Phe Leu Pro Thr Phe Val Gly Lys Ile Ala
2225 2230 2235
Gln Trp Ile Lys Asn Thr Phe Ser Leu Val Thr Ile Cys Asp Leu
2240 2245 2250
Tyr Ser Met Gln Asp Val Gly Phe Lys Asn Gln Tyr Cys Asn Gly
2255 2260 2265
Ser Ile Ala Cys Gln Phe Cys Leu Ala Gly Phe Asp Met Leu Asp
2270 2275 2280
Asn Tyr Lys Ala Ile Asp Val Val Gln Tyr Glu Ala Asp Arg Arg
2285 2290 2295
Ala Phe Val Asp Tyr Thr Gly Val Leu Lys Ile Val Ile Glu Leu
2300 2305 2310
Ile Val Ser Tyr Ala Leu Tyr Thr Ala Trp Phe Tyr Pro Leu Phe
2315 2320 2325
Ala Leu Ile Ser Ile Gln Ile Leu Thr Thr Trp Leu Pro Glu Leu
2330 2335 2340
Phe Met Leu Ser Thr Leu His Trp Ser Phe Arg Leu Leu Val Ala
2345 2350 2355
Leu Ala Asn Met Leu Pro Ala His Val Phe Met Arg Phe Tyr Ile
2360 2365 2370
Ile Ile Ala Ser Phe Ile Lys Leu Phe Ser Leu Phe Arg His Val
2375 2380 2385
Ala Tyr Gly Cys Ser Lys Ser Gly Cys Leu Phe Cys Tyr Lys Arg
2390 2395 2400
Asn Arg Ser Leu Arg Val Lys Cys Ser Thr Ile Val Gly Gly Met
2405 2410 2415
Ile Arg Tyr Tyr Asp Val Met Ala Asn Gly Gly Thr Gly Phe Cys
2420 2425 2430
Ser Lys His Gln Trp Asn Cys Ile Asp Cys Asp Ser Tyr Lys Pro
2435 2440 2445
Gly Asn Thr Phe Ile Thr Val Glu Ala Ala Leu Asp Leu Ser Lys
2450 2455 2460
Glu Leu Lys Arg Pro Ile Gln Pro Thr Asp Val Ala Tyr His Thr
2465 2470 2475
Val Thr Asp Val Lys Gln Val Gly Cys Ser Met Arg Leu Phe Tyr
2480 2485 2490
Asp Arg Asp Gly Gln Arg Thr Tyr Asp Asp Val Asn Ala Ser Leu
2495 2500 2505
Phe Val Asp Tyr Ser Asn Leu Leu His Ser Lys Val Lys Ser Val
2510 2515 2520
Pro Asn Met His Val Val Val Val Glu Asn Asp Ala Asp Lys Ala
2525 2530 2535
Asn Phe Leu Asn Ala Ala Val Phe Tyr Ala Gln Ser Leu Phe Arg
2540 2545 2550
Pro Ile Leu Met Val Asp Lys Asn Leu Ile Thr Thr Ala Asn Thr
2555 2560 2565
Gly Thr Ser Val Thr Glu Thr Met Phe Asp Val Tyr Val Asp Thr
2570 2575 2580
Phe Leu Ser Met Phe Asp Val Asp Lys Lys Ser Leu Asn Ala Leu
2585 2590 2595
Ile Ala Thr Ala His Ser Ser Ile Lys Gln Gly Thr Gln Ile Tyr
2600 2605 2610
Lys Val Leu Asp Thr Phe Leu Ser Cys Ala Arg Lys Ser Cys Ser
2615 2620 2625
Ile Asp Ser Asp Val Asp Thr Lys Cys Leu Ala Asp Ser Val Met
2630 2635 2640
Ser Ala Val Ser Ala Gly Leu Glu Leu Thr Asp Glu Ser Cys Asn
2645 2650 2655
Asn Leu Val Pro Thr Tyr Leu Lys Ser Asp Asn Ile Val Ala Ala
2660 2665 2670
Asp Leu Gly Val Leu Ile Gln Asn Ser Ala Lys His Val Gln Gly
2675 2680 2685
Asn Val Ala Lys Ile Ala Gly Val Ser Cys Ile Trp Ser Val Asp
2690 2695 2700
Ala Phe Asn Gln Phe Ser Ser Asp Phe Gln His Lys Leu Lys Lys
2705 2710 2715
Ala Cys Cys Lys Thr Gly Leu Lys Leu Lys Leu Thr Tyr Asn Lys
2720 2725 2730
Gln Met Ala Asn Val Ser Val Leu Thr Thr Pro Phe Ser Leu Lys
2735 2740 2745
Gly Gly Ala Val Phe Ser Tyr Phe Val Tyr Val Cys Phe Val Leu
2750 2755 2760
Ser Leu Val Cys Phe Ile Gly Leu Trp Cys Leu Met Pro Thr Tyr
2765 2770 2775
Thr Val His Lys Ser Asp Phe Gln Leu Pro Val Tyr Ala Ser Tyr
2780 2785 2790
Lys Val Leu Asp Asn Gly Val Ile Arg Asp Val Ser Val Glu Asp
2795 2800 2805
Val Cys Phe Ala Asn Lys Phe Glu Gln Phe Asp Gln Trp Tyr Glu
2810 2815 2820
Ser Thr Phe Gly Leu Ser Tyr Tyr Ser Asn Ser Met Ala Cys Pro
2825 2830 2835
Ile Val Val Ala Val Ile Asp Gln Asp Phe Gly Ser Thr Val Phe
2840 2845 2850
Asn Val Pro Thr Lys Val Leu Arg Tyr Gly Tyr His Val Leu His
2855 2860 2865
Phe Ile Thr His Ala Leu Ser Ala Asp Gly Val Gln Cys Tyr Thr
2870 2875 2880
Pro His Ser Gln Ile Ser Tyr Ser Asn Phe Tyr Ala Ser Gly Cys
2885 2890 2895
Val Leu Ser Ser Ala Cys Thr Met Phe Thr Met Ala Asp Gly Ser
2900 2905 2910
Pro Gln Pro Tyr Cys Tyr Thr Glu Gly Leu Met Gln Asn Ala Ser
2915 2920 2925
Leu Tyr Ser Ser Leu Val Pro His Val Arg Tyr Asn Leu Ala Asn
2930 2935 2940
Ala Lys Gly Phe Ile Arg Phe Pro Glu Val Leu Arg Glu Gly Leu
2945 2950 2955
Val Arg Ile Val Arg Thr Arg Ser Met Ser Tyr Cys Arg Val Gly
2960 2965 2970
Leu Cys Glu Glu Ala Asp Glu Gly Ile Cys Phe Asn Phe Asn Gly
2975 2980 2985
Ser Trp Val Leu Asn Asn Asp Tyr Tyr Arg Ser Leu Pro Gly Thr
2990 2995 3000
Phe Cys Gly Arg Asp Val Phe Asp Leu Ile Tyr Gln Leu Phe Lys
3005 3010 3015
Gly Leu Ala Gln Pro Val Asp Phe Leu Ala Leu Thr Ala Ser Ser
3020 3025 3030
Ile Ala Gly Ala Ile Leu Ala Val Ile Val Val Leu Val Phe Tyr
3035 3040 3045
Tyr Leu Ile Lys Leu Lys Arg Ala Phe Gly Asp Tyr Thr Ser Val
3050 3055 3060
Val Phe Val Asn Val Ile Val Trp Cys Val Asn Phe Met Met Leu
3065 3070 3075
Phe Val Phe Gln Val Tyr Pro Ile Leu Ser Cys Val Tyr Ala Ile
3080 3085 3090
Cys Tyr Phe Tyr Ala Thr Leu Tyr Phe Pro Ser Glu Ile Ser Val
3095 3100 3105
Ile Met His Leu Gln Trp Leu Val Met Tyr Gly Thr Ile Met Pro
3110 3115 3120
Leu Trp Phe Cys Leu Leu Tyr Ile Ala Val Val Val Ser Asn His
3125 3130 3135
Ala Phe Trp Val Phe Ser Tyr Cys Arg Lys Leu Gly Thr Ser Val
3140 3145 3150
Arg Ser Asp Gly Thr Phe Glu Glu Met Ala Leu Thr Thr Phe Met
3155 3160 3165
Ile Thr Lys Asp Ser Tyr Cys Lys Leu Lys Asn Ser Leu Ser Asp
3170 3175 3180
Val Ala Phe Asn Arg Tyr Leu Ser Leu Tyr Asn Lys Tyr Arg Tyr
3185 3190 3195
Tyr Ser Gly Lys Met Asp Thr Ala Ala Tyr Arg Glu Ala Ala Cys
3200 3205 3210
Ser Gln Leu Ala Lys Ala Met Asp Thr Phe Thr Asn Asn Asn Gly
3215 3220 3225
Ser Asp Val Leu Tyr Gln Pro Pro Thr Ala Ser Val Ser Thr Ser
3230 3235 3240
Phe Leu Gln Ser Gly Ile Val Lys Met Val Asn Pro Thr Ser Lys
3245 3250 3255
Val Glu Pro Cys Val Val Ser Val Thr Tyr Gly Asn Met Thr Leu
3260 3265 3270
Asn Gly Leu Trp Leu Asp Asp Lys Val Tyr Cys Pro Arg His Val
3275 3280 3285
Ile Cys Ser Ala Ser Asp Met Thr Asn Pro Asp Tyr Thr Asn Leu
3290 3295 3300
Leu Cys Arg Val Thr Ser Ser Asp Phe Thr Val Leu Phe Asp Arg
3305 3310 3315
Leu Ser Leu Thr Val Met Ser Tyr Gln Met Arg Gly Cys Met Leu
3320 3325 3330
Val Leu Thr Val Thr Leu Gln Asn Ser Arg Thr Pro Lys Tyr Thr
3335 3340 3345
Phe Gly Val Val Lys Pro Gly Glu Thr Phe Thr Val Leu Ala Ala
3350 3355 3360
Tyr Asn Gly Lys Pro Gln Gly Ala Phe His Val Thr Met Arg Ser
3365 3370 3375
Ser Tyr Thr Ile Lys Gly Ser Phe Leu Cys Gly Ser Cys Gly Ser
3380 3385 3390
Val Gly Tyr Val Ile Met Gly Asp Cys Val Lys Phe Val Tyr Met
3395 3400 3405
His Gln Leu Glu Leu Ser Thr Gly Cys His Thr Gly Thr Asp Phe
3410 3415 3420
Asn Gly Asp Phe Tyr Gly Pro Tyr Lys Asp Ala Gln Val Val Gln
3425 3430 3435
Leu Leu Ile Gln Asp Tyr Ile Gln Ser Val Asn Phe Val Ala Trp
3440 3445 3450
Leu Tyr Ala Ala Ile Leu Asn Asn Cys Asn Trp Phe Val Gln Ser
3455 3460 3465
Asp Lys Cys Ser Val Glu Asp Phe Asn Val Trp Ala Leu Ser Asn
3470 3475 3480
Gly Phe Ser Gln Val Lys Ser Asp Leu Val Ile Asp Ala Leu Ala
3485 3490 3495
Ser Met Thr Gly Val Ser Leu Glu Thr Leu Leu Ala Ala Ile Lys
3500 3505 3510
Arg Leu Lys Asn Gly Phe Gln Gly Arg Gln Ile Met Gly Ser Cys
3515 3520 3525
Ser Phe Glu Asp Glu Leu Thr Pro Ser Asp Val Tyr Gln Gln Leu
3530 3535 3540
Ala Gly Ile Lys Leu Gln Ser Lys Arg Thr Arg Leu Phe Lys Gly
3545 3550 3555
Thr Val Cys Trp Ile Met Ala Ser Thr Phe Leu Phe Ser Cys Ile
3560 3565 3570
Ile Thr Ala Phe Val Lys Trp Thr Met Phe Met Tyr Val Thr Thr
3575 3580 3585
Asn Met Phe Ser Ile Thr Phe Cys Ala Leu Cys Val Ile Ser Leu
3590 3595 3600
Ala Met Leu Leu Val Lys His Lys His Leu Tyr Leu Thr Met Tyr
3605 3610 3615
Ile Thr Pro Val Leu Phe Thr Leu Leu Tyr Asn Asn Tyr Leu Val
3620 3625 3630
Val Tyr Lys His Thr Phe Arg Gly Tyr Val Tyr Ala Trp Leu Ser
3635 3640 3645
Tyr Tyr Val Pro Ser Val Glu Tyr Thr Tyr Thr Asp Glu Val Ile
3650 3655 3660
Tyr Gly Met Leu Leu Leu Val Gly Met Val Phe Val Thr Leu Arg
3665 3670 3675
Ser Ile Asn His Asp Leu Phe Ser Phe Ile Met Phe Val Gly Arg
3680 3685 3690
Leu Ile Ser Val Phe Ser Leu Trp Tyr Lys Gly Ser Asn Leu Glu
3695 3700 3705
Glu Glu Ile Leu Leu Met Leu Ala Ser Leu Phe Gly Thr Tyr Thr
3710 3715 3720
Trp Thr Thr Val Leu Ser Met Ala Val Ala Lys Val Ile Ala Lys
3725 3730 3735
Trp Val Ala Val Asn Val Leu Tyr Phe Thr Asp Ile Pro Gln Ile
3740 3745 3750
Lys Ile Val Leu Leu Cys Tyr Leu Phe Ile Gly Tyr Ile Ile Ser
3755 3760 3765
Cys Tyr Trp Gly Leu Phe Ser Leu Met Asn Ser Leu Phe Arg Met
3770 3775 3780
Pro Leu Gly Val Tyr Asn Tyr Lys Ile Ser Val Gln Glu Leu Arg
3785 3790 3795
Tyr Met Asn Ala Asn Gly Leu Arg Pro Pro Lys Asn Ser Phe Glu
3800 3805 3810
Ala Leu Met Leu Asn Phe Lys Leu Leu Gly Ile Gly Gly Val Pro
3815 3820 3825
Ile Ile Glu Val Ser Gln Phe Gln Ser Lys Leu Thr Asp Val Lys
3830 3835 3840
Cys Ala Asn Val Val Leu Leu Asn Cys Leu Gln His Leu His Val
3845 3850 3855
Ala Ser Asn Ser Lys Leu Trp His Tyr Cys Ser Thr Leu His Asn
3860 3865 3870
Glu Ile Leu Ala Thr Ser Asp Leu Ser Val Ala Phe Glu Lys Leu
3875 3880 3885
Ala Gln Leu Leu Ile Val Leu Phe Ala Asn Pro Ala Ala Val Asp
3890 3895 3900
Ser Lys Cys Leu Thr Ser Ile Glu Glu Val Cys Asp Asp Tyr Ala
3905 3910 3915
Lys Asp Asn Thr Val Leu Gln Ala Leu Gln Ser Glu Phe Val Asn
3920 3925 3930
Met Ala Ser Phe Val Glu Tyr Glu Val Ala Lys Lys Asn Leu Asp
3935 3940 3945
Glu Ala Arg Phe Ser Gly Ser Ala Asn Gln Gln Gln Leu Lys Gln
3950 3955 3960
Leu Glu Lys Ala Cys Asn Ile Ala Lys Ser Ala Tyr Glu Arg Asp
3965 3970 3975
Arg Ala Val Ala Lys Lys Leu Glu Arg Met Ala Asp Leu Ala Leu
3980 3985 3990
Thr Asn Met Tyr Lys Glu Ala Arg Ile Asn Asp Lys Lys Ser Lys
3995 4000 4005
Val Val Ser Ala Leu Gln Thr Met Leu Phe Ser Met Val Arg Lys
4010 4015 4020
Leu Asp Asn Gln Ala Leu Asn Ser Ile Leu Asp Asn Ala Val Lys
4025 4030 4035
Gly Cys Val Pro Leu Asn Ala Ile Pro Ser Leu Ala Ala Asn Thr
4040 4045 4050
Leu Asn Ile Ile Val Pro Asp Lys Ser Val Tyr Asp Gln Val Val
4055 4060 4065
Asp Asn Val Tyr Val Thr Tyr Ala Gly Asn Val Trp Gln Ile Gln
4070 4075 4080
Thr Ile Gln Asp Ser Asp Gly Thr Asn Lys Gln Leu Asn Glu Ile
4085 4090 4095
Ser Asp Asp Cys Asn Trp Pro Leu Val Ile Ile Ala Asn Arg Tyr
4100 4105 4110
Asn Glu Val Ser Ala Thr Val Leu Gln Asn Asn Glu Leu Met Pro
4115 4120 4125
Ala Lys Leu Lys Ile Gln Val Val Asn Ser Gly Pro Asp Gln Thr
4130 4135 4140
Cys Asn Thr Pro Thr Gln Cys Tyr Tyr Asn Asn Ser Asn Asn Gly
4145 4150 4155
Lys Ile Val Tyr Ala Ile Leu Ser Asp Val Asp Gly Leu Lys Tyr
4160 4165 4170
Thr Lys Ile Leu Lys Asp Asp Gly Asn Phe Val Val Leu Glu Leu
4175 4180 4185
Asp Pro Pro Cys Lys Phe Thr Val Gln Asp Ala Lys Gly Leu Lys
4190 4195 4200
Ile Lys Tyr Leu Tyr Phe Val Lys Gly Cys Asn Thr Leu Ala Arg
4205 4210 4215
Gly Trp Val Val Gly Thr Ile Ser Ser Thr Val Arg Leu Gln Ala
4220 4225 4230
Gly Thr Ala Thr Glu Tyr Ala Ser Asn Ser Ser Ile Leu Ser Leu
4235 4240 4245
Cys Ala Phe Ser Val Asp Pro Lys Lys Thr Tyr Leu Asp Phe Ile
4250 4255 4260
Gln Gln Gly Gly Thr Pro Ile Ala Asn Cys Val Lys Met Leu Cys
4265 4270 4275
Asp His Ala Gly Thr Gly Met Ala Ile Thr Val Lys Pro Asp Ala
4280 4285 4290
Thr Thr Ser Gln Asp Ser Tyr Gly Gly Ala Ser Val Cys Ile Tyr
4295 4300 4305
Cys Arg Ala Arg Val Glu His Pro Asp Val Asp Gly Leu Cys Lys
4310 4315 4320
Leu Arg Gly Lys Phe Val Gln Val Pro Val Gly Ile Lys Asp Pro
4325 4330 4335
Val Ser Tyr Val Leu Thr His Asp Val Cys Arg Val Cys Gly Phe
4340 4345 4350
Trp Arg Asp Gly Ser Cys Ser Cys Val Ser Thr Asp Thr Thr Val
4355 4360 4365
Gln Ser Lys Asp Thr
4370
<210> 17
<211> 4102
<212> PRT
<213> porcine epidemic diarrhea virus
<220>
<221> MISC_FEATURE
<223> ORF 1A
<400> 17
Met Ala Ser Asn His Val Thr Leu Ala Phe Ala Asn Asp Ala Glu Ile
1 5 10 15
Ser Ala Phe Gly Phe Cys Thr Ala Ser Glu Ala Val Ser Tyr Tyr Ser
20 25 30
Glu Ala Ala Ala Ser Gly Phe Met Gln Cys Arg Phe Val Ser Leu Asp
35 40 45
Leu Ala Asp Thr Val Glu Gly Leu Leu Pro Glu Asp Tyr Val Met Val
50 55 60
Val Ile Gly Thr Thr Lys Leu Ser Ala Tyr Val Asp Thr Phe Gly Ser
65 70 75 80
Arg Pro Arg Asn Ile Cys Gly Trp Leu Leu Phe Ser Asn Cys Asn Tyr
85 90 95
Phe Leu Glu Glu Leu Glu Leu Thr Phe Gly Arg Arg Gly Gly Asn Ile
100 105 110
Val Pro Val Asp Gln Tyr Met Cys Gly Ala Asp Gly Lys Pro Val Leu
115 120 125
Gln Glu Ser Glu Trp Glu Tyr Thr Asp Phe Phe Ala Asp Ser Glu Asp
130 135 140
Gly Gln Leu Asn Ile Ala Gly Ile Thr Tyr Val Lys Ala Trp Ile Val
145 150 155 160
Glu Arg Ser Asp Val Ser Tyr Ala Ser Gln Asn Leu Thr Ser Ile Lys
165 170 175
Ser Ile Thr Tyr Cys Ser Thr Tyr Glu His Thr Phe Leu Asp Gly Thr
180 185 190
Ala Met Lys Val Ala Arg Thr Pro Lys Ile Lys Lys Asn Val Val Leu
195 200 205
Ser Glu Pro Leu Ala Thr Ile Tyr Arg Glu Ile Gly Ser Pro Phe Val
210 215 220
Asp Asn Gly Ser Asp Ala Arg Ser Ile Ile Arg Arg Pro Val Phe Leu
225 230 235 240
His Ala Phe Val Lys Cys Lys Cys Gly Ser Tyr His Trp Thr Val Gly
245 250 255
Asp Trp Thr Ser Tyr Val Ser Thr Cys Cys Gly Phe Lys Cys Lys Pro
260 265 270
Val Leu Val Ala Ser Cys Ser Ala Met Pro Gly Ser Val Val Val Thr
275 280 285
Arg Ala Gly Ala Gly Thr Gly Val Lys Tyr Tyr Asn Asn Met Phe Leu
290 295 300
Arg His Val Ala Asp Ile Asp Gly Leu Ala Phe Trp Arg Ile Leu Lys
305 310 315 320
Val Gln Ser Lys Asp Asp Leu Ala Cys Ser Gly Lys Phe Leu Glu His
325 330 335
His Glu Glu Gly Phe Thr Asp Pro Cys Tyr Phe Leu Asn Asp Ser Ser
340 345 350
Leu Ala Thr Lys Leu Lys Phe Asp Ile Leu Ser Gly Lys Phe Ser Asp
355 360 365
Glu Val Lys Gln Ala Ile Ile Ala Gly His Val Val Val Gly Ser Ala
370 375 380
Leu Val Asp Ile Val Asp Asp Ala Leu Gly Gln Pro Trp Phe Ile Arg
385 390 395 400
Lys Leu Gly Asp Leu Ala Ser Ala Pro Trp Glu Gln Leu Lys Ala Val
405 410 415
Val Arg Gly Leu Gly Leu Leu Ser Asp Glu Val Val Leu Phe Gly Lys
420 425 430
Arg Leu Ser Cys Ala Thr Leu Ser Ile Val Asn Gly Val Phe Glu Phe
435 440 445
Leu Ala Asp Val Pro Glu Lys Leu Ala Ala Ala Val Thr Val Phe Val
450 455 460
Asn Phe Leu Asn Glu Phe Phe Glu Ser Ala Cys Asp Cys Leu Lys Val
465 470 475 480
Gly Gly Lys Thr Phe Asn Lys Val Gly Ser Tyr Val Leu Phe Asp Asn
485 490 495
Ala Leu Val Lys Leu Val Lys Ala Lys Ala Arg Gly Pro Arg Gln Ala
500 505 510
Gly Ile Cys Glu Val Arg Tyr Thr Ser Leu Val Val Gly Ser Thr Thr
515 520 525
Lys Val Val Ser Lys Arg Val Glu Asn Ala Asn Val Asn Leu Val Val
530 535 540
Val Asp Glu Asp Val Thr Leu Asn Thr Thr Gly Arg Thr Val Val Val
545 550 555 560
Asp Gly Leu Ala Phe Phe Glu Ser Asp Gly Phe Tyr Arg His Leu Ala
565 570 575
Asp Ala Asp Val Val Ile Glu His Pro Val Tyr Lys Ser Ala Cys Glu
580 585 590
Leu Lys Pro Val Phe Glu Cys Asp Pro Ile Pro Asp Phe Pro Leu Pro
595 600 605
Val Ala Ala Ser Val Ala Glu Leu Cys Val Gln Thr Asp Leu Leu Leu
610 615 620
Lys Asn Tyr Asn Thr Pro Tyr Lys Thr Tyr Ser Cys Val Val Arg Gly
625 630 635 640
Asp Lys Cys Cys Ile Thr Cys Thr Leu Gln Phe Lys Ala Pro Ser Tyr
645 650 655
Val Glu Asp Ala Val Asn Phe Val Asp Leu Cys Thr Lys Asn Ile Gly
660 665 670
Thr Ala Gly Phe His Glu Phe Tyr Ile Thr Ala His Glu Gln Gln Asp
675 680 685
Leu Gln Gly Phe Leu Thr Thr Cys Cys Thr Met Ser Gly Phe Glu Cys
690 695 700
Phe Met Pro Thr Ile Pro Gln Cys Pro Ala Val Leu Glu Glu Ile Asp
705 710 715 720
Gly Gly Ser Ile Trp Arg Ser Phe Ile Thr Gly Leu Asn Thr Met Trp
725 730 735
Asp Phe Cys Lys Arg Leu Lys Val Ser Phe Gly Leu Asp Gly Ile Val
740 745 750
Val Thr Val Ala Arg Lys Phe Lys Arg Leu Gly Ala Leu Leu Ala Glu
755 760 765
Met Tyr Asn Thr Tyr Leu Ser Thr Val Val Glu Asn Leu Val Leu Ala
770 775 780
Gly Val Ser Phe Lys Tyr Tyr Ala Thr Ser Val Pro Lys Ile Val Leu
785 790 795 800
Gly Gly Cys Phe His Ser Val Lys Ser Val Phe Ala Ser Val Phe Gln
805 810 815
Ile Pro Val Gln Ala Gly Ile Glu Lys Phe Lys Val Phe Leu Asn Cys
820 825 830
Val His Pro Val Val Pro Arg Val Ile Glu Thr Ser Phe Val Glu Leu
835 840 845
Glu Glu Thr Thr Phe Lys Pro Pro Ala Leu Asn Gly Gly Ile Ala Ile
850 855 860
Val Asp Gly Phe Ala Phe Tyr Tyr Asp Gly Thr Leu Tyr Tyr Pro Thr
865 870 875 880
Asp Gly Asn Ser Val Val Pro Ile Cys Phe Lys Lys Lys Gly Gly Gly
885 890 895
Asp Val Lys Phe Ser Asp Glu Val Ser Val Lys Thr Ile Asp Pro Val
900 905 910
Tyr Lys Val Ser Leu Glu Phe Glu Phe Glu Ser Glu Thr Ile Met Ala
915 920 925
Val Leu Asn Lys Ala Val Gly Asn Arg Ile Lys Val Thr Gly Gly Trp
930 935 940
Asp Asp Val Val Glu Tyr Ile Asn Val Ala Ile Glu Val Leu Lys Asp
945 950 955 960
His Val Glu Val Pro Lys Tyr Tyr Ile Tyr Asp Glu Glu Gly Gly Thr
965 970 975
Asp Pro Asn Leu Pro Val Met Val Ser Gln Trp Pro Leu Asn Asp Asp
980 985 990
Thr Ile Ser Gln Asp Leu Leu Asp Val Glu Val Val Thr Asp Ala Pro
995 1000 1005
Ile Asp Ser Glu Gly Asp Glu Val Asp Ser Ser Ala Pro Glu Lys
1010 1015 1020
Val Ala Asp Val Ala Asn Ser Glu Pro Gly Asp Asp Gly Leu Pro
1025 1030 1035
Val Ala Pro Glu Thr Asn Val Glu Ser Glu Val Glu Glu Val Ala
1040 1045 1050
Ala Thr Leu Ser Phe Ile Lys Asp Thr Pro Ser Thr Val Thr Lys
1055 1060 1065
Asp Pro Phe Ala Phe Asp Phe Val Ser Tyr Gly Gly Leu Lys Val
1070 1075 1080
Leu Arg Gln Ser His Asn Asn Cys Trp Val Thr Ser Thr Leu Val
1085 1090 1095
Gln Leu Gln Leu Leu Gly Ile Val Asp Asp Pro Ala Met Glu Leu
1100 1105 1110
Phe Ser Ala Gly Arg Val Gly Pro Met Val Arg Lys Cys Tyr Glu
1115 1120 1125
Ser Gln Lys Ala Ile Leu Gly Ser Leu Gly Asp Val Ser Ala Cys
1130 1135 1140
Leu Glu Ser Leu Thr Lys Asp Leu His Thr Leu Lys Ile Thr Cys
1145 1150 1155
Ser Val Val Cys Gly Cys Gly Thr Gly Glu Arg Ile Tyr Glu Gly
1160 1165 1170
Cys Ala Phe Arg Met Thr Pro Thr Leu Glu Pro Phe Pro Tyr Gly
1175 1180 1185
Ala Cys Ala Gln Cys Ala Gln Val Leu Met His Thr Phe Lys Ser
1190 1195 1200
Ile Val Gly Thr Gly Ile Phe Cys Arg Asp Thr Thr Ala Leu Ser
1205 1210 1215
Leu Asp Ser Leu Val Val Lys Pro Leu Cys Ala Ala Ala Phe Ile
1220 1225 1230
Gly Lys Asp Ser Gly His Tyr Val Thr Asn Phe Tyr Asp Ala Ala
1235 1240 1245
Met Ala Ile Asp Gly Tyr Gly Arg His Gln Ile Lys Tyr Asp Thr
1250 1255 1260
Leu Asn Thr Ile Cys Val Lys Asp Val Asn Trp Thr Ala Pro Leu
1265 1270 1275
Val Pro Ala Val Asp Ser Val Val Glu Pro Val Val Lys Pro Phe
1280 1285 1290
Tyr Ser Tyr Lys Asn Val Asp Phe Tyr Gln Gly Asp Phe Ser Asp
1295 1300 1305
Leu Val Lys Leu Pro Cys Asp Phe Val Val Asn Ala Ala Asn Glu
1310 1315 1320
Lys Leu Ser His Gly Gly Gly Ile Ala Lys Ala Ile Asp Val Tyr
1325 1330 1335
Thr Lys Gly Met Leu Gln Lys Cys Ser Asn Asp Tyr Ile Lys Ala
1340 1345 1350
His Gly Pro Ile Lys Val Gly Arg Gly Val Met Leu Glu Ala Leu
1355 1360 1365
Gly Leu Lys Val Phe Asn Val Val Gly Pro Arg Lys Gly Lys His
1370 1375 1380
Ala Pro Glu Leu Leu Val Lys Ala Tyr Lys Ser Val Phe Ala Asn
1385 1390 1395
Ser Gly Val Ala Leu Thr Pro Leu Ile Ser Val Gly Ile Phe Ser
1400 1405 1410
Val Pro Leu Glu Glu Ser Leu Ser Ala Phe Leu Ala Cys Val Gly
1415 1420 1425
Asp Arg His Cys Lys Cys Phe Cys Tyr Gly Asp Lys Glu Arg Glu
1430 1435 1440
Ala Ile Ile Lys Tyr Met Asp Gly Leu Val Asp Ala Ile Phe Lys
1445 1450 1455
Glu Ala Leu Val Asp Thr Thr Pro Val Gln Glu Asp Val Gln Gln
1460 1465 1470
Val Ser Gln Lys Pro Val Leu Pro Asn Phe Glu Pro Phe Arg Ile
1475 1480 1485
Glu Gly Ala His Ala Phe Tyr Glu Cys Asn Pro Glu Gly Leu Met
1490 1495 1500
Ser Leu Gly Ala Asp Lys Leu Val Leu Phe Thr Asn Ser Asn Leu
1505 1510 1515
Asp Phe Cys Ser Val Gly Lys Cys Leu Asn Asp Val Thr Ser Gly
1520 1525 1530
Ala Leu Leu Glu Ala Ile Asn Val Phe Lys Lys Ser Asn Lys Thr
1535 1540 1545
Val Pro Ala Gly Asn Cys Val Thr Leu Asp Cys Ala Asn Met Ile
1550 1555 1560
Ser Ile Thr Met Val Val Leu Pro Phe Asp Gly Asp Ala Asn Tyr
1565 1570 1575
Asp Lys Asn Tyr Ala Arg Ala Val Val Lys Val Ser Lys Leu Lys
1580 1585 1590
Gly Lys Leu Val Leu Ala Val Asp Asp Ala Thr Leu Tyr Ser Lys
1595 1600 1605
Leu Ser His Leu Ser Val Leu Gly Phe Val Ser Thr Pro Asp Asp
1610 1615 1620
Val Glu Arg Phe Tyr Ala Asn Lys Ser Val Val Ile Lys Val Thr
1625 1630 1635
Glu Asp Thr Arg Ser Val Lys Ala Val Lys Val Glu Ser Thr Ala
1640 1645 1650
Thr Tyr Gly Gln Gln Ile Gly Pro Cys Leu Val Asn Asp Thr Val
1655 1660 1665
Val Thr Asp Asn Lys Pro Val Val Ala Asp Val Val Ala Lys Val
1670 1675 1680
Val Pro Asn Ala Asn Trp Asp Ser His Tyr Gly Phe Asp Lys Ala
1685 1690 1695
Gly Glu Phe His Met Leu Asp His Thr Gly Phe Thr Phe Pro Ser
1700 1705 1710
Glu Val Val Asn Gly Arg Arg Val Ile Lys Thr Thr Asp Asn Asn
1715 1720 1725
Cys Trp Val Asn Val Thr Cys Leu Gln Leu Gln Phe Ala Arg Phe
1730 1735 1740
Arg Phe Lys Ser Ala Gly Leu Gln Ala Met Trp Glu Ser Tyr Cys
1745 1750 1755
Thr Gly Asp Val Ala Met Phe Val His Trp Leu Tyr Trp Leu Thr
1760 1765 1770
Gly Val Asp Lys Gly Gln Pro Ser Asp Ser Glu Asn Ala Leu Asn
1775 1780 1785
Met Leu Ser Lys Tyr Ile Val Pro Ala Gly Ser Val Thr Ile Glu
1790 1795 1800
Arg Val Thr His Asp Gly Cys Cys Cys Ser Lys Arg Val Val Thr
1805 1810 1815
Ala Pro Val Val Asn Ala Ser Val Leu Lys Leu Gly Val Glu Asp
1820 1825 1830
Gly Leu Cys Pro His Gly Leu Asn Tyr Ile Gly Lys Val Val Val
1835 1840 1845
Val Lys Gly Thr Thr Ile Val Val Asn Val Gly Lys Pro Val Val
1850 1855 1860
Ala Pro Ser His Leu Phe Leu Lys Gly Val Ser Tyr Thr Thr Phe
1865 1870 1875
Leu Asp Asn Gly Asn Gly Val Val Gly His Tyr Thr Val Phe Asp
1880 1885 1890
His Gly Thr Gly Met Val His Asp Gly Asp Ala Phe Val Pro Gly
1895 1900 1905
Asp Leu Asn Val Ser Pro Val Thr Asn Val Val Val Ser Glu Gln
1910 1915 1920
Thr Ala Val Val Ile Lys Asp Pro Val Lys Lys Ala Glu Leu Asp
1925 1930 1935
Ala Thr Lys Leu Leu Asp Thr Met Asn Tyr Ala Ser Glu Arg Phe
1940 1945 1950
Phe Ser Phe Gly Asp Phe Met Ser Arg Asn Leu Ile Thr Val Phe
1955 1960 1965
Leu Tyr Ile Leu Ser Ile Leu Gly Leu Cys Phe Arg Ala Phe Arg
1970 1975 1980
Lys Arg Asp Val Lys Val Leu Ala Gly Val Pro Gln Arg Thr Gly
1985 1990 1995
Ile Ile Leu Arg Lys Ser Met Arg Tyr Asn Ala Lys Ala Leu Gly
2000 2005 2010
Val Phe Phe Lys Leu Lys Leu Tyr Trp Phe Lys Val Leu Gly Lys
2015 2020 2025
Phe Ser Leu Gly Ile Tyr Ala Leu Tyr Ala Leu Leu Phe Met Thr
2030 2035 2040
Ile Arg Phe Thr Pro Ile Gly Ser Pro Val Cys Asp Asp Val Val
2045 2050 2055
Ala Gly Tyr Ala Asn Ser Ser Phe Asp Lys Asn Glu Tyr Cys Asn
2060 2065 2070
Ser Val Ile Cys Lys Val Cys Leu Tyr Gly Tyr Gln Glu Leu Ser
2075 2080 2085
Asp Phe Ser His Thr Gln Val Val Trp Gln His Leu Arg Asp Pro
2090 2095 2100
Leu Ile Gly Asn Val Met Pro Phe Phe Tyr Leu Ala Phe Leu Ala
2105 2110 2115
Ile Phe Gly Gly Val Tyr Val Lys Ala Ile Thr Leu Tyr Phe Ile
2120 2125 2130
Phe Gln Tyr Leu Asn Ser Leu Gly Val Phe Leu Gly Leu Gln Gln
2135 2140 2145
Ser Ile Trp Phe Leu Gln Leu Val Pro Phe Asp Val Phe Gly Asp
2150 2155 2160
Glu Ile Val Val Phe Phe Ile Val Thr Arg Val Leu Met Phe Ile
2165 2170 2175
Lys His Val Cys Leu Gly Cys Asp Lys Ala Ser Cys Val Ala Cys
2180 2185 2190
Ser Lys Ser Ala Arg Leu Lys Arg Val Pro Val Gln Thr Ile Phe
2195 2200 2205
Gln Gly Thr Ser Lys Ser Phe Tyr Val His Ala Asn Gly Gly Ser
2210 2215 2220
Lys Phe Cys Lys Lys His Asn Phe Phe Cys Leu Asn Cys Asp Ser
2225 2230 2235
Tyr Gly Pro Gly Cys Thr Phe Ile Asn Asp Val Ile Ala Thr Glu
2240 2245 2250
Val Gly Asn Val Val Lys Leu Asn Val Gln Pro Thr Gly Pro Ala
2255 2260 2265
Thr Ile Leu Ile Asp Lys Val Glu Phe Ser Asn Gly Phe Tyr Tyr
2270 2275 2280
Leu Tyr Ser Gly Asp Thr Phe Trp Lys Tyr Asn Phe Asp Ile Thr
2285 2290 2295
Asp Ser Lys Tyr Thr Cys Lys Glu Ala Leu Lys Asn Cys Ser Ile
2300 2305 2310
Ile Thr Asp Phe Ile Val Phe Asn Asn Asn Gly Ser Asn Val Asn
2315 2320 2325
Gln Val Lys Asn Ala Cys Val Tyr Phe Ser Gln Met Leu Cys Lys
2330 2335 2340
Pro Val Lys Leu Val Asp Ser Ala Leu Leu Ala Ser Leu Ser Val
2345 2350 2355
Asp Phe Gly Ala Ser Leu His Ser Ala Phe Val Ser Val Leu Ser
2360 2365 2370
Asn Ser Phe Gly Lys Asp Leu Ser Ser Cys Asn Asp Met Gln Asp
2375 2380 2385
Cys Lys Ser Thr Leu Gly Phe Asp Asp Val Pro Leu Asp Thr Phe
2390 2395 2400
Asn Ala Ala Val Ala Glu Ala His Arg Tyr Asp Val Leu Leu Thr
2405 2410 2415
Asp Met Ser Phe Asn Asn Phe Thr Thr Ser Tyr Ala Lys Pro Glu
2420 2425 2430
Glu Lys Phe Pro Val His Asp Ile Ala Thr Cys Met Arg Val Gly
2435 2440 2445
Ala Lys Ile Val Asn His Asn Val Leu Val Lys Asp Ser Ile Pro
2450 2455 2460
Val Val Trp Leu Val Arg Asp Phe Ile Ala Leu Ser Glu Glu Thr
2465 2470 2475
Arg Lys Tyr Ile Ile Arg Thr Thr Lys Val Lys Gly Ile Thr Phe
2480 2485 2490
Met Leu Thr Phe Asn Asp Cys Arg Met His Thr Thr Ile Pro Thr
2495 2500 2505
Val Cys Ile Ala Asn Lys Lys Gly Ala Gly Leu Pro Ser Phe Ser
2510 2515 2520
Lys Val Lys Lys Phe Phe Trp Phe Leu Cys Leu Phe Ile Val Ala
2525 2530 2535
Ala Phe Phe Ala Leu Ser Phe Leu Asp Phe Ser Thr Gln Val Ser
2540 2545 2550
Ser Asp Ser Asp Tyr Asp Phe Lys Tyr Ile Glu Ser Gly Gln Leu
2555 2560 2565
Lys Thr Phe Asp Asn Pro Leu Ser Cys Val His Asn Val Phe Ile
2570 2575 2580
Asn Phe Asp Gln Trp His Asp Ala Lys Phe Gly Phe Thr Pro Val
2585 2590 2595
Asn Asn Pro Ser Cys Pro Ile Val Val Gly Val Ser Asp Glu Ala
2600 2605 2610
Arg Thr Val Pro Gly Ile Pro Ala Gly Val Tyr Leu Ala Gly Lys
2615 2620 2625
Thr Leu Val Phe Ala Ile Asn Thr Ile Phe Gly Thr Ser Gly Leu
2630 2635 2640
Cys Phe Asp Ala Ser Gly Val Ala Asp Lys Gly Ala Cys Ile Phe
2645 2650 2655
Asn Ser Ala Cys Thr Thr Leu Ser Gly Leu Gly Gly Thr Ala Val
2660 2665 2670
Tyr Cys Tyr Lys Asn Gly Leu Val Glu Gly Ala Lys Leu Tyr Ser
2675 2680 2685
Glu Leu Ala Pro His Ser Tyr Tyr Lys Met Val Asp Gly Asn Ala
2690 2695 2700
Val Ser Leu Pro Glu Ile Ile Ser Arg Gly Phe Gly Ile Arg Thr
2705 2710 2715
Ile Arg Thr Lys Ala Met Thr Tyr Cys Arg Val Gly Gln Cys Val
2720 2725 2730
Gln Ser Ala Glu Gly Val Cys Phe Gly Ala Asp Arg Phe Phe Val
2735 2740 2745
Tyr Asn Ala Glu Ser Gly Ser Asp Phe Val Cys Gly Thr Gly Leu
2750 2755 2760
Phe Thr Leu Leu Met Asn Val Ile Ser Val Phe Ser Lys Thr Val
2765 2770 2775
Pro Val Thr Val Leu Ser Gly Gln Ile Leu Phe Asn Cys Ile Ile
2780 2785 2790
Ala Phe Val Ala Val Ala Val Cys Phe Leu Phe Thr Lys Phe Lys
2795 2800 2805
Arg Met Phe Gly Asp Met Ser Val Gly Val Phe Thr Val Gly Ala
2810 2815 2820
Cys Thr Leu Leu Asn Asn Val Ser Tyr Ile Val Thr Gln Asn Thr
2825 2830 2835
Leu Gly Met Leu Gly Tyr Ala Thr Leu Tyr Phe Leu Cys Thr Lys
2840 2845 2850
Gly Val Arg Tyr Met Trp Ile Trp His Leu Gly Phe Leu Ile Ser
2855 2860 2865
Tyr Ile Leu Ile Ala Pro Trp Trp Val Leu Met Val Tyr Ala Phe
2870 2875 2880
Ser Ala Ile Phe Glu Phe Met Pro Asn Leu Phe Lys Leu Lys Val
2885 2890 2895
Ser Thr Gln Leu Phe Glu Gly Asp Lys Phe Val Gly Ser Phe Glu
2900 2905 2910
Asn Ala Ala Ala Gly Thr Phe Val Leu Asp Met His Ala Tyr Glu
2915 2920 2925
Arg Leu Ala Asn Ser Ile Ser Thr Glu Lys Leu Arg Gln Tyr Ala
2930 2935 2940
Ser Thr Tyr Asn Lys Tyr Lys Tyr Tyr Ser Gly Ser Ala Ser Glu
2945 2950 2955
Ala Asp Tyr Arg Leu Ala Cys Phe Ala His Leu Ala Lys Ala Met
2960 2965 2970
Met Asp Tyr Ala Ser Asn His Asn Asp Thr Leu Tyr Thr Pro Pro
2975 2980 2985
Thr Val Ser Tyr Asn Ser Thr Leu Gln Ala Gly Leu Arg Lys Met
2990 2995 3000
Ala Gln Pro Ser Gly Val Val Glu Lys Cys Ile Val Arg Val Cys
3005 3010 3015
Tyr Gly Asn Met Ala Leu Asn Gly Leu Trp Leu Gly Asp Ile Val
3020 3025 3030
Met Cys Pro Arg His Val Ile Ala Ser Ser Thr Thr Ser Thr Ile
3035 3040 3045
Asp Tyr Asp Tyr Ala Leu Ser Val Leu Arg Leu His Asn Phe Ser
3050 3055 3060
Ile Ser Ser Gly Asn Val Phe Leu Gly Val Val Ser Ala Thr Met
3065 3070 3075
Arg Gly Ala Leu Leu Gln Ile Lys Val Asn Gln Asn Asn Val His
3080 3085 3090
Thr Pro Lys Tyr Thr Tyr Arg Thr Val Arg Pro Gly Glu Ser Phe
3095 3100 3105
Asn Ile Leu Ala Cys Tyr Asp Gly Ala Ala Ala Gly Val Tyr Gly
3110 3115 3120
Val Asn Met Arg Ser Asn Tyr Thr Ile Arg Gly Ser Phe Ile Asn
3125 3130 3135
Gly Ala Cys Gly Ser Pro Gly Tyr Asn Ile Asn Asn Gly Thr Val
3140 3145 3150
Glu Phe Cys Tyr Leu His Gln Leu Glu Leu Gly Ser Gly Cys His
3155 3160 3165
Val Gly Ser Asp Leu Asp Gly Val Met Tyr Gly Gly Tyr Glu Asp
3170 3175 3180
Gln Pro Thr Leu Gln Val Glu Gly Ala Ser Ser Leu Phe Thr Glu
3185 3190 3195
Asn Val Leu Ala Phe Leu Tyr Ala Ala Leu Ile Asn Gly Ser Thr
3200 3205 3210
Trp Trp Leu Ser Ser Ser Arg Ile Ala Val Asp Arg Phe Asn Glu
3215 3220 3225
Trp Ala Val His Asn Gly Met Thr Thr Val Gly Asn Thr Asp Cys
3230 3235 3240
Phe Ser Ile Leu Ala Ala Lys Thr Gly Val Asp Val Gln Arg Leu
3245 3250 3255
Leu Ala Ser Ile Gln Ser Leu His Lys Asn Phe Gly Gly Lys Gln
3260 3265 3270
Ile Leu Gly His Thr Ser Leu Thr Asp Glu Phe Thr Thr Gly Glu
3275 3280 3285
Val Val Arg Gln Met Tyr Gly Val Asn Leu Gln Gly Gly Tyr Val
3290 3295 3300
Ser Arg Ala Cys Arg Asn Val Leu Leu Val Gly Ser Phe Leu Thr
3305 3310 3315
Phe Phe Trp Ser Glu Leu Val Ser Tyr Thr Lys Phe Phe Trp Val
3320 3325 3330
Asn Pro Gly Tyr Val Thr Pro Met Phe Ala Cys Leu Ser Leu Leu
3335 3340 3345
Ser Ser Leu Leu Met Phe Thr Leu Lys His Lys Thr Leu Phe Phe
3350 3355 3360
Gln Val Phe Leu Ile Pro Ala Leu Ile Val Thr Ser Cys Ile Asn
3365 3370 3375
Leu Ala Phe Asp Val Glu Val Tyr Asn Tyr Leu Ala Glu His Phe
3380 3385 3390
Asp Tyr His Val Ser Leu Met Gly Phe Asn Ala Gln Gly Leu Val
3395 3400 3405
Asn Ile Phe Val Cys Phe Val Val Thr Ile Leu His Gly Thr Tyr
3410 3415 3420
Thr Trp Arg Phe Phe Asn Thr Pro Ala Ser Ser Val Thr Tyr Val
3425 3430 3435
Val Ala Leu Leu Thr Ala Ala Tyr Asn Tyr Phe Tyr Ala Ser Asp
3440 3445 3450
Ile Leu Ser Cys Ala Met Thr Leu Phe Ala Ser Val Thr Gly Asn
3455 3460 3465
Trp Phe Val Gly Ala Val Cys Tyr Lys Val Ala Val Tyr Met Ala
3470 3475 3480
Leu Arg Phe Pro Thr Phe Val Ala Ile Phe Gly Asp Ile Lys Ser
3485 3490 3495
Val Met Phe Cys Tyr Leu Val Leu Gly Tyr Phe Thr Cys Cys Phe
3500 3505 3510
Tyr Gly Ile Leu Tyr Trp Phe Asn Arg Phe Phe Lys Val Ser Val
3515 3520 3525
Gly Val Tyr Asp Tyr Thr Val Ser Ala Ala Glu Phe Lys Tyr Met
3530 3535 3540
Val Ala Asn Gly Leu Arg Ala Pro Thr Gly Thr Leu Asp Ser Leu
3545 3550 3555
Leu Leu Ser Ala Lys Leu Ile Gly Ile Gly Gly Glu Arg Asn Ile
3560 3565 3570
Lys Ile Ser Ser Val Gln Ser Lys Leu Thr Asp Ile Lys Cys Ser
3575 3580 3585
Asn Val Val Leu Leu Gly Cys Leu Ser Ser Met Asn Val Ser Ala
3590 3595 3600
Asn Ser Thr Glu Trp Ala Tyr Cys Val Asp Leu His Asn Lys Ile
3605 3610 3615
Asn Leu Cys Asn Asp Pro Glu Lys Ala Gln Glu Met Leu Leu Ala
3620 3625 3630
Leu Leu Ala Phe Phe Leu Ser Lys Asn Ser Ala Phe Gly Leu Asp
3635 3640 3645
Asp Leu Leu Glu Ser Tyr Phe Asn Asp Asn Ser Met Leu Gln Ser
3650 3655 3660
Val Ala Ser Thr Tyr Val Gly Leu Pro Ser Tyr Val Ile Tyr Glu
3665 3670 3675
Asn Ala Arg Gln Gln Tyr Glu Asp Ala Val Asn Asn Gly Ser Pro
3680 3685 3690
Pro Gln Leu Val Lys Gln Leu Arg His Ala Met Asn Val Ala Lys
3695 3700 3705
Ser Glu Phe Asp Arg Glu Ala Ser Thr Gln Arg Lys Leu Asp Arg
3710 3715 3720
Met Ala Glu Gln Ala Ala Ala Gln Met Tyr Lys Glu Ala Arg Ala
3725 3730 3735
Val Asn Arg Lys Ser Lys Val Val Ser Ala Met His Ser Leu Leu
3740 3745 3750
Phe Gly Met Leu Arg Arg Leu Asp Met Ser Ser Val Asp Thr Ile
3755 3760 3765
Leu Asn Leu Ala Lys Asp Gly Val Val Pro Leu Ser Val Ile Pro
3770 3775 3780
Ala Val Ser Ala Thr Lys Leu Asn Ile Val Thr Ser Asp Ile Asp
3785 3790 3795
Ser Tyr Asn Arg Ile Gln Arg Glu Gly Cys Val His Tyr Ala Gly
3800 3805 3810
Thr Ile Trp Asn Ile Ile Asp Ile Lys Asp Asn Asp Gly Lys Val
3815 3820 3825
Val His Val Lys Glu Val Thr Ala Gln Asn Ala Glu Ser Leu Ser
3830 3835 3840
Trp Pro Leu Val Leu Gly Cys Glu Arg Ile Val Lys Leu Gln Asn
3845 3850 3855
Asn Glu Ile Ile Pro Gly Lys Leu Lys Gln Arg Ser Ile Lys Ala
3860 3865 3870
Glu Gly Asp Gly Ile Val Gly Glu Gly Lys Ala Leu Tyr Asn Asn
3875 3880 3885
Glu Gly Gly Arg Thr Phe Met Tyr Ala Phe Ile Ser Asp Lys Pro
3890 3895 3900
Asp Leu Arg Val Val Lys Trp Glu Phe Asp Gly Gly Cys Asn Thr
3905 3910 3915
Ile Glu Leu Glu Pro Pro Arg Lys Phe Leu Val Asp Ser Pro Asn
3920 3925 3930
Gly Ala Gln Ile Lys Tyr Leu Tyr Phe Val Arg Asn Leu Asn Thr
3935 3940 3945
Leu Arg Arg Gly Ala Val Leu Gly Tyr Ile Gly Ala Thr Val Arg
3950 3955 3960
Leu Gln Ala Gly Lys Gln Thr Glu Gln Ala Ile Asn Ser Ser Leu
3965 3970 3975
Leu Thr Leu Cys Ala Phe Ala Val Asp Pro Ala Lys Thr Tyr Ile
3980 3985 3990
Asp Ala Val Lys Ser Gly His Lys Pro Val Gly Asn Cys Val Lys
3995 4000 4005
Met Leu Ala Asn Gly Ser Gly Asn Gly Gln Ala Val Thr Asn Gly
4010 4015 4020
Val Glu Ala Ser Thr Asn Gln Asp Ser Tyr Gly Gly Ala Ser Val
4025 4030 4035
Cys Leu Tyr Cys Arg Ala His Val Glu His Pro Ser Met Asp Gly
4040 4045 4050
Phe Cys Arg Leu Lys Gly Lys Tyr Val Gln Val Pro Leu Gly Thr
4055 4060 4065
Val Asp Pro Ile Arg Phe Val Leu Glu Asn Asp Val Cys Lys Val
4070 4075 4080
Cys Gly Cys Trp Leu Ser Asn Gly Cys Thr Cys Asp Arg Ser Ile
4085 4090 4095
Met Gln Ser Thr
4100
<210> 18
<211> 4382
<212> PRT
<213> human SARS virus
<220>
<221> MISC_FEATURE
<223> ORF 1A
<400> 18
Met Glu Ser Leu Val Leu Gly Val Asn Glu Lys Thr His Val Gln Leu
1 5 10 15
Ser Leu Pro Val Leu Gln Val Arg Asp Val Leu Val Arg Gly Phe Gly
20 25 30
Asp Ser Val Glu Glu Ala Leu Ser Glu Ala Arg Glu His Leu Lys Asn
35 40 45
Gly Thr Cys Gly Leu Val Glu Leu Glu Lys Gly Val Leu Pro Gln Leu
50 55 60
Glu Gln Pro Tyr Val Phe Ile Lys Arg Ser Asp Ala Leu Ser Thr Asn
65 70 75 80
His Gly His Lys Val Val Glu Leu Val Ala Glu Met Asp Gly Ile Gln
85 90 95
Tyr Gly Arg Ser Gly Ile Thr Leu Gly Val Leu Val Pro His Val Gly
100 105 110
Glu Thr Pro Ile Ala Tyr Arg Asn Val Leu Leu Arg Lys Asn Gly Asn
115 120 125
Lys Gly Ala Gly Gly His Ser Tyr Gly Ile Asp Leu Lys Ser Tyr Asp
130 135 140
Leu Gly Asp Glu Leu Gly Thr Asp Pro Ile Glu Asp Tyr Glu Gln Asn
145 150 155 160
Trp Asn Thr Lys His Gly Ser Gly Ala Leu Arg Glu Leu Thr Arg Glu
165 170 175
Leu Asn Gly Gly Ala Val Thr Arg Tyr Val Asp Asn Asn Phe Cys Gly
180 185 190
Pro Asp Gly Tyr Pro Leu Asp Cys Ile Lys Asp Phe Leu Ala Arg Ala
195 200 205
Gly Lys Ser Met Cys Thr Leu Ser Glu Gln Leu Asp Tyr Ile Glu Ser
210 215 220
Lys Arg Gly Val Tyr Cys Cys Arg Asp His Glu His Glu Ile Ala Trp
225 230 235 240
Phe Thr Glu Arg Ser Asp Lys Ser Tyr Glu His Gln Thr Pro Phe Glu
245 250 255
Ile Lys Ser Ala Lys Lys Phe Asp Thr Phe Lys Gly Glu Cys Pro Lys
260 265 270
Phe Val Phe Pro Leu Asn Ser Lys Val Lys Val Ile Gln Pro Arg Val
275 280 285
Glu Lys Lys Lys Thr Glu Gly Phe Met Gly Arg Ile Arg Ser Val Tyr
290 295 300
Pro Val Ala Ser Pro Gln Glu Cys Asn Asn Met His Leu Ser Thr Leu
305 310 315 320
Met Lys Cys Asn His Cys Asp Glu Val Ser Trp Gln Thr Cys Asp Phe
325 330 335
Leu Lys Ala Thr Cys Glu His Cys Gly Thr Glu Asn Leu Val Ile Glu
340 345 350
Gly Pro Thr Thr Cys Gly Tyr Leu Pro Thr Asn Ala Val Val Lys Met
355 360 365
Pro Cys Pro Ala Cys Gln Asp Pro Glu Ile Gly Pro Glu His Ser Val
370 375 380
Ala Asp Tyr His Asn His Ser Asn Ile Glu Thr Arg Leu Arg Lys Gly
385 390 395 400
Gly Arg Thr Arg Cys Phe Gly Gly Cys Val Phe Ala Tyr Val Gly Cys
405 410 415
Tyr Asn Lys Arg Ala Tyr Trp Val Pro Arg Ala Ser Ala Asp Ile Gly
420 425 430
Ser Gly His Thr Gly Ile Thr Gly Asp Asn Val Glu Thr Leu Asn Glu
435 440 445
Asp Leu Leu Glu Ile Leu Ser Arg Glu Arg Val Asn Ile Asn Ile Val
450 455 460
Gly Asp Phe His Leu Asn Glu Glu Val Ala Ile Ile Leu Ala Ser Phe
465 470 475 480
Ser Ala Ser Thr Ser Ala Phe Ile Asp Thr Ile Lys Ser Leu Asp Tyr
485 490 495
Lys Ser Phe Lys Thr Ile Val Glu Ser Cys Gly Asn Tyr Lys Val Thr
500 505 510
Lys Gly Lys Pro Val Lys Gly Ala Trp Asn Ile Gly Gln Gln Arg Ser
515 520 525
Val Leu Thr Pro Leu Cys Gly Phe Pro Ser Gln Ala Ala Gly Val Ile
530 535 540
Arg Ser Ile Phe Ala Arg Thr Leu Asp Ala Ala Asn His Ser Ile Pro
545 550 555 560
Asp Leu Gln Arg Ala Ala Val Thr Ile Leu Asp Gly Ile Ser Glu Gln
565 570 575
Ser Leu Arg Leu Val Asp Ala Met Val Tyr Thr Ser Asp Leu Leu Thr
580 585 590
Asn Ser Val Ile Ile Met Ala Tyr Val Thr Gly Gly Leu Val Gln Gln
595 600 605
Thr Ser Gln Trp Leu Ser Asn Leu Leu Gly Thr Thr Val Glu Lys Leu
610 615 620
Arg Pro Ile Phe Glu Trp Ile Glu Ala Lys Leu Ser Ala Gly Val Glu
625 630 635 640
Phe Leu Lys Asp Ala Trp Glu Ile Leu Lys Phe Leu Ile Thr Gly Val
645 650 655
Phe Asp Ile Val Lys Gly Gln Ile Gln Val Ala Ser Asp Asn Ile Lys
660 665 670
Asp Cys Val Lys Cys Phe Ile Asp Val Val Asn Lys Ala Leu Glu Met
675 680 685
Cys Ile Asp Gln Val Thr Ile Ala Gly Ala Lys Leu Arg Ser Leu Asn
690 695 700
Leu Gly Glu Val Phe Ile Ala Gln Ser Lys Gly Leu Tyr Arg Gln Cys
705 710 715 720
Ile Arg Gly Lys Glu Gln Leu Gln Leu Leu Met Pro Leu Lys Ala Pro
725 730 735
Lys Glu Val Thr Phe Leu Glu Gly Asp Ser His Asp Thr Val Leu Thr
740 745 750
Ser Glu Glu Val Val Leu Lys Asn Gly Glu Leu Glu Ala Leu Glu Thr
755 760 765
Pro Val Asp Ser Phe Thr Asn Gly Ala Ile Val Gly Thr Pro Val Cys
770 775 780
Val Asn Gly Leu Met Leu Leu Glu Ile Lys Asp Lys Glu Gln Tyr Cys
785 790 795 800
Ala Leu Ser Pro Gly Leu Leu Ala Thr Asn Asn Val Phe Arg Leu Lys
805 810 815
Gly Gly Ala Pro Ile Lys Gly Val Thr Phe Gly Glu Asp Thr Val Trp
820 825 830
Glu Val Gln Gly Tyr Lys Asn Val Arg Ile Thr Phe Glu Leu Asp Glu
835 840 845
Arg Val Asp Lys Val Leu Asn Glu Lys Cys Ser Val Tyr Thr Val Glu
850 855 860
Ser Gly Thr Glu Val Thr Glu Phe Ala Cys Val Val Ala Glu Ala Val
865 870 875 880
Val Lys Thr Leu Gln Pro Val Ser Asp Leu Leu Thr Asn Met Gly Ile
885 890 895
Asp Leu Asp Glu Trp Ser Val Ala Thr Phe Tyr Leu Phe Asp Asp Ala
900 905 910
Gly Glu Glu Asn Phe Ser Ser Arg Met Tyr Cys Ser Phe Tyr Pro Pro
915 920 925
Asp Glu Glu Glu Glu Asp Asp Ala Glu Cys Glu Glu Glu Glu Ile Asp
930 935 940
Glu Thr Cys Glu His Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Leu
945 950 955 960
Pro Leu Glu Phe Gly Ala Ser Ala Glu Thr Val Arg Val Glu Glu Glu
965 970 975
Glu Glu Glu Asp Trp Leu Asp Asp Thr Thr Glu Gln Ser Glu Ile Glu
980 985 990
Pro Glu Pro Glu Pro Thr Pro Glu Glu Pro Val Asn Gln Phe Thr Gly
995 1000 1005
Tyr Leu Lys Leu Thr Asp Asn Val Ala Ile Lys Cys Val Asp Ile
1010 1015 1020
Val Lys Glu Ala Gln Ser Ala Asn Pro Met Val Ile Val Asn Ala
1025 1030 1035
Ala Asn Ile His Leu Lys His Gly Gly Gly Val Ala Gly Ala Leu
1040 1045 1050
Asn Lys Ala Thr Asn Gly Ala Met Gln Lys Glu Ser Asp Asp Tyr
1055 1060 1065
Ile Lys Leu Asn Gly Pro Leu Thr Val Gly Gly Ser Cys Leu Leu
1070 1075 1080
Ser Gly His Asn Leu Ala Lys Lys Cys Leu His Val Val Gly Pro
1085 1090 1095
Asn Leu Asn Ala Gly Glu Asp Ile Gln Leu Leu Lys Ala Ala Tyr
1100 1105 1110
Glu Asn Phe Asn Ser Gln Asp Ile Leu Leu Ala Pro Leu Leu Ser
1115 1120 1125
Ala Gly Ile Phe Gly Ala Lys Pro Leu Gln Ser Leu Gln Val Cys
1130 1135 1140
Val Gln Thr Val Arg Thr Gln Val Tyr Ile Ala Val Asn Asp Lys
1145 1150 1155
Ala Leu Tyr Glu Gln Val Val Met Asp Tyr Leu Asp Asn Leu Lys
1160 1165 1170
Pro Arg Val Glu Ala Pro Lys Gln Glu Glu Pro Pro Asn Thr Glu
1175 1180 1185
Asp Ser Lys Thr Glu Glu Lys Ser Val Val Gln Lys Pro Val Asp
1190 1195 1200
Val Lys Pro Lys Ile Lys Ala Cys Ile Asp Glu Val Thr Thr Thr
1205 1210 1215
Leu Glu Glu Thr Lys Phe Leu Thr Asn Lys Leu Leu Leu Phe Ala
1220 1225 1230
Asp Ile Asn Gly Lys Leu Tyr His Asp Ser Gln Asn Met Leu Arg
1235 1240 1245
Gly Glu Asp Met Ser Phe Leu Glu Lys Asp Ala Pro Tyr Met Val
1250 1255 1260
Gly Asp Val Ile Thr Ser Gly Asp Ile Thr Cys Val Val Ile Pro
1265 1270 1275
Ser Lys Lys Ala Gly Gly Thr Thr Glu Met Leu Ser Arg Ala Leu
1280 1285 1290
Lys Lys Val Pro Val Asp Glu Tyr Ile Thr Thr Tyr Pro Gly Gln
1295 1300 1305
Gly Cys Ala Gly Tyr Thr Leu Glu Glu Ala Lys Thr Ala Leu Lys
1310 1315 1320
Lys Cys Lys Ser Ala Phe Tyr Val Leu Pro Ser Glu Ala Pro Asn
1325 1330 1335
Ala Lys Glu Glu Ile Leu Gly Thr Val Ser Trp Asn Leu Arg Glu
1340 1345 1350
Met Leu Ala His Ala Glu Glu Thr Arg Lys Leu Met Pro Ile Cys
1355 1360 1365
Met Asp Val Arg Ala Ile Met Ala Thr Ile Gln Arg Lys Tyr Lys
1370 1375 1380
Gly Ile Lys Ile Gln Glu Gly Ile Val Asp Tyr Gly Val Arg Phe
1385 1390 1395
Phe Phe Tyr Thr Ser Lys Glu Pro Val Ala Ser Ile Ile Thr Lys
1400 1405 1410
Leu Asn Ser Leu Asn Glu Pro Leu Val Thr Met Pro Ile Gly Tyr
1415 1420 1425
Val Thr His Gly Phe Asn Leu Glu Glu Ala Ala Arg Cys Met Arg
1430 1435 1440
Ser Leu Lys Ala Pro Ala Val Val Ser Val Ser Ser Pro Asp Ala
1445 1450 1455
Val Thr Thr Tyr Asn Gly Tyr Leu Thr Ser Ser Ser Lys Thr Ser
1460 1465 1470
Glu Glu His Phe Val Glu Thr Val Ser Leu Ala Gly Ser Tyr Arg
1475 1480 1485
Asp Trp Ser Tyr Ser Gly Gln Arg Thr Glu Leu Gly Val Glu Phe
1490 1495 1500
Leu Lys Arg Gly Asp Lys Ile Val Tyr His Thr Leu Glu Ser Pro
1505 1510 1515
Val Glu Phe His Leu Asp Gly Glu Val Leu Ser Leu Asp Lys Leu
1520 1525 1530
Lys Ser Leu Leu Ser Leu Arg Glu Val Lys Thr Ile Lys Val Phe
1535 1540 1545
Thr Thr Val Asp Asn Thr Asn Leu His Thr Gln Leu Val Asp Met
1550 1555 1560
Ser Met Thr Tyr Gly Gln Gln Phe Gly Pro Thr Tyr Leu Asp Gly
1565 1570 1575
Ala Asp Val Thr Lys Ile Lys Pro His Val Asn His Glu Gly Lys
1580 1585 1590
Thr Phe Phe Val Leu Pro Ser Asp Asp Thr Leu Arg Ser Glu Ala
1595 1600 1605
Phe Glu Tyr Tyr His Thr Leu Asp Glu Ser Phe Leu Gly Arg Tyr
1610 1615 1620
Met Ser Ala Leu Asn His Thr Lys Lys Trp Lys Phe Pro Gln Val
1625 1630 1635
Gly Gly Leu Thr Ser Ile Lys Trp Ala Asp Asn Asn Cys Tyr Leu
1640 1645 1650
Ser Ser Val Leu Leu Ala Leu Gln Gln Leu Glu Val Lys Phe Asn
1655 1660 1665
Ala Pro Ala Leu Gln Glu Ala Tyr Tyr Arg Ala Arg Ala Gly Asp
1670 1675 1680
Ala Ala Asn Phe Cys Ala Leu Ile Leu Ala Tyr Ser Asn Lys Thr
1685 1690 1695
Val Gly Glu Leu Gly Asp Val Arg Glu Thr Met Thr His Leu Leu
1700 1705 1710
Gln His Ala Asn Leu Glu Ser Ala Lys Arg Val Leu Asn Val Val
1715 1720 1725
Cys Lys His Cys Gly Gln Lys Thr Thr Thr Leu Thr Gly Val Glu
1730 1735 1740
Ala Val Met Tyr Met Gly Thr Leu Ser Tyr Asp Asn Leu Lys Thr
1745 1750 1755
Gly Val Ser Ile Pro Cys Val Cys Gly Arg Asp Ala Thr Gln Tyr
1760 1765 1770
Leu Val Gln Gln Glu Ser Ser Phe Val Met Met Ser Ala Pro Pro
1775 1780 1785
Ala Glu Tyr Lys Leu Gln Gln Gly Thr Phe Leu Cys Ala Asn Glu
1790 1795 1800
Tyr Thr Gly Asn Tyr Gln Cys Gly His Tyr Thr His Ile Thr Ala
1805 1810 1815
Lys Glu Thr Leu Tyr Arg Ile Asp Gly Ala His Leu Thr Lys Met
1820 1825 1830
Ser Glu Tyr Lys Gly Pro Val Thr Asp Val Phe Tyr Lys Glu Thr
1835 1840 1845
Ser Tyr Thr Thr Thr Ile Lys Pro Val Ser Tyr Lys Leu Asp Gly
1850 1855 1860
Val Thr Tyr Thr Glu Ile Glu Pro Lys Leu Asp Gly Tyr Tyr Lys
1865 1870 1875
Lys Asp Asn Ala Tyr Tyr Thr Glu Gln Pro Ile Asp Leu Val Pro
1880 1885 1890
Thr Gln Pro Leu Pro Asn Ala Ser Phe Asp Asn Phe Lys Leu Thr
1895 1900 1905
Cys Ser Asn Thr Lys Phe Ala Asp Asp Leu Asn Gln Met Thr Gly
1910 1915 1920
Phe Thr Lys Pro Ala Ser Arg Glu Leu Ser Val Thr Phe Phe Pro
1925 1930 1935
Asp Leu Asn Gly Asp Val Val Ala Ile Asp Tyr Arg His Tyr Ser
1940 1945 1950
Ala Ser Phe Lys Lys Gly Ala Lys Leu Leu His Lys Pro Ile Val
1955 1960 1965
Trp His Ile Asn Gln Ala Thr Thr Lys Thr Thr Phe Lys Pro Asn
1970 1975 1980
Thr Trp Cys Leu Arg Cys Leu Trp Ser Thr Lys Pro Val Asp Thr
1985 1990 1995
Ser Asn Ser Phe Glu Val Leu Ala Val Glu Asp Thr Gln Gly Met
2000 2005 2010
Asp Asn Leu Ala Cys Glu Ser Gln Gln Pro Thr Ser Glu Glu Val
2015 2020 2025
Val Glu Asn Pro Thr Ile Gln Lys Glu Val Ile Glu Cys Asp Val
2030 2035 2040
Lys Thr Thr Glu Val Val Gly Asn Val Ile Leu Lys Pro Ser Asp
2045 2050 2055
Glu Gly Val Lys Val Thr Gln Glu Leu Gly His Glu Asp Leu Met
2060 2065 2070
Ala Ala Tyr Val Glu Asn Thr Ser Ile Thr Ile Lys Lys Pro Asn
2075 2080 2085
Glu Leu Ser Leu Ala Leu Gly Leu Lys Thr Ile Ala Thr His Gly
2090 2095 2100
Ile Ala Ala Ile Asn Ser Val Pro Trp Ser Lys Ile Leu Ala Tyr
2105 2110 2115
Val Lys Pro Phe Leu Gly Gln Ala Ala Ile Thr Thr Ser Asn Cys
2120 2125 2130
Ala Lys Arg Leu Ala Gln Arg Val Phe Asn Asn Tyr Met Pro Tyr
2135 2140 2145
Val Phe Thr Leu Leu Phe Gln Leu Cys Thr Phe Thr Lys Ser Thr
2150 2155 2160
Asn Ser Arg Ile Arg Ala Ser Leu Pro Thr Thr Ile Ala Lys Asn
2165 2170 2175
Ser Val Lys Ser Val Ala Lys Leu Cys Leu Asp Ala Gly Ile Asn
2180 2185 2190
Tyr Val Lys Ser Pro Lys Phe Ser Lys Leu Phe Thr Ile Ala Met
2195 2200 2205
Trp Leu Leu Leu Leu Ser Ile Cys Leu Gly Ser Leu Ile Cys Val
2210 2215 2220
Thr Ala Ala Phe Gly Val Leu Leu Ser Asn Phe Gly Ala Pro Ser
2225 2230 2235
Tyr Cys Asn Gly Val Arg Glu Leu Tyr Leu Asn Ser Ser Asn Val
2240 2245 2250
Thr Thr Met Asp Phe Cys Glu Gly Ser Phe Pro Cys Ser Ile Cys
2255 2260 2265
Leu Ser Gly Leu Asp Ser Leu Asp Ser Tyr Pro Ala Leu Glu Thr
2270 2275 2280
Ile Gln Val Thr Ile Ser Ser Tyr Lys Leu Asp Leu Thr Ile Leu
2285 2290 2295
Gly Leu Ala Ala Glu Trp Val Leu Ala Tyr Met Leu Phe Thr Lys
2300 2305 2310
Phe Phe Tyr Leu Leu Gly Leu Ser Ala Ile Met Gln Val Phe Phe
2315 2320 2325
Gly Tyr Phe Ala Ser His Phe Ile Ser Asn Ser Trp Leu Met Trp
2330 2335 2340
Phe Ile Ile Ser Ile Val Gln Met Ala Pro Val Ser Ala Met Val
2345 2350 2355
Arg Met Tyr Ile Phe Phe Ala Ser Phe Tyr Tyr Ile Trp Lys Ser
2360 2365 2370
Tyr Val His Ile Met Asp Gly Cys Thr Ser Ser Thr Cys Met Met
2375 2380 2385
Cys Tyr Lys Arg Asn Arg Ala Thr Arg Val Glu Cys Thr Thr Ile
2390 2395 2400
Val Asn Gly Met Lys Arg Ser Phe Tyr Val Tyr Ala Asn Gly Gly
2405 2410 2415
Arg Gly Phe Cys Lys Thr His Asn Trp Asn Cys Leu Asn Cys Asp
2420 2425 2430
Thr Phe Cys Thr Gly Ser Thr Phe Ile Ser Asp Glu Val Ala Arg
2435 2440 2445
Asp Leu Ser Leu Gln Phe Lys Arg Pro Ile Asn Pro Thr Asp Gln
2450 2455 2460
Ser Ser Tyr Ile Val Asp Ser Val Ala Val Lys Asn Gly Ala Leu
2465 2470 2475
His Leu Tyr Phe Asp Lys Ala Gly Gln Lys Thr Tyr Glu Arg His
2480 2485 2490
Pro Leu Ser His Phe Val Asn Leu Asp Asn Leu Arg Ala Asn Asn
2495 2500 2505
Thr Lys Gly Ser Leu Pro Ile Asn Val Ile Val Phe Asp Gly Lys
2510 2515 2520
Ser Lys Cys Asp Glu Ser Ala Ser Lys Ser Ala Ser Val Tyr Tyr
2525 2530 2535
Ser Gln Leu Met Cys Gln Pro Ile Leu Leu Leu Asp Gln Val Leu
2540 2545 2550
Val Ser Asp Val Gly Asp Ser Thr Glu Val Ser Val Lys Met Phe
2555 2560 2565
Asp Ala Tyr Val Asp Thr Phe Ser Ala Thr Phe Ser Val Pro Met
2570 2575 2580
Glu Lys Leu Lys Ala Leu Val Ala Thr Ala His Ser Glu Leu Ala
2585 2590 2595
Lys Gly Val Ala Leu Asp Gly Val Leu Ser Thr Phe Val Ser Ala
2600 2605 2610
Ala Arg Gln Gly Val Val Asp Thr Asp Val Asp Thr Lys Asp Val
2615 2620 2625
Ile Glu Cys Leu Lys Leu Ser His His Ser Asp Leu Glu Val Thr
2630 2635 2640
Gly Asp Ser Cys Asn Asn Phe Met Leu Thr Tyr Asn Lys Val Glu
2645 2650 2655
Asn Met Thr Pro Arg Asp Leu Gly Ala Cys Ile Asp Cys Asn Ala
2660 2665 2670
Arg His Ile Asn Ala Gln Val Ala Lys Ser His Asn Val Ser Leu
2675 2680 2685
Ile Trp Asn Val Lys Asp Tyr Met Ser Leu Ser Glu Gln Leu Arg
2690 2695 2700
Lys Gln Ile Arg Ser Ala Ala Lys Lys Asn Asn Ile Pro Phe Arg
2705 2710 2715
Leu Thr Cys Ala Thr Thr Arg Gln Val Val Asn Val Ile Thr Thr
2720 2725 2730
Lys Ile Ser Leu Lys Gly Gly Lys Ile Val Ser Thr Cys Phe Lys
2735 2740 2745
Leu Met Leu Lys Ala Thr Leu Leu Cys Val Leu Ala Ala Leu Val
2750 2755 2760
Cys Tyr Ile Val Met Pro Val His Thr Leu Ser Ile His Asp Gly
2765 2770 2775
Tyr Thr Asn Glu Ile Ile Gly Tyr Lys Ala Ile Gln Asp Gly Val
2780 2785 2790
Thr Arg Asp Ile Ile Ser Thr Asp Asp Cys Phe Ala Asn Lys His
2795 2800 2805
Ala Gly Phe Asp Ala Trp Phe Ser Gln Arg Gly Gly Ser Tyr Lys
2810 2815 2820
Asn Asp Lys Ser Cys Pro Val Val Ala Ala Ile Ile Thr Arg Glu
2825 2830 2835
Ile Gly Phe Ile Val Pro Gly Leu Pro Gly Thr Val Leu Arg Ala
2840 2845 2850
Ile Asn Gly Asp Phe Leu His Phe Leu Pro Arg Val Phe Ser Ala
2855 2860 2865
Val Gly Asn Ile Cys Tyr Thr Pro Ser Lys Leu Ile Glu Tyr Ser
2870 2875 2880
Asp Phe Ala Thr Ser Ala Cys Val Leu Ala Ala Glu Cys Thr Ile
2885 2890 2895
Phe Lys Asp Ala Met Gly Lys Pro Val Pro Tyr Cys Tyr Asp Thr
2900 2905 2910
Asn Leu Leu Glu Gly Ser Ile Ser Tyr Ser Glu Leu Arg Pro Asp
2915 2920 2925
Thr Arg Tyr Val Leu Met Asp Gly Ser Ile Ile Gln Phe Pro Asn
2930 2935 2940
Thr Tyr Leu Glu Gly Ser Val Arg Val Val Thr Thr Phe Asp Ala
2945 2950 2955
Glu Tyr Cys Arg His Gly Thr Cys Glu Arg Ser Glu Val Gly Ile
2960 2965 2970
Cys Leu Ser Thr Ser Gly Arg Trp Val Leu Asn Asn Glu His Tyr
2975 2980 2985
Arg Ala Leu Ser Gly Val Phe Cys Gly Val Asp Ala Met Asn Leu
2990 2995 3000
Ile Ala Asn Ile Phe Thr Pro Leu Val Gln Pro Val Gly Ala Leu
3005 3010 3015
Asp Val Ser Ala Ser Val Val Ala Gly Gly Ile Ile Ala Ile Leu
3020 3025 3030
Val Thr Cys Ala Ala Tyr Tyr Phe Met Lys Phe Arg Arg Val Phe
3035 3040 3045
Gly Glu Tyr Asn His Val Val Ala Ala Asn Ala Leu Leu Phe Leu
3050 3055 3060
Met Ser Phe Thr Ile Leu Cys Leu Val Pro Ala Tyr Ser Phe Leu
3065 3070 3075
Pro Gly Val Tyr Ser Val Phe Tyr Leu Tyr Leu Thr Phe Tyr Phe
3080 3085 3090
Thr Asn Asp Val Ser Phe Leu Ala His Leu Gln Trp Phe Ala Met
3095 3100 3105
Phe Ser Pro Ile Val Pro Phe Trp Ile Thr Ala Ile Tyr Val Phe
3110 3115 3120
Cys Ile Ser Leu Lys His Cys His Trp Phe Phe Asn Asn Tyr Leu
3125 3130 3135
Arg Lys Arg Val Met Phe Asn Gly Val Thr Phe Ser Thr Phe Glu
3140 3145 3150
Glu Ala Ala Leu Cys Thr Phe Leu Leu Asn Lys Glu Met Tyr Leu
3155 3160 3165
Lys Leu Arg Ser Glu Thr Leu Leu Pro Leu Thr Gln Tyr Asn Arg
3170 3175 3180
Tyr Leu Ala Leu Tyr Asn Lys Tyr Lys Tyr Phe Ser Gly Ala Leu
3185 3190 3195
Asp Thr Thr Ser Tyr Arg Glu Ala Ala Cys Cys His Leu Ala Lys
3200 3205 3210
Ala Leu Asn Asp Phe Ser Asn Ser Gly Ala Asp Val Leu Tyr Gln
3215 3220 3225
Pro Pro Gln Thr Ser Ile Thr Ser Ala Val Leu Gln Ser Gly Phe
3230 3235 3240
Arg Lys Met Ala Phe Pro Ser Gly Lys Val Glu Gly Cys Met Val
3245 3250 3255
Gln Val Thr Cys Gly Thr Thr Thr Leu Asn Gly Leu Trp Leu Asp
3260 3265 3270
Asp Thr Val Tyr Cys Pro Arg His Val Ile Cys Thr Ala Glu Asp
3275 3280 3285
Met Leu Asn Pro Asn Tyr Glu Asp Leu Leu Ile Arg Lys Ser Asn
3290 3295 3300
His Ser Phe Leu Val Gln Ala Gly Asn Val Gln Leu Arg Val Ile
3305 3310 3315
Gly His Ser Met Gln Asn Cys Leu Leu Arg Leu Lys Val Asp Thr
3320 3325 3330
Ser Asn Pro Lys Thr Pro Lys Tyr Lys Phe Val Arg Ile Gln Pro
3335 3340 3345
Gly Gln Thr Phe Ser Val Leu Ala Cys Tyr Asn Gly Ser Pro Ser
3350 3355 3360
Gly Val Tyr Gln Cys Ala Met Arg Pro Asn His Thr Ile Lys Gly
3365 3370 3375
Ser Phe Leu Asn Gly Ser Cys Gly Ser Val Gly Phe Asn Ile Asp
3380 3385 3390
Tyr Asp Cys Val Ser Phe Cys Tyr Met His His Met Glu Leu Pro
3395 3400 3405
Thr Gly Val His Ala Gly Thr Asp Leu Glu Gly Lys Phe Tyr Gly
3410 3415 3420
Pro Phe Val Asp Arg Gln Thr Ala Gln Ala Ala Gly Thr Asp Thr
3425 3430 3435
Thr Ile Thr Leu Asn Val Leu Ala Trp Leu Tyr Ala Ala Val Ile
3440 3445 3450
Asn Gly Asp Arg Trp Phe Leu Asn Arg Phe Thr Thr Thr Leu Asn
3455 3460 3465
Asp Phe Asn Leu Val Ala Met Lys Tyr Asn Tyr Glu Pro Leu Thr
3470 3475 3480
Gln Asp His Val Asp Ile Leu Gly Pro Leu Ser Ala Gln Thr Gly
3485 3490 3495
Ile Ala Val Leu Asp Met Cys Ala Ala Leu Lys Glu Leu Leu Gln
3500 3505 3510
Asn Gly Met Asn Gly Arg Thr Ile Leu Gly Ser Thr Ile Leu Glu
3515 3520 3525
Asp Glu Phe Thr Pro Phe Asp Val Val Arg Gln Cys Ser Gly Val
3530 3535 3540
Thr Phe Gln Gly Lys Phe Lys Lys Ile Val Lys Gly Thr His His
3545 3550 3555
Trp Met Leu Leu Thr Phe Leu Thr Ser Leu Leu Ile Leu Val Gln
3560 3565 3570
Ser Thr Gln Trp Ser Leu Phe Phe Phe Val Tyr Glu Asn Ala Phe
3575 3580 3585
Leu Pro Phe Thr Leu Gly Ile Met Ala Ile Ala Ala Cys Ala Met
3590 3595 3600
Leu Leu Val Lys His Lys His Ala Phe Leu Cys Leu Phe Leu Leu
3605 3610 3615
Pro Ser Leu Ala Thr Val Ala Tyr Phe Asn Met Val Tyr Met Pro
3620 3625 3630
Ala Ser Trp Val Met Arg Ile Met Thr Trp Leu Glu Leu Ala Asp
3635 3640 3645
Thr Ser Leu Ser Gly Tyr Arg Leu Lys Asp Cys Val Met Tyr Ala
3650 3655 3660
Ser Ala Leu Val Leu Leu Ile Leu Met Thr Ala Arg Thr Val Tyr
3665 3670 3675
Asp Asp Ala Ala Arg Arg Val Trp Thr Leu Met Asn Val Ile Thr
3680 3685 3690
Leu Val Tyr Lys Val Tyr Tyr Gly Asn Ala Leu Asp Gln Ala Ile
3695 3700 3705
Ser Met Trp Ala Leu Val Ile Ser Val Thr Ser Asn Tyr Ser Gly
3710 3715 3720
Val Val Thr Thr Ile Met Phe Leu Ala Arg Ala Ile Val Phe Val
3725 3730 3735
Cys Val Glu Tyr Tyr Pro Leu Leu Phe Ile Thr Gly Asn Thr Leu
3740 3745 3750
Gln Cys Ile Met Leu Val Tyr Cys Phe Leu Gly Tyr Cys Cys Cys
3755 3760 3765
Cys Tyr Phe Gly Leu Phe Cys Leu Leu Asn Arg Tyr Phe Arg Leu
3770 3775 3780
Thr Leu Gly Val Tyr Asp Tyr Leu Val Ser Thr Gln Glu Phe Arg
3785 3790 3795
Tyr Met Asn Ser Gln Gly Leu Leu Pro Pro Lys Ser Ser Ile Asp
3800 3805 3810
Ala Phe Lys Leu Asn Ile Lys Leu Leu Gly Ile Gly Gly Lys Pro
3815 3820 3825
Cys Ile Lys Val Ala Thr Val Gln Ser Lys Met Ser Asp Val Lys
3830 3835 3840
Cys Thr Ser Val Val Leu Leu Ser Val Leu Gln Gln Leu Arg Val
3845 3850 3855
Glu Ser Ser Ser Lys Leu Trp Ala Gln Cys Val Gln Leu His Asn
3860 3865 3870
Asp Ile Leu Leu Ala Lys Asp Thr Thr Glu Ala Phe Glu Lys Met
3875 3880 3885
Val Ser Leu Leu Ser Val Leu Leu Ser Met Gln Gly Ala Val Asp
3890 3895 3900
Ile Asn Arg Leu Cys Glu Glu Met Leu Asp Asn Arg Ala Thr Leu
3905 3910 3915
Gln Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro Ser Tyr Ala Ala
3920 3925 3930
Tyr Ala Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val Ala Asn Gly
3935 3940 3945
Asp Ser Glu Val Val Leu Lys Lys Leu Lys Lys Ser Leu Asn Val
3950 3955 3960
Ala Lys Ser Glu Phe Asp Arg Asp Ala Ala Met Gln Arg Lys Leu
3965 3970 3975
Glu Lys Met Ala Asp Gln Ala Met Thr Gln Met Tyr Lys Gln Ala
3980 3985 3990
Arg Ser Glu Asp Lys Arg Ala Lys Val Thr Ser Ala Met Gln Thr
3995 4000 4005
Met Leu Phe Thr Met Leu Arg Lys Leu Asp Asn Asp Ala Leu Asn
4010 4015 4020
Asn Ile Ile Asn Asn Ala Arg Asp Gly Cys Val Pro Leu Asn Ile
4025 4030 4035
Ile Pro Leu Thr Thr Ala Ala Lys Leu Met Val Val Val Pro Asp
4040 4045 4050
Tyr Gly Thr Tyr Lys Asn Thr Cys Asp Gly Asn Thr Phe Thr Tyr
4055 4060 4065
Ala Ser Ala Leu Trp Glu Ile Gln Gln Val Val Asp Ala Asp Ser
4070 4075 4080
Lys Ile Val Gln Leu Ser Glu Ile Asn Met Asp Asn Ser Pro Asn
4085 4090 4095
Leu Ala Trp Pro Leu Ile Val Thr Ala Leu Arg Ala Asn Ser Ala
4100 4105 4110
Val Lys Leu Gln Asn Asn Glu Leu Ser Pro Val Ala Leu Arg Gln
4115 4120 4125
Met Ser Cys Ala Ala Gly Thr Thr Gln Thr Ala Cys Thr Asp Asp
4130 4135 4140
Asn Ala Leu Ala Tyr Tyr Asn Asn Ser Lys Gly Gly Arg Phe Val
4145 4150 4155
Leu Ala Leu Leu Ser Asp His Gln Asp Leu Lys Trp Ala Arg Phe
4160 4165 4170
Pro Lys Ser Asp Gly Thr Gly Thr Ile Tyr Thr Glu Leu Glu Pro
4175 4180 4185
Pro Cys Arg Phe Val Thr Asp Thr Pro Lys Gly Pro Lys Val Lys
4190 4195 4200
Tyr Leu Tyr Phe Ile Lys Gly Leu Asn Asn Leu Asn Arg Gly Met
4205 4210 4215
Val Leu Gly Ser Leu Ala Ala Thr Val Arg Leu Gln Ala Gly Asn
4220 4225 4230
Ala Thr Glu Val Pro Ala Asn Ser Thr Val Leu Ser Phe Cys Ala
4235 4240 4245
Phe Ala Val Asp Pro Ala Lys Ala Tyr Lys Asp Tyr Leu Ala Ser
4250 4255 4260
Gly Gly Gln Pro Ile Thr Asn Cys Val Lys Met Leu Cys Thr His
4265 4270 4275
Thr Gly Thr Gly Gln Ala Ile Thr Val Thr Pro Glu Ala Asn Met
4280 4285 4290
Asp Gln Glu Ser Phe Gly Gly Ala Ser Cys Cys Leu Tyr Cys Arg
4295 4300 4305
Cys His Ile Asp His Pro Asn Pro Lys Gly Phe Cys Asp Leu Lys
4310 4315 4320
Gly Lys Tyr Val Gln Ile Pro Thr Thr Cys Ala Asn Asp Pro Val
4325 4330 4335
Gly Phe Thr Leu Arg Asn Thr Val Cys Thr Val Cys Gly Met Trp
4340 4345 4350
Lys Gly Tyr Gly Cys Ser Cys Asp Gln Leu Arg Glu Pro Leu Met
4355 4360 4365
Gln Ser Ala Asp Ala Ser Thr Phe Leu Asn Gly Phe Ala Val
4370 4375 4380
<210> 19
<211> 4383
<212> PRT
<213> Bovine coronavirus
<220>
<221> MISC_FEATURE
<223> ORF 1A
<400> 19
Met Ser Lys Ile Asn Lys Tyr Gly Leu Glu Leu His Trp Ala Pro Glu
1 5 10 15
Phe Pro Trp Met Phe Glu Asp Ala Glu Glu Lys Leu Asp Asn Pro Ser
20 25 30
Ser Ser Glu Val Asp Ile Val Cys Ser Thr Thr Ala Gln Lys Leu Glu
35 40 45
Thr Gly Gly Ile Cys Pro Glu Asn His Val Met Val Asp Cys Arg Arg
50 55 60
Leu Leu Lys Gln Glu Cys Cys Val Gln Ser Ser Leu Ile Arg Glu Ile
65 70 75 80
Val Met Asn Thr Arg Pro Tyr Asp Leu Glu Val Leu Leu Gln Asp Ala
85 90 95
Leu Gln Ser Arg Glu Ala Val Leu Val Thr Pro Pro Leu Gly Met Ser
100 105 110
Leu Glu Ala Cys Tyr Val Arg Gly Cys Asn Pro Asn Gly Trp Thr Met
115 120 125
Gly Leu Phe Arg Arg Arg Ser Val Cys Asn Thr Gly Arg Cys Ala Val
130 135 140
Asn Lys His Val Ala Tyr Gln Leu Tyr Met Ile Asp Pro Ala Gly Val
145 150 155 160
Cys Phe Gly Ala Gly Gln Phe Val Gly Trp Val Ile Pro Leu Ala Phe
165 170 175
Met Pro Val Gln Ser Arg Lys Phe Ile Ala Pro Trp Val Met Tyr Leu
180 185 190
Arg Lys Cys Gly Glu Lys Gly Ala Tyr Ile Lys Asp Tyr Lys Arg Gly
195 200 205
Gly Phe Glu His Val Tyr Asn Phe Lys Val Glu Asp Ala Tyr Asp Leu
210 215 220
Val His Asp Glu Pro Lys Gly Lys Phe Ser Lys Lys Ala Tyr Ala Leu
225 230 235 240
Ile Arg Gly Tyr Arg Gly Val Lys Pro Leu Leu Tyr Val Asp Gln Tyr
245 250 255
Gly Cys Asp Tyr Thr Gly Gly Leu Ala Asp Gly Leu Glu Ala Tyr Ala
260 265 270
Asp Lys Thr Leu Gln Glu Met Lys Ala Leu Phe Pro Ile Trp Ser Gln
275 280 285
Glu Leu Pro Phe Asp Val Thr Val Ala Trp His Val Val Arg Asp Pro
290 295 300
Arg Tyr Val Met Arg Leu Gln Ser Ala Ser Thr Ile Arg Ser Val Ala
305 310 315 320
Tyr Val Ala Asn Pro Thr Glu Asp Leu Cys Asp Gly Ser Val Val Ile
325 330 335
Lys Glu Pro Val His Val Tyr Ala Asp Asp Ser Ile Ile Leu Arg Gln
340 345 350
His Asn Leu Val Asp Ile Met Ser Cys Phe Tyr Met Glu Ala Asp Ala
355 360 365
Val Val Asn Ala Phe Tyr Gly Val Asp Leu Lys Asp Cys Gly Phe Val
370 375 380
Met Gln Phe Gly Tyr Ile Asp Cys Glu Gln Asp Leu Cys Asp Phe Lys
385 390 395 400
Gly Trp Val Pro Gly Asn Met Ile Asp Gly Phe Ala Cys Thr Thr Cys
405 410 415
Gly His Val Tyr Glu Thr Gly Asp Leu Leu Ala Gln Ser Ser Gly Val
420 425 430
Leu Pro Val Asn Pro Val Leu His Thr Lys Ser Ala Ala Gly Tyr Gly
435 440 445
Gly Phe Gly Cys Lys Asp Ser Phe Thr Leu Tyr Gly Gln Thr Val Val
450 455 460
Tyr Phe Gly Gly Cys Val Tyr Trp Ser Pro Ala Arg Asn Ile Trp Ile
465 470 475 480
Pro Ile Leu Lys Ser Ser Val Lys Ser Tyr Asp Gly Leu Val Tyr Thr
485 490 495
Gly Val Val Gly Cys Lys Ala Ile Val Lys Glu Thr Asn Leu Ile Cys
500 505 510
Lys Ala Leu Tyr Leu Asp Tyr Val Gln His Lys Cys Gly Asn Leu His
515 520 525
Gln Arg Glu Leu Leu Gly Val Ser Asp Val Trp His Lys Gln Leu Leu
530 535 540
Leu Asn Arg Gly Val Tyr Lys Pro Leu Leu Glu Asn Ile Asp Tyr Phe
545 550 555 560
Asn Met Arg Arg Ala Lys Phe Ser Leu Glu Thr Phe Thr Val Cys Ala
565 570 575
Asp Gly Phe Met Pro Phe Leu Leu Asp Asp Leu Val Pro Arg Ala Tyr
580 585 590
Tyr Leu Ala Val Ser Gly Gln Ala Phe Cys Asp Tyr Ala Gly Lys Ile
595 600 605
Cys His Ala Val Val Ser Lys Ser Lys Glu Leu Leu Asp Val Ser Val
610 615 620
Asp Ser Leu Gly Ala Ala Ile His Tyr Leu Asn Ser Lys Ile Val Asp
625 630 635 640
Leu Ala Gln His Phe Ser Asp Phe Gly Thr Ser Phe Val Ser Lys Ile
645 650 655
Val His Phe Phe Lys Thr Phe Thr Thr Ser Thr Ala Leu Ala Phe Ala
660 665 670
Trp Val Leu Phe His Val Leu His Gly Ala Tyr Ile Val Val Glu Ser
675 680 685
Asp Ile Tyr Phe Gly Lys Asn Ile Pro Arg Tyr Ala Ser Ala Val Ala
690 695 700
Gln Ala Phe Arg Ser Gly Ala Lys Val Gly Leu Asp Ser Leu Arg Val
705 710 715 720
Thr Phe Ile Asp Gly Leu Ser Cys Phe Lys Ile Gly Arg Arg Arg Ile
725 730 735
Cys Leu Ser Gly Ser Lys Ile Tyr Glu Val Glu Arg Gly Leu Leu His
740 745 750
Ser Ser Gln Leu Pro Leu Asp Val Tyr Asp Leu Thr Met Pro Ser Gln
755 760 765
Val Gln Lys Thr Lys Gln Lys Gly Ile Tyr Leu Lys Gly Ser Gly Ser
770 775 780
Asp Phe Ser Leu Ala Asp Ser Val Val Glu Val Val Thr Thr Ser Leu
785 790 795 800
Thr Pro Cys Gly Tyr Ser Glu Pro Pro Lys Val Ala Asp Lys Ile Cys
805 810 815
Ile Val Asp Asn Val Tyr Met Ala Lys Ala Gly Asp Lys Tyr Tyr Pro
820 825 830
Val Val Val Asp Gly His Val Gly Leu Leu Asp Gln Ala Trp Arg Val
835 840 845
Pro Cys Ala Gly Arg Cys Val Thr Phe Lys Glu Gln Pro Thr Val Asn
850 855 860
Glu Ile Ala Ser Thr Pro Lys Thr Ile Lys Val Phe Tyr Glu Leu Asp
865 870 875 880
Lys Asp Phe Asn Thr Ile Leu Asn Thr Ala Cys Gly Glu Phe Glu Val
885 890 895
Asp Asp Thr Val Asp Met Glu Glu Phe Tyr Ala Val Val Ile Asp Ala
900 905 910
Ile Glu Glu Lys Leu Ser Pro Cys Lys Glu Leu Glu Gly Val Gly Ala
915 920 925
Lys Val Ser Ala Phe Leu Gln Lys Leu Glu Asp Asn Ser Leu Phe Leu
930 935 940
Phe Asp Glu Ala Gly Glu Glu Val Leu Ala Pro Lys Leu Tyr Cys Ala
945 950 955 960
Phe Thr Ala Pro Glu Asp Asp Asp Phe Leu Glu Glu Ser Gly Val Glu
965 970 975
Glu Asp Asp Val Glu Gly Glu Glu Thr Asp Leu Thr Val Thr Ser Ala
980 985 990
Gly Glu Pro Cys Val Ala Ser Glu Gln Glu Glu Ser Ser Glu Ile Leu
995 1000 1005
Glu Asp Thr Leu Asp Asp Gly Pro Cys Val Glu Thr Ser Asp Ser
1010 1015 1020
Gln Val Glu Glu Asp Val Gln Met Ser Asp Phe Gly Asp Leu Glu
1025 1030 1035
Ser Val Ile Gln Asp Tyr Glu Asn Val Cys Phe Glu Phe Tyr Thr
1040 1045 1050
Thr Glu Pro Glu Phe Val Lys Val Leu Asp Leu Tyr Val Pro Lys
1055 1060 1065
Ala Thr Arg Asn Asn Cys Trp Leu Arg Ser Val Leu Ala Val Met
1070 1075 1080
Gln Lys Leu Pro Cys Gln Phe Lys Asp Lys Asn Leu Gln Asp Leu
1085 1090 1095
Trp Val Leu Tyr Lys Gln Gln Tyr Ser Gln Leu Phe Val Asp Thr
1100 1105 1110
Leu Val Asn Lys Ile Pro Ala Asn Ile Val Val Pro Gln Gly Gly
1115 1120 1125
Tyr Val Ala Asp Phe Ala Tyr Trp Phe Leu Thr Leu Cys Asp Trp
1130 1135 1140
Gln Cys Val Ala Tyr Trp Lys Cys Ile Lys Cys Asp Leu Ala Leu
1145 1150 1155
Lys Leu Lys Gly Leu Asp Ala Met Phe Phe Tyr Gly Asp Val Val
1160 1165 1170
Ser His Val Cys Lys Cys Gly Glu Ser Met Val Leu Ile Asp Val
1175 1180 1185
Asp Val Pro Phe Thr Ala His Phe Ala Leu Lys Asp Lys Leu Phe
1190 1195 1200
Cys Ala Phe Ile Thr Lys Arg Ser Val Tyr Lys Ala Ala Cys Val
1205 1210 1215
Val Asp Val Asn Asp Ser His Ser Met Ala Val Val Asp Gly Lys
1220 1225 1230
Gln Ile Asp Asp His Arg Ile Thr Ser Ile Thr Ser Asp Lys Phe
1235 1240 1245
Asp Phe Ile Ile Gly His Gly Thr Ser Phe Ser Met Thr Thr Phe
1250 1255 1260
Glu Ile Ala Gln Leu Tyr Gly Ser Cys Ile Thr Pro Asn Val Cys
1265 1270 1275
Phe Val Lys Gly Asp Ile Ile Lys Val Ser Lys Arg Val Lys Ala
1280 1285 1290
Glu Val Val Val Asn Pro Ala Asn Gly His Met Ala His Gly Gly
1295 1300 1305
Gly Val Ala Lys Ala Ile Ala Val Ala Ala Gly Gln Gln Phe Val
1310 1315 1320
Lys Glu Thr Thr Asp Met Val Lys Ser Lys Gly Val Cys Ala Thr
1325 1330 1335
Gly Asp Cys Tyr Val Ser Thr Gly Gly Lys Leu Cys Lys Thr Val
1340 1345 1350
Leu Asn Val Val Gly Pro Asp Ala Arg Thr Gln Gly Lys Gln Ser
1355 1360 1365
Tyr Ala Leu Leu Glu Arg Val Tyr Lys His Leu Asn Lys Tyr Asp
1370 1375 1380
Cys Val Val Thr Thr Leu Ile Ser Ala Gly Ile Phe Ser Val Pro
1385 1390 1395
Ser Asp Val Ser Leu Thr Tyr Leu Leu Gly Thr Ala Lys Lys Gln
1400 1405 1410
Val Val Leu Val Ser Asn Asn Gln Glu Asp Phe Asp Leu Ile Ser
1415 1420 1425
Lys Cys Gln Ile Thr Ala Val Glu Gly Thr Lys Lys Leu Ala Glu
1430 1435 1440
Arg Leu Ser Phe Asn Val Gly Arg Ser Ile Val Tyr Glu Thr Asp
1445 1450 1455
Ala Asn Lys Leu Ile Leu Ser Asn Asp Val Ala Phe Val Ser Thr
1460 1465 1470
Phe Asn Val Leu Gln Asp Val Leu Ser Leu Arg His Asp Ile Ala
1475 1480 1485
Leu Asp Asp Asp Ala Arg Thr Phe Val Gln Ser Asn Val Asp Val
1490 1495 1500
Val Pro Glu Gly Trp Arg Val Val Asn Lys Phe Tyr Gln Ile Asn
1505 1510 1515
Gly Val Arg Pro Val Lys Tyr Phe Glu Cys Pro Gly Gly Ile Asp
1520 1525 1530
Ile Cys Ser Gln Asp Lys Val Phe Gly Tyr Val Gln Gln Gly Ser
1535 1540 1545
Phe Asn Lys Ala Thr Val Ala Gln Ile Lys Ala Leu Phe Leu Asp
1550 1555 1560
Lys Val Asp Ile Leu Leu Thr Val Asp Gly Val Asn Phe Thr Asn
1565 1570 1575
Arg Phe Val Pro Val Gly Glu Ser Phe Gly Lys Ser Leu Gly Asn
1580 1585 1590
Val Phe Cys Asp Gly Val Asn Val Thr Lys His Lys Cys Asp Ile
1595 1600 1605
Asn Tyr Lys Gly Lys Val Phe Phe Gln Phe Asp Asn Leu Ser Ser
1610 1615 1620
Glu Asp Leu Lys Ala Val Arg Ser Ser Phe Asn Phe Asp Gln Lys
1625 1630 1635
Glu Leu Leu Ala Tyr Tyr Asn Met Leu Val Asn Cys Ser Lys Trp
1640 1645 1650
Gln Val Val Phe Asn Gly Lys Tyr Phe Thr Phe Lys Gln Ala Asn
1655 1660 1665
Asn Asn Cys Phe Val Asn Val Ser Cys Leu Met Leu Gln Ser Leu
1670 1675 1680
Asn Leu Lys Phe Lys Ile Val Gln Trp Gln Glu Ala Trp Leu Glu
1685 1690 1695
Phe Arg Ser Gly Arg Pro Ala Arg Phe Val Ser Leu Val Leu Ala
1700 1705 1710
Lys Gly Gly Phe Lys Phe Gly Asp Pro Ala Asp Ser Arg Asp Phe
1715 1720 1725
Leu Arg Val Val Phe Ser Gln Val Asp Leu Thr Gly Ala Ile Cys
1730 1735 1740
Asp Phe Glu Ile Ala Cys Lys Cys Gly Val Lys Gln Glu Gln Arg
1745 1750 1755
Thr Gly Val Asp Ala Val Met His Phe Gly Thr Leu Ser Arg Glu
1760 1765 1770
Asp Leu Glu Ile Gly Tyr Thr Val Asp Cys Ser Cys Gly Lys Lys
1775 1780 1785
Leu Ile His Cys Val Arg Phe Asp Val Pro Phe Leu Ile Cys Ser
1790 1795 1800
Asn Thr Pro Ala Ser Val Lys Leu Pro Lys Gly Val Gly Ser Ala
1805 1810 1815
Asn Ile Phe Lys Gly Asp Lys Val Gly His Tyr Val His Val Lys
1820 1825 1830
Cys Glu Gln Ser Tyr Gln Leu Tyr Asp Ala Ser Asn Val Lys Lys
1835 1840 1845
Val Thr Asp Val Thr Gly Asn Leu Ser Asp Cys Leu Tyr Leu Lys
1850 1855 1860
Asn Leu Lys Gln Thr Phe Lys Ser Val Leu Thr Thr Tyr Tyr Leu
1865 1870 1875
Asp Asp Val Lys Lys Ile Glu Tyr Lys Pro Asp Leu Ser Gln Tyr
1880 1885 1890
Tyr Cys Asp Gly Gly Lys Tyr Tyr Thr Gln Arg Ile Ile Lys Ala
1895 1900 1905
Gln Phe Lys Thr Phe Glu Lys Val Asp Gly Val Tyr Thr Asn Phe
1910 1915 1920
Lys Leu Ile Gly His Thr Val Cys Asp Ile Leu Asn Ala Lys Leu
1925 1930 1935
Gly Phe Asp Ser Ser Lys Glu Phe Val Glu Tyr Lys Val Thr Glu
1940 1945 1950
Trp Pro Thr Ala Thr Gly Asp Val Val Leu Ala Thr Asp Asp Leu
1955 1960 1965
Tyr Val Lys Arg Tyr Glu Arg Gly Cys Ile Thr Phe Gly Lys Pro
1970 1975 1980
Val Ile Trp Leu Ser His Glu Gln Ala Ser Leu Asn Ser Leu Thr
1985 1990 1995
Tyr Phe Asn Arg Pro Leu Leu Val Asp Glu Asn Lys Phe Asp Val
2000 2005 2010
Leu Lys Val Asp Asp Val Asp Asp Gly Gly Asp Ile Ser Glu Ser
2015 2020 2025
Asp Ala Lys Glu Pro Lys Glu Ile Asn Ile Ile Lys Leu Ser Gly
2030 2035 2040
Val Lys Lys Pro Phe Lys Val Glu Asp Ser Val Ile Val Asn Asp
2045 2050 2055
Asp Thr Ser Glu Ile Lys Tyr Val Lys Ser Leu Ser Ile Val Asp
2060 2065 2070
Val Tyr Asp Met Trp Leu Thr Gly Cys Arg Cys Val Val Arg Thr
2075 2080 2085
Ala Asn Ala Leu Ser Arg Ala Val Asn Val Pro Thr Ile Arg Lys
2090 2095 2100
Phe Ile Lys Phe Gly Met Thr Leu Val Ser Ile Pro Ile Asp Leu
2105 2110 2115
Leu Asn Leu Arg Glu Ile Lys Pro Val Phe Asn Val Val Lys Ala
2120 2125 2130
Val Arg Asn Lys Ile Ser Ala Cys Phe Asn Phe Ile Lys Trp Leu
2135 2140 2145
Phe Val Leu Leu Phe Gly Trp Ile Lys Ile Ser Ala Asp Asn Lys
2150 2155 2160
Val Ile Tyr Thr Thr Glu Val Ala Ser Lys Leu Thr Cys Lys Leu
2165 2170 2175
Val Ala Leu Ala Phe Lys Asn Ala Phe Leu Thr Phe Lys Trp Ser
2180 2185 2190
Val Val Ala Arg Gly Ala Cys Ile Ile Ala Thr Ile Phe Leu Leu
2195 2200 2205
Trp Phe Asn Phe Ile Tyr Ala Asn Val Ile Phe Ser Asp Phe Tyr
2210 2215 2220
Leu Pro Lys Ile Gly Phe Leu Pro Thr Phe Val Gly Lys Ile Val
2225 2230 2235
Gln Trp Ile Lys Asn Thr Phe Ser Leu Val Thr Ile Cys Asp Leu
2240 2245 2250
Tyr Ser Ile Gln Asp Val Gly Phe Lys Asn Gln Tyr Cys Asn Gly
2255 2260 2265
Ser Ile Ala Cys Gln Phe Cys Leu Ala Gly Phe Asp Met Leu Asp
2270 2275 2280
Asn Tyr Lys Ala Ile Asp Val Val Gln Tyr Glu Ala Asp Arg Arg
2285 2290 2295
Ala Phe Val Asp Tyr Thr Gly Val Leu Lys Ile Val Ile Glu Leu
2300 2305 2310
Ile Val Ser Tyr Ala Leu Tyr Thr Ala Trp Phe Tyr Pro Leu Phe
2315 2320 2325
Ala Leu Ile Ser Ile Gln Ile Leu Thr Thr Trp Leu Pro Glu Leu
2330 2335 2340
Leu Met Leu Ser Thr Leu His Trp Ser Val Arg Leu Leu Val Ser
2345 2350 2355
Leu Ala Asn Met Leu Pro Ala His Val Phe Met Arg Phe Tyr Ile
2360 2365 2370
Ile Ile Ala Ser Phe Ile Lys Leu Phe Ser Leu Phe Arg His Val
2375 2380 2385
Ala Tyr Gly Cys Ser Lys Ser Gly Cys Leu Phe Cys Tyr Lys Arg
2390 2395 2400
Asn Arg Ser Leu Arg Val Lys Cys Ser Thr Ile Val Gly Gly Met
2405 2410 2415
Ile Arg Tyr Tyr Asp Val Met Ala Asn Gly Gly Thr Gly Phe Cys
2420 2425 2430
Ser Lys His Gln Trp Asn Cys Ile Asp Cys Asp Ser Tyr Lys Pro
2435 2440 2445
Gly Asn Thr Phe Ile Thr Val Glu Ala Ala Leu Asp Leu Ser Lys
2450 2455 2460
Glu Leu Lys Arg Pro Ile Gln Pro Thr Asp Val Ala Tyr His Thr
2465 2470 2475
Val Thr Asp Val Lys Gln Val Gly Cys Tyr Met Arg Leu Phe Tyr
2480 2485 2490
Asp Arg Asp Gly Gln Arg Thr Tyr Asp Asp Val Asn Ala Ser Leu
2495 2500 2505
Phe Val Asp Tyr Ser Asn Leu Leu His Ser Lys Val Lys Ser Val
2510 2515 2520
Pro Asn Met His Val Val Val Val Glu Asn Asp Ala Asp Lys Ala
2525 2530 2535
Asn Phe Leu Asn Ala Ala Val Phe Tyr Ala Gln Ser Leu Phe Arg
2540 2545 2550
Pro Ile Leu Met Val Asp Lys Ile Leu Ile Thr Thr Ala Asn Thr
2555 2560 2565
Gly Thr Ser Val Thr Glu Thr Met Phe Asp Val Tyr Val Asp Thr
2570 2575 2580
Phe Leu Ser Met Phe Asp Val Asp Lys Lys Ser Leu Asn Ala Leu
2585 2590 2595
Ile Ala Thr Ala His Ser Ser Ile Lys Gln Gly Thr Gln Ile Cys
2600 2605 2610
Lys Val Leu Asp Thr Phe Leu Ser Cys Ala Arg Lys Ser Cys Ser
2615 2620 2625
Ile Asp Ser Asp Val Asp Thr Lys Cys Leu Ala Asp Ser Val Met
2630 2635 2640
Ser Ala Val Ser Ala Gly Leu Glu Leu Thr Asp Glu Ser Cys Asn
2645 2650 2655
Asn Leu Val Pro Thr Tyr Leu Lys Gly Asp Asn Ile Val Ala Ala
2660 2665 2670
Asp Leu Gly Val Leu Ile Gln Asn Ser Ala Lys His Val Gln Gly
2675 2680 2685
Asn Val Ala Lys Ile Ala Gly Val Ser Cys Ile Trp Ser Val Asp
2690 2695 2700
Ala Phe Asn Gln Leu Ser Ser Asp Phe Gln His Lys Leu Lys Lys
2705 2710 2715
Ala Cys Cys Lys Thr Gly Leu Lys Leu Glu Leu Thr Tyr Asn Lys
2720 2725 2730
Gln Met Ala Asn Val Ser Val Leu Thr Thr Pro Phe Ser Leu Lys
2735 2740 2745
Gly Gly Ala Val Phe Ser Tyr Phe Val Tyr Val Cys Phe Val Leu
2750 2755 2760
Ser Leu Val Cys Phe Ile Gly Leu Trp Cys Leu Met Pro Thr Tyr
2765 2770 2775
Thr Val His Lys Ser Asp Phe Gln Leu Pro Val Tyr Ala Ser Tyr
2780 2785 2790
Lys Val Leu Asp Asn Gly Val Ile Arg Asp Val Ser Val Glu Asp
2795 2800 2805
Val Cys Phe Ala Asn Lys Phe Glu Gln Phe Asp Gln Trp Tyr Glu
2810 2815 2820
Ser Thr Phe Gly Leu Ser Tyr Tyr Ser Asn Ser Met Ala Cys Pro
2825 2830 2835
Ile Val Val Ala Val Val Asp Gln Asp Phe Gly Ser Thr Val Phe
2840 2845 2850
Asn Val Pro Thr Lys Val Leu Arg Tyr Gly Tyr His Val Leu His
2855 2860 2865
Phe Ile Thr His Ala Leu Ser Ala Asp Gly Val Gln Cys Tyr Thr
2870 2875 2880
Pro His Ser Gln Ile Ser Tyr Ser Asn Phe Tyr Ala Ser Gly Cys
2885 2890 2895
Val Leu Ser Ser Ala Cys Thr Met Phe Ala Met Ala Asp Gly Ser
2900 2905 2910
Pro Gln Pro Tyr Cys Tyr Thr Asp Gly Leu Met Gln Asn Ala Ser
2915 2920 2925
Leu Tyr Ser Ser Leu Val Pro His Val Arg Tyr Asn Leu Ala Asn
2930 2935 2940
Ala Lys Gly Phe Ile Arg Leu Pro Glu Val Leu Arg Glu Gly Leu
2945 2950 2955
Val Arg Ile Val Arg Thr Arg Ser Met Ser Tyr Cys Arg Val Gly
2960 2965 2970
Leu Cys Glu Glu Ala Asp Glu Gly Ile Cys Phe Asn Phe Asn Gly
2975 2980 2985
Ser Trp Val Leu Asn Asn Asp Tyr Tyr Arg Ser Leu Pro Gly Thr
2990 2995 3000
Phe Cys Gly Arg Asp Val Phe Asp Leu Ile Tyr Gln Leu Phe Lys
3005 3010 3015
Gly Leu Ala Gln Pro Val Asp Phe Leu Ala Leu Thr Ala Ser Ser
3020 3025 3030
Ile Ala Gly Ala Ile Leu Ala Val Ile Val Val Leu Gly Phe Tyr
3035 3040 3045
Tyr Leu Ile Lys Leu Lys Arg Ala Phe Gly Asp Tyr Thr Ser Ile
3050 3055 3060
Val Phe Val Asn Val Ile Val Trp Cys Val Asn Phe Met Met Leu
3065 3070 3075
Phe Val Phe Gln Val Tyr Pro Thr Leu Ser Cys Val Tyr Ala Ile
3080 3085 3090
Cys Tyr Phe Tyr Ala Thr Leu Tyr Phe Pro Ser Glu Ile Ser Val
3095 3100 3105
Ile Met His Leu Gln Trp Leu Val Met Tyr Gly Thr Ile Met Pro
3110 3115 3120
Leu Trp Phe Cys Leu Leu Tyr Ile Ser Val Val Val Ser Asn His
3125 3130 3135
Ala Phe Trp Val Phe Ser Tyr Cys Arg Gln Leu Gly Thr Ser Val
3140 3145 3150
Arg Ser Asp Gly Thr Phe Glu Glu Met Ala Leu Thr Thr Phe Met
3155 3160 3165
Ile Thr Lys Asp Ser Tyr Cys Lys Leu Lys Asn Ser Leu Ser Asp
3170 3175 3180
Val Ala Phe Asn Arg Tyr Leu Ser Leu Tyr Asn Lys Tyr Arg Tyr
3185 3190 3195
Tyr Ser Gly Lys Met Asp Thr Ala Ala Tyr Arg Glu Ala Ala Cys
3200 3205 3210
Ser Gln Leu Ala Lys Ala Met Asp Thr Phe Thr Asn Asn Asn Gly
3215 3220 3225
Ser Asp Val Leu Tyr Gln Pro Pro Thr Ala Ser Val Ser Thr Ser
3230 3235 3240
Phe Leu Gln Ser Gly Ile Val Lys Met Val Asn Pro Thr Ser Lys
3245 3250 3255
Val Glu Pro Cys Ile Val Ser Val Thr Tyr Gly Asn Met Thr Leu
3260 3265 3270
Asn Gly Leu Trp Leu Asp Asp Lys Val Tyr Cys Pro Arg His Val
3275 3280 3285
Ile Cys Ser Ala Ser Asp Met Thr Asn Pro Asp Tyr Thr Asn Leu
3290 3295 3300
Leu Cys Arg Val Thr Ser Ser Asp Phe Thr Val Leu Phe Asp Arg
3305 3310 3315
Leu Ser Leu Thr Val Met Ser Tyr Gln Met Gln Gly Cys Met Leu
3320 3325 3330
Val Leu Thr Val Thr Leu Gln Asn Ser Arg Thr Pro Lys Tyr Thr
3335 3340 3345
Phe Gly Val Val Lys Pro Gly Glu Thr Phe Thr Val Leu Ala Ala
3350 3355 3360
Tyr Asn Gly Lys Pro Gln Gly Ala Phe His Val Thr Met Arg Ser
3365 3370 3375
Ser Tyr Thr Ile Lys Gly Ser Phe Leu Cys Gly Ser Cys Gly Ser
3380 3385 3390
Val Gly Tyr Val Ile Met Gly Asp Cys Val Lys Phe Val Tyr Met
3395 3400 3405
His Gln Leu Glu Leu Ser Thr Gly Cys His Thr Gly Thr Asp Phe
3410 3415 3420
Asn Gly Asp Phe Tyr Gly Pro Tyr Lys Asp Ala Gln Val Val Gln
3425 3430 3435
Leu Pro Val Gln Asp Tyr Ile Gln Ser Val Asn Phe Val Ala Trp
3440 3445 3450
Leu Tyr Ala Ala Ile Leu Asn Asn Cys Asn Trp Phe Val Gln Ser
3455 3460 3465
Asp Lys Cys Ser Val Glu Asp Phe Asn Val Trp Ala Leu Ser Asn
3470 3475 3480
Gly Phe Ser Gln Val Lys Ser Asp Leu Val Ile Asp Ala Leu Ala
3485 3490 3495
Ser Met Thr Gly Val Ser Leu Glu Thr Leu Leu Ala Ala Ile Lys
3500 3505 3510
Arg Leu Lys Asn Gly Phe Gln Gly Arg Gln Ile Met Gly Ser Cys
3515 3520 3525
Ser Phe Glu Asp Glu Leu Thr Pro Ser Asp Val Tyr Gln Gln Leu
3530 3535 3540
Ala Gly Ile Lys Leu Gln Ser Lys Arg Thr Arg Leu Val Lys Gly
3545 3550 3555
Ile Val Cys Trp Ile Met Ala Ser Thr Phe Leu Phe Ser Cys Ile
3560 3565 3570
Ile Thr Ala Phe Val Lys Trp Thr Met Phe Met Tyr Val Thr Thr
3575 3580 3585
Asn Met Leu Ser Ile Thr Phe Cys Ala Leu Cys Val Ile Ser Leu
3590 3595 3600
Ala Met Leu Leu Val Lys His Lys His Leu Tyr Leu Thr Met Tyr
3605 3610 3615
Ile Ile Pro Val Leu Phe Thr Leu Leu Tyr Asn Asn Tyr Leu Val
3620 3625 3630
Val Tyr Lys Gln Thr Phe Arg Gly Tyr Val Tyr Ala Trp Leu Ser
3635 3640 3645
Tyr Tyr Val Pro Ser Val Glu Tyr Thr Tyr Thr Asp Glu Val Ile
3650 3655 3660
Tyr Gly Met Leu Leu Leu Ile Gly Met Val Phe Val Thr Leu Arg
3665 3670 3675
Ser Ile Asn His Asp Leu Phe Ser Phe Ile Met Phe Val Gly Arg
3680 3685 3690
Val Ile Ser Val Val Ser Leu Trp Tyr Met Gly Ser Asn Leu Glu
3695 3700 3705
Glu Glu Ile Leu Leu Met Leu Ala Ser Leu Phe Gly Thr Tyr Thr
3710 3715 3720
Trp Thr Thr Ala Leu Ser Met Ala Ala Ala Lys Val Ile Ala Lys
3725 3730 3735
Trp Val Ala Val Asn Val Leu Tyr Phe Thr Asp Ile Pro Gln Ile
3740 3745 3750
Lys Ile Val Leu Val Cys Tyr Leu Phe Ile Gly Tyr Ile Ile Ser
3755 3760 3765
Cys Tyr Trp Gly Leu Phe Ser Leu Met Asn Ser Leu Phe Arg Met
3770 3775 3780
Pro Leu Gly Val Tyr Asn Tyr Lys Ile Ser Val Gln Glu Leu Arg
3785 3790 3795
Tyr Met Asn Ala Asn Gly Leu Arg Pro Pro Lys Asn Ser Phe Glu
3800 3805 3810
Ala Leu Met Leu Asn Phe Lys Leu Leu Gly Ile Gly Gly Val Pro
3815 3820 3825
Ile Ile Glu Val Ser Gln Phe Gln Ser Lys Leu Thr Asp Val Lys
3830 3835 3840
Cys Ala Asn Gly Gly Leu Leu Asn Cys Leu Gln His Leu His Val
3845 3850 3855
Ala Ser Asn Ser Lys Leu Trp Gln Tyr Cys Ser Thr Leu His Asn
3860 3865 3870
Glu Ile Leu Ala Thr Ser Asp Leu Gly Val Ala Phe Glu Lys Leu
3875 3880 3885
Ala Gln Leu Leu Ile Val Leu Phe Ala Asn Pro Ala Ala Val Asp
3890 3895 3900
Ser Lys Cys Leu Thr Ser Ile Glu Glu Val Cys Asp Asp Tyr Ala
3905 3910 3915
Lys Asp Asn Thr Val Leu Gln Ala Leu Gln Ser Glu Phe Val Asn
3920 3925 3930
Met Ala Ser Phe Val Glu Tyr Glu Val Ala Lys Lys Asn Leu Asp
3935 3940 3945
Glu Ala Cys Ser Ser Gly Ser Ala Asn Gln Gln Gln Leu Lys Gln
3950 3955 3960
Leu Glu Lys Ala Cys Asn Ile Ala Lys Ser Ala Tyr Glu Arg Asp
3965 3970 3975
Arg Ala Val Ala Arg Lys Leu Glu Arg Met Ala Asp Leu Ala Leu
3980 3985 3990
Thr Asn Met Tyr Lys Glu Ala Arg Ile Asn Asp Lys Lys Ser Lys
3995 4000 4005
Val Val Ser Ala Leu Gln Thr Met Leu Phe Ser Met Val Arg Lys
4010 4015 4020
Leu Asp Asn Gln Ala Leu Asn Ser Ile Leu Asp Asn Ala Val Lys
4025 4030 4035
Gly Cys Val Pro Leu Asn Ala Ile Pro Ser Leu Ala Ala Asn Thr
4040 4045 4050
Leu Thr Ile Ile Val Pro Asp Lys Ser Val Tyr Asp Gln Val Val
4055 4060 4065
Asp Asn Val Tyr Val Thr Tyr Ala Gly Asn Val Trp Gln Ile Gln
4070 4075 4080
Thr Ile Gln Asp Ser Asp Gly Thr Asn Lys Gln Leu His Glu Ile
4085 4090 4095
Ser Asp Asp Cys Asn Trp Pro Leu Val Ile Ile Ala Asn Arg His
4100 4105 4110
Asn Glu Val Ser Ala Thr Val Leu Gln Asn Asn Glu Leu Met Pro
4115 4120 4125
Ala Lys Leu Lys Thr Gln Val Val Asn Ser Gly Pro Asp Gln Thr
4130 4135 4140
Cys Asn Thr Pro Thr Gln Cys Tyr Tyr Asn Asn Ser Tyr Asn Gly
4145 4150 4155
Lys Ile Val Tyr Ala Ile Leu Ser Asp Val Asp Gly Leu Lys Tyr
4160 4165 4170
Thr Lys Ile Leu Lys Asp Asp Gly Asn Phe Val Val Leu Glu Leu
4175 4180 4185
Asp Pro Pro Cys Lys Phe Thr Val Gln Asp Val Lys Gly Leu Lys
4190 4195 4200
Ile Lys Tyr Leu Tyr Phe Val Lys Gly Cys Asn Thr Leu Ala Arg
4205 4210 4215
Gly Trp Val Val Gly Thr Ile Ser Ser Thr Val Arg Leu Gln Ala
4220 4225 4230
Gly Thr Ala Thr Glu Tyr Ala Ser Asn Ser Ser Ile Leu Ser Leu
4235 4240 4245
Cys Ala Phe Ser Val Asp Pro Lys Lys Thr Tyr Leu Asp Phe Ile
4250 4255 4260
Gln Gln Gly Gly Thr Pro Ile Ala Asn Cys Val Lys Met Leu Cys
4265 4270 4275
Asp His Ala Gly Thr Gly Met Ala Ile Thr Val Lys Pro Asp Ala
4280 4285 4290
Thr Thr Ser Gln Asp Ser Tyr Gly Gly Ala Ser Val Cys Ile Tyr
4295 4300 4305
Cys Arg Ala Arg Val Glu His Pro Asp Val Asp Gly Leu Cys Lys
4310 4315 4320
Leu Arg Gly Lys Phe Val Gln Val Pro Val Gly Ile Lys Asp Pro
4325 4330 4335
Val Ser Tyr Val Leu Thr His Asp Val Cys Gln Val Cys Gly Phe
4340 4345 4350
Trp Arg Asp Gly Ser Cys Ser Cys Val Ser Thr Asp Thr Thr Val
4355 4360 4365
Gln Ser Lys Asp Thr Asn Phe Leu Asn Gly Phe Gly Val Arg Val
4370 4375 4380
<210> 20
<211> 6685
<212> PRT
<213> transmissable gastroenteritus virus
<220>
<221> MISC_FEATURE
<223> ORF 1AB
<400> 20
Met Ser Ser Lys Gln Phe Lys Ile Leu Val Asn Glu Asp Tyr Gln Val
1 5 10 15
Asn Val Pro Ser Leu Pro Ile Arg Asp Val Leu Gln Glu Ile Lys Tyr
20 25 30
Cys Tyr Arg Asn Gly Phe Glu Gly Tyr Val Phe Val Pro Glu Tyr Cys
35 40 45
Arg Asp Leu Val Asp Cys Asp Arg Lys Asp His Tyr Val Ile Gly Val
50 55 60
Leu Gly Asn Gly Val Ser Asp Leu Lys Pro Val Leu Leu Thr Glu Pro
65 70 75 80
Ser Val Met Leu Gln Gly Phe Ile Val Arg Ala Asn Cys Asn Gly Val
85 90 95
Leu Glu Asp Phe Asp Leu Lys Ile Ala Arg Thr Gly Arg Gly Ala Ile
100 105 110
Tyr Val Asp Gln Tyr Met Cys Gly Ala Asp Gly Lys Pro Val Ile Glu
115 120 125
Gly Asp Phe Lys Asp Tyr Phe Gly Asp Glu Asp Ile Ile Glu Phe Glu
130 135 140
Gly Glu Glu Tyr His Cys Ala Trp Thr Thr Val Arg Asp Glu Lys Pro
145 150 155 160
Leu Asn Gln Gln Thr Leu Phe Thr Ile Gln Glu Ile Gln Tyr Asn Leu
165 170 175
Asp Ile Pro His Lys Leu Pro Asn Cys Ala Thr Arg His Val Ala Pro
180 185 190
Pro Val Lys Lys Asn Ser Lys Ile Val Leu Ser Glu Asp Tyr Lys Lys
195 200 205
Leu Tyr Asp Ile Phe Gly Ser Pro Phe Met Gly Asn Gly Asp Cys Leu
210 215 220
Ser Lys Cys Phe Asp Thr Leu His Phe Ile Ala Ala Thr Leu Arg Cys
225 230 235 240
Pro Cys Gly Ser Glu Ser Ser Gly Val Gly Asp Trp Thr Gly Phe Lys
245 250 255
Thr Ala Cys Cys Gly Leu Ser Gly Lys Val Lys Gly Val Thr Leu Gly
260 265 270
Asp Ile Lys Pro Gly Asp Ala Val Val Thr Ser Met Ser Ala Gly Lys
275 280 285
Gly Val Lys Phe Phe Ala Asn Cys Val Leu Gln Tyr Ala Gly Asp Val
290 295 300
Glu Gly Val Ser Ile Trp Lys Val Ile Lys Thr Phe Thr Val Asp Glu
305 310 315 320
Thr Val Cys Thr Pro Gly Phe Glu Gly Glu Leu Asn Asp Phe Ile Lys
325 330 335
Pro Glu Ser Lys Ser Leu Val Ala Cys Ser Val Lys Arg Ala Phe Ile
340 345 350
Thr Gly Asp Ile Asp Asp Ala Val His Asp Cys Ile Ile Thr Gly Lys
355 360 365
Leu Asp Leu Ser Thr Asn Leu Phe Gly Asn Val Gly Leu Leu Phe Lys
370 375 380
Lys Thr Pro Trp Phe Val Gln Lys Cys Gly Ala Leu Phe Val Asp Ala
385 390 395 400
Trp Lys Val Val Glu Glu Leu Cys Gly Ser Leu Thr Leu Thr Tyr Lys
405 410 415
Gln Ile Tyr Glu Val Val Ala Ser Leu Cys Thr Ser Ala Phe Thr Ile
420 425 430
Val Asn Tyr Lys Pro Thr Phe Val Val Pro Asp Asn Arg Val Lys Asp
435 440 445
Leu Val Asp Lys Cys Val Lys Val Leu Val Lys Ala Phe Asp Val Phe
450 455 460
Thr Gln Ile Ile Thr Ile Ala Gly Ile Glu Ala Lys Cys Phe Val Leu
465 470 475 480
Gly Ala Lys Tyr Leu Leu Phe Asn Asn Ala Leu Val Lys Leu Val Ser
485 490 495
Val Lys Ile Leu Gly Lys Lys Gln Lys Gly Leu Glu Cys Ala Phe Phe
500 505 510
Ala Thr Ser Leu Val Gly Ala Thr Val Asn Val Thr Pro Lys Arg Thr
515 520 525
Glu Thr Ala Thr Ile Ser Leu Asn Lys Val Asp Asp Val Val Ala Pro
530 535 540
Gly Glu Gly Tyr Ile Val Ile Val Gly Asp Met Ala Phe Tyr Lys Ser
545 550 555 560
Gly Glu Tyr Tyr Phe Met Met Ser Ser Pro Asn Phe Val Leu Thr Asn
565 570 575
Asn Val Phe Lys Ala Val Lys Val Pro Ser Tyr Asp Ile Val Tyr Asp
580 585 590
Val Asp Asn Asp Thr Lys Ser Lys Met Ile Ala Lys Leu Gly Ser Ser
595 600 605
Phe Glu Tyr Asp Gly Asp Ile Asp Ala Ala Ile Val Lys Val Asn Glu
610 615 620
Leu Leu Ile Glu Phe Arg Gln Gln Ser Leu Cys Phe Arg Ala Phe Lys
625 630 635 640
Asp Asp Lys Ser Ile Phe Val Glu Ala Tyr Phe Lys Lys Tyr Lys Met
645 650 655
Pro Ala Cys Leu Ala Lys His Ile Gly Leu Trp Asn Ile Ile Lys Lys
660 665 670
Asp Ser Cys Lys Arg Gly Phe Leu Asn Leu Phe Asn His Leu Asn Glu
675 680 685
Leu Glu Asp Ile Lys Glu Thr Asn Ile Gln Ala Ile Lys Asn Ile Leu
690 695 700
Cys Pro Asp Pro Leu Leu Asp Leu Asp Tyr Gly Ala Ile Trp Tyr Asn
705 710 715 720
Cys Met Pro Gly Cys Ser Asp Pro Ser Val Leu Gly Ser Val Gln Leu
725 730 735
Leu Ile Gly Asn Gly Val Lys Val Val Cys Asp Gly Cys Lys Gly Phe
740 745 750
Ala Asn Gln Leu Ser Lys Gly Tyr Asn Lys Leu Cys Asn Ala Ala Arg
755 760 765
Asn Asp Ile Glu Ile Gly Gly Ile Pro Phe Ser Thr Phe Lys Thr Pro
770 775 780
Thr Asn Thr Phe Ile Glu Met Thr Asp Ala Ile Tyr Ser Val Ile Glu
785 790 795 800
Gln Gly Lys Ala Leu Ser Phe Arg Asp Ala Asp Val Pro Val Val Asp
805 810 815
Asn Gly Thr Ile Ser Thr Ala Asp Trp Ser Glu Pro Ile Leu Leu Glu
820 825 830
Pro Ala Glu Tyr Val Lys Pro Lys Asn Asn Gly Asn Val Ile Val Ile
835 840 845
Ala Gly Tyr Thr Phe Tyr Lys Asp Glu Asp Glu His Phe Tyr Pro Tyr
850 855 860
Gly Phe Gly Lys Ile Val Gln Arg Met Tyr Asn Lys Met Gly Gly Gly
865 870 875 880
Asp Lys Thr Val Ser Phe Ser Glu Glu Val Asp Val Gln Glu Ile Ala
885 890 895
Pro Val Thr Arg Val Lys Leu Glu Phe Glu Phe Asp Asn Glu Ile Val
900 905 910
Thr Gly Val Leu Glu Arg Ala Ile Gly Thr Arg Tyr Lys Phe Thr Gly
915 920 925
Thr Thr Trp Glu Glu Phe Glu Glu Ser Ile Ser Glu Glu Leu Asp Ala
930 935 940
Ile Phe Asp Thr Leu Ala Asn Gln Gly Val Glu Leu Glu Gly Tyr Phe
945 950 955 960
Ile Tyr Asp Thr Cys Gly Gly Phe Asp Ile Lys Asn Pro Asp Gly Ile
965 970 975
Met Ile Ser Gln Tyr Asp Ile Asn Ile Thr Ala Asp Glu Lys Ser Glu
980 985 990
Val Ser Ala Ser Ser Glu Glu Glu Glu Val Glu Ser Val Glu Glu Asp
995 1000 1005
Pro Glu Asn Glu Ile Val Glu Ala Ser Glu Gly Ala Glu Gly Thr
1010 1015 1020
Ser Ser Gln Glu Glu Val Glu Thr Val Glu Val Ala Asp Ile Thr
1025 1030 1035
Ser Thr Glu Glu Asp Val Asp Ile Val Glu Val Ser Ala Lys Asp
1040 1045 1050
Asp Pro Trp Ala Ala Ala Val Asp Val Gln Glu Ala Glu Gln Phe
1055 1060 1065
Asn Pro Ser Leu Pro Pro Phe Lys Thr Thr Asn Leu Asn Gly Lys
1070 1075 1080
Ile Ile Leu Lys Gln Gly Asp Asn Asn Cys Trp Ile Asn Ala Cys
1085 1090 1095
Cys Tyr Gln Leu Gln Ala Phe Asp Phe Phe Asn Asn Glu Ala Trp
1100 1105 1110
Glu Lys Phe Lys Lys Gly Asp Val Met Asp Phe Val Asn Leu Cys
1115 1120 1125
Tyr Ala Ala Thr Thr Leu Ala Arg Gly His Ser Gly Asp Ala Glu
1130 1135 1140
Tyr Leu Leu Glu Leu Met Leu Asn Asp Tyr Ser Thr Ala Lys Ile
1145 1150 1155
Val Leu Ala Ala Lys Cys Gly Cys Gly Glu Lys Glu Ile Val Leu
1160 1165 1170
Glu Arg Ala Val Phe Lys Leu Thr Pro Leu Lys Glu Ser Phe Asn
1175 1180 1185
Tyr Gly Val Cys Gly Asp Cys Met Gln Val Asn Thr Cys Arg Phe
1190 1195 1200
Leu Ser Val Glu Gly Ser Gly Val Phe Val His Asp Ile Leu Ser
1205 1210 1215
Lys Gln Thr Pro Glu Ala Met Phe Val Val Lys Pro Val Met His
1220 1225 1230
Ala Val Tyr Thr Gly Thr Thr Gln Asn Gly His Tyr Met Val Asp
1235 1240 1245
Asp Ile Glu His Gly Tyr Cys Val Asp Gly Met Gly Ile Lys Pro
1250 1255 1260
Leu Lys Lys Arg Cys Tyr Thr Ser Thr Leu Phe Ile Asn Ala Asn
1265 1270 1275
Val Met Thr Arg Ala Glu Lys Pro Lys Gln Glu Phe Lys Val Glu
1280 1285 1290
Lys Val Glu Gln Gln Pro Ile Val Glu Glu Asn Lys Ser Ser Ile
1295 1300 1305
Glu Lys Glu Glu Ile Gln Ser Pro Lys Asn Asp Asp Leu Ile Leu
1310 1315 1320
Pro Phe Tyr Lys Ala Gly Lys Leu Ser Phe Tyr Gln Gly Ala Leu
1325 1330 1335
Asp Val Leu Ile Asn Phe Leu Glu Pro Asp Val Ile Val Asn Ala
1340 1345 1350
Ala Asn Gly Asp Leu Lys His Met Gly Gly Val Ala Arg Ala Ile
1355 1360 1365
Asp Val Phe Thr Gly Gly Lys Leu Thr Glu Arg Ser Lys Asp Tyr
1370 1375 1380
Leu Lys Lys Asn Lys Ser Ile Ala Pro Gly Asn Ala Val Phe Phe
1385 1390 1395
Glu Asn Val Ile Glu His Leu Ser Val Leu Asn Ala Val Gly Pro
1400 1405 1410
Arg Asn Gly Asp Ser Arg Val Glu Ala Lys Leu Cys Asn Val Tyr
1415 1420 1425
Lys Ala Ile Ala Lys Cys Glu Gly Lys Ile Leu Thr Pro Leu Ile
1430 1435 1440
Ser Val Gly Ile Phe Asn Val Arg Leu Glu Thr Ser Leu Gln Cys
1445 1450 1455
Leu Leu Lys Thr Val Asn Asp Arg Gly Leu Asn Val Phe Val Tyr
1460 1465 1470
Thr Asp Gln Glu Arg Gln Thr Ile Glu Asn Phe Phe Ser Cys Ser
1475 1480 1485
Ile Pro Val Asn Val Thr Glu Asp Asn Val Asn His Glu Arg Val
1490 1495 1500
Ser Val Ser Phe Asp Lys Thr Tyr Gly Glu Gln Leu Lys Gly Thr
1505 1510 1515
Val Val Ile Lys Asp Lys Asp Val Thr Asn Gln Leu Pro Ser Ala
1520 1525 1530
Phe Asp Val Gly Gln Lys Val Ile Lys Ala Ile Asp Ile Asp Trp
1535 1540 1545
Gln Ala His Tyr Gly Phe Arg Asp Ala Ala Ala Phe Ser Ala Ser
1550 1555 1560
Ser His Asp Ala Tyr Lys Phe Glu Val Val Thr His Ser Asn Phe
1565 1570 1575
Ile Val His Lys Gln Thr Asp Asn Asn Cys Trp Ile Asn Ala Ile
1580 1585 1590
Cys Leu Ala Leu Gln Arg Leu Lys Pro Gln Trp Lys Phe Pro Gly
1595 1600 1605
Val Arg Gly Leu Trp Asn Glu Phe Leu Glu Arg Lys Thr Gln Gly
1610 1615 1620
Phe Val His Met Leu Tyr His Ile Ser Gly Val Lys Lys Gly Glu
1625 1630 1635
Pro Gly Asp Ala Glu Leu Met Leu His Lys Leu Gly Asp Leu Met
1640 1645 1650
Asp Asn Asp Cys Glu Ile Ile Val Thr His Thr Thr Ala Cys Asp
1655 1660 1665
Lys Cys Ala Lys Val Glu Lys Phe Val Gly Pro Val Val Ala Ala
1670 1675 1680
Pro Leu Ala Ile His Gly Thr Asp Glu Thr Cys Val His Gly Val
1685 1690 1695
Ser Val Asn Val Lys Val Thr Gln Ile Lys Gly Thr Val Ala Ile
1700 1705 1710
Thr Ser Leu Ile Gly Pro Ile Ile Gly Glu Val Leu Glu Ala Thr
1715 1720 1725
Gly Tyr Ile Cys Tyr Ser Gly Ser Asn Arg Asn Gly His Tyr Thr
1730 1735 1740
Tyr Tyr Asp Asn Arg Asn Gly Leu Val Val Asp Ala Glu Lys Ala
1745 1750 1755
Tyr His Phe Asn Arg Asp Leu Leu Gln Val Thr Thr Ala Ile Ala
1760 1765 1770
Ser Asn Phe Val Val Lys Lys Pro Gln Ala Glu Glu Arg Pro Lys
1775 1780 1785
Asn Cys Ala Phe Asn Lys Val Ala Ala Ser Pro Lys Ile Val Gln
1790 1795 1800
Glu Gln Lys Leu Leu Ala Ile Glu Ser Gly Ala Asn Tyr Ala Leu
1805 1810 1815
Thr Glu Phe Gly Arg Tyr Ala Asp Met Phe Phe Met Ala Gly Asp
1820 1825 1830
Lys Ile Leu Arg Leu Leu Leu Glu Val Phe Lys Tyr Leu Leu Val
1835 1840 1845
Leu Phe Met Cys Leu Arg Ser Thr Lys Met Pro Lys Val Lys Val
1850 1855 1860
Lys Pro Pro Leu Ala Phe Lys Asp Phe Gly Ala Lys Val Arg Thr
1865 1870 1875
Leu Asn Tyr Met Arg Gln Leu Asn Lys Pro Ser Val Trp Arg Tyr
1880 1885 1890
Ala Lys Leu Val Leu Leu Leu Ile Ala Ile Tyr Asn Phe Phe Tyr
1895 1900 1905
Leu Phe Val Ser Ile Pro Val Val His Lys Leu Thr Cys Asn Gly
1910 1915 1920
Ala Val Gln Ala Tyr Lys Asn Ser Ser Phe Ile Lys Ser Ala Val
1925 1930 1935
Cys Gly Asn Ser Ile Leu Cys Lys Ala Cys Leu Ala Ser Tyr Asp
1940 1945 1950
Glu Leu Ala Asp Phe Gln His Leu Gln Val Thr Trp Asp Phe Lys
1955 1960 1965
Ser Asp Pro Leu Trp Asn Arg Leu Val Gln Leu Ser Tyr Phe Ala
1970 1975 1980
Phe Leu Ala Val Phe Gly Asn Asn Tyr Val Arg Cys Phe Leu Met
1985 1990 1995
Tyr Phe Val Ser Gln Tyr Leu Asn Leu Trp Leu Ser Tyr Phe Gly
2000 2005 2010
Tyr Val Glu Tyr Ser Trp Phe Leu His Val Val Asn Phe Glu Ser
2015 2020 2025
Ile Ser Ala Glu Phe Val Ile Val Val Ile Val Val Lys Ala Val
2030 2035 2040
Leu Ala Leu Lys His Ile Val Phe Ala Cys Ser Asn Pro Ser Cys
2045 2050 2055
Lys Thr Cys Ser Arg Thr Ala Arg Gln Thr Arg Ile Pro Ile Gln
2060 2065 2070
Val Val Val Asn Gly Ser Met Lys Thr Val Tyr Val His Ala Asn
2075 2080 2085
Gly Thr Gly Lys Phe Cys Lys Lys His Asn Phe Tyr Cys Lys Asn
2090 2095 2100
Cys Asp Ser Tyr Gly Phe Glu Asn Thr Phe Ile Cys Asp Glu Ile
2105 2110 2115
Val Arg Asp Leu Ser Asn Ser Val Lys Gln Thr Val Tyr Ala Thr
2120 2125 2130
Asp Arg Ser His Gln Glu Val Thr Lys Val Glu Cys Ser Asp Gly
2135 2140 2145
Phe Tyr Arg Phe Tyr Val Gly Asp Glu Phe Thr Ser Tyr Asp Tyr
2150 2155 2160
Asp Val Lys His Lys Lys Tyr Ser Ser Gln Glu Val Leu Lys Ser
2165 2170 2175
Met Leu Leu Leu Asp Asp Phe Ile Val Tyr Ser Pro Ser Gly Ser
2180 2185 2190
Ala Leu Ala Asn Val Arg Asn Ala Cys Val Tyr Phe Ser Gln Leu
2195 2200 2205
Ile Gly Lys Pro Ile Lys Ile Val Asn Ser Asp Leu Leu Glu Asp
2210 2215 2220
Leu Ser Val Asp Phe Lys Gly Ala Leu Phe Asn Ala Lys Lys Asn
2225 2230 2235
Val Ile Lys Asn Ser Phe Asn Val Asp Val Ser Glu Cys Lys Asn
2240 2245 2250
Leu Asp Glu Cys Tyr Arg Ala Cys Asn Leu Asn Val Ser Phe Ser
2255 2260 2265
Thr Phe Glu Met Ala Val Asn Asn Ala His Arg Phe Gly Ile Leu
2270 2275 2280
Ile Thr Asp Arg Ser Phe Asn Asn Phe Trp Pro Ser Lys Val Lys
2285 2290 2295
Pro Gly Ser Ser Gly Val Ser Ala Met Asp Ile Gly Lys Cys Met
2300 2305 2310
Thr Ser Asp Ala Lys Ile Val Asn Ala Lys Val Leu Thr Gln Arg
2315 2320 2325
Gly Lys Ser Val Val Trp Leu Ser Gln Asp Phe Ala Ala Leu Ser
2330 2335 2340
Ser Thr Ala Gln Lys Val Leu Val Lys Thr Phe Val Glu Glu Gly
2345 2350 2355
Val Asn Phe Ser Leu Thr Phe Asn Ala Val Gly Ser Asp Asp Asp
2360 2365 2370
Leu Pro Tyr Glu Arg Phe Thr Glu Ser Val Ser Pro Lys Ser Gly
2375 2380 2385
Ser Gly Phe Phe Asp Val Ile Thr Gln Leu Lys Gln Ile Val Ile
2390 2395 2400
Leu Val Phe Val Phe Ile Phe Ile Cys Gly Leu Cys Ser Val Tyr
2405 2410 2415
Ser Val Ala Thr Gln Ser Tyr Ile Glu Ser Ala Glu Gly Tyr Asp
2420 2425 2430
Tyr Met Val Ile Lys Asn Gly Ile Val Gln Pro Phe Asp Asp Thr
2435 2440 2445
Ile Ser Cys Val His Asn Thr Tyr Lys Gly Phe Gly Asp Trp Phe
2450 2455 2460
Lys Ala Lys Tyr Gly Phe Ile Pro Thr Phe Gly Lys Ser Cys Pro
2465 2470 2475
Ile Val Val Gly Thr Val Phe Asp Leu Glu Asn Met Arg Pro Ile
2480 2485 2490
Pro Asp Val Pro Ala Tyr Val Ser Ile Val Gly Arg Ser Leu Val
2495 2500 2505
Phe Ala Ile Asn Ala Ala Phe Gly Val Thr Asn Met Cys Tyr Asp
2510 2515 2520
His Thr Gly Asn Ala Val Ser Lys Asp Ser Tyr Phe Asp Thr Cys
2525 2530 2535
Val Phe Asn Thr Ala Cys Thr Thr Leu Thr Gly Leu Gly Gly Thr
2540 2545 2550
Ile Val Tyr Cys Ala Lys Gln Gly Leu Val Glu Gly Ala Lys Leu
2555 2560 2565
Tyr Ser Asp Leu Met Pro Asp Tyr Tyr Tyr Glu His Ala Ser Gly
2570 2575 2580
Asn Met Val Lys Leu Pro Ala Ile Ile Arg Gly Leu Gly Leu Arg
2585 2590 2595
Phe Val Lys Thr Gln Ala Thr Thr Tyr Cys Arg Val Gly Glu Cys
2600 2605 2610
Ile Asp Ser Lys Ala Gly Phe Cys Phe Gly Gly Asp Asn Trp Phe
2615 2620 2625
Val Tyr Asp Asn Glu Phe Gly Asn Gly Tyr Ile Cys Gly Asn Ser
2630 2635 2640
Val Leu Gly Phe Phe Lys Asn Val Phe Lys Leu Phe Asn Ser Asn
2645 2650 2655
Met Ser Val Val Ala Thr Ser Gly Ala Met Leu Val Asn Ile Ile
2660 2665 2670
Ile Ala Cys Leu Ala Ile Ala Met Cys Tyr Gly Val Leu Lys Phe
2675 2680 2685
Lys Lys Ile Phe Gly Asp Cys Thr Phe Leu Ile Val Met Ile Ile
2690 2695 2700
Val Thr Leu Val Val Asn Asn Val Ser Tyr Phe Val Thr Gln Asn
2705 2710 2715
Thr Phe Phe Met Ile Ile Tyr Ala Ile Val Tyr Tyr Phe Ile Thr
2720 2725 2730
Arg Lys Leu Ala Tyr Pro Gly Ile Leu Asp Ala Gly Phe Ile Ile
2735 2740 2745
Ala Tyr Ile Asn Met Ala Pro Trp Tyr Val Ile Thr Ala Tyr Ile
2750 2755 2760
Leu Val Phe Leu Tyr Asp Ser Leu Pro Ser Leu Phe Lys Leu Lys
2765 2770 2775
Val Ser Thr Asn Leu Phe Glu Gly Asp Lys Phe Val Gly Asn Phe
2780 2785 2790
Glu Ser Ala Ala Met Gly Thr Phe Val Ile Asp Met Arg Ser Tyr
2795 2800 2805
Glu Thr Ile Val Asn Ser Thr Ser Ile Ala Arg Ile Lys Ser Tyr
2810 2815 2820
Ala Asn Ser Phe Asn Lys Tyr Lys Tyr Tyr Thr Gly Ser Met Gly
2825 2830 2835
Glu Ala Asp Tyr Arg Met Ala Cys Tyr Ala His Leu Gly Lys Ala
2840 2845 2850
Leu Met Asp Tyr Ser Val Asn Arg Thr Asp Met Leu Tyr Thr Pro
2855 2860 2865
Pro Thr Val Ser Val Asn Ser Thr Leu Gln Ser Gly Leu Arg Lys
2870 2875 2880
Met Ala Gln Pro Ser Gly Leu Val Glu Pro Cys Ile Val Arg Val
2885 2890 2895
Ser Tyr Gly Asn Asn Val Leu Asn Gly Leu Trp Leu Gly Asp Glu
2900 2905 2910
Val Ile Cys Pro Arg His Val Ile Ala Ser Asp Thr Thr Arg Val
2915 2920 2925
Ile Asn Tyr Glu Asn Glu Met Ser Ser Val Arg Leu His Asn Phe
2930 2935 2940
Ser Val Ser Lys Asn Asn Val Phe Leu Gly Val Val Ser Ala Arg
2945 2950 2955
Tyr Lys Gly Val Asn Leu Val Leu Lys Val Asn Gln Val Asn Pro
2960 2965 2970
Asn Thr Pro Glu His Lys Phe Lys Ser Ile Lys Ala Gly Glu Ser
2975 2980 2985
Phe Asn Ile Leu Ala Cys Tyr Glu Gly Cys Pro Gly Ser Val Tyr
2990 2995 3000
Gly Val Asn Met Arg Ser Gln Gly Thr Ile Lys Gly Ser Phe Ile
3005 3010 3015
Ala Gly Thr Cys Gly Ser Val Gly Tyr Val Leu Glu Asn Gly Ile
3020 3025 3030
Leu Tyr Phe Val Tyr Met His His Leu Glu Leu Gly Asn Gly Ser
3035 3040 3045
His Val Gly Ser Asn Phe Glu Gly Glu Met Tyr Gly Gly Tyr Glu
3050 3055 3060
Asp Gln Pro Ser Met Gln Leu Glu Gly Thr Asn Val Met Ser Ser
3065 3070 3075
Asp Asn Val Val Ala Phe Leu Tyr Ala Ala Leu Ile Asn Gly Glu
3080 3085 3090
Arg Trp Phe Val Thr Asn Thr Ser Met Ser Leu Glu Ser Tyr Asn
3095 3100 3105
Thr Trp Ala Lys Thr Asn Ser Phe Thr Glu Leu Ser Ser Thr Asp
3110 3115 3120
Ala Phe Ser Met Leu Ala Ala Lys Thr Gly Gln Ser Val Glu Lys
3125 3130 3135
Leu Leu Asp Ser Ile Val Arg Leu Asn Lys Gly Phe Gly Gly Arg
3140 3145 3150
Thr Ile Leu Ser Tyr Gly Ser Leu Cys Asp Glu Phe Thr Pro Thr
3155 3160 3165
Glu Val Ile Arg Gln Met Tyr Gly Val Asn Leu Gln Ala Gly Lys
3170 3175 3180
Val Lys Ser Phe Phe Tyr Pro Ile Met Thr Ala Met Thr Ile Leu
3185 3190 3195
Phe Ala Phe Trp Leu Glu Phe Phe Met Tyr Thr Pro Phe Thr Trp
3200 3205 3210
Ile Asn Pro Thr Phe Val Ser Ile Val Leu Ala Val Thr Thr Leu
3215 3220 3225
Ile Ser Thr Val Phe Val Ser Gly Ile Lys His Lys Met Leu Phe
3230 3235 3240
Phe Met Ser Phe Val Leu Pro Ser Val Ile Leu Val Thr Ala His
3245 3250 3255
Asn Leu Phe Trp Asp Phe Ser Tyr Tyr Glu Ser Leu Gln Ser Ile
3260 3265 3270
Val Glu Asn Thr Asn Thr Met Phe Leu Pro Val Asp Met Gln Gly
3275 3280 3285
Val Met Leu Thr Val Phe Cys Phe Ile Val Phe Val Thr Tyr Ser
3290 3295 3300
Val Arg Phe Phe Thr Cys Lys Gln Ser Trp Phe Ser Leu Ala Val
3305 3310 3315
Thr Thr Ile Leu Val Ile Phe Asn Met Val Lys Ile Phe Gly Thr
3320 3325 3330
Ser Asp Glu Pro Trp Thr Glu Asn Gln Ile Ala Phe Cys Phe Val
3335 3340 3345
Asn Met Leu Thr Met Ile Val Ser Leu Thr Thr Lys Asp Trp Met
3350 3355 3360
Val Val Ile Ala Ser Tyr Arg Ile Ala Tyr Tyr Ile Val Val Cys
3365 3370 3375
Val Met Pro Ser Ala Phe Val Ser Asp Phe Gly Phe Met Lys Cys
3380 3385 3390
Ile Ser Ile Val Tyr Met Ala Cys Gly Tyr Leu Phe Cys Cys Tyr
3395 3400 3405
Tyr Gly Ile Leu Tyr Trp Val Asn Arg Phe Thr Cys Met Thr Cys
3410 3415 3420
Gly Val Tyr Gln Phe Thr Val Ser Ala Ala Glu Leu Lys Tyr Met
3425 3430 3435
Thr Ala Asn Asn Leu Ser Ala Pro Lys Asn Ala Tyr Asp Ala Met
3440 3445 3450
Ile Leu Ser Ala Lys Leu Ile Gly Val Gly Gly Lys Arg Asn Ile
3455 3460 3465
Lys Ile Ser Thr Val Gln Ser Lys Leu Thr Glu Met Lys Cys Thr
3470 3475 3480
Asn Val Val Leu Leu Gly Leu Leu Ser Lys Met His Val Glu Ser
3485 3490 3495
Asn Ser Lys Glu Trp Asn Tyr Cys Val Gly Leu His Asn Glu Ile
3500 3505 3510
Asn Leu Cys Asp Asp Pro Glu Ile Val Leu Glu Lys Leu Leu Ala
3515 3520 3525
Leu Ile Ala Phe Phe Leu Ser Lys His Asn Thr Cys Asp Leu Ser
3530 3535 3540
Glu Leu Ile Glu Ser Tyr Phe Glu Asn Thr Thr Ile Leu Gln Ser
3545 3550 3555
Val Ala Ser Ala Tyr Ala Ala Leu Pro Ser Trp Ile Ala Leu Glu
3560 3565 3570
Lys Ala Arg Ala Asp Leu Glu Glu Ala Lys Lys Asn Asp Val Ser
3575 3580 3585
Pro Gln Ile Leu Lys Gln Leu Thr Lys Ala Phe Asn Ile Ala Lys
3590 3595 3600
Ser Asp Phe Glu Arg Glu Ala Ser Val Gln Lys Lys Leu Asp Lys
3605 3610 3615
Met Ala Glu Gln Ala Ala Ala Ser Met Tyr Lys Glu Ala Arg Ala
3620 3625 3630
Val Asp Arg Lys Ser Lys Ile Val Ser Ala Met His Ser Leu Leu
3635 3640 3645
Phe Gly Met Leu Lys Lys Leu Asp Met Ser Ser Val Asn Thr Ile
3650 3655 3660
Ile Asp Gln Ala Arg Asn Gly Val Leu Pro Leu Ser Ile Ile Pro
3665 3670 3675
Ala Ala Ser Ala Thr Arg Leu Val Val Ile Thr Pro Ser Leu Glu
3680 3685 3690
Val Phe Ser Lys Ile Arg Gln Glu Asn Asn Val His Tyr Ala Gly
3695 3700 3705
Ala Ile Trp Thr Ile Val Glu Val Lys Asp Ala Asn Gly Ser His
3710 3715 3720
Val His Leu Lys Glu Val Thr Ala Ala Asn Glu Leu Asn Leu Thr
3725 3730 3735
Trp Pro Leu Ser Ile Thr Cys Glu Arg Thr Thr Lys Leu Gln Asn
3740 3745 3750
Asn Glu Ile Met Pro Gly Lys Leu Lys Glu Arg Ala Val Arg Ala
3755 3760 3765
Ser Ala Thr Leu Asp Gly Glu Ala Phe Gly Ser Gly Lys Ala Leu
3770 3775 3780
Met Ala Ser Glu Ser Gly Lys Ser Phe Met Tyr Ala Phe Ile Ala
3785 3790 3795
Ser Asp Asn Asn Leu Lys Tyr Val Lys Trp Glu Ser Asn Asn Asp
3800 3805 3810
Ile Ile Pro Ile Glu Leu Glu Ala Pro Leu Arg Phe Tyr Val Asp
3815 3820 3825
Gly Ala Asn Gly Pro Glu Val Lys Tyr Leu Tyr Phe Val Lys Asn
3830 3835 3840
Leu Asn Thr Leu Arg Arg Gly Ala Val Leu Gly Tyr Ile Gly Ala
3845 3850 3855
Thr Val Arg Leu Gln Ala Gly Lys Pro Thr Glu His Pro Ser Asn
3860 3865 3870
Ser Ser Leu Leu Thr Leu Cys Ala Phe Ser Pro Asp Pro Ala Lys
3875 3880 3885
Ala Tyr Val Asp Ala Val Lys Arg Gly Met Gln Pro Val Asn Asn
3890 3895 3900
Cys Val Lys Met Leu Ser Asn Gly Ala Gly Asn Gly Met Ala Val
3905 3910 3915
Thr Asn Gly Val Glu Ala Asn Thr Gln Gln Asp Ser Tyr Gly Gly
3920 3925 3930
Ala Ser Val Cys Ile Tyr Cys Arg Cys His Val Glu His Pro Ala
3935 3940 3945
Ile Asp Gly Leu Cys Arg Tyr Lys Gly Lys Phe Val Gln Ile Pro
3950 3955 3960
Thr Gly Thr Gln Asp Pro Ile Arg Phe Cys Ile Glu Asn Glu Val
3965 3970 3975
Cys Val Val Cys Gly Cys Trp Leu Asn Asn Gly Cys Met Cys Asp
3980 3985 3990
Arg Thr Ser Met Gln Ser Phe Thr Val Asp Gln Ser Tyr Leu Phe
3995 4000 4005
Lys Arg Val Arg Gly Ser Ser Ala Ala Arg Leu Glu Pro Cys Asn
4010 4015 4020
Gly Thr Asp Pro Asp His Val Ser Arg Ala Phe Asp Ile Tyr Asn
4025 4030 4035
Lys Asp Val Ala Cys Ile Gly Lys Phe Leu Lys Thr Asn Cys Ser
4040 4045 4050
Arg Phe Arg Asn Leu Asp Lys His Asp Ala Tyr Tyr Ile Val Lys
4055 4060 4065
Arg Cys Thr Lys Thr Val Met Asp His Glu Gln Val Cys Tyr Asn
4070 4075 4080
Asp Leu Lys Asp Ser Gly Ala Val Ala Glu His Asp Phe Phe Thr
4085 4090 4095
Tyr Lys Glu Gly Arg Cys Glu Phe Gly Asn Val Ala Arg Arg Asn
4100 4105 4110
Leu Thr Lys Tyr Thr Met Met Asp Leu Cys Tyr Ala Ile Arg Asn
4115 4120 4125
Phe Asp Glu Lys Asn Cys Glu Val Leu Lys Glu Ile Leu Val Thr
4130 4135 4140
Val Gly Ala Cys Thr Glu Glu Phe Phe Glu Asn Lys Asp Trp Phe
4145 4150 4155
Asp Pro Val Glu Asn Glu Ala Ile His Glu Val Tyr Ala Lys Leu
4160 4165 4170
Gly Pro Ile Val Ala Asn Ala Met Leu Lys Cys Val Ala Phe Cys
4175 4180 4185
Asp Ala Ile Val Glu Lys Gly Tyr Ile Gly Val Ile Thr Leu Asp
4190 4195 4200
Asn Gln Asp Leu Asn Gly Asn Phe Tyr Asp Phe Gly Asp Phe Val
4205 4210 4215
Lys Thr Ala Pro Gly Phe Gly Cys Ala Cys Val Thr Ser Tyr Tyr
4220 4225 4230
Ser Tyr Met Met Pro Leu Met Gly Met Thr Ser Cys Leu Glu Ser
4235 4240 4245
Glu Asn Phe Val Lys Ser Asp Ile Tyr Gly Ser Asp Tyr Lys Gln
4250 4255 4260
Tyr Asp Leu Leu Ala Tyr Asp Phe Thr Glu His Lys Glu Tyr Leu
4265 4270 4275
Phe Gln Lys Tyr Phe Lys Tyr Trp Asp Arg Thr Tyr His Pro Asn
4280 4285 4290
Cys Ser Asp Cys Thr Ser Asp Glu Cys Ile Ile His Cys Ala Asn
4295 4300 4305
Phe Asn Thr Leu Phe Ser Met Thr Ile Pro Met Thr Ala Phe Gly
4310 4315 4320
Pro Leu Val Arg Lys Val His Ile Asp Gly Val Pro Val Val Val
4325 4330 4335
Thr Ala Gly Tyr His Phe Lys Gln Leu Gly Ile Val Trp Asn Leu
4340 4345 4350
Asp Val Lys Leu Asp Thr Met Lys Leu Ser Met Thr Asp Leu Leu
4355 4360 4365
Arg Phe Val Thr Asp Pro Thr Leu Leu Val Ala Ser Ser Pro Ala
4370 4375 4380
Leu Leu Asp Gln Arg Thr Val Cys Phe Ser Ile Ala Ala Leu Ser
4385 4390 4395
Thr Gly Ile Thr Tyr Gln Thr Val Lys Pro Gly His Phe Asn Lys
4400 4405 4410
Asp Phe Tyr Asp Phe Ile Thr Glu Arg Gly Phe Phe Glu Glu Gly
4415 4420 4425
Ser Glu Leu Thr Leu Lys His Phe Phe Phe Ala Gln Gly Gly Glu
4430 4435 4440
Ala Ala Met Thr Asp Phe Asn Tyr Tyr Arg Tyr Asn Arg Val Thr
4445 4450 4455
Val Leu Asp Ile Cys Gln Ala Gln Phe Val Tyr Lys Ile Val Gly
4460 4465 4470
Lys Tyr Phe Glu Cys Tyr Asp Gly Gly Cys Ile Asn Ala Arg Glu
4475 4480 4485
Val Val Val Thr Asn Tyr Asp Lys Ser Ala Gly Tyr Pro Leu Asn
4490 4495 4500
Lys Phe Gly Lys Ala Arg Leu Tyr Tyr Glu Thr Leu Ser Tyr Glu
4505 4510 4515
Glu Gln Asp Ala Leu Phe Ala Leu Thr Lys Arg Asn Val Leu Pro
4520 4525 4530
Thr Met Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Gly Lys Ala
4535 4540 4545
Arg Ala Arg Thr Val Gly Gly Val Ser Leu Leu Ser Thr Met Thr
4550 4555 4560
Thr Arg Gln Tyr His Gln Lys His Leu Lys Ser Ile Ala Ala Thr
4565 4570 4575
Arg Asn Ala Thr Val Val Ile Gly Ser Thr Lys Phe Tyr Gly Gly
4580 4585 4590
Trp Asp Asn Met Leu Lys Asn Leu Met Arg Asp Val Asp Asn Gly
4595 4600 4605
Cys Leu Met Gly Trp Asp Tyr Pro Lys Cys Asp Arg Ala Leu Pro
4610 4615 4620
Asn Met Ile Arg Met Ala Ser Ala Met Ile Leu Gly Ser Lys His
4625 4630 4635
Val Gly Cys Cys Thr His Asn Asp Arg Phe Tyr Arg Leu Ser Asn
4640 4645 4650
Glu Leu Ala Gln Val Leu Thr Glu Val Val His Cys Thr Gly Gly
4655 4660 4665
Phe Tyr Phe Lys Pro Gly Gly Thr Thr Ser Gly Asp Gly Thr Thr
4670 4675 4680
Ala Tyr Ala Asn Ser Ala Phe Asn Ile Phe Gln Ala Val Ser Ala
4685 4690 4695
Asn Val Asn Lys Leu Leu Gly Val Asp Ser Asn Ala Cys Asn Asn
4700 4705 4710
Val Thr Val Lys Ser Ile Gln Arg Lys Ile Tyr Asp Asn Cys Tyr
4715 4720 4725
Arg Ser Ser Ser Ile Asp Glu Glu Phe Val Val Glu Tyr Phe Ser
4730 4735 4740
Tyr Leu Arg Lys His Phe Ser Met Met Ile Leu Ser Asp Asp Gly
4745 4750 4755
Val Val Cys Tyr Asn Lys Asp Tyr Ala Asp Leu Gly Tyr Val Ala
4760 4765 4770
Asp Ile Asn Ala Phe Lys Ala Thr Leu Tyr Tyr Gln Asn Asn Val
4775 4780 4785
Phe Met Ser Thr Ser Lys Cys Trp Val Glu Pro Asp Leu Ser Val
4790 4795 4800
Gly Pro His Glu Phe Cys Ser Gln His Thr Leu Gln Ile Val Gly
4805 4810 4815
Pro Asp Gly Asp Tyr Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile
4820 4825 4830
Leu Ser Ala Gly Val Phe Val Asp Asp Ile Val Lys Thr Asp Asn
4835 4840 4845
Val Ile Met Leu Glu Arg Tyr Val Ser Leu Ala Ile Asp Ala Tyr
4850 4855 4860
Pro Leu Thr Lys His Pro Lys Pro Ala Tyr Gln Lys Val Phe Tyr
4865 4870 4875
Thr Leu Leu Asp Trp Val Lys His Leu Gln Lys Asn Leu Asn Ala
4880 4885 4890
Gly Val Leu Asp Ser Phe Ser Val Thr Met Leu Glu Glu Gly Gln
4895 4900 4905
Asp Lys Phe Trp Ser Glu Glu Phe Tyr Ala Ser Leu Tyr Glu Lys
4910 4915 4920
Ser Thr Val Leu Gln Ala Ala Gly Met Cys Val Val Cys Gly Ser
4925 4930 4935
Gln Thr Val Leu Arg Cys Gly Asp Cys Leu Arg Arg Pro Leu Leu
4940 4945 4950
Cys Thr Lys Cys Ala Tyr Asp His Val Met Gly Thr Lys His Lys
4955 4960 4965
Phe Ile Met Ser Ile Thr Pro Tyr Val Cys Ser Phe Asn Gly Cys
4970 4975 4980
Asn Val Asn Asp Val Thr Lys Leu Phe Leu Gly Gly Leu Ser Tyr
4985 4990 4995
Tyr Cys Met Asn His Lys Pro Gln Leu Ser Phe Pro Leu Cys Ala
5000 5005 5010
Asn Gly Asn Val Phe Gly Leu Tyr Lys Ser Ser Ala Val Gly Ser
5015 5020 5025
Glu Ala Val Glu Asp Phe Asn Lys Leu Ala Val Ser Asp Trp Thr
5030 5035 5040
Asn Val Glu Asp Tyr Lys Leu Ala Asn Asn Val Lys Glu Ser Leu
5045 5050 5055
Lys Ile Phe Ala Ala Glu Thr Val Lys Ala Lys Glu Glu Ser Val
5060 5065 5070
Lys Ser Glu Tyr Ala Tyr Ala Val Leu Lys Glu Val Ile Gly Pro
5075 5080 5085
Lys Glu Ile Val Leu Gln Trp Glu Ala Ser Lys Thr Lys Pro Pro
5090 5095 5100
Leu Asn Arg Asn Ser Val Phe Thr Cys Phe Gln Ile Ser Lys Asp
5105 5110 5115
Thr Lys Ile Gln Leu Gly Glu Phe Val Phe Glu Gln Ser Glu Tyr
5120 5125 5130
Gly Ser Asp Ser Val Tyr Tyr Lys Ser Thr Ser Thr Tyr Lys Leu
5135 5140 5145
Thr Pro Gly Met Ile Phe Val Leu Thr Ser His Asn Val Ser Pro
5150 5155 5160
Leu Lys Ala Pro Ile Leu Val Asn Gln Glu Lys Tyr Asn Thr Ile
5165 5170 5175
Ser Lys Leu Tyr Pro Val Phe Asn Ile Ala Glu Ala Tyr Asn Thr
5180 5185 5190
Leu Val Pro Tyr Tyr Gln Met Ile Gly Lys Gln Lys Phe Thr Thr
5195 5200 5205
Ile Gln Gly Pro Pro Gly Ser Gly Lys Ser His Cys Val Ile Gly
5210 5215 5220
Leu Gly Leu Tyr Tyr Pro Gln Ala Arg Ile Val Tyr Thr Ala Cys
5225 5230 5235
Ser His Ala Ala Val Asp Ala Leu Cys Glu Lys Ala Ala Lys Asn
5240 5245 5250
Phe Asn Val Asp Arg Cys Ser Arg Ile Ile Pro Gln Arg Ile Arg
5255 5260 5265
Val Asp Cys Tyr Thr Gly Phe Lys Pro Asn Asn Thr Asn Ala Gln
5270 5275 5280
Tyr Leu Phe Cys Thr Val Asn Ala Leu Pro Glu Ala Ser Cys Asp
5285 5290 5295
Ile Val Val Val Asp Glu Val Ser Met Cys Thr Asn Tyr Asp Leu
5300 5305 5310
Ser Val Ile Asn Ser Arg Leu Ser Tyr Lys His Ile Val Tyr Val
5315 5320 5325
Gly Asp Pro Gln Gln Leu Pro Ala Pro Arg Thr Leu Ile Asn Lys
5330 5335 5340
Gly Val Leu Gln Pro Gln Asp Tyr Asn Val Val Thr Lys Arg Met
5345 5350 5355
Cys Thr Leu Gly Pro Asp Val Phe Leu His Lys Cys Tyr Arg Cys
5360 5365 5370
Pro Ala Glu Ile Val Lys Thr Val Ser Ala Leu Val Tyr Glu Asn
5375 5380 5385
Lys Phe Val Pro Val Asn Pro Glu Ser Lys Gln Cys Phe Lys Met
5390 5395 5400
Phe Val Lys Gly Gln Val Gln Ile Glu Ser Asn Ser Ser Ile Asn
5405 5410 5415
Asn Lys Gln Leu Glu Val Val Lys Ala Phe Leu Ala His Asn Pro
5420 5425 5430
Lys Trp Arg Lys Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln Asn
5435 5440 5445
Tyr Val Ala Arg Arg Leu Leu Gly Leu Gln Thr Gln Thr Val Asp
5450 5455 5460
Ser Ala Gln Gly Ser Glu Tyr Asp Tyr Val Ile Tyr Thr Gln Thr
5465 5470 5475
Ser Asp Thr Gln His Ala Thr Asn Val Asn Arg Phe Asn Val Ala
5480 5485 5490
Ile Thr Arg Ala Lys Val Gly Ile Leu Cys Ile Met Cys Asp Arg
5495 5500 5505
Thr Met Tyr Glu Asn Leu Asp Phe Tyr Glu Leu Lys Asp Ser Lys
5510 5515 5520
Ile Gly Leu Gln Ala Lys Pro Glu Thr Cys Gly Leu Phe Lys Asp
5525 5530 5535
Cys Ser Lys Ser Glu Gln Tyr Ile Pro Pro Ala Tyr Ala Thr Thr
5540 5545 5550
Tyr Met Ser Leu Ser Asp Asn Phe Lys Thr Ser Asp Gly Leu Ala
5555 5560 5565
Val Asn Ile Gly Thr Lys Asp Val Lys Tyr Ala Asn Val Ile Ser
5570 5575 5580
Tyr Met Gly Phe Arg Phe Glu Ala Asn Ile Pro Gly Tyr His Thr
5585 5590 5595
Leu Phe Cys Thr Arg Asp Phe Ala Met Arg Asn Val Arg Ala Trp
5600 5605 5610
Leu Gly Phe Asp Val Glu Gly Ala His Val Cys Gly Asp Asn Val
5615 5620 5625
Gly Thr Asn Val Pro Leu Gln Leu Gly Phe Ser Asn Gly Val Asp
5630 5635 5640
Phe Val Val Gln Thr Glu Gly Cys Val Ile Thr Glu Lys Gly Asn
5645 5650 5655
Ser Ile Glu Val Val Lys Ala Arg Ala Pro Pro Gly Glu Gln Phe
5660 5665 5670
Ala His Leu Ile Pro Leu Met Arg Lys Gly Gln Pro Trp His Ile
5675 5680 5685
Val Arg Arg Arg Ile Val Gln Met Val Cys Asp Tyr Phe Asp Gly
5690 5695 5700
Leu Ser Asp Ile Leu Ile Phe Val Leu Trp Ala Gly Gly Leu Glu
5705 5710 5715
Leu Thr Thr Met Arg Tyr Phe Val Lys Ile Gly Arg Pro Gln Lys
5720 5725 5730
Cys Glu Cys Gly Lys Ser Ala Thr Cys Tyr Ser Ser Ser Gln Ser
5735 5740 5745
Val Tyr Ala Cys Phe Lys His Ala Leu Gly Cys Asp Tyr Leu Tyr
5750 5755 5760
Asn Pro Tyr Cys Ile Asp Ile Gln Gln Trp Gly Tyr Thr Gly Ser
5765 5770 5775
Leu Ser Met Asn His His Glu Val Cys Asn Ile His Arg Asn Glu
5780 5785 5790
His Val Ala Ser Gly Asp Ala Ile Met Thr Arg Cys Leu Ala Ile
5795 5800 5805
His Asp Cys Phe Val Lys Arg Val Asp Trp Ser Ile Val Tyr Pro
5810 5815 5820
Phe Ile Asp Asn Glu Glu Lys Ile Asn Lys Ala Gly Arg Ile Val
5825 5830 5835
Gln Ser His Val Met Lys Ala Ala Leu Lys Ile Phe Asn Pro Ala
5840 5845 5850
Ala Ile His Asp Val Gly Asn Pro Lys Gly Ile Arg Cys Ala Thr
5855 5860 5865
Thr Pro Ile Pro Trp Phe Cys Tyr Asp Arg Asp Pro Ile Asn Asn
5870 5875 5880
Asn Val Arg Cys Leu Asp Tyr Asp Tyr Met Val His Gly Gln Met
5885 5890 5895
Asn Gly Leu Met Leu Phe Trp Asn Cys Asn Val Asp Met Tyr Pro
5900 5905 5910
Glu Phe Ser Ile Val Cys Arg Phe Asp Thr Arg Thr Arg Ser Lys
5915 5920 5925
Leu Ser Leu Glu Gly Cys Asn Gly Gly Ala Leu Tyr Val Asn Asn
5930 5935 5940
His Ala Phe His Thr Pro Ala Tyr Asp Arg Arg Ala Phe Ala Lys
5945 5950 5955
Leu Lys Pro Met Pro Phe Phe Tyr Tyr Asp Asp Ser Asn Cys Glu
5960 5965 5970
Leu Val Asp Gly Gln Pro Asn Tyr Val Pro Leu Lys Ser Asn Val
5975 5980 5985
Cys Ile Thr Lys Cys Asn Ile Gly Gly Ala Val Cys Lys Lys His
5990 5995 6000
Ala Ala Leu Tyr Arg Ala Tyr Val Glu Asp Tyr Asn Ile Phe Met
6005 6010 6015
Gln Ala Gly Phe Thr Ile Trp Cys Pro Gln Asn Phe Asp Thr Tyr
6020 6025 6030
Met Leu Trp His Gly Phe Val Asn Ser Lys Ala Leu Gln Ser Leu
6035 6040 6045
Glu Asn Val Ala Phe Asn Val Val Lys Lys Gly Ala Phe Thr Gly
6050 6055 6060
Leu Lys Gly Asp Leu Pro Thr Ala Val Ile Ala Asp Lys Ile Met
6065 6070 6075
Val Arg Asp Gly Pro Thr Asp Lys Cys Ile Phe Thr Asn Lys Thr
6080 6085 6090
Ser Leu Pro Thr Asn Val Ala Phe Glu Leu Tyr Ala Lys Arg Lys
6095 6100 6105
Leu Gly Leu Thr Pro Pro Leu Thr Ile Leu Arg Asn Leu Gly Val
6110 6115 6120
Val Ala Thr Tyr Lys Phe Val Leu Trp Asp Tyr Glu Ala Glu Arg
6125 6130 6135
Pro Phe Ser Asn Phe Thr Lys Gln Val Cys Ser Tyr Thr Asp Leu
6140 6145 6150
Asp Ser Glu Val Val Thr Cys Phe Asp Asn Ser Ile Ala Gly Ser
6155 6160 6165
Phe Glu Arg Phe Thr Thr Thr Arg Asp Ala Val Leu Ile Ser Asn
6170 6175 6180
Asn Ala Val Lys Gly Leu Ser Ala Ile Lys Leu Gln Tyr Gly Leu
6185 6190 6195
Leu Asn Asp Leu Pro Val Ser Thr Val Gly Asn Lys Pro Val Thr
6200 6205 6210
Trp Tyr Ile Tyr Val Arg Lys Asn Gly Glu Tyr Val Glu Gln Ile
6215 6220 6225
Asp Ser Tyr Tyr Thr Gln Gly Arg Thr Phe Glu Thr Phe Lys Pro
6230 6235 6240
Arg Ser Thr Met Glu Glu Asp Phe Leu Ser Met Asp Thr Thr Leu
6245 6250 6255
Phe Ile Gln Lys Tyr Gly Leu Glu Asp Tyr Gly Phe Glu His Val
6260 6265 6270
Val Phe Gly Asp Val Ser Lys Thr Thr Ile Gly Gly Met His Leu
6275 6280 6285
Leu Ile Ser Gln Val Arg Leu Ala Lys Met Gly Leu Phe Ser Val
6290 6295 6300
Gln Glu Phe Met Asn Asn Ser Asp Ser Thr Leu Lys Ser Cys Cys
6305 6310 6315
Ile Thr Tyr Ala Asp Asp Pro Ser Ser Lys Asn Val Cys Thr Tyr
6320 6325 6330
Met Asp Ile Leu Leu Asp Asp Phe Val Thr Ile Ile Lys Ser Leu
6335 6340 6345
Asp Leu Asn Val Val Ser Lys Val Val Asp Val Ile Val Asp Cys
6350 6355 6360
Lys Ala Trp Arg Trp Met Leu Trp Cys Glu Asn Ser His Ile Lys
6365 6370 6375
Thr Phe Tyr Pro Gln Leu Gln Ser Ala Glu Trp Asn Pro Gly Tyr
6380 6385 6390
Ser Met Pro Thr Leu Tyr Lys Ile Gln Arg Met Cys Leu Glu Arg
6395 6400 6405
Cys Asn Leu Tyr Asn Tyr Gly Ala Gln Val Lys Leu Pro Asp Gly
6410 6415 6420
Ile Thr Thr Asn Val Val Lys Tyr Thr Gln Leu Cys Gln Tyr Leu
6425 6430 6435
Asn Thr Thr Thr Leu Cys Val Pro His Lys Met Arg Val Leu His
6440 6445 6450
Leu Gly Ala Ala Gly Ala Ser Gly Val Ala Pro Gly Ser Thr Val
6455 6460 6465
Leu Arg Arg Trp Leu Pro Asp Asp Ala Ile Leu Val Asp Asn Asp
6470 6475 6480
Leu Arg Asp Tyr Val Ser Asp Ala Asp Phe Ser Val Thr Gly Asp
6485 6490 6495
Cys Thr Ser Leu Tyr Ile Glu Asp Lys Phe Asp Leu Leu Val Ser
6500 6505 6510
Asp Leu Tyr Asp Gly Ser Thr Lys Ser Ile Asp Gly Glu Asn Thr
6515 6520 6525
Ser Lys Asp Gly Phe Phe Thr Tyr Ile Asn Gly Phe Ile Lys Glu
6530 6535 6540
Lys Leu Ser Leu Gly Gly Ser Val Ala Ile Lys Ile Thr Glu Phe
6545 6550 6555
Ser Trp Asn Lys Asp Leu Tyr Glu Leu Ile Gln Arg Phe Glu Tyr
6560 6565 6570
Trp Thr Val Phe Cys Thr Ser Val Asn Thr Ser Ser Ser Glu Gly
6575 6580 6585
Phe Leu Ile Gly Ile Asn Tyr Leu Gly Pro Tyr Cys Asp Lys Ala
6590 6595 6600
Ile Val Asp Gly Asn Ile Met His Ala Asn Tyr Ile Phe Trp Arg
6605 6610 6615
Asn Ser Thr Ile Met Ala Leu Ser His Asn Ser Val Leu Asp Thr
6620 6625 6630
Pro Lys Phe Lys Cys Arg Cys Asn Asn Ala Leu Ile Val Asn Leu
6635 6640 6645
Lys Glu Lys Glu Leu Asn Glu Met Val Ile Gly Leu Leu Arg Lys
6650 6655 6660
Gly Lys Leu Leu Ile Arg Asn Asn Gly Lys Leu Leu Asn Phe Gly
6665 6670 6675
Asn His Phe Val Asn Thr Pro
6680 6685
<210> 21
<211> 6603
<212> PRT
<213> avian infectious bronchitis virus
<220>
<221> MISC_FEATURE
<223> ORF 1AB
<400> 21
Met Ala Ser Ser Leu Lys Gln Gly Val Ser Pro Lys Pro Arg Asp Val
1 5 10 15
Ile Leu Val Ser Lys Asp Ile Pro Glu Gln Leu Cys Asp Ala Leu Phe
20 25 30
Phe Tyr Thr Ser His Asn Pro Lys Asp Tyr Ala Asp Ala Phe Ala Val
35 40 45
Arg Gln Lys Phe Asp Arg Ser Leu Gln Thr Gly Lys Gln Phe Lys Phe
50 55 60
Glu Thr Val Cys Gly Leu Phe Leu Leu Lys Gly Val Asp Lys Ile Thr
65 70 75 80
Pro Gly Val Pro Ala Lys Val Leu Lys Ala Thr Ser Lys Leu Ala Asp
85 90 95
Leu Glu Asp Ile Phe Gly Val Ser Pro Leu Ala Arg Lys Tyr Arg Glu
100 105 110
Leu Leu Lys Thr Ala Cys Gln Trp Ser Leu Thr Val Glu Ala Leu Asp
115 120 125
Val Arg Ala Gln Thr Leu Asp Glu Ile Phe Asp Pro Thr Glu Ile Leu
130 135 140
Trp Leu Gln Val Ala Ala Lys Ile His Val Ser Ser Met Ala Met Arg
145 150 155 160
Arg Leu Val Gly Glu Val Thr Ala Lys Val Met Asp Ala Leu Gly Ser
165 170 175
Asn Leu Ser Ala Leu Phe Gln Ile Val Lys Gln Gln Ile Ala Arg Ile
180 185 190
Phe Gln Lys Ala Leu Ala Ile Phe Glu Asn Val Asn Glu Leu Pro Gln
195 200 205
Arg Ile Ala Ala Leu Lys Met Ala Phe Ala Lys Cys Ala Arg Ser Ile
210 215 220
Thr Val Val Val Val Glu Arg Thr Leu Val Val Lys Glu Phe Ala Gly
225 230 235 240
Thr Cys Leu Ala Ser Ile Asn Gly Ala Val Ala Lys Phe Phe Glu Glu
245 250 255
Leu Pro Asn Gly Phe Met Gly Ser Lys Ile Phe Thr Thr Leu Ala Phe
260 265 270
Phe Lys Glu Ala Ala Val Arg Val Val Glu Asn Ile Pro Asn Ala Pro
275 280 285
Arg Gly Thr Lys Gly Phe Glu Val Val Gly Asn Ala Lys Gly Thr Gln
290 295 300
Val Val Val Arg Gly Met Arg Asn Asp Leu Thr Leu Leu Asp Gln Lys
305 310 315 320
Ala Asp Ile Pro Val Glu Pro Glu Gly Trp Ser Ala Ile Leu Asp Gly
325 330 335
His Leu Cys Tyr Val Phe Arg Ser Gly Asp Arg Phe Tyr Ala Ala Pro
340 345 350
Leu Ser Gly Asn Phe Ala Leu Ser Asp Val His Cys Cys Glu Arg Val
355 360 365
Val Cys Leu Ser Asp Gly Val Thr Pro Glu Ile Asn Asp Gly Leu Ile
370 375 380
Leu Ala Ala Ile Tyr Ser Ser Phe Ser Val Ser Glu Leu Val Thr Ala
385 390 395 400
Leu Lys Lys Gly Glu Pro Phe Lys Phe Leu Gly His Lys Phe Val Tyr
405 410 415
Ala Lys Asp Ala Ala Val Ser Phe Thr Leu Ala Lys Ala Ala Thr Ile
420 425 430
Ala Asp Val Leu Arg Leu Phe Gln Ser Ala Arg Val Ile Ala Glu Asp
435 440 445
Val Trp Ser Ser Phe Thr Glu Lys Ser Phe Glu Phe Trp Lys Leu Ala
450 455 460
Tyr Gly Lys Val Arg Asn Leu Glu Glu Phe Val Lys Thr Tyr Val Cys
465 470 475 480
Lys Ala Gln Met Ser Ile Val Ile Leu Ala Ala Val Leu Gly Glu Asp
485 490 495
Ile Trp His Leu Val Ser Gln Val Ile Tyr Lys Leu Gly Val Leu Phe
500 505 510
Thr Lys Val Val Asp Phe Cys Asp Lys His Trp Lys Gly Phe Cys Val
515 520 525
Gln Leu Lys Arg Ala Lys Leu Ile Val Thr Glu Thr Phe Cys Val Leu
530 535 540
Lys Gly Val Ala Gln His Cys Phe Gln Leu Leu Leu Asp Ala Ile His
545 550 555 560
Ser Leu Tyr Lys Ser Phe Lys Lys Cys Ala Leu Gly Arg Ile His Gly
565 570 575
Asp Leu Leu Phe Trp Lys Gly Gly Val His Lys Ile Val Gln Asp Gly
580 585 590
Asp Glu Ile Trp Phe Asp Ala Ile Asp Ser Val Asp Val Glu Asp Leu
595 600 605
Gly Val Val Gln Glu Lys Ser Ile Asp Phe Glu Val Cys Asp Asp Val
610 615 620
Thr Leu Pro Glu Asn Gln Pro Gly His Met Val Gln Ile Glu Asp Asp
625 630 635 640
Gly Lys Asn Tyr Met Phe Phe Arg Phe Lys Lys Asp Glu Asn Ile Tyr
645 650 655
Tyr Thr Pro Met Ser Gln Leu Gly Ala Ile Asn Val Val Cys Lys Ala
660 665 670
Gly Gly Lys Thr Val Thr Phe Gly Glu Thr Thr Val Gln Glu Ile Pro
675 680 685
Pro Pro Asp Val Val Pro Ile Lys Val Ser Ile Glu Cys Cys Gly Glu
690 695 700
Pro Trp Asn Thr Ile Phe Lys Lys Ala Tyr Lys Glu Pro Ile Glu Val
705 710 715 720
Asp Thr Asp Leu Thr Val Glu Gln Leu Leu Ser Val Ile Tyr Glu Lys
725 730 735
Met Cys Asp Asp Leu Lys Leu Phe Pro Glu Ala Pro Glu Pro Pro Pro
740 745 750
Phe Glu Asn Val Ala Leu Val Asp Lys Asn Gly Lys Asp Leu Asp Cys
755 760 765
Ile Lys Ser Cys His Leu Ile Tyr Arg Asp Tyr Glu Ser Asp Asp Asp
770 775 780
Ile Glu Glu Glu Asp Ala Glu Glu Cys Asp Thr Asp Ser Gly Glu Ala
785 790 795 800
Glu Glu Cys Asp Thr Asn Ser Glu Cys Glu Glu Glu Asp Glu Asp Thr
805 810 815
Lys Val Leu Ala Leu Ile Gln Asp Pro Ala Ser Ile Lys Tyr Pro Leu
820 825 830
Pro Leu Asp Glu Asp Tyr Ser Val Tyr Asn Gly Cys Ile Val His Lys
835 840 845
Asp Ala Leu Asp Val Val Asn Leu Pro Ser Gly Glu Glu Thr Phe Val
850 855 860
Val Asn Asn Cys Phe Glu Gly Ala Val Lys Pro Leu Pro Gln Lys Val
865 870 875 880
Val Asp Val Leu Gly Asp Trp Gly Glu Ala Val Asp Ala Gln Glu Gln
885 890 895
Leu Cys Gln Gln Glu Pro Leu Gln His Thr Phe Glu Glu Pro Val Glu
900 905 910
Asn Ser Thr Gly Ser Ser Lys Thr Met Thr Glu Gln Val Val Val Glu
915 920 925
Asp Gln Glu Leu Pro Val Val Glu Gln Asp Gln Asp Val Val Val Tyr
930 935 940
Thr Pro Thr Asp Leu Glu Val Ala Lys Glu Thr Ala Glu Glu Val Asp
945 950 955 960
Glu Phe Ile Leu Ile Phe Ala Val Pro Lys Glu Glu Val Val Ser Gln
965 970 975
Lys Asp Gly Ala Gln Ile Lys Gln Glu Pro Ile Gln Val Val Lys Pro
980 985 990
Gln Arg Glu Lys Lys Ala Lys Lys Phe Lys Val Lys Pro Ala Thr Cys
995 1000 1005
Glu Lys Pro Lys Phe Leu Glu Tyr Lys Thr Cys Val Gly Asp Leu
1010 1015 1020
Thr Val Val Ile Ala Lys Ala Leu Asp Glu Phe Lys Glu Phe Cys
1025 1030 1035
Ile Val Asn Ala Ala Asn Glu His Met Thr His Gly Ser Gly Val
1040 1045 1050
Ala Lys Ala Ile Ala Asp Phe Cys Gly Leu Asp Phe Val Glu Tyr
1055 1060 1065
Cys Glu Asp Tyr Val Lys Lys His Gly Pro Gln Gln Arg Leu Val
1070 1075 1080
Thr Pro Ser Phe Val Lys Gly Ile Gln Cys Val Asn Asn Val Val
1085 1090 1095
Gly Pro Arg His Gly Asp Asn Asn Leu His Glu Lys Leu Val Ala
1100 1105 1110
Ala Tyr Lys Asn Val Leu Val Asp Gly Val Val Asn Tyr Val Val
1115 1120 1125
Pro Val Leu Ser Leu Gly Ile Phe Gly Val Asp Phe Lys Met Ser
1130 1135 1140
Ile Asp Ala Met Arg Glu Ala Phe Glu Gly Cys Thr Ile Arg Val
1145 1150 1155
Leu Leu Phe Ser Leu Ser Gln Glu His Ile Asp Tyr Phe Asp Val
1160 1165 1170
Thr Cys Lys Gln Lys Thr Ile Tyr Leu Thr Glu Asp Gly Val Lys
1175 1180 1185
Tyr Arg Ser Ile Val Leu Lys Pro Gly Asp Ser Leu Gly Gln Phe
1190 1195 1200
Gly Gln Val Tyr Ala Lys Asn Lys Ile Val Phe Thr Ala Asp Asp
1205 1210 1215
Val Glu Asp Lys Glu Ile Leu Tyr Val Pro Thr Thr Asp Lys Ser
1220 1225 1230
Ile Leu Glu Tyr Tyr Gly Leu Asp Ala Gln Lys Tyr Val Ile Tyr
1235 1240 1245
Leu Gln Thr Leu Ala Gln Lys Trp Asn Val Gln Tyr Arg Asp Asn
1250 1255 1260
Phe Leu Ile Leu Glu Trp Arg Asp Gly Asn Cys Trp Ile Ser Ser
1265 1270 1275
Ala Ile Val Leu Leu Gln Ala Ala Lys Ile Arg Phe Lys Gly Phe
1280 1285 1290
Leu Thr Glu Ala Trp Ala Lys Leu Leu Gly Gly Asp Pro Thr Asp
1295 1300 1305
Phe Val Ala Trp Cys Tyr Ala Ser Cys Thr Ala Lys Val Gly Asp
1310 1315 1320
Phe Ser Asp Ala Asn Trp Leu Leu Ala Asn Leu Ala Glu His Phe
1325 1330 1335
Asp Ala Asp Tyr Thr Asn Ala Phe Leu Lys Lys Arg Val Ser Cys
1340 1345 1350
Asn Cys Gly Ile Lys Ser Tyr Glu Leu Arg Gly Leu Glu Ala Cys
1355 1360 1365
Ile Gln Pro Val Arg Ala Thr Asn Leu Leu His Phe Lys Thr Gln
1370 1375 1380
Tyr Ser Asn Cys Pro Thr Cys Gly Ala Asn Asn Thr Asp Glu Val
1385 1390 1395
Ile Glu Ala Ser Leu Pro Tyr Leu Leu Leu Phe Ala Thr Asp Gly
1400 1405 1410
Pro Ala Thr Val Asp Cys Asp Glu Asp Ala Val Gly Thr Val Val
1415 1420 1425
Phe Val Gly Ser Thr Asn Ser Gly His Cys Tyr Thr Gln Ala Ala
1430 1435 1440
Gly Gln Ala Phe Asp Asn Leu Ala Lys Asp Arg Lys Phe Gly Lys
1445 1450 1455
Lys Ser Pro Tyr Ile Thr Ala Met Tyr Thr Arg Phe Ala Phe Lys
1460 1465 1470
Asn Glu Thr Ser Leu Pro Val Ala Lys Gln Ser Lys Gly Lys Ser
1475 1480 1485
Lys Ser Val Lys Glu Asp Val Ser Asn Leu Ala Thr Ser Ser Lys
1490 1495 1500
Ala Ser Phe Asp Asn Leu Thr Asp Phe Glu Gln Trp Tyr Asp Ser
1505 1510 1515
Asn Ile Tyr Glu Ser Leu Lys Val Gln Glu Ser Pro Asp Asn Phe
1520 1525 1530
Asp Lys Tyr Val Ser Phe Thr Thr Lys Glu Asp Ser Lys Leu Pro
1535 1540 1545
Leu Thr Leu Lys Val Arg Gly Ile Lys Ser Val Val Asp Phe Arg
1550 1555 1560
Ser Lys Asp Gly Phe Ile Tyr Lys Leu Thr Pro Asp Thr Asp Glu
1565 1570 1575
Asn Ser Lys Ala Pro Val Tyr Tyr Pro Val Leu Asp Ala Ile Ser
1580 1585 1590
Leu Lys Ala Ile Trp Val Glu Gly Asn Ala Asn Phe Val Val Gly
1595 1600 1605
His Pro Asn Tyr Tyr Ser Lys Ser Leu His Ile Pro Thr Phe Trp
1610 1615 1620
Glu Asn Ala Glu Asn Phe Val Lys Met Gly Asp Lys Ile Gly Gly
1625 1630 1635
Val Thr Met Gly Leu Trp Arg Ala Glu His Leu Asn Lys Pro Asn
1640 1645 1650
Leu Glu Arg Ile Phe Asn Ile Ala Lys Lys Ala Ile Val Gly Ser
1655 1660 1665
Ser Val Val Thr Thr Gln Cys Gly Lys Leu Ile Gly Lys Ala Ala
1670 1675 1680
Thr Phe Ile Ala Asp Lys Val Gly Gly Gly Val Val Arg Asn Ile
1685 1690 1695
Thr Asp Ser Ile Lys Gly Leu Cys Gly Ile Thr Arg Gly His Phe
1700 1705 1710
Glu Arg Lys Met Ser Pro Gln Phe Leu Lys Thr Leu Met Phe Phe
1715 1720 1725
Leu Phe Tyr Phe Leu Lys Ala Ser Val Lys Ser Val Val Ala Ser
1730 1735 1740
Tyr Lys Thr Val Leu Cys Lys Val Val Leu Ala Thr Leu Leu Ile
1745 1750 1755
Val Trp Phe Val Tyr Thr Ser Asn Pro Val Met Phe Thr Gly Ile
1760 1765 1770
Arg Val Leu Asp Phe Leu Phe Glu Gly Ser Leu Cys Gly Pro Tyr
1775 1780 1785
Lys Asp Tyr Gly Lys Asp Ser Phe Asp Val Leu Arg Tyr Cys Ala
1790 1795 1800
Asp Asp Phe Ile Cys Arg Val Cys Leu His Asp Lys Asp Ser Leu
1805 1810 1815
His Leu Tyr Lys His Ala Tyr Ser Val Glu Gln Val Tyr Lys Asp
1820 1825 1830
Ala Ala Ser Gly Phe Ile Phe Asn Trp Asn Trp Leu Tyr Leu Val
1835 1840 1845
Phe Leu Ile Leu Phe Val Lys Pro Val Ala Gly Phe Val Ile Ile
1850 1855 1860
Cys Tyr Cys Val Lys Tyr Leu Val Leu Asn Ser Thr Val Leu Gln
1865 1870 1875
Thr Gly Val Cys Phe Leu Asp Trp Phe Val Gln Thr Val Phe Ser
1880 1885 1890
His Phe Asn Phe Met Gly Ala Gly Phe Tyr Phe Trp Leu Phe Tyr
1895 1900 1905
Lys Ile Tyr Ile Gln Val His His Ile Leu Tyr Cys Lys Asp Val
1910 1915 1920
Thr Cys Glu Val Cys Lys Arg Val Ala Arg Ser Asn Arg Gln Glu
1925 1930 1935
Val Ser Val Val Val Gly Gly Arg Lys Gln Ile Val His Val Tyr
1940 1945 1950
Thr Asn Ser Gly Tyr Asn Phe Cys Lys Arg His Asn Trp Tyr Cys
1955 1960 1965
Arg Asn Cys Asp Asp Tyr Gly His Gln Asn Thr Phe Met Ser Pro
1970 1975 1980
Glu Val Ala Gly Glu Leu Ser Glu Lys Leu Lys Arg His Val Lys
1985 1990 1995
Pro Thr Ala Tyr Ala Tyr His Val Val Asp Glu Ala Cys Leu Val
2000 2005 2010
Asp Asp Phe Val Asn Leu Lys Tyr Lys Ala Ala Thr Pro Gly Lys
2015 2020 2025
Asp Ser Ala Ser Ser Ala Val Lys Cys Phe Ser Val Thr Asp Phe
2030 2035 2040
Leu Lys Lys Ala Val Phe Leu Lys Glu Ala Leu Lys Cys Glu Gln
2045 2050 2055
Ile Ser Asn Asp Gly Phe Ile Val Cys Asn Thr Gln Ser Ala His
2060 2065 2070
Ala Leu Glu Glu Ala Lys Asn Ala Ala Ile Tyr Tyr Ala Gln Tyr
2075 2080 2085
Leu Cys Lys Pro Ile Leu Ile Leu Asp Gln Ala Leu Tyr Glu Gln
2090 2095 2100
Leu Val Val Glu Pro Val Ser Lys Ser Val Ile Asp Lys Val Cys
2105 2110 2115
Ser Ile Leu Ser Ser Ile Ile Ser Val Asp Thr Ala Ala Leu Asn
2120 2125 2130
Tyr Lys Ala Gly Thr Leu Arg Asp Ala Leu Leu Ser Ile Thr Lys
2135 2140 2145
Asp Glu Glu Ala Val Asp Met Ala Ile Phe Cys His Asn His Asp
2150 2155 2160
Val Asp Tyr Thr Gly Asp Gly Phe Thr Asn Val Ile Pro Ser Tyr
2165 2170 2175
Gly Ile Asp Thr Gly Lys Leu Thr Pro Arg Asp Arg Gly Phe Leu
2180 2185 2190
Ile Asn Ala Asp Ala Ser Ile Ala Asn Leu Arg Val Lys Asn Ala
2195 2200 2205
Pro Pro Val Val Trp Lys Phe Ser Glu Leu Ile Lys Leu Ser Asp
2210 2215 2220
Ser Cys Leu Lys Tyr Leu Ile Ser Ala Thr Val Lys Ser Gly Val
2225 2230 2235
Arg Phe Phe Ile Thr Lys Ser Gly Ala Lys Gln Val Ile Ala Cys
2240 2245 2250
His Thr Gln Lys Leu Leu Val Glu Lys Lys Ala Gly Gly Ile Val
2255 2260 2265
Ser Gly Thr Phe Lys Cys Phe Lys Ser Tyr Phe Lys Trp Leu Leu
2270 2275 2280
Ile Phe Tyr Ile Leu Phe Thr Ala Cys Cys Ser Gly Tyr Tyr Tyr
2285 2290 2295
Met Glu Val Ser Lys Ser Phe Val His Pro Met Tyr Asp Val Asn
2300 2305 2310
Ser Thr Leu His Val Glu Gly Phe Lys Val Ile Asp Lys Gly Val
2315 2320 2325
Leu Arg Glu Ile Val Pro Glu Asp Thr Cys Phe Ser Asn Lys Phe
2330 2335 2340
Val Asn Phe Asp Ala Phe Trp Gly Arg Pro Tyr Asp Asn Ser Arg
2345 2350 2355
Asn Cys Pro Ile Val Thr Ala Val Ile Asp Gly Asp Gly Thr Val
2360 2365 2370
Ala Thr Gly Val Pro Gly Phe Val Ser Trp Val Met Asp Gly Val
2375 2380 2385
Met Phe Ile His Met Thr Gln Thr Glu Arg Lys Pro Trp Tyr Ile
2390 2395 2400
Pro Thr Trp Phe Asn Arg Glu Ile Val Gly Tyr Thr Gln Asp Ser
2405 2410 2415
Ile Ile Thr Glu Gly Ser Phe Tyr Thr Ser Ile Ala Leu Phe Ser
2420 2425 2430
Ala Arg Cys Leu Tyr Leu Thr Ala Ser Asn Thr Pro Gln Leu Tyr
2435 2440 2445
Cys Phe Asn Gly Asp Asn Asp Ala Pro Gly Ala Leu Pro Phe Gly
2450 2455 2460
Ser Ile Ile Pro His Arg Val Tyr Phe Gln Pro Asn Gly Val Arg
2465 2470 2475
Leu Ile Val Pro Gln Gln Ile Leu His Thr Pro Tyr Val Val Lys
2480 2485 2490
Phe Val Ser Asp Ser Tyr Cys Arg Gly Ser Val Cys Glu Tyr Thr
2495 2500 2505
Arg Pro Gly Tyr Cys Val Ser Leu Asn Pro Gln Trp Val Leu Phe
2510 2515 2520
Asn Asp Glu Tyr Thr Ser Lys Pro Gly Val Phe Cys Gly Ser Thr
2525 2530 2535
Val Arg Glu Leu Met Phe Ser Met Val Ser Thr Phe Phe Thr Gly
2540 2545 2550
Val Asn Pro Asn Ile Tyr Met Gln Leu Ala Thr Met Phe Leu Ile
2555 2560 2565
Leu Val Val Val Val Leu Ile Phe Ala Met Val Ile Lys Phe Gln
2570 2575 2580
Gly Val Phe Lys Ala Tyr Ala Thr Thr Val Phe Ile Thr Met Leu
2585 2590 2595
Val Trp Val Ile Asn Ala Phe Ile Leu Cys Val His Ser Tyr Asn
2600 2605 2610
Ser Val Leu Ala Val Ile Leu Leu Val Leu Tyr Cys Tyr Ala Ser
2615 2620 2625
Leu Val Thr Ser Arg Asn Thr Val Ile Ile Met His Cys Trp Leu
2630 2635 2640
Val Phe Thr Phe Gly Leu Ile Val Pro Thr Trp Leu Ala Cys Cys
2645 2650 2655
Tyr Leu Gly Phe Ile Ile Tyr Met Tyr Thr Pro Leu Phe Leu Trp
2660 2665 2670
Cys Tyr Gly Thr Thr Lys Asn Thr Arg Lys Leu Tyr Asp Gly Asn
2675 2680 2685
Glu Phe Val Gly Asn Tyr Asp Leu Ala Ala Lys Ser Thr Phe Val
2690 2695 2700
Ile Arg Gly Ser Glu Phe Val Lys Leu Thr Asn Glu Ile Gly Asp
2705 2710 2715
Lys Phe Glu Ala Tyr Leu Ser Ala Tyr Ala Arg Leu Lys Tyr Tyr
2720 2725 2730
Ser Gly Thr Gly Ser Glu Gln Asp Tyr Leu Gln Ala Cys Arg Ala
2735 2740 2745
Trp Leu Ala Tyr Ala Leu Asp Gln Tyr Arg Asn Ser Gly Val Glu
2750 2755 2760
Ile Val Tyr Thr Pro Pro Arg Tyr Ser Ile Gly Val Ser Arg Leu
2765 2770 2775
Gln Ser Gly Phe Lys Lys Leu Val Ser Pro Ser Ser Ala Val Glu
2780 2785 2790
Lys Cys Ile Val Ser Val Ser Tyr Arg Gly Asn Asn Leu Asn Gly
2795 2800 2805
Leu Trp Leu Gly Asp Thr Ile Tyr Cys Pro Arg His Val Leu Gly
2810 2815 2820
Lys Phe Ser Gly Asp Gln Trp Asn Asp Val Leu Asn Leu Ala Asn
2825 2830 2835
Asn His Glu Phe Glu Val Thr Thr Gln His Gly Val Thr Leu Asn
2840 2845 2850
Val Val Ser Arg Arg Leu Lys Gly Ala Val Leu Ile Leu Gln Thr
2855 2860 2865
Ala Val Ala Asn Ala Glu Thr Pro Lys Tyr Lys Phe Ile Lys Ala
2870 2875 2880
Asn Cys Gly Asp Ser Phe Thr Ile Ala Cys Ala Tyr Gly Gly Thr
2885 2890 2895
Val Val Gly Leu Tyr Pro Val Thr Met Arg Ser Asn Gly Thr Ile
2900 2905 2910
Arg Ala Ser Phe Leu Ala Gly Ala Cys Gly Ser Val Gly Phe Asn
2915 2920 2925
Ile Glu Lys Gly Val Val Asn Phe Phe Tyr Met His His Leu Glu
2930 2935 2940
Leu Pro Asn Ala Leu His Thr Gly Thr Asp Leu Met Gly Glu Phe
2945 2950 2955
Tyr Gly Gly Tyr Val Asp Glu Glu Val Ala Gln Arg Val Pro Pro
2960 2965 2970
Asp Asn Leu Val Thr Asn Asn Ile Val Ala Trp Leu Tyr Ala Ala
2975 2980 2985
Ile Ile Ser Val Lys Glu Ser Ser Phe Ser Leu Pro Lys Trp Leu
2990 2995 3000
Glu Ser Thr Thr Val Ser Val Asp Asp Tyr Asn Lys Trp Ala Gly
3005 3010 3015
Asp Asn Gly Phe Thr Pro Phe Ser Thr Ser Thr Ala Ile Thr Lys
3020 3025 3030
Leu Ser Ala Ile Thr Gly Val Asp Val Cys Lys Leu Leu Arg Thr
3035 3040 3045
Ile Met Val Lys Asn Ser Gln Trp Gly Gly Asp Pro Ile Leu Gly
3050 3055 3060
Gln Tyr Asn Phe Glu Asp Glu Leu Thr Pro Glu Ser Val Phe Asn
3065 3070 3075
Gln Ile Gly Gly Val Arg Leu Gln Ser Ser Phe Val Arg Lys Ala
3080 3085 3090
Thr Ser Trp Phe Trp Ser Arg Cys Val Leu Ala Cys Phe Leu Phe
3095 3100 3105
Val Leu Cys Ala Ile Val Leu Phe Thr Ala Val Pro Leu Lys Phe
3110 3115 3120
Tyr Val Tyr Ala Ala Val Ile Leu Leu Met Ala Val Leu Phe Ile
3125 3130 3135
Ser Phe Thr Val Lys His Val Met Ala Tyr Met Asp Thr Phe Leu
3140 3145 3150
Leu Pro Thr Leu Ile Thr Val Ile Ile Gly Val Cys Ala Glu Val
3155 3160 3165
Pro Phe Ile Tyr Asn Thr Leu Ile Ser Gln Val Val Ile Phe Leu
3170 3175 3180
Ser Gln Trp Tyr Asp Pro Val Val Phe Asp Thr Met Val Pro Trp
3185 3190 3195
Met Phe Leu Pro Leu Val Leu Tyr Thr Ala Phe Lys Cys Val Gln
3200 3205 3210
Gly Cys Tyr Met Asn Ser Phe Asn Thr Ser Leu Leu Met Leu Tyr
3215 3220 3225
Gln Phe Val Lys Leu Gly Phe Val Ile Tyr Thr Ser Ser Asn Thr
3230 3235 3240
Leu Thr Ala Tyr Thr Glu Gly Asn Trp Glu Leu Phe Phe Glu Leu
3245 3250 3255
Val His Thr Thr Val Leu Ala Asn Val Ser Ser Asn Ser Leu Ile
3260 3265 3270
Gly Leu Phe Val Phe Lys Cys Ala Lys Trp Met Leu Tyr Tyr Cys
3275 3280 3285
Asn Ala Thr Tyr Leu Asn Asn Tyr Val Leu Met Ala Val Met Val
3290 3295 3300
Asn Cys Ile Gly Trp Leu Cys Thr Cys Tyr Phe Gly Leu Tyr Trp
3305 3310 3315
Trp Val Asn Lys Val Phe Gly Leu Thr Leu Gly Lys Tyr Asn Phe
3320 3325 3330
Lys Val Ser Val Asp Gln Tyr Arg Tyr Met Cys Leu His Lys Ile
3335 3340 3345
Asn Pro Pro Lys Thr Val Trp Glu Val Phe Ser Thr Asn Ile Leu
3350 3355 3360
Ile Gln Gly Ile Gly Gly Asp Arg Val Leu Pro Ile Ala Thr Val
3365 3370 3375
Gln Ala Lys Leu Ser Asp Val Lys Cys Thr Thr Val Val Leu Met
3380 3385 3390
Gln Leu Leu Thr Lys Leu Asn Val Glu Ala Asn Ser Lys Met His
3395 3400 3405
Val Tyr Leu Val Glu Leu His Asn Lys Ile Leu Ala Ser Asp Asp
3410 3415 3420
Val Gly Glu Cys Met Asp Asn Leu Leu Gly Met Leu Ile Thr Leu
3425 3430 3435
Phe Cys Ile Asp Ser Thr Ile Asp Leu Ser Glu Tyr Cys Asp Asp
3440 3445 3450
Ile Leu Lys Arg Ser Thr Val Leu Gln Ser Val Thr Gln Glu Phe
3455 3460 3465
Ser His Ile Pro Ser Tyr Ala Glu Tyr Glu Arg Ala Lys Asn Leu
3470 3475 3480
Tyr Glu Lys Val Leu Val Asp Ser Lys Asn Gly Gly Val Thr Gln
3485 3490 3495
Gln Glu Leu Ala Ala Tyr Arg Lys Ala Ala Asn Ile Ala Lys Ser
3500 3505 3510
Val Phe Asp Arg Asp Leu Ala Val Gln Lys Lys Leu Asp Ser Met
3515 3520 3525
Ala Glu Arg Ala Met Thr Thr Met Tyr Lys Glu Ala Arg Val Thr
3530 3535 3540
Asp Arg Arg Ala Lys Leu Val Ser Ser Leu His Ala Leu Leu Phe
3545 3550 3555
Ser Met Leu Lys Lys Ile Asp Ser Glu Lys Leu Asn Val Leu Phe
3560 3565 3570
Asp Gln Ala Ser Ser Gly Val Val Pro Leu Ala Thr Val Pro Ile
3575 3580 3585
Val Cys Ser Asn Lys Leu Thr Leu Val Ile Pro Asp Pro Glu Thr
3590 3595 3600
Trp Val Lys Cys Val Glu Gly Val His Val Thr Tyr Ser Thr Val
3605 3610 3615
Val Trp Asn Ile Asp Thr Val Ile Asp Ala Asp Gly Thr Glu Leu
3620 3625 3630
His Pro Thr Ser Thr Gly Ser Gly Leu Thr Tyr Cys Ile Ser Gly
3635 3640 3645
Ala Asn Ile Ala Trp Pro Leu Lys Val Asn Leu Thr Arg Asn Gly
3650 3655 3660
His Asn Lys Val Asp Val Val Leu Gln Asn Asn Glu Leu Met Pro
3665 3670 3675
His Gly Val Lys Thr Lys Ala Cys Val Ala Gly Val Asp Gln Ala
3680 3685 3690
His Cys Ser Val Glu Ser Lys Cys Tyr Tyr Thr Asn Ile Ser Gly
3695 3700 3705
Asn Ser Val Val Ala Ala Ile Thr Ser Ser Asn Pro Asn Leu Lys
3710 3715 3720
Val Ala Ser Phe Leu Asn Glu Ala Gly Asn Gln Ile Tyr Val Asp
3725 3730 3735
Leu Asp Pro Pro Cys Lys Phe Gly Met Lys Val Gly Val Lys Val
3740 3745 3750
Glu Val Val Tyr Leu Tyr Phe Ile Lys Asn Thr Arg Ser Ile Val
3755 3760 3765
Arg Gly Met Val Leu Gly Ala Ile Ser Asn Val Val Val Leu Gln
3770 3775 3780
Ser Lys Gly His Glu Thr Glu Glu Val Asp Ala Val Gly Ile Leu
3785 3790 3795
Ser Leu Cys Ser Phe Ala Val Asp Pro Ala Asp Thr Tyr Cys Lys
3800 3805 3810
Tyr Val Ala Ala Gly Asn Gln Pro Leu Gly Asn Cys Val Lys Met
3815 3820 3825
Leu Thr Val His Asn Gly Ser Gly Phe Ala Ile Thr Ser Lys Pro
3830 3835 3840
Ser Pro Thr Pro Asp Gln Asp Ser Tyr Gly Gly Ala Ser Val Cys
3845 3850 3855
Leu Tyr Cys Arg Ala His Ile Ala His Pro Gly Ser Val Gly Asn
3860 3865 3870
Leu Asp Gly Arg Cys Gln Phe Lys Gly Ser Phe Val Gln Ile Pro
3875 3880 3885
Thr Thr Glu Lys Asp Pro Val Gly Phe Cys Leu Arg Asn Lys Val
3890 3895 3900
Cys Thr Val Cys Gln Cys Trp Ile Gly Tyr Gly Cys Gln Cys Asp
3905 3910 3915
Ser Leu Arg Gln Pro Lys Ser Ser Val Gln Ser Val Ala Gly Ala
3920 3925 3930
Ser Asp Phe Asp Lys Asn Tyr Leu Asn Gly Tyr Gly Val Ala Val
3935 3940 3945
Arg Leu Gly Met Phe Gln Asn Leu Lys Arg Asn Cys Ala Arg Phe
3950 3955 3960
Gln Glu Leu Arg Asp Thr Glu Asp Gly Asn Leu Glu Tyr Leu Asp
3965 3970 3975
Ser Tyr Phe Val Val Lys Gln Thr Thr Pro Ser Asn Tyr Glu His
3980 3985 3990
Glu Lys Ser Cys Tyr Glu Asp Leu Lys Ser Glu Val Thr Ala Asp
3995 4000 4005
His Asp Phe Phe Val Phe Asn Lys Asn Ile Tyr Asn Ile Ser Arg
4010 4015 4020
Gln Arg Leu Thr Lys Tyr Thr Met Met Asp Phe Cys Tyr Ala Leu
4025 4030 4035
Arg His Phe Asp Pro Lys Asp Cys Glu Val Leu Lys Glu Ile Leu
4040 4045 4050
Val Thr Tyr Gly Cys Ile Glu Asp Tyr His Pro Lys Trp Phe Glu
4055 4060 4065
Glu Asn Lys Asp Trp Tyr Asp Pro Ile Glu Asn Ser Lys Tyr Tyr
4070 4075 4080
Val Met Leu Ala Lys Met Gly Pro Ile Val Arg Arg Ala Leu Leu
4085 4090 4095
Asn Ala Ile Glu Phe Gly Asn Leu Met Val Glu Lys Gly Tyr Val
4100 4105 4110
Gly Val Ile Thr Leu Asp Asn Gln Asp Leu Asn Gly Lys Phe Tyr
4115 4120 4125
Asp Phe Gly Asp Phe Gln Lys Thr Ala Pro Gly Ala Gly Val Pro
4130 4135 4140
Val Phe Asp Thr Tyr Tyr Ser Tyr Met Met Pro Ile Ile Ala Met
4145 4150 4155
Thr Asp Ala Leu Ala Pro Glu Arg Tyr Phe Glu Tyr Asp Val His
4160 4165 4170
Lys Gly Tyr Lys Ser Tyr Asp Leu Leu Lys Tyr Asp Tyr Thr Glu
4175 4180 4185
Glu Lys Gln Glu Leu Phe Gln Lys Tyr Phe Lys Tyr Trp Asp Gln
4190 4195 4200
Glu Tyr His Pro Asn Cys Arg Asp Cys Ser Asp Asp Arg Cys Leu
4205 4210 4215
Ile His Cys Ala Asn Phe Asn Ile Leu Phe Ser Thr Leu Ile Pro
4220 4225 4230
Gln Thr Ser Phe Gly Asn Leu Cys Arg Lys Val Phe Val Asp Gly
4235 4240 4245
Val Pro Phe Ile Ala Thr Cys Gly Tyr His Ser Lys Glu Leu Gly
4250 4255 4260
Val Ile Met Asn Gln Asp Asn Thr Met Ser Phe Ser Lys Met Gly
4265 4270 4275
Leu Ser Gln Leu Met Gln Phe Val Gly Asp Pro Ala Leu Leu Val
4280 4285 4290
Gly Thr Ser Asn Asn Leu Val Asp Leu Arg Thr Ser Cys Phe Ser
4295 4300 4305
Val Cys Ala Leu Thr Ser Gly Ile Thr His Gln Thr Val Lys Pro
4310 4315 4320
Gly His Phe Asn Lys Asp Phe Tyr Asp Phe Ala Glu Lys Ala Gly
4325 4330 4335
Met Phe Lys Glu Gly Ser Ser Ile Pro Leu Lys His Phe Phe Tyr
4340 4345 4350
Pro Gln Thr Gly Asn Ala Ala Ile Asn Asp Tyr Asp Tyr Tyr Arg
4355 4360 4365
Tyr Asn Arg Pro Thr Met Phe Asp Ile Cys Gln Leu Leu Phe Cys
4370 4375 4380
Leu Glu Val Thr Ser Lys Tyr Phe Glu Cys Tyr Glu Gly Gly Cys
4385 4390 4395
Ile Pro Ala Ser Gln Val Val Val Asn Asn Leu Asp Lys Ser Ala
4400 4405 4410
Gly Tyr Pro Phe Asn Lys Phe Gly Lys Ala Arg Leu Tyr Tyr Glu
4415 4420 4425
Met Ser Leu Glu Glu Gln Asp Gln Leu Phe Glu Ile Thr Lys Lys
4430 4435 4440
Asn Val Leu Pro Thr Ile Thr Gln Met Asn Leu Lys Tyr Ala Ile
4445 4450 4455
Ser Ala Lys Asn Arg Ala Arg Thr Val Ala Gly Val Ser Ile Leu
4460 4465 4470
Ser Thr Met Thr Asn Arg Gln Phe His Gln Lys Ile Leu Lys Ser
4475 4480 4485
Ile Val Asn Thr Arg Asn Ala Ser Val Val Ile Gly Thr Thr Lys
4490 4495 4500
Phe Tyr Gly Gly Trp Asp Asn Met Leu Arg Asn Leu Ile Gln Gly
4505 4510 4515
Val Glu Asp Pro Ile Leu Met Gly Trp Asp Tyr Pro Lys Cys Asp
4520 4525 4530
Arg Ala Met Pro Asn Leu Leu Arg Ile Ala Ala Ser Leu Val Leu
4535 4540 4545
Ala Arg Lys His Thr Asn Cys Cys Ser Trp Ser Glu Arg Ile Tyr
4550 4555 4560
Arg Leu Tyr Asn Glu Cys Ala Gln Val Leu Ser Glu Thr Val Leu
4565 4570 4575
Ala Thr Gly Gly Ile Tyr Val Lys Pro Gly Gly Thr Ser Ser Gly
4580 4585 4590
Asp Ala Thr Thr Ala Tyr Ala Asn Ser Val Phe Asn Ile Ile Gln
4595 4600 4605
Ala Thr Ser Ala Asn Val Ala Arg Leu Leu Ser Val Ile Thr Arg
4610 4615 4620
Asp Ile Val Tyr Asp Asn Ile Lys Ser Leu Gln Tyr Glu Leu Tyr
4625 4630 4635
Gln Gln Val Tyr Arg Arg Val Asn Phe Asp Pro Ala Phe Val Glu
4640 4645 4650
Lys Phe Tyr Ser Tyr Leu Cys Lys Asn Phe Ser Leu Met Ile Leu
4655 4660 4665
Ser Asp Asp Gly Val Val Cys Tyr Asn Asn Thr Leu Ala Lys Gln
4670 4675 4680
Gly Leu Val Ala Asp Ile Ser Gly Phe Arg Glu Val Leu Tyr Tyr
4685 4690 4695
Gln Asn Asn Val Phe Met Ala Asp Ser Lys Cys Trp Val Glu Pro
4700 4705 4710
Asp Leu Glu Lys Gly Pro His Glu Phe Cys Ser Gln His Thr Met
4715 4720 4725
Leu Val Glu Val Asp Gly Glu Pro Lys Tyr Leu Pro Tyr Pro Asp
4730 4735 4740
Pro Ser Arg Ile Leu Gly Ala Cys Val Phe Val Asp Asp Val Asp
4745 4750 4755
Lys Thr Glu Pro Val Ala Val Met Glu Arg Tyr Ile Ala Leu Ala
4760 4765 4770
Ile Asp Ala Tyr Pro Leu Val His His Glu Asn Glu Glu Tyr Lys
4775 4780 4785
Lys Val Phe Phe Val Leu Leu Ala Tyr Ile Arg Lys Leu Tyr Gln
4790 4795 4800
Glu Leu Ser Gln Asn Met Leu Met Asp Tyr Ser Phe Val Met Asp
4805 4810 4815
Ile Asp Lys Gly Ser Lys Phe Trp Glu Gln Glu Phe Tyr Glu Asn
4820 4825 4830
Met Tyr Arg Ala Pro Thr Thr Leu Gln Ser Cys Gly Val Cys Val
4835 4840 4845
Val Cys Asn Ser Gln Thr Ile Leu Arg Cys Gly Asn Cys Ile Arg
4850 4855 4860
Lys Pro Phe Leu Cys Cys Lys Cys Cys Tyr Asp His Val Met His
4865 4870 4875
Thr Asp His Lys Asn Val Leu Ser Ile Asn Pro Tyr Ile Cys Ser
4880 4885 4890
Gln Leu Gly Cys Gly Glu Ala Asp Val Thr Lys Leu Tyr Leu Gly
4895 4900 4905
Gly Met Ser Tyr Phe Cys Gly Asn His Lys Pro Lys Leu Ser Ile
4910 4915 4920
Pro Leu Val Ser Asn Gly Thr Val Phe Gly Ile Tyr Arg Ala Asn
4925 4930 4935
Cys Ala Gly Ser Glu Asn Val Asp Asp Phe Asn Gln Leu Ala Thr
4940 4945 4950
Thr Asn Trp Ser Ile Val Glu Pro Tyr Ile Leu Ala Asn Arg Cys
4955 4960 4965
Ser Asp Ser Leu Arg Arg Phe Ala Ala Glu Thr Val Lys Ala Thr
4970 4975 4980
Glu Glu Leu His Lys Gln Gln Phe Ala Ser Ala Glu Val Arg Glu
4985 4990 4995
Val Phe Ser Asp Arg Glu Leu Ile Leu Ser Trp Glu Pro Gly Lys
5000 5005 5010
Thr Arg Pro Pro Leu Asn Arg Asn Tyr Val Phe Thr Gly Tyr His
5015 5020 5025
Phe Thr Arg Thr Ser Lys Val Gln Leu Gly Asp Phe Thr Phe Glu
5030 5035 5040
Lys Gly Glu Gly Lys Asp Val Val Tyr Tyr Lys Ala Thr Ser Thr
5045 5050 5055
Ala Lys Leu Ser Val Gly Asp Ile Phe Val Leu Thr Ser His Asn
5060 5065 5070
Val Val Ser Leu Val Ala Pro Thr Leu Cys Pro Gln Gln Thr Phe
5075 5080 5085
Ser Arg Phe Val Asn Leu Arg Pro Asn Val Met Val Pro Glu Cys
5090 5095 5100
Phe Val Asn Asn Ile Pro Leu Tyr His Leu Val Gly Lys Gln Lys
5105 5110 5115
Arg Thr Thr Val Gln Gly Pro Pro Gly Ser Gly Lys Ser His Phe
5120 5125 5130
Ala Ile Gly Leu Ala Val Tyr Phe Ser Ser Ala Arg Val Val Phe
5135 5140 5145
Thr Ala Cys Ser His Ala Ala Val Asp Ala Leu Cys Glu Lys Ala
5150 5155 5160
Phe Lys Phe Leu Lys Val Asp Asp Cys Thr Arg Ile Val Pro Gln
5165 5170 5175
Arg Thr Thr Val Asp Cys Phe Ser Lys Phe Lys Ala Asn Asp Thr
5180 5185 5190
Gly Lys Lys Tyr Ile Phe Ser Thr Ile Asn Ala Leu Pro Glu Val
5195 5200 5205
Ser Cys Asp Ile Leu Leu Val Asp Glu Val Ser Met Leu Thr Asn
5210 5215 5220
Tyr Glu Leu Ser Phe Ile Asn Gly Lys Ile Asn Tyr Gln Tyr Val
5225 5230 5235
Val Tyr Val Gly Asp Pro Ala Gln Leu Pro Ala Pro Arg Thr Leu
5240 5245 5250
Leu Asn Gly Ser Leu Ser Pro Lys Asp Tyr Asn Val Val Thr Asn
5255 5260 5265
Leu Met Val Cys Val Lys Pro Asp Ile Phe Leu Ala Lys Cys Tyr
5270 5275 5280
Arg Cys Pro Lys Glu Ile Val Asp Thr Val Ser Thr Leu Val Tyr
5285 5290 5295
Asp Gly Lys Phe Ile Ala Asn Asn Pro Glu Ser Arg Glu Cys Phe
5300 5305 5310
Lys Val Ile Val Asn Asn Gly Asn Ser Asp Val Gly His Glu Ser
5315 5320 5325
Gly Ser Ala Tyr Asn Thr Thr Gln Leu Glu Phe Val Lys Asp Phe
5330 5335 5340
Val Cys Arg Asn Lys Gln Trp Arg Glu Ala Ile Phe Ile Ser Pro
5345 5350 5355
Tyr Asn Ala Met Asn Gln Arg Ala Tyr Arg Met Leu Gly Leu Asn
5360 5365 5370
Val Gln Thr Val Asp Ser Ser Gln Gly Ser Glu Tyr Asp Tyr Val
5375 5380 5385
Ile Phe Cys Val Thr Ala Asp Ser Gln His Ala Leu Asn Ile Asn
5390 5395 5400
Arg Phe Asn Val Ala Leu Thr Arg Ala Lys Arg Gly Ile Leu Val
5405 5410 5415
Val Met Arg Gln Arg Asp Glu Leu Tyr Ser Ala Leu Lys Phe Thr
5420 5425 5430
Glu Leu Asp Ser Glu Thr Ser Leu Gln Gly Thr Gly Leu Phe Lys
5435 5440 5445
Ile Cys Asn Lys Glu Phe Ser Gly Val His Pro Ala Tyr Ala Val
5450 5455 5460
Thr Thr Lys Ala Leu Ala Ala Thr Tyr Lys Val Asn Asp Glu Leu
5465 5470 5475
Ala Ala Leu Val Asn Val Glu Ala Gly Ser Glu Ile Thr Tyr Lys
5480 5485 5490
His Leu Ile Ser Leu Leu Gly Phe Lys Met Ser Val Asn Val Glu
5495 5500 5505
Gly Cys His Asn Met Phe Ile Thr Arg Asp Glu Ala Ile Arg Asn
5510 5515 5520
Val Arg Gly Trp Val Gly Phe Asp Val Glu Ala Thr His Ala Cys
5525 5530 5535
Gly Thr Asn Ile Gly Thr Asn Leu Pro Phe Gln Val Gly Phe Ser
5540 5545 5550
Thr Gly Ala Asp Phe Val Val Thr Pro Glu Gly Leu Val Asp Thr
5555 5560 5565
Ser Ile Gly Asn Asn Phe Glu Pro Val Asn Ser Lys Ala Pro Pro
5570 5575 5580
Gly Glu Gln Phe Asn His Leu Arg Val Leu Phe Lys Ser Ala Lys
5585 5590 5595
Pro Trp His Val Ile Arg Pro Arg Ile Val Gln Met Leu Ala Asp
5600 5605 5610
Asn Leu Cys Asn Val Ser Asp Cys Val Val Phe Val Thr Trp Cys
5615 5620 5625
His Gly Leu Glu Leu Thr Thr Leu Arg Tyr Phe Val Lys Ile Gly
5630 5635 5640
Lys Glu Gln Val Cys Ser Cys Gly Ser Arg Ala Thr Thr Phe Asn
5645 5650 5655
Ser His Thr Gln Ala Tyr Ala Cys Trp Lys His Cys Leu Gly Phe
5660 5665 5670
Asp Phe Val Tyr Asn Pro Leu Leu Val Asp Ile Gln Gln Trp Gly
5675 5680 5685
Tyr Ser Gly Asn Leu Gln Phe Asn His Asp Leu His Cys Asn Val
5690 5695 5700
His Gly His Ala His Val Ala Ser Val Asp Ala Ile Met Thr Arg
5705 5710 5715
Cys Leu Ala Ile Asn Asn Ala Phe Cys Gln Asp Val Asn Trp Asp
5720 5725 5730
Leu Thr Tyr Pro His Ile Ala Asn Glu Asp Glu Val Asn Ser Ser
5735 5740 5745
Cys Arg Tyr Leu Gln Arg Met Tyr Leu Asn Ala Cys Val Asp Ala
5750 5755 5760
Leu Lys Val Asn Val Val Tyr Asp Ile Gly Asn Pro Lys Gly Ile
5765 5770 5775
Lys Cys Val Arg Arg Gly Asp Val Asn Phe Arg Phe Tyr Asp Lys
5780 5785 5790
Asn Pro Ile Val Arg Asn Val Lys Gln Phe Glu Tyr Asp Tyr Asn
5795 5800 5805
Gln His Lys Asp Lys Phe Ala Asp Gly Leu Cys Met Phe Trp Asn
5810 5815 5820
Cys Asn Val Asp Cys Tyr Pro Asp Asn Ser Leu Val Cys Arg Tyr
5825 5830 5835
Asp Thr Arg Asn Leu Ser Val Phe Asn Leu Pro Gly Cys Asn Gly
5840 5845 5850
Gly Ser Leu Tyr Val Asn Lys His Ala Phe Tyr Thr Pro Lys Phe
5855 5860 5865
Asp Arg Ile Ser Phe Arg Asn Leu Lys Ala Met Pro Phe Phe Phe
5870 5875 5880
Tyr Asp Ser Ser Pro Cys Glu Thr Ile Gln Val Asp Gly Val Ala
5885 5890 5895
Gln Asp Leu Val Ser Leu Ala Thr Lys Asp Cys Ile Thr Lys Cys
5900 5905 5910
Asn Ile Gly Gly Ala Val Cys Lys Lys His Ala Gln Met Tyr Ala
5915 5920 5925
Glu Phe Val Thr Ser Tyr Asn Ala Ala Val Thr Ala Gly Phe Thr
5930 5935 5940
Phe Trp Val Thr Asn Lys Leu Asn Pro Tyr Asn Leu Trp Lys Ser
5945 5950 5955
Phe Ser Ala Leu Gln Ser Ile Asp Asn Ile Ala Tyr Asn Met Tyr
5960 5965 5970
Lys Gly Gly His Tyr Asp Ala Ile Ala Gly Glu Met Pro Thr Val
5975 5980 5985
Ile Thr Gly Asp Lys Val Phe Val Ile Asp Gln Gly Val Glu Lys
5990 5995 6000
Ala Val Phe Val Asn Gln Thr Thr Leu Pro Thr Ser Val Ala Phe
6005 6010 6015
Glu Leu Tyr Ala Lys Arg Asn Ile Arg Thr Leu Pro Asn Asn Arg
6020 6025 6030
Ile Leu Lys Gly Leu Gly Val Asp Val Thr Asn Gly Phe Val Ile
6035 6040 6045
Trp Asp Tyr Ala Asn Gln Thr Pro Leu Tyr Arg Asn Thr Val Lys
6050 6055 6060
Val Cys Ala Tyr Thr Asp Ile Glu Pro Asn Gly Leu Val Val Leu
6065 6070 6075
Tyr Asp Asp Arg Tyr Gly Asp Tyr Gln Ser Phe Leu Ala Ala Asp
6080 6085 6090
Asn Ala Val Leu Val Ser Thr Gln Cys Tyr Lys Arg Tyr Ser Tyr
6095 6100 6105
Val Glu Ile Pro Ser Asn Leu Leu Val Gln Asn Gly Met Pro Leu
6110 6115 6120
Lys Asp Gly Ala Asn Leu Tyr Val Tyr Lys Arg Val Asn Gly Ala
6125 6130 6135
Phe Val Thr Leu Pro Asn Thr Ile Asn Thr Gln Gly Arg Ser Tyr
6140 6145 6150
Glu Thr Phe Glu Pro Arg Ser Asp Ile Glu Arg Asp Phe Leu Ala
6155 6160 6165
Met Ser Glu Glu Ser Phe Val Glu Arg Tyr Gly Lys Asp Leu Gly
6170 6175 6180
Leu Gln His Ile Leu Tyr Gly Glu Val Asp Lys Pro Gln Leu Gly
6185 6190 6195
Gly Leu His Thr Val Ile Gly Met Tyr Arg Leu Leu Arg Ala Asn
6200 6205 6210
Lys Leu Asn Ala Lys Ser Val Thr Asn Ser Asp Ser Asp Val Met
6215 6220 6225
Gln Asn Tyr Phe Val Leu Ser Asp Asn Gly Ser Tyr Lys Gln Val
6230 6235 6240
Cys Thr Val Val Asp Leu Leu Leu Asp Asp Phe Leu Glu Leu Leu
6245 6250 6255
Arg Asn Ile Leu Lys Glu Tyr Gly Thr Asn Lys Ser Lys Val Val
6260 6265 6270
Thr Val Ser Ile Asp Tyr His Ser Ile Asn Phe Met Thr Trp Phe
6275 6280 6285
Glu Asp Gly Ser Ile Lys Thr Cys Tyr Pro Gln Leu Gln Ser Ala
6290 6295 6300
Trp Thr Cys Gly Tyr Asn Met Pro Glu Leu Tyr Lys Val Gln Asn
6305 6310 6315
Cys Val Met Glu Pro Cys Asn Ile Pro Asn Tyr Gly Val Gly Ile
6320 6325 6330
Thr Leu Pro Ser Gly Ile Leu Met Asn Val Ala Lys Tyr Thr Gln
6335 6340 6345
Leu Cys Gln Tyr Leu Ser Lys Thr Thr Ile Cys Val Pro His Asn
6350 6355 6360
Met Arg Val Met His Phe Gly Ala Gly Ser Asp Lys Gly Val Ala
6365 6370 6375
Pro Gly Ser Thr Val Leu Lys Gln Trp Leu Pro Glu Gly Thr Leu
6380 6385 6390
Leu Val Asp Asn Asp Ile Val Asp Tyr Val Ser Asp Ala His Val
6395 6400 6405
Ser Val Leu Ser Asp Cys Asn Lys Tyr Asn Thr Glu His Lys Phe
6410 6415 6420
Asp Leu Val Ile Ser Asp Met Tyr Thr Asp Asn Asp Ser Lys Arg
6425 6430 6435
Lys His Glu Gly Val Ile Ala Asn Asn Gly Asn Asp Asp Val Phe
6440 6445 6450
Ile Tyr Leu Ser Ser Phe Leu Arg Asn Asn Leu Ala Leu Gly Gly
6455 6460 6465
Ser Phe Ala Val Lys Val Thr Glu Thr Ser Trp His Glu Val Leu
6470 6475 6480
Tyr Asp Ile Ala Gln Asp Cys Ala Trp Trp Thr Met Phe Cys Thr
6485 6490 6495
Ala Val Asn Ala Ser Ser Ser Glu Ala Phe Leu Ile Gly Val Asn
6500 6505 6510
Tyr Leu Gly Ala Ser Glu Lys Val Lys Val Ser Gly Lys Thr Leu
6515 6520 6525
His Ala Asn Tyr Ile Phe Trp Arg Asn Cys Asn Tyr Leu Gln Thr
6530 6535 6540
Ser Ala Tyr Ser Ile Phe Asp Val Ala Lys Phe Asp Leu Arg Leu
6545 6550 6555
Lys Ala Thr Pro Val Val Asn Leu Lys Thr Glu Gln Lys Thr Asp
6560 6565 6570
Leu Val Phe Asn Leu Ile Lys Cys Gly Lys Leu Leu Val Arg Asp
6575 6580 6585
Val Gly Asn Thr Ser Phe Thr Ser Asp Ser Phe Val Cys Thr Met
6590 6595 6600
<210> 22
<211> 7058
<212> PRT
<213> Bovine Coronavirus
<220>
<221> MISC_FEATURE
<223> ORF 1AB
<400> 22
Met Ser Lys Ile Asn Lys Tyr Gly Leu Glu Leu His Trp Ala Pro Glu
1 5 10 15
Phe Pro Trp Met Phe Glu Asp Ala Glu Glu Lys Leu Asp Asn Pro Ser
20 25 30
Ser Ser Glu Val Asp Ile Val Cys Ser Thr Thr Ala Gln Lys Leu Glu
35 40 45
Thr Gly Gly Ile Cys Pro Glu Asn His Val Met Val Asp Cys Arg Arg
50 55 60
Leu Leu Lys Gln Glu Cys Cys Val Gln Ser Ser Leu Ile Arg Glu Ile
65 70 75 80
Val Met Asn Thr Arg Pro Tyr Asp Leu Glu Val Leu Leu Gln Asp Ala
85 90 95
Leu Gln Ser Arg Glu Ala Val Leu Val Thr Pro Pro Leu Gly Met Ser
100 105 110
Leu Glu Ala Cys Tyr Val Arg Gly Cys Asn Pro Asn Gly Trp Thr Met
115 120 125
Gly Leu Phe Arg Arg Arg Ser Val Cys Asn Thr Gly Arg Cys Ala Val
130 135 140
Asn Lys His Val Ala Tyr Gln Leu Tyr Met Ile Asp Pro Ala Gly Val
145 150 155 160
Cys Phe Gly Ala Gly Gln Phe Val Gly Trp Val Ile Pro Leu Ala Phe
165 170 175
Met Pro Val Gln Ser Arg Lys Phe Ile Ala Pro Trp Val Met Tyr Leu
180 185 190
Arg Lys Cys Gly Glu Lys Gly Ala Tyr Ile Lys Asp Tyr Lys Arg Gly
195 200 205
Gly Phe Glu His Val Tyr Asn Phe Lys Val Glu Asp Ala Tyr Asp Leu
210 215 220
Val His Asp Glu Pro Lys Gly Lys Phe Ser Lys Lys Ala Tyr Ala Leu
225 230 235 240
Ile Arg Gly Tyr Arg Gly Val Lys Pro Leu Leu Tyr Val Asp Gln Tyr
245 250 255
Gly Cys Asp Tyr Thr Gly Gly Leu Ala Asp Gly Leu Glu Ala Tyr Ala
260 265 270
Asp Lys Thr Leu Gln Glu Met Lys Ala Leu Phe Pro Ile Trp Ser Gln
275 280 285
Glu Leu Pro Phe Asp Val Thr Val Ala Trp His Val Val Arg Asp Pro
290 295 300
Arg Tyr Val Met Arg Leu Gln Ser Ala Ser Thr Ile Arg Ser Val Ala
305 310 315 320
Tyr Val Ala Asn Pro Thr Glu Asp Leu Cys Asp Gly Ser Val Val Ile
325 330 335
Lys Glu Pro Val His Val Tyr Ala Asp Asp Ser Ile Ile Leu Arg Gln
340 345 350
His Asn Leu Val Asp Ile Met Ser Cys Phe Tyr Met Glu Ala Asp Ala
355 360 365
Val Val Asn Ala Phe Tyr Gly Val Asp Leu Lys Asp Cys Gly Phe Val
370 375 380
Met Gln Phe Gly Tyr Ile Asp Cys Glu Gln Asp Leu Cys Asp Phe Lys
385 390 395 400
Gly Trp Val Pro Gly Asn Met Ile Asp Gly Phe Ala Cys Thr Thr Cys
405 410 415
Gly His Val Tyr Glu Thr Gly Asp Leu Leu Ala Gln Ser Ser Gly Val
420 425 430
Leu Pro Val Asn Pro Val Leu His Thr Lys Ser Ala Ala Gly Tyr Gly
435 440 445
Gly Phe Gly Cys Lys Asp Ser Phe Thr Leu Tyr Gly Gln Thr Val Val
450 455 460
Tyr Phe Gly Gly Cys Val Tyr Trp Ser Pro Ala Arg Asn Ile Trp Ile
465 470 475 480
Pro Ile Leu Lys Ser Ser Val Lys Ser Tyr Asp Gly Leu Val Tyr Thr
485 490 495
Gly Val Val Gly Cys Lys Ala Ile Val Lys Glu Thr Asn Leu Ile Cys
500 505 510
Lys Ala Leu Tyr Leu Asp Tyr Val Gln His Lys Cys Gly Asn Leu His
515 520 525
Gln Arg Glu Leu Leu Gly Val Ser Asp Val Trp His Lys Gln Leu Leu
530 535 540
Leu Asn Arg Gly Val Tyr Lys Pro Leu Leu Glu Asn Ile Asp Tyr Phe
545 550 555 560
Asn Met Arg Arg Ala Lys Phe Ser Leu Glu Thr Phe Thr Val Cys Ala
565 570 575
Asp Gly Phe Met Pro Phe Leu Leu Asp Asp Leu Val Pro Arg Ala Tyr
580 585 590
Tyr Leu Ala Val Ser Gly Gln Ala Phe Cys Asp Tyr Ala Gly Lys Ile
595 600 605
Cys His Ala Val Val Ser Lys Ser Lys Glu Leu Leu Asp Val Ser Val
610 615 620
Asp Ser Leu Gly Ala Ala Ile His Tyr Leu Asn Ser Lys Ile Val Asp
625 630 635 640
Leu Ala Gln His Phe Ser Asp Phe Gly Thr Ser Phe Val Ser Lys Ile
645 650 655
Val His Phe Phe Lys Thr Phe Thr Thr Ser Thr Ala Leu Ala Phe Ala
660 665 670
Trp Val Leu Phe His Val Leu His Gly Ala Tyr Ile Val Val Glu Ser
675 680 685
Asp Ile Tyr Phe Gly Lys Asn Ile Pro Arg Tyr Ala Ser Ala Val Ala
690 695 700
Gln Ala Phe Arg Ser Gly Ala Lys Val Gly Leu Asp Ser Leu Arg Val
705 710 715 720
Thr Phe Ile Asp Gly Leu Ser Cys Phe Lys Ile Gly Arg Arg Arg Ile
725 730 735
Cys Leu Ser Gly Ser Lys Ile Tyr Glu Val Glu Arg Gly Leu Leu His
740 745 750
Ser Ser Gln Leu Pro Leu Asp Val Tyr Asp Leu Thr Met Pro Ser Gln
755 760 765
Val Gln Lys Thr Lys Gln Lys Gly Ile Tyr Leu Lys Gly Ser Gly Ser
770 775 780
Asp Phe Ser Leu Ala Asp Ser Val Val Glu Val Val Thr Thr Ser Leu
785 790 795 800
Thr Pro Cys Gly Tyr Ser Glu Pro Pro Lys Val Ala Asp Lys Ile Cys
805 810 815
Ile Val Asp Asn Val Tyr Met Ala Lys Ala Gly Asp Lys Tyr Tyr Pro
820 825 830
Val Val Val Asp Gly His Val Gly Leu Leu Asp Gln Ala Trp Arg Val
835 840 845
Pro Cys Ala Gly Arg Cys Val Thr Phe Lys Glu Gln Pro Thr Val Asn
850 855 860
Glu Ile Ala Ser Thr Pro Lys Thr Ile Lys Val Phe Tyr Glu Leu Asp
865 870 875 880
Lys Asp Phe Asn Thr Ile Leu Asn Thr Ala Cys Gly Glu Phe Glu Val
885 890 895
Asp Asp Thr Val Asp Met Glu Glu Phe Tyr Ala Val Val Ile Asp Ala
900 905 910
Ile Glu Glu Lys Leu Ser Pro Cys Lys Glu Leu Glu Gly Val Gly Ala
915 920 925
Lys Val Ser Ala Phe Leu Gln Lys Leu Glu Asp Asn Ser Leu Phe Leu
930 935 940
Phe Asp Glu Ala Gly Glu Glu Val Leu Ala Pro Lys Leu Tyr Cys Ala
945 950 955 960
Phe Thr Ala Pro Glu Asp Asp Asp Phe Leu Glu Glu Ser Gly Val Glu
965 970 975
Glu Asp Asp Val Glu Gly Glu Glu Thr Asp Leu Thr Val Thr Ser Ala
980 985 990
Gly Glu Pro Cys Val Ala Ser Glu Gln Glu Glu Ser Ser Glu Ile Leu
995 1000 1005
Glu Asp Thr Leu Asp Asp Gly Pro Cys Val Glu Thr Ser Asp Ser
1010 1015 1020
Gln Val Glu Glu Asp Val Gln Met Ser Asp Phe Gly Asp Leu Glu
1025 1030 1035
Ser Val Ile Gln Asp Tyr Glu Asn Val Cys Phe Glu Phe Tyr Thr
1040 1045 1050
Thr Glu Pro Glu Phe Val Lys Val Leu Asp Leu Tyr Val Pro Lys
1055 1060 1065
Ala Thr Arg Asn Asn Cys Trp Leu Arg Ser Val Leu Ala Val Met
1070 1075 1080
Gln Lys Leu Pro Cys Gln Phe Lys Asp Lys Asn Leu Gln Asp Leu
1085 1090 1095
Trp Val Leu Tyr Lys Gln Gln Tyr Ser Gln Leu Phe Val Asp Thr
1100 1105 1110
Leu Val Asn Lys Ile Pro Ala Asn Ile Val Val Pro Gln Gly Gly
1115 1120 1125
Tyr Val Ala Asp Phe Ala Tyr Trp Phe Leu Thr Leu Cys Asp Trp
1130 1135 1140
Gln Cys Val Ala Tyr Trp Lys Cys Ile Lys Cys Asp Leu Ala Leu
1145 1150 1155
Lys Leu Lys Gly Leu Asp Ala Met Phe Phe Tyr Gly Asp Val Val
1160 1165 1170
Ser His Val Cys Lys Cys Gly Glu Ser Met Val Leu Ile Asp Val
1175 1180 1185
Asp Val Pro Phe Thr Ala His Phe Ala Leu Lys Asp Lys Leu Phe
1190 1195 1200
Cys Ala Phe Ile Thr Lys Arg Ser Val Tyr Lys Ala Ala Cys Val
1205 1210 1215
Val Asp Val Asn Asp Ser His Ser Met Ala Val Val Asp Gly Lys
1220 1225 1230
Gln Ile Asp Asp His Arg Ile Thr Ser Ile Thr Ser Asp Lys Phe
1235 1240 1245
Asp Phe Ile Ile Gly His Gly Thr Ser Phe Ser Met Thr Thr Phe
1250 1255 1260
Glu Ile Ala Gln Leu Tyr Gly Ser Cys Ile Thr Pro Asn Val Cys
1265 1270 1275
Phe Val Lys Gly Asp Ile Ile Lys Val Ser Lys Arg Val Lys Ala
1280 1285 1290
Glu Val Val Val Asn Pro Ala Asn Gly His Met Ala His Gly Gly
1295 1300 1305
Gly Val Ala Lys Ala Ile Ala Val Ala Ala Gly Gln Gln Phe Val
1310 1315 1320
Lys Glu Thr Thr Asp Met Val Lys Ser Lys Gly Val Cys Ala Thr
1325 1330 1335
Gly Asp Cys Tyr Val Ser Thr Gly Gly Lys Leu Cys Lys Thr Val
1340 1345 1350
Leu Asn Val Val Gly Pro Asp Ala Arg Thr Gln Gly Lys Gln Ser
1355 1360 1365
Tyr Ala Leu Leu Glu Arg Val Tyr Lys His Leu Asn Lys Tyr Asp
1370 1375 1380
Cys Val Val Thr Thr Leu Ile Ser Ala Gly Ile Phe Ser Val Pro
1385 1390 1395
Ser Asp Val Ser Leu Thr Tyr Leu Leu Gly Thr Ala Lys Lys Gln
1400 1405 1410
Val Val Leu Val Ser Asn Asn Gln Glu Asp Phe Asp Leu Ile Ser
1415 1420 1425
Lys Cys Gln Ile Thr Ala Val Glu Gly Thr Lys Lys Leu Ala Glu
1430 1435 1440
Arg Leu Ser Phe Asn Val Gly Arg Ser Ile Val Tyr Glu Thr Asp
1445 1450 1455
Ala Asn Lys Leu Ile Leu Ser Asn Asp Val Ala Phe Val Ser Thr
1460 1465 1470
Phe Asn Val Leu Gln Asp Val Leu Ser Leu Arg His Asp Ile Ala
1475 1480 1485
Leu Asp Asp Asp Ala Arg Thr Phe Val Gln Ser Asn Val Asp Val
1490 1495 1500
Val Pro Glu Gly Trp Arg Val Val Asn Lys Phe Tyr Gln Ile Asn
1505 1510 1515
Gly Val Arg Pro Val Lys Tyr Phe Glu Cys Pro Gly Gly Ile Asp
1520 1525 1530
Ile Cys Ser Gln Asp Lys Val Phe Gly Tyr Val Gln Gln Gly Ser
1535 1540 1545
Phe Asn Lys Ala Thr Val Ala Gln Ile Lys Ala Leu Phe Leu Asp
1550 1555 1560
Lys Val Asp Ile Leu Leu Thr Val Asp Gly Val Asn Phe Thr Asn
1565 1570 1575
Arg Phe Val Pro Val Gly Glu Ser Phe Gly Lys Ser Leu Gly Asn
1580 1585 1590
Val Phe Cys Asp Gly Val Asn Val Thr Lys His Lys Cys Asp Ile
1595 1600 1605
Asn Tyr Lys Gly Lys Val Phe Phe Gln Phe Asp Asn Leu Ser Ser
1610 1615 1620
Glu Asp Leu Lys Ala Val Arg Ser Ser Phe Asn Phe Asp Gln Lys
1625 1630 1635
Glu Leu Leu Ala Tyr Tyr Asn Met Leu Val Asn Cys Ser Lys Trp
1640 1645 1650
Gln Val Val Phe Asn Gly Lys Tyr Phe Thr Phe Lys Gln Ala Asn
1655 1660 1665
Asn Asn Cys Phe Val Asn Val Ser Cys Leu Met Leu Gln Ser Leu
1670 1675 1680
Asn Leu Lys Phe Lys Ile Val Gln Trp Gln Glu Ala Trp Leu Glu
1685 1690 1695
Phe Arg Ser Gly Arg Pro Ala Arg Phe Val Ser Leu Val Leu Ala
1700 1705 1710
Lys Gly Gly Phe Lys Phe Gly Asp Pro Ala Asp Ser Arg Asp Phe
1715 1720 1725
Leu Arg Val Val Phe Ser Gln Val Asp Leu Thr Gly Ala Ile Cys
1730 1735 1740
Asp Phe Glu Ile Ala Cys Lys Cys Gly Val Lys Gln Glu Gln Arg
1745 1750 1755
Thr Gly Val Asp Ala Val Met His Phe Gly Thr Leu Ser Arg Glu
1760 1765 1770
Asp Leu Glu Ile Gly Tyr Thr Val Asp Cys Ser Cys Gly Lys Lys
1775 1780 1785
Leu Ile His Cys Val Arg Phe Asp Val Pro Phe Leu Ile Cys Ser
1790 1795 1800
Asn Thr Pro Ala Ser Val Lys Leu Pro Lys Gly Val Gly Ser Ala
1805 1810 1815
Asn Ile Phe Lys Gly Asp Lys Val Gly His Tyr Val His Val Lys
1820 1825 1830
Cys Glu Gln Ser Tyr Gln Leu Tyr Asp Ala Ser Asn Val Lys Lys
1835 1840 1845
Val Thr Asp Val Thr Gly Asn Leu Ser Asp Cys Leu Tyr Leu Lys
1850 1855 1860
Asn Leu Lys Gln Thr Phe Lys Ser Val Leu Thr Thr Tyr Tyr Leu
1865 1870 1875
Asp Asp Val Lys Lys Ile Glu Tyr Lys Pro Asp Leu Ser Gln Tyr
1880 1885 1890
Tyr Cys Asp Gly Gly Lys Tyr Tyr Thr Gln Arg Ile Ile Lys Ala
1895 1900 1905
Gln Phe Lys Thr Phe Glu Lys Val Asp Gly Val Tyr Thr Asn Phe
1910 1915 1920
Lys Leu Ile Gly His Thr Val Cys Asp Ile Leu Asn Ala Lys Leu
1925 1930 1935
Gly Phe Asp Ser Ser Lys Glu Phe Val Glu Tyr Lys Val Thr Glu
1940 1945 1950
Trp Pro Thr Ala Thr Gly Asp Val Val Leu Ala Thr Asp Asp Leu
1955 1960 1965
Tyr Val Lys Arg Tyr Glu Arg Gly Cys Ile Thr Phe Gly Lys Pro
1970 1975 1980
Val Ile Trp Leu Ser His Glu Gln Ala Ser Leu Asn Ser Leu Thr
1985 1990 1995
Tyr Phe Asn Arg Pro Leu Leu Val Asp Glu Asn Lys Phe Asp Val
2000 2005 2010
Leu Lys Val Asp Asp Val Asp Asp Gly Gly Asp Ile Ser Glu Ser
2015 2020 2025
Asp Ala Lys Glu Pro Lys Glu Ile Asn Ile Ile Lys Leu Ser Gly
2030 2035 2040
Val Lys Lys Pro Phe Lys Val Glu Asp Ser Val Ile Val Asn Asp
2045 2050 2055
Asp Thr Ser Glu Ile Lys Tyr Val Lys Ser Leu Ser Ile Val Asp
2060 2065 2070
Val Tyr Asp Met Trp Leu Thr Gly Cys Arg Cys Val Val Arg Thr
2075 2080 2085
Ala Asn Ala Leu Ser Arg Ala Val Asn Val Pro Thr Ile Arg Lys
2090 2095 2100
Phe Ile Lys Phe Gly Met Thr Leu Val Ser Ile Pro Ile Asp Leu
2105 2110 2115
Leu Asn Leu Arg Glu Ile Lys Pro Val Phe Asn Val Val Lys Ala
2120 2125 2130
Val Arg Asn Lys Ile Ser Ala Cys Phe Asn Phe Ile Lys Trp Leu
2135 2140 2145
Phe Val Leu Leu Phe Gly Trp Ile Lys Ile Ser Ala Asp Asn Lys
2150 2155 2160
Val Ile Tyr Thr Thr Glu Val Ala Ser Lys Leu Thr Cys Lys Leu
2165 2170 2175
Val Ala Leu Ala Phe Lys Asn Ala Phe Leu Thr Phe Lys Trp Ser
2180 2185 2190
Val Val Ala Arg Gly Ala Cys Ile Ile Ala Thr Ile Phe Leu Leu
2195 2200 2205
Trp Phe Asn Phe Ile Tyr Ala Asn Val Ile Phe Ser Asp Phe Tyr
2210 2215 2220
Leu Pro Lys Ile Gly Phe Leu Pro Thr Phe Val Gly Lys Ile Val
2225 2230 2235
Gln Trp Ile Lys Asn Thr Phe Ser Leu Val Thr Ile Cys Asp Leu
2240 2245 2250
Tyr Ser Ile Gln Asp Val Gly Phe Lys Asn Gln Tyr Cys Asn Gly
2255 2260 2265
Ser Ile Ala Cys Gln Phe Cys Leu Ala Gly Phe Asp Met Leu Asp
2270 2275 2280
Asn Tyr Lys Ala Ile Asp Val Val Gln Tyr Glu Ala Asp Arg Arg
2285 2290 2295
Ala Phe Val Asp Tyr Thr Gly Val Leu Lys Ile Val Ile Glu Leu
2300 2305 2310
Ile Val Ser Tyr Ala Leu Tyr Thr Ala Trp Phe Tyr Pro Leu Phe
2315 2320 2325
Ala Leu Ile Ser Ile Gln Ile Leu Thr Thr Trp Leu Pro Glu Leu
2330 2335 2340
Leu Met Leu Ser Thr Leu His Trp Ser Val Arg Leu Leu Val Ser
2345 2350 2355
Leu Ala Asn Met Leu Pro Ala His Val Phe Met Arg Phe Tyr Ile
2360 2365 2370
Ile Ile Ala Ser Phe Ile Lys Leu Phe Ser Leu Phe Arg His Val
2375 2380 2385
Ala Tyr Gly Cys Ser Lys Ser Gly Cys Leu Phe Cys Tyr Lys Arg
2390 2395 2400
Asn Arg Ser Leu Arg Val Lys Cys Ser Thr Ile Val Gly Gly Met
2405 2410 2415
Ile Arg Tyr Tyr Asp Val Met Ala Asn Gly Gly Thr Gly Phe Cys
2420 2425 2430
Ser Lys His Gln Trp Asn Cys Ile Asp Cys Asp Ser Tyr Lys Pro
2435 2440 2445
Gly Asn Thr Phe Ile Thr Val Glu Ala Ala Leu Asp Leu Ser Lys
2450 2455 2460
Glu Leu Lys Arg Pro Ile Gln Pro Thr Asp Val Ala Tyr His Thr
2465 2470 2475
Val Thr Asp Val Lys Gln Val Gly Cys Tyr Met Arg Leu Phe Tyr
2480 2485 2490
Asp Arg Asp Gly Gln Arg Thr Tyr Asp Asp Val Asn Ala Ser Leu
2495 2500 2505
Phe Val Asp Tyr Ser Asn Leu Leu His Ser Lys Val Lys Ser Val
2510 2515 2520
Pro Asn Met His Val Val Val Val Glu Asn Asp Ala Asp Lys Ala
2525 2530 2535
Asn Phe Leu Asn Ala Ala Val Phe Tyr Ala Gln Ser Leu Phe Arg
2540 2545 2550
Pro Ile Leu Met Val Asp Lys Ile Leu Ile Thr Thr Ala Asn Thr
2555 2560 2565
Gly Thr Ser Val Thr Glu Thr Met Phe Asp Val Tyr Val Asp Thr
2570 2575 2580
Phe Leu Ser Met Phe Asp Val Asp Lys Lys Ser Leu Asn Ala Leu
2585 2590 2595
Ile Ala Thr Ala His Ser Ser Ile Lys Gln Gly Thr Gln Ile Cys
2600 2605 2610
Lys Val Leu Asp Thr Phe Leu Ser Cys Ala Arg Lys Ser Cys Ser
2615 2620 2625
Ile Asp Ser Asp Val Asp Thr Lys Cys Leu Ala Asp Ser Val Met
2630 2635 2640
Ser Ala Val Ser Ala Gly Leu Glu Leu Thr Asp Glu Ser Cys Asn
2645 2650 2655
Asn Leu Val Pro Thr Tyr Leu Lys Gly Asp Asn Ile Val Ala Ala
2660 2665 2670
Asp Leu Gly Val Leu Ile Gln Asn Ser Ala Lys His Val Gln Gly
2675 2680 2685
Asn Val Ala Lys Ile Ala Gly Val Ser Cys Ile Trp Ser Val Asp
2690 2695 2700
Ala Phe Asn Gln Leu Ser Ser Asp Phe Gln His Lys Leu Lys Lys
2705 2710 2715
Ala Cys Cys Lys Thr Gly Leu Lys Leu Glu Leu Thr Tyr Asn Lys
2720 2725 2730
Gln Met Ala Asn Val Ser Val Leu Thr Thr Pro Phe Ser Leu Lys
2735 2740 2745
Gly Gly Ala Val Phe Ser Tyr Phe Val Tyr Val Cys Phe Val Leu
2750 2755 2760
Ser Leu Val Cys Phe Ile Gly Leu Trp Cys Leu Met Pro Thr Tyr
2765 2770 2775
Thr Val His Lys Ser Asp Phe Gln Leu Pro Val Tyr Ala Ser Tyr
2780 2785 2790
Lys Val Leu Asp Asn Gly Val Ile Arg Asp Val Ser Val Glu Asp
2795 2800 2805
Val Cys Phe Ala Asn Lys Phe Glu Gln Phe Asp Gln Trp Tyr Glu
2810 2815 2820
Ser Thr Phe Gly Leu Ser Tyr Tyr Ser Asn Ser Met Ala Cys Pro
2825 2830 2835
Ile Val Val Ala Val Val Asp Gln Asp Phe Gly Ser Thr Val Phe
2840 2845 2850
Asn Val Pro Thr Lys Val Leu Arg Tyr Gly Tyr His Val Leu His
2855 2860 2865
Phe Ile Thr His Ala Leu Ser Ala Asp Gly Val Gln Cys Tyr Thr
2870 2875 2880
Pro His Ser Gln Ile Ser Tyr Ser Asn Phe Tyr Ala Ser Gly Cys
2885 2890 2895
Val Leu Ser Ser Ala Cys Thr Met Phe Ala Met Ala Asp Gly Ser
2900 2905 2910
Pro Gln Pro Tyr Cys Tyr Thr Asp Gly Leu Met Gln Asn Ala Ser
2915 2920 2925
Leu Tyr Ser Ser Leu Val Pro His Val Arg Tyr Asn Leu Ala Asn
2930 2935 2940
Ala Lys Gly Phe Ile Arg Leu Pro Glu Val Leu Arg Glu Gly Leu
2945 2950 2955
Val Arg Ile Val Arg Thr Arg Ser Met Ser Tyr Cys Arg Val Gly
2960 2965 2970
Leu Cys Glu Glu Ala Asp Glu Gly Ile Cys Phe Asn Phe Asn Gly
2975 2980 2985
Ser Trp Val Leu Asn Asn Asp Tyr Tyr Arg Ser Leu Pro Gly Thr
2990 2995 3000
Phe Cys Gly Arg Asp Val Phe Asp Leu Ile Tyr Gln Leu Phe Lys
3005 3010 3015
Gly Leu Ala Gln Pro Val Asp Phe Leu Ala Leu Thr Ala Ser Ser
3020 3025 3030
Ile Ala Gly Ala Ile Leu Ala Val Ile Val Val Leu Gly Phe Tyr
3035 3040 3045
Tyr Leu Ile Lys Leu Lys Arg Ala Phe Gly Asp Tyr Thr Ser Ile
3050 3055 3060
Val Phe Val Asn Val Ile Val Trp Cys Val Asn Phe Met Met Leu
3065 3070 3075
Phe Val Phe Gln Val Tyr Pro Thr Leu Ser Cys Val Tyr Ala Ile
3080 3085 3090
Cys Tyr Phe Tyr Ala Thr Leu Tyr Phe Pro Ser Glu Ile Ser Val
3095 3100 3105
Ile Met His Leu Gln Trp Leu Val Met Tyr Gly Thr Ile Met Pro
3110 3115 3120
Leu Trp Phe Cys Leu Leu Tyr Ile Ser Val Val Val Ser Asn His
3125 3130 3135
Ala Phe Trp Val Phe Ser Tyr Cys Arg Gln Leu Gly Thr Ser Val
3140 3145 3150
Arg Ser Asp Gly Thr Phe Glu Glu Met Ala Leu Thr Thr Phe Met
3155 3160 3165
Ile Thr Lys Asp Ser Tyr Cys Lys Leu Lys Asn Ser Leu Ser Asp
3170 3175 3180
Val Ala Phe Asn Arg Tyr Leu Ser Leu Tyr Asn Lys Tyr Arg Tyr
3185 3190 3195
Tyr Ser Gly Lys Met Asp Thr Ala Ala Tyr Arg Glu Ala Ala Cys
3200 3205 3210
Ser Gln Leu Ala Lys Ala Met Asp Thr Phe Thr Asn Asn Asn Gly
3215 3220 3225
Ser Asp Val Leu Tyr Gln Pro Pro Thr Ala Ser Val Ser Thr Ser
3230 3235 3240
Phe Leu Gln Ser Gly Ile Val Lys Met Val Asn Pro Thr Ser Lys
3245 3250 3255
Val Glu Pro Cys Ile Val Ser Val Thr Tyr Gly Asn Met Thr Leu
3260 3265 3270
Asn Gly Leu Trp Leu Asp Asp Lys Val Tyr Cys Pro Arg His Val
3275 3280 3285
Ile Cys Ser Ala Ser Asp Met Thr Asn Pro Asp Tyr Thr Asn Leu
3290 3295 3300
Leu Cys Arg Val Thr Ser Ser Asp Phe Thr Val Leu Phe Asp Arg
3305 3310 3315
Leu Ser Leu Thr Val Met Ser Tyr Gln Met Gln Gly Cys Met Leu
3320 3325 3330
Val Leu Thr Val Thr Leu Gln Asn Ser Arg Thr Pro Lys Tyr Thr
3335 3340 3345
Phe Gly Val Val Lys Pro Gly Glu Thr Phe Thr Val Leu Ala Ala
3350 3355 3360
Tyr Asn Gly Lys Pro Gln Gly Ala Phe His Val Thr Met Arg Ser
3365 3370 3375
Ser Tyr Thr Ile Lys Gly Ser Phe Leu Cys Gly Ser Cys Gly Ser
3380 3385 3390
Val Gly Tyr Val Ile Met Gly Asp Cys Val Lys Phe Val Tyr Met
3395 3400 3405
His Gln Leu Glu Leu Ser Thr Gly Cys His Thr Gly Thr Asp Phe
3410 3415 3420
Asn Gly Asp Phe Tyr Gly Pro Tyr Lys Asp Ala Gln Val Val Gln
3425 3430 3435
Leu Pro Val Gln Asp Tyr Ile Gln Ser Val Asn Phe Val Ala Trp
3440 3445 3450
Leu Tyr Ala Ala Ile Leu Asn Asn Cys Asn Trp Phe Val Gln Ser
3455 3460 3465
Asp Lys Cys Ser Val Glu Asp Phe Asn Val Trp Ala Leu Ser Asn
3470 3475 3480
Gly Phe Ser Gln Val Lys Ser Asp Leu Val Ile Asp Ala Leu Ala
3485 3490 3495
Ser Met Thr Gly Val Ser Leu Glu Thr Leu Leu Ala Ala Ile Lys
3500 3505 3510
Arg Leu Lys Asn Gly Phe Gln Gly Arg Gln Ile Met Gly Ser Cys
3515 3520 3525
Ser Phe Glu Asp Glu Leu Thr Pro Ser Asp Val Tyr Gln Gln Leu
3530 3535 3540
Ala Gly Ile Lys Leu Gln Ser Lys Arg Thr Arg Leu Val Lys Gly
3545 3550 3555
Ile Val Cys Trp Ile Met Ala Ser Thr Phe Leu Phe Ser Cys Ile
3560 3565 3570
Ile Thr Ala Phe Val Lys Trp Thr Met Phe Met Tyr Val Thr Thr
3575 3580 3585
Asn Met Leu Ser Ile Thr Phe Cys Ala Leu Cys Val Ile Ser Leu
3590 3595 3600
Ala Met Leu Leu Val Lys His Lys His Leu Tyr Leu Thr Met Tyr
3605 3610 3615
Ile Ile Pro Val Leu Phe Thr Leu Leu Tyr Asn Asn Tyr Leu Val
3620 3625 3630
Val Tyr Lys Gln Thr Phe Arg Gly Tyr Val Tyr Ala Trp Leu Ser
3635 3640 3645
Tyr Tyr Val Pro Ser Val Glu Tyr Thr Tyr Thr Asp Glu Val Ile
3650 3655 3660
Tyr Gly Met Leu Leu Leu Ile Gly Met Val Phe Val Thr Leu Arg
3665 3670 3675
Ser Ile Asn His Asp Leu Phe Ser Phe Ile Met Phe Val Gly Arg
3680 3685 3690
Val Ile Ser Val Val Ser Leu Trp Tyr Met Gly Ser Asn Leu Glu
3695 3700 3705
Glu Glu Ile Leu Leu Met Leu Ala Ser Leu Phe Gly Thr Tyr Thr
3710 3715 3720
Trp Thr Thr Ala Leu Ser Met Ala Ala Ala Lys Val Ile Ala Lys
3725 3730 3735
Trp Val Ala Val Asn Val Leu Tyr Phe Thr Asp Ile Pro Gln Ile
3740 3745 3750
Lys Ile Val Leu Val Cys Tyr Leu Phe Ile Gly Tyr Ile Ile Ser
3755 3760 3765
Cys Tyr Trp Gly Leu Phe Ser Leu Met Asn Ser Leu Phe Arg Met
3770 3775 3780
Pro Leu Gly Val Tyr Asn Tyr Lys Ile Ser Val Gln Glu Leu Arg
3785 3790 3795
Tyr Met Asn Ala Asn Gly Leu Arg Pro Pro Lys Asn Ser Phe Glu
3800 3805 3810
Ala Leu Met Leu Asn Phe Lys Leu Leu Gly Ile Gly Gly Val Pro
3815 3820 3825
Ile Ile Glu Val Ser Gln Phe Gln Ser Lys Leu Thr Asp Val Lys
3830 3835 3840
Cys Ala Asn Gly Gly Leu Leu Asn Cys Leu Gln His Leu His Val
3845 3850 3855
Ala Ser Asn Ser Lys Leu Trp Gln Tyr Cys Ser Thr Leu His Asn
3860 3865 3870
Glu Ile Leu Ala Thr Ser Asp Leu Gly Val Ala Phe Glu Lys Leu
3875 3880 3885
Ala Gln Leu Leu Ile Val Leu Phe Ala Asn Pro Ala Ala Val Asp
3890 3895 3900
Ser Lys Cys Leu Thr Ser Ile Glu Glu Val Cys Asp Asp Tyr Ala
3905 3910 3915
Lys Asp Asn Thr Val Leu Gln Ala Leu Gln Ser Glu Phe Val Asn
3920 3925 3930
Met Ala Ser Phe Val Glu Tyr Glu Val Ala Lys Lys Asn Leu Asp
3935 3940 3945
Glu Ala Cys Ser Ser Gly Ser Ala Asn Gln Gln Gln Leu Lys Gln
3950 3955 3960
Leu Glu Lys Ala Cys Asn Ile Ala Lys Ser Ala Tyr Glu Arg Asp
3965 3970 3975
Arg Ala Val Ala Arg Lys Leu Glu Arg Met Ala Asp Leu Ala Leu
3980 3985 3990
Thr Asn Met Tyr Lys Glu Ala Arg Ile Asn Asp Lys Lys Ser Lys
3995 4000 4005
Val Val Ser Ala Leu Gln Thr Met Leu Phe Ser Met Val Arg Lys
4010 4015 4020
Leu Asp Asn Gln Ala Leu Asn Ser Ile Leu Asp Asn Ala Val Lys
4025 4030 4035
Gly Cys Val Pro Leu Asn Ala Ile Pro Ser Leu Ala Ala Asn Thr
4040 4045 4050
Leu Thr Ile Ile Val Pro Asp Lys Ser Val Tyr Asp Gln Val Val
4055 4060 4065
Asp Asn Val Tyr Val Thr Tyr Ala Gly Asn Val Trp Gln Ile Gln
4070 4075 4080
Thr Ile Gln Asp Ser Asp Gly Thr Asn Lys Gln Leu His Glu Ile
4085 4090 4095
Ser Asp Asp Cys Asn Trp Pro Leu Val Ile Ile Ala Asn Arg His
4100 4105 4110
Asn Glu Val Ser Ala Thr Val Leu Gln Asn Asn Glu Leu Met Pro
4115 4120 4125
Ala Lys Leu Lys Thr Gln Val Val Asn Ser Gly Pro Asp Gln Thr
4130 4135 4140
Cys Asn Thr Pro Thr Gln Cys Tyr Tyr Asn Asn Ser Tyr Asn Gly
4145 4150 4155
Lys Ile Val Tyr Ala Ile Leu Ser Asp Val Asp Gly Leu Lys Tyr
4160 4165 4170
Thr Lys Ile Leu Lys Asp Asp Gly Asn Phe Val Val Leu Glu Leu
4175 4180 4185
Asp Pro Pro Cys Lys Phe Thr Val Gln Asp Val Lys Gly Leu Lys
4190 4195 4200
Ile Lys Tyr Leu Tyr Phe Val Lys Gly Cys Asn Thr Leu Ala Arg
4205 4210 4215
Gly Trp Val Val Gly Thr Ile Ser Ser Thr Val Arg Leu Gln Ala
4220 4225 4230
Gly Thr Ala Thr Glu Tyr Ala Ser Asn Ser Ser Ile Leu Ser Leu
4235 4240 4245
Cys Ala Phe Ser Val Asp Pro Lys Lys Thr Tyr Leu Asp Phe Ile
4250 4255 4260
Gln Gln Gly Gly Thr Pro Ile Ala Asn Cys Val Lys Met Leu Cys
4265 4270 4275
Asp His Ala Gly Thr Gly Met Ala Ile Thr Val Lys Pro Asp Ala
4280 4285 4290
Thr Thr Ser Gln Asp Ser Tyr Gly Gly Ala Ser Val Cys Ile Tyr
4295 4300 4305
Cys Arg Ala Arg Val Glu His Pro Asp Val Asp Gly Leu Cys Lys
4310 4315 4320
Leu Arg Gly Lys Phe Val Gln Val Pro Val Gly Ile Lys Asp Pro
4325 4330 4335
Val Ser Tyr Val Leu Thr His Asp Val Cys Gln Val Cys Gly Phe
4340 4345 4350
Trp Arg Asp Gly Ser Cys Ser Cys Val Ser Thr Asp Thr Thr Val
4355 4360 4365
Gln Ser Lys Asp Thr Phe Phe Lys Arg Val Arg Gly Thr Ser Val
4370 4375 4380
Asp Ala Arg Leu Val Pro Cys Ala Ser Gly Leu Ser Thr Asp Val
4385 4390 4395
Gln Leu Arg Ala Phe Asp Ile Cys Asn Ala Ser Val Ala Gly Ile
4400 4405 4410
Gly Leu His Leu Lys Val Asn Cys Cys Arg Phe Gln Arg Val Asp
4415 4420 4425
Glu Asn Gly Asp Lys Leu Asp Gln Phe Phe Val Val Lys Arg Thr
4430 4435 4440
Asp Leu Thr Ile Tyr Asn Arg Glu Met Glu Cys Tyr Glu Arg Val
4445 4450 4455
Lys Asp Cys Lys Phe Val Ala Glu His Asp Phe Phe Thr Phe Asp
4460 4465 4470
Val Glu Gly Ser Arg Val Pro His Ile Val Arg Lys Asp Leu Thr
4475 4480 4485
Lys Tyr Thr Met Leu Asp Leu Cys Tyr Ala Leu Arg His Phe Asp
4490 4495 4500
Arg Asn Asp Cys Met Leu Leu Cys Asp Ile Leu Ser Ile Tyr Ala
4505 4510 4515
Gly Cys Glu Gln Ser Tyr Phe Thr Lys Lys Asp Trp Tyr Asp Phe
4520 4525 4530
Val Glu Asn Pro Asp Ile Ile Asn Val Tyr Lys Lys Leu Gly Pro
4535 4540 4545
Ile Phe Asn Arg Ala Leu Val Ser Ala Thr Glu Phe Ala Asp Lys
4550 4555 4560
Leu Val Glu Val Gly Leu Val Gly Ile Leu Thr Leu Asp Asn Gln
4565 4570 4575
Asp Leu Asn Gly Lys Trp Tyr Asp Phe Gly Asp Tyr Val Ile Ala
4580 4585 4590
Ala Pro Gly Cys Gly Val Ala Ile Ala Asp Ser Tyr Tyr Ser Tyr
4595 4600 4605
Met Met Pro Met Leu Thr Met Cys His Ala Leu Asp Cys Glu Leu
4610 4615 4620
Tyr Val Asn Asn Ala Tyr Arg Leu Phe Asp Leu Val Gln Tyr Asp
4625 4630 4635
Phe Thr Asp Tyr Lys Leu Glu Leu Phe Asn Lys Tyr Phe Lys His
4640 4645 4650
Trp Ser Met Pro Tyr His Pro Asn Thr Val Asp Cys Gln Asp Asp
4655 4660 4665
Arg Cys Ile Ile His Cys Ala Asn Phe Asn Ile Leu Phe Ser Met
4670 4675 4680
Val Leu Pro Asn Thr Cys Phe Gly Pro Leu Val Arg Gln Ile Phe
4685 4690 4695
Val Asp Gly Val Pro Phe Val Val Ser Ile Gly Tyr His Tyr Lys
4700 4705 4710
Glu Leu Gly Ile Val Met Asn Met Asp Val Asp Thr His Arg Tyr
4715 4720 4725
Arg Leu Ser Leu Lys Asp Leu Leu Leu Tyr Ala Ala Asp Pro Ala
4730 4735 4740
Leu His Val Ala Ser Ala Ser Ala Leu Tyr Asp Leu Arg Thr Cys
4745 4750 4755
Cys Phe Ser Val Ala Ala Ile Thr Ser Gly Val Lys Phe Gln Thr
4760 4765 4770
Val Lys Pro Gly Asn Phe Asn Gln Asp Phe Tyr Asp Phe Ile Leu
4775 4780 4785
Ser Lys Gly Leu Leu Lys Glu Gly Ser Ser Val Asp Leu Lys His
4790 4795 4800
Phe Phe Phe Thr Gln Asp Gly Asn Ala Ala Ile Thr Asp Tyr Asn
4805 4810 4815
Tyr Tyr Lys Tyr Asn Leu Pro Thr Met Val Asp Ile Lys Gln Leu
4820 4825 4830
Leu Phe Val Leu Glu Val Val Tyr Lys Tyr Phe Glu Ile Tyr Asp
4835 4840 4845
Gly Gly Cys Ile Pro Ala Ala Gln Val Ile Val Asn Asn Tyr Asp
4850 4855 4860
Lys Ser Ala Gly Tyr Pro Phe Asn Lys Phe Gly Lys Ala Arg Leu
4865 4870 4875
Tyr Tyr Glu Ala Leu Ser Phe Glu Glu Gln Asp Glu Ile Tyr Ala
4880 4885 4890
Tyr Thr Lys Arg Asn Val Leu Pro Thr Leu Thr Gln Met Asn Leu
4895 4900 4905
Lys Tyr Ala Ile Ser Ala Lys Asn Arg Ala Arg Thr Val Ala Gly
4910 4915 4920
Val Ser Ile Leu Ser Thr Met Thr Gly Arg Met Phe His Gln Lys
4925 4930 4935
Cys Leu Lys Ser Ile Ala Ala Thr Arg Gly Val Pro Val Val Ile
4940 4945 4950
Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asp Met Leu Arg Arg
4955 4960 4965
Leu Ile Lys Asp Val Asp Asn Pro Val Leu Met Gly Trp Asp Tyr
4970 4975 4980
Pro Lys Cys Asp Arg Ala Met Pro Asn Ile Leu Arg Ile Val Ser
4985 4990 4995
Ser Leu Val Leu Ala Arg Lys His Glu Ala Cys Cys Ser Gln Ser
5000 5005 5010
Asp Arg Phe Tyr Arg Leu Ala Asn Glu Cys Ala Gln Val Leu Ser
5015 5020 5025
Glu Ile Val Met Cys Gly Gly Cys Tyr Tyr Val Lys Pro Gly Gly
5030 5035 5040
Thr Ser Ser Gly Asp Ala Thr Thr Ala Phe Ala Asn Ser Val Phe
5045 5050 5055
Asn Ile Cys Gln Ala Val Ser Ala Asn Val Cys Ala Leu Met Ser
5060 5065 5070
Cys Asn Gly Asn Lys Ile Glu Asp Leu Ser Ile Arg Ala Leu Gln
5075 5080 5085
Lys Arg Leu Tyr Ser His Val Tyr Arg Ser Asp Met Val Asp Ser
5090 5095 5100
Thr Phe Val Thr Glu Tyr Tyr Glu Phe Leu Asn Lys His Phe Ser
5105 5110 5115
Met Met Ile Leu Ser Asp Asp Gly Val Val Cys Tyr Asn Ser Asp
5120 5125 5130
Tyr Ala Ser Lys Gly Tyr Ile Ala Asn Ile Ser Ala Phe Gln Gln
5135 5140 5145
Val Leu Tyr Tyr Gln Asn Asn Val Phe Met Ser Glu Ser Lys Cys
5150 5155 5160
Trp Val Glu Asn Asp Ile Asn Asn Gly Pro His Glu Phe Cys Ser
5165 5170 5175
Gln His Thr Met Leu Val Lys Met Asp Gly Asp Asp Val Tyr Leu
5180 5185 5190
Pro Tyr Pro Val Pro Ser Arg Ile Leu Gly Ala Gly Cys Phe Val
5195 5200 5205
Asp Asp Leu Leu Lys Thr Asp Ser Val Leu Leu Ile Glu Arg Phe
5210 5215 5220
Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Val Tyr His Glu Asn
5225 5230 5235
Glu Glu Tyr Gln Lys Val Phe Arg Val Tyr Leu Glu Tyr Ile Lys
5240 5245 5250
Lys Leu Tyr Asn Glu Leu Gly Asn Gln Ile Leu Asp Ser Tyr Ser
5255 5260 5265
Val Ile Leu Ser Thr Cys Asp Gly Gln Lys Phe Thr Asp Glu Ser
5270 5275 5280
Phe Tyr Lys Asn Met Tyr Leu Arg Ser Ala Val Met Gln Ser Val
5285 5290 5295
Gly Ala Cys Val Val Cys Ser Ser Gln Thr Ser Leu Arg Cys Gly
5300 5305 5310
Ser Cys Ile Arg Lys Pro Leu Leu Cys Cys Lys Cys Cys Tyr Asp
5315 5320 5325
His Val Met Ala Thr Asp His Lys Tyr Val Leu Ser Val Ser Pro
5330 5335 5340
Tyr Val Cys Asn Ala Pro Gly Cys Asp Val Asn Asp Val Thr Lys
5345 5350 5355
Leu Tyr Leu Gly Gly Met Ser Tyr Tyr Cys Glu Asp His Lys Pro
5360 5365 5370
Gln Tyr Ser Phe Lys Leu Val Met Asn Gly Met Val Phe Gly Leu
5375 5380 5385
Tyr Lys Gln Ser Cys Thr Gly Ser Pro Tyr Ile Asp Asp Phe Asn
5390 5395 5400
Arg Ile Ala Ser Cys Lys Trp Thr Asp Val Asp Asp Tyr Ile Leu
5405 5410 5415
Ala Asn Glu Cys Thr Glu Arg Leu Lys Leu Phe Ala Ala Glu Thr
5420 5425 5430
Gln Lys Ala Thr Glu Glu Ala Phe Lys Gln Ser Tyr Ala Ser Ala
5435 5440 5445
Thr Ile Gln Glu Ile Val Ser Glu Arg Glu Leu Ile Leu Ser Trp
5450 5455 5460
Glu Ile Gly Lys Val Lys Pro Pro Leu Asn Lys Asn Tyr Val Phe
5465 5470 5475
Thr Gly Tyr His Phe Thr Lys Asn Gly Lys Thr Val Leu Gly Glu
5480 5485 5490
Tyr Val Phe Asp Lys Ser Glu Leu Thr Asn Gly Val Tyr Tyr Arg
5495 5500 5505
Ala Thr Thr Thr Tyr Lys Leu Ser Val Gly Asp Val Phe Val Leu
5510 5515 5520
Thr Ser His Ser Val Ala Asn Leu Ser Ala Pro Thr Leu Val Pro
5525 5530 5535
Gln Glu Asn Tyr Ser Ser Ile Arg Phe Ala Ser Val Tyr Ser Val
5540 5545 5550
Leu Glu Thr Phe Gln Asn Asn Val Val Asn Tyr Gln His Ile Gly
5555 5560 5565
Met Lys Arg Tyr Cys Thr Val Gln Gly Pro Pro Gly Thr Gly Lys
5570 5575 5580
Ser His Leu Ala Ile Gly Leu Ala Val Tyr Tyr Cys Thr Ala Arg
5585 5590 5595
Val Val Tyr Thr Ala Ala Ser His Ala Ala Val Asp Ala Leu Cys
5600 5605 5610
Glu Lys Ala Tyr Lys Phe Leu Asn Ile Asn Asp Cys Thr Arg Ile
5615 5620 5625
Val Pro Ala Lys Val Arg Val Glu Cys Tyr Asp Lys Phe Lys Ile
5630 5635 5640
Asn Asp Thr Thr Arg Lys Tyr Val Phe Thr Thr Ile Asn Ala Leu
5645 5650 5655
Pro Glu Met Val Thr Asp Ile Val Val Val Asp Glu Val Ser Met
5660 5665 5670
Leu Thr Asn Tyr Glu Leu Ser Val Ile Asn Ala Arg Ile Arg Ala
5675 5680 5685
Lys His Tyr Val Tyr Ile Gly Asp Pro Ala Gln Leu Pro Ala Pro
5690 5695 5700
Arg Val Leu Leu Ser Lys Gly Thr Leu Glu Pro Lys Tyr Phe Asn
5705 5710 5715
Thr Val Thr Lys Leu Met Cys Cys Leu Gly Pro Asp Ile Phe Leu
5720 5725 5730
Gly Thr Cys Tyr Arg Cys Pro Lys Glu Ile Val Asp Thr Val Ser
5735 5740 5745
Ala Leu Val Tyr Glu Asn Lys Leu Lys Ala Lys Asn Glu Ser Ser
5750 5755 5760
Ser Leu Cys Phe Lys Val Tyr Tyr Lys Gly Val Thr Thr His Glu
5765 5770 5775
Ser Ser Ser Ala Val Asn Met Gln Gln Ile Tyr Leu Ile Asn Lys
5780 5785 5790
Phe Leu Lys Ala Asn Pro Leu Trp His Lys Ala Val Phe Ile Ser
5795 5800 5805
Pro Tyr Asn Ser Gln Asn Phe Ala Ala Lys Arg Val Leu Gly Leu
5810 5815 5820
Gln Thr Gln Thr Val Asp Ser Ala Gln Gly Ser Glu Tyr Asp Tyr
5825 5830 5835
Val Ile Tyr Ser Gln Thr Ala Glu Thr Ala His Ser Val Asn Val
5840 5845 5850
Asn Arg Phe Asn Val Ala Ile Thr Arg Ala Lys Lys Gly Ile Leu
5855 5860 5865
Cys Val Met Ser Asn Met Gln Leu Phe Glu Ala Leu Gln Phe Thr
5870 5875 5880
Thr Leu Thr Val Asp Lys Val Pro Gln Ala Val Glu Thr Arg Val
5885 5890 5895
Gln Cys Ser Thr Asn Leu Phe Lys Asp Cys Ser Lys Ser Tyr Ser
5900 5905 5910
Gly Tyr His Pro Ala His Ala Pro Ser Phe Leu Ala Val Asp Asp
5915 5920 5925
Lys Tyr Lys Ala Thr Gly Asp Leu Ala Val Cys Leu Gly Ile Gly
5930 5935 5940
Asp Ser Ala Val Thr Tyr Ser Arg Leu Ile Ser Leu Met Gly Phe
5945 5950 5955
Lys Leu Asp Val Thr Leu Asp Gly Tyr Cys Lys Leu Phe Ile Thr
5960 5965 5970
Lys Glu Glu Ala Val Lys Arg Val Arg Ala Trp Val Gly Phe Asp
5975 5980 5985
Ala Glu Gly Ala His Ala Thr Arg Asp Ser Ile Gly Thr Asn Phe
5990 5995 6000
Pro Leu Gln Leu Gly Phe Ser Thr Gly Ile Asp Phe Val Val Glu
6005 6010 6015
Ala Thr Gly Leu Phe Ala Asp Arg Asp Gly Tyr Ser Phe Lys Lys
6020 6025 6030
Ala Val Ala Lys Ala Pro Pro Gly Glu Gln Phe Lys His Leu Ile
6035 6040 6045
Pro Leu Met Thr Arg Gly Gln Arg Trp Asp Val Val Arg Pro Arg
6050 6055 6060
Ile Val Gln Met Phe Ala Asp His Leu Ile Asp Leu Ser Asp Cys
6065 6070 6075
Val Val Leu Val Thr Trp Ala Ala Asn Phe Glu Leu Thr Cys Leu
6080 6085 6090
Arg Tyr Phe Ala Lys Val Gly Arg Glu Ile Ser Cys Asn Val Ser
6095 6100 6105
Thr Lys Arg Ala Thr Ala Tyr Asn Ser Arg Thr Gly Tyr Tyr Gly
6110 6115 6120
Cys Trp Arg His Ser Val Thr Cys Asp Tyr Leu Tyr Asn Pro Leu
6125 6130 6135
Ile Val Asp Ile Gln Gln Trp Gly Tyr Ile Gly Ser Leu Ser Ser
6140 6145 6150
Asn His Asp Leu Tyr Cys Ser Val His Lys Gly Ala His Val Ala
6155 6160 6165
Ser Ser Asp Ala Ile Met Thr Arg Cys Leu Ala Val Tyr Asp Cys
6170 6175 6180
Phe Cys Asn Asn Ile Asn Trp Asn Val Glu Tyr Pro Ile Ile Ser
6185 6190 6195
Asn Glu Leu Ser Ile Asn Thr Ser Cys Arg Val Leu Gln Arg Val
6200 6205 6210
Met Leu Lys Ala Ala Met Leu Cys Asn Arg Tyr Thr Leu Cys Tyr
6215 6220 6225
Asp Ile Gly Asn Pro Lys Ala Ile Ala Cys Val Lys Asp Phe Asp
6230 6235 6240
Phe Lys Phe Tyr Asp Ala Gln Pro Ile Val Lys Ser Val Lys Thr
6245 6250 6255
Leu Leu Tyr Phe Phe Glu Ala His Lys Asp Ser Phe Lys Asp Gly
6260 6265 6270
Leu Cys Met Phe Trp Asn Cys Asn Val Asp Lys Tyr Pro Pro Asn
6275 6280 6285
Ala Val Val Cys Arg Phe Asp Thr Arg Val Leu Asn Asn Leu Asn
6290 6295 6300
Leu Pro Gly Cys Asn Gly Gly Ser Leu Tyr Val Asn Lys His Ala
6305 6310 6315
Phe His Thr Lys Pro Phe Ser Arg Ala Ala Phe Glu His Leu Lys
6320 6325 6330
Pro Met Pro Phe Phe Tyr Tyr Ser Asp Thr Pro Cys Val Tyr Met
6335 6340 6345
Asp Gly Met Asp Ala Lys Gln Val Asp Tyr Val Pro Leu Lys Ser
6350 6355 6360
Ala Thr Cys Ile Thr Arg Cys Asn Leu Gly Gly Ala Val Cys Leu
6365 6370 6375
Lys His Ala Glu Glu Tyr Arg Glu Tyr Leu Glu Ser Tyr Asn Thr
6380 6385 6390
Ala Thr Thr Ala Gly Phe Thr Phe Trp Val Tyr Lys Thr Phe Asp
6395 6400 6405
Phe Tyr Asn Leu Trp Asn Thr Phe Thr Lys Leu Gln Ser Leu Glu
6410 6415 6420
Asn Val Val Tyr Asn Leu Val Lys Thr Gly His Tyr Thr Gly Gln
6425 6430 6435
Ala Gly Glu Met Pro Cys Ala Ile Ile Asn Asp Lys Val Val Ala
6440 6445 6450
Lys Ile Asp Lys Glu Asp Val Val Ile Phe Ile Asn Asn Thr Thr
6455 6460 6465
Tyr Pro Thr Asn Val Ala Val Glu Leu Phe Ala Lys Arg Ser Ile
6470 6475 6480
Arg His His Pro Glu Leu Lys Leu Phe Arg Asn Leu Asn Ile Asp
6485 6490 6495
Val Cys Trp Lys His Val Ile Trp Asp Tyr Ala Arg Glu Ser Ile
6500 6505 6510
Phe Cys Ser Asn Thr Tyr Gly Val Cys Met Tyr Thr Asp Leu Lys
6515 6520 6525
Leu Ile Asp Lys Leu Asn Val Leu Phe Asp Gly Arg Asp Asn Gly
6530 6535 6540
Ala Leu Glu Ala Phe Lys Arg Ser Asn Asn Gly Val Tyr Ile Ser
6545 6550 6555
Thr Thr Lys Val Lys Ser Leu Ser Met Ile Arg Gly Pro Pro Arg
6560 6565 6570
Ala Glu Leu Asn Gly Val Val Val Asp Lys Val Gly Asp Thr Asp
6575 6580 6585
Cys Val Phe Tyr Phe Ala Val Arg Lys Glu Gly Gln Asp Val Ile
6590 6595 6600
Phe Ser Gln Phe Asp Ser Leu Arg Val Ser Ser Asn Gln Ser Pro
6605 6610 6615
Gln Gly Asn Leu Gly Ser Asn Glu Pro Gly Asn Val Gly Gly Asn
6620 6625 6630
Asp Ala Leu Ala Thr Ser Thr Ile Phe Thr Gln Ser Arg Val Ile
6635 6640 6645
Ser Ser Phe Thr Cys Arg Thr Asp Met Glu Lys Asp Phe Ile Ala
6650 6655 6660
Leu Asp Gln Asp Val Phe Ile Gln Lys Tyr Gly Leu Glu Asp Tyr
6665 6670 6675
Ala Phe Glu His Ile Val Tyr Gly Asn Phe Asn Gln Lys Ile Ile
6680 6685 6690
Gly Gly Leu His Leu Leu Ile Gly Leu Tyr Arg Arg Gln Gln Thr
6695 6700 6705
Ser Asn Leu Val Ile Gln Glu Phe Val Ser Tyr Asp Ser Ser Ile
6710 6715 6720
His Ser Tyr Phe Ile Thr Asp Glu Lys Ser Gly Gly Ser Lys Ser
6725 6730 6735
Val Cys Thr Val Ile Asp Ile Leu Leu Asp Asp Phe Val Ala Leu
6740 6745 6750
Val Lys Ser Leu Asn Leu Asn Cys Val Ser Lys Val Val Asn Val
6755 6760 6765
Asn Val Asp Phe Lys Asp Phe Gln Phe Met Leu Trp Cys Asn Asp
6770 6775 6780
Glu Lys Val Met Thr Phe Tyr Pro Arg Leu Gln Ala Ala Ser Asp
6785 6790 6795
Trp Lys Pro Gly Tyr Ser Met Pro Val Leu Tyr Lys Tyr Leu Asn
6800 6805 6810
Ser Pro Met Glu Arg Val Ser Leu Trp Asn Tyr Gly Lys Pro Val
6815 6820 6825
Thr Leu Pro Thr Gly Cys Met Met Asn Val Ala Lys Tyr Thr Gln
6830 6835 6840
Leu Cys Gln Tyr Leu Asn Thr Thr Thr Leu Ala Val Pro Val Asn
6845 6850 6855
Thr Arg Val Leu His Leu Gly Ala Gly Ser Glu Lys Gly Val Ala
6860 6865 6870
Pro Gly Ser Ala Val Leu Arg Gln Trp Leu Pro Ala Gly Thr Ile
6875 6880 6885
Leu Arg Gln Trp Leu Pro Ala Gly Thr Ile Leu Val His Asn Asp
6890 6895 6900
Leu Tyr Pro Phe Val Ser Asp Ser Val Ala Thr Tyr Phe Gly Asp
6905 6910 6915
Cys Ile Thr Leu Pro Phe Asp Cys Gln Trp Asp Leu Ile Ile Ser
6920 6925 6930
Asp Met Tyr Asp Leu Leu Leu Asp Ile Gly Val His Val Val Arg
6935 6940 6945
Cys Ser Tyr Ile His Cys His Met Ile Arg Asp Lys Leu Ala Leu
6950 6955 6960
Gly Gly Ser Val Ala Ile Lys Ile Thr Glu Phe Ser Trp Asn Ala
6965 6970 6975
Glu Leu Tyr Lys Leu Met Gly Tyr Phe Ala Phe Trp Thr Val Phe
6980 6985 6990
Cys Thr Asn Ala Asn Ala Ser Ser Ser Glu Gly Phe Leu Ile Gly
6995 7000 7005
Ile Asn Tyr Leu Gly Lys Pro Lys Val Glu Ile Asp Gly Asn Val
7010 7015 7020
Met His Ala Ile Ile Cys Phe Gly Glu Ile Pro Gln Phe Gly Thr
7025 7030 7035
Gly Val Leu Ile Ala Cys Leu Ile Trp Leu Asn Ser Arg Leu Ser
7040 7045 7050
Trp Leu Val Met Pro
7055
<210> 23
<211> 6724
<212> PRT
<213> EMCR Coronavirus
<220>
<221> MISC_FEATURE
<223> ORF 1AB
<220>
<221> MISC_FEATURE
<222> (4890)..(4890)
<223> Unknown amino acid
<400> 23
Met Phe Tyr Asn Gln Val Thr Leu Ala Val Ala Ser Asp Ser Glu Ile
1 5 10 15
Ser Gly Phe Gly Phe Ala Ile Pro Ser Val Ala Val Arg Ala Tyr Ser
20 25 30
Glu Ala Ala Ala Gln Gly Phe Gln Ala Cys Arg Phe Val Ala Phe Gly
35 40 45
Leu Gln Asp Cys Val Thr Gly Ile Asn Asp Asp Asp Tyr Val Ile Ala
50 55 60
Leu Thr Gly Thr Asn Gln Leu Cys Ala Lys Ile Leu Leu Phe Ser Asp
65 70 75 80
Arg Pro Leu Asn Leu Arg Gly Trp Leu Ile Phe Ser Asn Ser Asn Tyr
85 90 95
Val Leu Gln Asp Phe Asp Val Val Phe Gly His Gly Ala Gly Ser Val
100 105 110
Val Phe Val Asp Lys Tyr Met Cys Gly Phe Asp Gly Lys Pro Val Leu
115 120 125
Pro Lys Asn Met Trp Glu Phe Arg Asp Tyr Phe Asn Asp Asn Thr Asp
130 135 140
Ser Ile Val Ile Gly Gly Val Thr Tyr Gln Leu Ala Trp Asp Val Ile
145 150 155 160
Arg Lys Asp Leu Ser Tyr Glu Gln Gln Asn Val Leu Ala Ile Glu Ser
165 170 175
Ile His Tyr Leu Gly Thr Thr Gly His Thr Leu Lys Ser Gly Cys Lys
180 185 190
Leu Ile Asn Ala Lys Pro Pro Lys Tyr Ser Ser Lys Val Val Leu Ser
195 200 205
Gly Glu Trp Asn Ala Val Tyr Lys Ala Phe Gly Ser Pro Phe Ile Thr
210 215 220
Asn Gly Ile Ser Leu Leu Asp Ile Ile Val Lys Pro Val Phe Phe Asn
225 230 235 240
Ala Phe Val Lys Cys Asn Cys Gly Ser Glu Asn Trp Ser Val Gly Ala
245 250 255
Trp Asp Gly Tyr Leu Ser Ser Cys Cys Gly Thr Pro Ala Lys Lys Leu
260 265 270
Cys Val Val Pro Gly Asn Val Val Pro Gly Asp Val Ile Ile Thr Ser
275 280 285
Thr Asp Ala Gly Cys Gly Val Lys Tyr Tyr Ala Gly Leu Val Val Lys
290 295 300
His Ile Thr Asn Ile Thr Gly Val Ser Leu Trp Arg Val Thr Ala Val
305 310 315 320
His Ser Asp Gly Met Phe Val Ala Thr Ser Ser Tyr Asp Ala Leu Leu
325 330 335
His Arg Asn Ser Leu Asp Pro Phe Cys Phe Asp Val Asn Thr Leu Leu
340 345 350
Ser Asn Gln Leu Arg Leu Ala Phe Leu Gly Ala Ser Val Thr Glu Asp
355 360 365
Val Lys Phe Ala Ala Ser Thr Gly Val Ile Asp Ile Ser Ala Gly Met
370 375 380
Phe Gly Leu Tyr Asp Asp Ile Leu Thr Asn Asn Lys Pro Trp Phe Val
385 390 395 400
Arg Lys Ala Ser Gly Leu Phe Asp Ala Ile Trp Asp Ala Phe Val Ala
405 410 415
Ala Ile Lys Leu Val Pro Thr Thr Thr Gly Gly Leu Val Arg Phe Val
420 425 430
Lys Ser Ile Ala Ser Thr Val Leu Thr Val Ser Asn Gly Val Ile Ile
435 440 445
Met Cys Ala Asp Val Pro Asp Ala Phe Gln Pro Val Tyr Arg Thr Phe
450 455 460
Thr Gln Ala Ile Cys Ala Ala Phe Asp Phe Ser Leu Asp Val Phe Lys
465 470 475 480
Ile Gly Asp Val Lys Phe Lys Arg Leu Gly Asp Tyr Val Leu Thr Glu
485 490 495
Asn Ala Leu Val Arg Leu Thr Thr Glu Val Val Arg Gly Val Arg Asp
500 505 510
Ala Arg Ile Lys Lys Ala Met Phe Thr Lys Val Val Val Gly Pro Thr
515 520 525
Thr Glu Val Lys Phe Ser Val Ile Glu Leu Ala Thr Val Asn Leu Arg
530 535 540
Leu Val Asp Cys Ala Pro Val Val Cys Pro Lys Gly Lys Ile Val Val
545 550 555 560
Ile Ala Gly Gln Ala Phe Phe Tyr Ser Gly Gly Phe Tyr Arg Phe Met
565 570 575
Val Asp Ser Thr Thr Val Leu Asn Asp Pro Val Phe Thr Gly Glu Leu
580 585 590
Phe Tyr Thr Ile Lys Phe Ser Gly Phe Lys Leu Asp Gly Phe Asn His
595 600 605
Gln Phe Val Asn Ala Ser Ser Ala Thr Asp Ala Ile Ile Ala Val Glu
610 615 620
Leu Leu Leu Ser Asp Phe Lys Thr Ala Val Phe Val Tyr Thr Cys Val
625 630 635 640
Val Asp Gly Cys Ser Val Ile Val Arg Arg Asp Ala Thr Phe Ala Thr
645 650 655
His Val Cys Phe Lys Asp Cys Tyr Ser Ile Trp Glu Gln Phe Cys Ile
660 665 670
Asp Asn Cys Gly Glu Pro Trp Phe Leu Thr Asp Tyr Asn Ala Ile Leu
675 680 685
Gln Ser Asn Asn Pro Gln Cys Ala Ile Val Gln Ala Ser Glu Ser Lys
690 695 700
Val Leu Leu Glu Arg Phe Leu Pro Lys Cys Pro Glu Ile Leu Leu Ser
705 710 715 720
Ile Asp Asp Gly His Leu Trp Asn Leu Phe Val Glu Lys Phe Asn Phe
725 730 735
Val Thr Asp Trp Leu Lys Thr Leu Lys Leu Thr Leu Thr Ser Asn Gly
740 745 750
Leu Leu Gly Asn Cys Ala Lys Arg Phe Arg Arg Val Leu Val Lys Leu
755 760 765
Leu Asp Val Tyr Asn Gly Phe Leu Glu Thr Val Cys Ser Val Val His
770 775 780
Thr Ala Gly Val Cys Ile Lys Tyr Tyr Ala Val Asn Val Pro Tyr Val
785 790 795 800
Val Ile Ser Gly Phe Val Ser Arg Val Ile Arg Arg Glu Arg Cys Asp
805 810 815
Val Thr Phe Pro Cys Val Ser Cys Val Thr Phe Phe Tyr Glu Phe Leu
820 825 830
Asp Thr Cys Phe Gly Val Ser Lys Pro Asn Ala Ile Asp Val Glu His
835 840 845
Leu Glu Leu Lys Glu Thr Val Phe Val Glu Pro Lys Asp Gly Gly Gln
850 855 860
Phe Phe Val Ser Asp Asp Tyr Leu Trp Tyr Val Val Asp Asp Ile Tyr
865 870 875 880
Tyr Pro Ala Ser Cys Asn Gly Val Leu Pro Val Ala Phe Thr Lys Leu
885 890 895
Ala Gly Gly Lys Ile Ser Phe Ser Asp Asp Val Ile Val His Asp Val
900 905 910
Glu Pro Thr His Lys Val Lys Leu Ile Phe Glu Phe Glu Asp Asp Val
915 920 925
Val Thr Ser Leu Cys Lys Lys Ser Phe Gly Lys Ser Ile Ile Tyr Thr
930 935 940
Gly Asp Trp Glu Gly Leu His Glu Val Leu Thr Ser Ala Met Asn Val
945 950 955 960
Ile Gly Gln His Ile Lys Leu Pro Gln Phe Tyr Ile Tyr Asp Glu Glu
965 970 975
Gly Gly Tyr Asp Val Ser Lys Pro Val Met Ile Ser Gln Trp Pro Ile
980 985 990
Ser Asp Asp Ser Asp Gly Cys Val Val Glu Ala Ser Thr Asp Phe His
995 1000 1005
Gln Leu Glu Ser Val Arg Glu Glu Val Asp Ile Ile Glu Gln Pro
1010 1015 1020
Phe Gly Glu Val Glu His Ala Leu Ser Ile Arg Gln Pro Phe Ser
1025 1030 1035
Phe Ser Phe Arg Asp Glu Leu Gly Val Arg Val Leu Asp Gln Ser
1040 1045 1050
Asp Asn Asn Cys Trp Ile Ser Thr Thr Leu Ile Gln Leu Gln Leu
1055 1060 1065
Thr Lys Leu Leu Asp Asp Ser Ile Glu Met Gln Leu Phe Lys Val
1070 1075 1080
Gly Lys Val Asp Ser Ile Val Gln Lys Cys Tyr Glu Leu Ser His
1085 1090 1095
Leu Ile Ser Gly Ser Leu Gly Asp Ser Gly Lys Leu Leu Ser Glu
1100 1105 1110
Leu Leu Lys Asp Lys Tyr Thr Cys Ser Ile Thr Phe Glu Met Ser
1115 1120 1125
Cys Asp Cys Gly Lys Lys Phe Asp Glu Gln Val Gly Cys Leu Phe
1130 1135 1140
Trp Ile Met Pro Tyr Thr Lys Leu Phe Gln Lys Gly Glu Cys Cys
1145 1150 1155
Ile Cys His Lys Met Gln Thr Tyr Lys Leu Val Ser Met Lys Gly
1160 1165 1170
Thr Gly Val Phe Val Gln Asp Pro Ala Pro Ile Asp Ile Asp Ala
1175 1180 1185
Phe Pro Val Arg Pro Ile Cys Ser Ser Val Tyr Leu Gly Val Lys
1190 1195 1200
Gly Ser Gly His Tyr Gln Thr Asn Leu Tyr Ser Phe Asp Lys Ala
1205 1210 1215
Ile Asp Gly Phe Gly Val Phe Asp Ile Lys Asn Ser Ser Val Asn
1220 1225 1230
Thr Val Cys Phe Val Asp Val Asp Phe His Ser Val Glu Ile Glu
1235 1240 1245
Ala Gly Glu Val Lys Pro Phe Ala Val Tyr Lys Asn Val Lys Phe
1250 1255 1260
Tyr Leu Gly Asp Ile Ser His Leu Val Asn Cys Val Ser Phe Asp
1265 1270 1275
Phe Val Val Asn Ala Ala Asn Glu Asn Leu Met His Gly Gly Gly
1280 1285 1290
Val Ala Arg Ala Ile Asp Ile Leu Thr Glu Gly Gln Leu Gln Ser
1295 1300 1305
Leu Ser Lys Asp Tyr Ile Ser Ser Asn Gly Pro Leu Lys Val Gly
1310 1315 1320
Ala Gly Val Met Leu Glu Cys Glu Lys Phe Asn Val Phe Asn Val
1325 1330 1335
Val Gly Pro Arg Thr Gly Lys His Glu His Ser Leu Leu Val Glu
1340 1345 1350
Ala Tyr Asn Ser Ile Leu Phe Glu Asn Gly Ile Pro Leu Met Pro
1355 1360 1365
Leu Leu Ser Cys Gly Ile Phe Gly Val Arg Ile Glu Asn Ser Leu
1370 1375 1380
Lys Ala Leu Phe Ser Cys Asp Ile Asn Lys Pro Leu Gln Val Phe
1385 1390 1395
Val Tyr Ser Ser Asn Glu Glu Gln Ala Val Leu Lys Phe Leu Asp
1400 1405 1410
Gly Leu Asp Leu Thr Pro Val Ile Asp Asp Val Asp Val Val Lys
1415 1420 1425
Pro Phe Arg Val Glu Gly Asn Phe Ser Phe Phe Asp Cys Gly Val
1430 1435 1440
Asn Ala Leu Asp Gly Asp Ile Tyr Leu Leu Phe Thr Asn Ser Ile
1445 1450 1455
Leu Met Leu Asp Lys Gln Gly Gln Leu Leu Asp Thr Lys Leu Asn
1460 1465 1470
Gly Ile Leu Gln Gln Ala Val Leu Asp Tyr Leu Ala Thr Val Lys
1475 1480 1485
Thr Val Pro Ala Gly Asn Leu Val Lys Leu Val Val Glu Ser Cys
1490 1495 1500
Thr Ile Tyr Met Cys Val Val Pro Ser Ile Asn Asp Leu Ser Phe
1505 1510 1515
Asp Lys Asn Leu Gly Arg Cys Val Arg Lys Leu Asn Arg Leu Lys
1520 1525 1530
Thr Cys Val Ile Ala Asn Val Pro Ala Ile Asp Val Leu Lys Lys
1535 1540 1545
Leu Leu Ser Ser Leu Thr Leu Thr Val Lys Phe Val Val Glu Ser
1550 1555 1560
Asn Val Met Asp Val Asn Asp Cys Phe Lys Asn Asp Asn Val Val
1565 1570 1575
Leu Lys Ile Thr Glu Asp Gly Ile Asn Val Lys Asp Val Val Val
1580 1585 1590
Glu Ser Ser Lys Ser Leu Gly Lys Gln Leu Gly Val Val Ser Asp
1595 1600 1605
Gly Val Asp Ser Phe Glu Gly Val Leu Pro Ile Asn Thr Asp Thr
1610 1615 1620
Val Leu Ser Val Ala Pro Glu Val Asp Trp Val Ala Phe Tyr Gly
1625 1630 1635
Phe Glu Lys Ala Ala Leu Phe Ala Ser Leu Asp Val Lys Pro Tyr
1640 1645 1650
Gly Tyr Pro Asn Asp Phe Val Gly Gly Phe Arg Val Leu Gly Thr
1655 1660 1665
Thr Asp Asn Asn Cys Trp Val Asn Ala Thr Cys Ile Ile Leu Gln
1670 1675 1680
Tyr Leu Lys Pro Thr Phe Lys Ser Lys Gly Leu Asn Val Leu Trp
1685 1690 1695
Asn Lys Phe Val Thr Gly Asp Val Gly Pro Phe Val Ser Phe Ile
1700 1705 1710
Tyr Phe Ile Thr Met Ser Ser Lys Gly Gln Lys Gly Asp Ala Glu
1715 1720 1725
Glu Ala Leu Ser Lys Leu Ser Glu Tyr Leu Ile Ser Asp Ser Ile
1730 1735 1740
Val Thr Leu Glu Gln Tyr Ser Thr Cys Asp Ile Cys Lys Ser Thr
1745 1750 1755
Val Val Glu Val Lys Ser Ala Val Val Cys Ala Ser Val Leu Lys
1760 1765 1770
Asp Gly Cys Asp Val Gly Phe Cys Pro His Arg His Lys Leu Arg
1775 1780 1785
Ser Arg Val Lys Phe Val Asn Gly Arg Val Val Ile Thr Asn Val
1790 1795 1800
Gly Glu Pro Ile Ile Ser Gln Pro Ser Lys Leu Leu Asn Gly Ile
1805 1810 1815
Ala Tyr Thr Thr Phe Ser Gly Ser Phe Asp Asn Gly His Tyr Val
1820 1825 1830
Val Tyr Asp Ala Ala Asn Asn Ala Val Tyr Asp Gly Ala Arg Leu
1835 1840 1845
Phe Ala Ser Asp Leu Ser Thr Leu Ala Val Thr Ala Ile Val Val
1850 1855 1860
Val Gly Gly Cys Val Thr Ser Asn Val Pro Pro Ile Val Ser Glu
1865 1870 1875
Lys Ile Ser Val Met Asp Lys Leu Asp Thr Gly Ala Gln Lys Phe
1880 1885 1890
Phe Gln Phe Gly Asp Phe Val Met Asn Asn Ile Val Leu Phe Leu
1895 1900 1905
Thr Trp Leu Leu Ser Met Phe Ser Leu Leu Arg Thr Ser Ile Met
1910 1915 1920
Lys His Asp Ile Lys Val Ile Ala Lys Ala Pro Lys Arg Thr Gly
1925 1930 1935
Val Ile Leu Thr Arg Ser Phe Lys Tyr Asn Ile Arg Ser Ala Leu
1940 1945 1950
Phe Val Val Lys Gln Lys Trp Cys Val Ile Val Thr Leu Phe Lys
1955 1960 1965
Phe Leu Leu Leu Leu Tyr Ala Ile Tyr Ala Leu Val Phe Met Ile
1970 1975 1980
Val Gln Phe Ser Pro Phe Asn Ser Leu Leu Cys Gly Asp Ile Val
1985 1990 1995
Ser Gly Tyr Glu Lys Ser Thr Phe Asn Lys Asp Ile Tyr Cys Gly
2000 2005 2010
Asn Ser Met Val Cys Lys Met Cys Leu Phe Ser Tyr Gln Glu Phe
2015 2020 2025
Asn Asp Leu Asp His Thr Ser Leu Val Trp Lys His Ile Arg Asp
2030 2035 2040
Pro Ile Leu Ile Ser Leu Gln Pro Phe Val Ile Leu Val Ile Leu
2045 2050 2055
Leu Ile Phe Gly Asn Met Tyr Leu Arg Phe Gly Leu Leu Tyr Phe
2060 2065 2070
Val Ala Gln Phe Ile Ser Thr Phe Gly Ser Phe Leu Gly Phe His
2075 2080 2085
Gln Lys Gln Trp Phe Leu His Phe Val Pro Phe Asp Val Leu Cys
2090 2095 2100
Asn Glu Phe Leu Ala Thr Phe Ile Val Cys Lys Ile Val Leu Phe
2105 2110 2115
Val Arg His Ile Ile Val Gly Cys Asn Asn Ala Asp Cys Val Ala
2120 2125 2130
Cys Ser Lys Ser Ala Arg Leu Lys Arg Val Pro Leu Gln Thr Ile
2135 2140 2145
Ile Asn Gly Met His Lys Ser Phe Tyr Val Asn Ala Asn Gly Gly
2150 2155 2160
Thr Cys Phe Cys Asn Lys His Asn Phe Phe Cys Val Asn Cys Asp
2165 2170 2175
Ser Phe Gly Pro Gly Asn Thr Phe Ile Asn Gly Asp Ile Ala Arg
2180 2185 2190
Glu Leu Gly Asn Val Val Lys Thr Ala Val Gln Pro Thr Ala Pro
2195 2200 2205
Ala Tyr Val Ile Ile Asp Lys Val Asp Phe Val Asn Gly Phe Tyr
2210 2215 2220
Arg Leu Tyr Ser Gly Asp Thr Phe Trp Arg Tyr Asp Phe Asp Ile
2225 2230 2235
Thr Glu Ser Lys Tyr Ser Cys Lys Glu Val Leu Lys Asn Cys Asn
2240 2245 2250
Val Leu Glu Asn Phe Ile Val Tyr Asn Asn Ser Gly Ser Asn Ile
2255 2260 2265
Thr Gln Ile Lys Asn Ala Cys Val Tyr Phe Ser Gln Leu Leu Cys
2270 2275 2280
Glu Pro Ile Lys Leu Val Asn Ser Glu Leu Leu Ser Thr Leu Ser
2285 2290 2295
Val Asp Phe Asn Gly Val Leu His Lys Ala Tyr Val Asp Val Leu
2300 2305 2310
Cys Asn Ser Phe Phe Lys Glu Leu Thr Ala Asn Met Ser Met Ala
2315 2320 2325
Glu Cys Lys Ala Thr Leu Gly Leu Thr Val Ser Asp Asp Asp Phe
2330 2335 2340
Val Ser Ala Val Ala Asn Ala His Arg Tyr Asp Val Leu Leu Ser
2345 2350 2355
Asp Leu Ser Phe Asn Asn Phe Phe Ile Ser Tyr Ala Lys Pro Glu
2360 2365 2370
Asp Lys Leu Ser Val Tyr Asp Ile Ala Cys Cys Met Arg Ala Gly
2375 2380 2385
Ser Lys Val Val Asn His Asn Val Leu Ile Lys Glu Ser Ile Pro
2390 2395 2400
Ile Val Trp Gly Val Lys Asp Phe Asn Thr Leu Ser Gln Glu Gly
2405 2410 2415
Lys Lys Tyr Leu Val Lys Thr Thr Lys Ala Lys Gly Leu Thr Phe
2420 2425 2430
Leu Leu Thr Phe Asn Asp Asn Gln Ala Ile Thr Gln Val Pro Ala
2435 2440 2445
Thr Ser Ile Val Ala Lys Gln Gly Ala Gly Phe Lys Arg Thr Tyr
2450 2455 2460
Asn Phe Leu Trp Tyr Val Cys Leu Phe Val Val Ala Leu Phe Ile
2465 2470 2475
Gly Val Ser Phe Ile Asp Tyr Thr Thr Thr Val Thr Ser Phe His
2480 2485 2490
Gly Tyr Asp Phe Lys Tyr Ile Glu Asn Gly Gln Leu Lys Val Phe
2495 2500 2505
Glu Ala Pro Leu His Cys Val Arg Asn Val Phe Asp Asn Phe Asn
2510 2515 2520
Gln Trp His Glu Ala Lys Phe Gly Val Val Thr Thr Asn Ser Asp
2525 2530 2535
Lys Cys Pro Ile Val Val Gly Val Ser Glu Arg Ile Asn Val Val
2540 2545 2550
Pro Gly Val Pro Thr Asn Val Tyr Leu Val Gly Lys Thr Leu Val
2555 2560 2565
Phe Thr Leu Gln Ala Ala Phe Gly Asn Thr Gly Val Cys Tyr Asp
2570 2575 2580
Phe Asp Gly Val Thr Thr Ser Asp Lys Cys Ile Phe Asn Ser Ala
2585 2590 2595
Cys Thr Arg Leu Glu Gly Leu Gly Gly Asp Asn Val Tyr Cys Tyr
2600 2605 2610
Asn Thr Asp Leu Ile Glu Gly Ser Lys Pro Tyr Ser Ile Leu Gln
2615 2620 2625
Pro Asn Ala Tyr Tyr Lys Tyr Asp Val Lys Asn Tyr Val Arg Phe
2630 2635 2640
Pro Glu Ile Leu Ala Arg Gly Phe Gly Leu Arg Thr Ile Arg Thr
2645 2650 2655
Leu Ala Thr Arg Tyr Cys Arg Val Gly Glu Cys Arg Asp Ser His
2660 2665 2670
Lys Gly Val Cys Phe Gly Phe Asp Lys Trp Tyr Val Asn Asp Gly
2675 2680 2685
Arg Val Asp Asp Gly Tyr Ile Cys Gly Asp Gly Leu Ile Asp Leu
2690 2695 2700
Leu Val Asn Val Leu Ser Ile Phe Ser Ser Ser Phe Ser Val Val
2705 2710 2715
Ala Met Ser Gly His Met Leu Phe Asn Phe Leu Phe Ala Ala Phe
2720 2725 2730
Ile Thr Phe Leu Cys Phe Leu Val Thr Lys Phe Lys Arg Val Phe
2735 2740 2745
Gly Asp Leu Ser Tyr Gly Val Phe Thr Val Val Cys Ala Thr Leu
2750 2755 2760
Ile Asn Asn Ile Ser Tyr Val Val Thr Gln Asn Leu Phe Phe Met
2765 2770 2775
Leu Leu Tyr Ala Ile Leu Tyr Phe Val Phe Thr Arg Thr Val Arg
2780 2785 2790
Tyr Ala Trp Ile Trp His Ile Ala Tyr Ile Val Ala Tyr Phe Leu
2795 2800 2805
Leu Ile Pro Trp Trp Leu Leu Thr Trp Phe Ser Phe Ala Ala Phe
2810 2815 2820
Leu Glu Leu Leu Pro Asn Val Phe Lys Leu Lys Ile Ser Thr Gln
2825 2830 2835
Leu Phe Glu Gly Asp Lys Phe Ile Gly Thr Phe Glu Ser Ala Ala
2840 2845 2850
Ala Gly Thr Phe Val Leu Asp Met Arg Ser Tyr Glu Arg Leu Ile
2855 2860 2865
Asn Thr Ile Ser Pro Glu Lys Leu Lys Asn Tyr Ala Ala Ser Tyr
2870 2875 2880
Asn Lys Tyr Lys Tyr Tyr Ser Gly Ser Ala Ser Glu Ala Asp Tyr
2885 2890 2895
Arg Cys Ala Cys Tyr Ala His Leu Ala Lys Ala Met Leu Asp Tyr
2900 2905 2910
Ala Lys Asp His Asn Asp Met Leu Tyr Ser Pro Pro Thr Ile Ser
2915 2920 2925
Tyr Asn Ser Thr Leu Gln Ser Gly Leu Lys Lys Met Ala Gln Pro
2930 2935 2940
Ser Gly Cys Val Glu Arg Cys Val Val Arg Val Cys Tyr Gly Ser
2945 2950 2955
Thr Val Leu Asn Gly Val Trp Leu Gly Asp Thr Val Thr Cys Pro
2960 2965 2970
Arg His Val Ile Ala Pro Ser Thr Thr Val Leu Ile Asp Tyr Asp
2975 2980 2985
His Ala Tyr Ser Thr Met Arg Leu His Asn Phe Ser Val Ser His
2990 2995 3000
Asn Gly Val Phe Leu Gly Val Val Gly Val Thr Met His Gly Ser
3005 3010 3015
Val Leu Arg Ile Lys Val Ser Gln Ser Asn Val His Thr Pro Lys
3020 3025 3030
His Val Phe Lys Thr Leu Lys Pro Gly Ala Ser Phe Asn Ile Leu
3035 3040 3045
Ala Cys Tyr Glu Gly Ile Ala Ser Gly Val Phe Gly Val Asn Leu
3050 3055 3060
Arg Thr Asn Phe Thr Ile Lys Gly Ser Phe Ile Asn Gly Ala Cys
3065 3070 3075
Gly Ser Pro Gly Tyr Asn Val Arg Asn Asp Gly Thr Val Glu Phe
3080 3085 3090
Cys Tyr Leu His Gln Ile Glu Leu Gly Ser Gly Ala His Val Gly
3095 3100 3105
Ser Asp Phe Thr Gly Ser Val Tyr Gly Asn Phe Asp Asp Gln Pro
3110 3115 3120
Ser Leu Gln Val Glu Ser Ala Asn Leu Met Leu Ser Asp Asn Val
3125 3130 3135
Val Ala Phe Leu Tyr Ala Ala Leu Leu Asn Gly Cys Arg Trp Trp
3140 3145 3150
Leu Arg Ser Thr Arg Val Asn Val Asp Gly Phe Asn Glu Trp Ala
3155 3160 3165
Met Ala Asn Gly Tyr Thr Ile Val Ser Ser Val Glu Cys Tyr Ser
3170 3175 3180
Ile Leu Ala Ala Lys Thr Gly Val Ser Val Glu Gln Leu Leu Ala
3185 3190 3195
Ser Ile Gln His Leu His Glu Gly Phe Gly Gly Lys Asn Ile Leu
3200 3205 3210
Gly Tyr Ser Ser Leu Cys Asp Glu Phe Thr Leu Ala Glu Val Val
3215 3220 3225
Lys Gln Met Tyr Gly Val Asn Leu Gln Ser Gly Lys Val Ile Phe
3230 3235 3240
Gly Leu Lys Thr Met Phe Leu Phe Ser Val Phe Phe Thr Met Phe
3245 3250 3255
Trp Ala Glu Leu Phe Ile Tyr Thr Asn Thr Ile Trp Ile Asn Pro
3260 3265 3270
Val Ile Leu Thr Pro Ile Phe Cys Leu Leu Leu Phe Leu Ser Leu
3275 3280 3285
Val Leu Thr Met Phe Leu Lys His Lys Phe Leu Phe Leu Gln Val
3290 3295 3300
Phe Leu Leu Pro Thr Val Ile Ala Thr Ala Leu Tyr Asn Cys Val
3305 3310 3315
Leu Asp Tyr Tyr Ile Val Lys Phe Leu Ala Asp His Phe Asn Tyr
3320 3325 3330
Asn Val Ser Val Leu Gln Met Asp Val Gln Gly Leu Val Asn Val
3335 3340 3345
Leu Val Cys Leu Phe Val Val Phe Leu His Thr Trp Arg Phe Ser
3350 3355 3360
Lys Glu Arg Phe Thr His Trp Phe Thr Tyr Val Cys Ser Leu Ile
3365 3370 3375
Ala Val Ala Tyr Thr Tyr Phe Tyr Ser Gly Asp Phe Leu Ser Leu
3380 3385 3390
Leu Val Met Phe Leu Cys Ala Ile Ser Ser Asp Trp Tyr Ile Gly
3395 3400 3405
Ala Ile Val Phe Arg Leu Ser Arg Leu Ile Ile Phe Phe Ser Pro
3410 3415 3420
Glu Ser Val Phe Ser Val Phe Gly Asp Val Lys Leu Thr Leu Val
3425 3430 3435
Val Tyr Leu Ile Cys Gly Tyr Leu Val Cys Thr Tyr Trp Gly Ile
3440 3445 3450
Leu Tyr Trp Phe Asn Arg Phe Phe Lys Cys Thr Met Gly Val Tyr
3455 3460 3465
Asp Phe Lys Val Ser Ala Ala Glu Phe Lys Tyr Met Val Ala Asn
3470 3475 3480
Gly Leu His Ala Pro Tyr Gly Pro Phe Asp Ala Leu Trp Leu Ser
3485 3490 3495
Phe Lys Leu Leu Gly Ile Gly Gly Asp Arg Cys Ile Lys Ile Ser
3500 3505 3510
Thr Val Gln Ser Lys Leu Thr Asp Leu Lys Cys Thr Asn Val Val
3515 3520 3525
Leu Leu Gly Cys Leu Ser Ser Met Asn Ile Ala Ala Asn Ser Ser
3530 3535 3540
Glu Trp Ala Tyr Cys Val Asp Leu His Asn Lys Ile Asn Leu Cys
3545 3550 3555
Asp Asp Pro Glu Lys Ala Gln Gly Met Leu Leu Ala Leu Leu Ala
3560 3565 3570
Phe Phe Leu Ser Lys His Ser Asp Phe Gly Leu Asp Gly Leu Ile
3575 3580 3585
Asp Ser Tyr Phe Asp Asn Ser Ser Thr Leu Gln Ser Val Ala Ser
3590 3595 3600
Ser Phe Val Ser Met Pro Ser Tyr Ile Ala Tyr Glu Asn Ala Arg
3605 3610 3615
Gln Ala Tyr Glu Asp Ala Ile Ala Asn Gly Ser Ser Ser Gln Leu
3620 3625 3630
Ile Lys Gln Leu Lys Arg Ala Met Asn Ile Ala Lys Ser Glu Phe
3635 3640 3645
Asp His Glu Ile Ser Val Gln Lys Lys Ile Asn Arg Met Ala Glu
3650 3655 3660
Gln Ala Ala Thr Gln Met Tyr Lys Glu Ala Arg Ser Val Asn Arg
3665 3670 3675
Lys Ser Lys Val Ile Ser Ala Met His Ser Leu Leu Phe Gly Met
3680 3685 3690
Leu Arg Arg Leu Asp Met Ser Ser Val Glu Thr Val Leu Asn Leu
3695 3700 3705
Ala Arg Asp Gly Val Val Pro Leu Ser Val Ile Pro Ala Thr Ser
3710 3715 3720
Ala Ser Lys Leu Thr Ile Val Ser Pro Asp Leu Glu Ser Tyr Ser
3725 3730 3735
Lys Ile Val Cys Asp Gly Ser Val His Tyr Ala Gly Val Val Trp
3740 3745 3750
Thr Leu Asn Asp Val Lys Asp Asn Asp Gly Arg Pro Val His Val
3755 3760 3765
Lys Glu Ile Thr Arg Glu Asn Val Glu Thr Leu Thr Trp Pro Leu
3770 3775 3780
Ile Leu Asn Cys Glu Arg Val Val Lys Leu Gln Asn Asn Glu Ile
3785 3790 3795
Met Pro Gly Lys Leu Lys Gln Lys Pro Met Lys Ala Glu Gly Asp
3800 3805 3810
Gly Gly Val Leu Gly Asp Gly Asn Ala Leu Tyr Asn Thr Glu Gly
3815 3820 3825
Gly Lys Thr Phe Met Tyr Ala Tyr Ile Ser Asn Lys Ala Asp Leu
3830 3835 3840
Lys Phe Val Lys Trp Glu Tyr Glu Gly Gly Cys Asn Thr Ile Glu
3845 3850 3855
Leu Asp Ser Pro Cys Arg Phe Met Val Glu Thr Pro Asn Gly Pro
3860 3865 3870
Gln Val Lys Tyr Leu Tyr Phe Val Lys Asn Leu Asn Thr Leu Arg
3875 3880 3885
Arg Gly Ala Val Leu Gly Phe Ile Gly Ala Thr Ile Arg Leu Gln
3890 3895 3900
Ala Gly Lys Gln Thr Glu Leu Ala Val Asn Ser Gly Leu Leu Thr
3905 3910 3915
Ala Cys Ala Phe Ser Val Asp Pro Ala Thr Thr Tyr Leu Glu Ala
3920 3925 3930
Val Lys His Gly Ala Lys Pro Val Ser Asn Cys Ile Lys Met Leu
3935 3940 3945
Ser Asn Gly Ala Gly Asn Gly Gln Ala Ile Thr Thr Ser Val Asp
3950 3955 3960
Ala Asn Thr Asn Gln Asp Ser Tyr Gly Gly Ala Ser Ile Cys Leu
3965 3970 3975
Tyr Cys Arg Ala His Val Pro His Pro Ser Met Asp Gly Tyr Cys
3980 3985 3990
Lys Phe Lys Gly Lys Cys Val Gln Val Pro Ile Gly Cys Leu Asp
3995 4000 4005
Pro Ile Arg Phe Cys Leu Glu Asn Asn Val Cys Asn Val Cys Gly
4010 4015 4020
Cys Trp Leu Gly His Gly Cys Ala Cys Asp Arg Thr Thr Ile Gln
4025 4030 4035
Ser Val Asp Ile Ser Tyr Leu Asn Arg Ala Arg Gly Ser Ser Ala
4040 4045 4050
Ala Arg Leu Glu Pro Cys Asn Gly Thr Asp Ile Asp Lys Cys Val
4055 4060 4065
Arg Ala Phe Asp Ile Tyr Asn Lys Asn Val Ser Phe Leu Gly Lys
4070 4075 4080
Cys Leu Lys Met Asn Cys Val Arg Phe Lys Asn Ala Asp Leu Lys
4085 4090 4095
Asp Gly Tyr Phe Val Ile Lys Arg Cys Thr Lys Ser Val Met Glu
4100 4105 4110
His Glu Gln Ser Met Tyr Asn Leu Leu Asn Phe Ser Gly Ala Leu
4115 4120 4125
Ala Glu His Asp Phe Phe Thr Trp Lys Asp Gly Arg Val Ile Tyr
4130 4135 4140
Gly Asn Val Ser Arg His Asn Leu Thr Lys Tyr Thr Met Met Asp
4145 4150 4155
Leu Val Tyr Ala Met Arg Asn Phe Asp Glu Gln Asn Cys Asp Val
4160 4165 4170
Leu Lys Glu Val Leu Val Leu Thr Gly Cys Cys Asp Asn Ser Tyr
4175 4180 4185
Phe Asp Ser Lys Gly Trp Tyr Asp Pro Val Glu Asn Glu Asp Ile
4190 4195 4200
His Arg Val Tyr Ala Ser Leu Gly Lys Ile Val Ala Arg Ala Met
4205 4210 4215
Leu Lys Cys Val Ala Leu Cys Asp Ala Met Val Ala Lys Gly Val
4220 4225 4230
Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Asn Gly Asn Phe
4235 4240 4245
Tyr Asp Phe Gly Asp Phe Val Val Ser Leu Pro Asn Met Gly Val
4250 4255 4260
Pro Cys Cys Thr Ser Tyr Tyr Ser Tyr Met Met Pro Ile Met Gly
4265 4270 4275
Leu Thr Asn Cys Leu Ala Ser Glu Cys Phe Val Lys Ser Asp Ile
4280 4285 4290
Phe Gly Ser Asp Phe Lys Thr Phe Asp Leu Leu Lys Tyr Asp Phe
4295 4300 4305
Thr Glu His Lys Glu Asn Leu Phe Asn Lys Tyr Phe Lys His Trp
4310 4315 4320
Ser Phe Asp Tyr His Pro Asn Cys Ser Asp Cys Tyr Asp Asp Met
4325 4330 4335
Cys Val Ile His Cys Ala Asn Phe Asn Thr Leu Phe Ala Thr Thr
4340 4345 4350
Ile Pro Gly Thr Ala Phe Gly Pro Leu Cys Arg Lys Val Phe Ile
4355 4360 4365
Asp Gly Val Pro Leu Val Thr Thr Ala Gly Tyr His Phe Lys Gln
4370 4375 4380
Leu Gly Leu Val Trp Asn Lys Asp Val Asn Thr His Ser Val Arg
4385 4390 4395
Leu Thr Ile Thr Glu Leu Leu Gln Phe Val Thr Asp Pro Ser Leu
4400 4405 4410
Ile Ile Ala Ser Ser Pro Ala Leu Val Asp Gln Arg Thr Ile Cys
4415 4420 4425
Phe Ser Val Ala Ala Leu Ser Thr Gly Leu Thr Asn Gln Val Val
4430 4435 4440
Lys Pro Gly His Phe Asn Glu Glu Phe Tyr Asn Phe Leu Arg Leu
4445 4450 4455
Arg Gly Phe Phe Asp Glu Gly Ser Glu Leu Thr Leu Lys His Phe
4460 4465 4470
Phe Phe Ala Gln Asn Gly Asp Ala Ala Val Lys Asp Phe Asp Phe
4475 4480 4485
Tyr Arg Tyr Asn Lys Pro Thr Ile Leu Asp Ile Cys Gln Ala Arg
4490 4495 4500
Val Thr Tyr Lys Ile Val Ser Arg Tyr Phe Asp Ile Tyr Glu Gly
4505 4510 4515
Gly Cys Ile Lys Ala Cys Glu Val Val Val Thr Asn Leu Asn Lys
4520 4525 4530
Ser Ala Gly Trp Pro Leu Asn Lys Phe Gly Lys Ala Ser Leu Tyr
4535 4540 4545
Tyr Glu Ser Ile Ser Tyr Glu Glu Gln Asp Ala Leu Phe Ala Leu
4550 4555 4560
Thr Lys Arg Asn Val Leu Pro Thr Met Thr Gln Leu Asn Leu Lys
4565 4570 4575
Tyr Ala Ile Ser Gly Lys Glu Arg Ala Arg Thr Val Gly Gly Val
4580 4585 4590
Ser Leu Leu Ser Thr Met Thr Thr Arg Gln Tyr His Gln Lys His
4595 4600 4605
Leu Lys Ser Ile Val Asn Thr Arg Asn Ala Thr Val Val Ile Gly
4610 4615 4620
Thr Thr Lys Phe Tyr Gly Gly Trp Asn Asn Met Leu Arg Thr Leu
4625 4630 4635
Ile Asp Gly Val Glu Asn Pro Met Leu Met Gly Trp Asp Tyr Pro
4640 4645 4650
Lys Cys Asp Arg Ala Leu Pro Asn Met Ile Arg Met Ile Ser Ala
4655 4660 4665
Met Val Leu Gly Ser Lys His Val Asn Cys Cys Thr Val Thr Asp
4670 4675 4680
Arg Phe Tyr Arg Leu Gly Asn Glu Leu Ala Gln Val Leu Thr Glu
4685 4690 4695
Val Val Tyr Ser Asn Gly Gly Phe Tyr Phe Lys Pro Gly Gly Thr
4700 4705 4710
Thr Ser Gly Asp Ala Ser Thr Ala Tyr Ala Asn Ser Ile Phe Asn
4715 4720 4725
Ile Phe Gln Ala Val Ser Ser Asn Ile Asn Arg Leu Leu Ser Val
4730 4735 4740
Pro Ser Asp Ser Cys Asn Asn Val Asn Val Arg Asp Leu Gln Arg
4745 4750 4755
Arg Leu Tyr Asp Asn Cys Tyr Arg Leu Thr Ser Val Glu Glu Ser
4760 4765 4770
Phe Ile Asp Asp Tyr Tyr Gly Tyr Leu Arg Lys His Phe Ser Met
4775 4780 4785
Met Ile Leu Ser Asp Asp Gly Val Val Cys Tyr Asn Lys Asp Tyr
4790 4795 4800
Ala Glu Leu Gly Tyr Ile Ala Asp Ile Ser Ala Phe Lys Ala Thr
4805 4810 4815
Leu Tyr Tyr Gln Asn Asn Val Phe Met Ser Thr Ser Lys Cys Trp
4820 4825 4830
Val Glu Glu Asp Leu Thr Lys Gly Pro His Glu Phe Cys Ser Gln
4835 4840 4845
His Thr Met Gln Ile Val Asp Lys Asp Gly Thr Tyr Tyr Leu Pro
4850 4855 4860
Tyr Pro Asp Pro Ser Arg Ile Leu Ser Ala Gly Val Phe Val Asp
4865 4870 4875
Asp Val Val Lys Thr Asp Ala Val Val Leu Leu Xaa Arg Tyr Val
4880 4885 4890
Ser Leu Ala Ile Asp Ala Tyr Pro Leu Ser Lys His Pro Asn Ser
4895 4900 4905
Glu Tyr Arg Lys Val Phe Tyr Val Leu Leu Asp Trp Val Lys His
4910 4915 4920
Leu Asn Lys Asn Leu Asn Glu Gly Val Leu Glu Ser Phe Ser Val
4925 4930 4935
Thr Leu Leu Asp Asn Gln Glu Asp Lys Phe Trp Cys Glu Asp Phe
4940 4945 4950
Tyr Ala Ser Met Tyr Glu Asn Ser Thr Ile Leu Gln Ala Ala Gly
4955 4960 4965
Leu Cys Val Val Cys Gly Ser Gln Thr Val Leu Arg Cys Gly Asp
4970 4975 4980
Cys Leu Arg Lys Pro Met Leu Cys Thr Lys Cys Ala Tyr Asp His
4985 4990 4995
Val Phe Gly Thr Asp His Lys Phe Ile Leu Ala Ile Thr Pro Tyr
5000 5005 5010
Val Cys Asn Ala Ser Gly Cys Gly Val Ser Asp Val Lys Lys Leu
5015 5020 5025
Tyr Leu Gly Gly Leu Asn Tyr Tyr Cys Thr Asn His Lys Pro Gln
5030 5035 5040
Leu Ser Phe Pro Leu Cys Ser Ala Gly Asn Ile Phe Gly Leu Tyr
5045 5050 5055
Lys Asn Ser Ala Thr Gly Ser Leu Asp Val Glu Val Phe Asn Arg
5060 5065 5070
Leu Ala Thr Ser Asp Trp Thr Asp Val Arg Asp Tyr Lys Leu Ala
5075 5080 5085
Asn Asp Val Lys Asp Thr Leu Arg Leu Phe Ala Ala Glu Thr Ile
5090 5095 5100
Lys Ala Lys Glu Glu Ser Val Lys Ser Ser Tyr Ala Phe Ala Thr
5105 5110 5115
Leu Lys Glu Val Val Gly Pro Lys Glu Leu Leu Leu Ser Trp Glu
5120 5125 5130
Ser Gly Lys Val Lys Pro Pro Leu Asn Arg Asn Ser Val Phe Thr
5135 5140 5145
Cys Phe Gln Ile Ser Lys Asp Ser Lys Phe Gln Ile Gly Glu Phe
5150 5155 5160
Ile Phe Glu Lys Val Glu Tyr Gly Ser Asp Thr Val Thr Tyr Lys
5165 5170 5175
Ser Thr Val Thr Thr Lys Leu Val Pro Gly Met Ile Phe Val Leu
5180 5185 5190
Thr Ser His Asn Val Gln Pro Leu Arg Ala Pro Thr Ile Ala Asn
5195 5200 5205
Gln Glu Lys Tyr Ser Ser Ile Tyr Lys Leu His Pro Ala Phe Asn
5210 5215 5220
Val Ser Asp Ala Tyr Ala Asn Leu Val Pro Tyr Tyr Gln Leu Ile
5225 5230 5235
Gly Lys Gln Lys Ile Thr Thr Ile Gln Gly Pro Pro Gly Ser Gly
5240 5245 5250
Lys Ser His Cys Ser Ile Gly Leu Gly Leu Tyr Tyr Pro Gly Ala
5255 5260 5265
Arg Ile Val Phe Val Ala Cys Ala His Ala Ala Val Asp Ser Leu
5270 5275 5280
Cys Ala Lys Ala Met Thr Val Tyr Ser Ile Asp Lys Cys Thr Arg
5285 5290 5295
Ile Ile Pro Ala Arg Ala Arg Val Glu Cys Tyr Ser Gly Phe Lys
5300 5305 5310
Pro Asn Asn Thr Ser Ala Gln Tyr Ile Phe Ser Thr Val Asn Ala
5315 5320 5325
Leu Pro Glu Cys Asn Ala Asp Ile Val Val Val Asp Glu Val Ser
5330 5335 5340
Met Cys Thr Asn Tyr Asp Leu Ser Val Ile Asn Gln Arg Leu Ser
5345 5350 5355
Tyr Lys His Ile Val Tyr Val Gly Asp Pro Gln Gln Leu Pro Ala
5360 5365 5370
Pro Arg Val Met Ile Thr Lys Gly Val Met Glu Pro Val Asp Tyr
5375 5380 5385
Asn Val Val Thr Gln Arg Met Cys Ala Ile Gly Pro Asp Val Phe
5390 5395 5400
Leu His Lys Cys Tyr Arg Cys Pro Ala Glu Ile Val Asn Thr Val
5405 5410 5415
Ser Glu Leu Val Tyr Glu Asn Lys Phe Val Pro Val Lys Pro Ala
5420 5425 5430
Ser Lys Gln Cys Phe Lys Ile Phe Phe Lys Gly Asn Val Gln Val
5435 5440 5445
Asp Asn Gly Ser Ser Ile Asn Arg Lys Gln Leu Glu Ile Val Lys
5450 5455 5460
Leu Phe Leu Val Lys Asn Pro Ser Trp Ser Lys Ala Val Phe Ile
5465 5470 5475
Ser Pro Tyr Asn Ser Gln Asn Tyr Val Ala Ser Arg Phe Leu Gly
5480 5485 5490
Leu Gln Ile Gln Thr Val Asp Ser Ser Gln Gly Ser Glu Tyr Asp
5495 5500 5505
Tyr Val Ile Tyr Ala Gln Thr Ser Asp Thr Ala His Ala Cys Asn
5510 5515 5520
Val Asn Arg Phe Asn Val Ala Ile Thr Arg Ala Lys Lys Gly Ile
5525 5530 5535
Phe Cys Val Met Cys Asp Lys Thr Leu Phe Asp Ser Leu Lys Phe
5540 5545 5550
Phe Glu Ile Lys His Ala Asp Leu His Ser Ser Gln Val Cys Gly
5555 5560 5565
Leu Phe Lys Asn Cys Thr Arg Thr Pro Leu Asn Leu Pro Pro Thr
5570 5575 5580
His Ala His Thr Phe Leu Ser Leu Ser Asp Gln Phe Lys Thr Thr
5585 5590 5595
Gly Asp Leu Ala Val Gln Ile Gly Ser Asn Asn Val Cys Thr Tyr
5600 5605 5610
Glu His Val Ile Ser Phe Met Gly Phe Arg Phe Asp Ile Ser Ile
5615 5620 5625
Pro Gly Ser His Ser Leu Phe Cys Thr Arg Asp Phe Ala Ile Arg
5630 5635 5640
Asn Val Arg Gly Trp Leu Gly Met Asp Val Glu Ser Ala His Val
5645 5650 5655
Cys Gly Asp Asn Ile Gly Thr Asn Val Pro Leu Gln Val Gly Phe
5660 5665 5670
Ser Asn Gly Val Asn Phe Val Val Gln Thr Glu Gly Cys Val Ser
5675 5680 5685
Thr Asn Phe Gly Asp Val Ile Lys Pro Val Cys Ala Lys Ser Pro
5690 5695 5700
Pro Gly Glu Gln Phe Arg His Leu Val Pro Phe Leu Arg Lys Gly
5705 5710 5715
Gln Pro Trp Leu Ile Val Arg Arg Arg Ile Val Gln Met Ile Ser
5720 5725 5730
Asp Tyr Leu Ser Asn Leu Ser Asp Ile Leu Val Phe Val Leu Trp
5735 5740 5745
Ala Gly Ser Leu Glu Leu Thr Thr Met Arg Tyr Phe Val Lys Ile
5750 5755 5760
Gly Pro Ile Lys Tyr Cys Tyr Cys Gly Asn Ser Ala Thr Cys Tyr
5765 5770 5775
Asn Ser Val Ser Asn Glu Tyr Cys Cys Phe Lys His Ala Leu Gly
5780 5785 5790
Cys Asp Tyr Val Tyr Asn Pro Tyr Ala Phe Asp Ile Gln Gln Trp
5795 5800 5805
Gly Tyr Val Gly Ser Leu Ser Gln Asn His His Thr Phe Cys Asn
5810 5815 5820
Ile His Arg Asn Glu His Asp Ala Ser Gly Asp Ala Val Met Thr
5825 5830 5835
Arg Cys Leu Ala Val His Asp Cys Phe Val Lys Asn Val Asp Trp
5840 5845 5850
Thr Val Thr Tyr Pro Phe Ile Ala Asn Glu Lys Phe Ile Asn Gly
5855 5860 5865
Cys Gly Arg Asn Val Gln Gly His Val Val Arg Ala Ala Leu Lys
5870 5875 5880
Leu Tyr Lys Pro Ser Val Ile His Asp Ile Gly Asn Pro Lys Gly
5885 5890 5895
Val Arg Cys Ala Val Thr Asp Ala Lys Trp Tyr Cys Tyr Asp Lys
5900 5905 5910
Gln Pro Val Asn Ser Asn Val Lys Leu Leu Asp Tyr Asp Tyr Ala
5915 5920 5925
Thr His Gly Gln Leu Asp Gly Leu Cys Leu Phe Trp Asn Cys Asn
5930 5935 5940
Val Asp Met Tyr Pro Glu Phe Ser Ile Val Cys Arg Phe Asp Thr
5945 5950 5955
Arg Thr Arg Ser Val Phe Asn Leu Glu Gly Val Asn Gly Gly Ser
5960 5965 5970
Leu Tyr Val Asn Lys His Ala Phe His Thr Pro Ala Tyr Asp Lys
5975 5980 5985
Arg Ala Phe Val Lys Leu Lys Pro Met Pro Phe Phe Tyr Phe Asp
5990 5995 6000
Asp Ser Asp Cys Asp Val Val Gln Glu Gln Val Asn Tyr Val Pro
6005 6010 6015
Leu Arg Ala Ser Ser Cys Val Thr Arg Cys Asn Ile Gly Gly Ala
6020 6025 6030
Val Cys Ser Lys His Ala Asn Leu Tyr Gln Lys Tyr Val Glu Ala
6035 6040 6045
Tyr Asn Thr Phe Thr Gln Ala Gly Phe Asn Ile Trp Val Pro His
6050 6055 6060
Ser Phe Asp Val Tyr Asn Leu Trp Gln Ile Phe Ile Glu Thr Asn
6065 6070 6075
Leu Gln Ser Leu Glu Asn Ile Ala Phe Asn Val Val Lys Lys Gly
6080 6085 6090
Cys Phe Thr Gly Val Asp Gly Glu Leu Pro Val Ala Val Val Asn
6095 6100 6105
Asp Lys Val Phe Val Arg Tyr Gly Asp Val Asp Asn Leu Val Phe
6110 6115 6120
Thr Asn Lys Thr Thr Leu Pro Thr Asn Val Ala Phe Glu Leu Phe
6125 6130 6135
Ala Lys Arg Lys Met Gly Leu Thr Pro Pro Leu Ser Ile Leu Lys
6140 6145 6150
Asn Leu Gly Val Val Ala Thr Tyr Lys Phe Val Leu Trp Asp Tyr
6155 6160 6165
Glu Ala Glu Arg Pro Phe Thr Ser Tyr Thr Lys Ser Val Cys Lys
6170 6175 6180
Tyr Thr Asp Phe Asn Glu Asp Val Cys Val Cys Phe Asp Asn Ser
6185 6190 6195
Ile Gln Gly Ser Tyr Glu Arg Phe Thr Leu Thr Thr Asn Ala Val
6200 6205 6210
Leu Phe Ser Thr Val Val Ile Lys Asn Leu Thr Pro Ile Lys Leu
6215 6220 6225
Asn Phe Gly Met Leu Asn Gly Met Pro Val Ser Ser Ile Lys Ser
6230 6235 6240
Asp Lys Gly Val Glu Lys Leu Val Asn Trp Tyr Thr Tyr Val Arg
6245 6250 6255
Lys Asn Gly Gln Phe Gln Asp His Tyr Asp Gly Phe Tyr Thr Gln
6260 6265 6270
Gly Arg Asn Leu Ser Asp Phe Thr Pro Arg Ser Asp Met Glu Tyr
6275 6280 6285
Asp Phe Leu Asn Met Asp Met Gly Val Phe Ile Asn Lys Tyr Gly
6290 6295 6300
Leu Glu Asp Phe Asn Phe Glu His Val Val Tyr Gly Asp Val Ser
6305 6310 6315
Lys Thr Thr Leu Gly Gly Leu His Leu Leu Ile Ser Gln Phe Arg
6320 6325 6330
Leu Ser Lys Met Gly Val Leu Lys Ala Asp Asp Phe Val Thr Ala
6335 6340 6345
Ser Asp Thr Thr Leu Arg Cys Cys Thr Val Thr Tyr Leu Asn Glu
6350 6355 6360
Leu Ser Ser Lys Val Val Cys Thr Tyr Met Asp Leu Leu Leu Asp
6365 6370 6375
Asp Phe Val Thr Ile Leu Lys Ser Leu Asp Leu Gly Val Ile Ser
6380 6385 6390
Lys Val His Glu Val Ile Ile Asp Asn Lys Pro Tyr Arg Trp Met
6395 6400 6405
Leu Trp Cys Lys Asp Asn His Leu Ser Thr Phe Tyr Pro Gln Leu
6410 6415 6420
Gln Ser Ala Glu Trp Lys Cys Gly Tyr Ala Met Pro Gln Ile Tyr
6425 6430 6435
Lys Leu Gln Arg Met Cys Leu Glu Pro Cys Asn Leu Tyr Asn Tyr
6440 6445 6450
Gly Ala Gly Ile Lys Leu Pro Ser Gly Ile Met Leu Asn Val Val
6455 6460 6465
Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Ser Thr Thr Met Cys
6470 6475 6480
Val Pro His Asn Met Arg Val Leu His Tyr Gly Ala Gly Ser Asp
6485 6490 6495
Lys Gly Val Ala Pro Gly Thr Thr Val Leu Lys Arg Trp Leu Pro
6500 6505 6510
Pro Asp Ala Ile Ile Ile Asp Asn Asp Ile Asn Asp Tyr Val Ser
6515 6520 6525
Asp Ala Asp Phe Ser Ile Thr Gly Asp Cys Ala Thr Val Tyr Leu
6530 6535 6540
Glu Asp Lys Phe Asp Leu Leu Ile Ser Asp Met Tyr Asp Gly Arg
6545 6550 6555
Ile Lys Phe Cys Asp Gly Glu Asn Val Ser Lys Asp Gly Phe Phe
6560 6565 6570
Thr Tyr Leu Asn Gly Val Ile Arg Glu Lys Leu Ala Ile Gly Gly
6575 6580 6585
Ser Val Ala Ile Lys Ile Thr Glu Tyr Ser Trp Asn Lys Tyr Leu
6590 6595 6600
Tyr Glu Leu Ile Gln Arg Phe Ala Phe Trp Thr Leu Phe Cys Thr
6605 6610 6615
Ser Val Asn Thr Ser Ser Ser Glu Ala Phe Leu Ile Gly Ile Asn
6620 6625 6630
Tyr Leu Gly Asp Phe Ile Gln Gly Pro Phe Ile Ala Gly Asn Thr
6635 6640 6645
Val His Ala Asn Tyr Ile Phe Trp Arg Asn Ser Thr Ile Met Ser
6650 6655 6660
Leu Ser Tyr Asn Ser Val Leu Asp Leu Ser Lys Phe Glu Cys Lys
6665 6670 6675
His Lys Ala Thr Val Val Val Thr Leu Lys Asp Ser Asp Val Asn
6680 6685 6690
Asp Met Val Leu Ser Leu Ile Lys Ser Gly Arg Leu Leu Leu Arg
6695 6700 6705
Asn Ser Gly Arg Phe Gly Gly Phe Ser Asn His Leu Val Ser Thr
6710 6715 6720
Lys
<210> 24
<211> 7123
<212> PRT
<213> murine hepatitis virus
<220>
<221> MISC_FEATURE
<223> ORF 1AB
<400> 24
Met Ala Lys Met Gly Lys Tyr Gly Leu Gly Phe Lys Trp Ala Pro Glu
1 5 10 15
Phe Pro Trp Met Leu Pro Asn Ala Ser Glu Lys Leu Gly Ser Pro Glu
20 25 30
Arg Ser Glu Glu Asp Gly Phe Cys Pro Ser Ala Ala Gln Glu Pro Lys
35 40 45
Thr Lys Gly Lys Thr Leu Ile Asn His Val Arg Val Asp Cys Ser Arg
50 55 60
Leu Pro Ala Leu Glu Cys Cys Val Gln Ser Ala Ile Ile Arg Asp Ile
65 70 75 80
Phe Val Asp Glu Asp Pro Leu Asn Val Glu Ala Ser Thr Met Met Ala
85 90 95
Leu Gln Phe Gly Ser Ala Val Leu Val Lys Pro Ser Lys Arg Leu Ser
100 105 110
Ile Gln Ala Trp Ala Lys Leu Gly Val Leu Pro Lys Thr Pro Ala Met
115 120 125
Gly Leu Phe Lys Arg Phe Cys Leu Cys Asn Thr Arg Glu Cys Val Cys
130 135 140
Asp Ala His Val Ala Phe Gln Leu Phe Thr Val Gln Pro Asp Gly Val
145 150 155 160
Cys Leu Gly Asn Gly Arg Phe Ile Gly Trp Phe Val Pro Val Thr Ala
165 170 175
Ile Pro Ala Tyr Ala Lys Gln Trp Leu Gln Pro Trp Ser Ile Leu Leu
180 185 190
Arg Lys Gly Gly Asn Lys Gly Ser Val Thr Ser Gly His Phe Arg Arg
195 200 205
Ala Val Thr Met Pro Val Tyr Asp Phe Asn Val Glu Asp Ala Cys Glu
210 215 220
Glu Val His Leu Asn Pro Lys Gly Lys Tyr Ser Arg Lys Ala Tyr Ala
225 230 235 240
Leu Leu Lys Gly Tyr Arg Gly Val Lys Ser Ile Leu Phe Leu Asp Gln
245 250 255
Tyr Gly Cys Asp Tyr Thr Gly Arg Leu Ala Lys Gly Leu Glu Asp Tyr
260 265 270
Gly Asp Cys Thr Leu Glu Glu Met Lys Glu Leu Phe Pro Val Trp Cys
275 280 285
Asp Ser Leu Asp Asn Glu Val Val Val Ala Trp His Val Asp Arg Asp
290 295 300
Pro Arg Ala Val Met Arg Leu Gln Thr Leu Ala Thr Ile Arg Ser Ile
305 310 315 320
Gly Tyr Val Gly Gln Pro Thr Glu Asp Leu Val Asp Gly Asp Val Val
325 330 335
Val Arg Glu Pro Ala His Leu Leu Ala Ala Asn Ala Ile Val Lys Arg
340 345 350
Leu Pro Arg Leu Val Glu Thr Met Leu Tyr Thr Asp Ser Ser Val Thr
355 360 365
Glu Phe Cys Tyr Lys Thr Lys Leu Cys Asp Cys Gly Phe Ile Thr Gln
370 375 380
Phe Gly Tyr Val Asp Cys Cys Gly Asp Ala Cys Asp Phe Arg Gly Trp
385 390 395 400
Val Pro Gly Asn Met Met Asp Gly Phe Leu Cys Pro Gly Cys Ser Lys
405 410 415
Ser Tyr Met Pro Trp Glu Leu Glu Ala Gln Ser Ser Gly Val Ile Pro
420 425 430
Lys Gly Gly Val Leu Phe Thr Gln Ser Thr Asp Thr Val Asn Arg Glu
435 440 445
Ser Phe Lys Leu Tyr Gly His Ala Val Val Pro Phe Gly Ser Ala Val
450 455 460
Tyr Trp Ser Pro Tyr Pro Gly Met Trp Leu Pro Val Ile Trp Ser Ser
465 470 475 480
Val Lys Ser Tyr Ala Asp Leu Thr Tyr Thr Gly Val Val Gly Cys Lys
485 490 495
Ala Ile Val Gln Glu Thr Asp Ala Ile Cys Arg Ser Leu Tyr Met Asp
500 505 510
Tyr Val Gln His Lys Cys Gly Asn Leu Glu Gln Arg Ala Ile Leu Gly
515 520 525
Leu Asp Asp Val Tyr His Arg Gln Leu Leu Val Asn Arg Gly Asp Tyr
530 535 540
Ser Leu Leu Leu Glu Asn Val Asp Leu Phe Val Lys Arg Arg Ala Glu
545 550 555 560
Phe Ala Cys Lys Phe Ala Thr Cys Gly Asp Gly Leu Val Pro Leu Leu
565 570 575
Leu Asp Gly Leu Val Pro Arg Ser Tyr Tyr Leu Ile Lys Ser Gly Gln
580 585 590
Ala Phe Thr Ser Met Met Val Asn Phe Ser His Glu Val Thr Asp Met
595 600 605
Cys Met Asp Met Ala Leu Leu Phe Met His Asp Val Lys Val Ala Thr
610 615 620
Lys Tyr Val Lys Lys Val Thr Gly Lys Leu Ala Val Arg Phe Lys Ala
625 630 635 640
Leu Gly Val Ala Val Val Arg Lys Ile Thr Glu Trp Phe Asp Leu Ala
645 650 655
Val Asp Thr Ala Ala Ser Ala Ala Gly Trp Leu Cys Tyr Gln Leu Val
660 665 670
Asn Gly Leu Phe Ala Val Ala Asn Gly Gly Ile Thr Phe Leu Ser Asp
675 680 685
Val Pro Glu Leu Val Lys Asn Phe Val Asp Lys Phe Lys Val Phe Phe
690 695 700
Lys Val Leu Ile Asp Ser Met Ser Val Ser Val Leu Ser Gly Leu Thr
705 710 715 720
Val Val Lys Thr Ala Ser Asn Arg Val Cys Leu Ala Gly Cys Lys Val
725 730 735
Tyr Glu Val Val Gln Lys Arg Leu Ser Ala Tyr Val Met Pro Val Gly
740 745 750
Cys Asn Glu Ala Thr Cys Leu Val Gly Glu Ile Glu Pro Ala Val Val
755 760 765
Glu Asp Asp Val Val Asp Val Val Lys Ala Pro Leu Thr Tyr Gln Gly
770 775 780
Cys Cys Lys Pro Pro Thr Ser Phe Glu Lys Ile Cys Val Val Asp Lys
785 790 795 800
Leu Tyr Met Ala Lys Cys Gly Asp Gln Phe Tyr Pro Val Val Val Asp
805 810 815
Asn Asp Thr Ile Gly Val Leu Asp Gln Cys Trp Arg Phe Pro Cys Ala
820 825 830
Gly Lys Lys Val Glu Phe Asn Asp Lys Pro Lys Val Lys Glu Ile Pro
835 840 845
Ser Thr Arg Lys Ile Lys Ile Asn Phe Ala Leu Asp Ala Thr Phe Asp
850 855 860
Ser Val Leu Ser Lys Ala Cys Ser Glu Phe Glu Val Asp Lys Asp Val
865 870 875 880
Thr Leu Asp Glu Leu Leu Asp Val Val Leu Asp Ala Val Glu Ser Thr
885 890 895
Leu Ser Pro Cys Lys Glu His Asp Val Ile Gly Thr Lys Val Cys Ala
900 905 910
Leu Leu Asn Arg Leu Ala Glu Asp Tyr Val Tyr Leu Phe Asp Glu Gly
915 920 925
Gly Glu Glu Val Ile Ala Pro Lys Met Tyr Cys Ser Phe Ser Ala Pro
930 935 940
Asp Asp Glu Asp Cys Val Ala Ala Asp Val Val Asp Ala Asp Glu Asn
945 950 955 960
Gln Gly Asp Asp Ala Asp Asp Ser Ala Ala Leu Val Thr Asp Thr Gln
965 970 975
Glu Glu Asp Gly Val Ala Lys Gly Gln Val Gly Val Ala Glu Ser Asp
980 985 990
Ala Arg Leu Asp Gln Val Glu Ala Phe Asp Ile Glu Lys Val Glu Asp
995 1000 1005
Pro Ile Leu Asn Glu Leu Ser Ala Glu Leu Asn Ala Pro Ala Asp
1010 1015 1020
Lys Thr Tyr Glu Asp Val Leu Ala Phe Asp Ala Ile Tyr Ser Glu
1025 1030 1035
Ala Leu Ser Ala Phe Tyr Ala Val Pro Gly Asp Glu Thr His Phe
1040 1045 1050
Lys Val Cys Gly Phe Tyr Ser Pro Ala Ile Glu Arg Thr Asn Cys
1055 1060 1065
Trp Leu Arg Ser Thr Leu Ile Val Met Gln Ser Leu Pro Leu Glu
1070 1075 1080
Phe Lys Asp Leu Glu Met Gln Lys Leu Trp Leu Ser Tyr Lys Ser
1085 1090 1095
Ser Tyr Asn Lys Glu Phe Val Asp Lys Leu Val Lys Ser Val Pro
1100 1105 1110
Lys Ser Ile Ile Leu Pro Gln Gly Gly Tyr Val Ala Asp Phe Ala
1115 1120 1125
Tyr Phe Phe Leu Ser Gln Cys Ser Phe Lys Ala Tyr Ala Asn Trp
1130 1135 1140
Arg Cys Leu Lys Cys Asp Met Asp Leu Lys Leu Gln Gly Leu Asp
1145 1150 1155
Ala Met Phe Phe Tyr Gly Asp Val Val Ser His Val Cys Lys Cys
1160 1165 1170
Gly Thr Gly Met Thr Leu Leu Ser Ala Asp Ile Pro Tyr Thr Leu
1175 1180 1185
His Phe Gly Leu Arg Asp Asp Lys Phe Cys Ala Phe Tyr Thr Pro
1190 1195 1200
Arg Lys Val Phe Arg Ala Ala Cys Val Val Asp Val Asn Asp Cys
1205 1210 1215
His Ser Met Ala Val Val Asp Gly Lys Gln Ile Asp Gly Lys Val
1220 1225 1230
Val Thr Lys Phe Asn Gly Asp Lys Tyr Asp Phe Met Val Gly His
1235 1240 1245
Gly Met Ala Phe Ser Met Ser Ala Phe Glu Ile Ala Gln Leu Tyr
1250 1255 1260
Gly Ser Cys Ile Thr Pro Asn Val Cys Phe Val Lys Gly Asp Val
1265 1270 1275
Ile Lys Val Leu Arg Arg Val Gly Ala Glu Val Ile Val Asn Pro
1280 1285 1290
Ala Asn Gly Arg Met Ala His Gly Ala Gly Val Ala Gly Ala Ile
1295 1300 1305
Ala Lys Ala Ala Gly Lys Ser Phe Ile Lys Glu Thr Ala Asp Met
1310 1315 1320
Val Lys Asn Gln Gly Val Cys Gln Val Gly Glu Cys Tyr Glu Ser
1325 1330 1335
Thr Gly Gly Asn Leu Cys Lys Thr Val Leu Asn Ile Val Gly Pro
1340 1345 1350
Asp Ala Arg Gly His Gly Lys Gln Cys Tyr Ser Phe Leu Glu Arg
1355 1360 1365
Ala Tyr Gln His Ile Asn Lys Cys Asp Asp Val Val Thr Thr Leu
1370 1375 1380
Ile Ser Ala Gly Ile Phe Ser Val Pro Thr Asp Val Ser Leu Thr
1385 1390 1395
Tyr Leu Ile Gly Val Val Thr Lys Asn Val Ile Leu Val Ser Asn
1400 1405 1410
Asn Lys Asp Asp Phe Asp Val Ile Glu Lys Cys Gln Val Thr Ser
1415 1420 1425
Ile Ala Gly Thr Lys Ala Leu Ser Leu Gln Leu Ala Lys Asn Leu
1430 1435 1440
Cys Arg Asp Val Lys Phe Glu Thr Asn Ala Cys Asp Ser Leu Phe
1445 1450 1455
Ser Asp Ser Cys Phe Val Ser Ser Tyr Asp Val Leu Gln Glu Val
1460 1465 1470
Glu Leu Leu Arg His Asp Ile Gln Leu Asp Asp Asp Ala Arg Val
1475 1480 1485
Phe Val Gln Ala His Met Asp Asn Leu Pro Ala Asp Trp Arg Leu
1490 1495 1500
Val Asn Lys Phe Asp Ser Val Asp Gly Val Arg Thr Val Lys Tyr
1505 1510 1515
Phe Glu Cys Pro Gly Glu Ile Phe Val Ser Ser Gln Gly Lys Lys
1520 1525 1530
Phe Gly Tyr Val Gln Asn Gly Ser Phe Lys Val Ala Ser Val Ser
1535 1540 1545
Gln Ile Arg Ala Leu Leu Ala Asn Lys Val Asp Val Leu Cys Thr
1550 1555 1560
Val Asp Gly Val Asn Phe Arg Ser Cys Cys Val Ala Glu Gly Glu
1565 1570 1575
Val Phe Gly Lys Thr Leu Gly Ser Val Phe Cys Asp Gly Ile Asn
1580 1585 1590
Val Thr Lys Val Arg Cys Ser Ala Ile His Lys Gly Lys Val Phe
1595 1600 1605
Phe Gln Tyr Ser Gly Leu Ser Ala Ala Asp Leu Val Ala Val Thr
1610 1615 1620
Asp Ala Phe Gly Phe Asp Glu Pro Gln Leu Leu Lys Tyr Tyr Asn
1625 1630 1635
Met Leu Gly Met Cys Lys Trp Pro Val Val Val Cys Gly Asn Tyr
1640 1645 1650
Phe Ala Phe Lys Gln Ser Asn Asn Asn Cys Tyr Ile Asn Val Ala
1655 1660 1665
Cys Leu Met Leu Gln His Leu Ser Leu Lys Phe His Lys Trp Gln
1670 1675 1680
Trp Gln Glu Ala Trp Asn Glu Phe Arg Ser Gly Lys Pro Leu Arg
1685 1690 1695
Phe Val Ser Leu Val Leu Ala Lys Gly Ser Phe Lys Phe Asn Glu
1700 1705 1710
Pro Ser Asp Ser Thr Asp Phe Met Arg Val Val Leu Arg Glu Ala
1715 1720 1725
Asp Leu Ser Gly Ala Thr Cys Asp Phe Glu Phe Val Cys Lys Cys
1730 1735 1740
Gly Val Lys Gln Glu Gln Arg Lys Gly Val Asp Ala Val Met His
1745 1750 1755
Phe Gly Thr Leu Asp Lys Gly Asp Leu Ala Lys Gly Tyr Thr Ile
1760 1765 1770
Ala Cys Thr Cys Gly Asn Lys Leu Val His Cys Thr Gln Leu Asn
1775 1780 1785
Val Pro Phe Leu Ile Cys Ser Asn Lys Pro Glu Gly Lys Lys Leu
1790 1795 1800
Pro Asp Asp Val Val Ala Ala Asn Ile Phe Thr Gly Gly Ser Leu
1805 1810 1815
Gly His Tyr Thr His Val Lys Cys Lys Pro Lys Tyr Gln Leu Tyr
1820 1825 1830
Asp Ala Cys Asn Val Ser Lys Val Ser Glu Ala Lys Gly Asn Phe
1835 1840 1845
Thr Asp Cys Leu Tyr Leu Lys Asn Leu Lys Gln Thr Phe Ser Ser
1850 1855 1860
Lys Leu Thr Thr Phe Tyr Leu Asp Asp Val Lys Cys Val Glu Tyr
1865 1870 1875
Asn Pro Asp Leu Ser Gln Tyr Tyr Cys Glu Ser Gly Lys Tyr Tyr
1880 1885 1890
Thr Lys Pro Ile Ile Lys Ala Gln Phe Arg Thr Phe Glu Lys Val
1895 1900 1905
Glu Gly Val Tyr Thr Asn Phe Lys Leu Val Gly His Ser Ile Ala
1910 1915 1920
Glu Lys Phe Asn Ala Lys Leu Gly Phe Asp Cys Asn Ser Pro Phe
1925 1930 1935
Thr Glu Tyr Lys Ile Thr Glu Trp Pro Thr Ala Thr Gly Asp Val
1940 1945 1950
Val Leu Ala Ser Asp Asp Leu Tyr Val Ser Arg Tyr Ser Gly Gly
1955 1960 1965
Cys Val Thr Phe Gly Lys Pro Val Ile Trp Leu Gly His Glu Glu
1970 1975 1980
Ala Ser Leu Lys Ser Leu Thr Tyr Phe Asn Arg Pro Ser Val Val
1985 1990 1995
Cys Glu Asn Lys Phe Asn Val Leu Pro Val Asp Val Ser Glu Pro
2000 2005 2010
Thr Asp Lys Gly Pro Val Pro Ala Ala Val Leu Val Thr Gly Ala
2015 2020 2025
Leu Ser Gly Ala Ala Thr Ala Pro Gly Thr Ala Lys Glu Gln Lys
2030 2035 2040
Val Cys Ala Ser Asp Ser Val Val Asp Gln Val Val Ser Gly Phe
2045 2050 2055
Leu Ser Asp Leu Ser Gly Ala Thr Val Asp Val Lys Glu Val Lys
2060 2065 2070
Leu Asn Gly Val Lys Lys Pro Ile Lys Val Glu Asp Ser Val Val
2075 2080 2085
Val Asn Asp Pro Thr Ser Glu Thr Lys Val Val Lys Ser Leu Ser
2090 2095 2100
Ile Val Asp Val Tyr Asp Met Phe Leu Thr Gly Cys Arg Tyr Val
2105 2110 2115
Val Trp Met Ala Asn Glu Leu Ser Arg Leu Val Asn Ser Pro Thr
2120 2125 2130
Val Arg Glu Tyr Val Lys Trp Gly Met Thr Lys Ile Val Ile Pro
2135 2140 2145
Ala Lys Leu Val Leu Leu Arg Asp Glu Lys Gln Glu Phe Val Ala
2150 2155 2160
Pro Lys Val Val Lys Ala Lys Val Ile Ala Cys Tyr Ser Ala Val
2165 2170 2175
Lys Trp Phe Phe Leu Tyr Cys Phe Ser Trp Ile Lys Phe Asn Thr
2180 2185 2190
Asp Asn Lys Val Ile Tyr Thr Thr Glu Val Ala Ser Lys Leu Thr
2195 2200 2205
Phe Asn Leu Cys Cys Leu Ala Phe Lys Asn Ala Leu Gln Thr Phe
2210 2215 2220
Asn Trp Asn Val Val Ser Arg Gly Phe Phe Leu Val Ala Thr Val
2225 2230 2235
Phe Leu Leu Trp Phe Asn Phe Leu Tyr Ala Asn Val Ile Leu Ser
2240 2245 2250
Asp Phe Tyr Leu Pro Asn Ile Gly Phe Phe Pro Thr Phe Val Gly
2255 2260 2265
Gln Ile Val Ala Trp Val Lys Thr Thr Phe Gly Ile Phe Thr Leu
2270 2275 2280
Cys Asp Leu Tyr Gln Val Ser Asp Val Gly Tyr Arg Ser Ser Phe
2285 2290 2295
Cys Asn Gly Ser Met Val Cys Glu Leu Cys Phe Ser Gly Phe Asp
2300 2305 2310
Met Leu Asp Asn Tyr Asp Ala Ile Asn Val Val Gln His Val Val
2315 2320 2325
Asp Arg Arg Val Ser Phe Asp Tyr Ile Ser Leu Phe Lys Leu Val
2330 2335 2340
Val Glu Leu Val Ile Gly Tyr Ser Leu Tyr Thr Val Cys Phe Tyr
2345 2350 2355
Pro Leu Phe Gly Leu Ile Gly Met Gln Leu Leu Thr Thr Trp Leu
2360 2365 2370
Pro Glu Phe Phe Met Leu Glu Thr Met His Trp Ser Ala Arg Phe
2375 2380 2385
Phe Val Phe Val Ala Asn Met Leu Pro Ala Phe Thr Leu Leu Arg
2390 2395 2400
Phe Tyr Ile Val Val Thr Ala Met Tyr Lys Ile Phe Cys Leu Cys
2405 2410 2415
Arg His Val Met Tyr Gly Cys Ser Arg Pro Gly Cys Leu Phe Cys
2420 2425 2430
Tyr Lys Arg Asn Arg Ser Val Arg Val Lys Cys Ser Thr Val Val
2435 2440 2445
Gly Gly Thr Leu Arg Tyr Tyr Asp Val Met Ala Asn Gly Gly Thr
2450 2455 2460
Gly Phe Cys Ala Lys His Gln Trp Asn Cys Leu Asn Cys Ser Ala
2465 2470 2475
Phe Gly Pro Gly Asn Thr Phe Ile Thr His Glu Ala Ala Ala Asp
2480 2485 2490
Leu Ser Lys Glu Leu Lys Arg Pro Val Asn Pro Thr Asp Ser Ala
2495 2500 2505
Tyr Tyr Leu Val Thr Glu Val Lys Gln Val Gly Cys Ser Met Arg
2510 2515 2520
Leu Phe Tyr Glu Arg Asp Gly Gln Arg Val Tyr Asp Asp Val Ser
2525 2530 2535
Ala Ser Leu Phe Val Asp Met Asn Gly Leu Leu His Ser Lys Val
2540 2545 2550
Lys Gly Val Pro Glu Thr His Val Val Val Val Glu Asn Glu Ala
2555 2560 2565
Asp Lys Ala Gly Phe Leu Asn Ala Ala Val Phe Tyr Ala Gln Ser
2570 2575 2580
Leu Tyr Arg Pro Met Leu Leu Val Glu Lys Lys Leu Ile Thr Thr
2585 2590 2595
Ala Asn Thr Gly Leu Ser Val Ser Gln Thr Met Phe Asp Leu Tyr
2600 2605 2610
Val Asp Ser Leu Leu Gly Val Leu Asp Val Asp Arg Lys Ser Leu
2615 2620 2625
Thr Ser Phe Val Asn Ala Ala His Asn Ser Leu Lys Glu Gly Val
2630 2635 2640
Gln Leu Glu Gln Val Met Asp Thr Phe Ile Gly Cys Ala Arg Arg
2645 2650 2655
Lys Cys Ala Ile Asp Ser Asp Val Glu Thr Lys Ser Ile Thr Lys
2660 2665 2670
Ser Ile Met Ser Ala Val Asn Ala Gly Val Asp Phe Thr Asp Glu
2675 2680 2685
Ser Cys Asn Asn Leu Val Pro Thr Tyr Val Lys Ser Asp Thr Ile
2690 2695 2700
Val Ala Ala Asp Leu Gly Val Leu Ile Gln Asn Asn Ala Lys His
2705 2710 2715
Val Gln Ala Asn Val Ala Lys Ala Ala Asn Val Ala Cys Ile Trp
2720 2725 2730
Ser Val Asp Ala Phe Asn Gln Leu Ser Ala Asp Leu Gln His Arg
2735 2740 2745
Leu Arg Lys Ala Cys Ser Lys Thr Gly Leu Lys Ile Lys Leu Thr
2750 2755 2760
Tyr Asn Lys Gln Glu Ala Asn Val Pro Ile Leu Thr Thr Pro Phe
2765 2770 2775
Ser Leu Lys Gly Gly Ala Val Phe Ser Lys Val Leu Gln Trp Leu
2780 2785 2790
Phe Val Val Asn Leu Ile Cys Phe Ile Val Leu Trp Ala Leu Met
2795 2800 2805
Pro Thr Tyr Ala Val His Lys Ser Asp Met Gln Leu Pro Leu Tyr
2810 2815 2820
Ala Ser Phe Lys Val Ile Asp Asn Gly Val Leu Arg Asp Val Thr
2825 2830 2835
Val Thr Asp Ala Cys Phe Ala Asn Lys Phe Ile Gln Phe Asp Gln
2840 2845 2850
Trp Tyr Glu Ser Thr Phe Gly Leu Val Tyr Tyr Arg Asn Ser Arg
2855 2860 2865
Ala Cys Pro Val Val Val Ala Val Ile Asp Gln Asp Ile Gly Tyr
2870 2875 2880
Thr Leu Phe Asn Val Pro Thr Lys Val Leu Arg Tyr Gly Phe His
2885 2890 2895
Val Leu His Phe Ile Thr His Ala Phe Ala Thr Asp Ser Val Gln
2900 2905 2910
Cys Tyr Thr Pro His Met Gln Ile Pro Tyr Asp Asn Phe Tyr Ala
2915 2920 2925
Ser Gly Cys Val Leu Ser Ser Leu Cys Thr Met Leu Ala His Ala
2930 2935 2940
Asp Gly Thr Pro His Pro Tyr Cys Tyr Thr Glu Gly Ile Met His
2945 2950 2955
Asn Ala Ser Leu Tyr Asp Ser Leu Ala Pro His Val Arg Tyr Asn
2960 2965 2970
Leu Ala Asn Ser Asn Gly Tyr Ile Arg Phe Pro Glu Val Val Ser
2975 2980 2985
Glu Gly Ile Val Arg Ile Val Arg Thr Arg Ser Met Thr Tyr Cys
2990 2995 3000
Arg Val Gly Leu Cys Glu Asp Ala Glu Glu Gly Val Cys Phe Asn
3005 3010 3015
Phe Asn Ser Ser Trp Val Leu Asn Asn Pro Tyr Tyr Arg Ala Met
3020 3025 3030
Pro Gly Thr Phe Cys Gly Arg Asn Ala Phe Asp Leu Ile His Gln
3035 3040 3045
Val Leu Gly Gly Leu Val Arg Pro Ile Asp Phe Phe Ala Leu Thr
3050 3055 3060
Ala Ser Ser Val Ala Gly Ala Ile Leu Ala Ile Ile Val Val Leu
3065 3070 3075
Ala Phe Tyr Tyr Leu Ile Lys Leu Lys Arg Ala Phe Gly Asp Tyr
3080 3085 3090
Thr Ser Val Val Val Ile Asn Val Ile Val Trp Cys Ile Asn Phe
3095 3100 3105
Leu Met Leu Phe Val Phe Gln Val Tyr Pro Thr Leu Ser Cys Leu
3110 3115 3120
Tyr Ala Cys Phe Tyr Phe Tyr Thr Thr Leu Tyr Phe Pro Ser Glu
3125 3130 3135
Ile Ser Val Val Met His Leu Gln Trp Leu Val Met Tyr Gly Ala
3140 3145 3150
Ile Met Pro Leu Trp Phe Cys Ile Ile Tyr Val Ala Val Val Val
3155 3160 3165
Ser Asn His Ala Leu Trp Leu Phe Ser Tyr Cys Arg Lys Leu Gly
3170 3175 3180
Thr Glu Val Arg Ser Asp Gly Thr Phe Glu Glu Met Ser Leu Thr
3185 3190 3195
Thr Phe Met Ile Thr Lys Glu Ser Tyr Cys Lys Leu Lys Asn Ser
3200 3205 3210
Val Ser Asp Val Ala Phe Asn Arg Tyr Leu Ser Leu Tyr Asn Lys
3215 3220 3225
Tyr Arg Tyr Phe Ser Gly Lys Met Asp Thr Ala Ala Tyr Arg Glu
3230 3235 3240
Ala Ala Cys Ser Gln Leu Ala Lys Ala Met Glu Thr Phe Asn His
3245 3250 3255
Asn Asn Gly Asn Asp Val Leu Tyr Gln Pro Pro Thr Ala Ser Val
3260 3265 3270
Thr Thr Ser Phe Leu Gln Ser Gly Ile Val Lys Met Val Phe Pro
3275 3280 3285
Thr Ser Lys Val Glu Pro Cys Val Val Ser Val Thr Tyr Gly Asn
3290 3295 3300
Met Thr Leu Asn Gly Leu Trp Leu Asp Asp Lys Val Tyr Cys Pro
3305 3310 3315
Arg His Val Ile Cys Ser Ser Ala Asp Met Thr Asp Pro Asp Tyr
3320 3325 3330
Ser Asn Leu Leu Cys Arg Val Ile Ser Ser Asp Phe Cys Val Met
3335 3340 3345
Ser Gly Arg Met Ser Leu Thr Val Met Ser Tyr Gln Met Gln Gly
3350 3355 3360
Ser Leu Leu Val Leu Thr Val Thr Leu Gln Asn Pro Asn Thr Pro
3365 3370 3375
Lys Tyr Ser Phe Gly Val Val Lys Pro Gly Glu Thr Phe Thr Val
3380 3385 3390
Leu Ala Ala Tyr Asn Gly Lys Ser Gln Gly Ala Phe His Val Thr
3395 3400 3405
Met Arg Ser Ser Tyr Thr Ile Lys Gly Ser Phe Leu Cys Gly Ser
3410 3415 3420
Cys Gly Ser Val Gly Tyr Val Leu Thr Gly Asp Ser Val Arg Phe
3425 3430 3435
Val Tyr Met His Gln Leu Glu Leu Ser Thr Gly Cys His Thr Gly
3440 3445 3450
Thr Asp Phe Ser Gly Asn Phe Tyr Gly Pro Tyr Arg Asp Ala Gln
3455 3460 3465
Val Val Gln Leu Pro Val Gln Asp Tyr Thr Gln Thr Val Asn Val
3470 3475 3480
Val Ala Trp Leu Tyr Ala Ala Ile Leu Asn Arg Cys Asn Trp Phe
3485 3490 3495
Val Gln Ser Asp Ser Cys Ser Leu Glu Glu Phe Asn Val Trp Ala
3500 3505 3510
Met Thr Asn Gly Phe Ser Ser Ile Lys Ala Asp Leu Val Leu Asp
3515 3520 3525
Ala Leu Ala Ser Met Thr Gly Val Thr Val Glu Gln Ile Leu Ala
3530 3535 3540
Ala Ile Lys Arg Leu Tyr Ser Gly Phe Gln Gly Lys Gln Ile Leu
3545 3550 3555
Gly Ser Cys Val Leu Glu Asp Glu Leu Thr Pro Ser Asp Val Tyr
3560 3565 3570
Gln Gln Leu Ala Gly Val Lys Leu Gln Ser Lys Arg Thr Arg Val
3575 3580 3585
Val Lys Gly Thr Cys Cys Trp Ile Leu Ala Ser Thr Leu Leu Phe
3590 3595 3600
Cys Ser Ile Ile Ser Ala Phe Val Lys Trp Thr Met Phe Met Tyr
3605 3610 3615
Val Thr Thr His Met Leu Gly Val Thr Leu Cys Ala Leu Cys Phe
3620 3625 3630
Val Ser Phe Ala Met Leu Leu Val Lys His Lys His Leu Tyr Leu
3635 3640 3645
Thr Met Phe Ile Met Pro Val Leu Cys Thr Leu Phe Tyr Thr Asn
3650 3655 3660
Tyr Leu Val Val Tyr Lys Gln Ser Phe Arg Gly Leu Ala Tyr Ala
3665 3670 3675
Trp Leu Ser His Phe Val Pro Ala Val Asp Tyr Thr Tyr Met Asp
3680 3685 3690
Glu Val Leu Tyr Gly Val Val Leu Leu Val Ala Met Val Phe Val
3695 3700 3705
Thr Met Arg Ser Ile Asn His Asp Val Phe Ser Val Met Phe Leu
3710 3715 3720
Val Gly Arg Leu Val Ser Leu Val Ser Met Trp Tyr Phe Gly Ala
3725 3730 3735
Asn Leu Glu Glu Glu Val Leu Leu Phe Leu Thr Ser Leu Phe Gly
3740 3745 3750
Thr Tyr Thr Trp Thr Thr Met Leu Ser Leu Ala Thr Ala Lys Val
3755 3760 3765
Ile Ala Lys Trp Leu Ala Val Asn Val Leu Tyr Phe Thr Asp Val
3770 3775 3780
Pro Gln Val Lys Leu Val Leu Leu Ser Tyr Leu Cys Ile Gly Tyr
3785 3790 3795
Val Cys Cys Cys Tyr Trp Gly Val Leu Ser Leu Leu Asn Ser Ile
3800 3805 3810
Phe Arg Met Pro Leu Gly Val Tyr Asn Tyr Lys Ile Ser Val Gln
3815 3820 3825
Glu Leu Arg Tyr Met Asn Ala Asn Gly Leu Arg Pro Pro Arg Asn
3830 3835 3840
Ser Phe Glu Ala Leu Val Leu Asn Phe Lys Leu Leu Gly Ile Gly
3845 3850 3855
Gly Val Pro Val Ile Glu Val Ser Gln Ile Gln Ser Arg Leu Thr
3860 3865 3870
Asp Val Lys Cys Val Asn Val Val Leu Leu Asn Cys Leu Gln His
3875 3880 3885
Leu His Ile Ala Ser Ser Ser Lys Leu Trp Gln Tyr Cys Ser Thr
3890 3895 3900
Leu His Asn Glu Ile Leu Ala Thr Ser Asp Leu Ser Val Ala Phe
3905 3910 3915
Asp Lys Leu Ala Gln Leu Leu Val Val Leu Phe Ala Asn Pro Ala
3920 3925 3930
Ala Val Asp Ser Lys Cys Leu Ala Ser Ile Glu Glu Val Ser Asp
3935 3940 3945
Asp Tyr Val Arg Asp Ser Thr Val Leu Gln Ala Leu Gln Ser Glu
3950 3955 3960
Phe Val Asn Met Ala Ser Phe Val Glu Tyr Glu Leu Ala Lys Lys
3965 3970 3975
Asn Leu Asp Glu Ala Lys Ala Ser Gly Ser Ala Asn Gln Gln Gln
3980 3985 3990
Ile Lys Gln Leu Glu Lys Ala Cys Asn Ile Ala Lys Ser Ala Tyr
3995 4000 4005
Glu Arg Asp Arg Ala Val Ala Arg Lys Leu Glu Arg Met Ala Asp
4010 4015 4020
Leu Ala Leu Thr Asn Met Tyr Lys Glu Ala Arg Ile Asn Asp Lys
4025 4030 4035
Lys Ser Lys Val Val Ser Ala Leu Gln Thr Met Leu Phe Ser Met
4040 4045 4050
Ile Arg Lys Leu Asp Asn Gln Ala Leu Asn Ser Ile Leu Asp Asn
4055 4060 4065
Ala Val Lys Gly Cys Val Pro Leu Asn Ala Ile Pro Ser Leu Thr
4070 4075 4080
Ser Asn Thr Leu Thr Ile Ile Val Pro Asp Lys Gln Val Phe Asp
4085 4090 4095
Gln Val Val Asp Asn Val Tyr Val Thr Tyr Ala Gly Asn Val Trp
4100 4105 4110
His Ile Gln Ser Ile Gln Asp Ala Asp Gly Ala Val Lys Gln Leu
4115 4120 4125
Asn Glu Ile Asp Val Asn Ile Thr Trp Pro Leu Val Ile Ala Ala
4130 4135 4140
Asn Arg His Asn Glu Val Ser Ser Val Val Leu Gln Asn Asn Glu
4145 4150 4155
Leu Met Pro Gln Lys Leu Arg Thr Gln Val Val Asn Ser Gly Ser
4160 4165 4170
Asp Met Asn Cys Asn Thr Pro Thr Gln Cys Tyr Tyr Asn Thr Thr
4175 4180 4185
Gly Met Gly Lys Ile Val Tyr Ala Ile Leu Ser Asp Cys Asp Gly
4190 4195 4200
Leu Lys Tyr Thr Lys Ile Val Lys Glu Asp Gly Asn Cys Val Val
4205 4210 4215
Leu Glu Leu Asp Pro Pro Cys Lys Phe Ser Val Gln Asp Val Lys
4220 4225 4230
Gly Leu Lys Ile Lys Tyr Leu Tyr Phe Val Lys Gly Cys Asn Thr
4235 4240 4245
Leu Ala Arg Gly Trp Val Val Gly Thr Leu Ser Ser Thr Val Arg
4250 4255 4260
Leu Gln Ala Gly Thr Ala Thr Glu Tyr Ala Ser Asn Ser Ala Ile
4265 4270 4275
Arg Ser Leu Cys Ala Phe Ser Val Asp Pro Lys Lys Thr Tyr Leu
4280 4285 4290
Asp Tyr Ile Gln Gln Gly Gly Ala Pro Val Thr Asn Cys Val Lys
4295 4300 4305
Met Leu Cys Asp His Ala Gly Thr Gly Met Ala Ile Thr Ile Lys
4310 4315 4320
Pro Glu Ala Thr Thr Asn Gln Asp Ser Tyr Gly Gly Ala Ser Val
4325 4330 4335
Cys Ile Tyr Cys Arg Ser Arg Val Glu His Pro Asp Val Asp Gly
4340 4345 4350
Leu Cys Lys Leu Arg Gly Lys Phe Val Gln Val Pro Leu Gly Ile
4355 4360 4365
Lys Asp Pro Val Ser Tyr Val Leu Thr His Asp Val Cys Gln Val
4370 4375 4380
Cys Gly Phe Trp Arg Asp Gly Met Phe Leu Cys Arg His Arg Leu
4385 4390 4395
Pro Val Ser Val Lys Arg His Glu Leu Phe Lys Arg Val Arg Gly
4400 4405 4410
Thr Ser Val Asn Ala Arg Leu Val Pro Cys Ala Ser Gly Leu Asp
4415 4420 4425
Thr Asp Val Gln Leu Arg Ala Phe Asp Ile Cys Asn Ala Asn Arg
4430 4435 4440
Ala Gly Ile Gly Leu Tyr Tyr Lys Val Asn Cys Cys Arg Phe Gln
4445 4450 4455
Arg Ala Asp Glu Asp Gly Asn Thr Leu Asp Lys Phe Phe Val Ile
4460 4465 4470
Lys Arg Thr Asn Leu Glu Val Tyr Asn Lys Glu Lys Glu Cys Tyr
4475 4480 4485
Glu Leu Thr Lys Glu Cys Gly Val Val Ala Glu His Glu Phe Phe
4490 4495 4500
Thr Phe Asp Val Glu Gly Ser Arg Val Pro His Ile Val Arg Lys
4505 4510 4515
Asp Leu Ser Lys Tyr Thr Met Leu Asp Leu Cys Tyr Ala Leu Arg
4520 4525 4530
His Phe Asp Arg Asn Asp Cys Ser Thr Leu Lys Glu Ile Leu Leu
4535 4540 4545
Thr Tyr Ala Glu Cys Asp Glu Ser Tyr Phe Gln Lys Lys Asp Trp
4550 4555 4560
Tyr Asp Phe Val Glu Asn Ser Asp Ile Ile Asn Val Tyr Lys Lys
4565 4570 4575
Leu Gly Pro Ile Phe Asn Arg Ala Leu Leu Asn Thr Ala Lys Phe
4580 4585 4590
Ala Asp Thr Leu Val Glu Ala Gly Leu Val Gly Val Leu Thr Leu
4595 4600 4605
Asp Asn Gln Asp Leu Tyr Gly Gln Trp Tyr Asp Phe Gly Asp Phe
4610 4615 4620
Val Lys Thr Val Pro Gly Cys Gly Val Ala Val Ala Asp Ser Tyr
4625 4630 4635
Tyr Ser Tyr Met Met Pro Met Leu Thr Met Cys His Ala Leu Asp
4640 4645 4650
Ser Glu Leu Phe Ile Asn Gly Thr Tyr Arg Glu Phe Asp Leu Val
4655 4660 4665
Gln Tyr Asp Phe Thr Asp Phe Lys Leu Glu Leu Phe Asn Lys Tyr
4670 4675 4680
Phe Lys Tyr Trp Ser Met Thr Tyr His Pro Asn Thr Cys Glu Cys
4685 4690 4695
Glu Asp Asp Arg Cys Ile Ile His Cys Ala Asn Phe Asn Ile Leu
4700 4705 4710
Phe Ser Met Val Leu Pro Lys Thr Cys Phe Gly Pro Leu Val Arg
4715 4720 4725
Gln Ile Phe Val Asp Gly Val Pro Phe Val Val Ser Ile Gly Tyr
4730 4735 4740
His Tyr Lys Glu Leu Gly Val Val Met Asn Met Asp Val Asp Thr
4745 4750 4755
His Arg Tyr Arg Leu Ser Leu Lys Asp Leu Leu Leu Tyr Ala Ala
4760 4765 4770
Asp Pro Ala Leu His Val Ala Ser Ala Ser Ala Leu Leu Asp Leu
4775 4780 4785
Arg Thr Cys Cys Phe Ser Val Ala Ala Ile Thr Ser Gly Val Lys
4790 4795 4800
Phe Gln Thr Val Lys Pro Gly Asn Phe Asn Gln Asp Phe Tyr Glu
4805 4810 4815
Phe Ile Leu Ser Lys Gly Leu Leu Lys Glu Gly Ser Ser Val Asp
4820 4825 4830
Leu Lys His Phe Phe Phe Thr Gln Asp Gly Asn Ala Ala Ile Thr
4835 4840 4845
Asp Tyr Asn Tyr Tyr Lys Tyr Asn Leu Pro Thr Met Val Asp Ile
4850 4855 4860
Lys Gln Leu Leu Phe Val Leu Glu Val Val Asn Lys Tyr Phe Glu
4865 4870 4875
Ile Tyr Asp Gly Gly Cys Ile Pro Ala Thr Gln Val Ile Val Asn
4880 4885 4890
Asn Tyr Asp Lys Ser Ala Gly Tyr Pro Phe Asn Lys Phe Gly Lys
4895 4900 4905
Ala Arg Leu Tyr Tyr Glu Ala Leu Ser Phe Glu Glu Gln Asp Glu
4910 4915 4920
Val Tyr Ala Tyr Thr Lys Arg Asn Val Leu Pro Thr Leu Thr Gln
4925 4930 4935
Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn Arg Ala Arg Thr
4940 4945 4950
Val Ala Gly Val Ser Ile Leu Ser Thr Met Thr Gly Arg Met Phe
4955 4960 4965
His Gln Lys Cys Leu Lys Ser Ile Ala Ala Thr Arg Gly Val Pro
4970 4975 4980
Val Val Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asp Met
4985 4990 4995
Leu Arg Arg Leu Ile Lys Asp Val Asp Ser Pro Val Leu Met Gly
5000 5005 5010
Trp Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Ile Leu Arg
5015 5020 5025
Ile Ile Ser Ser Leu Val Leu Ala Arg Lys His Asp Ser Cys Cys
5030 5035 5040
Ser His Thr Asp Arg Phe Tyr Arg Leu Ala Asn Glu Cys Ala Gln
5045 5050 5055
Val Leu Ser Glu Ile Val Met Cys Gly Gly Cys Tyr Tyr Val Lys
5060 5065 5070
Pro Gly Gly Thr Ser Ser Gly Asp Ala Thr Thr Ala Phe Ala Asn
5075 5080 5085
Ser Val Phe Asn Ile Cys Gln Ala Val Ser Ala Asn Val Cys Ser
5090 5095 5100
Leu Met Ala Cys Asn Gly His Lys Ile Glu Asp Leu Ser Ile Arg
5105 5110 5115
Glu Leu Gln Lys Arg Leu Tyr Ser Asn Val Tyr Arg Ala Asp His
5120 5125 5130
Val Asp Pro Ala Phe Val Asn Glu Tyr Tyr Glu Phe Leu Asn Lys
5135 5140 5145
His Phe Ser Met Met Ile Leu Ser Asp Asp Gly Val Val Cys Tyr
5150 5155 5160
Asn Ser Glu Phe Ala Ser Lys Gly Tyr Ile Ala Asn Ile Ser Ala
5165 5170 5175
Phe Gln Gln Val Leu Tyr Tyr Gln Asn Asn Val Phe Met Ser Glu
5180 5185 5190
Ala Lys Cys Trp Val Glu Thr Asp Ile Glu Lys Gly Pro His Glu
5195 5200 5205
Phe Cys Ser Gln His Thr Met Leu Val Lys Met Asp Gly Asp Glu
5210 5215 5220
Val Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Gly Ala Gly
5225 5230 5235
Cys Phe Val Asp Asp Leu Leu Lys Thr Asp Ser Val Leu Leu Ile
5240 5245 5250
Glu Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Val Tyr
5255 5260 5265
His Glu Asn Pro Glu Tyr Gln Asn Val Phe Arg Val Tyr Leu Glu
5270 5275 5280
Tyr Ile Lys Lys Leu Tyr Asn Asp Leu Gly Asn Gln Ile Leu Asp
5285 5290 5295
Ser Tyr Ser Val Ile Leu Ser Thr Cys Asp Gly Gln Lys Phe Thr
5300 5305 5310
Asp Glu Thr Phe Tyr Lys Asn Met Tyr Leu Arg Ser Ala Val Met
5315 5320 5325
Gln Ser Val Gly Ala Cys Val Val Cys Ser Ser Gln Thr Ser Leu
5330 5335 5340
Arg Cys Gly Ser Cys Ile Arg Lys Pro Leu Leu Cys Cys Lys Cys
5345 5350 5355
Ala Tyr Asp His Val Met Ser Thr Asp His Lys Tyr Val Leu Ser
5360 5365 5370
Val Ser Pro Tyr Val Cys Asn Ser Pro Gly Cys Asp Val Asn Asp
5375 5380 5385
Val Thr Lys Leu Tyr Leu Gly Gly Met Ser Tyr Tyr Cys Glu Asp
5390 5395 5400
His Lys Pro Gln Tyr Ser Phe Lys Leu Val Met Asn Gly Met Val
5405 5410 5415
Phe Gly Leu Tyr Lys Gln Ser Cys Thr Gly Ser Pro Tyr Ile Glu
5420 5425 5430
Asp Phe Asn Lys Ile Ala Ser Cys Lys Trp Thr Glu Val Asp Asp
5435 5440 5445
Tyr Val Leu Ala Asn Glu Cys Thr Glu Arg Leu Lys Leu Phe Ala
5450 5455 5460
Ala Glu Thr Gln Lys Ala Thr Glu Glu Ser Phe Lys Gln Cys Tyr
5465 5470 5475
Ala Ser Ala Thr Ile Arg Glu Ile Val Ser Asp Arg Glu Leu Ile
5480 5485 5490
Leu Ser Trp Glu Ile Gly Lys Val Arg Pro Pro Leu Asn Lys Asn
5495 5500 5505
Tyr Val Phe Thr Gly Tyr His Phe Thr Ser Asn Gly Lys Thr Val
5510 5515 5520
Leu Gly Glu Tyr Val Phe Asp Lys Ser Glu Leu Thr Asn Gly Val
5525 5530 5535
Tyr Tyr Arg Ala Thr Thr Thr Tyr Lys Leu Ser Val Gly Asp Val
5540 5545 5550
Phe Ile Leu Thr Ser His Ala Val Ser Ser Leu Ser Ala Pro Thr
5555 5560 5565
Leu Val Pro Gln Glu Asn Tyr Thr Ser Ile Arg Phe Ala Ser Val
5570 5575 5580
Tyr Ser Val Pro Glu Thr Phe Gln Asn Asn Val Pro Asn Tyr Gln
5585 5590 5595
His Ile Gly Met Lys Arg Tyr Cys Thr Val Gln Gly Pro Pro Gly
5600 5605 5610
Thr Gly Lys Ser His Leu Ala Ile Gly Leu Ala Val Tyr Tyr Cys
5615 5620 5625
Thr Ala Arg Val Val Tyr Thr Ala Ala Ser His Ala Ala Val Asp
5630 5635 5640
Ala Leu Cys Glu Lys Ala Tyr Lys Phe Leu Asn Ile Asn Asp Cys
5645 5650 5655
Thr Arg Ile Val Pro Ala Lys Val Arg Val Asp Cys Tyr Asp Lys
5660 5665 5670
Phe Lys Val Asn Asp Thr Thr Arg Lys Tyr Val Phe Thr Thr Ile
5675 5680 5685
Asn Ala Leu Pro Glu Leu Val Thr Asp Ile Ile Val Val Asp Glu
5690 5695 5700
Val Ser Met Leu Thr Asn Tyr Glu Leu Ser Val Ile Asn Ser Arg
5705 5710 5715
Val Arg Ala Lys His Tyr Val Tyr Ile Gly Asp Pro Ala Gln Leu
5720 5725 5730
Pro Ala Pro Arg Val Leu Leu Asn Lys Gly Thr Leu Glu Pro Arg
5735 5740 5745
Tyr Phe Asn Ser Val Thr Lys Leu Met Cys Cys Leu Gly Pro Asp
5750 5755 5760
Ile Phe Leu Gly Thr Cys Tyr Arg Cys Pro Lys Glu Ile Val Asp
5765 5770 5775
Thr Val Ser Ala Leu Val Tyr His Asn Lys Leu Lys Ala Lys Asn
5780 5785 5790
Asp Asn Ser Ser Met Cys Phe Lys Val Tyr Tyr Lys Gly Gln Thr
5795 5800 5805
Thr His Glu Ser Ser Ser Ala Val Asn Met Gln Gln Ile Tyr Leu
5810 5815 5820
Ile Ser Lys Phe Leu Lys Ala Asn Pro Ser Trp Ser Asn Ala Val
5825 5830 5835
Phe Ile Ser Pro Tyr Asn Ser Gln Asn Tyr Val Ala Lys Arg Val
5840 5845 5850
Leu Gly Leu Gln Thr Gln Thr Val Asp Ser Ala Gln Gly Ser Glu
5855 5860 5865
Tyr Asp Phe Val Ile Tyr Ser Gln Thr Ala Glu Thr Ala His Ser
5870 5875 5880
Val Asn Val Asn Arg Phe Asn Val Ala Ile Thr Arg Ala Lys Lys
5885 5890 5895
Gly Ile Leu Cys Val Met Ser Ser Met Gln Leu Phe Glu Ser Leu
5900 5905 5910
Asn Phe Ser Thr Leu Thr Leu Asp Lys Ile Asn Asn Pro Arg Leu
5915 5920 5925
Gln Cys Thr Thr Asn Leu Phe Lys Asp Cys Ser Arg Ser Tyr Ala
5930 5935 5940
Gly Tyr His Pro Ala His Ala Pro Ser Phe Leu Ala Val Asp Asp
5945 5950 5955
Lys Tyr Lys Val Gly Gly Asp Leu Ala Val Cys Leu Asn Val Ala
5960 5965 5970
Asp Ser Ala Val Thr Tyr Ser Arg Leu Ile Ser Leu Met Gly Phe
5975 5980 5985
Lys Leu Asp Leu Thr Leu Asp Gly Tyr Cys Lys Leu Phe Ile Thr
5990 5995 6000
Arg Asp Glu Ala Ile Arg Arg Val Arg Ala Trp Val Gly Phe Asp
6005 6010 6015
Ala Glu Gly Ala His Ala Thr Arg Asp Ser Ile Gly Thr Asn Phe
6020 6025 6030
Pro Leu Gln Leu Gly Phe Ser Thr Gly Ile Asp Phe Val Val Glu
6035 6040 6045
Ala Thr Gly Met Phe Ala Glu Arg Asp Gly Tyr Val Phe Lys Lys
6050 6055 6060
Ala Val Ala Arg Ala Pro Pro Gly Glu Gln Phe Lys His Leu Val
6065 6070 6075
Pro Leu Met Ser Arg Gly Gln Lys Trp Asp Val Val Arg Ile Arg
6080 6085 6090
Ile Val Gln Met Leu Ser Asp His Leu Val Asp Leu Ala Asp Ser
6095 6100 6105
Val Val Leu Val Thr Trp Ala Ala Ser Phe Glu Leu Thr Cys Leu
6110 6115 6120
Arg Tyr Phe Ala Lys Val Gly Lys Glu Val Val Cys Ser Val Cys
6125 6130 6135
Asn Lys Arg Ala Thr Cys Phe Asn Ser Arg Thr Gly Tyr Tyr Gly
6140 6145 6150
Cys Trp Arg His Ser Tyr Ser Cys Asp Tyr Leu Tyr Asn Pro Leu
6155 6160 6165
Ile Val Asp Ile Gln Gln Trp Gly Tyr Thr Gly Ser Leu Thr Ser
6170 6175 6180
Asn His Asp Leu Ile Cys Ser Val His Lys Gly Ala His Val Ala
6185 6190 6195
Ser Ser Asp Ala Ile Met Thr Arg Cys Leu Ala Val His Asp Cys
6200 6205 6210
Phe Cys Lys Ser Val Asn Trp Ser Leu Glu Tyr Pro Ile Ile Ser
6215 6220 6225
Asn Glu Val Ser Val Asn Thr Ser Cys Arg Leu Leu Gln Arg Val
6230 6235 6240
Met Phe Arg Ala Ala Met Leu Cys Asn Arg Tyr Asp Val Cys Tyr
6245 6250 6255
Asp Ile Gly Asn Pro Lys Gly Leu Ala Cys Val Lys Gly Tyr Asp
6260 6265 6270
Phe Lys Phe Tyr Asp Ala Ser Pro Val Val Lys Ser Val Lys Gln
6275 6280 6285
Phe Val Tyr Lys Tyr Glu Ala His Lys Asp Gln Phe Leu Asp Gly
6290 6295 6300
Leu Cys Met Phe Trp Asn Cys Asn Val Asp Lys Tyr Pro Ala Asn
6305 6310 6315
Ala Val Val Cys Arg Phe Asp Thr Arg Val Leu Asn Lys Leu Asn
6320 6325 6330
Leu Pro Gly Cys Asn Gly Gly Ser Leu Tyr Val Asn Lys His Ala
6335 6340 6345
Phe His Thr Ser Pro Phe Thr Arg Ala Ala Phe Glu Asn Leu Lys
6350 6355 6360
Pro Met Pro Phe Phe Tyr Tyr Ser Asp Thr Pro Cys Val Tyr Met
6365 6370 6375
Glu Gly Met Glu Ser Lys Gln Val Asp Tyr Val Pro Leu Arg Ser
6380 6385 6390
Ala Thr Cys Ile Thr Arg Cys Asn Leu Gly Gly Ala Val Cys Leu
6395 6400 6405
Lys His Ala Glu Asp Tyr Arg Glu Tyr Leu Glu Ser Tyr Asn Thr
6410 6415 6420
Ala Thr Thr Ala Gly Phe Thr Phe Trp Val Tyr Lys Thr Phe Asp
6425 6430 6435
Phe Tyr Asn Leu Trp Asn Thr Phe Thr Arg Leu Gln Ser Leu Glu
6440 6445 6450
Asn Val Val Tyr Asn Leu Val Asn Ala Gly His Phe Asp Gly Arg
6455 6460 6465
Ala Gly Glu Leu Pro Cys Ala Val Ile Gly Glu Lys Val Ile Ala
6470 6475 6480
Lys Ile Gln Asn Glu Asp Val Val Val Phe Lys Asn Asn Thr Pro
6485 6490 6495
Phe Pro Thr Asn Val Ala Val Glu Leu Phe Ala Lys Arg Ser Ile
6500 6505 6510
Arg Pro His Pro Glu Leu Lys Leu Phe Arg Asn Leu Asn Ile Asp
6515 6520 6525
Val Cys Trp Ser His Val Leu Trp Asp Tyr Ala Lys Asp Ser Val
6530 6535 6540
Phe Cys Ser Ser Thr Tyr Lys Val Cys Lys Tyr Thr Asp Leu Gln
6545 6550 6555
Cys Ile Glu Ser Leu Asn Val Leu Phe Asp Gly Arg Asp Asn Gly
6560 6565 6570
Ala Leu Glu Ala Phe Lys Lys Cys Arg Asp Gly Val Tyr Ile Asn
6575 6580 6585
Thr Thr Lys Ile Lys Ser Leu Ser Met Ile Lys Gly Pro Gln Arg
6590 6595 6600
Ala Asp Leu Asn Gly Val Val Val Glu Lys Val Gly Asp Ser Asp
6605 6610 6615
Val Glu Phe Trp Phe Ala Met Arg Arg Asp Gly Asp Asp Val Ile
6620 6625 6630
Phe Ser Arg Thr Gly Ser Leu Glu Pro Ser His Tyr Arg Ser Pro
6635 6640 6645
Gln Gly Asn Pro Gly Gly Asn Arg Val Gly Asp Leu Ser Gly Asn
6650 6655 6660
Glu Ala Leu Ala Arg Gly Thr Ile Phe Thr Gln Ser Arg Phe Leu
6665 6670 6675
Ser Ser Phe Ala Pro Arg Ser Glu Met Glu Lys Asp Phe Met Asp
6680 6685 6690
Leu Asp Glu Asp Val Phe Ile Ala Lys Tyr Ser Leu Gln Asp Tyr
6695 6700 6705
Ala Phe Glu His Val Val Tyr Gly Ser Phe Asn Gln Lys Ile Ile
6710 6715 6720
Gly Gly Leu His Leu Leu Ile Gly Leu Ala Arg Arg Gln Gln Lys
6725 6730 6735
Ser Asn Leu Val Ile Gln Glu Phe Val Pro Tyr Asp Ser Ser Ile
6740 6745 6750
His Ser Tyr Phe Ile Thr Asp Glu Asn Ser Gly Ser Ser Lys Ser
6755 6760 6765
Val Cys Thr Val Ile Asp Leu Leu Leu Asp Asp Phe Val Asp Ile
6770 6775 6780
Val Lys Ser Leu Asn Leu Asn Cys Val Ser Lys Val Val Asn Val
6785 6790 6795
Asn Val Asp Phe Lys Asp Phe Gln Phe Met Leu Trp Cys Asn Glu
6800 6805 6810
Glu Lys Val Met Thr Phe Tyr Pro Arg Leu Gln Ala Ala Ala Asp
6815 6820 6825
Trp Lys Pro Gly Tyr Val Met Pro Val Leu Tyr Lys Tyr Leu Glu
6830 6835 6840
Ser Pro Leu Glu Arg Val Asn Leu Trp Asn Tyr Gly Lys Pro Ile
6845 6850 6855
Thr Leu Pro Thr Gly Cys Leu Met Asn Val Ala Lys Tyr Thr Gln
6860 6865 6870
Leu Cys Gln Tyr Leu Asn Thr Thr Thr Leu Ala Val Pro Ala Asn
6875 6880 6885
Met Arg Val Leu His Leu Gly Ala Gly Ser Asp Lys Asp Val Ala
6890 6895 6900
Pro Gly Ser Ala Val Leu Arg Gln Trp Leu Pro Ala Gly Ser Ile
6905 6910 6915
Leu Val Asp Asn Asp Ile Asn Pro Phe Val Ser Asp Ser Val Ala
6920 6925 6930
Ser Tyr Tyr Gly Asn Cys Ile Thr Leu Pro Ile Ala Cys Gln Trp
6935 6940 6945
Asp Leu Ile Ile Ser Asp Met Tyr Asp Pro Leu Thr Lys Asn Ile
6950 6955 6960
Gly Glu Tyr Asn Val Ser Lys Asp Gly Phe Phe Thr Tyr Leu Cys
6965 6970 6975
His Leu Ile Arg Asp Lys Leu Ala Leu Gly Gly Ser Val Ala Ile
6980 6985 6990
Lys Ile Thr Glu Phe Ser Trp Asn Ala Glu Leu Tyr Ser Leu Met
6995 7000 7005
Gly Lys Phe Ala Phe Trp Thr Ile Phe Cys Thr Asn Val Asn Ala
7010 7015 7020
Ser Ser Ser Glu Gly Phe Leu Ile Gly Ile Asn Trp Leu Asn Arg
7025 7030 7035
Thr Arg Thr Glu Ile Asp Gly Lys Thr Met His Ala Asn Tyr Leu
7040 7045 7050
Phe Trp Arg Asn Ser Thr Met Trp Asn Gly Gly Ala Tyr Ser Leu
7055 7060 7065
Phe Asp Met Ser Lys Phe Pro Leu Lys Val Ala Gly Thr Ala Val
7070 7075 7080
Val Ser Leu Lys Pro Asp Gln Ile Asn Asp Leu Val Leu Ser Leu
7085 7090 7095
Ile Glu Lys Gly Lys Leu Leu Val Arg Asp Thr Arg Lys Glu Val
7100 7105 7110
Phe Val Gly Asp Ser Leu Val Asn Val Lys
7115 7120
<210> 25
<211> 7095
<212> PRT
<213> human coronavirus OC43
<220>
<221> MISC_FEATURE
<223> ORF 1AB
<400> 25
Met Ser Lys Ile Asn Lys Tyr Gly Leu Glu Leu His Trp Ala Pro Glu
1 5 10 15
Phe Pro Trp Met Phe Glu Asp Ala Glu Glu Lys Leu Asp Asn Pro Ser
20 25 30
Ser Ser Glu Val Asp Met Ile Cys Ser Thr Thr Ala Gln Lys Leu Glu
35 40 45
Thr Asp Gly Ile Cys Pro Glu Asn His Val Met Val Asp Cys Arg Arg
50 55 60
Leu Leu Lys Gln Glu Cys Cys Val Gln Ser Ser Leu Ile Arg Glu Ile
65 70 75 80
Val Met Asn Ala Ser Pro Tyr Asp Leu Glu Val Leu Leu Gln Asp Ala
85 90 95
Leu Gln Ser Arg Glu Ala Val Leu Val Thr Thr Pro Leu Gly Met Ser
100 105 110
Leu Glu Ala Cys Tyr Val Arg Gly Cys Asn Pro Lys Gly Trp Thr Met
115 120 125
Gly Leu Phe Arg Arg Arg Ser Val Cys Asn Thr Gly Arg Cys Thr Val
130 135 140
Asn Lys His Val Ala Tyr Gln Leu Tyr Met Ile Asp Pro Ala Gly Val
145 150 155 160
Cys Leu Gly Ala Gly Gln Phe Val Gly Trp Val Ile Pro Leu Ala Phe
165 170 175
Met Pro Val Gln Ser Arg Lys Phe Ile Val Pro Trp Val Met Tyr Leu
180 185 190
Arg Lys Arg Gly Glu Lys Gly Ala Tyr Asn Lys Asp His Gly Arg Gly
195 200 205
Gly Phe Gly His Val Tyr Asp Phe Lys Val Glu Asp Ala Tyr Asp Gln
210 215 220
Val His Asp Glu Pro Lys Gly Lys Phe Ser Lys Lys Ala Tyr Ala Leu
225 230 235 240
Ile Arg Gly Tyr Arg Gly Val Lys Pro Leu Leu Tyr Val Asp Gln Tyr
245 250 255
Gly Cys Asp Tyr Thr Gly Ser Leu Ala Asp Gly Leu Glu Ala Tyr Ala
260 265 270
Asp Lys Thr Leu Gln Glu Met Lys Ala Leu Phe Pro Thr Trp Ser Gln
275 280 285
Glu Leu Leu Phe Asp Val Ile Val Ala Trp His Val Val Arg Asp Pro
290 295 300
Arg Tyr Val Met Arg Leu Gln Ser Ala Ala Thr Ile Arg Ser Val Ala
305 310 315 320
Tyr Val Ala Asn Pro Thr Glu Asp Leu Cys Asp Gly Ser Val Val Ile
325 330 335
Lys Glu Pro Val His Val Tyr Ala Asp Asp Ser Ile Ile Leu Arg Gln
340 345 350
Tyr Asn Leu Val Asp Ile Met Ser His Phe Tyr Met Glu Ala Asp Thr
355 360 365
Val Val Asn Ala Phe Tyr Gly Val Ala Leu Lys Asp Cys Gly Phe Val
370 375 380
Met Gln Phe Gly Tyr Ile Asp Cys Glu Gln Asp Ser Cys Asp Phe Lys
385 390 395 400
Gly Trp Ile Pro Gly Asn Met Ile Asp Gly Phe Ala Cys Thr Thr Cys
405 410 415
Gly His Val Tyr Glu Val Gly Asp Leu Ile Ala Gln Ser Ser Gly Val
420 425 430
Leu Pro Val Asn Pro Val Leu His Thr Lys Ser Ala Ala Gly Tyr Gly
435 440 445
Gly Phe Gly Cys Lys Asp Ser Phe Thr Leu Tyr Gly Gln Thr Val Val
450 455 460
Tyr Phe Gly Gly Cys Val Tyr Trp Ser Pro Ala Arg Asn Ile Trp Ile
465 470 475 480
Pro Ile Leu Lys Ser Ser Val Lys Ser Tyr Asp Ser Leu Val Tyr Thr
485 490 495
Gly Val Leu Gly Cys Lys Ala Ile Val Lys Glu Thr Asn Leu Ile Cys
500 505 510
Lys Ala Leu Tyr Leu Asp Tyr Val Gln His Lys Cys Gly Asn Leu His
515 520 525
Gln Arg Glu Leu Leu Gly Val Ser Asp Val Trp His Lys Gln Leu Leu
530 535 540
Leu Asn Arg Gly Val Tyr Lys Pro Leu Leu Glu Asn Ile Asp Tyr Phe
545 550 555 560
Asn Met Arg Arg Ala Lys Phe Ser Leu Glu Thr Phe Thr Val Cys Ala
565 570 575
Asp Gly Phe Met Pro Phe Leu Leu Asp Asp Leu Val Pro Arg Ala Tyr
580 585 590
Tyr Leu Ala Val Ser Gly Gln Ala Phe Cys Asp Tyr Ala Asp Lys Leu
595 600 605
Cys His Ala Val Val Ser Lys Ser Lys Glu Leu Leu Asp Val Ser Leu
610 615 620
Asp Ser Leu Gly Ala Ala Ile His Tyr Leu Asn Ser Lys Ile Val Asp
625 630 635 640
Leu Ala Gln His Phe Ser Asp Phe Gly Thr Ser Phe Val Ser Lys Ile
645 650 655
Val His Phe Phe Lys Thr Phe Thr Thr Ser Thr Ala Leu Ala Phe Ala
660 665 670
Trp Val Leu Phe His Val Leu His Gly Ala Tyr Ile Val Val Glu Ser
675 680 685
Asp Ile Tyr Phe Val Lys Asn Ile Pro Arg Tyr Ala Ser Ala Val Ala
690 695 700
Gln Ala Phe Gln Ser Val Ala Lys Val Val Leu Asp Ser Leu Arg Val
705 710 715 720
Thr Phe Ile Asp Gly Leu Ser Cys Phe Lys Ile Gly Arg Arg Arg Ile
725 730 735
Cys Leu Ser Gly Arg Lys Ile Tyr Glu Val Glu Arg Gly Leu Leu His
740 745 750
Ser Ser Gln Leu Pro Leu Asp Val Tyr Asp Leu Thr Met Pro Ser Gln
755 760 765
Val Gln Lys Ala Lys Gln Lys Pro Ile Tyr Leu Lys Gly Ser Gly Ser
770 775 780
Asp Phe Ser Leu Ala Asp Ser Val Val Glu Val Val Thr Thr Ser Leu
785 790 795 800
Thr Pro Cys Gly Tyr Ser Glu Pro Pro Lys Val Ala Asp Lys Ile Cys
805 810 815
Ile Val Asp Asn Val Tyr Met Ala Lys Ala Gly Asp Lys Tyr Tyr Pro
820 825 830
Val Val Val Asp Asp His Val Gly Leu Leu Asp Gln Ala Trp Arg Val
835 840 845
Pro Cys Ala Gly Arg Arg Val Thr Phe Lys Glu Gln Pro Thr Val Lys
850 855 860
Glu Ile Ile Ser Met Pro Lys Ile Ile Lys Val Phe Tyr Glu Leu Asp
865 870 875 880
Asn Asp Phe Asn Thr Ile Leu Asn Thr Ala Cys Gly Val Phe Glu Val
885 890 895
Asp Asp Thr Val Asp Met Glu Glu Phe Tyr Ala Val Val Ile Asp Ala
900 905 910
Ile Glu Glu Lys Leu Ser Pro Cys Lys Glu Leu Glu Gly Val Gly Ala
915 920 925
Lys Val Ser Ala Phe Leu Gln Lys Leu Glu Asp Asn Pro Leu Phe Leu
930 935 940
Phe Asp Glu Ala Gly Glu Glu Val Leu Ala Pro Lys Leu Tyr Cys Ala
945 950 955 960
Phe Thr Ala Pro Glu Asp Asp Asp Phe Leu Glu Glu Ser Asp Val Glu
965 970 975
Glu Asp Asp Val Glu Gly Glu Glu Thr Asp Leu Thr Val Thr Ser Ala
980 985 990
Gly Gln Pro Cys Val Ala Ser Glu Gln Glu Glu Ser Ser Glu Val Leu
995 1000 1005
Glu Asp Thr Leu Asp Asp Gly Pro Ser Val Glu Thr Ser Asp Ser
1010 1015 1020
Gln Val Glu Glu Asp Val Glu Met Ser Asp Phe Val Asp Leu Glu
1025 1030 1035
Ser Val Ile Gln Asp Tyr Glu Asn Val Cys Phe Glu Phe Tyr Thr
1040 1045 1050
Thr Glu Pro Glu Phe Val Lys Val Leu Gly Leu Tyr Val Pro Lys
1055 1060 1065
Ala Thr Arg Asn Asn Cys Trp Leu Arg Ser Val Leu Ala Val Met
1070 1075 1080
Gln Lys Leu Pro Cys Gln Phe Lys Asp Lys Asn Leu Gln Asp Leu
1085 1090 1095
Trp Val Leu Tyr Lys Gln Gln Tyr Ser Gln Leu Phe Val Asp Thr
1100 1105 1110
Leu Val Asn Lys Ile Pro Ala Asn Ile Val Leu Pro Gln Gly Gly
1115 1120 1125
Tyr Val Ala Asp Phe Ala Tyr Trp Phe Leu Thr Leu Cys Asp Trp
1130 1135 1140
Gln Cys Val Ala Tyr Trp Lys Cys Ile Lys Cys Asp Leu Ala Leu
1145 1150 1155
Lys Leu Lys Gly Leu Asp Ala Met Phe Phe Tyr Gly Asp Val Val
1160 1165 1170
Ser His Ile Cys Lys Cys Gly Glu Ser Met Val Leu Ile Asp Val
1175 1180 1185
Asp Val Pro Phe Thr Ala His Phe Ala Leu Lys Asp Lys Leu Phe
1190 1195 1200
Cys Ala Phe Ile Thr Lys Arg Ile Val Tyr Lys Ala Ala Cys Val
1205 1210 1215
Val Asp Val Asn Asp Ser His Ser Met Ala Val Val Asp Gly Lys
1220 1225 1230
Gln Ile Asp Asp His Arg Ile Thr Ser Ile Thr Ser Asp Lys Phe
1235 1240 1245
Asp Phe Ile Ile Gly His Gly Met Ser Phe Ser Met Thr Thr Phe
1250 1255 1260
Glu Ile Ala Gln Leu Tyr Gly Ser Cys Ile Thr Pro Asn Val Cys
1265 1270 1275
Phe Val Lys Gly Asp Ile Ile Lys Val Ser Lys Leu Val Lys Ala
1280 1285 1290
Glu Val Val Val Asn Pro Ala Asn Gly His Met Ala His Gly Gly
1295 1300 1305
Gly Val Ala Lys Ala Ile Ala Val Ala Ala Gly Gln Gln Phe Val
1310 1315 1320
Lys Glu Thr Thr Asp Met Val Lys Ser Lys Gly Val Cys Ala Thr
1325 1330 1335
Gly Asp Cys Tyr Val Ser Thr Gly Gly Lys Leu Cys Lys Thr Val
1340 1345 1350
Leu Asn Val Val Gly Pro Asp Ala Arg Thr Gln Gly Lys Gln Ser
1355 1360 1365
Tyr Val Leu Leu Glu Arg Val Tyr Lys His Leu Asn Asn Tyr Asp
1370 1375 1380
Cys Val Val Thr Thr Leu Ile Ser Ala Gly Ile Phe Ser Val Pro
1385 1390 1395
Ser Asp Val Ser Leu Thr Tyr Leu Leu Gly Thr Ala Lys Lys Gln
1400 1405 1410
Val Val Leu Val Ser Asn Asn Gln Glu Asp Phe Asp Leu Ile Ser
1415 1420 1425
Lys Cys Gln Ile Thr Ala Val Glu Gly Thr Lys Lys Leu Ala Ala
1430 1435 1440
Arg Leu Ser Phe Asn Val Gly Arg Ser Ile Val Tyr Glu Thr Asp
1445 1450 1455
Ala Asn Lys Leu Ile Leu Ile Asn Asp Val Ala Phe Val Ser Thr
1460 1465 1470
Phe Asn Val Leu Gln Asp Val Leu Ser Leu Arg His Asp Ile Ala
1475 1480 1485
Leu Asp Asp Asp Ala Arg Thr Phe Val Gln Ser Asn Val Asp Val
1490 1495 1500
Val Pro Glu Gly Trp Arg Val Val Asn Lys Phe Tyr Gln Ile Asn
1505 1510 1515
Gly Val Arg Thr Val Lys Tyr Phe Glu Cys Thr Gly Gly Ile Asp
1520 1525 1530
Ile Cys Ser Gln Asp Lys Val Phe Gly Tyr Val Gln Gln Gly Ile
1535 1540 1545
Phe Asn Lys Ala Thr Val Ala Gln Ile Lys Ala Leu Phe Leu Asp
1550 1555 1560
Lys Val Asp Ile Leu Leu Thr Val Asp Gly Val Asn Phe Thr Asn
1565 1570 1575
Arg Phe Val Pro Val Gly Glu Ser Phe Gly Lys Ser Leu Gly Asn
1580 1585 1590
Val Phe Cys Asp Gly Val Asn Val Thr Lys His Lys Cys Asp Ile
1595 1600 1605
Asn Tyr Lys Gly Lys Val Phe Phe Gln Phe Asp Asn Leu Ser Ser
1610 1615 1620
Glu Asp Leu Lys Ala Val Arg Ser Ser Phe Asn Phe Asp Gln Lys
1625 1630 1635
Glu Leu Leu Ala Tyr Tyr Asn Met Leu Val Asn Cys Phe Lys Trp
1640 1645 1650
Gln Val Val Val Asn Gly Lys Tyr Phe Thr Phe Lys Gln Ala Asn
1655 1660 1665
Asn Asn Cys Phe Val Asn Val Ser Cys Leu Met Leu Gln Ser Leu
1670 1675 1680
His Leu Thr Phe Lys Ile Val Gln Trp Gln Glu Ala Trp Leu Glu
1685 1690 1695
Phe Arg Ser Gly Arg Pro Ala Arg Phe Val Ala Leu Val Leu Ala
1700 1705 1710
Lys Gly Gly Phe Lys Phe Gly Asp Pro Ala Asp Ser Arg Asp Phe
1715 1720 1725
Leu Arg Val Val Phe Ser Gln Val Asp Leu Thr Gly Ala Ile Cys
1730 1735 1740
Asp Phe Glu Ile Ala Cys Lys Cys Gly Val Lys Gln Glu Gln Arg
1745 1750 1755
Thr Gly Leu Asp Ala Val Met His Phe Gly Thr Leu Ser Arg Glu
1760 1765 1770
Asp Leu Glu Ile Gly Tyr Thr Val Asp Cys Ser Cys Gly Lys Lys
1775 1780 1785
Leu Ile His Cys Val Arg Phe Asp Val Pro Phe Leu Ile Cys Ser
1790 1795 1800
Asn Thr Pro Ala Ser Val Lys Leu Pro Lys Gly Val Gly Ser Ala
1805 1810 1815
Asn Ile Phe Ile Gly Asp Lys Val Gly His Tyr Val His Val Lys
1820 1825 1830
Cys Glu Gln Ser Tyr Gln Leu Tyr Asp Ala Ser Asn Val Lys Lys
1835 1840 1845
Val Thr Asp Val Thr Gly Lys Leu Ser Asp Cys Leu Tyr Leu Lys
1850 1855 1860
Asn Leu Lys Gln Thr Phe Lys Ser Val Leu Thr Thr Tyr Tyr Leu
1865 1870 1875
Asp Asp Val Lys Lys Ile Glu Tyr Lys Pro Asp Leu Ser Gln Tyr
1880 1885 1890
Tyr Cys Asp Gly Gly Lys Tyr Tyr Thr Gln Arg Ile Ile Lys Ala
1895 1900 1905
Gln Phe Lys Thr Phe Glu Lys Val Asp Gly Val Tyr Thr Asn Phe
1910 1915 1920
Lys Leu Ile Gly His Thr Val Cys Asp Ser Leu Asn Ala Lys Leu
1925 1930 1935
Gly Phe Asp Ser Ser Lys Glu Phe Val Glu Tyr Lys Ile Thr Glu
1940 1945 1950
Trp Pro Thr Ala Thr Gly Asp Val Val Leu Ala Thr Asp Asp Leu
1955 1960 1965
Tyr Val Lys Arg Tyr Glu Arg Gly Cys Ile Thr Phe Gly Lys Pro
1970 1975 1980
Val Ile Trp Leu Ser His Glu Lys Ala Ser Leu Asn Ser Leu Thr
1985 1990 1995
Tyr Phe Asn Arg Pro Ser Leu Val Asp Asp Asn Lys Phe Asp Val
2000 2005 2010
Leu Lys Val Asp Asp Val Asp Asp Gly Gly Asp Ser Ser Glu Ser
2015 2020 2025
Gly Ala Lys Glu Thr Lys Glu Ile Asn Ile Ile Lys Leu Ser Gly
2030 2035 2040
Val Lys Lys Pro Phe Lys Val Glu Asp Ser Val Ile Val Asn Asp
2045 2050 2055
Asp Thr Ser Glu Thr Lys Tyr Val Lys Ser Leu Ser Ile Val Asp
2060 2065 2070
Val Tyr Asp Met Trp Leu Thr Gly Cys Lys Tyr Val Val Arg Thr
2075 2080 2085
Ala Asn Ala Leu Ser Arg Ala Val Asn Val Pro Thr Ile Arg Lys
2090 2095 2100
Phe Ile Lys Phe Gly Met Thr Leu Val Ser Ile Pro Ile Asp Leu
2105 2110 2115
Leu Asn Leu Arg Glu Ile Lys Pro Ala Val Asn Val Val Lys Ala
2120 2125 2130
Val Arg Asn Lys Ile Ser Val Cys Phe Asn Phe Ile Lys Trp Leu
2135 2140 2145
Phe Val Leu Leu Phe Gly Trp Ile Lys Ile Ser Ala Asp Asn Lys
2150 2155 2160
Val Ile Tyr Thr Thr Glu Ile Ala Ser Lys Leu Thr Cys Lys Leu
2165 2170 2175
Val Ala Leu Ala Phe Lys Asn Ala Phe Leu Thr Phe Lys Trp Ser
2180 2185 2190
Met Val Ala Arg Gly Ala Cys Ile Ile Ala Thr Ile Phe Leu Leu
2195 2200 2205
Trp Phe Asn Phe Ile Tyr Ala Asn Val Ile Phe Ser Asp Phe Tyr
2210 2215 2220
Leu Pro Lys Ile Gly Phe Leu Pro Thr Phe Val Gly Lys Ile Ala
2225 2230 2235
Gln Trp Ile Lys Asn Thr Phe Ser Leu Val Thr Ile Cys Asp Leu
2240 2245 2250
Tyr Ser Met Gln Asp Val Gly Phe Lys Asn Gln Tyr Cys Asn Gly
2255 2260 2265
Ser Ile Ala Cys Gln Phe Cys Leu Ala Gly Phe Asp Met Leu Asp
2270 2275 2280
Asn Tyr Lys Ala Ile Asp Val Val Gln Tyr Glu Ala Asp Arg Arg
2285 2290 2295
Ala Phe Val Asp Tyr Thr Gly Val Leu Lys Ile Val Ile Glu Leu
2300 2305 2310
Ile Val Ser Tyr Ala Leu Tyr Thr Ala Trp Phe Tyr Pro Leu Phe
2315 2320 2325
Ala Leu Ile Ser Ile Gln Ile Leu Thr Thr Trp Leu Pro Glu Leu
2330 2335 2340
Phe Met Leu Ser Thr Leu His Trp Ser Phe Arg Leu Leu Val Ala
2345 2350 2355
Leu Ala Asn Met Leu Pro Ala His Val Phe Met Arg Phe Tyr Ile
2360 2365 2370
Ile Ile Ala Ser Phe Ile Lys Leu Phe Ser Leu Phe Arg His Val
2375 2380 2385
Ala Tyr Gly Cys Ser Lys Ser Gly Cys Leu Phe Cys Tyr Lys Arg
2390 2395 2400
Asn Arg Ser Leu Arg Val Lys Cys Ser Thr Ile Val Gly Gly Met
2405 2410 2415
Ile Arg Tyr Tyr Asp Val Met Ala Asn Gly Gly Thr Gly Phe Cys
2420 2425 2430
Ser Lys His Gln Trp Asn Cys Ile Asp Cys Asp Ser Tyr Lys Pro
2435 2440 2445
Gly Asn Thr Phe Ile Thr Val Glu Ala Ala Leu Asp Leu Ser Lys
2450 2455 2460
Glu Leu Lys Arg Pro Ile Gln Pro Thr Asp Val Ala Tyr His Thr
2465 2470 2475
Val Thr Asp Val Lys Gln Val Gly Cys Ser Met Arg Leu Phe Tyr
2480 2485 2490
Asp Arg Asp Gly Gln Arg Thr Tyr Asp Asp Val Asn Ala Ser Leu
2495 2500 2505
Phe Val Asp Tyr Ser Asn Leu Leu His Ser Lys Val Lys Ser Val
2510 2515 2520
Pro Asn Met His Val Val Val Val Glu Asn Asp Ala Asp Lys Ala
2525 2530 2535
Asn Phe Leu Asn Ala Ala Val Phe Tyr Ala Gln Ser Leu Phe Arg
2540 2545 2550
Pro Ile Leu Met Val Asp Lys Asn Leu Ile Thr Thr Ala Asn Thr
2555 2560 2565
Gly Thr Ser Val Thr Glu Thr Met Phe Asp Val Tyr Val Asp Thr
2570 2575 2580
Phe Leu Ser Met Phe Asp Val Asp Lys Lys Ser Leu Asn Ala Leu
2585 2590 2595
Ile Ala Thr Ala His Ser Ser Ile Lys Gln Gly Thr Gln Ile Tyr
2600 2605 2610
Lys Val Leu Asp Thr Phe Leu Ser Cys Ala Arg Lys Ser Cys Ser
2615 2620 2625
Ile Asp Ser Asp Val Asp Thr Lys Cys Leu Ala Asp Ser Val Met
2630 2635 2640
Ser Ala Val Ser Ala Gly Leu Glu Leu Thr Asp Glu Ser Cys Asn
2645 2650 2655
Asn Leu Val Pro Thr Tyr Leu Lys Ser Asp Asn Ile Val Ala Ala
2660 2665 2670
Asp Leu Gly Val Leu Ile Gln Asn Ser Ala Lys His Val Gln Gly
2675 2680 2685
Asn Val Ala Lys Ile Ala Gly Val Ser Cys Ile Trp Ser Val Asp
2690 2695 2700
Ala Phe Asn Gln Phe Ser Ser Asp Phe Gln His Lys Leu Lys Lys
2705 2710 2715
Ala Cys Cys Lys Thr Gly Leu Lys Leu Lys Leu Thr Tyr Asn Lys
2720 2725 2730
Gln Met Ala Asn Val Ser Val Leu Thr Thr Pro Phe Ser Leu Lys
2735 2740 2745
Gly Gly Ala Val Phe Ser Tyr Phe Val Tyr Val Cys Phe Val Leu
2750 2755 2760
Ser Leu Val Cys Phe Ile Gly Leu Trp Cys Leu Met Pro Thr Tyr
2765 2770 2775
Thr Val His Lys Ser Asp Phe Gln Leu Pro Val Tyr Ala Ser Tyr
2780 2785 2790
Lys Val Leu Asp Asn Gly Val Ile Arg Asp Val Ser Val Glu Asp
2795 2800 2805
Val Cys Phe Ala Asn Lys Phe Glu Gln Phe Asp Gln Trp Tyr Glu
2810 2815 2820
Ser Thr Phe Gly Leu Ser Tyr Tyr Ser Asn Ser Met Ala Cys Pro
2825 2830 2835
Ile Val Val Ala Val Ile Asp Gln Asp Phe Gly Ser Thr Val Phe
2840 2845 2850
Asn Val Pro Thr Lys Val Leu Arg Tyr Gly Tyr His Val Leu His
2855 2860 2865
Phe Ile Thr His Ala Leu Ser Ala Asp Gly Val Gln Cys Tyr Thr
2870 2875 2880
Pro His Ser Gln Ile Ser Tyr Ser Asn Phe Tyr Ala Ser Gly Cys
2885 2890 2895
Val Leu Ser Ser Ala Cys Thr Met Phe Thr Met Ala Asp Gly Ser
2900 2905 2910
Pro Gln Pro Tyr Cys Tyr Thr Glu Gly Leu Met Gln Asn Ala Ser
2915 2920 2925
Leu Tyr Ser Ser Leu Val Pro His Val Arg Tyr Asn Leu Ala Asn
2930 2935 2940
Ala Lys Gly Phe Ile Arg Phe Pro Glu Val Leu Arg Glu Gly Leu
2945 2950 2955
Val Arg Ile Val Arg Thr Arg Ser Met Ser Tyr Cys Arg Val Gly
2960 2965 2970
Leu Cys Glu Glu Ala Asp Glu Gly Ile Cys Phe Asn Phe Asn Gly
2975 2980 2985
Ser Trp Val Leu Asn Asn Asp Tyr Tyr Arg Ser Leu Pro Gly Thr
2990 2995 3000
Phe Cys Gly Arg Asp Val Phe Asp Leu Ile Tyr Gln Leu Phe Lys
3005 3010 3015
Gly Leu Ala Gln Pro Val Asp Phe Leu Ala Leu Thr Ala Ser Ser
3020 3025 3030
Ile Ala Gly Ala Ile Leu Ala Val Ile Val Val Leu Val Phe Tyr
3035 3040 3045
Tyr Leu Ile Lys Leu Lys Arg Ala Phe Gly Asp Tyr Thr Ser Val
3050 3055 3060
Val Phe Val Asn Val Ile Val Trp Cys Val Asn Phe Met Met Leu
3065 3070 3075
Phe Val Phe Gln Val Tyr Pro Ile Leu Ser Cys Val Tyr Ala Ile
3080 3085 3090
Cys Tyr Phe Tyr Ala Thr Leu Tyr Phe Pro Ser Glu Ile Ser Val
3095 3100 3105
Ile Met His Leu Gln Trp Leu Val Met Tyr Gly Thr Ile Met Pro
3110 3115 3120
Leu Trp Phe Cys Leu Leu Tyr Ile Ala Val Val Val Ser Asn His
3125 3130 3135
Ala Phe Trp Val Phe Ser Tyr Cys Arg Lys Leu Gly Thr Ser Val
3140 3145 3150
Arg Ser Asp Gly Thr Phe Glu Glu Met Ala Leu Thr Thr Phe Met
3155 3160 3165
Ile Thr Lys Asp Ser Tyr Cys Lys Leu Lys Asn Ser Leu Ser Asp
3170 3175 3180
Val Ala Phe Asn Arg Tyr Leu Ser Leu Tyr Asn Lys Tyr Arg Tyr
3185 3190 3195
Tyr Ser Gly Lys Met Asp Thr Ala Ala Tyr Arg Glu Ala Ala Cys
3200 3205 3210
Ser Gln Leu Ala Lys Ala Met Asp Thr Phe Thr Asn Asn Asn Gly
3215 3220 3225
Ser Asp Val Leu Tyr Gln Pro Pro Thr Ala Ser Val Ser Thr Ser
3230 3235 3240
Phe Leu Gln Ser Gly Ile Val Lys Met Val Asn Pro Thr Ser Lys
3245 3250 3255
Val Glu Pro Cys Val Val Ser Val Thr Tyr Gly Asn Met Thr Leu
3260 3265 3270
Asn Gly Leu Trp Leu Asp Asp Lys Val Tyr Cys Pro Arg His Val
3275 3280 3285
Ile Cys Ser Ala Ser Asp Met Thr Asn Pro Asp Tyr Thr Asn Leu
3290 3295 3300
Leu Cys Arg Val Thr Ser Ser Asp Phe Thr Val Leu Phe Asp Arg
3305 3310 3315
Leu Ser Leu Thr Val Met Ser Tyr Gln Met Arg Gly Cys Met Leu
3320 3325 3330
Val Leu Thr Val Thr Leu Gln Asn Ser Arg Thr Pro Lys Tyr Thr
3335 3340 3345
Phe Gly Val Val Lys Pro Gly Glu Thr Phe Thr Val Leu Ala Ala
3350 3355 3360
Tyr Asn Gly Lys Pro Gln Gly Ala Phe His Val Thr Met Arg Ser
3365 3370 3375
Ser Tyr Thr Ile Lys Gly Ser Phe Leu Cys Gly Ser Cys Gly Ser
3380 3385 3390
Val Gly Tyr Val Ile Met Gly Asp Cys Val Lys Phe Val Tyr Met
3395 3400 3405
His Gln Leu Glu Leu Ser Thr Gly Cys His Thr Gly Thr Asp Phe
3410 3415 3420
Asn Gly Asp Phe Tyr Gly Pro Tyr Lys Asp Ala Gln Val Val Gln
3425 3430 3435
Leu Leu Ile Gln Asp Tyr Ile Gln Ser Val Asn Phe Val Ala Trp
3440 3445 3450
Leu Tyr Ala Ala Ile Leu Asn Asn Cys Asn Trp Phe Val Gln Ser
3455 3460 3465
Asp Lys Cys Ser Val Glu Asp Phe Asn Val Trp Ala Leu Ser Asn
3470 3475 3480
Gly Phe Ser Gln Val Lys Ser Asp Leu Val Ile Asp Ala Leu Ala
3485 3490 3495
Ser Met Thr Gly Val Ser Leu Glu Thr Leu Leu Ala Ala Ile Lys
3500 3505 3510
Arg Leu Lys Asn Gly Phe Gln Gly Arg Gln Ile Met Gly Ser Cys
3515 3520 3525
Ser Phe Glu Asp Glu Leu Thr Pro Ser Asp Val Tyr Gln Gln Leu
3530 3535 3540
Ala Gly Ile Lys Leu Gln Ser Lys Arg Thr Arg Leu Phe Lys Gly
3545 3550 3555
Thr Val Cys Trp Ile Met Ala Ser Thr Phe Leu Phe Ser Cys Ile
3560 3565 3570
Ile Thr Ala Phe Val Lys Trp Thr Met Phe Met Tyr Val Thr Thr
3575 3580 3585
Asn Met Phe Ser Ile Thr Phe Cys Ala Leu Cys Val Ile Ser Leu
3590 3595 3600
Ala Met Leu Leu Val Lys His Lys His Leu Tyr Leu Thr Met Tyr
3605 3610 3615
Ile Thr Pro Val Leu Phe Thr Leu Leu Tyr Asn Asn Tyr Leu Val
3620 3625 3630
Val Tyr Lys His Thr Phe Arg Gly Tyr Val Tyr Ala Trp Leu Ser
3635 3640 3645
Tyr Tyr Val Pro Ser Val Glu Tyr Thr Tyr Thr Asp Glu Val Ile
3650 3655 3660
Tyr Gly Met Leu Leu Leu Val Gly Met Val Phe Val Thr Leu Arg
3665 3670 3675
Ser Ile Asn His Asp Leu Phe Ser Phe Ile Met Phe Val Gly Arg
3680 3685 3690
Leu Ile Ser Val Phe Ser Leu Trp Tyr Lys Gly Ser Asn Leu Glu
3695 3700 3705
Glu Glu Ile Leu Leu Met Leu Ala Ser Leu Phe Gly Thr Tyr Thr
3710 3715 3720
Trp Thr Thr Val Leu Ser Met Ala Val Ala Lys Val Ile Ala Lys
3725 3730 3735
Trp Val Ala Val Asn Val Leu Tyr Phe Thr Asp Ile Pro Gln Ile
3740 3745 3750
Lys Ile Val Leu Leu Cys Tyr Leu Phe Ile Gly Tyr Ile Ile Ser
3755 3760 3765
Cys Tyr Trp Gly Leu Phe Ser Leu Met Asn Ser Leu Phe Arg Met
3770 3775 3780
Pro Leu Gly Val Tyr Asn Tyr Lys Ile Ser Val Gln Glu Leu Arg
3785 3790 3795
Tyr Met Asn Ala Asn Gly Leu Arg Pro Pro Lys Asn Ser Phe Glu
3800 3805 3810
Ala Leu Met Leu Asn Phe Lys Leu Leu Gly Ile Gly Gly Val Pro
3815 3820 3825
Ile Ile Glu Val Ser Gln Phe Gln Ser Lys Leu Thr Asp Val Lys
3830 3835 3840
Cys Ala Asn Val Val Leu Leu Asn Cys Leu Gln His Leu His Val
3845 3850 3855
Ala Ser Asn Ser Lys Leu Trp His Tyr Cys Ser Thr Leu His Asn
3860 3865 3870
Glu Ile Leu Ala Thr Ser Asp Leu Ser Val Ala Phe Glu Lys Leu
3875 3880 3885
Ala Gln Leu Leu Ile Val Leu Phe Ala Asn Pro Ala Ala Val Asp
3890 3895 3900
Ser Lys Cys Leu Thr Ser Ile Glu Glu Val Cys Asp Asp Tyr Ala
3905 3910 3915
Lys Asp Asn Thr Val Leu Gln Ala Leu Gln Ser Glu Phe Val Asn
3920 3925 3930
Met Ala Ser Phe Val Glu Tyr Glu Val Ala Lys Lys Asn Leu Asp
3935 3940 3945
Glu Ala Arg Phe Ser Gly Ser Ala Asn Gln Gln Gln Leu Lys Gln
3950 3955 3960
Leu Glu Lys Ala Cys Asn Ile Ala Lys Ser Ala Tyr Glu Arg Asp
3965 3970 3975
Arg Ala Val Ala Lys Lys Leu Glu Arg Met Ala Asp Leu Ala Leu
3980 3985 3990
Thr Asn Met Tyr Lys Glu Ala Arg Ile Asn Asp Lys Lys Ser Lys
3995 4000 4005
Val Val Ser Ala Leu Gln Thr Met Leu Phe Ser Met Val Arg Lys
4010 4015 4020
Leu Asp Asn Gln Ala Leu Asn Ser Ile Leu Asp Asn Ala Val Lys
4025 4030 4035
Gly Cys Val Pro Leu Asn Ala Ile Pro Ser Leu Ala Ala Asn Thr
4040 4045 4050
Leu Asn Ile Ile Val Pro Asp Lys Ser Val Tyr Asp Gln Val Val
4055 4060 4065
Asp Asn Val Tyr Val Thr Tyr Ala Gly Asn Val Trp Gln Ile Gln
4070 4075 4080
Thr Ile Gln Asp Ser Asp Gly Thr Asn Lys Gln Leu Asn Glu Ile
4085 4090 4095
Ser Asp Asp Cys Asn Trp Pro Leu Val Ile Ile Ala Asn Arg Tyr
4100 4105 4110
Asn Glu Val Ser Ala Thr Val Leu Gln Asn Asn Glu Leu Met Pro
4115 4120 4125
Ala Lys Leu Lys Ile Gln Val Val Asn Ser Gly Pro Asp Gln Thr
4130 4135 4140
Cys Asn Thr Pro Thr Gln Cys Tyr Tyr Asn Asn Ser Asn Asn Gly
4145 4150 4155
Lys Ile Val Tyr Ala Ile Leu Ser Asp Val Asp Gly Leu Lys Tyr
4160 4165 4170
Thr Lys Ile Leu Lys Asp Asp Gly Asn Phe Val Val Leu Glu Leu
4175 4180 4185
Asp Pro Pro Cys Lys Phe Thr Val Gln Asp Ala Lys Gly Leu Lys
4190 4195 4200
Ile Lys Tyr Leu Tyr Phe Val Lys Gly Cys Asn Thr Leu Ala Arg
4205 4210 4215
Gly Trp Val Val Gly Thr Ile Ser Ser Thr Val Arg Leu Gln Ala
4220 4225 4230
Gly Thr Ala Thr Glu Tyr Ala Ser Asn Ser Ser Ile Leu Ser Leu
4235 4240 4245
Cys Ala Phe Ser Val Asp Pro Lys Lys Thr Tyr Leu Asp Phe Ile
4250 4255 4260
Gln Gln Gly Gly Thr Pro Ile Ala Asn Cys Val Lys Met Leu Cys
4265 4270 4275
Asp His Ala Gly Thr Gly Met Ala Ile Thr Val Lys Pro Asp Ala
4280 4285 4290
Thr Thr Ser Gln Asp Ser Tyr Gly Gly Ala Ser Val Cys Ile Tyr
4295 4300 4305
Cys Arg Ala Arg Val Glu His Pro Asp Val Asp Gly Leu Cys Lys
4310 4315 4320
Leu Arg Gly Lys Phe Val Gln Val Pro Val Gly Ile Lys Asp Pro
4325 4330 4335
Val Ser Tyr Val Leu Thr His Asp Val Cys Arg Val Cys Gly Phe
4340 4345 4350
Trp Arg Asp Gly Ser Cys Ser Cys Val Ser Thr Asp Thr Thr Val
4355 4360 4365
Gln Ser Lys Asp Thr Asn Phe Phe Lys Arg Val Arg Gly Thr Ser
4370 4375 4380
Val Asp Ala Arg Leu Val Pro Cys Ala Ser Gly Leu Ser Thr Asp
4385 4390 4395
Val Gln Leu Arg Ala Phe Asp Ile Tyr Asn Ala Ser Val Ala Gly
4400 4405 4410
Ile Gly Leu His Leu Lys Val Asn Cys Cys Arg Phe Gln Arg Val
4415 4420 4425
Asp Glu Asn Gly Asp Lys Leu Asp Gln Phe Phe Val Val Lys Arg
4430 4435 4440
Thr Asp Leu Thr Ile Tyr Asn Arg Glu Met Lys Cys Tyr Glu Arg
4445 4450 4455
Val Lys Asp Cys Lys Phe Val Ala Glu His Asp Phe Phe Thr Phe
4460 4465 4470
Asp Val Glu Gly Ser Arg Val Pro His Ile Val Arg Lys Asp Leu
4475 4480 4485
Thr Lys Tyr Thr Met Leu Asp Leu Cys Tyr Ala Leu Arg His Phe
4490 4495 4500
Asp Arg Asn Asp Cys Met Leu Leu Cys Asp Ile Leu Ser Ile Tyr
4505 4510 4515
Ala Gly Cys Glu Gln Ser Tyr Phe Thr Lys Lys Asp Trp Tyr Asp
4520 4525 4530
Phe Val Glu Asn Pro Asp Ile Ile Asn Val Tyr Lys Lys Leu Gly
4535 4540 4545
Pro Ile Phe Asn Arg Ala Leu Val Ser Ala Thr Glu Phe Ala Asp
4550 4555 4560
Lys Leu Val Glu Val Gly Leu Val Gly Val Leu Thr Leu Asp Asn
4565 4570 4575
Gln Asp Leu Asn Gly Lys Trp Tyr Asp Phe Gly Asp Tyr Val Ile
4580 4585 4590
Ala Ala Pro Gly Cys Gly Val Ala Ile Ala Asp Ser Tyr Tyr Ser
4595 4600 4605
Tyr Ile Met Pro Met Leu Thr Met Cys His Ala Leu Asp Cys Glu
4610 4615 4620
Leu Tyr Val Asn Asn Ala Tyr Arg Leu Phe Asp Leu Val Gln Tyr
4625 4630 4635
Asp Phe Thr Asp Tyr Lys Leu Glu Leu Phe Asn Lys Tyr Phe Lys
4640 4645 4650
His Trp Ser Met Pro Tyr His Pro Asn Thr Val Asp Cys Gln Asp
4655 4660 4665
Asp Arg Cys Ile Ile His Cys Ala Asn Phe Asn Ile Leu Phe Ser
4670 4675 4680
Met Val Leu Pro Asn Thr Cys Phe Gly Pro Leu Val Arg Gln Ile
4685 4690 4695
Phe Val Asp Gly Val Pro Phe Val Val Ser Ile Gly Tyr His Tyr
4700 4705 4710
Lys Glu Leu Gly Ile Val Met Asn Met Asp Val Asp Thr His Arg
4715 4720 4725
Tyr Arg Leu Ser Leu Lys Asp Leu Leu Leu Tyr Ala Ala Asp Pro
4730 4735 4740
Ala Leu His Val Ala Ser Ala Ser Ala Leu Tyr Asp Leu Arg Thr
4745 4750 4755
Cys Cys Phe Ser Val Ala Ala Ile Thr Ser Gly Val Lys Phe Gln
4760 4765 4770
Thr Val Lys Pro Gly Asn Phe Asn Gln Asp Phe Tyr Asp Phe Val
4775 4780 4785
Leu Ser Lys Gly Leu Leu Lys Glu Gly Ser Ser Val Asp Leu Lys
4790 4795 4800
His Phe Phe Phe Thr Gln Asp Gly Asn Ala Ala Ile Thr Asp Tyr
4805 4810 4815
Asn Tyr Tyr Lys Tyr Asn Leu Pro Thr Met Val Asp Ile Lys Gln
4820 4825 4830
Leu Leu Phe Val Leu Glu Val Val Tyr Lys Tyr Phe Glu Ile Tyr
4835 4840 4845
Asp Gly Gly Cys Ile Pro Ala Ser Gln Val Ile Val Asn Asn Tyr
4850 4855 4860
Asp Lys Ser Ala Gly Tyr Pro Phe Asn Lys Phe Gly Lys Ala Arg
4865 4870 4875
Leu Tyr Tyr Glu Ala Leu Ser Phe Glu Glu Gln Asp Glu Ile Tyr
4880 4885 4890
Ala Tyr Thr Lys Arg Asn Val Leu Pro Thr Leu Thr Gln Met Asn
4895 4900 4905
Leu Lys Tyr Ala Ile Ser Ala Lys Asn Arg Ala Arg Thr Val Ala
4910 4915 4920
Gly Val Ser Ile Leu Ser Thr Met Thr Gly Arg Met Phe His Gln
4925 4930 4935
Lys Cys Leu Lys Ser Ile Ala Ala Thr Arg Gly Val Pro Val Val
4940 4945 4950
Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asp Met Leu Arg
4955 4960 4965
Arg Leu Ile Lys Asp Val Asp Asn Pro Val Leu Met Gly Trp Asp
4970 4975 4980
Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Leu Leu Arg Ile Val
4985 4990 4995
Ser Ser Leu Val Leu Ala Arg Lys His Glu Thr Cys Cys Ser Gln
5000 5005 5010
Ser Asp Arg Phe Tyr Arg Leu Ala Asn Glu Cys Ala Gln Val Leu
5015 5020 5025
Ser Glu Ile Val Met Cys Gly Gly Cys Tyr Tyr Val Lys Pro Gly
5030 5035 5040
Gly Thr Ser Ser Gly Asp Ala Thr Thr Ala Phe Ala Asn Ser Val
5045 5050 5055
Phe Asn Ile Cys Gln Ala Val Ser Ala Asn Val Cys Ala Leu Met
5060 5065 5070
Ser Cys Asn Gly Asn Lys Ile Glu Asp Leu Ser Ile Arg Ala Leu
5075 5080 5085
Gln Lys Arg Leu Tyr Ser His Val Tyr Arg Ser Asp Lys Val Asp
5090 5095 5100
Ser Thr Phe Val Thr Glu Tyr Tyr Glu Phe Leu Asn Lys His Phe
5105 5110 5115
Ser Met Met Ile Leu Ser Asp Asp Gly Val Val Cys Tyr Asn Ser
5120 5125 5130
Asp Tyr Ala Ser Lys Gly Tyr Ile Ala Asn Ile Ser Ala Phe Gln
5135 5140 5145
Gln Val Leu Tyr Tyr Gln Asn Asn Val Phe Met Ser Glu Ser Lys
5150 5155 5160
Cys Trp Val Glu His Asp Ile Asn Asn Gly Pro His Glu Phe Cys
5165 5170 5175
Ser Gln His Thr Met Leu Val Lys Met Asp Gly Asp Asp Val Tyr
5180 5185 5190
Leu Pro Tyr Pro Asn Pro Ser Arg Ile Leu Gly Ala Gly Cys Phe
5195 5200 5205
Val Asp Asp Leu Leu Lys Thr Asp Ser Val Leu Leu Ile Glu Arg
5210 5215 5220
Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Val Tyr His Glu
5225 5230 5235
Asn Glu Glu Tyr Gln Lys Val Phe Arg Val Tyr Leu Ala Tyr Ile
5240 5245 5250
Lys Lys Leu Tyr Asn Asp Leu Gly Asn Gln Ile Leu Asp Ser Tyr
5255 5260 5265
Ser Val Ile Leu Ser Thr Cys Asp Gly Gln Lys Phe Thr Asp Glu
5270 5275 5280
Ser Phe Tyr Lys Asn Met Tyr Leu Arg Ser Ala Val Met Gln Ser
5285 5290 5295
Val Gly Ala Cys Val Val Cys Ser Ser Gln Thr Ser Leu Arg Cys
5300 5305 5310
Gly Ser Cys Ile Arg Lys Pro Leu Leu Cys Cys Lys Cys Cys Tyr
5315 5320 5325
Asp His Val Met Ala Thr Asp His Lys Tyr Val Leu Ser Val Ser
5330 5335 5340
Pro Tyr Val Cys Asn Ala Pro Gly Cys Asp Val Asn Asp Val Thr
5345 5350 5355
Lys Leu Tyr Leu Gly Gly Met Ser Tyr Tyr Cys Glu Asp His Lys
5360 5365 5370
Pro Gln Tyr Ser Phe Lys Leu Val Met Asn Gly Leu Val Phe Gly
5375 5380 5385
Leu Tyr Lys Gln Ser Cys Thr Gly Ser Pro Tyr Ile Asp Asp Phe
5390 5395 5400
Asn Arg Ile Ala Ser Cys Lys Trp Thr Asp Val Asp Asp Tyr Ile
5405 5410 5415
Leu Ala Asn Glu Cys Thr Glu Arg Leu Lys Leu Phe Ala Ala Glu
5420 5425 5430
Thr Gln Lys Ala Thr Glu Glu Ala Phe Lys Gln Ser Tyr Ala Ser
5435 5440 5445
Ala Thr Ile Gln Glu Ile Val Ser Glu Arg Glu Leu Ile Leu Ser
5450 5455 5460
Trp Glu Ile Gly Lys Val Lys Pro Pro Leu Asn Lys Asn Tyr Val
5465 5470 5475
Phe Thr Gly Tyr His Phe Thr Lys Asn Gly Lys Thr Val Leu Gly
5480 5485 5490
Glu Tyr Val Phe Asp Lys Ser Glu Leu Thr Asn Gly Val Tyr Tyr
5495 5500 5505
Arg Ala Thr Thr Thr Tyr Lys Leu Ser Val Gly Asp Val Phe Val
5510 5515 5520
Leu Thr Ser His Ser Val Ala Asn Leu Ser Ala Pro Thr Leu Val
5525 5530 5535
Pro Gln Glu Asn Tyr Ser Ser Ile Arg Phe Ala Ser Val Tyr Ser
5540 5545 5550
Val Leu Glu Thr Phe Gln Asn Asn Val Val Asn Tyr Gln His Ile
5555 5560 5565
Gly Met Lys Arg Tyr Cys Thr Val Gln Gly Pro Pro Gly Thr Gly
5570 5575 5580
Lys Ser His Leu Ala Ile Gly Leu Ala Val Phe Tyr Cys Thr Ala
5585 5590 5595
Arg Val Val Tyr Thr Ala Ala Ser His Ala Ala Val Asp Ala Leu
5600 5605 5610
Cys Glu Lys Ala Tyr Lys Phe Leu Asn Ile Asn Asp Cys Thr Arg
5615 5620 5625
Ile Val Pro Ala Lys Val Arg Val Glu Cys Tyr Asp Lys Phe Lys
5630 5635 5640
Ile Asn Asp Thr Thr Arg Lys Tyr Val Phe Thr Thr Ile Asn Ala
5645 5650 5655
Leu Pro Glu Met Val Thr Asp Ile Val Val Val Asp Glu Val Ser
5660 5665 5670
Met Leu Thr Asn Tyr Glu Leu Ser Val Ile Asn Ala Arg Ile Arg
5675 5680 5685
Ala Lys His Tyr Val Tyr Ile Gly Asp Pro Ala Gln Leu Pro Ala
5690 5695 5700
Pro Arg Val Leu Leu Ser Lys Gly Thr Leu Glu Pro Lys Tyr Phe
5705 5710 5715
Asn Thr Val Thr Lys Leu Met Cys Cys Leu Gly Pro Asp Ile Phe
5720 5725 5730
Leu Gly Thr Cys Tyr Arg Cys Pro Lys Glu Ile Val Asp Thr Val
5735 5740 5745
Ser Ala Leu Val Tyr Glu Asn Lys Leu Lys Ala Lys Asn Glu Ser
5750 5755 5760
Ser Ser Leu Cys Phe Lys Val Tyr Tyr Lys Gly Val Thr Thr His
5765 5770 5775
Glu Ser Ser Ser Ala Val Asn Met Gln Gln Ile Tyr Leu Ile Asn
5780 5785 5790
Lys Phe Leu Lys Ala Asn Pro Leu Trp His Lys Ala Val Phe Ile
5795 5800 5805
Ser Pro Tyr Asn Ser Gln Asn Phe Ala Ala Lys Arg Val Leu Gly
5810 5815 5820
Leu Gln Thr Gln Thr Val Asp Ser Ala Gln Gly Ser Glu Tyr Asp
5825 5830 5835
Tyr Val Ile Tyr Ser Gln Thr Ala Glu Thr Ala His Ser Val Asn
5840 5845 5850
Val Asn Arg Phe Asn Val Ala Ile Thr Arg Ala Lys Lys Gly Ile
5855 5860 5865
Leu Cys Val Met Ser Asn Met Gln Leu Phe Glu Ala Leu Gln Phe
5870 5875 5880
Thr Thr Leu Thr Leu Asp Lys Val Pro Gln Ala Val Glu Thr Lys
5885 5890 5895
Val Gln Cys Ser Thr Asn Leu Phe Lys Asp Cys Ser Lys Ser Tyr
5900 5905 5910
Ser Gly Tyr His Pro Ala His Ala Pro Ser Phe Leu Ala Val Asp
5915 5920 5925
Asp Lys Tyr Lys Ala Thr Gly Asp Leu Ala Val Cys Leu Gly Ile
5930 5935 5940
Gly Asp Ser Ala Val Thr Tyr Ser Arg Leu Ile Ser Leu Met Gly
5945 5950 5955
Phe Lys Leu Asp Val Thr Leu Asp Gly Tyr Cys Lys Leu Phe Ile
5960 5965 5970
Thr Lys Glu Glu Ala Val Lys Arg Val Arg Ala Trp Val Gly Phe
5975 5980 5985
Asp Ala Glu Gly Ala His Ala Thr Arg Asp Ser Ile Gly Thr Asn
5990 5995 6000
Phe Pro Leu Gln Leu Gly Phe Ser Thr Gly Ile Asp Phe Val Val
6005 6010 6015
Glu Ala Thr Gly Leu Phe Ala Asp Arg Asp Gly Tyr Ser Phe Lys
6020 6025 6030
Lys Ala Val Ala Lys Ala Pro Pro Gly Glu Gln Phe Lys His Leu
6035 6040 6045
Ile Pro Leu Met Thr Arg Gly His Arg Trp Asp Val Val Arg Pro
6050 6055 6060
Arg Ile Val Gln Met Phe Ala Asp His Leu Ile Asp Leu Ser Asp
6065 6070 6075
Cys Val Val Leu Val Thr Trp Ala Ala Asn Phe Glu Leu Thr Cys
6080 6085 6090
Leu Arg Tyr Phe Ala Lys Val Gly Arg Glu Ile Ser Cys Asn Val
6095 6100 6105
Cys Thr Lys Arg Ala Thr Val Tyr Asn Ser Arg Thr Gly Tyr Tyr
6110 6115 6120
Gly Cys Trp Arg His Ser Val Thr Cys Asp Tyr Leu Tyr Asn Pro
6125 6130 6135
Leu Ile Val Asp Ile Gln Gln Trp Gly Tyr Ile Gly Ser Leu Ser
6140 6145 6150
Ser Asn His Asp Leu Tyr Cys Ser Val His Lys Gly Ala His Val
6155 6160 6165
Ala Ser Ser Asp Ala Ile Met Thr Arg Cys Leu Ala Val Tyr Asp
6170 6175 6180
Cys Phe Cys Asn Asn Ile Asn Trp Asn Val Glu Tyr Pro Ile Ile
6185 6190 6195
Ser Asn Glu Leu Ser Ile Asn Thr Ser Cys Arg Val Leu Gln Arg
6200 6205 6210
Val Ile Leu Lys Ala Ala Met Leu Cys Asn Arg Tyr Thr Leu Cys
6215 6220 6225
Tyr Asp Ile Gly Asn Pro Lys Ala Ile Ala Cys Val Lys Asp Phe
6230 6235 6240
Asp Phe Lys Phe Tyr Asp Ala Gln Pro Ile Val Lys Ser Val Lys
6245 6250 6255
Thr Leu Leu Tyr Ser Phe Glu Ala His Lys Asp Ser Phe Lys Asp
6260 6265 6270
Gly Leu Cys Met Phe Trp Asn Cys Asn Val Asp Lys Tyr Pro Pro
6275 6280 6285
Asn Ala Val Val Cys Arg Phe Asp Thr Arg Val Leu Asn Asn Leu
6290 6295 6300
Asn Leu Pro Gly Cys Asn Gly Gly Ser Leu Tyr Val Asn Lys His
6305 6310 6315
Ala Phe His Thr Lys Pro Phe Ala Arg Ala Ala Phe Glu His Leu
6320 6325 6330
Lys Pro Met Pro Phe Phe Tyr Tyr Ser Asp Thr Pro Cys Val Tyr
6335 6340 6345
Met Asp Gly Met Asp Ala Lys Gln Val Asp Tyr Val Pro Leu Lys
6350 6355 6360
Ser Ala Thr Cys Ile Thr Arg Cys Asn Leu Gly Gly Ala Val Cys
6365 6370 6375
Leu Lys His Ala Glu Glu Tyr Arg Glu Tyr Leu Glu Ser Tyr Asn
6380 6385 6390
Thr Ala Thr Thr Ala Gly Phe Thr Phe Trp Val Tyr Lys Thr Phe
6395 6400 6405
Asp Phe Tyr Asn Leu Trp Asn Thr Phe Thr Lys Leu Gln Ser Leu
6410 6415 6420
Glu Asn Val Val Tyr Asn Leu Val Lys Thr Gly His Tyr Thr Gly
6425 6430 6435
Gln Ala Gly Glu Met Pro Cys Ala Ile Ile Asn Asp Lys Val Val
6440 6445 6450
Ala Lys Ile Asp Lys Glu Asp Val Val Ile Phe Ile Asn Asn Thr
6455 6460 6465
Thr Tyr Pro Thr Asn Val Ala Val Glu Leu Phe Ala Lys Arg Ser
6470 6475 6480
Val Arg His His Pro Glu Leu Lys Leu Phe Arg Asn Leu Asn Ile
6485 6490 6495
Asp Val Cys Trp Lys His Val Ile Trp Asp Tyr Ala Arg Glu Ser
6500 6505 6510
Ile Phe Cys Ser Asn Thr Tyr Gly Val Cys Met Tyr Thr Asp Leu
6515 6520 6525
Lys Phe Ile Asp Lys Leu Asn Val Leu Phe Asp Gly Arg Asp Asn
6530 6535 6540
Gly Ala Leu Glu Ala Phe Lys Arg Ser Asn Asn Gly Val Tyr Ile
6545 6550 6555
Ser Thr Thr Lys Val Lys Ser Leu Ser Met Ile Arg Gly Pro Pro
6560 6565 6570
Arg Ala Glu Leu Asn Gly Val Val Val Asp Lys Val Gly Asp Thr
6575 6580 6585
Asp Cys Val Phe Tyr Phe Ala Val Arg Lys Glu Gly Gln Asp Val
6590 6595 6600
Ile Phe Ser Gln Phe Asp Ser Leu Gly Val Ser Ser Asn Gln Ser
6605 6610 6615
Pro Gln Gly Asn Leu Gly Ser Asn Gly Lys Pro Gly Asn Val Gly
6620 6625 6630
Gly Asn Asp Ala Leu Ser Ile Ser Thr Ile Phe Thr Gln Ser Arg
6635 6640 6645
Val Ile Ser Ser Phe Thr Cys Arg Thr Asp Met Glu Lys Asp Phe
6650 6655 6660
Ile Ala Leu Asp Gln Asp Val Phe Ile Gln Lys Tyr Gly Leu Glu
6665 6670 6675
Asp Tyr Ala Phe Glu His Ile Val Tyr Gly Asn Phe Asn Gln Lys
6680 6685 6690
Ile Ile Gly Gly Leu His Leu Leu Ile Gly Leu Tyr Arg Arg Gln
6695 6700 6705
Gln Thr Ser Asn Leu Val Val Gln Glu Phe Val Ser Tyr Asp Ser
6710 6715 6720
Ser Ile His Ser Tyr Phe Ile Thr Asp Glu Lys Ser Gly Gly Ser
6725 6730 6735
Lys Ser Val Cys Thr Val Ile Asp Ile Leu Leu Asp Asp Phe Val
6740 6745 6750
Ala Leu Val Lys Ser Leu Asn Leu Asn Cys Val Ser Lys Val Val
6755 6760 6765
Asn Val Asn Val Asp Phe Lys Asp Phe Gln Phe Met Leu Trp Cys
6770 6775 6780
Asn Asp Glu Lys Val Met Thr Phe Tyr Pro Arg Leu Gln Ala Ala
6785 6790 6795
Ser Asp Trp Lys Pro Gly Tyr Ser Met Pro Val Leu Tyr Lys Tyr
6800 6805 6810
Leu Asn Ser Pro Met Glu Arg Val Ser Leu Trp Asn Tyr Gly Lys
6815 6820 6825
Pro Val Thr Leu Pro Thr Gly Cys Met Met Asn Val Ala Lys Tyr
6830 6835 6840
Thr Gln Leu Cys Gln Tyr Leu Asn Thr Thr Thr Leu Ala Val Pro
6845 6850 6855
Val Asn Met Arg Val Leu His Leu Gly Ala Gly Ser Glu Lys Gly
6860 6865 6870
Val Ala Pro Gly Ser Ala Val Leu Arg Gln Trp Leu Pro Ala Gly
6875 6880 6885
Thr Ile Leu Val Asp Asn Asp Leu Tyr Pro Phe Val Ser Asp Ser
6890 6895 6900
Val Ala Thr Tyr Phe Gly Asp Cys Ile Thr Leu Pro Phe Asp Cys
6905 6910 6915
Gln Trp Asp Leu Ile Ile Ser Asp Met Tyr Asp Pro Ile Thr Lys
6920 6925 6930
Asn Ile Gly Glu Tyr Asn Val Ser Lys Asp Gly Phe Phe Thr Tyr
6935 6940 6945
Ile Cys His Met Ile Arg Asp Lys Leu Ala Leu Gly Gly Ser Val
6950 6955 6960
Ala Ile Lys Ile Thr Glu Phe Ser Trp Asn Ala Glu Leu Tyr Lys
6965 6970 6975
Leu Met Gly Tyr Phe Ala Phe Trp Thr Val Phe Cys Thr Asn Ala
6980 6985 6990
Asn Ala Ser Ser Ser Glu Gly Phe Leu Ile Gly Ile Asn Tyr Leu
6995 7000 7005
Cys Lys Pro Lys Val Glu Ile Asp Gly Asn Val Met His Ala Asn
7010 7015 7020
Tyr Leu Phe Trp Arg Asn Ser Thr Val Trp Asn Gly Gly Ala Tyr
7025 7030 7035
Ser Leu Phe Asp Met Ala Lys Phe Pro Leu Lys Leu Ala Gly Thr
7040 7045 7050
Ala Val Ile Asn Leu Arg Ala Asp Gln Ile Asn Asp Met Val Tyr
7055 7060 7065
Ser Leu Leu Glu Lys Gly Lys Leu Leu Ile Arg Asp Thr Asn Lys
7070 7075 7080
Glu Val Phe Val Gly Asp Ser Leu Val Asn Val Ile
7085 7090 7095
<210> 26
<211> 6781
<212> PRT
<213> porcine epidemic diarhhea virus
<220>
<221> MISC_FEATURE
<223> ORF 1AB
<400> 26
Met Ala Ser Asn His Val Thr Leu Ala Phe Ala Asn Asp Ala Glu Ile
1 5 10 15
Ser Ala Phe Gly Phe Cys Thr Ala Ser Glu Ala Val Ser Tyr Tyr Ser
20 25 30
Glu Ala Ala Ala Ser Gly Phe Met Gln Cys Arg Phe Val Ser Leu Asp
35 40 45
Leu Ala Asp Thr Val Glu Gly Leu Leu Pro Glu Asp Tyr Val Met Val
50 55 60
Val Ile Gly Thr Thr Lys Leu Ser Ala Tyr Val Asp Thr Phe Gly Ser
65 70 75 80
Arg Pro Arg Asn Ile Cys Gly Trp Leu Leu Phe Ser Asn Cys Asn Tyr
85 90 95
Phe Leu Glu Glu Leu Glu Leu Thr Phe Gly Arg Arg Gly Gly Asn Ile
100 105 110
Val Pro Val Asp Gln Tyr Met Cys Gly Ala Asp Gly Lys Pro Val Leu
115 120 125
Gln Glu Ser Glu Trp Glu Tyr Thr Asp Phe Phe Ala Asp Ser Glu Asp
130 135 140
Gly Gln Leu Asn Ile Ala Gly Ile Thr Tyr Val Lys Ala Trp Ile Val
145 150 155 160
Glu Arg Ser Asp Val Ser Tyr Ala Ser Gln Asn Leu Thr Ser Ile Lys
165 170 175
Ser Ile Thr Tyr Cys Ser Thr Tyr Glu His Thr Phe Leu Asp Gly Thr
180 185 190
Ala Met Lys Val Ala Arg Thr Pro Lys Ile Lys Lys Asn Val Val Leu
195 200 205
Ser Glu Pro Leu Ala Thr Ile Tyr Arg Glu Ile Gly Ser Pro Phe Val
210 215 220
Asp Asn Gly Ser Asp Ala Arg Ser Ile Ile Arg Arg Pro Val Phe Leu
225 230 235 240
His Ala Phe Val Lys Cys Lys Cys Gly Ser Tyr His Trp Thr Val Gly
245 250 255
Asp Trp Thr Ser Tyr Val Ser Thr Cys Cys Gly Phe Lys Cys Lys Pro
260 265 270
Val Leu Val Ala Ser Cys Ser Ala Met Pro Gly Ser Val Val Val Thr
275 280 285
Arg Ala Gly Ala Gly Thr Gly Val Lys Tyr Tyr Asn Asn Met Phe Leu
290 295 300
Arg His Val Ala Asp Ile Asp Gly Leu Ala Phe Trp Arg Ile Leu Lys
305 310 315 320
Val Gln Ser Lys Asp Asp Leu Ala Cys Ser Gly Lys Phe Leu Glu His
325 330 335
His Glu Glu Gly Phe Thr Asp Pro Cys Tyr Phe Leu Asn Asp Ser Ser
340 345 350
Leu Ala Thr Lys Leu Lys Phe Asp Ile Leu Ser Gly Lys Phe Ser Asp
355 360 365
Glu Val Lys Gln Ala Ile Ile Ala Gly His Val Val Val Gly Ser Ala
370 375 380
Leu Val Asp Ile Val Asp Asp Ala Leu Gly Gln Pro Trp Phe Ile Arg
385 390 395 400
Lys Leu Gly Asp Leu Ala Ser Ala Pro Trp Glu Gln Leu Lys Ala Val
405 410 415
Val Arg Gly Leu Gly Leu Leu Ser Asp Glu Val Val Leu Phe Gly Lys
420 425 430
Arg Leu Ser Cys Ala Thr Leu Ser Ile Val Asn Gly Val Phe Glu Phe
435 440 445
Leu Ala Asp Val Pro Glu Lys Leu Ala Ala Ala Val Thr Val Phe Val
450 455 460
Asn Phe Leu Asn Glu Phe Phe Glu Ser Ala Cys Asp Cys Leu Lys Val
465 470 475 480
Gly Gly Lys Thr Phe Asn Lys Val Gly Ser Tyr Val Leu Phe Asp Asn
485 490 495
Ala Leu Val Lys Leu Val Lys Ala Lys Ala Arg Gly Pro Arg Gln Ala
500 505 510
Gly Ile Cys Glu Val Arg Tyr Thr Ser Leu Val Val Gly Ser Thr Thr
515 520 525
Lys Val Val Ser Lys Arg Val Glu Asn Ala Asn Val Asn Leu Val Val
530 535 540
Val Asp Glu Asp Val Thr Leu Asn Thr Thr Gly Arg Thr Val Val Val
545 550 555 560
Asp Gly Leu Ala Phe Phe Glu Ser Asp Gly Phe Tyr Arg His Leu Ala
565 570 575
Asp Ala Asp Val Val Ile Glu His Pro Val Tyr Lys Ser Ala Cys Glu
580 585 590
Leu Lys Pro Val Phe Glu Cys Asp Pro Ile Pro Asp Phe Pro Leu Pro
595 600 605
Val Ala Ala Ser Val Ala Glu Leu Cys Val Gln Thr Asp Leu Leu Leu
610 615 620
Lys Asn Tyr Asn Thr Pro Tyr Lys Thr Tyr Ser Cys Val Val Arg Gly
625 630 635 640
Asp Lys Cys Cys Ile Thr Cys Thr Leu Gln Phe Lys Ala Pro Ser Tyr
645 650 655
Val Glu Asp Ala Val Asn Phe Val Asp Leu Cys Thr Lys Asn Ile Gly
660 665 670
Thr Ala Gly Phe His Glu Phe Tyr Ile Thr Ala His Glu Gln Gln Asp
675 680 685
Leu Gln Gly Phe Leu Thr Thr Cys Cys Thr Met Ser Gly Phe Glu Cys
690 695 700
Phe Met Pro Thr Ile Pro Gln Cys Pro Ala Val Leu Glu Glu Ile Asp
705 710 715 720
Gly Gly Ser Ile Trp Arg Ser Phe Ile Thr Gly Leu Asn Thr Met Trp
725 730 735
Asp Phe Cys Lys Arg Leu Lys Val Ser Phe Gly Leu Asp Gly Ile Val
740 745 750
Val Thr Val Ala Arg Lys Phe Lys Arg Leu Gly Ala Leu Leu Ala Glu
755 760 765
Met Tyr Asn Thr Tyr Leu Ser Thr Val Val Glu Asn Leu Val Leu Ala
770 775 780
Gly Val Ser Phe Lys Tyr Tyr Ala Thr Ser Val Pro Lys Ile Val Leu
785 790 795 800
Gly Gly Cys Phe His Ser Val Lys Ser Val Phe Ala Ser Val Phe Gln
805 810 815
Ile Pro Val Gln Ala Gly Ile Glu Lys Phe Lys Val Phe Leu Asn Cys
820 825 830
Val His Pro Val Val Pro Arg Val Ile Glu Thr Ser Phe Val Glu Leu
835 840 845
Glu Glu Thr Thr Phe Lys Pro Pro Ala Leu Asn Gly Gly Ile Ala Ile
850 855 860
Val Asp Gly Phe Ala Phe Tyr Tyr Asp Gly Thr Leu Tyr Tyr Pro Thr
865 870 875 880
Asp Gly Asn Ser Val Val Pro Ile Cys Phe Lys Lys Lys Gly Gly Gly
885 890 895
Asp Val Lys Phe Ser Asp Glu Val Ser Val Lys Thr Ile Asp Pro Val
900 905 910
Tyr Lys Val Ser Leu Glu Phe Glu Phe Glu Ser Glu Thr Ile Met Ala
915 920 925
Val Leu Asn Lys Ala Val Gly Asn Arg Ile Lys Val Thr Gly Gly Trp
930 935 940
Asp Asp Val Val Glu Tyr Ile Asn Val Ala Ile Glu Val Leu Lys Asp
945 950 955 960
His Val Glu Val Pro Lys Tyr Tyr Ile Tyr Asp Glu Glu Gly Gly Thr
965 970 975
Asp Pro Asn Leu Pro Val Met Val Ser Gln Trp Pro Leu Asn Asp Asp
980 985 990
Thr Ile Ser Gln Asp Leu Leu Asp Val Glu Val Val Thr Asp Ala Pro
995 1000 1005
Ile Asp Ser Glu Gly Asp Glu Val Asp Ser Ser Ala Pro Glu Lys
1010 1015 1020
Val Ala Asp Val Ala Asn Ser Glu Pro Gly Asp Asp Gly Leu Pro
1025 1030 1035
Val Ala Pro Glu Thr Asn Val Glu Ser Glu Val Glu Glu Val Ala
1040 1045 1050
Ala Thr Leu Ser Phe Ile Lys Asp Thr Pro Ser Thr Val Thr Lys
1055 1060 1065
Asp Pro Phe Ala Phe Asp Phe Val Ser Tyr Gly Gly Leu Lys Val
1070 1075 1080
Leu Arg Gln Ser His Asn Asn Cys Trp Val Thr Ser Thr Leu Val
1085 1090 1095
Gln Leu Gln Leu Leu Gly Ile Val Asp Asp Pro Ala Met Glu Leu
1100 1105 1110
Phe Ser Ala Gly Arg Val Gly Pro Met Val Arg Lys Cys Tyr Glu
1115 1120 1125
Ser Gln Lys Ala Ile Leu Gly Ser Leu Gly Asp Val Ser Ala Cys
1130 1135 1140
Leu Glu Ser Leu Thr Lys Asp Leu His Thr Leu Lys Ile Thr Cys
1145 1150 1155
Ser Val Val Cys Gly Cys Gly Thr Gly Glu Arg Ile Tyr Glu Gly
1160 1165 1170
Cys Ala Phe Arg Met Thr Pro Thr Leu Glu Pro Phe Pro Tyr Gly
1175 1180 1185
Ala Cys Ala Gln Cys Ala Gln Val Leu Met His Thr Phe Lys Ser
1190 1195 1200
Ile Val Gly Thr Gly Ile Phe Cys Arg Asp Thr Thr Ala Leu Ser
1205 1210 1215
Leu Asp Ser Leu Val Val Lys Pro Leu Cys Ala Ala Ala Phe Ile
1220 1225 1230
Gly Lys Asp Ser Gly His Tyr Val Thr Asn Phe Tyr Asp Ala Ala
1235 1240 1245
Met Ala Ile Asp Gly Tyr Gly Arg His Gln Ile Lys Tyr Asp Thr
1250 1255 1260
Leu Asn Thr Ile Cys Val Lys Asp Val Asn Trp Thr Ala Pro Leu
1265 1270 1275
Val Pro Ala Val Asp Ser Val Val Glu Pro Val Val Lys Pro Phe
1280 1285 1290
Tyr Ser Tyr Lys Asn Val Asp Phe Tyr Gln Gly Asp Phe Ser Asp
1295 1300 1305
Leu Val Lys Leu Pro Cys Asp Phe Val Val Asn Ala Ala Asn Glu
1310 1315 1320
Lys Leu Ser His Gly Gly Gly Ile Ala Lys Ala Ile Asp Val Tyr
1325 1330 1335
Thr Lys Gly Met Leu Gln Lys Cys Ser Asn Asp Tyr Ile Lys Ala
1340 1345 1350
His Gly Pro Ile Lys Val Gly Arg Gly Val Met Leu Glu Ala Leu
1355 1360 1365
Gly Leu Lys Val Phe Asn Val Val Gly Pro Arg Lys Gly Lys His
1370 1375 1380
Ala Pro Glu Leu Leu Val Lys Ala Tyr Lys Ser Val Phe Ala Asn
1385 1390 1395
Ser Gly Val Ala Leu Thr Pro Leu Ile Ser Val Gly Ile Phe Ser
1400 1405 1410
Val Pro Leu Glu Glu Ser Leu Ser Ala Phe Leu Ala Cys Val Gly
1415 1420 1425
Asp Arg His Cys Lys Cys Phe Cys Tyr Gly Asp Lys Glu Arg Glu
1430 1435 1440
Ala Ile Ile Lys Tyr Met Asp Gly Leu Val Asp Ala Ile Phe Lys
1445 1450 1455
Glu Ala Leu Val Asp Thr Thr Pro Val Gln Glu Asp Val Gln Gln
1460 1465 1470
Val Ser Gln Lys Pro Val Leu Pro Asn Phe Glu Pro Phe Arg Ile
1475 1480 1485
Glu Gly Ala His Ala Phe Tyr Glu Cys Asn Pro Glu Gly Leu Met
1490 1495 1500
Ser Leu Gly Ala Asp Lys Leu Val Leu Phe Thr Asn Ser Asn Leu
1505 1510 1515
Asp Phe Cys Ser Val Gly Lys Cys Leu Asn Asp Val Thr Ser Gly
1520 1525 1530
Ala Leu Leu Glu Ala Ile Asn Val Phe Lys Lys Ser Asn Lys Thr
1535 1540 1545
Val Pro Ala Gly Asn Cys Val Thr Leu Asp Cys Ala Asn Met Ile
1550 1555 1560
Ser Ile Thr Met Val Val Leu Pro Phe Asp Gly Asp Ala Asn Tyr
1565 1570 1575
Asp Lys Asn Tyr Ala Arg Ala Val Val Lys Val Ser Lys Leu Lys
1580 1585 1590
Gly Lys Leu Val Leu Ala Val Asp Asp Ala Thr Leu Tyr Ser Lys
1595 1600 1605
Leu Ser His Leu Ser Val Leu Gly Phe Val Ser Thr Pro Asp Asp
1610 1615 1620
Val Glu Arg Phe Tyr Ala Asn Lys Ser Val Val Ile Lys Val Thr
1625 1630 1635
Glu Asp Thr Arg Ser Val Lys Ala Val Lys Val Glu Ser Thr Ala
1640 1645 1650
Thr Tyr Gly Gln Gln Ile Gly Pro Cys Leu Val Asn Asp Thr Val
1655 1660 1665
Val Thr Asp Asn Lys Pro Val Val Ala Asp Val Val Ala Lys Val
1670 1675 1680
Val Pro Asn Ala Asn Trp Asp Ser His Tyr Gly Phe Asp Lys Ala
1685 1690 1695
Gly Glu Phe His Met Leu Asp His Thr Gly Phe Thr Phe Pro Ser
1700 1705 1710
Glu Val Val Asn Gly Arg Arg Val Ile Lys Thr Thr Asp Asn Asn
1715 1720 1725
Cys Trp Val Asn Val Thr Cys Leu Gln Leu Gln Phe Ala Arg Phe
1730 1735 1740
Arg Phe Lys Ser Ala Gly Leu Gln Ala Met Trp Glu Ser Tyr Cys
1745 1750 1755
Thr Gly Asp Val Ala Met Phe Val His Trp Leu Tyr Trp Leu Thr
1760 1765 1770
Gly Val Asp Lys Gly Gln Pro Ser Asp Ser Glu Asn Ala Leu Asn
1775 1780 1785
Met Leu Ser Lys Tyr Ile Val Pro Ala Gly Ser Val Thr Ile Glu
1790 1795 1800
Arg Val Thr His Asp Gly Cys Cys Cys Ser Lys Arg Val Val Thr
1805 1810 1815
Ala Pro Val Val Asn Ala Ser Val Leu Lys Leu Gly Val Glu Asp
1820 1825 1830
Gly Leu Cys Pro His Gly Leu Asn Tyr Ile Gly Lys Val Val Val
1835 1840 1845
Val Lys Gly Thr Thr Ile Val Val Asn Val Gly Lys Pro Val Val
1850 1855 1860
Ala Pro Ser His Leu Phe Leu Lys Gly Val Ser Tyr Thr Thr Phe
1865 1870 1875
Leu Asp Asn Gly Asn Gly Val Val Gly His Tyr Thr Val Phe Asp
1880 1885 1890
His Gly Thr Gly Met Val His Asp Gly Asp Ala Phe Val Pro Gly
1895 1900 1905
Asp Leu Asn Val Ser Pro Val Thr Asn Val Val Val Ser Glu Gln
1910 1915 1920
Thr Ala Val Val Ile Lys Asp Pro Val Lys Lys Ala Glu Leu Asp
1925 1930 1935
Ala Thr Lys Leu Leu Asp Thr Met Asn Tyr Ala Ser Glu Arg Phe
1940 1945 1950
Phe Ser Phe Gly Asp Phe Met Ser Arg Asn Leu Ile Thr Val Phe
1955 1960 1965
Leu Tyr Ile Leu Ser Ile Leu Gly Leu Cys Phe Arg Ala Phe Arg
1970 1975 1980
Lys Arg Asp Val Lys Val Leu Ala Gly Val Pro Gln Arg Thr Gly
1985 1990 1995
Ile Ile Leu Arg Lys Ser Met Arg Tyr Asn Ala Lys Ala Leu Gly
2000 2005 2010
Val Phe Phe Lys Leu Lys Leu Tyr Trp Phe Lys Val Leu Gly Lys
2015 2020 2025
Phe Ser Leu Gly Ile Tyr Ala Leu Tyr Ala Leu Leu Phe Met Thr
2030 2035 2040
Ile Arg Phe Thr Pro Ile Gly Ser Pro Val Cys Asp Asp Val Val
2045 2050 2055
Ala Gly Tyr Ala Asn Ser Ser Phe Asp Lys Asn Glu Tyr Cys Asn
2060 2065 2070
Ser Val Ile Cys Lys Val Cys Leu Tyr Gly Tyr Gln Glu Leu Ser
2075 2080 2085
Asp Phe Ser His Thr Gln Val Val Trp Gln His Leu Arg Asp Pro
2090 2095 2100
Leu Ile Gly Asn Val Met Pro Phe Phe Tyr Leu Ala Phe Leu Ala
2105 2110 2115
Ile Phe Gly Gly Val Tyr Val Lys Ala Ile Thr Leu Tyr Phe Ile
2120 2125 2130
Phe Gln Tyr Leu Asn Ser Leu Gly Val Phe Leu Gly Leu Gln Gln
2135 2140 2145
Ser Ile Trp Phe Leu Gln Leu Val Pro Phe Asp Val Phe Gly Asp
2150 2155 2160
Glu Ile Val Val Phe Phe Ile Val Thr Arg Val Leu Met Phe Ile
2165 2170 2175
Lys His Val Cys Leu Gly Cys Asp Lys Ala Ser Cys Val Ala Cys
2180 2185 2190
Ser Lys Ser Ala Arg Leu Lys Arg Val Pro Val Gln Thr Ile Phe
2195 2200 2205
Gln Gly Thr Ser Lys Ser Phe Tyr Val His Ala Asn Gly Gly Ser
2210 2215 2220
Lys Phe Cys Lys Lys His Asn Phe Phe Cys Leu Asn Cys Asp Ser
2225 2230 2235
Tyr Gly Pro Gly Cys Thr Phe Ile Asn Asp Val Ile Ala Thr Glu
2240 2245 2250
Val Gly Asn Val Val Lys Leu Asn Val Gln Pro Thr Gly Pro Ala
2255 2260 2265
Thr Ile Leu Ile Asp Lys Val Glu Phe Ser Asn Gly Phe Tyr Tyr
2270 2275 2280
Leu Tyr Ser Gly Asp Thr Phe Trp Lys Tyr Asn Phe Asp Ile Thr
2285 2290 2295
Asp Ser Lys Tyr Thr Cys Lys Glu Ala Leu Lys Asn Cys Ser Ile
2300 2305 2310
Ile Thr Asp Phe Ile Val Phe Asn Asn Asn Gly Ser Asn Val Asn
2315 2320 2325
Gln Val Lys Asn Ala Cys Val Tyr Phe Ser Gln Met Leu Cys Lys
2330 2335 2340
Pro Val Lys Leu Val Asp Ser Ala Leu Leu Ala Ser Leu Ser Val
2345 2350 2355
Asp Phe Gly Ala Ser Leu His Ser Ala Phe Val Ser Val Leu Ser
2360 2365 2370
Asn Ser Phe Gly Lys Asp Leu Ser Ser Cys Asn Asp Met Gln Asp
2375 2380 2385
Cys Lys Ser Thr Leu Gly Phe Asp Asp Val Pro Leu Asp Thr Phe
2390 2395 2400
Asn Ala Ala Val Ala Glu Ala His Arg Tyr Asp Val Leu Leu Thr
2405 2410 2415
Asp Met Ser Phe Asn Asn Phe Thr Thr Ser Tyr Ala Lys Pro Glu
2420 2425 2430
Glu Lys Phe Pro Val His Asp Ile Ala Thr Cys Met Arg Val Gly
2435 2440 2445
Ala Lys Ile Val Asn His Asn Val Leu Val Lys Asp Ser Ile Pro
2450 2455 2460
Val Val Trp Leu Val Arg Asp Phe Ile Ala Leu Ser Glu Glu Thr
2465 2470 2475
Arg Lys Tyr Ile Ile Arg Thr Thr Lys Val Lys Gly Ile Thr Phe
2480 2485 2490
Met Leu Thr Phe Asn Asp Cys Arg Met His Thr Thr Ile Pro Thr
2495 2500 2505
Val Cys Ile Ala Asn Lys Lys Gly Ala Gly Leu Pro Ser Phe Ser
2510 2515 2520
Lys Val Lys Lys Phe Phe Trp Phe Leu Cys Leu Phe Ile Val Ala
2525 2530 2535
Ala Phe Phe Ala Leu Ser Phe Leu Asp Phe Ser Thr Gln Val Ser
2540 2545 2550
Ser Asp Ser Asp Tyr Asp Phe Lys Tyr Ile Glu Ser Gly Gln Leu
2555 2560 2565
Lys Thr Phe Asp Asn Pro Leu Ser Cys Val His Asn Val Phe Ile
2570 2575 2580
Asn Phe Asp Gln Trp His Asp Ala Lys Phe Gly Phe Thr Pro Val
2585 2590 2595
Asn Asn Pro Ser Cys Pro Ile Val Val Gly Val Ser Asp Glu Ala
2600 2605 2610
Arg Thr Val Pro Gly Ile Pro Ala Gly Val Tyr Leu Ala Gly Lys
2615 2620 2625
Thr Leu Val Phe Ala Ile Asn Thr Ile Phe Gly Thr Ser Gly Leu
2630 2635 2640
Cys Phe Asp Ala Ser Gly Val Ala Asp Lys Gly Ala Cys Ile Phe
2645 2650 2655
Asn Ser Ala Cys Thr Thr Leu Ser Gly Leu Gly Gly Thr Ala Val
2660 2665 2670
Tyr Cys Tyr Lys Asn Gly Leu Val Glu Gly Ala Lys Leu Tyr Ser
2675 2680 2685
Glu Leu Ala Pro His Ser Tyr Tyr Lys Met Val Asp Gly Asn Ala
2690 2695 2700
Val Ser Leu Pro Glu Ile Ile Ser Arg Gly Phe Gly Ile Arg Thr
2705 2710 2715
Ile Arg Thr Lys Ala Met Thr Tyr Cys Arg Val Gly Gln Cys Val
2720 2725 2730
Gln Ser Ala Glu Gly Val Cys Phe Gly Ala Asp Arg Phe Phe Val
2735 2740 2745
Tyr Asn Ala Glu Ser Gly Ser Asp Phe Val Cys Gly Thr Gly Leu
2750 2755 2760
Phe Thr Leu Leu Met Asn Val Ile Ser Val Phe Ser Lys Thr Val
2765 2770 2775
Pro Val Thr Val Leu Ser Gly Gln Ile Leu Phe Asn Cys Ile Ile
2780 2785 2790
Ala Phe Val Ala Val Ala Val Cys Phe Leu Phe Thr Lys Phe Lys
2795 2800 2805
Arg Met Phe Gly Asp Met Ser Val Gly Val Phe Thr Val Gly Ala
2810 2815 2820
Cys Thr Leu Leu Asn Asn Val Ser Tyr Ile Val Thr Gln Asn Thr
2825 2830 2835
Leu Gly Met Leu Gly Tyr Ala Thr Leu Tyr Phe Leu Cys Thr Lys
2840 2845 2850
Gly Val Arg Tyr Met Trp Ile Trp His Leu Gly Phe Leu Ile Ser
2855 2860 2865
Tyr Ile Leu Ile Ala Pro Trp Trp Val Leu Met Val Tyr Ala Phe
2870 2875 2880
Ser Ala Ile Phe Glu Phe Met Pro Asn Leu Phe Lys Leu Lys Val
2885 2890 2895
Ser Thr Gln Leu Phe Glu Gly Asp Lys Phe Val Gly Ser Phe Glu
2900 2905 2910
Asn Ala Ala Ala Gly Thr Phe Val Leu Asp Met His Ala Tyr Glu
2915 2920 2925
Arg Leu Ala Asn Ser Ile Ser Thr Glu Lys Leu Arg Gln Tyr Ala
2930 2935 2940
Ser Thr Tyr Asn Lys Tyr Lys Tyr Tyr Ser Gly Ser Ala Ser Glu
2945 2950 2955
Ala Asp Tyr Arg Leu Ala Cys Phe Ala His Leu Ala Lys Ala Met
2960 2965 2970
Met Asp Tyr Ala Ser Asn His Asn Asp Thr Leu Tyr Thr Pro Pro
2975 2980 2985
Thr Val Ser Tyr Asn Ser Thr Leu Gln Ala Gly Leu Arg Lys Met
2990 2995 3000
Ala Gln Pro Ser Gly Val Val Glu Lys Cys Ile Val Arg Val Cys
3005 3010 3015
Tyr Gly Asn Met Ala Leu Asn Gly Leu Trp Leu Gly Asp Ile Val
3020 3025 3030
Met Cys Pro Arg His Val Ile Ala Ser Ser Thr Thr Ser Thr Ile
3035 3040 3045
Asp Tyr Asp Tyr Ala Leu Ser Val Leu Arg Leu His Asn Phe Ser
3050 3055 3060
Ile Ser Ser Gly Asn Val Phe Leu Gly Val Val Ser Ala Thr Met
3065 3070 3075
Arg Gly Ala Leu Leu Gln Ile Lys Val Asn Gln Asn Asn Val His
3080 3085 3090
Thr Pro Lys Tyr Thr Tyr Arg Thr Val Arg Pro Gly Glu Ser Phe
3095 3100 3105
Asn Ile Leu Ala Cys Tyr Asp Gly Ala Ala Ala Gly Val Tyr Gly
3110 3115 3120
Val Asn Met Arg Ser Asn Tyr Thr Ile Arg Gly Ser Phe Ile Asn
3125 3130 3135
Gly Ala Cys Gly Ser Pro Gly Tyr Asn Ile Asn Asn Gly Thr Val
3140 3145 3150
Glu Phe Cys Tyr Leu His Gln Leu Glu Leu Gly Ser Gly Cys His
3155 3160 3165
Val Gly Ser Asp Leu Asp Gly Val Met Tyr Gly Gly Tyr Glu Asp
3170 3175 3180
Gln Pro Thr Leu Gln Val Glu Gly Ala Ser Ser Leu Phe Thr Glu
3185 3190 3195
Asn Val Leu Ala Phe Leu Tyr Ala Ala Leu Ile Asn Gly Ser Thr
3200 3205 3210
Trp Trp Leu Ser Ser Ser Arg Ile Ala Val Asp Arg Phe Asn Glu
3215 3220 3225
Trp Ala Val His Asn Gly Met Thr Thr Val Gly Asn Thr Asp Cys
3230 3235 3240
Phe Ser Ile Leu Ala Ala Lys Thr Gly Val Asp Val Gln Arg Leu
3245 3250 3255
Leu Ala Ser Ile Gln Ser Leu His Lys Asn Phe Gly Gly Lys Gln
3260 3265 3270
Ile Leu Gly His Thr Ser Leu Thr Asp Glu Phe Thr Thr Gly Glu
3275 3280 3285
Val Val Arg Gln Met Tyr Gly Val Asn Leu Gln Gly Gly Tyr Val
3290 3295 3300
Ser Arg Ala Cys Arg Asn Val Leu Leu Val Gly Ser Phe Leu Thr
3305 3310 3315
Phe Phe Trp Ser Glu Leu Val Ser Tyr Thr Lys Phe Phe Trp Val
3320 3325 3330
Asn Pro Gly Tyr Val Thr Pro Met Phe Ala Cys Leu Ser Leu Leu
3335 3340 3345
Ser Ser Leu Leu Met Phe Thr Leu Lys His Lys Thr Leu Phe Phe
3350 3355 3360
Gln Val Phe Leu Ile Pro Ala Leu Ile Val Thr Ser Cys Ile Asn
3365 3370 3375
Leu Ala Phe Asp Val Glu Val Tyr Asn Tyr Leu Ala Glu His Phe
3380 3385 3390
Asp Tyr His Val Ser Leu Met Gly Phe Asn Ala Gln Gly Leu Val
3395 3400 3405
Asn Ile Phe Val Cys Phe Val Val Thr Ile Leu His Gly Thr Tyr
3410 3415 3420
Thr Trp Arg Phe Phe Asn Thr Pro Ala Ser Ser Val Thr Tyr Val
3425 3430 3435
Val Ala Leu Leu Thr Ala Ala Tyr Asn Tyr Phe Tyr Ala Ser Asp
3440 3445 3450
Ile Leu Ser Cys Ala Met Thr Leu Phe Ala Ser Val Thr Gly Asn
3455 3460 3465
Trp Phe Val Gly Ala Val Cys Tyr Lys Val Ala Val Tyr Met Ala
3470 3475 3480
Leu Arg Phe Pro Thr Phe Val Ala Ile Phe Gly Asp Ile Lys Ser
3485 3490 3495
Val Met Phe Cys Tyr Leu Val Leu Gly Tyr Phe Thr Cys Cys Phe
3500 3505 3510
Tyr Gly Ile Leu Tyr Trp Phe Asn Arg Phe Phe Lys Val Ser Val
3515 3520 3525
Gly Val Tyr Asp Tyr Thr Val Ser Ala Ala Glu Phe Lys Tyr Met
3530 3535 3540
Val Ala Asn Gly Leu Arg Ala Pro Thr Gly Thr Leu Asp Ser Leu
3545 3550 3555
Leu Leu Ser Ala Lys Leu Ile Gly Ile Gly Gly Glu Arg Asn Ile
3560 3565 3570
Lys Ile Ser Ser Val Gln Ser Lys Leu Thr Asp Ile Lys Cys Ser
3575 3580 3585
Asn Val Val Leu Leu Gly Cys Leu Ser Ser Met Asn Val Ser Ala
3590 3595 3600
Asn Ser Thr Glu Trp Ala Tyr Cys Val Asp Leu His Asn Lys Ile
3605 3610 3615
Asn Leu Cys Asn Asp Pro Glu Lys Ala Gln Glu Met Leu Leu Ala
3620 3625 3630
Leu Leu Ala Phe Phe Leu Ser Lys Asn Ser Ala Phe Gly Leu Asp
3635 3640 3645
Asp Leu Leu Glu Ser Tyr Phe Asn Asp Asn Ser Met Leu Gln Ser
3650 3655 3660
Val Ala Ser Thr Tyr Val Gly Leu Pro Ser Tyr Val Ile Tyr Glu
3665 3670 3675
Asn Ala Arg Gln Gln Tyr Glu Asp Ala Val Asn Asn Gly Ser Pro
3680 3685 3690
Pro Gln Leu Val Lys Gln Leu Arg His Ala Met Asn Val Ala Lys
3695 3700 3705
Ser Glu Phe Asp Arg Glu Ala Ser Thr Gln Arg Lys Leu Asp Arg
3710 3715 3720
Met Ala Glu Gln Ala Ala Ala Gln Met Tyr Lys Glu Ala Arg Ala
3725 3730 3735
Val Asn Arg Lys Ser Lys Val Val Ser Ala Met His Ser Leu Leu
3740 3745 3750
Phe Gly Met Leu Arg Arg Leu Asp Met Ser Ser Val Asp Thr Ile
3755 3760 3765
Leu Asn Leu Ala Lys Asp Gly Val Val Pro Leu Ser Val Ile Pro
3770 3775 3780
Ala Val Ser Ala Thr Lys Leu Asn Ile Val Thr Ser Asp Ile Asp
3785 3790 3795
Ser Tyr Asn Arg Ile Gln Arg Glu Gly Cys Val His Tyr Ala Gly
3800 3805 3810
Thr Ile Trp Asn Ile Ile Asp Ile Lys Asp Asn Asp Gly Lys Val
3815 3820 3825
Val His Val Lys Glu Val Thr Ala Gln Asn Ala Glu Ser Leu Ser
3830 3835 3840
Trp Pro Leu Val Leu Gly Cys Glu Arg Ile Val Lys Leu Gln Asn
3845 3850 3855
Asn Glu Ile Ile Pro Gly Lys Leu Lys Gln Arg Ser Ile Lys Ala
3860 3865 3870
Glu Gly Asp Gly Ile Val Gly Glu Gly Lys Ala Leu Tyr Asn Asn
3875 3880 3885
Glu Gly Gly Arg Thr Phe Met Tyr Ala Phe Ile Ser Asp Lys Pro
3890 3895 3900
Asp Leu Arg Val Val Lys Trp Glu Phe Asp Gly Gly Cys Asn Thr
3905 3910 3915
Ile Glu Leu Glu Pro Pro Arg Lys Phe Leu Val Asp Ser Pro Asn
3920 3925 3930
Gly Ala Gln Ile Lys Tyr Leu Tyr Phe Val Arg Asn Leu Asn Thr
3935 3940 3945
Leu Arg Arg Gly Ala Val Leu Gly Tyr Ile Gly Ala Thr Val Arg
3950 3955 3960
Leu Gln Ala Gly Lys Gln Thr Glu Gln Ala Ile Asn Ser Ser Leu
3965 3970 3975
Leu Thr Leu Cys Ala Phe Ala Val Asp Pro Ala Lys Thr Tyr Ile
3980 3985 3990
Asp Ala Val Lys Ser Gly His Lys Pro Val Gly Asn Cys Val Lys
3995 4000 4005
Met Leu Ala Asn Gly Ser Gly Asn Gly Gln Ala Val Thr Asn Gly
4010 4015 4020
Val Glu Ala Ser Thr Asn Gln Asp Ser Tyr Gly Gly Ala Ser Val
4025 4030 4035
Cys Leu Tyr Cys Arg Ala His Val Glu His Pro Ser Met Asp Gly
4040 4045 4050
Phe Cys Arg Leu Lys Gly Lys Tyr Val Gln Val Pro Leu Gly Thr
4055 4060 4065
Val Asp Pro Ile Arg Phe Val Leu Glu Asn Asp Val Cys Lys Val
4070 4075 4080
Cys Gly Cys Trp Leu Ser Asn Gly Cys Thr Cys Asp Arg Ser Ile
4085 4090 4095
Met Gln Ser Thr Asp Tyr Gly Leu Phe Lys Arg Val Arg Gly Ser
4100 4105 4110
Ser Ala Ala Arg Leu Glu Pro Cys Asn Gly Thr Asp Thr Gln His
4115 4120 4125
Val Tyr Arg Ala Phe Asp Ile Tyr Asn Lys Asp Val Ala Cys Leu
4130 4135 4140
Gly Lys Phe Leu Lys Val Asn Cys Val Arg Leu Lys Asn Leu Asp
4145 4150 4155
Lys His Asp Ala Phe Tyr Val Val Lys Arg Cys Thr Lys Ser Ala
4160 4165 4170
Met Glu His Glu Gln Ser Ile Tyr Ser Arg Leu Glu Lys Cys Gly
4175 4180 4185
Ala Ile Ala Glu His Asp Phe Phe Thr Trp Lys Asp Gly Arg Ala
4190 4195 4200
Ile Tyr Gly Asn Val Cys Arg Lys Asp Leu Thr Glu Tyr Thr Met
4205 4210 4215
Met Asp Leu Cys Tyr Ala Leu Arg Asn Phe Asp Glu Asn Asn Cys
4220 4225 4230
Asp Val Leu Lys Ser Ile Leu Ile Lys Val Gly Ala Cys Glu Glu
4235 4240 4245
Ser Tyr Phe Asn Asn Lys Val Trp Phe Asp Pro Val Glu Asn Glu
4250 4255 4260
Asp Ile His Arg Val Tyr Ala Leu Leu Gly Thr Ile Val Ala Arg
4265 4270 4275
Ala Met Leu Lys Cys Val Lys Phe Cys Asp Ala Met Val Glu Gln
4280 4285 4290
Gly Ile Val Gly Val Val Thr Leu Asp Asn Gln Asp Leu Asn Gly
4295 4300 4305
Asp Phe Tyr Asp Phe Gly Asp Phe Thr Cys Ser Ile Lys Gly Met
4310 4315 4320
Gly Val Pro Ile Cys Thr Ser Tyr Tyr Ser Tyr Met Met Pro Val
4325 4330 4335
Met Gly Met Thr Asn Cys Leu Ala Ser Glu Cys Phe Val Lys Ser
4340 4345 4350
Asp Ile Phe Gly Glu Asp Phe Lys Ser Tyr Asp Leu Leu Glu Tyr
4355 4360 4365
Asp Phe Thr Glu His Lys Thr Ala Leu Phe Asn Lys Tyr Phe Lys
4370 4375 4380
Tyr Trp Gly Leu Gln Tyr His Pro Asn Cys Val Asp Cys Ser Asp
4385 4390 4395
Glu Gln Cys Ile Val His Cys Ala Asn Phe Asn Thr Leu Phe Ser
4400 4405 4410
Thr Thr Ile Pro Ile Thr Ala Phe Gly Pro Leu Cys Arg Lys Cys
4415 4420 4425
Trp Ile Asp Gly Val Pro Leu Val Thr Thr Ala Gly Tyr His Phe
4430 4435 4440
Lys Gln Leu Gly Ile Val Trp Asn Asn Asp Leu Asn Leu His Ser
4445 4450 4455
Ser Arg Leu Ser Ile Asn Glu Leu Leu Gln Phe Cys Ser Asp Pro
4460 4465 4470
Ala Leu Leu Ile Ala Ser Ser Pro Ala Leu Val Asp Gln Arg Thr
4475 4480 4485
Val Cys Phe Ser Val Ala Ala Leu Gly Thr Gly Met Thr Asn Gln
4490 4495 4500
Thr Val Lys Pro Gly His Phe Asn Lys Glu Phe Tyr Asp Phe Leu
4505 4510 4515
Leu Glu Gln Gly Phe Phe Ser Glu Gly Ser Glu Leu Thr Leu Lys
4520 4525 4530
His Phe Phe Phe Ala Gln Lys Val Asp Ala Ala Val Lys Asp Phe
4535 4540 4545
Asp Tyr Tyr Arg Tyr Asn Arg Pro Thr Val Leu Asp Ile Cys Gln
4550 4555 4560
Ala Arg Val Val Tyr Gln Ile Val Gln Arg Tyr Phe Asp Ile Tyr
4565 4570 4575
Glu Gly Gly Cys Ile Thr Ala Lys Glu Val Val Val Thr Asn Leu
4580 4585 4590
Asn Lys Ser Ala Gly Tyr Pro Leu Asn Lys Phe Gly Lys Ala Gly
4595 4600 4605
Leu Tyr Tyr Glu Ser Leu Ser Tyr Glu Glu Gln Asp Glu Leu Tyr
4610 4615 4620
Ala Tyr Thr Lys Arg Asn Ile Leu Pro Thr Met Thr Gln Leu Asn
4625 4630 4635
Leu Lys Tyr Ala Ile Ser Gly Lys Glu Arg Ala Arg Thr Val Gly
4640 4645 4650
Gly Val Ser Leu Leu Ser Thr Met Thr Thr Arg Gln Tyr His Gln
4655 4660 4665
Lys His Leu Lys Ser Ile Val Asn Thr Arg Gly Ala Ser Val Val
4670 4675 4680
Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asn Met Leu Lys
4685 4690 4695
Asn Leu Ile Asp Gly Val Glu Asn Pro Cys Leu Met Gly Trp Asp
4700 4705 4710
Tyr Pro Lys Cys Asp Arg Ala Leu Pro Asn Met Ile Arg Met Ile
4715 4720 4725
Ser Ala Met Ile Leu Gly Ser Lys His Thr Thr Cys Cys Ser Ser
4730 4735 4740
Thr Asp Arg Phe Phe Arg Leu Cys Asn Glu Leu Ala Gln Val Leu
4745 4750 4755
Thr Glu Val Val Tyr Ser Asn Gly Gly Phe Tyr Leu Lys Pro Gly
4760 4765 4770
Gly Thr Thr Ser Gly Asp Ala Thr Thr Ala Tyr Ala Asn Ser Val
4775 4780 4785
Phe Asn Ile Phe Gln Ala Val Ser Ala Asn Val Asn Lys Leu Leu
4790 4795 4800
Ser Val Asp Ser Asn Val Cys His Asn Leu Glu Val Lys Gln Leu
4805 4810 4815
Gln Arg Lys Leu Tyr Glu Cys Cys Tyr Arg Ser Thr Ile Val Asp
4820 4825 4830
Asp Gln Phe Val Val Glu Tyr Tyr Gly Tyr Leu Arg Lys His Phe
4835 4840 4845
Ser Met Met Ile Leu Ser Asp Asp Gly Val Val Cys Tyr Asn Asn
4850 4855 4860
Asp Tyr Ala Ser Leu Gly Tyr Val Ala Asp Leu Asn Ala Phe Lys
4865 4870 4875
Ala Val Leu Tyr Tyr Gln Asn Asn Val Phe Met Ser Ala Ser Lys
4880 4885 4890
Cys Trp Ile Glu Pro Asp Ile Asn Lys Gly Pro His Glu Phe Cys
4895 4900 4905
Ser Gln His Thr Met Gln Ile Val Asp Lys Glu Gly Thr Tyr Tyr
4910 4915 4920
Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Ser Ala Gly Val Phe
4925 4930 4935
Val Asp Asp Val Val Lys Thr Asp Ala Val Val Leu Leu Glu Arg
4940 4945 4950
Tyr Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Ser Lys His Glu
4955 4960 4965
Asn Pro Glu Tyr Lys Lys Val Phe Tyr Val Leu Leu Asp Trp Val
4970 4975 4980
Lys His Leu Tyr Lys Thr Leu Asn Ala Gly Val Leu Glu Ser Phe
4985 4990 4995
Ser Val Thr Leu Leu Glu Asp Ser Thr Ala Lys Phe Trp Asp Glu
5000 5005 5010
Ser Phe Tyr Ala Asn Met Tyr Glu Lys Ser Ala Val Leu Gln Ser
5015 5020 5025
Ala Gly Leu Cys Val Val Cys Gly Ser Gln Thr Val Leu Arg Cys
5030 5035 5040
Gly Asp Cys Leu Arg Arg Pro Met Leu Cys Thr Lys Cys Ala Tyr
5045 5050 5055
Asp His Val Ile Gly Thr Thr His Lys Phe Ile Leu Ala Ile Thr
5060 5065 5070
Pro Tyr Val Cys Cys Ala Ser Asp Cys Gly Val Asn Asp Val Thr
5075 5080 5085
Lys Leu Tyr Leu Gly Gly Leu Ser Tyr Trp Cys His Glu His Lys
5090 5095 5100
Pro Arg Leu Ala Phe Pro Leu Cys Ser Ala Gly Asn Val Phe Gly
5105 5110 5115
Leu Tyr Lys Asn Ser Ala Thr Gly Ser Pro Asp Val Glu Asp Phe
5120 5125 5130
Asn Arg Ile Ala Thr Ser Asp Trp Thr Asp Val Ser Asp Tyr Arg
5135 5140 5145
Leu Ala Asn Asp Val Lys Asp Ser Leu Arg Leu Phe Ala Ala Glu
5150 5155 5160
Thr Ile Lys Ala Lys Glu Glu Ser Val Lys Ser Ser Tyr Ala Cys
5165 5170 5175
Ala Thr Leu His Glu Val Val Gly Pro Lys Glu Leu Leu Leu Lys
5180 5185 5190
Trp Glu Val Gly Arg Pro Lys Pro Pro Leu Asn Arg Asn Ser Val
5195 5200 5205
Phe Thr Cys Tyr His Ile Thr Lys Asn Thr Lys Phe Gln Ile Gly
5210 5215 5220
Glu Phe Val Phe Glu Lys Ala Glu Tyr Asp Asn Asp Ala Val Thr
5225 5230 5235
Tyr Lys Thr Thr Ala Thr Thr Lys Leu Val Pro Gly Met Val Phe
5240 5245 5250
Val Leu Thr Ser His Asn Val Gln Pro Leu Arg Ala Pro Thr Ile
5255 5260 5265
Ala Asn Gln Glu Arg Tyr Ser Thr Ile His Lys Leu His Pro Ala
5270 5275 5280
Phe Asn Ile Pro Glu Ala Tyr Ser Ser Leu Val Pro Tyr Tyr Gln
5285 5290 5295
Leu Ile Gly Lys Gln Lys Ile Thr Thr Ile Gln Gly Pro Pro Gly
5300 5305 5310
Ser Gly Lys Ser His Cys Val Ile Gly Leu Gly Leu Tyr Tyr Pro
5315 5320 5325
Gly Ala Arg Ile Val Phe Thr Ala Cys Ser His Ala Ala Val Asp
5330 5335 5340
Ser Leu Cys Val Lys Ala Ser Thr Ala Tyr Ser Asn Asp Lys Cys
5345 5350 5355
Ser Arg Ile Ile Pro Gln Arg Ala Arg Val Glu Cys Tyr Asp Gly
5360 5365 5370
Phe Lys Ser Asn Asn Thr Ser Ala Gln Tyr Leu Phe Ser Thr Val
5375 5380 5385
Asn Ala Leu Pro Glu Cys Asn Ala Asp Ile Val Val Val Asp Glu
5390 5395 5400
Val Ser Met Cys Thr Asn Tyr Asp Leu Ser Val Ile Asn Gln Arg
5405 5410 5415
Ile Ser Tyr Arg His Val Val Tyr Val Gly Asp Pro Gln Gln Leu
5420 5425 5430
Pro Ala Pro Arg Val Met Ile Ser Arg Gly Thr Leu Glu Pro Lys
5435 5440 5445
Asp Tyr Asn Val Val Thr Gln Arg Met Cys Ala Leu Lys Pro Asp
5450 5455 5460
Val Phe Leu His Lys Cys Tyr Arg Cys Pro Ala Glu Ile Val Arg
5465 5470 5475
Thr Val Ser Glu Met Val Tyr Glu Asn Gln Phe Ile Pro Val His
5480 5485 5490
Pro Asp Ser Lys Gln Cys Phe Lys Ile Phe Cys Lys Gly Asn Val
5495 5500 5505
Gln Val Asp Asn Gly Ser Ser Ile Asn Arg Arg Gln Leu Asp Val
5510 5515 5520
Val Arg Met Phe Leu Ala Lys Asn Pro Arg Trp Ser Lys Ala Val
5525 5530 5535
Phe Ile Ser Pro Tyr Asn Ser Gln Asn Tyr Val Ala Ser Arg Leu
5540 5545 5550
Leu Gly Leu Gln Ile Gln Thr Val Asp Ser Ser Gln Gly Ser Glu
5555 5560 5565
Tyr Asp Tyr Val Ile Tyr Ala Gln Thr Ser Asp Thr Ala His Ala
5570 5575 5580
Ser Asn Val Asn Arg Phe Asn Val Ala Ile Thr Arg Ala Lys Lys
5585 5590 5595
Gly Ile Leu Cys Ile Met Cys Asp Arg Ser Leu Phe Asp Leu Leu
5600 5605 5610
Lys Phe Phe Glu Leu Lys Leu Ser Asp Leu Gln Ala Asn Glu Gly
5615 5620 5625
Cys Gly Leu Phe Lys Asp Cys Ser Arg Gly Asp Asp Leu Leu Pro
5630 5635 5640
Pro Ser His Ala Asn Thr Phe Met Ser Leu Ala Asp Asn Phe Lys
5645 5650 5655
Thr Asp Gln Tyr Leu Ala Val Gln Ile Gly Val Asn Gly Pro Ile
5660 5665 5670
Lys Tyr Glu His Val Ile Ser Phe Met Gly Phe Arg Phe Asp Ile
5675 5680 5685
Asn Ile Pro Asn His His Thr Leu Phe Cys Thr Arg Asp Phe Ala
5690 5695 5700
Met Arg Asn Val Arg Gly Trp Leu Gly Phe Asp Val Glu Gly Ala
5705 5710 5715
His Val Val Gly Ser Asn Val Gly Thr Asn Val Pro Leu Gln Leu
5720 5725 5730
Gly Phe Ser Asn Gly Val Asp Phe Val Val Arg Pro Glu Gly Cys
5735 5740 5745
Val Val Thr Glu Ser Gly Asp Tyr Ile Lys Pro Val Arg Ala Arg
5750 5755 5760
Ala Pro Pro Gly Glu Gln Phe Ala His Leu Leu Pro Leu Leu Lys
5765 5770 5775
Arg Gly Gln Pro Trp Asp Val Val Arg Lys Arg Ile Val Gln Met
5780 5785 5790
Cys Ser Asp Tyr Leu Ala Asn Leu Ser Asp Ile Leu Ile Phe Val
5795 5800 5805
Leu Trp Ala Gly Gly Leu Glu Leu Thr Thr Met Arg Tyr Phe Val
5810 5815 5820
Lys Ile Gly Pro Ser Lys Ser Cys Asp Cys Gly Lys Val Ala Thr
5825 5830 5835
Cys Tyr Asn Ser Ala Leu His Thr Tyr Cys Cys Phe Lys His Ala
5840 5845 5850
Leu Gly Cys Asp Tyr Leu Tyr Asn Pro Tyr Cys Ile Asp Ile Gln
5855 5860 5865
Gln Trp Gly Tyr Lys Gly Ser Leu Ser Leu Asn His His Glu His
5870 5875 5880
Cys Asn Val His Arg Asn Glu His Val Ala Ser Gly Asp Ala Ile
5885 5890 5895
Met Thr Arg Cys Leu Ala Ile His Asp Cys Phe Val Lys Asn Val
5900 5905 5910
Asp Trp Ser Ile Thr Tyr Pro Phe Ile Gly Asn Glu Ala Val Ile
5915 5920 5925
Asn Lys Ser Gly Arg Ile Val Gln Ser His Thr Met Arg Ser Val
5930 5935 5940
Leu Lys Leu Tyr Asn Pro Lys Ala Ile Tyr Asp Ile Gly Asn Pro
5945 5950 5955
Lys Gly Ile Arg Cys Ala Val Thr Asp Ala Lys Trp Phe Cys Phe
5960 5965 5970
Asp Lys Asn Pro Thr Asn Ser Asn Val Lys Thr Leu Glu Tyr Asp
5975 5980 5985
Tyr Ile Thr His Gly Gln Phe Asp Gly Leu Cys Leu Phe Trp Asn
5990 5995 6000
Cys Asn Val Asp Met Tyr Pro Glu Phe Ser Val Val Cys Arg Phe
6005 6010 6015
Asp Thr Arg Cys Arg Ser Pro Leu Asn Leu Glu Gly Cys Asn Gly
6020 6025 6030
Gly Ser Leu Tyr Val Asn Asn His Ala Phe His Thr Pro Ala Phe
6035 6040 6045
Asp Lys Arg Ala Phe Ala Lys Leu Lys Pro Met Pro Phe Phe Phe
6050 6055 6060
Tyr Asp Asp Thr Glu Cys Asp Lys Leu Gln Asp Ser Ile Asn Tyr
6065 6070 6075
Val Pro Leu Arg Ala Ser Asn Cys Ile Thr Lys Cys Asn Val Gly
6080 6085 6090
Gly Ala Val Cys Ser Lys His Cys Ala Met Tyr His Ser Tyr Val
6095 6100 6105
Asn Ala Tyr Asn Thr Phe Thr Ser Ala Gly Phe Thr Ile Trp Val
6110 6115 6120
Pro Thr Ser Phe Asp Thr Tyr Asn Leu Trp Gln Thr Phe Ser Asn
6125 6130 6135
Asn Leu Gln Gly Leu Glu Asn Ile Ala Phe Asn Val Leu Lys Lys
6140 6145 6150
Gly Ser Phe Val Gly Asp Glu Gly Glu Leu Pro Val Ala Val Val
6155 6160 6165
Asn Asp Lys Val Leu Val Arg Asp Gly Thr Val Asp Thr Leu Val
6170 6175 6180
Phe Thr Asn Lys Thr Ser Leu Pro Thr Asn Val Ala Phe Glu Leu
6185 6190 6195
Tyr Ala Lys Arg Lys Val Gly Leu Thr Pro Pro Ile Thr Ile Leu
6200 6205 6210
Arg Asn Leu Gly Val Val Cys Thr Ser Lys Cys Val Ile Trp Asp
6215 6220 6225
Tyr Glu Ala Glu Arg Pro Leu Thr Thr Phe Thr Lys Asp Val Cys
6230 6235 6240
Lys Tyr Thr Asp Phe Glu Gly Asp Val Cys Thr Leu Phe Asp Asn
6245 6250 6255
Ser Ile Val Gly Ser Leu Glu Arg Phe Ser Met Thr Gln Asn Ala
6260 6265 6270
Val Leu Met Ser Leu Thr Ala Val Lys Lys Leu Thr Gly Ile Lys
6275 6280 6285
Leu Thr Tyr Gly Tyr Leu Asn Gly Val Pro Val Asn Thr His Glu
6290 6295 6300
Asp Lys Pro Phe Thr Trp Tyr Ile Tyr Thr Arg Lys Asn Gly Lys
6305 6310 6315
Phe Glu Asp Tyr Pro Asp Gly Tyr Phe Thr Gln Gly Arg Thr Thr
6320 6325 6330
Ala Asp Phe Ser Pro Arg Ser Asp Met Glu Lys Asp Phe Leu Ser
6335 6340 6345
Met Asp Met Gly Leu Phe Ile Asn Lys Tyr Gly Leu Glu Asp Tyr
6350 6355 6360
Gly Phe Glu His Val Val Tyr Gly Asp Val Ser Lys Thr Thr Leu
6365 6370 6375
Gly Gly Leu His Leu Leu Ile Ser Gln Val Arg Leu Ala Cys Met
6380 6385 6390
Gly Val Leu Lys Ile Asp Glu Phe Val Ser Ser Asn Asp Ser Thr
6395 6400 6405
Leu Lys Ser Cys Thr Val Thr Tyr Ala Asp Asn Pro Ser Ser Lys
6410 6415 6420
Met Val Cys Thr Tyr Met Asp Leu Leu Leu Asp Asp Phe Val Ser
6425 6430 6435
Ile Leu Lys Ser Leu Asp Leu Ser Val Val Ser Lys Val His Glu
6440 6445 6450
Val Met Val Asp Cys Lys Met Trp Arg Trp Met Leu Trp Cys Lys
6455 6460 6465
Asp His Lys Leu Gln Thr Phe Tyr Pro Gln Leu Gln Ala Ser Glu
6470 6475 6480
Trp Lys Cys Gly Tyr Ser Met Pro Ser Ile Tyr Lys Ile Gln Arg
6485 6490 6495
Met Cys Leu Glu Pro Cys Asn Leu Tyr Asn Tyr Gly Ala Gly Val
6500 6505 6510
Lys Leu Pro Asp Gly Ile Met Phe Asn Val Val Lys Tyr Thr Gln
6515 6520 6525
Leu Cys Gln Tyr Leu Asn Ser Thr Thr Met Cys Val Pro His His
6530 6535 6540
Met Arg Val Leu His Leu Gly Ala Gly Ser Asp Lys Gly Val Ala
6545 6550 6555
Pro Gly Thr Ala Val Leu Arg Arg Trp Leu Pro Leu Asp Ala Ile
6560 6565 6570
Ile Val Asp Asn Asp Ser Val Asp Tyr Val Ser Asp Ala Asp Tyr
6575 6580 6585
Ser Val Thr Gly Asp Cys Ser Thr Leu Tyr Leu Ser Asp Lys Phe
6590 6595 6600
Asp Leu Val Ile Ser Asp Met Tyr Asp Gly Lys Ile Lys Ser Cys
6605 6610 6615
Asp Gly Glu Asn Val Ser Lys Glu Gly Phe Phe Pro Tyr Ile Asn
6620 6625 6630
Gly Val Ile Thr Glu Lys Leu Ala Leu Gly Gly Thr Val Ala Ile
6635 6640 6645
Lys Val Thr Glu Phe Ser Trp Asn Lys Lys Leu Tyr Glu Leu Ile
6650 6655 6660
Gln Lys Phe Glu Tyr Trp Thr Met Phe Cys Thr Ser Val Asn Thr
6665 6670 6675
Ser Ser Ser Glu Ala Phe Leu Ile Gly Val His Tyr Leu Gly Asp
6680 6685 6690
Phe Ala Ser Gly Ala Val Ile Asp Gly Asn Thr Met His Ala Asn
6695 6700 6705
Tyr Ile Phe Trp Arg Asn Ser Thr Ile Met Thr Met Ser Tyr Asn
6710 6715 6720
Ser Val Leu Asp Leu Ser Lys Phe Asn Cys Lys His Lys Ala Thr
6725 6730 6735
Val Val Val Asn Leu Lys Asp Ser Ser Ile Ser Asp Val Val Leu
6740 6745 6750
Gly Leu Leu Lys Asn Gly Lys Leu Leu Val Arg Asn Asn Asp Ala
6755 6760 6765
Ile Cys Gly Phe Ser Asn His Leu Val Asn Val Asn Lys
6770 6775 6780
<210> 27
<211> 7073
<212> PRT
<213> human SARS virus
<220>
<221> MISC_FEATURE
<223> ORF 1AB
<400> 27
Met Glu Ser Leu Val Leu Gly Val Asn Glu Lys Thr His Val Gln Leu
1 5 10 15
Ser Leu Pro Val Leu Gln Val Arg Asp Val Leu Val Arg Gly Phe Gly
20 25 30
Asp Ser Val Glu Glu Ala Leu Ser Glu Ala Arg Glu His Leu Lys Asn
35 40 45
Gly Thr Cys Gly Leu Val Glu Leu Glu Lys Gly Val Leu Pro Gln Leu
50 55 60
Glu Gln Pro Tyr Val Phe Ile Lys Arg Ser Asp Ala Leu Ser Thr Asn
65 70 75 80
His Gly His Lys Val Val Glu Leu Val Ala Glu Met Asp Gly Ile Gln
85 90 95
Tyr Gly Arg Ser Gly Ile Thr Leu Gly Val Leu Val Pro His Val Gly
100 105 110
Glu Thr Pro Ile Ala Tyr Arg Asn Val Leu Leu Arg Lys Asn Gly Asn
115 120 125
Lys Gly Ala Gly Gly His Ser Tyr Gly Ile Asp Leu Lys Ser Tyr Asp
130 135 140
Leu Gly Asp Glu Leu Gly Thr Asp Pro Ile Glu Asp Tyr Glu Gln Asn
145 150 155 160
Trp Asn Thr Lys His Gly Ser Gly Ala Leu Arg Glu Leu Thr Arg Glu
165 170 175
Leu Asn Gly Gly Ala Val Thr Arg Tyr Val Asp Asn Asn Phe Cys Gly
180 185 190
Pro Asp Gly Tyr Pro Leu Asp Cys Ile Lys Asp Phe Leu Ala Arg Ala
195 200 205
Gly Lys Ser Met Cys Thr Leu Ser Glu Gln Leu Asp Tyr Ile Glu Ser
210 215 220
Lys Arg Gly Val Tyr Cys Cys Arg Asp His Glu His Glu Ile Ala Trp
225 230 235 240
Phe Thr Glu Arg Ser Asp Lys Ser Tyr Glu His Gln Thr Pro Phe Glu
245 250 255
Ile Lys Ser Ala Lys Lys Phe Asp Thr Phe Lys Gly Glu Cys Pro Lys
260 265 270
Phe Val Phe Pro Leu Asn Ser Lys Val Lys Val Ile Gln Pro Arg Val
275 280 285
Glu Lys Lys Lys Thr Glu Gly Phe Met Gly Arg Ile Arg Ser Val Tyr
290 295 300
Pro Val Ala Ser Pro Gln Glu Cys Asn Asn Met His Leu Ser Thr Leu
305 310 315 320
Met Lys Cys Asn His Cys Asp Glu Val Ser Trp Gln Thr Cys Asp Phe
325 330 335
Leu Lys Ala Thr Cys Glu His Cys Gly Thr Glu Asn Leu Val Ile Glu
340 345 350
Gly Pro Thr Thr Cys Gly Tyr Leu Pro Thr Asn Ala Val Val Lys Met
355 360 365
Pro Cys Pro Ala Cys Gln Asp Pro Glu Ile Gly Pro Glu His Ser Val
370 375 380
Ala Asp Tyr His Asn His Ser Asn Ile Glu Thr Arg Leu Arg Lys Gly
385 390 395 400
Gly Arg Thr Arg Cys Phe Gly Gly Cys Val Phe Ala Tyr Val Gly Cys
405 410 415
Tyr Asn Lys Arg Ala Tyr Trp Val Pro Arg Ala Ser Ala Asp Ile Gly
420 425 430
Ser Gly His Thr Gly Ile Thr Gly Asp Asn Val Glu Thr Leu Asn Glu
435 440 445
Asp Leu Leu Glu Ile Leu Ser Arg Glu Arg Val Asn Ile Asn Ile Val
450 455 460
Gly Asp Phe His Leu Asn Glu Glu Val Ala Ile Ile Leu Ala Ser Phe
465 470 475 480
Ser Ala Ser Thr Ser Ala Phe Ile Asp Thr Ile Lys Ser Leu Asp Tyr
485 490 495
Lys Ser Phe Lys Thr Ile Val Glu Ser Cys Gly Asn Tyr Lys Val Thr
500 505 510
Lys Gly Lys Pro Val Lys Gly Ala Trp Asn Ile Gly Gln Gln Arg Ser
515 520 525
Val Leu Thr Pro Leu Cys Gly Phe Pro Ser Gln Ala Ala Gly Val Ile
530 535 540
Arg Ser Ile Phe Ala Arg Thr Leu Asp Ala Ala Asn His Ser Ile Pro
545 550 555 560
Asp Leu Gln Arg Ala Ala Val Thr Ile Leu Asp Gly Ile Ser Glu Gln
565 570 575
Ser Leu Arg Leu Val Asp Ala Met Val Tyr Thr Ser Asp Leu Leu Thr
580 585 590
Asn Ser Val Ile Ile Met Ala Tyr Val Thr Gly Gly Leu Val Gln Gln
595 600 605
Thr Ser Gln Trp Leu Ser Asn Leu Leu Gly Thr Thr Val Glu Lys Leu
610 615 620
Arg Pro Ile Phe Glu Trp Ile Glu Ala Lys Leu Ser Ala Gly Val Glu
625 630 635 640
Phe Leu Lys Asp Ala Trp Glu Ile Leu Lys Phe Leu Ile Thr Gly Val
645 650 655
Phe Asp Ile Val Lys Gly Gln Ile Gln Val Ala Ser Asp Asn Ile Lys
660 665 670
Asp Cys Val Lys Cys Phe Ile Asp Val Val Asn Lys Ala Leu Glu Met
675 680 685
Cys Ile Asp Gln Val Thr Ile Ala Gly Ala Lys Leu Arg Ser Leu Asn
690 695 700
Leu Gly Glu Val Phe Ile Ala Gln Ser Lys Gly Leu Tyr Arg Gln Cys
705 710 715 720
Ile Arg Gly Lys Glu Gln Leu Gln Leu Leu Met Pro Leu Lys Ala Pro
725 730 735
Lys Glu Val Thr Phe Leu Glu Gly Asp Ser His Asp Thr Val Leu Thr
740 745 750
Ser Glu Glu Val Val Leu Lys Asn Gly Glu Leu Glu Ala Leu Glu Thr
755 760 765
Pro Val Asp Ser Phe Thr Asn Gly Ala Ile Val Gly Thr Pro Val Cys
770 775 780
Val Asn Gly Leu Met Leu Leu Glu Ile Lys Asp Lys Glu Gln Tyr Cys
785 790 795 800
Ala Leu Ser Pro Gly Leu Leu Ala Thr Asn Asn Val Phe Arg Leu Lys
805 810 815
Gly Gly Ala Pro Ile Lys Gly Val Thr Phe Gly Glu Asp Thr Val Trp
820 825 830
Glu Val Gln Gly Tyr Lys Asn Val Arg Ile Thr Phe Glu Leu Asp Glu
835 840 845
Arg Val Asp Lys Val Leu Asn Glu Lys Cys Ser Val Tyr Thr Val Glu
850 855 860
Ser Gly Thr Glu Val Thr Glu Phe Ala Cys Val Val Ala Glu Ala Val
865 870 875 880
Val Lys Thr Leu Gln Pro Val Ser Asp Leu Leu Thr Asn Met Gly Ile
885 890 895
Asp Leu Asp Glu Trp Ser Val Ala Thr Phe Tyr Leu Phe Asp Asp Ala
900 905 910
Gly Glu Glu Asn Phe Ser Ser Arg Met Tyr Cys Ser Phe Tyr Pro Pro
915 920 925
Asp Glu Glu Glu Glu Asp Asp Ala Glu Cys Glu Glu Glu Glu Ile Asp
930 935 940
Glu Thr Cys Glu His Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Leu
945 950 955 960
Pro Leu Glu Phe Gly Ala Ser Ala Glu Thr Val Arg Val Glu Glu Glu
965 970 975
Glu Glu Glu Asp Trp Leu Asp Asp Thr Thr Glu Gln Ser Glu Ile Glu
980 985 990
Pro Glu Pro Glu Pro Thr Pro Glu Glu Pro Val Asn Gln Phe Thr Gly
995 1000 1005
Tyr Leu Lys Leu Thr Asp Asn Val Ala Ile Lys Cys Val Asp Ile
1010 1015 1020
Val Lys Glu Ala Gln Ser Ala Asn Pro Met Val Ile Val Asn Ala
1025 1030 1035
Ala Asn Ile His Leu Lys His Gly Gly Gly Val Ala Gly Ala Leu
1040 1045 1050
Asn Lys Ala Thr Asn Gly Ala Met Gln Lys Glu Ser Asp Asp Tyr
1055 1060 1065
Ile Lys Leu Asn Gly Pro Leu Thr Val Gly Gly Ser Cys Leu Leu
1070 1075 1080
Ser Gly His Asn Leu Ala Lys Lys Cys Leu His Val Val Gly Pro
1085 1090 1095
Asn Leu Asn Ala Gly Glu Asp Ile Gln Leu Leu Lys Ala Ala Tyr
1100 1105 1110
Glu Asn Phe Asn Ser Gln Asp Ile Leu Leu Ala Pro Leu Leu Ser
1115 1120 1125
Ala Gly Ile Phe Gly Ala Lys Pro Leu Gln Ser Leu Gln Val Cys
1130 1135 1140
Val Gln Thr Val Arg Thr Gln Val Tyr Ile Ala Val Asn Asp Lys
1145 1150 1155
Ala Leu Tyr Glu Gln Val Val Met Asp Tyr Leu Asp Asn Leu Lys
1160 1165 1170
Pro Arg Val Glu Ala Pro Lys Gln Glu Glu Pro Pro Asn Thr Glu
1175 1180 1185
Asp Ser Lys Thr Glu Glu Lys Ser Val Val Gln Lys Pro Val Asp
1190 1195 1200
Val Lys Pro Lys Ile Lys Ala Cys Ile Asp Glu Val Thr Thr Thr
1205 1210 1215
Leu Glu Glu Thr Lys Phe Leu Thr Asn Lys Leu Leu Leu Phe Ala
1220 1225 1230
Asp Ile Asn Gly Lys Leu Tyr His Asp Ser Gln Asn Met Leu Arg
1235 1240 1245
Gly Glu Asp Met Ser Phe Leu Glu Lys Asp Ala Pro Tyr Met Val
1250 1255 1260
Gly Asp Val Ile Thr Ser Gly Asp Ile Thr Cys Val Val Ile Pro
1265 1270 1275
Ser Lys Lys Ala Gly Gly Thr Thr Glu Met Leu Ser Arg Ala Leu
1280 1285 1290
Lys Lys Val Pro Val Asp Glu Tyr Ile Thr Thr Tyr Pro Gly Gln
1295 1300 1305
Gly Cys Ala Gly Tyr Thr Leu Glu Glu Ala Lys Thr Ala Leu Lys
1310 1315 1320
Lys Cys Lys Ser Ala Phe Tyr Val Leu Pro Ser Glu Ala Pro Asn
1325 1330 1335
Ala Lys Glu Glu Ile Leu Gly Thr Val Ser Trp Asn Leu Arg Glu
1340 1345 1350
Met Leu Ala His Ala Glu Glu Thr Arg Lys Leu Met Pro Ile Cys
1355 1360 1365
Met Asp Val Arg Ala Ile Met Ala Thr Ile Gln Arg Lys Tyr Lys
1370 1375 1380
Gly Ile Lys Ile Gln Glu Gly Ile Val Asp Tyr Gly Val Arg Phe
1385 1390 1395
Phe Phe Tyr Thr Ser Lys Glu Pro Val Ala Ser Ile Ile Thr Lys
1400 1405 1410
Leu Asn Ser Leu Asn Glu Pro Leu Val Thr Met Pro Ile Gly Tyr
1415 1420 1425
Val Thr His Gly Phe Asn Leu Glu Glu Ala Ala Arg Cys Met Arg
1430 1435 1440
Ser Leu Lys Ala Pro Ala Val Val Ser Val Ser Ser Pro Asp Ala
1445 1450 1455
Val Thr Thr Tyr Asn Gly Tyr Leu Thr Ser Ser Ser Lys Thr Ser
1460 1465 1470
Glu Glu His Phe Val Glu Thr Val Ser Leu Ala Gly Ser Tyr Arg
1475 1480 1485
Asp Trp Ser Tyr Ser Gly Gln Arg Thr Glu Leu Gly Val Glu Phe
1490 1495 1500
Leu Lys Arg Gly Asp Lys Ile Val Tyr His Thr Leu Glu Ser Pro
1505 1510 1515
Val Glu Phe His Leu Asp Gly Glu Val Leu Ser Leu Asp Lys Leu
1520 1525 1530
Lys Ser Leu Leu Ser Leu Arg Glu Val Lys Thr Ile Lys Val Phe
1535 1540 1545
Thr Thr Val Asp Asn Thr Asn Leu His Thr Gln Leu Val Asp Met
1550 1555 1560
Ser Met Thr Tyr Gly Gln Gln Phe Gly Pro Thr Tyr Leu Asp Gly
1565 1570 1575
Ala Asp Val Thr Lys Ile Lys Pro His Val Asn His Glu Gly Lys
1580 1585 1590
Thr Phe Phe Val Leu Pro Ser Asp Asp Thr Leu Arg Ser Glu Ala
1595 1600 1605
Phe Glu Tyr Tyr His Thr Leu Asp Glu Ser Phe Leu Gly Arg Tyr
1610 1615 1620
Met Ser Ala Leu Asn His Thr Lys Lys Trp Lys Phe Pro Gln Val
1625 1630 1635
Gly Gly Leu Thr Ser Ile Lys Trp Ala Asp Asn Asn Cys Tyr Leu
1640 1645 1650
Ser Ser Val Leu Leu Ala Leu Gln Gln Leu Glu Val Lys Phe Asn
1655 1660 1665
Ala Pro Ala Leu Gln Glu Ala Tyr Tyr Arg Ala Arg Ala Gly Asp
1670 1675 1680
Ala Ala Asn Phe Cys Ala Leu Ile Leu Ala Tyr Ser Asn Lys Thr
1685 1690 1695
Val Gly Glu Leu Gly Asp Val Arg Glu Thr Met Thr His Leu Leu
1700 1705 1710
Gln His Ala Asn Leu Glu Ser Ala Lys Arg Val Leu Asn Val Val
1715 1720 1725
Cys Lys His Cys Gly Gln Lys Thr Thr Thr Leu Thr Gly Val Glu
1730 1735 1740
Ala Val Met Tyr Met Gly Thr Leu Ser Tyr Asp Asn Leu Lys Thr
1745 1750 1755
Gly Val Ser Ile Pro Cys Val Cys Gly Arg Asp Ala Thr Gln Tyr
1760 1765 1770
Leu Val Gln Gln Glu Ser Ser Phe Val Met Met Ser Ala Pro Pro
1775 1780 1785
Ala Glu Tyr Lys Leu Gln Gln Gly Thr Phe Leu Cys Ala Asn Glu
1790 1795 1800
Tyr Thr Gly Asn Tyr Gln Cys Gly His Tyr Thr His Ile Thr Ala
1805 1810 1815
Lys Glu Thr Leu Tyr Arg Ile Asp Gly Ala His Leu Thr Lys Met
1820 1825 1830
Ser Glu Tyr Lys Gly Pro Val Thr Asp Val Phe Tyr Lys Glu Thr
1835 1840 1845
Ser Tyr Thr Thr Thr Ile Lys Pro Val Ser Tyr Lys Leu Asp Gly
1850 1855 1860
Val Thr Tyr Thr Glu Ile Glu Pro Lys Leu Asp Gly Tyr Tyr Lys
1865 1870 1875
Lys Asp Asn Ala Tyr Tyr Thr Glu Gln Pro Ile Asp Leu Val Pro
1880 1885 1890
Thr Gln Pro Leu Pro Asn Ala Ser Phe Asp Asn Phe Lys Leu Thr
1895 1900 1905
Cys Ser Asn Thr Lys Phe Ala Asp Asp Leu Asn Gln Met Thr Gly
1910 1915 1920
Phe Thr Lys Pro Ala Ser Arg Glu Leu Ser Val Thr Phe Phe Pro
1925 1930 1935
Asp Leu Asn Gly Asp Val Val Ala Ile Asp Tyr Arg His Tyr Ser
1940 1945 1950
Ala Ser Phe Lys Lys Gly Ala Lys Leu Leu His Lys Pro Ile Val
1955 1960 1965
Trp His Ile Asn Gln Ala Thr Thr Lys Thr Thr Phe Lys Pro Asn
1970 1975 1980
Thr Trp Cys Leu Arg Cys Leu Trp Ser Thr Lys Pro Val Asp Thr
1985 1990 1995
Ser Asn Ser Phe Glu Val Leu Ala Val Glu Asp Thr Gln Gly Met
2000 2005 2010
Asp Asn Leu Ala Cys Glu Ser Gln Gln Pro Thr Ser Glu Glu Val
2015 2020 2025
Val Glu Asn Pro Thr Ile Gln Lys Glu Val Ile Glu Cys Asp Val
2030 2035 2040
Lys Thr Thr Glu Val Val Gly Asn Val Ile Leu Lys Pro Ser Asp
2045 2050 2055
Glu Gly Val Lys Val Thr Gln Glu Leu Gly His Glu Asp Leu Met
2060 2065 2070
Ala Ala Tyr Val Glu Asn Thr Ser Ile Thr Ile Lys Lys Pro Asn
2075 2080 2085
Glu Leu Ser Leu Ala Leu Gly Leu Lys Thr Ile Ala Thr His Gly
2090 2095 2100
Ile Ala Ala Ile Asn Ser Val Pro Trp Ser Lys Ile Leu Ala Tyr
2105 2110 2115
Val Lys Pro Phe Leu Gly Gln Ala Ala Ile Thr Thr Ser Asn Cys
2120 2125 2130
Ala Lys Arg Leu Ala Gln Arg Val Phe Asn Asn Tyr Met Pro Tyr
2135 2140 2145
Val Phe Thr Leu Leu Phe Gln Leu Cys Thr Phe Thr Lys Ser Thr
2150 2155 2160
Asn Ser Arg Ile Arg Ala Ser Leu Pro Thr Thr Ile Ala Lys Asn
2165 2170 2175
Ser Val Lys Ser Val Ala Lys Leu Cys Leu Asp Ala Gly Ile Asn
2180 2185 2190
Tyr Val Lys Ser Pro Lys Phe Ser Lys Leu Phe Thr Ile Ala Met
2195 2200 2205
Trp Leu Leu Leu Leu Ser Ile Cys Leu Gly Ser Leu Ile Cys Val
2210 2215 2220
Thr Ala Ala Phe Gly Val Leu Leu Ser Asn Phe Gly Ala Pro Ser
2225 2230 2235
Tyr Cys Asn Gly Val Arg Glu Leu Tyr Leu Asn Ser Ser Asn Val
2240 2245 2250
Thr Thr Met Asp Phe Cys Glu Gly Ser Phe Pro Cys Ser Ile Cys
2255 2260 2265
Leu Ser Gly Leu Asp Ser Leu Asp Ser Tyr Pro Ala Leu Glu Thr
2270 2275 2280
Ile Gln Val Thr Ile Ser Ser Tyr Lys Leu Asp Leu Thr Ile Leu
2285 2290 2295
Gly Leu Ala Ala Glu Trp Val Leu Ala Tyr Met Leu Phe Thr Lys
2300 2305 2310
Phe Phe Tyr Leu Leu Gly Leu Ser Ala Ile Met Gln Val Phe Phe
2315 2320 2325
Gly Tyr Phe Ala Ser His Phe Ile Ser Asn Ser Trp Leu Met Trp
2330 2335 2340
Phe Ile Ile Ser Ile Val Gln Met Ala Pro Val Ser Ala Met Val
2345 2350 2355
Arg Met Tyr Ile Phe Phe Ala Ser Phe Tyr Tyr Ile Trp Lys Ser
2360 2365 2370
Tyr Val His Ile Met Asp Gly Cys Thr Ser Ser Thr Cys Met Met
2375 2380 2385
Cys Tyr Lys Arg Asn Arg Ala Thr Arg Val Glu Cys Thr Thr Ile
2390 2395 2400
Val Asn Gly Met Lys Arg Ser Phe Tyr Val Tyr Ala Asn Gly Gly
2405 2410 2415
Arg Gly Phe Cys Lys Thr His Asn Trp Asn Cys Leu Asn Cys Asp
2420 2425 2430
Thr Phe Cys Thr Gly Ser Thr Phe Ile Ser Asp Glu Val Ala Arg
2435 2440 2445
Asp Leu Ser Leu Gln Phe Lys Arg Pro Ile Asn Pro Thr Asp Gln
2450 2455 2460
Ser Ser Tyr Ile Val Asp Ser Val Ala Val Lys Asn Gly Ala Leu
2465 2470 2475
His Leu Tyr Phe Asp Lys Ala Gly Gln Lys Thr Tyr Glu Arg His
2480 2485 2490
Pro Leu Ser His Phe Val Asn Leu Asp Asn Leu Arg Ala Asn Asn
2495 2500 2505
Thr Lys Gly Ser Leu Pro Ile Asn Val Ile Val Phe Asp Gly Lys
2510 2515 2520
Ser Lys Cys Asp Glu Ser Ala Ser Lys Ser Ala Ser Val Tyr Tyr
2525 2530 2535
Ser Gln Leu Met Cys Gln Pro Ile Leu Leu Leu Asp Gln Val Leu
2540 2545 2550
Val Ser Asp Val Gly Asp Ser Thr Glu Val Ser Val Lys Met Phe
2555 2560 2565
Asp Ala Tyr Val Asp Thr Phe Ser Ala Thr Phe Ser Val Pro Met
2570 2575 2580
Glu Lys Leu Lys Ala Leu Val Ala Thr Ala His Ser Glu Leu Ala
2585 2590 2595
Lys Gly Val Ala Leu Asp Gly Val Leu Ser Thr Phe Val Ser Ala
2600 2605 2610
Ala Arg Gln Gly Val Val Asp Thr Asp Val Asp Thr Lys Asp Val
2615 2620 2625
Ile Glu Cys Leu Lys Leu Ser His His Ser Asp Leu Glu Val Thr
2630 2635 2640
Gly Asp Ser Cys Asn Asn Phe Met Leu Thr Tyr Asn Lys Val Glu
2645 2650 2655
Asn Met Thr Pro Arg Asp Leu Gly Ala Cys Ile Asp Cys Asn Ala
2660 2665 2670
Arg His Ile Asn Ala Gln Val Ala Lys Ser His Asn Val Ser Leu
2675 2680 2685
Ile Trp Asn Val Lys Asp Tyr Met Ser Leu Ser Glu Gln Leu Arg
2690 2695 2700
Lys Gln Ile Arg Ser Ala Ala Lys Lys Asn Asn Ile Pro Phe Arg
2705 2710 2715
Leu Thr Cys Ala Thr Thr Arg Gln Val Val Asn Val Ile Thr Thr
2720 2725 2730
Lys Ile Ser Leu Lys Gly Gly Lys Ile Val Ser Thr Cys Phe Lys
2735 2740 2745
Leu Met Leu Lys Ala Thr Leu Leu Cys Val Leu Ala Ala Leu Val
2750 2755 2760
Cys Tyr Ile Val Met Pro Val His Thr Leu Ser Ile His Asp Gly
2765 2770 2775
Tyr Thr Asn Glu Ile Ile Gly Tyr Lys Ala Ile Gln Asp Gly Val
2780 2785 2790
Thr Arg Asp Ile Ile Ser Thr Asp Asp Cys Phe Ala Asn Lys His
2795 2800 2805
Ala Gly Phe Asp Ala Trp Phe Ser Gln Arg Gly Gly Ser Tyr Lys
2810 2815 2820
Asn Asp Lys Ser Cys Pro Val Val Ala Ala Ile Ile Thr Arg Glu
2825 2830 2835
Ile Gly Phe Ile Val Pro Gly Leu Pro Gly Thr Val Leu Arg Ala
2840 2845 2850
Ile Asn Gly Asp Phe Leu His Phe Leu Pro Arg Val Phe Ser Ala
2855 2860 2865
Val Gly Asn Ile Cys Tyr Thr Pro Ser Lys Leu Ile Glu Tyr Ser
2870 2875 2880
Asp Phe Ala Thr Ser Ala Cys Val Leu Ala Ala Glu Cys Thr Ile
2885 2890 2895
Phe Lys Asp Ala Met Gly Lys Pro Val Pro Tyr Cys Tyr Asp Thr
2900 2905 2910
Asn Leu Leu Glu Gly Ser Ile Ser Tyr Ser Glu Leu Arg Pro Asp
2915 2920 2925
Thr Arg Tyr Val Leu Met Asp Gly Ser Ile Ile Gln Phe Pro Asn
2930 2935 2940
Thr Tyr Leu Glu Gly Ser Val Arg Val Val Thr Thr Phe Asp Ala
2945 2950 2955
Glu Tyr Cys Arg His Gly Thr Cys Glu Arg Ser Glu Val Gly Ile
2960 2965 2970
Cys Leu Ser Thr Ser Gly Arg Trp Val Leu Asn Asn Glu His Tyr
2975 2980 2985
Arg Ala Leu Ser Gly Val Phe Cys Gly Val Asp Ala Met Asn Leu
2990 2995 3000
Ile Ala Asn Ile Phe Thr Pro Leu Val Gln Pro Val Gly Ala Leu
3005 3010 3015
Asp Val Ser Ala Ser Val Val Ala Gly Gly Ile Ile Ala Ile Leu
3020 3025 3030
Val Thr Cys Ala Ala Tyr Tyr Phe Met Lys Phe Arg Arg Val Phe
3035 3040 3045
Gly Glu Tyr Asn His Val Val Ala Ala Asn Ala Leu Leu Phe Leu
3050 3055 3060
Met Ser Phe Thr Ile Leu Cys Leu Val Pro Ala Tyr Ser Phe Leu
3065 3070 3075
Pro Gly Val Tyr Ser Val Phe Tyr Leu Tyr Leu Thr Phe Tyr Phe
3080 3085 3090
Thr Asn Asp Val Ser Phe Leu Ala His Leu Gln Trp Phe Ala Met
3095 3100 3105
Phe Ser Pro Ile Val Pro Phe Trp Ile Thr Ala Ile Tyr Val Phe
3110 3115 3120
Cys Ile Ser Leu Lys His Cys His Trp Phe Phe Asn Asn Tyr Leu
3125 3130 3135
Arg Lys Arg Val Met Phe Asn Gly Val Thr Phe Ser Thr Phe Glu
3140 3145 3150
Glu Ala Ala Leu Cys Thr Phe Leu Leu Asn Lys Glu Met Tyr Leu
3155 3160 3165
Lys Leu Arg Ser Glu Thr Leu Leu Pro Leu Thr Gln Tyr Asn Arg
3170 3175 3180
Tyr Leu Ala Leu Tyr Asn Lys Tyr Lys Tyr Phe Ser Gly Ala Leu
3185 3190 3195
Asp Thr Thr Ser Tyr Arg Glu Ala Ala Cys Cys His Leu Ala Lys
3200 3205 3210
Ala Leu Asn Asp Phe Ser Asn Ser Gly Ala Asp Val Leu Tyr Gln
3215 3220 3225
Pro Pro Gln Thr Ser Ile Thr Ser Ala Val Leu Gln Ser Gly Phe
3230 3235 3240
Arg Lys Met Ala Phe Pro Ser Gly Lys Val Glu Gly Cys Met Val
3245 3250 3255
Gln Val Thr Cys Gly Thr Thr Thr Leu Asn Gly Leu Trp Leu Asp
3260 3265 3270
Asp Thr Val Tyr Cys Pro Arg His Val Ile Cys Thr Ala Glu Asp
3275 3280 3285
Met Leu Asn Pro Asn Tyr Glu Asp Leu Leu Ile Arg Lys Ser Asn
3290 3295 3300
His Ser Phe Leu Val Gln Ala Gly Asn Val Gln Leu Arg Val Ile
3305 3310 3315
Gly His Ser Met Gln Asn Cys Leu Leu Arg Leu Lys Val Asp Thr
3320 3325 3330
Ser Asn Pro Lys Thr Pro Lys Tyr Lys Phe Val Arg Ile Gln Pro
3335 3340 3345
Gly Gln Thr Phe Ser Val Leu Ala Cys Tyr Asn Gly Ser Pro Ser
3350 3355 3360
Gly Val Tyr Gln Cys Ala Met Arg Pro Asn His Thr Ile Lys Gly
3365 3370 3375
Ser Phe Leu Asn Gly Ser Cys Gly Ser Val Gly Phe Asn Ile Asp
3380 3385 3390
Tyr Asp Cys Val Ser Phe Cys Tyr Met His His Met Glu Leu Pro
3395 3400 3405
Thr Gly Val His Ala Gly Thr Asp Leu Glu Gly Lys Phe Tyr Gly
3410 3415 3420
Pro Phe Val Asp Arg Gln Thr Ala Gln Ala Ala Gly Thr Asp Thr
3425 3430 3435
Thr Ile Thr Leu Asn Val Leu Ala Trp Leu Tyr Ala Ala Val Ile
3440 3445 3450
Asn Gly Asp Arg Trp Phe Leu Asn Arg Phe Thr Thr Thr Leu Asn
3455 3460 3465
Asp Phe Asn Leu Val Ala Met Lys Tyr Asn Tyr Glu Pro Leu Thr
3470 3475 3480
Gln Asp His Val Asp Ile Leu Gly Pro Leu Ser Ala Gln Thr Gly
3485 3490 3495
Ile Ala Val Leu Asp Met Cys Ala Ala Leu Lys Glu Leu Leu Gln
3500 3505 3510
Asn Gly Met Asn Gly Arg Thr Ile Leu Gly Ser Thr Ile Leu Glu
3515 3520 3525
Asp Glu Phe Thr Pro Phe Asp Val Val Arg Gln Cys Ser Gly Val
3530 3535 3540
Thr Phe Gln Gly Lys Phe Lys Lys Ile Val Lys Gly Thr His His
3545 3550 3555
Trp Met Leu Leu Thr Phe Leu Thr Ser Leu Leu Ile Leu Val Gln
3560 3565 3570
Ser Thr Gln Trp Ser Leu Phe Phe Phe Val Tyr Glu Asn Ala Phe
3575 3580 3585
Leu Pro Phe Thr Leu Gly Ile Met Ala Ile Ala Ala Cys Ala Met
3590 3595 3600
Leu Leu Val Lys His Lys His Ala Phe Leu Cys Leu Phe Leu Leu
3605 3610 3615
Pro Ser Leu Ala Thr Val Ala Tyr Phe Asn Met Val Tyr Met Pro
3620 3625 3630
Ala Ser Trp Val Met Arg Ile Met Thr Trp Leu Glu Leu Ala Asp
3635 3640 3645
Thr Ser Leu Ser Gly Tyr Arg Leu Lys Asp Cys Val Met Tyr Ala
3650 3655 3660
Ser Ala Leu Val Leu Leu Ile Leu Met Thr Ala Arg Thr Val Tyr
3665 3670 3675
Asp Asp Ala Ala Arg Arg Val Trp Thr Leu Met Asn Val Ile Thr
3680 3685 3690
Leu Val Tyr Lys Val Tyr Tyr Gly Asn Ala Leu Asp Gln Ala Ile
3695 3700 3705
Ser Met Trp Ala Leu Val Ile Ser Val Thr Ser Asn Tyr Ser Gly
3710 3715 3720
Val Val Thr Thr Ile Met Phe Leu Ala Arg Ala Ile Val Phe Val
3725 3730 3735
Cys Val Glu Tyr Tyr Pro Leu Leu Phe Ile Thr Gly Asn Thr Leu
3740 3745 3750
Gln Cys Ile Met Leu Val Tyr Cys Phe Leu Gly Tyr Cys Cys Cys
3755 3760 3765
Cys Tyr Phe Gly Leu Phe Cys Leu Leu Asn Arg Tyr Phe Arg Leu
3770 3775 3780
Thr Leu Gly Val Tyr Asp Tyr Leu Val Ser Thr Gln Glu Phe Arg
3785 3790 3795
Tyr Met Asn Ser Gln Gly Leu Leu Pro Pro Lys Ser Ser Ile Asp
3800 3805 3810
Ala Phe Lys Leu Asn Ile Lys Leu Leu Gly Ile Gly Gly Lys Pro
3815 3820 3825
Cys Ile Lys Val Ala Thr Val Gln Ser Lys Met Ser Asp Val Lys
3830 3835 3840
Cys Thr Ser Val Val Leu Leu Ser Val Leu Gln Gln Leu Arg Val
3845 3850 3855
Glu Ser Ser Ser Lys Leu Trp Ala Gln Cys Val Gln Leu His Asn
3860 3865 3870
Asp Ile Leu Leu Ala Lys Asp Thr Thr Glu Ala Phe Glu Lys Met
3875 3880 3885
Val Ser Leu Leu Ser Val Leu Leu Ser Met Gln Gly Ala Val Asp
3890 3895 3900
Ile Asn Arg Leu Cys Glu Glu Met Leu Asp Asn Arg Ala Thr Leu
3905 3910 3915
Gln Ala Ile Ala Ser Glu Phe Ser Ser Leu Pro Ser Tyr Ala Ala
3920 3925 3930
Tyr Ala Thr Ala Gln Glu Ala Tyr Glu Gln Ala Val Ala Asn Gly
3935 3940 3945
Asp Ser Glu Val Val Leu Lys Lys Leu Lys Lys Ser Leu Asn Val
3950 3955 3960
Ala Lys Ser Glu Phe Asp Arg Asp Ala Ala Met Gln Arg Lys Leu
3965 3970 3975
Glu Lys Met Ala Asp Gln Ala Met Thr Gln Met Tyr Lys Gln Ala
3980 3985 3990
Arg Ser Glu Asp Lys Arg Ala Lys Val Thr Ser Ala Met Gln Thr
3995 4000 4005
Met Leu Phe Thr Met Leu Arg Lys Leu Asp Asn Asp Ala Leu Asn
4010 4015 4020
Asn Ile Ile Asn Asn Ala Arg Asp Gly Cys Val Pro Leu Asn Ile
4025 4030 4035
Ile Pro Leu Thr Thr Ala Ala Lys Leu Met Val Val Val Pro Asp
4040 4045 4050
Tyr Gly Thr Tyr Lys Asn Thr Cys Asp Gly Asn Thr Phe Thr Tyr
4055 4060 4065
Ala Ser Ala Leu Trp Glu Ile Gln Gln Val Val Asp Ala Asp Ser
4070 4075 4080
Lys Ile Val Gln Leu Ser Glu Ile Asn Met Asp Asn Ser Pro Asn
4085 4090 4095
Leu Ala Trp Pro Leu Ile Val Thr Ala Leu Arg Ala Asn Ser Ala
4100 4105 4110
Val Lys Leu Gln Asn Asn Glu Leu Ser Pro Val Ala Leu Arg Gln
4115 4120 4125
Met Ser Cys Ala Ala Gly Thr Thr Gln Thr Ala Cys Thr Asp Asp
4130 4135 4140
Asn Ala Leu Ala Tyr Tyr Asn Asn Ser Lys Gly Gly Arg Phe Val
4145 4150 4155
Leu Ala Leu Leu Ser Asp His Gln Asp Leu Lys Trp Ala Arg Phe
4160 4165 4170
Pro Lys Ser Asp Gly Thr Gly Thr Ile Tyr Thr Glu Leu Glu Pro
4175 4180 4185
Pro Cys Arg Phe Val Thr Asp Thr Pro Lys Gly Pro Lys Val Lys
4190 4195 4200
Tyr Leu Tyr Phe Ile Lys Gly Leu Asn Asn Leu Asn Arg Gly Met
4205 4210 4215
Val Leu Gly Ser Leu Ala Ala Thr Val Arg Leu Gln Ala Gly Asn
4220 4225 4230
Ala Thr Glu Val Pro Ala Asn Ser Thr Val Leu Ser Phe Cys Ala
4235 4240 4245
Phe Ala Val Asp Pro Ala Lys Ala Tyr Lys Asp Tyr Leu Ala Ser
4250 4255 4260
Gly Gly Gln Pro Ile Thr Asn Cys Val Lys Met Leu Cys Thr His
4265 4270 4275
Thr Gly Thr Gly Gln Ala Ile Thr Val Thr Pro Glu Ala Asn Met
4280 4285 4290
Asp Gln Glu Ser Phe Gly Gly Ala Ser Cys Cys Leu Tyr Cys Arg
4295 4300 4305
Cys His Ile Asp His Pro Asn Pro Lys Gly Phe Cys Asp Leu Lys
4310 4315 4320
Gly Lys Tyr Val Gln Ile Pro Thr Thr Cys Ala Asn Asp Pro Val
4325 4330 4335
Gly Phe Thr Leu Arg Asn Thr Val Cys Thr Val Cys Gly Met Trp
4340 4345 4350
Lys Gly Tyr Gly Cys Ser Cys Asp Gln Leu Arg Glu Pro Leu Met
4355 4360 4365
Gln Ser Ala Asp Ala Ser Thr Phe Leu Asn Arg Val Cys Gly Val
4370 4375 4380
Ser Ala Ala Arg Leu Thr Pro Cys Gly Thr Gly Thr Ser Thr Asp
4385 4390 4395
Val Val Tyr Arg Ala Phe Asp Ile Tyr Asn Glu Lys Val Ala Gly
4400 4405 4410
Phe Ala Lys Phe Leu Lys Thr Asn Cys Cys Arg Phe Gln Glu Lys
4415 4420 4425
Asp Glu Glu Gly Asn Leu Leu Asp Ser Tyr Phe Val Val Lys Arg
4430 4435 4440
His Thr Met Ser Asn Tyr Gln His Glu Glu Thr Ile Tyr Asn Leu
4445 4450 4455
Val Lys Asp Cys Pro Ala Val Ala Val His Asp Phe Phe Lys Phe
4460 4465 4470
Arg Val Asp Gly Asp Met Val Pro His Ile Ser Arg Gln Arg Leu
4475 4480 4485
Thr Lys Tyr Thr Met Ala Asp Leu Val Tyr Ala Leu Arg His Phe
4490 4495 4500
Asp Glu Gly Asn Cys Asp Thr Leu Lys Glu Ile Leu Val Thr Tyr
4505 4510 4515
Asn Cys Cys Asp Asp Asp Tyr Phe Asn Lys Lys Asp Trp Tyr Asp
4520 4525 4530
Phe Val Glu Asn Pro Asp Ile Leu Arg Val Tyr Ala Asn Leu Gly
4535 4540 4545
Glu Arg Val Arg Gln Ser Leu Leu Lys Thr Val Gln Phe Cys Asp
4550 4555 4560
Ala Met Arg Asp Ala Gly Ile Val Gly Val Leu Thr Leu Asp Asn
4565 4570 4575
Gln Asp Leu Asn Gly Asn Trp Tyr Asp Phe Gly Asp Phe Val Gln
4580 4585 4590
Val Ala Pro Gly Cys Gly Val Pro Ile Val Asp Ser Tyr Tyr Ser
4595 4600 4605
Leu Leu Met Pro Ile Leu Thr Leu Thr Arg Ala Leu Ala Ala Glu
4610 4615 4620
Ser His Met Asp Ala Asp Leu Ala Lys Pro Leu Ile Lys Trp Asp
4625 4630 4635
Leu Leu Lys Tyr Asp Phe Thr Glu Glu Arg Leu Cys Leu Phe Asp
4640 4645 4650
Arg Tyr Phe Lys Tyr Trp Asp Gln Thr Tyr His Pro Asn Cys Ile
4655 4660 4665
Asn Cys Leu Asp Asp Arg Cys Ile Leu His Cys Ala Asn Phe Asn
4670 4675 4680
Val Leu Phe Ser Thr Val Phe Pro Pro Thr Ser Phe Gly Pro Leu
4685 4690 4695
Val Arg Lys Ile Phe Val Asp Gly Val Pro Phe Val Val Ser Thr
4700 4705 4710
Gly Tyr His Phe Arg Glu Leu Gly Val Val His Asn Gln Asp Val
4715 4720 4725
Asn Leu His Ser Ser Arg Leu Ser Phe Lys Glu Leu Leu Val Tyr
4730 4735 4740
Ala Ala Asp Pro Ala Met His Ala Ala Ser Gly Asn Leu Leu Leu
4745 4750 4755
Asp Lys Arg Thr Thr Cys Phe Ser Val Ala Ala Leu Thr Asn Asn
4760 4765 4770
Val Ala Phe Gln Thr Val Lys Pro Gly Asn Phe Asn Lys Asp Phe
4775 4780 4785
Tyr Asp Phe Ala Val Ser Lys Gly Phe Phe Lys Glu Gly Ser Ser
4790 4795 4800
Val Glu Leu Lys His Phe Phe Phe Ala Gln Asp Gly Asn Ala Ala
4805 4810 4815
Ile Ser Asp Tyr Asp Tyr Tyr Arg Tyr Asn Leu Pro Thr Met Cys
4820 4825 4830
Asp Ile Arg Gln Leu Leu Phe Val Val Glu Val Val Asp Lys Tyr
4835 4840 4845
Phe Asp Cys Tyr Asp Gly Gly Cys Ile Asn Ala Asn Gln Val Ile
4850 4855 4860
Val Asn Asn Leu Asp Lys Ser Ala Gly Phe Pro Phe Asn Lys Trp
4865 4870 4875
Gly Lys Ala Arg Leu Tyr Tyr Asp Ser Met Ser Tyr Glu Asp Gln
4880 4885 4890
Asp Ala Leu Phe Ala Tyr Thr Lys Arg Asn Val Ile Pro Thr Ile
4895 4900 4905
Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn Arg Ala
4910 4915 4920
Arg Thr Val Ala Gly Val Ser Ile Cys Ser Thr Met Thr Asn Arg
4925 4930 4935
Gln Phe His Gln Lys Leu Leu Lys Ser Ile Ala Ala Thr Arg Gly
4940 4945 4950
Ala Thr Val Val Ile Gly Thr Ser Lys Phe Tyr Gly Gly Trp His
4955 4960 4965
Asn Met Leu Lys Thr Val Tyr Ser Asp Val Glu Thr Pro His Leu
4970 4975 4980
Met Gly Trp Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Met
4985 4990 4995
Leu Arg Ile Met Ala Ser Leu Val Leu Ala Arg Lys His Asn Thr
5000 5005 5010
Cys Cys Asn Leu Ser His Arg Phe Tyr Arg Leu Ala Asn Glu Cys
5015 5020 5025
Ala Gln Val Leu Ser Glu Met Val Met Cys Gly Gly Ser Leu Tyr
5030 5035 5040
Val Lys Pro Gly Gly Thr Ser Ser Gly Asp Ala Thr Thr Ala Tyr
5045 5050 5055
Ala Asn Ser Val Phe Asn Ile Cys Gln Ala Val Thr Ala Asn Val
5060 5065 5070
Asn Ala Leu Leu Ser Thr Asp Gly Asn Lys Ile Ala Asp Lys Tyr
5075 5080 5085
Val Arg Asn Leu Gln His Arg Leu Tyr Glu Cys Leu Tyr Arg Asn
5090 5095 5100
Arg Asp Val Asp His Glu Phe Val Asp Glu Phe Tyr Ala Tyr Leu
5105 5110 5115
Arg Lys His Phe Ser Met Met Ile Leu Ser Asp Asp Ala Val Val
5120 5125 5130
Cys Tyr Asn Ser Asn Tyr Ala Ala Gln Gly Leu Val Ala Ser Ile
5135 5140 5145
Lys Asn Phe Lys Ala Val Leu Tyr Tyr Gln Asn Asn Val Phe Met
5150 5155 5160
Ser Glu Ala Lys Cys Trp Thr Glu Thr Asp Leu Thr Lys Gly Pro
5165 5170 5175
His Glu Phe Cys Ser Gln His Thr Met Leu Val Lys Gln Gly Asp
5180 5185 5190
Asp Tyr Val Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Gly
5195 5200 5205
Ala Gly Cys Phe Val Asp Asp Ile Val Lys Thr Asp Gly Thr Leu
5210 5215 5220
Met Ile Glu Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu
5225 5230 5235
Thr Lys His Pro Asn Gln Glu Tyr Ala Asp Val Phe His Leu Tyr
5240 5245 5250
Leu Gln Tyr Ile Arg Lys Leu His Asp Glu Leu Thr Gly His Met
5255 5260 5265
Leu Asp Met Tyr Ser Val Met Leu Thr Asn Asp Asn Thr Ser Arg
5270 5275 5280
Tyr Trp Glu Pro Glu Phe Tyr Glu Ala Met Tyr Thr Pro His Thr
5285 5290 5295
Val Leu Gln Ala Val Gly Ala Cys Val Leu Cys Asn Ser Gln Thr
5300 5305 5310
Ser Leu Arg Cys Gly Ala Cys Ile Arg Arg Pro Phe Leu Cys Cys
5315 5320 5325
Lys Cys Cys Tyr Asp His Val Ile Ser Thr Ser His Lys Leu Val
5330 5335 5340
Leu Ser Val Asn Pro Tyr Val Cys Asn Ala Pro Gly Cys Asp Val
5345 5350 5355
Thr Asp Val Thr Gln Leu Tyr Leu Gly Gly Met Ser Tyr Tyr Cys
5360 5365 5370
Lys Ser His Lys Pro Pro Ile Ser Phe Pro Leu Cys Ala Asn Gly
5375 5380 5385
Gln Val Phe Gly Leu Tyr Lys Asn Thr Cys Val Gly Ser Asp Asn
5390 5395 5400
Val Thr Asp Phe Asn Ala Ile Ala Thr Cys Asp Trp Thr Asn Ala
5405 5410 5415
Gly Asp Tyr Ile Leu Ala Asn Thr Cys Thr Glu Arg Leu Lys Leu
5420 5425 5430
Phe Ala Ala Glu Thr Leu Lys Ala Thr Glu Glu Thr Phe Lys Leu
5435 5440 5445
Ser Tyr Gly Ile Ala Thr Val Arg Glu Val Leu Ser Asp Arg Glu
5450 5455 5460
Leu His Leu Ser Trp Glu Val Gly Lys Pro Arg Pro Pro Leu Asn
5465 5470 5475
Arg Asn Tyr Val Phe Thr Gly Tyr Arg Val Thr Lys Asn Ser Lys
5480 5485 5490
Val Gln Ile Gly Glu Tyr Thr Phe Glu Lys Gly Asp Tyr Gly Asp
5495 5500 5505
Ala Val Val Tyr Arg Gly Thr Thr Thr Tyr Lys Leu Asn Val Gly
5510 5515 5520
Asp Tyr Phe Val Leu Thr Ser His Thr Val Met Pro Leu Ser Ala
5525 5530 5535
Pro Thr Leu Val Pro Gln Glu His Tyr Val Arg Ile Thr Gly Leu
5540 5545 5550
Tyr Pro Thr Leu Asn Ile Ser Asp Glu Phe Ser Ser Asn Val Ala
5555 5560 5565
Asn Tyr Gln Lys Val Gly Met Gln Lys Tyr Ser Thr Leu Gln Gly
5570 5575 5580
Pro Pro Gly Thr Gly Lys Ser His Phe Ala Ile Gly Leu Ala Leu
5585 5590 5595
Tyr Tyr Pro Ser Ala Arg Ile Val Tyr Thr Ala Cys Ser His Ala
5600 5605 5610
Ala Val Asp Ala Leu Cys Glu Lys Ala Leu Lys Tyr Leu Pro Ile
5615 5620 5625
Asp Lys Cys Ser Arg Ile Ile Pro Ala Arg Ala Arg Val Glu Cys
5630 5635 5640
Phe Asp Lys Phe Lys Val Asn Ser Thr Leu Glu Gln Tyr Val Phe
5645 5650 5655
Cys Thr Val Asn Ala Leu Pro Glu Thr Thr Ala Asp Ile Val Val
5660 5665 5670
Phe Asp Glu Ile Ser Met Ala Thr Asn Tyr Asp Leu Ser Val Val
5675 5680 5685
Asn Ala Arg Leu Arg Ala Lys His Tyr Val Tyr Ile Gly Asp Pro
5690 5695 5700
Ala Gln Leu Pro Ala Pro Arg Thr Leu Leu Thr Lys Gly Thr Leu
5705 5710 5715
Glu Pro Glu Tyr Phe Asn Ser Val Cys Arg Leu Met Lys Thr Ile
5720 5725 5730
Gly Pro Asp Met Phe Leu Gly Thr Cys Arg Arg Cys Pro Ala Glu
5735 5740 5745
Ile Val Asp Thr Val Ser Ala Leu Val Tyr Asp Asn Lys Leu Lys
5750 5755 5760
Ala His Lys Asp Lys Ser Ala Gln Cys Phe Lys Met Phe Tyr Lys
5765 5770 5775
Gly Val Ile Thr His Asp Val Ser Ser Ala Ile Asn Arg Pro Gln
5780 5785 5790
Ile Gly Val Val Arg Glu Phe Leu Thr Arg Asn Pro Ala Trp Arg
5795 5800 5805
Lys Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln Asn Ala Val Ala
5810 5815 5820
Ser Lys Ile Leu Gly Leu Pro Thr Gln Thr Val Asp Ser Ser Gln
5825 5830 5835
Gly Ser Glu Tyr Asp Tyr Val Ile Phe Thr Gln Thr Thr Glu Thr
5840 5845 5850
Ala His Ser Cys Asn Val Asn Arg Phe Asn Val Ala Ile Thr Arg
5855 5860 5865
Ala Lys Ile Gly Ile Leu Cys Ile Met Ser Asp Arg Asp Leu Tyr
5870 5875 5880
Asp Lys Leu Gln Phe Thr Ser Leu Glu Ile Pro Arg Arg Asn Val
5885 5890 5895
Ala Thr Leu Gln Ala Glu Asn Val Thr Gly Leu Phe Lys Asp Cys
5900 5905 5910
Ser Lys Ile Ile Thr Gly Leu His Pro Thr Gln Ala Pro Thr His
5915 5920 5925
Leu Ser Val Asp Ile Lys Phe Lys Thr Glu Gly Leu Cys Val Asp
5930 5935 5940
Ile Pro Gly Ile Pro Lys Asp Met Thr Tyr Arg Arg Leu Ile Ser
5945 5950 5955
Met Met Gly Phe Lys Met Asn Tyr Gln Val Asn Gly Tyr Pro Asn
5960 5965 5970
Met Phe Ile Thr Arg Glu Glu Ala Ile Arg His Val Arg Ala Trp
5975 5980 5985
Ile Gly Phe Asp Val Glu Gly Cys His Ala Thr Arg Asp Ala Val
5990 5995 6000
Gly Thr Asn Leu Pro Leu Gln Leu Gly Phe Ser Thr Gly Val Asn
6005 6010 6015
Leu Val Ala Val Pro Thr Gly Tyr Val Asp Thr Glu Asn Asn Thr
6020 6025 6030
Glu Phe Thr Arg Val Asn Ala Lys Pro Pro Pro Gly Asp Gln Phe
6035 6040 6045
Lys His Leu Ile Pro Leu Met Tyr Lys Gly Leu Pro Trp Asn Val
6050 6055 6060
Val Arg Ile Lys Ile Val Gln Met Leu Ser Asp Thr Leu Lys Gly
6065 6070 6075
Leu Ser Asp Arg Val Val Phe Val Leu Trp Ala His Gly Phe Glu
6080 6085 6090
Leu Thr Ser Met Lys Tyr Phe Val Lys Ile Gly Pro Glu Arg Thr
6095 6100 6105
Cys Cys Leu Cys Asp Lys Arg Ala Thr Cys Phe Ser Thr Ser Ser
6110 6115 6120
Asp Thr Tyr Ala Cys Trp Asn His Ser Val Gly Phe Asp Tyr Val
6125 6130 6135
Tyr Asn Pro Phe Met Ile Asp Val Gln Gln Trp Gly Phe Thr Gly
6140 6145 6150
Asn Leu Gln Ser Asn His Asp Gln His Cys Gln Val His Gly Asn
6155 6160 6165
Ala His Val Ala Ser Cys Asp Ala Ile Met Thr Arg Cys Leu Ala
6170 6175 6180
Val His Glu Cys Phe Val Lys Arg Val Asp Trp Ser Val Glu Tyr
6185 6190 6195
Pro Ile Ile Gly Asp Glu Leu Arg Val Asn Ser Ala Cys Arg Lys
6200 6205 6210
Val Gln His Met Val Val Lys Ser Ala Leu Leu Ala Asp Lys Phe
6215 6220 6225
Pro Val Leu His Asp Ile Gly Asn Pro Lys Ala Ile Lys Cys Val
6230 6235 6240
Pro Gln Ala Glu Val Glu Trp Lys Phe Tyr Asp Ala Gln Pro Cys
6245 6250 6255
Ser Asp Lys Ala Tyr Lys Ile Glu Glu Leu Phe Tyr Ser Tyr Ala
6260 6265 6270
Thr His His Asp Lys Phe Thr Asp Gly Val Cys Leu Phe Trp Asn
6275 6280 6285
Cys Asn Val Asp Arg Tyr Pro Ala Asn Ala Ile Val Cys Arg Phe
6290 6295 6300
Asp Thr Arg Val Leu Ser Asn Leu Asn Leu Pro Gly Cys Asp Gly
6305 6310 6315
Gly Ser Leu Tyr Val Asn Lys His Ala Phe His Thr Pro Ala Phe
6320 6325 6330
Asp Lys Ser Ala Phe Thr Asn Leu Lys Gln Leu Pro Phe Phe Tyr
6335 6340 6345
Tyr Ser Asp Ser Pro Cys Glu Ser His Gly Lys Gln Val Val Ser
6350 6355 6360
Asp Ile Asp Tyr Val Pro Leu Lys Ser Ala Thr Cys Ile Thr Arg
6365 6370 6375
Cys Asn Leu Gly Gly Ala Val Cys Arg His His Ala Asn Glu Tyr
6380 6385 6390
Arg Gln Tyr Leu Asp Ala Tyr Asn Met Met Ile Ser Ala Gly Phe
6395 6400 6405
Ser Leu Trp Ile Tyr Lys Gln Phe Asp Thr Tyr Asn Leu Trp Asn
6410 6415 6420
Thr Phe Thr Arg Leu Gln Ser Leu Glu Asn Val Ala Tyr Asn Val
6425 6430 6435
Val Asn Lys Gly His Phe Asp Gly His Ala Gly Glu Ala Pro Val
6440 6445 6450
Ser Ile Ile Asn Asn Ala Val Tyr Thr Lys Val Asp Gly Ile Asp
6455 6460 6465
Val Glu Ile Phe Glu Asn Lys Thr Thr Leu Pro Val Asn Val Ala
6470 6475 6480
Phe Glu Leu Trp Ala Lys Arg Asn Ile Lys Pro Val Pro Glu Ile
6485 6490 6495
Lys Ile Leu Asn Asn Leu Gly Val Asp Ile Ala Ala Asn Thr Val
6500 6505 6510
Ile Trp Asp Tyr Lys Arg Glu Ala Pro Ala His Val Ser Thr Ile
6515 6520 6525
Gly Val Cys Thr Met Thr Asp Ile Ala Lys Lys Pro Thr Glu Ser
6530 6535 6540
Ala Cys Ser Ser Leu Thr Val Leu Phe Asp Gly Arg Val Glu Gly
6545 6550 6555
Gln Val Asp Leu Phe Arg Asn Ala Arg Asn Gly Val Leu Ile Thr
6560 6565 6570
Glu Gly Ser Val Lys Gly Leu Thr Pro Ser Lys Gly Pro Ala Gln
6575 6580 6585
Ala Ser Val Asn Gly Val Thr Leu Ile Gly Glu Ser Val Lys Thr
6590 6595 6600
Gln Phe Asn Tyr Phe Lys Lys Val Asp Gly Ile Ile Gln Gln Leu
6605 6610 6615
Pro Glu Thr Tyr Phe Thr Gln Ser Arg Asp Leu Glu Asp Phe Lys
6620 6625 6630
Pro Arg Ser Gln Met Glu Thr Asp Phe Leu Glu Leu Ala Met Asp
6635 6640 6645
Glu Phe Ile Gln Arg Tyr Lys Leu Glu Gly Tyr Ala Phe Glu His
6650 6655 6660
Ile Val Tyr Gly Asp Phe Ser His Gly Gln Leu Gly Gly Leu His
6665 6670 6675
Leu Met Ile Gly Leu Ala Lys Arg Ser Gln Asp Ser Pro Leu Lys
6680 6685 6690
Leu Glu Asp Phe Ile Pro Met Asp Ser Thr Val Lys Asn Tyr Phe
6695 6700 6705
Ile Thr Asp Ala Gln Thr Gly Ser Ser Lys Cys Val Cys Ser Val
6710 6715 6720
Ile Asp Leu Leu Leu Asp Asp Phe Val Glu Ile Ile Lys Ser Gln
6725 6730 6735
Asp Leu Ser Val Ile Ser Lys Val Val Lys Val Thr Ile Asp Tyr
6740 6745 6750
Ala Glu Ile Ser Phe Met Leu Trp Cys Lys Asp Gly His Val Glu
6755 6760 6765
Thr Phe Tyr Pro Lys Leu Gln Ala Ser Gln Ala Trp Gln Pro Gly
6770 6775 6780
Val Ala Met Pro Asn Leu Tyr Lys Met Gln Arg Met Leu Leu Glu
6785 6790 6795
Lys Cys Asp Leu Gln Asn Tyr Gly Glu Asn Ala Val Ile Pro Lys
6800 6805 6810
Gly Ile Met Met Asn Val Ala Lys Tyr Thr Gln Leu Cys Gln Tyr
6815 6820 6825
Leu Asn Thr Leu Thr Leu Ala Val Pro Tyr Asn Met Arg Val Ile
6830 6835 6840
His Phe Gly Ala Gly Ser Asp Lys Gly Val Ala Pro Gly Thr Ala
6845 6850 6855
Val Leu Arg Gln Trp Leu Pro Thr Gly Thr Leu Leu Val Asp Ser
6860 6865 6870
Asp Leu Asn Asp Phe Val Ser Asp Ala Asp Ser Thr Leu Ile Gly
6875 6880 6885
Asp Cys Ala Thr Val His Thr Ala Asn Lys Trp Asp Leu Ile Ile
6890 6895 6900
Ser Asp Met Tyr Asp Pro Arg Thr Lys His Val Thr Lys Glu Asn
6905 6910 6915
Asp Ser Lys Glu Gly Phe Phe Thr Tyr Leu Cys Gly Phe Ile Lys
6920 6925 6930
Gln Lys Leu Ala Leu Gly Gly Ser Ile Ala Val Lys Ile Thr Glu
6935 6940 6945
His Ser Trp Asn Ala Asp Leu Tyr Lys Leu Met Gly His Phe Ser
6950 6955 6960
Trp Trp Thr Ala Phe Val Thr Asn Val Asn Ala Ser Ser Ser Glu
6965 6970 6975
Ala Phe Leu Ile Gly Ala Asn Tyr Leu Gly Lys Pro Lys Glu Gln
6980 6985 6990
Ile Asp Gly Tyr Thr Met His Ala Asn Tyr Ile Phe Trp Arg Asn
6995 7000 7005
Thr Asn Pro Ile Gln Leu Ser Ser Tyr Ser Leu Phe Asp Met Ser
7010 7015 7020
Lys Phe Pro Leu Lys Leu Arg Gly Thr Ala Val Met Ser Leu Lys
7025 7030 7035
Glu Asn Gln Ile Asn Asp Met Ile Tyr Ser Leu Leu Glu Lys Gly
7040 7045 7050
Arg Leu Ile Ile Arg Glu Asn Asn Arg Val Val Val Ser Ser Asp
7055 7060 7065
Ile Leu Val Asn Asn
7070
<210> 28
<211> 6758
<212> PRT
<213> human coronavirus 229E
<220>
<221> MISC_FEATURE
<223> ORF 1 AB
<400> 28
Met Ala Cys Asn Arg Val Thr Leu Ala Val Ala Ser Asp Ser Glu Ile
1 5 10 15
Ser Ala Asn Gly Cys Ser Thr Ile Ala Gln Ala Val Arg Arg Tyr Ser
20 25 30
Glu Ala Ala Ser Asn Gly Phe Arg Ala Cys Arg Phe Val Ser Leu Asp
35 40 45
Leu Gln Asp Cys Ile Val Gly Ile Ala Asp Asp Thr Tyr Val Met Gly
50 55 60
Leu His Gly Asn Gln Thr Leu Phe Cys Asn Ile Met Lys Phe Ser Asp
65 70 75 80
Arg Pro Phe Met Leu His Gly Trp Leu Val Phe Ser Asn Ser Asn Tyr
85 90 95
Leu Leu Glu Glu Phe Asp Val Val Phe Gly Lys Arg Gly Gly Gly Asn
100 105 110
Val Thr Tyr Thr Asp Gln Tyr Leu Cys Gly Ala Asp Gly Lys Pro Val
115 120 125
Met Ser Glu Asp Leu Trp Gln Phe Val Asp His Phe Gly Glu Asn Glu
130 135 140
Glu Ile Ile Ile Asn Gly His Thr Tyr Val Cys Ala Trp Leu Thr Lys
145 150 155 160
Arg Lys Pro Leu Asp Tyr Lys Arg Gln Asn Asn Leu Ala Ile Glu Glu
165 170 175
Ile Glu Tyr Val His Gly Asp Ala Leu His Thr Leu Arg Asn Gly Ser
180 185 190
Val Leu Glu Met Ala Lys Glu Val Lys Thr Ser Ser Lys Val Val Leu
195 200 205
Ser Asp Ala Leu Asp Lys Leu Tyr Lys Val Phe Gly Ser Pro Val Met
210 215 220
Thr Asn Gly Ser Asn Ile Leu Glu Ala Phe Thr Lys Pro Val Phe Ile
225 230 235 240
Ser Ala Leu Val Gln Cys Thr Cys Gly Thr Lys Ser Trp Ser Val Gly
245 250 255
Asp Trp Thr Gly Phe Lys Ser Ser Cys Cys Asn Val Ile Ser Asn Lys
260 265 270
Leu Cys Val Val Pro Gly Asn Val Lys Pro Gly Asp Ala Val Ile Thr
275 280 285
Thr Gln Gln Ala Gly Ala Gly Ile Lys Tyr Phe Cys Gly Met Thr Leu
290 295 300
Lys Phe Val Ala Asn Ile Glu Gly Val Ser Val Trp Arg Val Ile Ala
305 310 315 320
Leu Gln Ser Val Asp Cys Phe Val Ala Ser Ser Thr Phe Val Glu Glu
325 330 335
Glu His Val Asn Arg Met Asp Thr Phe Cys Phe Asn Val Arg Asn Ser
340 345 350
Val Thr Asp Glu Cys Arg Leu Ala Met Leu Gly Ala Glu Met Thr Ser
355 360 365
Asn Val Arg Arg Gln Val Ala Ser Gly Val Ile Asp Ile Ser Thr Gly
370 375 380
Trp Phe Asp Val Tyr Asp Asp Ile Phe Ala Glu Ser Lys Pro Trp Phe
385 390 395 400
Val Arg Lys Ala Glu Asp Ile Phe Gly Pro Cys Trp Ser Ala Leu Ala
405 410 415
Ser Ala Leu Lys Gln Leu Lys Val Thr Thr Gly Glu Leu Val Arg Phe
420 425 430
Val Lys Ser Ile Cys Asn Ser Ala Val Ala Val Val Gly Gly Thr Ile
435 440 445
Gln Ile Leu Ala Ser Val Pro Glu Lys Phe Leu Asn Ala Phe Asp Val
450 455 460
Phe Val Thr Ala Ile Gln Thr Val Phe Asp Cys Ala Val Glu Thr Cys
465 470 475 480
Thr Ile Ala Gly Lys Ala Phe Asp Lys Val Phe Asp Tyr Val Leu Leu
485 490 495
Asp Asn Ala Leu Val Lys Leu Val Thr Thr Lys Leu Lys Gly Val Arg
500 505 510
Glu Arg Gly Leu Asn Lys Val Lys Tyr Ala Thr Val Val Val Gly Ser
515 520 525
Thr Glu Glu Val Lys Ser Ser Arg Val Glu Arg Ser Thr Ala Val Leu
530 535 540
Thr Ile Ala Asn Asn Tyr Ser Lys Leu Phe Asp Glu Gly Tyr Thr Val
545 550 555 560
Val Ile Gly Asp Val Ala Tyr Phe Val Ser Asp Gly Tyr Phe Arg Leu
565 570 575
Met Ala Ser Pro Asn Ser Val Leu Thr Thr Ala Val Tyr Lys Pro Leu
580 585 590
Phe Ala Phe Asn Val Asn Val Met Gly Thr Arg Pro Glu Lys Phe Pro
595 600 605
Thr Thr Val Thr Cys Glu Asn Leu Glu Ser Ala Val Leu Phe Val Asn
610 615 620
Asp Lys Ile Thr Glu Phe Gln Leu Asp Tyr Ser Ile Asp Val Ile Asp
625 630 635 640
Asn Glu Ile Ile Val Lys Pro Asn Ile Ser Leu Cys Val Pro Leu Tyr
645 650 655
Val Arg Asp Tyr Val Asp Lys Trp Asp Asp Phe Cys Arg Gln Tyr Ser
660 665 670
Asn Glu Ser Trp Phe Glu Asp Asp Tyr Arg Ala Phe Ile Ser Val Leu
675 680 685
Asp Ile Thr Asp Ala Ala Val Lys Ala Ala Glu Ser Lys Ala Phe Val
690 695 700
Asp Thr Ile Val Pro Pro Cys Pro Ser Ile Leu Lys Val Ile Asp Gly
705 710 715 720
Gly Lys Ile Trp Asn Gly Val Ile Lys Asn Val Asn Ser Val Arg Asp
725 730 735
Trp Leu Lys Ser Leu Lys Leu Asn Leu Thr Gln Gln Gly Leu Leu Gly
740 745 750
Thr Cys Ala Lys Arg Phe Lys Arg Trp Leu Gly Ile Leu Leu Glu Ala
755 760 765
Tyr Asn Ala Phe Leu Asp Thr Val Val Ser Thr Val Lys Ile Gly Gly
770 775 780
Leu Thr Phe Lys Thr Tyr Ala Phe Asp Lys Pro Tyr Ile Val Ile Arg
785 790 795 800
Asp Ile Val Cys Lys Val Glu Asn Lys Thr Glu Ala Glu Trp Ile Glu
805 810 815
Leu Phe Pro His Asn Asp Arg Ile Lys Ser Phe Ser Thr Phe Glu Ser
820 825 830
Ala Tyr Met Pro Ile Ala Asp Pro Thr His Phe Asp Ile Glu Glu Val
835 840 845
Glu Leu Leu Asp Ala Glu Phe Val Glu Pro Gly Cys Gly Gly Ile Leu
850 855 860
Ala Val Ile Asp Glu His Val Phe Tyr Lys Lys Asp Gly Val Tyr Tyr
865 870 875 880
Pro Ser Asn Gly Thr Asn Ile Leu Pro Val Ala Phe Thr Lys Ala Ala
885 890 895
Gly Gly Lys Val Ser Phe Ser Asp Asp Val Glu Val Lys Asp Ile Glu
900 905 910
Pro Val Tyr Arg Val Lys Leu Cys Phe Glu Phe Glu Asp Glu Lys Leu
915 920 925
Val Asp Val Cys Glu Lys Ala Ile Gly Lys Lys Ile Lys His Glu Gly
930 935 940
Asp Trp Asp Ser Phe Cys Lys Thr Ile Gln Ser Ala Leu Ser Val Val
945 950 955 960
Ser Cys Tyr Val Asn Leu Pro Thr Tyr Tyr Ile Tyr Asp Glu Glu Gly
965 970 975
Gly Asn Asp Leu Ser Leu Pro Val Met Ile Ser Glu Trp Pro Leu Ser
980 985 990
Val Gln Gln Ala Gln Gln Glu Ala Thr Leu Pro Asp Ile Ala Glu Asp
995 1000 1005
Val Val Asp Gln Val Glu Glu Val Asn Ser Ile Phe Asp Ile Glu
1010 1015 1020
Thr Val Asp Val Lys His Asp Val Ser Pro Phe Glu Met Pro Phe
1025 1030 1035
Glu Glu Leu Asn Gly Leu Lys Ile Leu Lys Gln Leu Asp Asn Asn
1040 1045 1050
Cys Trp Val Asn Ser Val Met Leu Gln Ile Gln Leu Thr Gly Ile
1055 1060 1065
Leu Asp Gly Asp Tyr Ala Met Gln Phe Phe Lys Met Gly Arg Val
1070 1075 1080
Ala Lys Met Ile Glu Arg Cys Tyr Thr Ala Glu Gln Cys Ile Arg
1085 1090 1095
Gly Ala Met Gly Asp Val Gly Leu Cys Met Tyr Arg Leu Leu Lys
1100 1105 1110
Asp Leu His Thr Gly Phe Met Val Met Asp Tyr Lys Cys Ser Cys
1115 1120 1125
Thr Ser Gly Arg Leu Glu Glu Ser Gly Ala Val Leu Phe Cys Thr
1130 1135 1140
Pro Thr Lys Lys Ala Phe Pro Tyr Gly Thr Cys Leu Asn Cys Asn
1145 1150 1155
Ala Pro Arg Met Cys Thr Ile Arg Gln Leu Gln Gly Thr Ile Ile
1160 1165 1170
Phe Val Gln Gln Lys Pro Glu Pro Val Asn Pro Val Ser Phe Val
1175 1180 1185
Val Lys Pro Val Cys Ser Ser Ile Phe Arg Gly Ala Val Ser Cys
1190 1195 1200
Gly His Tyr Gln Thr Asn Ile Tyr Ser Gln Asn Leu Cys Val Asp
1205 1210 1215
Gly Phe Gly Val Asn Lys Ile Gln Pro Trp Thr Asn Asp Ala Leu
1220 1225 1230
Asn Thr Ile Cys Ile Lys Asp Ala Asp Tyr Asn Ala Lys Val Glu
1235 1240 1245
Ile Ser Val Thr Pro Ile Lys Asn Thr Val Asp Thr Thr Pro Lys
1250 1255 1260
Glu Glu Phe Val Val Lys Glu Lys Leu Asn Ala Phe Leu Val His
1265 1270 1275
Asp Asn Val Ala Phe Tyr Gln Gly Asp Val Asp Thr Val Val Asn
1280 1285 1290
Gly Val Asp Phe Asp Phe Ile Val Asn Ala Ala Asn Glu Asn Leu
1295 1300 1305
Ala His Gly Gly Gly Leu Ala Lys Ala Leu Asp Val Tyr Thr Lys
1310 1315 1320
Gly Lys Leu Gln Arg Leu Ser Lys Glu His Ile Gly Leu Ala Gly
1325 1330 1335
Lys Val Lys Val Gly Thr Gly Val Met Val Glu Cys Asp Ser Leu
1340 1345 1350
Arg Ile Phe Asn Val Val Gly Pro Arg Lys Gly Lys His Glu Arg
1355 1360 1365
Asp Leu Leu Ile Lys Ala Tyr Asn Thr Ile Asn Asn Glu Gln Gly
1370 1375 1380
Thr Pro Leu Thr Pro Ile Leu Ser Cys Gly Ile Phe Gly Ile Lys
1385 1390 1395
Leu Glu Thr Ser Leu Glu Val Leu Leu Asp Val Cys Asn Thr Lys
1400 1405 1410
Glu Val Lys Val Phe Val Tyr Thr Asp Thr Glu Val Cys Lys Val
1415 1420 1425
Lys Asp Phe Val Ser Gly Leu Val Asn Val Gln Lys Val Glu Gln
1430 1435 1440
Pro Lys Ile Glu Pro Lys Pro Val Ser Val Ile Lys Val Ala Pro
1445 1450 1455
Lys Pro Tyr Arg Val Asp Gly Lys Phe Ser Tyr Phe Thr Glu Asp
1460 1465 1470
Leu Leu Cys Val Ala Asp Asp Lys Pro Ile Val Leu Phe Thr Asp
1475 1480 1485
Ser Met Leu Thr Leu Asp Asp Arg Gly Leu Ala Leu Asp Asn Ala
1490 1495 1500
Leu Ser Gly Val Leu Ser Ala Ala Ile Lys Asp Cys Val Asp Ile
1505 1510 1515
Asn Lys Ala Ile Pro Ser Gly Asn Leu Ile Lys Phe Asp Ile Gly
1520 1525 1530
Ser Val Val Val Tyr Met Cys Val Val Pro Ser Glu Lys Asp Lys
1535 1540 1545
His Leu Asp Asn Asn Val Gln Arg Cys Thr Arg Lys Leu Asn Arg
1550 1555 1560
Leu Met Cys Asp Ile Val Cys Thr Ile Pro Ala Asp Tyr Ile Leu
1565 1570 1575
Pro Leu Val Leu Ser Ser Leu Thr Cys Asn Val Ser Phe Val Gly
1580 1585 1590
Glu Leu Lys Ala Ala Glu Ala Lys Val Ile Thr Ile Lys Val Thr
1595 1600 1605
Glu Asp Gly Val Asn Val His Asp Val Thr Val Thr Thr Asp Lys
1610 1615 1620
Ser Phe Glu Gln Gln Val Gly Val Ile Ala Asp Lys Asp Lys Asp
1625 1630 1635
Leu Ser Gly Ala Val Pro Ser Asp Leu Asn Thr Ser Glu Leu Leu
1640 1645 1650
Thr Lys Ala Ile Asp Val Asp Trp Val Glu Phe Tyr Gly Phe Lys
1655 1660 1665
Asp Ala Val Thr Phe Ala Thr Val Asp His Ser Ala Phe Ala Tyr
1670 1675 1680
Glu Ser Ala Val Val Asn Gly Ile Arg Val Leu Lys Thr Ser Asp
1685 1690 1695
Asn Asn Cys Trp Val Asn Ala Val Cys Ile Ala Leu Gln Tyr Ser
1700 1705 1710
Lys Pro His Phe Ile Ser Gln Gly Leu Asp Ala Ala Trp Asn Lys
1715 1720 1725
Phe Val Leu Gly Asp Val Glu Ile Phe Val Ala Phe Val Tyr Tyr
1730 1735 1740
Val Ala Arg Leu Met Lys Gly Asp Lys Gly Asp Ala Glu Asp Thr
1745 1750 1755
Leu Thr Lys Leu Ser Lys Tyr Leu Ala Asn Glu Ala Gln Val Gln
1760 1765 1770
Leu Glu His Tyr Ser Ser Cys Val Glu Cys Asp Ala Lys Phe Lys
1775 1780 1785
Asn Ser Val Ala Ser Ile Asn Ser Ala Ile Val Cys Ala Ser Val
1790 1795 1800
Lys Arg Asp Gly Val Gln Val Gly Tyr Cys Val His Gly Ile Lys
1805 1810 1815
Tyr Tyr Ser Arg Val Arg Ser Val Arg Gly Arg Ala Ile Ile Val
1820 1825 1830
Ser Val Glu Gln Leu Glu Pro Cys Ala Gln Ser Arg Leu Leu Ser
1835 1840 1845
Gly Val Ala Tyr Thr Ala Phe Ser Gly Pro Val Asp Lys Gly His
1850 1855 1860
Tyr Thr Val Tyr Asp Thr Ala Lys Lys Ser Met Tyr Asp Gly Asp
1865 1870 1875
Arg Phe Val Lys His Asp Leu Ser Leu Leu Ser Val Thr Ser Val
1880 1885 1890
Val Met Val Gly Gly Tyr Val Ala Pro Val Asn Thr Val Lys Pro
1895 1900 1905
Lys Pro Val Ile Asn Gln Leu Asp Glu Lys Ala Gln Lys Phe Phe
1910 1915 1920
Asp Phe Gly Asp Phe Leu Ile His Asn Phe Val Ile Phe Phe Thr
1925 1930 1935
Trp Leu Leu Ser Met Phe Thr Leu Cys Lys Thr Ala Val Thr Thr
1940 1945 1950
Gly Asp Val Lys Ile Met Ala Lys Ala Pro Gln Arg Thr Gly Val
1955 1960 1965
Val Leu Lys Arg Ser Leu Lys Tyr Asn Leu Lys Ala Ser Ala Ala
1970 1975 1980
Val Leu Lys Ser Lys Trp Trp Leu Leu Ala Lys Phe Thr Lys Leu
1985 1990 1995
Leu Leu Leu Ile Tyr Thr Leu Tyr Ser Val Val Leu Leu Cys Val
2000 2005 2010
Arg Phe Gly Pro Phe Asn Phe Cys Ser Glu Thr Val Asn Gly Tyr
2015 2020 2025
Ala Lys Ser Asn Phe Val Lys Asp Asp Tyr Cys Asp Gly Ser Leu
2030 2035 2040
Gly Cys Lys Met Cys Leu Phe Gly Tyr Gln Glu Leu Ser Gln Phe
2045 2050 2055
Ser His Leu Asp Val Val Trp Lys His Ile Thr Asp Pro Leu Phe
2060 2065 2070
Ser Asn Met Gln Pro Phe Ile Val Met Val Leu Leu Leu Ile Phe
2075 2080 2085
Gly Asp Asn Tyr Leu Arg Cys Phe Leu Leu Tyr Phe Val Ala Gln
2090 2095 2100
Met Ile Ser Thr Val Gly Val Phe Leu Gly Tyr Lys Glu Thr Asn
2105 2110 2115
Trp Phe Leu His Phe Ile Pro Phe Asp Val Ile Cys Asp Glu Leu
2120 2125 2130
Leu Val Thr Val Ile Val Ile Lys Val Ile Ser Phe Val Arg His
2135 2140 2145
Val Leu Phe Gly Cys Glu Asn Pro Asp Cys Ile Ala Cys Ser Lys
2150 2155 2160
Ser Ala Arg Leu Lys Arg Phe Pro Val Asn Thr Ile Val Asn Gly
2165 2170 2175
Val Gln Arg Ser Phe Tyr Val Asn Ala Asn Gly Gly Ser Lys Phe
2180 2185 2190
Cys Lys Lys His Arg Phe Phe Cys Val Asp Cys Asp Ser Tyr Gly
2195 2200 2205
Tyr Gly Ser Thr Phe Ile Thr Pro Glu Val Ser Arg Glu Leu Gly
2210 2215 2220
Asn Ile Thr Lys Thr Asn Val Gln Pro Thr Gly Pro Ala Tyr Val
2225 2230 2235
Met Ile Asp Lys Val Glu Phe Glu Asn Gly Phe Tyr Arg Leu Tyr
2240 2245 2250
Ser Cys Glu Thr Phe Trp Arg Tyr Asn Phe Asp Ile Thr Glu Ser
2255 2260 2265
Lys Tyr Ser Cys Lys Glu Val Phe Lys Asn Cys Asn Val Leu Asp
2270 2275 2280
Asp Phe Ile Val Phe Asn Asn Asn Gly Thr Asn Val Thr Gln Val
2285 2290 2295
Lys Asn Ala Ser Val Tyr Phe Ser Gln Leu Leu Cys Arg Pro Ile
2300 2305 2310
Lys Leu Val Asp Ser Glu Leu Leu Ser Thr Leu Ser Val Asp Phe
2315 2320 2325
Asn Gly Val Leu His Lys Ala Tyr Ile Asp Val Leu Arg Asn Ser
2330 2335 2340
Phe Gly Lys Asp Leu Asn Ala Asn Met Ser Leu Ala Glu Cys Lys
2345 2350 2355
Arg Ala Leu Gly Leu Ser Ile Ser Asp His Glu Phe Thr Ser Ala
2360 2365 2370
Ile Ser Asn Ala His Arg Cys Asp Val Leu Leu Ser Asp Leu Ser
2375 2380 2385
Phe Asn Asn Phe Val Ser Ser Tyr Ala Lys Pro Glu Glu Lys Leu
2390 2395 2400
Ser Ala Tyr Asp Leu Ala Cys Cys Met Arg Ala Gly Ala Lys Val
2405 2410 2415
Val Asn Ala Asn Val Leu Thr Lys Asp Gln Thr Pro Ile Val Trp
2420 2425 2430
His Ala Lys Asp Phe Asn Ser Leu Ser Ala Glu Gly Arg Lys Tyr
2435 2440 2445
Ile Val Lys Thr Ser Lys Ala Lys Gly Leu Thr Phe Leu Leu Thr
2450 2455 2460
Ile Asn Glu Asn Gln Ala Val Thr Gln Ile Pro Ala Thr Ser Ile
2465 2470 2475
Val Ala Lys Gln Gly Ala Gly Asp Ala Gly His Ser Leu Thr Trp
2480 2485 2490
Leu Trp Leu Leu Cys Gly Leu Val Cys Leu Ile Gln Phe Tyr Leu
2495 2500 2505
Cys Phe Phe Met Pro Tyr Phe Met Tyr Asp Ile Val Ser Ser Phe
2510 2515 2520
Glu Gly Tyr Asp Phe Lys Tyr Ile Glu Asn Gly Gln Leu Lys Asn
2525 2530 2535
Phe Glu Ala Pro Leu Lys Cys Val Arg Asn Val Phe Glu Asn Phe
2540 2545 2550
Glu Asp Trp His Tyr Ala Lys Phe Gly Phe Thr Pro Leu Asn Lys
2555 2560 2565
Gln Ser Cys Pro Ile Val Val Gly Val Ser Glu Ile Val Asn Thr
2570 2575 2580
Val Ala Gly Ile Pro Ser Asn Val Tyr Leu Val Gly Lys Thr Leu
2585 2590 2595
Ile Phe Thr Leu Gln Ala Ala Phe Gly Asn Ala Gly Val Cys Tyr
2600 2605 2610
Asp Ile Phe Gly Val Thr Thr Pro Glu Lys Cys Ile Phe Thr Ser
2615 2620 2625
Ala Cys Thr Arg Leu Glu Gly Leu Gly Gly Asn Asn Val Tyr Cys
2630 2635 2640
Tyr Asn Thr Ala Leu Met Glu Gly Ser Leu Pro Tyr Ser Ser Ile
2645 2650 2655
Gln Ala Asn Ala Tyr Tyr Lys Tyr Asp Asn Gly Asn Phe Ile Lys
2660 2665 2670
Leu Pro Glu Val Ile Ala Gln Gly Phe Gly Phe Arg Thr Val Arg
2675 2680 2685
Thr Ile Ala Thr Lys Tyr Cys Arg Val Gly Glu Cys Val Glu Ser
2690 2695 2700
Asn Ala Gly Val Cys Phe Gly Phe Asp Lys Trp Phe Val Asn Asp
2705 2710 2715
Gly Arg Val Ala Asn Gly Tyr Val Cys Gly Thr Gly Leu Trp Asn
2720 2725 2730
Leu Val Phe Asn Ile Leu Ser Met Phe Ser Ser Ser Phe Ser Val
2735 2740 2745
Ala Ala Met Ser Gly Gln Ile Leu Leu Asn Cys Ala Leu Gly Ala
2750 2755 2760
Phe Ala Ile Phe Cys Cys Phe Leu Val Thr Lys Phe Arg Arg Met
2765 2770 2775
Phe Gly Asp Leu Ser Val Gly Val Cys Thr Val Val Val Ala Val
2780 2785 2790
Leu Leu Asn Asn Val Ser Tyr Ile Val Thr Gln Asn Leu Val Thr
2795 2800 2805
Met Ile Ala Tyr Ala Ile Leu Tyr Phe Phe Ala Thr Arg Ser Leu
2810 2815 2820
Arg Tyr Ala Trp Ile Trp Cys Ala Ala Tyr Leu Ile Ala Tyr Ile
2825 2830 2835
Ser Phe Ala Pro Trp Trp Leu Cys Ala Trp Tyr Phe Leu Ala Met
2840 2845 2850
Leu Thr Gly Leu Leu Pro Ser Leu Leu Lys Leu Lys Val Ser Thr
2855 2860 2865
Asn Leu Phe Glu Gly Asp Lys Phe Val Gly Thr Phe Glu Ser Ala
2870 2875 2880
Ala Ala Gly Thr Phe Val Ile Asp Met Arg Ser Tyr Glu Lys Leu
2885 2890 2895
Ala Asn Ser Ile Ser Pro Glu Lys Leu Lys Ser Tyr Ala Ala Ser
2900 2905 2910
Tyr Asn Arg Tyr Lys Tyr Tyr Ser Gly Asn Ala Asn Glu Ala Asp
2915 2920 2925
Tyr Arg Cys Ala Cys Tyr Ala Tyr Leu Ala Lys Ala Met Leu Asp
2930 2935 2940
Phe Ser Arg Asp His Asn Asp Ile Leu Tyr Thr Pro Pro Thr Val
2945 2950 2955
Ser Tyr Gly Ser Thr Leu Gln Ala Gly Leu Arg Lys Met Ala Gln
2960 2965 2970
Pro Ser Gly Phe Val Glu Lys Cys Val Val Arg Val Cys Tyr Gly
2975 2980 2985
Asn Thr Val Leu Asn Gly Leu Trp Leu Gly Asp Ile Val Tyr Cys
2990 2995 3000
Pro Arg His Val Ile Ala Ser Asn Thr Thr Ser Ala Ile Asp Tyr
3005 3010 3015
Asp His Glu Tyr Ser Ile Met Arg Leu His Asn Phe Ser Ile Ile
3020 3025 3030
Ser Gly Thr Ala Phe Leu Gly Val Val Gly Ala Thr Met His Gly
3035 3040 3045
Val Thr Leu Lys Ile Lys Val Ser Gln Thr Asn Met His Thr Pro
3050 3055 3060
Arg His Ser Phe Arg Thr Leu Lys Ser Gly Glu Gly Phe Asn Ile
3065 3070 3075
Leu Ala Cys Tyr Asp Gly Cys Ala Gln Gly Val Phe Gly Val Asn
3080 3085 3090
Met Arg Thr Asn Trp Thr Ile Arg Gly Ser Phe Ile Asn Gly Ala
3095 3100 3105
Cys Gly Ser Pro Gly Tyr Asn Leu Lys Asn Gly Glu Val Glu Phe
3110 3115 3120
Val Tyr Met His Gln Ile Glu Leu Gly Ser Gly Ser His Val Gly
3125 3130 3135
Ser Ser Phe Asp Gly Val Met Tyr Gly Gly Phe Glu Asp Gln Pro
3140 3145 3150
Asn Leu Gln Val Glu Ser Ala Asn Gln Met Leu Thr Val Asn Val
3155 3160 3165
Val Ala Phe Leu Tyr Ala Ala Ile Leu Asn Gly Cys Thr Trp Trp
3170 3175 3180
Leu Lys Gly Glu Lys Leu Phe Val Glu His Tyr Asn Glu Trp Ala
3185 3190 3195
Gln Ala Asn Gly Phe Thr Ala Met Asn Gly Glu Asp Ala Phe Ser
3200 3205 3210
Ile Leu Ala Ala Lys Thr Gly Val Cys Val Glu Arg Leu Leu His
3215 3220 3225
Ala Ile Gln Val Leu Asn Asn Gly Phe Gly Gly Lys Gln Ile Leu
3230 3235 3240
Gly Tyr Ser Ser Leu Asn Asp Glu Phe Ser Ile Asn Glu Val Val
3245 3250 3255
Lys Gln Met Phe Gly Val Asn Leu Gln Ser Gly Lys Thr Thr Ser
3260 3265 3270
Met Phe Lys Ser Ile Ser Leu Phe Ala Gly Phe Phe Val Met Phe
3275 3280 3285
Trp Ala Glu Leu Phe Val Tyr Thr Thr Thr Ile Trp Val Asn Pro
3290 3295 3300
Gly Phe Leu Thr Pro Phe Met Ile Leu Leu Val Ala Leu Ser Leu
3305 3310 3315
Cys Leu Thr Phe Val Val Lys His Lys Val Leu Phe Leu Gln Val
3320 3325 3330
Phe Leu Leu Pro Ser Ile Ile Val Ala Ala Ile Gln Asn Cys Ala
3335 3340 3345
Trp Asp Tyr His Val Thr Lys Val Leu Ala Glu Lys Phe Asp Tyr
3350 3355 3360
Asn Val Ser Val Met Gln Met Asp Ile Gln Gly Phe Val Asn Ile
3365 3370 3375
Phe Ile Cys Leu Phe Val Ala Leu Leu His Thr Trp Arg Phe Ala
3380 3385 3390
Lys Glu Arg Cys Thr His Trp Cys Thr Tyr Leu Phe Ser Leu Ile
3395 3400 3405
Ala Val Leu Tyr Thr Ala Leu Tyr Ser Tyr Asp Tyr Val Ser Leu
3410 3415 3420
Leu Val Met Leu Leu Cys Ala Ile Ser Asn Glu Trp Tyr Ile Gly
3425 3430 3435
Ala Ile Ile Phe Arg Ile Cys Arg Phe Gly Val Ala Phe Leu Pro
3440 3445 3450
Val Glu Tyr Val Ser Tyr Phe Asp Gly Val Lys Thr Val Leu Leu
3455 3460 3465
Phe Tyr Met Leu Leu Gly Phe Val Ser Cys Met Tyr Tyr Gly Leu
3470 3475 3480
Leu Tyr Trp Ile Asn Arg Phe Cys Lys Cys Thr Leu Gly Val Tyr
3485 3490 3495
Asp Phe Cys Val Ser Pro Ala Glu Phe Lys Tyr Met Val Ala Asn
3500 3505 3510
Gly Leu Asn Ala Pro Asn Gly Pro Phe Asp Ala Leu Phe Leu Ser
3515 3520 3525
Phe Lys Leu Met Gly Ile Gly Gly Pro Arg Thr Ile Lys Val Ser
3530 3535 3540
Thr Val Gln Ser Lys Leu Thr Asp Leu Lys Cys Thr Asn Val Val
3545 3550 3555
Leu Met Gly Ile Leu Ser Asn Met Asn Ile Ala Ser Asn Ser Lys
3560 3565 3570
Glu Trp Ala Tyr Cys Val Glu Met His Asn Lys Ile Asn Leu Cys
3575 3580 3585
Asp Asp Pro Glu Thr Ala Gln Glu Leu Leu Leu Ala Leu Leu Ala
3590 3595 3600
Phe Phe Leu Ser Lys His Ser Asp Phe Gly Leu Gly Asp Leu Val
3605 3610 3615
Asp Ser Tyr Phe Glu Asn Asp Ser Ile Leu Gln Ser Val Ala Ser
3620 3625 3630
Ser Phe Val Gly Met Pro Ser Phe Val Ala Tyr Glu Thr Ala Arg
3635 3640 3645
Gln Glu Tyr Glu Asn Ala Val Ala Asn Gly Ser Ser Pro Gln Ile
3650 3655 3660
Ile Lys Gln Leu Lys Lys Ala Met Asn Val Ala Lys Ala Glu Phe
3665 3670 3675
Asp Arg Glu Ser Ser Val Gln Lys Lys Ile Asn Arg Met Ala Glu
3680 3685 3690
Gln Ala Ala Ala Ala Met Tyr Lys Glu Ala Arg Ala Val Asn Arg
3695 3700 3705
Lys Ser Lys Val Val Ser Ala Met His Ser Leu Leu Phe Gly Met
3710 3715 3720
Leu Arg Arg Leu Asp Met Ser Ser Val Asp Thr Ile Leu Asn Met
3725 3730 3735
Ala Arg Asn Gly Val Val Pro Leu Ser Val Ile Pro Ala Thr Ser
3740 3745 3750
Ala Ala Arg Leu Val Val Val Val Pro Asp His Asp Ser Phe Val
3755 3760 3765
Lys Met Met Val Asp Gly Phe Val His Tyr Ala Gly Val Val Trp
3770 3775 3780
Thr Leu Gln Glu Val Lys Asp Asn Asp Gly Lys Asn Val His Leu
3785 3790 3795
Lys Asp Val Thr Lys Glu Asn Gln Glu Ile Leu Val Trp Pro Leu
3800 3805 3810
Ile Leu Thr Cys Glu Arg Val Val Lys Leu Gln Asn Asn Glu Ile
3815 3820 3825
Met Pro Gly Lys Met Lys Val Lys Ala Thr Lys Gly Glu Gly Asp
3830 3835 3840
Gly Gly Ile Thr Ser Glu Gly Asn Ala Leu Tyr Asn Asn Glu Gly
3845 3850 3855
Gly Arg Ala Phe Met Tyr Ala Tyr Val Thr Thr Lys Pro Gly Met
3860 3865 3870
Lys Tyr Val Lys Trp Glu His Asp Ser Gly Val Val Thr Val Glu
3875 3880 3885
Leu Glu Pro Pro Cys Arg Phe Val Ile Asp Thr Pro Thr Gly Pro
3890 3895 3900
Gln Ile Lys Tyr Leu Tyr Phe Val Lys Asn Leu Asn Asn Leu Arg
3905 3910 3915
Arg Gly Ala Val Leu Gly Tyr Ile Gly Ala Thr Val Arg Leu Gln
3920 3925 3930
Ala Gly Lys Gln Thr Glu Phe Val Ser Asn Ser His Leu Leu Thr
3935 3940 3945
His Cys Ser Phe Ala Val Asp Pro Ala Ala Ala Tyr Leu Asp Ala
3950 3955 3960
Val Lys Gln Gly Ala Lys Pro Val Gly Asn Cys Val Lys Met Leu
3965 3970 3975
Thr Asn Gly Ser Gly Ser Gly Gln Ala Ile Thr Cys Thr Ile Asp
3980 3985 3990
Ser Asn Thr Thr Gln Asp Thr Tyr Gly Gly Ala Ser Val Cys Ile
3995 4000 4005
Tyr Cys Arg Ala His Val Ala His Pro Thr Met Asp Gly Phe Cys
4010 4015 4020
Gln Tyr Lys Gly Lys Trp Val Gln Val Pro Ile Gly Thr Asn Asp
4025 4030 4035
Pro Ile Arg Phe Cys Leu Glu Asn Thr Val Cys Lys Val Cys Gly
4040 4045 4050
Cys Trp Leu Asn His Gly Cys Thr Cys Asp Arg Thr Ala Ile Gln
4055 4060 4065
Ser Phe Asp Asn Ser Tyr Leu Asn Arg Val Arg Gly Ser Ser Ala
4070 4075 4080
Ala Arg Leu Glu Pro Cys Asn Gly Thr Asp Ile Asp Tyr Cys Val
4085 4090 4095
Arg Ala Phe Asp Val Tyr Asn Lys Asp Ala Ser Phe Ile Gly Lys
4100 4105 4110
Asn Leu Lys Ser Asn Cys Val Arg Phe Lys Asn Val Asp Lys Asp
4115 4120 4125
Asp Ala Phe Tyr Ile Val Lys Arg Cys Ile Lys Ser Val Met Asp
4130 4135 4140
His Glu Gln Ser Met Tyr Asn Leu Leu Lys Gly Cys Asn Ala Val
4145 4150 4155
Ala Lys His Asp Phe Phe Thr Trp His Glu Gly Arg Thr Ile Tyr
4160 4165 4170
Gly Asn Val Ser Arg Gln Asp Leu Thr Lys Tyr Thr Met Met Asp
4175 4180 4185
Leu Cys Phe Ala Leu Arg Asn Phe Asp Glu Lys Asp Cys Glu Val
4190 4195 4200
Phe Lys Glu Ile Leu Val Leu Thr Gly Cys Cys Ser Thr Asp Tyr
4205 4210 4215
Phe Glu Met Lys Asn Trp Phe Asp Pro Ile Glu Asn Glu Asp Ile
4220 4225 4230
His Arg Val Tyr Ala Ala Leu Gly Lys Val Val Ala Asn Ala Met
4235 4240 4245
Leu Lys Cys Val Ala Phe Cys Asp Glu Met Val Leu Lys Gly Val
4250 4255 4260
Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Asn Gly Asn Phe
4265 4270 4275
Tyr Asp Phe Gly Asp Phe Val Leu Cys Pro Pro Gly Met Gly Ile
4280 4285 4290
Pro Tyr Cys Thr Ser Tyr Tyr Ser Tyr Met Met Pro Val Met Gly
4295 4300 4305
Met Thr Asn Cys Leu Ala Ser Glu Cys Phe Met Lys Ser Asp Ile
4310 4315 4320
Phe Gly Gln Asp Phe Lys Thr Phe Asp Leu Leu Lys Tyr Asp Phe
4325 4330 4335
Thr Glu His Lys Glu Val Leu Phe Asn Lys Tyr Phe Lys Tyr Trp
4340 4345 4350
Gly Gln Asp Tyr His Pro Asp Cys Val Asp Cys His Asp Glu Met
4355 4360 4365
Cys Ile Leu His Cys Ser Asn Phe Asn Thr Leu Phe Ala Thr Thr
4370 4375 4380
Ile Pro Asn Thr Ala Phe Gly Pro Leu Cys Arg Lys Val Phe Ile
4385 4390 4395
Asp Gly Val Pro Val Val Ala Thr Ala Gly Tyr His Phe Lys Gln
4400 4405 4410
Leu Gly Leu Val Trp Asn Lys Asp Val Asn Thr His Ser Thr Arg
4415 4420 4425
Leu Thr Ile Thr Glu Leu Leu Gln Phe Val Thr Asp Pro Thr Leu
4430 4435 4440
Ile Val Ala Ser Ser Pro Ala Leu Val Asp Lys Arg Thr Val Cys
4445 4450 4455
Phe Ser Val Ala Ala Leu Ser Thr Gly Leu Thr Ser Gln Thr Val
4460 4465 4470
Lys Pro Gly His Phe Asn Lys Glu Phe Tyr Asp Phe Leu Arg Ser
4475 4480 4485
Gln Gly Phe Phe Asp Glu Gly Ser Glu Leu Thr Leu Lys His Phe
4490 4495 4500
Phe Phe Thr Gln Lys Gly Asp Ala Ala Ile Lys Asp Phe Asp Tyr
4505 4510 4515
Tyr Arg Tyr Asn Arg Pro Thr Met Leu Asp Ile Gly Gln Ala Arg
4520 4525 4530
Val Ala Tyr Gln Val Ala Ala Arg Tyr Phe Asp Cys Tyr Glu Gly
4535 4540 4545
Gly Cys Ile Thr Ser Arg Glu Val Val Val Thr Asn Leu Asn Lys
4550 4555 4560
Ser Ala Gly Trp Pro Leu Asn Lys Phe Gly Lys Ala Gly Leu Tyr
4565 4570 4575
Tyr Glu Ser Ile Ser Tyr Glu Glu Gln Asp Ala Ile Phe Ser Leu
4580 4585 4590
Thr Lys Arg Asn Ile Leu Pro Thr Met Thr Gln Leu Asn Leu Lys
4595 4600 4605
Tyr Ala Ile Ser Gly Lys Glu Arg Ala Arg Thr Val Gly Gly Val
4610 4615 4620
Ser Leu Leu Ala Thr Met Thr Thr Arg Gln Phe His Gln Lys Cys
4625 4630 4635
Leu Lys Ser Ile Val Ala Thr Arg Asn Ala Thr Val Val Ile Gly
4640 4645 4650
Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asn Met Leu Lys Asn Leu
4655 4660 4665
Met Ala Asp Val Asp Asp Pro Lys Leu Met Gly Trp Asp Tyr Pro
4670 4675 4680
Lys Cys Asp Arg Ala Met Pro Ser Met Ile Arg Met Leu Ser Ala
4685 4690 4695
Met Ile Leu Gly Ser Lys His Val Thr Cys Cys Thr Ala Ser Asp
4700 4705 4710
Lys Phe Tyr Arg Leu Ser Asn Glu Leu Ala Gln Val Leu Thr Glu
4715 4720 4725
Val Val Tyr Ser Asn Gly Gly Phe Tyr Phe Lys Pro Gly Gly Thr
4730 4735 4740
Thr Ser Gly Asp Ala Thr Thr Ala Tyr Ala Asn Ser Val Phe Asn
4745 4750 4755
Ile Phe Gln Ala Val Ser Ser Asn Ile Asn Cys Val Leu Ser Val
4760 4765 4770
Asn Ser Ser Asn Cys Asn Asn Phe Asn Val Lys Lys Leu Gln Arg
4775 4780 4785
Gln Leu Tyr Asp Asn Cys Tyr Arg Asn Ser Asn Val Asp Glu Ser
4790 4795 4800
Phe Val Asp Asp Phe Tyr Gly Tyr Leu Gln Lys His Phe Ser Met
4805 4810 4815
Met Ile Leu Ser Asp Asp Ser Val Val Cys Tyr Asn Lys Thr Tyr
4820 4825 4830
Ala Gly Leu Gly Tyr Ile Ala Asp Ile Ser Ala Phe Lys Ala Thr
4835 4840 4845
Leu Tyr Tyr Gln Asn Gly Val Phe Met Ser Thr Ala Lys Cys Trp
4850 4855 4860
Thr Glu Glu Asp Leu Ser Ile Gly Pro His Glu Phe Cys Ser Gln
4865 4870 4875
His Thr Met Gln Ile Val Asp Glu Asn Gly Lys Tyr Tyr Leu Pro
4880 4885 4890
Tyr Pro Asp Pro Ser Arg Ile Ile Ser Ala Gly Val Phe Val Asp
4895 4900 4905
Asp Ile Thr Lys Thr Asp Ala Val Ile Leu Leu Glu Arg Tyr Val
4910 4915 4920
Ser Leu Ala Ile Asp Ala Tyr Pro Leu Ser Lys His Pro Lys Pro
4925 4930 4935
Glu Tyr Arg Lys Val Phe Tyr Ala Leu Leu Asp Trp Val Lys His
4940 4945 4950
Leu Asn Lys Thr Leu Asn Glu Gly Val Leu Glu Ser Phe Ser Val
4955 4960 4965
Thr Leu Leu Asp Glu His Glu Ser Lys Phe Trp Asp Glu Ser Phe
4970 4975 4980
Tyr Ala Ser Met Tyr Glu Lys Ser Thr Val Leu Gln Ala Ala Gly
4985 4990 4995
Leu Cys Val Val Cys Gly Ser Gln Thr Val Leu Arg Cys Gly Asp
5000 5005 5010
Cys Leu Arg Arg Pro Met Leu Cys Thr Lys Cys Ala Tyr Asp His
5015 5020 5025
Val Phe Gly Thr Asp His Lys Phe Ile Leu Ala Ile Thr Pro Tyr
5030 5035 5040
Val Cys Asn Thr Ser Gly Cys Asn Val Asn Asp Val Thr Lys Leu
5045 5050 5055
Tyr Leu Gly Gly Leu Asn Tyr Tyr Cys Val Asp His Lys Pro His
5060 5065 5070
Leu Ser Phe Pro Leu Cys Ser Ala Gly Asn Val Phe Gly Leu Tyr
5075 5080 5085
Lys Ser Ser Ala Leu Gly Ser Met Asp Ile Asp Val Phe Asn Lys
5090 5095 5100
Leu Ser Thr Ser Asp Trp Ser Asp Ile Arg Asp Tyr Lys Leu Ala
5105 5110 5115
Asn Asp Ala Lys Glu Ser Leu Arg Leu Phe Ala Ala Glu Thr Val
5120 5125 5130
Lys Ala Lys Glu Glu Ser Val Lys Ser Ser Tyr Ala Tyr Ala Thr
5135 5140 5145
Leu Lys Glu Ile Val Gly Pro Lys Glu Leu Leu Leu Leu Trp Glu
5150 5155 5160
Ser Gly Lys Ala Lys Pro Pro Leu Asn Arg Asn Ser Val Phe Thr
5165 5170 5175
Cys Phe Gln Ile Thr Lys Asp Ser Lys Phe Gln Val Gly Glu Phe
5180 5185 5190
Val Phe Glu Lys Val Asp Tyr Gly Ser Asp Thr Val Thr Tyr Lys
5195 5200 5205
Ser Thr Ala Thr Thr Lys Leu Val Pro Gly Met Leu Phe Ile Leu
5210 5215 5220
Thr Ser His Asn Val Ala Pro Leu Arg Ala Pro Thr Met Ala Asn
5225 5230 5235
Gln Glu Lys Tyr Ser Thr Ile Tyr Lys Leu His Pro Ser Phe Asn
5240 5245 5250
Val Ser Asp Ala Tyr Ala Asn Leu Val Pro Tyr Tyr Gln Leu Ile
5255 5260 5265
Gly Lys Gln Arg Ile Thr Thr Ile Gln Gly Pro Pro Gly Ser Gly
5270 5275 5280
Lys Ser His Cys Ser Ile Gly Ile Gly Val Tyr Tyr Pro Gly Ala
5285 5290 5295
Arg Ile Val Phe Thr Ala Cys Ser His Ala Ala Val Asp Ser Leu
5300 5305 5310
Cys Ala Lys Ala Val Thr Ala Tyr Ser Val Asp Lys Cys Thr Arg
5315 5320 5325
Ile Ile Pro Ala Arg Ala Arg Val Glu Cys Tyr Ser Gly Phe Lys
5330 5335 5340
Pro Asn Asn Asn Ser Ala Gln Tyr Val Phe Ser Thr Val Asn Ala
5345 5350 5355
Leu Pro Glu Val Asn Ala Asp Ile Val Val Val Asp Glu Val Ser
5360 5365 5370
Met Cys Thr Asn Tyr Asp Leu Ser Val Ile Asn Gln Arg Ile Ser
5375 5380 5385
Tyr Lys His Ile Val Tyr Val Gly Asp Pro Gln Gln Leu Pro Ala
5390 5395 5400
Pro Arg Val Leu Ile Ser Lys Gly Val Met Glu Pro Ile Asp Tyr
5405 5410 5415
Asn Val Val Thr Gln Arg Met Cys Ala Ile Gly Pro Asp Val Phe
5420 5425 5430
Leu His Lys Cys Tyr Arg Cys Pro Ala Glu Ile Val Asn Thr Val
5435 5440 5445
Ser Glu Leu Val Tyr Glu Asn Lys Phe Val Pro Val Lys Glu Ala
5450 5455 5460
Ser Lys Gln Cys Phe Lys Ile Phe Glu Arg Gly Ser Val Gln Val
5465 5470 5475
Asp Asn Gly Ser Ser Ile Asn Arg Arg Gln Leu Asp Val Val Lys
5480 5485 5490
Arg Phe Ile His Lys Asn Ser Thr Trp Ser Lys Ala Val Phe Ile
5495 5500 5505
Ser Pro Tyr Asn Ser Gln Asn Tyr Val Ala Ala Arg Leu Leu Gly
5510 5515 5520
Leu Gln Thr Gln Thr Val Asp Ser Ala Gln Gly Ser Glu Tyr Asp
5525 5530 5535
Tyr Val Ile Phe Ala Gln Thr Ser Asp Thr Ala His Ala Cys Asn
5540 5545 5550
Ala Asn Arg Phe Asn Val Ala Ile Thr Arg Ala Lys Lys Gly Ile
5555 5560 5565
Phe Cys Ile Met Ser Asp Arg Thr Leu Phe Asp Ala Leu Lys Phe
5570 5575 5580
Phe Glu Ile Thr Met Thr Asp Leu Gln Ser Glu Ser Ser Cys Gly
5585 5590 5595
Leu Phe Lys Asp Cys Ala Arg Asn Pro Ile Asp Leu Pro Pro Ser
5600 5605 5610
His Ala Thr Thr Tyr Leu Ser Leu Ser Asp Arg Phe Lys Thr Ser
5615 5620 5625
Gly Asp Leu Ala Val Gln Ile Gly Asn Asn Asn Val Cys Thr Tyr
5630 5635 5640
Glu His Val Ile Ser Tyr Met Gly Phe Arg Phe Asp Val Ser Met
5645 5650 5655
Pro Gly Ser His Ser Leu Phe Cys Thr Arg Asp Phe Ala Met Arg
5660 5665 5670
His Val Arg Gly Trp Leu Gly Met Asp Val Glu Gly Ala His Val
5675 5680 5685
Thr Gly Asp Asn Val Gly Thr Asn Val Pro Leu Gln Val Gly Phe
5690 5695 5700
Ser Asn Gly Val Asp Phe Val Ala Gln Pro Glu Gly Cys Val Leu
5705 5710 5715
Thr Asn Thr Gly Ser Val Val Lys Pro Val Arg Ala Arg Ala Pro
5720 5725 5730
Pro Gly Glu Gln Phe Thr His Ile Val Pro Leu Leu Arg Lys Gly
5735 5740 5745
Gln Pro Trp Ser Val Leu Arg Lys Arg Ile Val Gln Met Ile Ala
5750 5755 5760
Asp Phe Leu Ala Gly Ser Ser Asp Val Leu Val Phe Val Leu Trp
5765 5770 5775
Ala Gly Gly Leu Glu Leu Thr Thr Met Arg Tyr Phe Val Lys Ile
5780 5785 5790
Gly Ala Val Lys His Cys Gln Cys Gly Thr Val Ala Thr Cys Tyr
5795 5800 5805
Asn Ser Val Ser Asn Asp Tyr Cys Cys Phe Lys His Ala Leu Gly
5810 5815 5820
Cys Asp Tyr Val Tyr Asn Pro Tyr Val Ile Asp Ile Gln Gln Trp
5825 5830 5835
Gly Tyr Val Gly Ser Leu Ser Thr Asn His His Ala Ile Cys Asn
5840 5845 5850
Val His Arg Asn Glu His Val Ala Ser Gly Asp Ala Ile Met Thr
5855 5860 5865
Arg Cys Leu Ala Val Tyr Asp Cys Phe Val Lys Asn Val Asp Trp
5870 5875 5880
Ser Ile Thr Tyr Pro Met Ile Ala Asn Glu Asn Ala Ile Asn Lys
5885 5890 5895
Gly Gly Arg Thr Val Gln Ser His Ile Met Arg Ala Ala Ile Lys
5900 5905 5910
Leu Tyr Asn Pro Lys Ala Ile His Asp Ile Gly Asn Pro Lys Gly
5915 5920 5925
Ile Arg Cys Ala Val Thr Asp Ala Lys Trp Tyr Cys Tyr Asp Lys
5930 5935 5940
Asn Pro Ile Asn Ser Asn Val Lys Thr Leu Glu Tyr Asp Tyr Met
5945 5950 5955
Thr His Gly Gln Met Asp Gly Leu Cys Leu Phe Trp Asn Cys Asn
5960 5965 5970
Val Asp Met Tyr Pro Glu Phe Ser Ile Val Cys Arg Phe Asp Thr
5975 5980 5985
Arg Thr Arg Ser Thr Leu Asn Leu Glu Gly Val Asn Gly Gly Ser
5990 5995 6000
Leu Tyr Val Asn Asn His Ala Phe His Thr Pro Ala Tyr Asp Lys
6005 6010 6015
Arg Ala Met Ala Lys Leu Lys Pro Ala Pro Phe Phe Tyr Tyr Asp
6020 6025 6030
Asp Gly Ser Cys Glu Val Val His Asp Gln Val Asn Tyr Val Pro
6035 6040 6045
Leu Arg Ala Thr Asn Cys Ile Thr Lys Cys Asn Ile Gly Gly Ala
6050 6055 6060
Val Cys Ser Lys His Ala Asn Leu Tyr Arg Ala Tyr Val Glu Ser
6065 6070 6075
Tyr Asn Ile Phe Thr Gln Ala Gly Phe Asn Ile Trp Val Pro Thr
6080 6085 6090
Thr Phe Asp Cys Tyr Asn Leu Trp Gln Thr Phe Thr Glu Val Asn
6095 6100 6105
Leu Gln Gly Leu Glu Asn Ile Ala Phe Asn Val Val Asn Lys Gly
6110 6115 6120
Ser Phe Val Gly Ala Asp Gly Glu Leu Pro Val Ala Ile Ser Gly
6125 6130 6135
Asp Lys Val Phe Val Arg Asp Gly Asn Thr Asp Asn Leu Val Phe
6140 6145 6150
Val Asn Lys Thr Ser Leu Pro Thr Asn Ile Ala Phe Glu Leu Phe
6155 6160 6165
Ala Lys Arg Lys Val Gly Leu Thr Pro Pro Leu Ser Ile Leu Lys
6170 6175 6180
Asn Leu Gly Val Val Ala Thr Tyr Lys Phe Val Leu Trp Asp Tyr
6185 6190 6195
Glu Ala Glu Arg Pro Leu Thr Ser Phe Thr Lys Ser Val Cys Gly
6200 6205 6210
Tyr Thr Asp Phe Ala Glu Asp Val Cys Thr Cys Tyr Asp Asn Ser
6215 6220 6225
Ile Gln Gly Ser Tyr Glu Arg Phe Thr Leu Ser Thr Asn Ala Val
6230 6235 6240
Leu Phe Ser Ala Thr Ala Val Lys Thr Gly Gly Lys Ser Leu Pro
6245 6250 6255
Ala Ile Lys Leu Asn Phe Gly Met Leu Asn Gly Asn Ala Ile Ala
6260 6265 6270
Thr Val Lys Ser Glu Asp Gly Asn Ile Lys Asn Ile Asn Trp Phe
6275 6280 6285
Val Tyr Val Arg Lys Asp Gly Lys Pro Val Asp His Tyr Asp Gly
6290 6295 6300
Phe Tyr Thr Gln Gly Arg Asn Leu Gln Asp Phe Leu Pro Arg Ser
6305 6310 6315
Thr Met Glu Glu Asp Phe Leu Asn Met Asp Ile Gly Val Phe Ile
6320 6325 6330
Gln Lys Tyr Gly Leu Glu Asp Phe Asn Phe Glu His Val Val Tyr
6335 6340 6345
Gly Asp Val Ser Lys Thr Thr Leu Gly Gly Leu His Leu Leu Ile
6350 6355 6360
Ser Gln Val Arg Leu Ser Lys Met Gly Ile Leu Lys Ala Glu Glu
6365 6370 6375
Phe Val Ala Ala Ser Asp Ile Thr Leu Lys Cys Cys Thr Val Thr
6380 6385 6390
Tyr Leu Asn Asp Pro Ser Ser Lys Thr Val Cys Thr Tyr Met Asp
6395 6400 6405
Leu Leu Leu Asp Asp Phe Val Ser Val Leu Lys Ser Leu Asp Leu
6410 6415 6420
Thr Val Val Ser Lys Val His Glu Val Ile Ile Asp Asn Lys Pro
6425 6430 6435
Trp Arg Trp Met Leu Trp Cys Lys Asp Asn Ala Val Ala Thr Phe
6440 6445 6450
Tyr Pro Gln Leu Gln Ser Ala Glu Trp Lys Cys Gly Tyr Ser Met
6455 6460 6465
Pro Gly Ile Tyr Lys Thr Gln Arg Met Cys Leu Glu Pro Cys Asn
6470 6475 6480
Leu Tyr Asn Tyr Gly Ala Gly Leu Lys Leu Pro Ser Gly Ile Met
6485 6490 6495
Phe Asn Val Val Lys Tyr Thr Gln Leu Cys Gln Tyr Phe Asn Ser
6500 6505 6510
Thr Thr Leu Cys Val Pro His Asn Met Arg Val Leu His Leu Gly
6515 6520 6525
Ala Gly Ser Asp Tyr Gly Val Ala Pro Gly Thr Ala Val Leu Lys
6530 6535 6540
Arg Trp Leu Pro His Asp Ala Ile Val Val Asp Asn Asp Val Val
6545 6550 6555
Asp Tyr Val Ser Asp Ala Asp Phe Ser Val Thr Gly Asp Cys Ala
6560 6565 6570
Thr Val Tyr Leu Glu Asp Lys Phe Asp Leu Leu Ile Ser Asp Met
6575 6580 6585
Tyr Asp Gly Arg Thr Lys Ala Ile Asp Gly Glu Asn Val Ser Lys
6590 6595 6600
Glu Gly Phe Phe Thr Tyr Ile Asn Gly Phe Ile Cys Glu Lys Leu
6605 6610 6615
Ala Ile Gly Gly Ser Ile Ala Ile Lys Val Thr Glu Tyr Ser Trp
6620 6625 6630
Asn Lys Lys Leu Tyr Glu Leu Val Gln Arg Phe Ser Phe Trp Thr
6635 6640 6645
Met Phe Cys Thr Ser Val Asn Thr Ser Ser Ser Glu Ala Phe Val
6650 6655 6660
Val Gly Ile Asn Tyr Leu Gly Asp Phe Ala Gln Gly Pro Phe Ile
6665 6670 6675
Asp Gly Asn Ile Ile His Ala Asn Tyr Val Phe Trp Arg Asn Ser
6680 6685 6690
Thr Val Met Ser Leu Ser Tyr Asn Ser Val Leu Asp Leu Ser Lys
6695 6700 6705
Phe Asn Cys Lys His Lys Ala Thr Val Val Val Gln Leu Lys Asp
6710 6715 6720
Ser Asp Ile Asn Glu Met Val Leu Ser Leu Val Arg Ser Gly Lys
6725 6730 6735
Leu Leu Val Arg Gly Asn Gly Lys Cys Leu Ser Phe Ser Asn His
6740 6745 6750
Leu Val Ser Thr Lys
6755
<210> 29
<211> 2666
<212> PRT
<213> transmissible gastroenteritus virus
<220>
<221> MISC_FEATURE
<223> ORF 1B
<400> 29
Glu Pro Cys Asn Gly Thr Asp Pro Asp His Val Ser Arg Ala Phe Asp
1 5 10 15
Ile Tyr Asn Lys Asp Val Ala Cys Ile Gly Lys Phe Leu Lys Thr Asn
20 25 30
Cys Ser Arg Phe Arg Asn Leu Asp Lys His Asp Ala Tyr Tyr Ile Val
35 40 45
Lys Arg Cys Thr Lys Thr Val Met Asp His Glu Gln Val Cys Tyr Asn
50 55 60
Asp Leu Lys Asp Ser Gly Ala Val Ala Glu His Asp Phe Phe Thr Tyr
65 70 75 80
Lys Glu Gly Arg Cys Glu Phe Gly Asn Val Ala Arg Arg Asn Leu Thr
85 90 95
Lys Tyr Thr Met Met Asp Leu Cys Tyr Ala Ile Arg Asn Phe Asp Glu
100 105 110
Lys Asn Cys Glu Val Leu Lys Glu Ile Leu Val Thr Val Gly Ala Cys
115 120 125
Thr Glu Glu Phe Phe Glu Asn Lys Asp Trp Phe Asp Pro Val Glu Asn
130 135 140
Glu Ala Ile His Glu Val Tyr Ala Lys Leu Gly Pro Ile Val Ala Asn
145 150 155 160
Ala Met Leu Lys Cys Val Ala Phe Cys Asp Ala Ile Val Glu Lys Gly
165 170 175
Tyr Ile Gly Val Ile Thr Leu Asp Asn Gln Asp Leu Asn Gly Asn Phe
180 185 190
Tyr Asp Phe Gly Asp Phe Val Lys Thr Ala Pro Gly Phe Gly Cys Ala
195 200 205
Cys Val Thr Ser Tyr Tyr Ser Tyr Met Met Pro Leu Met Gly Met Thr
210 215 220
Ser Cys Leu Glu Ser Glu Asn Phe Val Lys Ser Asp Ile Tyr Gly Ser
225 230 235 240
Asp Tyr Lys Gln Tyr Asp Leu Leu Ala Tyr Asp Phe Thr Glu His Lys
245 250 255
Glu Tyr Leu Phe Gln Lys Tyr Phe Lys Tyr Trp Asp Arg Thr Tyr His
260 265 270
Pro Asn Cys Ser Asp Cys Thr Ser Asp Glu Cys Ile Ile His Cys Ala
275 280 285
Asn Phe Asn Thr Leu Phe Ser Met Thr Ile Pro Met Thr Ala Phe Gly
290 295 300
Pro Leu Val Arg Lys Val His Ile Asp Gly Val Pro Val Val Val Thr
305 310 315 320
Ala Gly Tyr His Phe Lys Gln Leu Gly Ile Val Trp Asn Leu Asp Val
325 330 335
Lys Leu Asp Thr Met Lys Leu Ser Met Thr Asp Leu Leu Arg Phe Val
340 345 350
Thr Asp Pro Thr Leu Leu Val Ala Ser Ser Pro Ala Leu Leu Asp Gln
355 360 365
Arg Thr Val Cys Phe Ser Ile Ala Ala Leu Ser Thr Gly Ile Thr Tyr
370 375 380
Gln Thr Val Lys Pro Gly His Phe Asn Lys Asp Phe Tyr Asp Phe Ile
385 390 395 400
Thr Glu Arg Gly Phe Phe Glu Glu Gly Ser Glu Leu Thr Leu Lys His
405 410 415
Phe Phe Phe Ala Gln Gly Gly Glu Ala Ala Met Thr Asp Phe Asn Tyr
420 425 430
Tyr Arg Tyr Asn Arg Val Thr Val Leu Asp Ile Cys Gln Ala Gln Phe
435 440 445
Val Tyr Lys Ile Val Gly Lys Tyr Phe Glu Cys Tyr Asp Gly Gly Cys
450 455 460
Ile Asn Ala Arg Glu Val Val Val Thr Asn Tyr Asp Lys Ser Ala Gly
465 470 475 480
Tyr Pro Leu Asn Lys Phe Gly Lys Ala Arg Leu Tyr Tyr Glu Thr Leu
485 490 495
Ser Tyr Glu Glu Gln Asp Ala Leu Phe Ala Leu Thr Lys Arg Asn Val
500 505 510
Leu Pro Thr Met Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Gly Lys
515 520 525
Ala Arg Ala Arg Thr Val Gly Gly Val Ser Leu Leu Ser Thr Met Thr
530 535 540
Thr Arg Gln Tyr His Gln Lys His Leu Lys Ser Ile Ala Ala Thr Arg
545 550 555 560
Asn Ala Thr Val Val Ile Gly Ser Thr Lys Phe Tyr Gly Gly Trp Asp
565 570 575
Asn Met Leu Lys Asn Leu Met Arg Asp Val Asp Asn Gly Cys Leu Met
580 585 590
Gly Trp Asp Tyr Pro Lys Cys Asp Arg Ala Leu Pro Asn Met Ile Arg
595 600 605
Met Ala Ser Ala Met Ile Leu Gly Ser Lys His Val Gly Cys Cys Thr
610 615 620
His Asn Asp Arg Phe Tyr Arg Leu Ser Asn Glu Leu Ala Gln Val Leu
625 630 635 640
Thr Glu Val Val His Cys Thr Gly Gly Phe Tyr Phe Lys Pro Gly Gly
645 650 655
Thr Thr Ser Gly Asp Gly Thr Thr Ala Tyr Ala Asn Ser Ala Phe Asn
660 665 670
Ile Phe Gln Ala Val Ser Ala Asn Val Asn Lys Leu Leu Gly Val Asp
675 680 685
Ser Asn Ala Cys Asn Asn Val Thr Val Lys Ser Ile Gln Arg Lys Ile
690 695 700
Tyr Asp Asn Cys Tyr Arg Ser Ser Ser Ile Asp Glu Glu Phe Val Val
705 710 715 720
Glu Tyr Phe Ser Tyr Leu Arg Lys His Phe Ser Met Met Ile Leu Ser
725 730 735
Asp Asp Gly Val Val Cys Tyr Asn Lys Asp Tyr Ala Asp Leu Gly Tyr
740 745 750
Val Ala Asp Ile Asn Ala Phe Lys Ala Thr Leu Tyr Tyr Gln Asn Asn
755 760 765
Val Phe Met Ser Thr Ser Lys Cys Trp Val Glu Pro Asp Leu Ser Val
770 775 780
Gly Pro His Glu Phe Cys Ser Gln His Thr Leu Gln Ile Val Gly Pro
785 790 795 800
Asp Gly Asp Tyr Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Ser
805 810 815
Ala Gly Val Phe Val Asp Asp Ile Val Lys Thr Asp Asn Val Ile Met
820 825 830
Leu Glu Arg Tyr Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Thr Lys
835 840 845
His Pro Lys Pro Ala Tyr Gln Lys Val Phe Tyr Thr Leu Leu Asp Trp
850 855 860
Val Lys His Leu Gln Lys Asn Leu Asn Ala Gly Val Leu Asp Ser Phe
865 870 875 880
Ser Val Thr Met Leu Glu Glu Gly Gln Asp Lys Phe Trp Ser Glu Glu
885 890 895
Phe Tyr Ala Ser Leu Tyr Glu Lys Ser Thr Val Leu Gln Ala Ala Gly
900 905 910
Met Cys Val Val Cys Gly Ser Gln Thr Val Leu Arg Cys Gly Asp Cys
915 920 925
Leu Arg Arg Pro Leu Leu Cys Thr Lys Cys Ala Tyr Asp His Val Met
930 935 940
Gly Thr Lys His Lys Phe Ile Met Ser Ile Thr Pro Tyr Val Cys Ser
945 950 955 960
Phe Asn Gly Cys Asn Val Asn Asp Val Thr Lys Leu Phe Leu Gly Gly
965 970 975
Leu Ser Tyr Tyr Cys Met Asn His Lys Pro Gln Leu Ser Phe Pro Leu
980 985 990
Cys Ala Asn Gly Asn Val Phe Gly Leu Tyr Lys Ser Ser Ala Val Gly
995 1000 1005
Ser Glu Ala Val Glu Asp Phe Asn Lys Leu Ala Val Ser Asp Trp
1010 1015 1020
Thr Asn Val Glu Asp Tyr Lys Leu Ala Asn Asn Val Lys Glu Ser
1025 1030 1035
Leu Lys Ile Phe Ala Ala Glu Thr Val Lys Ala Lys Glu Glu Ser
1040 1045 1050
Val Lys Ser Glu Tyr Ala Tyr Ala Val Leu Lys Glu Val Ile Gly
1055 1060 1065
Pro Lys Glu Ile Val Leu Gln Trp Glu Ala Ser Lys Thr Lys Pro
1070 1075 1080
Pro Leu Asn Arg Asn Ser Val Phe Thr Cys Phe Gln Ile Ser Lys
1085 1090 1095
Asp Thr Lys Ile Gln Leu Gly Glu Phe Val Phe Glu Gln Ser Glu
1100 1105 1110
Tyr Gly Ser Asp Ser Val Tyr Tyr Lys Ser Thr Ser Thr Tyr Lys
1115 1120 1125
Leu Thr Pro Gly Met Ile Phe Val Leu Thr Ser His Asn Val Ser
1130 1135 1140
Pro Leu Lys Ala Pro Ile Leu Val Asn Gln Glu Lys Tyr Asn Thr
1145 1150 1155
Ile Ser Lys Leu Tyr Pro Val Phe Asn Ile Ala Glu Ala Tyr Asn
1160 1165 1170
Thr Leu Val Pro Tyr Tyr Gln Met Ile Gly Lys Gln Lys Phe Thr
1175 1180 1185
Thr Ile Gln Gly Pro Pro Gly Ser Gly Lys Ser His Cys Val Ile
1190 1195 1200
Gly Leu Gly Leu Tyr Tyr Pro Gln Ala Arg Ile Val Tyr Thr Ala
1205 1210 1215
Cys Ser His Ala Ala Val Asp Ala Leu Cys Glu Lys Ala Ala Lys
1220 1225 1230
Asn Phe Asn Val Asp Arg Cys Ser Arg Ile Ile Pro Gln Arg Ile
1235 1240 1245
Arg Val Asp Cys Tyr Thr Gly Phe Lys Pro Asn Asn Thr Asn Ala
1250 1255 1260
Gln Tyr Leu Phe Cys Thr Val Asn Ala Leu Pro Glu Ala Ser Cys
1265 1270 1275
Asp Ile Val Val Val Asp Glu Val Ser Met Cys Thr Asn Tyr Asp
1280 1285 1290
Leu Ser Val Ile Asn Ser Arg Leu Ser Tyr Lys His Ile Val Tyr
1295 1300 1305
Val Gly Asp Pro Gln Gln Leu Pro Ala Pro Arg Thr Leu Ile Asn
1310 1315 1320
Lys Gly Val Leu Gln Pro Gln Asp Tyr Asn Val Val Thr Lys Arg
1325 1330 1335
Met Cys Thr Leu Gly Pro Asp Val Phe Leu His Lys Cys Tyr Arg
1340 1345 1350
Cys Pro Ala Glu Ile Val Lys Thr Val Ser Ala Leu Val Tyr Glu
1355 1360 1365
Asn Lys Phe Val Pro Val Asn Pro Glu Ser Lys Gln Cys Phe Lys
1370 1375 1380
Met Phe Val Lys Gly Gln Val Gln Ile Glu Ser Asn Ser Ser Ile
1385 1390 1395
Asn Asn Lys Gln Leu Glu Val Val Lys Ala Phe Leu Ala His Asn
1400 1405 1410
Pro Lys Trp Arg Lys Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln
1415 1420 1425
Asn Tyr Val Ala Arg Arg Leu Leu Gly Leu Gln Thr Gln Thr Val
1430 1435 1440
Asp Ser Ala Gln Gly Ser Glu Tyr Asp Tyr Val Ile Tyr Thr Gln
1445 1450 1455
Thr Ser Asp Thr Gln His Ala Thr Asn Val Asn Arg Phe Asn Val
1460 1465 1470
Ala Ile Thr Arg Ala Lys Val Gly Ile Leu Cys Ile Met Cys Asp
1475 1480 1485
Arg Thr Met Tyr Glu Asn Leu Asp Phe Tyr Glu Leu Lys Asp Ser
1490 1495 1500
Lys Ile Gly Leu Gln Ala Lys Pro Glu Thr Cys Gly Leu Phe Lys
1505 1510 1515
Asp Cys Ser Lys Ser Glu Gln Tyr Ile Pro Pro Ala Tyr Ala Thr
1520 1525 1530
Thr Tyr Met Ser Leu Ser Asp Asn Phe Lys Thr Ser Asp Gly Leu
1535 1540 1545
Ala Val Asn Ile Gly Thr Lys Asp Val Lys Tyr Ala Asn Val Ile
1550 1555 1560
Ser Tyr Met Gly Phe Arg Phe Glu Ala Asn Ile Pro Gly Tyr His
1565 1570 1575
Thr Leu Phe Cys Thr Arg Asp Phe Ala Met Arg Asn Val Arg Ala
1580 1585 1590
Trp Leu Gly Phe Asp Val Glu Gly Ala His Val Cys Gly Asp Asn
1595 1600 1605
Val Gly Thr Asn Val Pro Leu Gln Leu Gly Phe Ser Asn Gly Val
1610 1615 1620
Asp Phe Val Val Gln Thr Glu Gly Cys Val Ile Thr Glu Lys Gly
1625 1630 1635
Asn Ser Ile Glu Val Val Lys Ala Arg Ala Pro Pro Gly Glu Gln
1640 1645 1650
Phe Ala His Leu Ile Pro Leu Met Arg Lys Gly Gln Pro Trp His
1655 1660 1665
Ile Val Arg Arg Arg Ile Val Gln Met Val Cys Asp Tyr Phe Asp
1670 1675 1680
Gly Leu Ser Asp Ile Leu Ile Phe Val Leu Trp Ala Gly Gly Leu
1685 1690 1695
Glu Leu Thr Thr Met Arg Tyr Phe Val Lys Ile Gly Arg Pro Gln
1700 1705 1710
Lys Cys Glu Cys Gly Lys Ser Ala Thr Cys Tyr Ser Ser Ser Gln
1715 1720 1725
Ser Val Tyr Ala Cys Phe Lys His Ala Leu Gly Cys Asp Tyr Leu
1730 1735 1740
Tyr Asn Pro Tyr Cys Ile Asp Ile Gln Gln Trp Gly Tyr Thr Gly
1745 1750 1755
Ser Leu Ser Met Asn His His Glu Val Cys Asn Ile His Arg Asn
1760 1765 1770
Glu His Val Ala Ser Gly Asp Ala Ile Met Thr Arg Cys Leu Ala
1775 1780 1785
Ile His Asp Cys Phe Val Lys Arg Val Asp Trp Ser Ile Val Tyr
1790 1795 1800
Pro Phe Ile Asp Asn Glu Glu Lys Ile Asn Lys Ala Gly Arg Ile
1805 1810 1815
Val Gln Ser His Val Met Lys Ala Ala Leu Lys Ile Phe Asn Pro
1820 1825 1830
Ala Ala Ile His Asp Val Gly Asn Pro Lys Gly Ile Arg Cys Ala
1835 1840 1845
Thr Thr Pro Ile Pro Trp Phe Cys Tyr Asp Arg Asp Pro Ile Asn
1850 1855 1860
Asn Asn Val Arg Cys Leu Asp Tyr Asp Tyr Met Val His Gly Gln
1865 1870 1875
Met Asn Gly Leu Met Leu Phe Trp Asn Cys Asn Val Asp Met Tyr
1880 1885 1890
Pro Glu Phe Ser Ile Val Cys Arg Phe Asp Thr Arg Thr Arg Ser
1895 1900 1905
Lys Leu Ser Leu Glu Gly Cys Asn Gly Gly Ala Leu Tyr Val Asn
1910 1915 1920
Asn His Ala Phe His Thr Pro Ala Tyr Asp Arg Arg Ala Phe Ala
1925 1930 1935
Lys Leu Lys Pro Met Pro Phe Phe Tyr Tyr Asp Asp Ser Asn Cys
1940 1945 1950
Glu Leu Val Asp Gly Gln Pro Asn Tyr Val Pro Leu Lys Ser Asn
1955 1960 1965
Val Cys Ile Thr Lys Cys Asn Ile Gly Gly Ala Val Cys Lys Lys
1970 1975 1980
His Ala Ala Leu Tyr Arg Ala Tyr Val Glu Asp Tyr Asn Ile Phe
1985 1990 1995
Met Gln Ala Gly Phe Thr Ile Trp Cys Pro Gln Asn Phe Asp Thr
2000 2005 2010
Tyr Met Leu Trp His Gly Phe Val Asn Ser Lys Ala Leu Gln Ser
2015 2020 2025
Leu Glu Asn Val Ala Phe Asn Val Val Lys Lys Gly Ala Phe Thr
2030 2035 2040
Gly Leu Lys Gly Asp Leu Pro Thr Ala Val Ile Ala Asp Lys Ile
2045 2050 2055
Met Val Arg Asp Gly Pro Thr Asp Lys Cys Ile Phe Thr Asn Lys
2060 2065 2070
Thr Ser Leu Pro Thr Asn Val Ala Phe Glu Leu Tyr Ala Lys Arg
2075 2080 2085
Lys Leu Gly Leu Thr Pro Pro Leu Thr Ile Leu Arg Asn Leu Gly
2090 2095 2100
Val Val Ala Thr Tyr Lys Phe Val Leu Trp Asp Tyr Glu Ala Glu
2105 2110 2115
Arg Pro Phe Ser Asn Phe Thr Lys Gln Val Cys Ser Tyr Thr Asp
2120 2125 2130
Leu Asp Ser Glu Val Val Thr Cys Phe Asp Asn Ser Ile Ala Gly
2135 2140 2145
Ser Phe Glu Arg Phe Thr Thr Thr Arg Asp Ala Val Leu Ile Ser
2150 2155 2160
Asn Asn Ala Val Lys Gly Leu Ser Ala Ile Lys Leu Gln Tyr Gly
2165 2170 2175
Leu Leu Asn Asp Leu Pro Val Ser Thr Val Gly Asn Lys Pro Val
2180 2185 2190
Thr Trp Tyr Ile Tyr Val Arg Lys Asn Gly Glu Tyr Val Glu Gln
2195 2200 2205
Ile Asp Ser Tyr Tyr Thr Gln Gly Arg Thr Phe Glu Thr Phe Lys
2210 2215 2220
Pro Arg Ser Thr Met Glu Glu Asp Phe Leu Ser Met Asp Thr Thr
2225 2230 2235
Leu Phe Ile Gln Lys Tyr Gly Leu Glu Asp Tyr Gly Phe Glu His
2240 2245 2250
Val Val Phe Gly Asp Val Ser Lys Thr Thr Ile Gly Gly Met His
2255 2260 2265
Leu Leu Ile Ser Gln Val Arg Leu Ala Lys Met Gly Leu Phe Ser
2270 2275 2280
Val Gln Glu Phe Met Asn Asn Ser Asp Ser Thr Leu Lys Ser Cys
2285 2290 2295
Cys Ile Thr Tyr Ala Asp Asp Pro Ser Ser Lys Asn Val Cys Thr
2300 2305 2310
Tyr Met Asp Ile Leu Leu Asp Asp Phe Val Thr Ile Ile Lys Ser
2315 2320 2325
Leu Asp Leu Asn Val Val Ser Lys Val Val Asp Val Ile Val Asp
2330 2335 2340
Cys Lys Ala Trp Arg Trp Met Leu Trp Cys Glu Asn Ser His Ile
2345 2350 2355
Lys Thr Phe Tyr Pro Gln Leu Gln Ser Ala Glu Trp Asn Pro Gly
2360 2365 2370
Tyr Ser Met Pro Thr Leu Tyr Lys Ile Gln Arg Met Cys Leu Glu
2375 2380 2385
Arg Cys Asn Leu Tyr Asn Tyr Gly Ala Gln Val Lys Leu Pro Asp
2390 2395 2400
Gly Ile Thr Thr Asn Val Val Lys Tyr Thr Gln Leu Cys Gln Tyr
2405 2410 2415
Leu Asn Thr Thr Thr Leu Cys Val Pro His Lys Met Arg Val Leu
2420 2425 2430
His Leu Gly Ala Ala Gly Ala Ser Gly Val Ala Pro Gly Ser Thr
2435 2440 2445
Val Leu Arg Arg Trp Leu Pro Asp Asp Ala Ile Leu Val Asp Asn
2450 2455 2460
Asp Leu Arg Asp Tyr Val Ser Asp Ala Asp Phe Ser Val Thr Gly
2465 2470 2475
Asp Cys Thr Ser Leu Tyr Ile Glu Asp Lys Phe Asp Leu Leu Val
2480 2485 2490
Ser Asp Leu Tyr Asp Gly Ser Thr Lys Ser Ile Asp Gly Glu Asn
2495 2500 2505
Thr Ser Lys Asp Gly Phe Phe Thr Tyr Ile Asn Gly Phe Ile Lys
2510 2515 2520
Glu Lys Leu Ser Leu Gly Gly Ser Val Ala Ile Lys Ile Thr Glu
2525 2530 2535
Phe Ser Trp Asn Lys Asp Leu Tyr Glu Leu Ile Gln Arg Phe Glu
2540 2545 2550
Tyr Trp Thr Val Phe Cys Thr Ser Val Asn Thr Ser Ser Ser Glu
2555 2560 2565
Gly Phe Leu Ile Gly Ile Asn Tyr Leu Gly Pro Tyr Cys Asp Lys
2570 2575 2580
Ala Ile Val Asp Gly Asn Ile Met His Ala Asn Tyr Ile Phe Trp
2585 2590 2595
Arg Asn Ser Thr Ile Met Ala Leu Ser His Asn Ser Val Leu Asp
2600 2605 2610
Thr Pro Lys Phe Lys Cys Arg Cys Asn Asn Ala Leu Ile Val Asn
2615 2620 2625
Leu Lys Glu Lys Glu Leu Asn Glu Met Val Ile Gly Leu Leu Arg
2630 2635 2640
Lys Gly Lys Leu Leu Ile Arg Asn Asn Gly Lys Leu Leu Asn Phe
2645 2650 2655
Gly Asn His Phe Val Asn Thr Pro
2660 2665
<210> 30
<211> 1447
<212> PRT
<213> transmissible gastroenteritus virus
<220>
<221> MISC_FEATURE
<223> Spike protein
<400> 30
Met Lys Lys Leu Phe Val Val Leu Val Val Met Pro Leu Ile Tyr Gly
1 5 10 15
Asp Asn Phe Pro Cys Ser Lys Leu Thr Asn Arg Thr Ile Gly Asn Gln
20 25 30
Trp Asn Leu Ile Glu Thr Phe Leu Leu Asn Tyr Ser Ser Arg Leu Pro
35 40 45
Pro Asn Ser Asp Val Val Leu Gly Asp Tyr Phe Pro Thr Val Gln Pro
50 55 60
Trp Phe Asn Cys Ile Arg Asn Asp Ser Asn Asp Leu Tyr Val Thr Leu
65 70 75 80
Glu Asn Leu Lys Ala Leu Tyr Trp Asp Tyr Ala Thr Glu Asn Ile Thr
85 90 95
Trp Asn His Arg Gln Arg Leu Asn Val Val Val Asn Gly Tyr Pro Tyr
100 105 110
Ser Ile Thr Val Thr Thr Thr Arg Asn Phe Asn Ser Ala Glu Gly Ala
115 120 125
Ile Ile Cys Ile Cys Lys Gly Ser Pro Pro Thr Thr Thr Thr Glu Ser
130 135 140
Ser Leu Thr Cys Asn Trp Gly Ser Glu Cys Arg Leu Asn His Lys Phe
145 150 155 160
Pro Ile Cys Pro Ser Asn Ser Glu Ala Asn Cys Gly Asn Met Leu Tyr
165 170 175
Gly Leu Gln Trp Phe Ala Asp Glu Val Val Ala Tyr Leu His Gly Ala
180 185 190
Ser Tyr Arg Ile Ser Phe Glu Asn Gln Trp Ser Gly Thr Val Thr Phe
195 200 205
Gly Asp Met Arg Ala Thr Thr Leu Glu Val Ala Gly Thr Leu Val Asp
210 215 220
Leu Trp Trp Phe Asn Pro Val Tyr Asp Val Ser Tyr Tyr Arg Val Asn
225 230 235 240
Asn Lys Asn Gly Thr Thr Val Val Ser Asn Cys Thr Asp Gln Cys Ala
245 250 255
Ser Tyr Val Ala Asn Val Phe Thr Thr Gln Pro Gly Gly Phe Ile Pro
260 265 270
Ser Asp Phe Ser Phe Asn Asn Trp Phe Leu Leu Thr Asn Ser Ser Thr
275 280 285
Leu Val Ser Gly Lys Leu Val Thr Lys Gln Pro Leu Leu Val Asn Cys
290 295 300
Leu Trp Pro Val Pro Ser Phe Glu Glu Ala Ala Ser Thr Phe Cys Phe
305 310 315 320
Glu Gly Ala Gly Phe Asp Gln Cys Asn Gly Ala Val Leu Asn Asn Thr
325 330 335
Val Asp Val Ile Arg Phe Asn Leu Asn Phe Thr Thr Asn Val Gln Ser
340 345 350
Gly Lys Gly Ala Thr Val Phe Ser Leu Asn Thr Thr Gly Gly Val Thr
355 360 365
Leu Glu Ile Ser Cys Tyr Thr Val Ser Asp Ser Ser Phe Phe Ser Tyr
370 375 380
Gly Glu Ile Pro Phe Gly Val Thr Asp Gly Pro Arg Tyr Cys Tyr Val
385 390 395 400
His Tyr Asn Gly Thr Ala Leu Lys Tyr Leu Gly Thr Leu Pro Pro Ser
405 410 415
Val Lys Glu Ile Ala Ile Ser Lys Trp Gly His Phe Tyr Ile Asn Gly
420 425 430
Tyr Asn Phe Phe Ser Thr Phe Pro Ile Asp Cys Ile Ser Phe Asn Leu
435 440 445
Thr Thr Gly Asp Ser Asp Val Phe Trp Thr Ile Ala Tyr Thr Ser Tyr
450 455 460
Thr Glu Ala Leu Val Gln Val Glu Asn Thr Ala Ile Thr Lys Val Thr
465 470 475 480
Tyr Cys Asn Ser His Val Asn Asn Ile Lys Cys Ser Gln Ile Thr Ala
485 490 495
Asn Leu Asn Asn Gly Phe Tyr Pro Val Ser Ser Ser Glu Val Gly Leu
500 505 510
Val Asn Lys Ser Val Val Leu Leu Pro Ser Phe Tyr Thr His Thr Ile
515 520 525
Val Asn Ile Thr Ile Gly Leu Gly Met Lys Arg Ser Gly Tyr Gly Gln
530 535 540
Pro Ile Ala Ser Thr Leu Ser Asn Ile Thr Leu Pro Met Gln Asp His
545 550 555 560
Asn Thr Asp Val Tyr Cys Ile Arg Ser Asp Gln Phe Ser Val Tyr Val
565 570 575
His Ser Thr Cys Lys Ser Ala Leu Trp Asp Asn Ile Phe Lys Arg Asn
580 585 590
Cys Thr Asp Val Leu Asp Ala Thr Ala Val Ile Lys Thr Gly Thr Cys
595 600 605
Pro Phe Ser Phe Asp Lys Leu Asn Asn Tyr Leu Thr Phe Asn Lys Phe
610 615 620
Cys Leu Ser Leu Ser Pro Val Gly Ala Asn Cys Lys Phe Asp Val Ala
625 630 635 640
Ala Arg Thr Arg Thr Asn Glu Gln Val Val Arg Ser Leu Tyr Val Ile
645 650 655
Tyr Glu Glu Gly Asp Asn Ile Val Gly Val Pro Ser Asp Asn Ser Gly
660 665 670
Val His Asp Leu Ser Val Leu His Leu Asp Ser Cys Thr Asp Tyr Asn
675 680 685
Ile Tyr Gly Arg Thr Gly Val Gly Ile Ile Arg Gln Thr Asn Arg Thr
690 695 700
Leu Leu Ser Gly Leu Tyr Tyr Thr Ser Leu Ser Gly Asp Leu Leu Gly
705 710 715 720
Phe Lys Asn Val Ser Asp Gly Val Ile Tyr Ser Val Thr Pro Cys Asp
725 730 735
Val Ser Ala Gln Ala Ala Val Ile Asp Gly Thr Ile Val Gly Ala Ile
740 745 750
Thr Ser Ile Asn Ser Glu Leu Leu Gly Leu Thr His Trp Thr Thr Thr
755 760 765
Pro Asn Phe Tyr Tyr Tyr Ser Ile Tyr Asn Tyr Thr Asn Asp Arg Thr
770 775 780
Arg Gly Thr Ala Ile Asp Ser Asn Asp Val Asp Cys Glu Pro Val Ile
785 790 795 800
Thr Tyr Ser Asn Ile Gly Val Cys Lys Asn Gly Ala Phe Val Phe Ile
805 810 815
Asn Val Thr His Ser Asp Gly Asp Val Gln Pro Ile Ser Thr Gly Asn
820 825 830
Val Thr Ile Pro Thr Asn Phe Thr Ile Ser Val Gln Val Glu Tyr Ile
835 840 845
Gln Val Tyr Thr Thr Pro Val Ser Ile Asp Cys Ser Arg Tyr Val Cys
850 855 860
Asn Gly Asn Pro Arg Cys Asn Lys Leu Leu Thr Gln Tyr Val Ser Ala
865 870 875 880
Cys Gln Thr Ile Glu Gln Ala Leu Ala Met Gly Ala Arg Leu Glu Asn
885 890 895
Met Glu Val Asp Ser Met Leu Phe Val Ser Glu Asn Ala Leu Lys Leu
900 905 910
Ala Ser Val Glu Ala Phe Asn Ser Ser Glu Thr Leu Asp Pro Ile Tyr
915 920 925
Lys Glu Trp Pro Asn Ile Gly Gly Ser Trp Leu Glu Gly Leu Lys Tyr
930 935 940
Ile Leu Pro Ser His Asn Ser Lys Arg Lys Tyr Arg Ser Ala Ile Glu
945 950 955 960
Asp Leu Leu Phe Asp Lys Val Val Thr Ser Gly Leu Gly Thr Val Asp
965 970 975
Glu Asp Tyr Lys Arg Cys Thr Gly Gly Tyr Asp Ile Ala Asp Leu Val
980 985 990
Cys Ala Gln Tyr Tyr Asn Gly Ile Met Val Leu Pro Gly Val Ala Asn
995 1000 1005
Ala Asp Lys Met Thr Met Tyr Thr Ala Ser Leu Ala Gly Gly Ile
1010 1015 1020
Thr Leu Gly Ala Leu Gly Gly Gly Ala Val Ala Ile Pro Phe Ala
1025 1030 1035
Val Ala Val Gln Ala Arg Leu Asn Tyr Val Ala Leu Gln Thr Asp
1040 1045 1050
Val Leu Asn Lys Asn Gln Gln Ile Leu Ala Ser Ala Phe Asn Gln
1055 1060 1065
Ala Ile Gly Asn Ile Thr Gln Ser Phe Gly Lys Val Asn Asp Ala
1070 1075 1080
Ile His Gln Thr Ser Arg Gly Leu Ala Thr Val Ala Lys Ala Leu
1085 1090 1095
Ala Lys Val Gln Asp Val Val Asn Ile Gln Gly Gln Ala Leu Ser
1100 1105 1110
His Leu Thr Val Gln Leu Gln Asn Asn Phe Gln Ala Ile Ser Ser
1115 1120 1125
Ser Ile Ser Asp Ile Tyr Asn Arg Leu Asp Glu Leu Ser Ala Asp
1130 1135 1140
Ala Gln Val Asp Arg Leu Ile Thr Gly Arg Leu Thr Ala Leu Asn
1145 1150 1155
Ala Phe Val Ser Gln Thr Leu Thr Arg Gln Ala Glu Val Arg Ala
1160 1165 1170
Ser Arg Gln Leu Ala Lys Asp Lys Val Asn Glu Cys Val Arg Ser
1175 1180 1185
Gln Ser Gln Arg Phe Gly Phe Cys Gly Asn Gly Thr His Leu Phe
1190 1195 1200
Ser Leu Ala Asn Ala Ala Pro Asn Gly Met Ile Phe Phe His Thr
1205 1210 1215
Val Leu Leu Pro Thr Ala Tyr Glu Thr Val Thr Ala Trp Pro Gly
1220 1225 1230
Ile Cys Ala Ser Asp Gly Asp Arg Thr Phe Gly Leu Val Val Lys
1235 1240 1245
Asp Val Gln Leu Thr Leu Phe Arg Asn Leu Asp Asp Lys Phe Tyr
1250 1255 1260
Leu Thr Pro Arg Thr Met Tyr Gln Pro Arg Val Ala Thr Ser Ser
1265 1270 1275
Asp Phe Val Gln Ile Glu Gly Cys Asp Val Leu Phe Val Asn Ala
1280 1285 1290
Thr Val Ser Asp Leu Pro Ser Ile Ile Pro Asp Tyr Ile Asp Ile
1295 1300 1305
Asn Gln Thr Val Gln Asp Ile Leu Glu Asn Phe Arg Pro Asn Trp
1310 1315 1320
Thr Val Pro Glu Leu Thr Phe Asp Ile Phe Asn Ala Thr Tyr Leu
1325 1330 1335
Asn Leu Thr Gly Glu Ile Asp Asp Leu Glu Phe Arg Ser Glu Lys
1340 1345 1350
Leu His Asn Thr Thr Val Glu Leu Ala Ile Leu Ile Asp Asn Ile
1355 1360 1365
Asn Asn Thr Leu Val Asn Leu Glu Trp Leu Asn Arg Ile Glu Thr
1370 1375 1380
Tyr Val Lys Trp Pro Trp Tyr Val Trp Leu Leu Ile Gly Leu Val
1385 1390 1395
Val Ile Phe Cys Ile Pro Leu Leu Leu Phe Cys Cys Cys Ser Thr
1400 1405 1410
Gly Cys Cys Gly Cys Ile Gly Cys Leu Gly Ser Cys Cys His Ser
1415 1420 1425
Ile Cys Ser Arg Arg Gln Phe Glu Asn Tyr Glu Pro Ile Glu Lys
1430 1435 1440
Val His Val His
1445
<210> 31
<211> 1162
<212> PRT
<213> avian infectious bronchitis virus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 31
Met Leu Val Thr Pro Leu Leu Leu Val Thr Leu Leu Cys Ala Leu Cys
1 5 10 15
Ser Ala Val Leu Tyr Asp Ser Ser Ser Tyr Val Tyr Tyr Tyr Gln Ser
20 25 30
Ala Phe Arg Pro Pro Ser Gly Trp His Leu Gln Gly Gly Ala Tyr Ala
35 40 45
Val Val Asn Ile Ser Ser Glu Phe Asn Asn Ala Gly Ser Ser Ser Gly
50 55 60
Cys Thr Val Gly Ile Ile His Gly Gly Arg Val Val Asn Ala Ser Ser
65 70 75 80
Ile Ala Met Thr Ala Pro Ser Ser Gly Met Ala Trp Ser Ser Ser Gln
85 90 95
Phe Cys Thr Ala His Cys Asn Phe Ser Asp Thr Thr Val Phe Val Thr
100 105 110
His Cys Tyr Lys His Gly Gly Cys Pro Leu Thr Gly Met Leu Gln Gln
115 120 125
Asn Leu Ile Arg Val Ser Ala Met Lys Asn Gly Gln Leu Phe Tyr Asn
130 135 140
Leu Thr Val Ser Val Ala Lys Tyr Pro Thr Phe Arg Ser Phe Gln Cys
145 150 155 160
Val Asn Asn Leu Thr Ser Val Tyr Leu Asn Gly Asp Leu Val Tyr Thr
165 170 175
Ser Asn Glu Thr Ile Asp Val Thr Ser Ala Gly Val Tyr Phe Lys Ala
180 185 190
Gly Gly Pro Ile Thr Tyr Lys Val Met Arg Glu Val Lys Ala Leu Ala
195 200 205
Tyr Phe Val Asn Gly Thr Ala Gln Asp Val Ile Leu Cys Asp Gly Ser
210 215 220
Pro Arg Gly Leu Leu Ala Cys Gln Tyr Asn Thr Gly Asn Phe Ser Asp
225 230 235 240
Gly Phe Tyr Pro Phe Thr Asn Ser Ser Leu Val Lys Gln Lys Phe Ile
245 250 255
Val Tyr Arg Glu Asn Ser Val Asn Thr Thr Cys Thr Leu His Asn Phe
260 265 270
Ile Phe His Asn Glu Thr Gly Ala Asn Pro Asn Pro Ser Gly Val Gln
275 280 285
Asn Ile Gln Thr Tyr Gln Thr Lys Thr Ala Gln Ser Gly Tyr Tyr Asn
290 295 300
Phe Asn Phe Ser Phe Leu Ser Ser Phe Val Tyr Lys Glu Ser Asn Phe
305 310 315 320
Met Tyr Gly Ser Tyr His Pro Ser Cys Lys Phe Arg Leu Glu Thr Ile
325 330 335
Asn Asn Gly Leu Trp Phe Asn Ser Leu Ser Val Ser Ile Ala Tyr Gly
340 345 350
Pro Leu Gln Gly Gly Cys Lys Gln Ser Val Phe Lys Gly Arg Ala Thr
355 360 365
Cys Cys Tyr Ala Tyr Ser Tyr Gly Gly Pro Ser Leu Cys Lys Gly Val
370 375 380
Tyr Ser Gly Glu Leu Asp His Asn Phe Glu Cys Gly Leu Leu Val Tyr
385 390 395 400
Val Thr Lys Ser Gly Gly Ser Arg Ile Gln Thr Ala Thr Glu Pro Pro
405 410 415
Val Ile Thr Gln Asn Asn Tyr Asn Asn Ile Thr Leu Asn Thr Cys Val
420 425 430
Asp Tyr Asn Ile Tyr Gly Arg Thr Gly Gln Gly Phe Ile Thr Asn Val
435 440 445
Thr Asp Ser Ala Val Ser Tyr Asn Tyr Leu Ala Asp Ala Gly Leu Ala
450 455 460
Ile Leu Asp Thr Ser Gly Ser Ile Asp Ile Phe Val Val Gln Gly Glu
465 470 475 480
Tyr Gly Leu Asn Tyr Tyr Lys Val Asn Pro Cys Glu Asp Val Asn Gln
485 490 495
Gln Phe Val Val Ser Gly Gly Lys Leu Val Gly Ile Leu Thr Ser Arg
500 505 510
Asn Glu Thr Gly Ser Gln Leu Leu Glu Asn Gln Phe Tyr Ile Lys Ile
515 520 525
Thr Asn Gly Thr Arg Arg Phe Arg Arg Ser Ile Thr Glu Asn Val Ala
530 535 540
Asn Cys Pro Tyr Val Ser Tyr Gly Lys Phe Cys Ile Lys Pro Asp Gly
545 550 555 560
Ser Ile Ala Thr Ile Val Pro Lys Gln Leu Glu Gln Phe Val Ala Pro
565 570 575
Leu Phe Asn Val Thr Glu Asn Val Leu Ile Pro Asn Ser Phe Asn Leu
580 585 590
Thr Val Thr Asp Glu Tyr Ile Gln Thr Arg Met Asp Lys Val Gln Ile
595 600 605
Asn Cys Leu Gln Tyr Val Cys Gly Ser Ser Leu Asp Cys Arg Lys Leu
610 615 620
Phe Gln Gln Tyr Gly Pro Val Cys Asp Asn Ile Leu Ser Val Val Asn
625 630 635 640
Ser Val Gly Gln Lys Glu Asp Met Glu Leu Leu Asn Phe Tyr Ser Ser
645 650 655
Thr Lys Pro Ala Gly Phe Asn Thr Pro Val Leu Ser Asn Val Ser Thr
660 665 670
Gly Glu Phe Asn Ile Ser Leu Leu Leu Thr Asn Pro Ser Ser Arg Arg
675 680 685
Lys Arg Ser Leu Ile Glu Asp Leu Leu Phe Thr Ser Val Glu Ser Val
690 695 700
Gly Leu Pro Thr Asn Asp Ala Tyr Lys Asn Cys Thr Ala Gly Pro Leu
705 710 715 720
Gly Phe Phe Lys Asp Leu Ala Cys Ala Arg Glu Tyr Asn Gly Leu Leu
725 730 735
Val Leu Pro Pro Ile Ile Thr Ala Glu Met Gln Ala Leu Tyr Thr Ser
740 745 750
Ser Leu Val Ala Ser Met Ala Phe Gly Gly Ile Thr Ala Ala Gly Ala
755 760 765
Ile Pro Phe Ala Thr Gln Leu Gln Ala Arg Ile Asn His Leu Gly Ile
770 775 780
Thr Gln Ser Leu Leu Leu Lys Asn Gln Glu Lys Ile Ala Ala Ser Phe
785 790 795 800
Asn Lys Ala Ile Gly His Met Gln Glu Gly Phe Arg Ser Thr Ser Leu
805 810 815
Ala Leu Gln Gln Ile Gln Asp Val Val Ser Lys Gln Ser Ala Ile Leu
820 825 830
Thr Glu Thr Met Ala Ser Leu Asn Lys Asn Phe Gly Ala Ile Ser Ser
835 840 845
Val Ile Gln Glu Ile Tyr Gln Gln Phe Asp Ala Ile Gln Ala Asn Ala
850 855 860
Gln Val Asp Arg Leu Ile Thr Gly Arg Leu Ser Ser Leu Ser Val Leu
865 870 875 880
Ala Ser Ala Lys Gln Ala Glu Tyr Ile Arg Val Ser Gln Gln Arg Glu
885 890 895
Leu Ala Thr Gln Lys Ile Asn Glu Cys Val Lys Ser Gln Ser Ile Arg
900 905 910
Tyr Ser Phe Cys Gly Asn Gly Arg His Val Leu Thr Ile Pro Gln Asn
915 920 925
Ala Pro Asn Gly Ile Val Phe Ile His Phe Ser Tyr Thr Pro Asp Ser
930 935 940
Phe Val Asn Val Thr Ala Ile Val Gly Phe Cys Val Lys Pro Ala Asn
945 950 955 960
Ala Ser Gln Tyr Ala Ile Val Pro Ala Asn Gly Arg Gly Ile Phe Ile
965 970 975
Gln Val Asn Gly Ser Tyr Tyr Ile Thr Ala Arg Asp Met Tyr Met Pro
980 985 990
Arg Ala Ile Thr Ala Gly Asp Val Val Thr Leu Thr Ser Cys Gln Ala
995 1000 1005
Asn Tyr Val Ser Val Asn Lys Thr Val Ile Thr Thr Phe Val Asp
1010 1015 1020
Asn Asp Asp Phe Asp Phe Asn Asp Glu Leu Ser Lys Trp Trp Asn
1025 1030 1035
Asp Thr Lys His Glu Leu Pro Asp Phe Asp Lys Phe Asn Tyr Thr
1040 1045 1050
Val Pro Ile Leu Asp Ile Asp Ser Glu Ile Asp Arg Ile Gln Gly
1055 1060 1065
Val Ile Gln Gly Leu Asn Asp Ser Leu Ile Asp Leu Glu Lys Leu
1070 1075 1080
Ser Ile Leu Lys Thr Tyr Ile Lys Trp Pro Trp Tyr Val Trp Leu
1085 1090 1095
Ala Ile Ala Phe Ala Thr Ile Ile Phe Ile Leu Ile Leu Gly Trp
1100 1105 1110
Val Phe Phe Met Thr Gly Cys Cys Gly Cys Cys Cys Gly Cys Phe
1115 1120 1125
Gly Ile Met Pro Leu Met Ser Lys Cys Gly Lys Lys Ser Ser Tyr
1130 1135 1140
Tyr Thr Thr Phe Asp Asn Asp Val Val Thr Glu Gln Tyr Arg Pro
1145 1150 1155
Lys Lys Ser Val
1160
<210> 32
<211> 1363
<212> PRT
<213> bovine coronavirus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 32
Met Phe Leu Ile Leu Leu Ile Ser Leu Pro Met Ala Phe Ala Val Ile
1 5 10 15
Gly Asp Leu Lys Cys Thr Thr Val Ser Ile Asn Asp Val Asp Thr Gly
20 25 30
Ala Pro Ser Ile Ser Thr Asp Ile Val Asp Val Thr Asn Gly Leu Gly
35 40 45
Thr Tyr Tyr Val Leu Asp Arg Val Tyr Leu Asn Thr Thr Leu Leu Leu
50 55 60
Asn Gly Tyr Tyr Pro Thr Ser Gly Ser Thr Tyr Arg Asn Met Ala Leu
65 70 75 80
Lys Gly Thr Leu Leu Leu Ser Arg Leu Trp Phe Lys Pro Pro Phe Leu
85 90 95
Ser Asp Phe Ile Asn Gly Ile Phe Ala Lys Val Lys Asn Thr Lys Val
100 105 110
Ile Lys Lys Gly Val Met Tyr Ser Glu Phe Pro Ala Ile Thr Ile Gly
115 120 125
Ser Thr Phe Val Asn Thr Ser Tyr Ser Val Val Val Gln Pro His Thr
130 135 140
Thr Asn Leu Asp Asn Lys Leu Gln Gly Leu Leu Glu Ile Ser Val Cys
145 150 155 160
Gln Tyr Thr Met Cys Glu Tyr Pro His Thr Ile Cys His Pro Lys Leu
165 170 175
Gly Asn Lys Arg Val Glu Leu Trp His Trp Asp Thr Gly Val Val Ser
180 185 190
Cys Leu Tyr Lys Arg Asn Phe Thr Tyr Asp Val Asn Ala Asp Tyr Leu
195 200 205
Tyr Phe His Phe Tyr Gln Glu Gly Gly Thr Phe Tyr Ala Tyr Phe Thr
210 215 220
Asp Thr Gly Val Val Thr Lys Phe Leu Phe Asn Val Tyr Leu Gly Thr
225 230 235 240
Val Leu Ser His Tyr Tyr Val Leu Pro Leu Thr Cys Ser Ser Ala Met
245 250 255
Thr Leu Glu Tyr Trp Val Thr Pro Leu Thr Ser Lys Gln Tyr Leu Leu
260 265 270
Ala Phe Asn Gln Asp Gly Val Ile Phe Asn Ala Val Asp Cys Lys Ser
275 280 285
Asp Phe Met Ser Glu Ile Lys Cys Lys Thr Leu Ser Ile Ala Pro Ser
290 295 300
Thr Gly Val Tyr Glu Leu Asn Gly Tyr Thr Val Gln Pro Ile Ala Asp
305 310 315 320
Val Tyr Arg Arg Ile Pro Asn Leu Pro Asp Cys Asn Ile Glu Ala Trp
325 330 335
Leu Asn Asp Lys Ser Val Pro Ser Pro Leu Asn Trp Glu Arg Lys Thr
340 345 350
Phe Ser Asn Cys Asn Phe Asn Met Ser Ser Leu Met Ser Phe Ile Gln
355 360 365
Ala Asp Ser Phe Thr Cys Asn Asn Ile Asp Ala Ala Lys Ile Tyr Gly
370 375 380
Met Cys Phe Ser Ser Ile Thr Ile Asp Lys Phe Ala Ile Pro Asn Gly
385 390 395 400
Arg Lys Val Asp Leu Gln Leu Gly Asn Leu Gly Tyr Leu Gln Ser Phe
405 410 415
Asn Tyr Arg Ile Asp Thr Thr Ala Thr Ser Cys Gln Leu Tyr Tyr Asn
420 425 430
Leu Pro Ala Ala Asn Val Ser Val Ser Arg Phe Asn Pro Ser Thr Trp
435 440 445
Asn Arg Arg Phe Gly Phe Thr Glu Gln Phe Val Phe Lys Pro Gln Pro
450 455 460
Val Gly Val Phe Thr His His Asp Val Val Tyr Ala Gln His Cys Phe
465 470 475 480
Lys Ala Pro Lys Asn Phe Cys Pro Cys Lys Leu Asp Gly Ser Leu Cys
485 490 495
Val Gly Asn Gly Pro Gly Ile Asp Ala Gly Tyr Lys Asn Ser Gly Ile
500 505 510
Gly Thr Cys Pro Ala Gly Thr Asn Tyr Leu Thr Cys His Asn Ala Ala
515 520 525
Gln Cys Asp Cys Leu Cys Thr Pro Asp Pro Ile Thr Ser Lys Ser Thr
530 535 540
Gly Pro Tyr Lys Cys Pro Gln Thr Lys Tyr Leu Val Gly Ile Gly Glu
545 550 555 560
His Cys Ser Gly Leu Ala Ile Lys Ser Asp Tyr Cys Gly Gly Asn Pro
565 570 575
Cys Thr Cys Gln Pro Gln Ala Phe Leu Gly Trp Ser Val Asp Ser Cys
580 585 590
Leu Gln Gly Asp Arg Cys Asn Ile Phe Ala Asn Phe Ile Phe His Asp
595 600 605
Val Asn Ser Gly Thr Thr Cys Ser Thr Asp Leu Gln Lys Ser Asn Thr
610 615 620
Asp Ile Ile Leu Gly Val Cys Val Asn Tyr Asp Leu Tyr Gly Ile Thr
625 630 635 640
Gly Gln Gly Ile Phe Val Glu Val Asn Ala Thr Tyr Tyr Asn Ser Trp
645 650 655
Gln Asn Leu Leu Tyr Asp Ser Asn Gly Asn Leu Tyr Gly Phe Arg Asp
660 665 670
Tyr Leu Thr Asn Arg Thr Phe Met Ile Arg Ser Cys Tyr Ser Gly Arg
675 680 685
Val Ser Ala Ala Phe His Ala Asn Ser Ser Glu Pro Ala Leu Leu Phe
690 695 700
Arg Asn Ile Lys Cys Asn Tyr Val Phe Asn Asn Thr Leu Ser Arg Gln
705 710 715 720
Leu Gln Pro Ile Asn Tyr Phe Asp Ser Tyr Leu Gly Cys Val Val Asn
725 730 735
Ala Asp Asn Ser Thr Ser Ser Val Val Gln Thr Cys Asp Leu Thr Val
740 745 750
Gly Ser Gly Tyr Cys Val Asp Tyr Ser Thr Lys Arg Arg Ser Arg Arg
755 760 765
Ala Ile Thr Thr Gly Tyr Arg Phe Thr Asn Phe Glu Pro Phe Thr Val
770 775 780
Asn Ser Val Asn Asp Ser Leu Glu Pro Val Gly Gly Leu Tyr Glu Ile
785 790 795 800
Gln Ile Pro Ser Glu Phe Thr Ile Gly Asn Met Glu Glu Phe Ile Gln
805 810 815
Thr Ser Ser Pro Lys Val Thr Ile Asp Cys Ser Ala Phe Val Cys Gly
820 825 830
Asp Tyr Ala Ala Cys Lys Ser Gln Leu Val Glu Tyr Gly Ser Phe Cys
835 840 845
Asp Asn Ile Asn Ala Ile Leu Thr Glu Val Asn Glu Leu Leu Asp Thr
850 855 860
Thr Gln Leu Gln Val Ala Asn Ser Leu Met Asn Gly Val Thr Leu Ser
865 870 875 880
Thr Lys Leu Lys Asp Gly Val Asn Phe Asn Val Asp Asp Ile Asn Phe
885 890 895
Ser Pro Val Leu Gly Cys Leu Gly Ser Ala Cys Asn Lys Val Ser Ser
900 905 910
Arg Ser Ala Ile Glu Asp Leu Leu Phe Ser Lys Val Lys Leu Ser Asp
915 920 925
Val Gly Phe Val Glu Ala Tyr Asn Asn Cys Thr Gly Gly Ala Glu Ile
930 935 940
Arg Asp Leu Ile Cys Val Gln Ser Tyr Asn Gly Ile Lys Val Leu Pro
945 950 955 960
Pro Leu Leu Ser Val Asn Gln Ile Ser Gly Tyr Thr Leu Ala Ala Thr
965 970 975
Ser Ala Ser Leu Phe Pro Pro Leu Ser Ala Ala Val Gly Val Pro Phe
980 985 990
Tyr Leu Asn Val Gln Tyr Arg Ile Asn Gly Ile Gly Val Thr Met Asp
995 1000 1005
Val Leu Ser Gln Asn Gln Lys Leu Ile Ala Asn Ala Phe Asn Asn
1010 1015 1020
Ala Leu Asp Ala Ile Gln Glu Gly Phe Asp Ala Thr Asn Ser Ala
1025 1030 1035
Leu Val Lys Ile Gln Ala Val Val Asn Ala Asn Ala Glu Ala Leu
1040 1045 1050
Asn Asn Leu Leu Gln Gln Leu Ser Asn Arg Phe Gly Ala Ile Ser
1055 1060 1065
Ser Ser Leu Gln Glu Ile Leu Ser Arg Leu Asp Ala Leu Glu Ala
1070 1075 1080
Gln Ala Gln Ile Asp Arg Leu Ile Asn Gly Arg Leu Thr Ala Leu
1085 1090 1095
Asn Val Tyr Val Ser Gln Gln Leu Ser Asp Ser Thr Leu Val Lys
1100 1105 1110
Phe Ser Ala Ala Gln Ala Met Glu Lys Val Asn Glu Cys Val Lys
1115 1120 1125
Ser Gln Ser Ser Arg Ile Asn Phe Cys Gly Asn Gly Asn His Ile
1130 1135 1140
Ile Ser Leu Val Gln Asn Ala Pro Tyr Gly Leu Tyr Phe Ile His
1145 1150 1155
Phe Ser Tyr Val Pro Thr Lys Tyr Val Thr Ala Lys Val Ser Pro
1160 1165 1170
Gly Leu Cys Ile Ala Gly Asp Arg Gly Ile Ala Pro Lys Ser Gly
1175 1180 1185
Tyr Phe Val Asn Val Asn Asn Thr Trp Met Phe Thr Gly Ser Gly
1190 1195 1200
Tyr Tyr Tyr Pro Glu Pro Ile Thr Gly Asn Asn Val Val Val Met
1205 1210 1215
Ser Thr Cys Ala Val Asn Tyr Thr Lys Ala Pro Asp Val Met Leu
1220 1225 1230
Asn Ile Ser Thr Pro Asn Leu His Asp Phe Lys Glu Glu Leu Asp
1235 1240 1245
Gln Trp Phe Lys Asn Gln Thr Ser Val Ala Pro Asp Leu Ser Leu
1250 1255 1260
Asp Tyr Ile Asn Val Thr Phe Leu Asp Leu Gln Asp Glu Met Asn
1265 1270 1275
Arg Leu Gln Glu Ala Ile Lys Val Leu Asn Gln Ser Tyr Ile Asn
1280 1285 1290
Leu Lys Asp Ile Gly Thr Tyr Glu Tyr Tyr Val Lys Trp Pro Trp
1295 1300 1305
Tyr Val Trp Leu Leu Ile Gly Phe Ala Gly Val Ala Met Leu Val
1310 1315 1320
Leu Leu Phe Phe Ile Cys Cys Cys Thr Gly Cys Gly Thr Ser Cys
1325 1330 1335
Phe Lys Ile Cys Gly Gly Cys Cys Asp Asp Tyr Thr Gly His Gln
1340 1345 1350
Glu Leu Val Ile Lys Thr Ser His Asp Asp
1355 1360
<210> 33
<211> 1451
<212> PRT
<213> canine coronavirus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 33
Met Ile Val Leu Thr Leu Cys Leu Phe Leu Phe Leu Tyr Ser Ser Val
1 5 10 15
Ser Cys Thr Ser Asn Asn Asp Cys Val Gln Val Asn Val Thr Gln Leu
20 25 30
Pro Gly Asn Glu Asn Ile Ile Lys Asp Phe Leu Phe Gln Asn Phe Lys
35 40 45
Glu Glu Gly Ser Leu Val Val Gly Gly Tyr Tyr Pro Thr Glu Val Trp
50 55 60
Tyr Asn Cys Ser Thr Thr Gln Gln Thr Thr Ala Tyr Lys Tyr Phe Ser
65 70 75 80
Asn Ile His Ala Phe Tyr Phe Asp Met Glu Ala Met Glu Asn Ser Thr
85 90 95
Gly Asn Ala Arg Gly Lys Pro Leu Leu Val His Val His Gly Asn Pro
100 105 110
Val Ser Ile Ile Val Tyr Ile Ser Ala Tyr Arg Asp Asp Val Gln Phe
115 120 125
Arg Pro Leu Leu Lys His Gly Leu Leu Cys Ile Thr Lys Asn Asp Thr
130 135 140
Val Asp Tyr Asn Ser Phe Thr Ile Asn Gln Trp Arg Asp Ile Cys Leu
145 150 155 160
Gly Asp Asp Arg Lys Ile Pro Phe Ser Val Val Pro Thr Asp Asn Gly
165 170 175
Thr Lys Leu Phe Gly Leu Glu Trp Asn Asp Asp Tyr Val Thr Ala Tyr
180 185 190
Ile Ser Asp Glu Ser His Arg Leu Asn Ile Asn Asn Asn Trp Phe Asn
195 200 205
Asn Val Thr Leu Leu Tyr Ser Arg Thr Ser Thr Ala Thr Trp Gln His
210 215 220
Ser Ala Ala Tyr Val Tyr Gln Gly Val Ser Asn Phe Thr Tyr Tyr Lys
225 230 235 240
Leu Asn Lys Thr Ala Gly Leu Lys Ser Tyr Glu Leu Cys Glu Asp Tyr
245 250 255
Glu Tyr Cys Thr Gly Tyr Ala Thr Asn Val Phe Ala Pro Thr Ser Gly
260 265 270
Gly Tyr Ile Pro Asp Gly Phe Ser Phe Asn Asn Trp Phe Met Leu Thr
275 280 285
Asn Ser Ser Thr Phe Val Ser Gly Arg Phe Val Thr Asn Gln Pro Leu
290 295 300
Leu Val Asn Cys Leu Trp Pro Val Pro Ser Phe Gly Val Ala Ala Gln
305 310 315 320
Glu Phe Cys Phe Glu Gly Ala Gln Phe Ser Gln Cys Asn Gly Val Ser
325 330 335
Leu Asn Asn Thr Val Asp Val Ile Arg Phe Asn Leu Asn Phe Thr Thr
340 345 350
Asp Val Gln Ser Gly Met Gly Ala Thr Val Phe Ser Leu Asn Thr Thr
355 360 365
Gly Gly Val Ile Leu Glu Ile Ser Cys Tyr Asn Asp Thr Val Ser Glu
370 375 380
Ser Ser Phe Tyr Ser Tyr Gly Glu Ile Pro Phe Gly Val Thr Asp Gly
385 390 395 400
Pro Arg Tyr Cys Tyr Val Leu Tyr Asn Gly Thr Ala Leu Lys Tyr Leu
405 410 415
Gly Thr Leu Pro Pro Ser Val Lys Glu Ile Ala Ile Ser Lys Trp Gly
420 425 430
His Phe Tyr Ile Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro Ile Asp
435 440 445
Cys Ile Ala Phe Asn Leu Thr Thr Gly Ala Ser Gly Ala Phe Trp Thr
450 455 460
Ile Ala Tyr Thr Ser Tyr Thr Glu Ala Leu Val Gln Val Glu Asn Thr
465 470 475 480
Ala Ile Lys Lys Val Thr Tyr Cys Asn Ser His Ile Asn Asn Ile Lys
485 490 495
Cys Ser Gln Leu Thr Ala Asn Leu Gln Asn Gly Phe Tyr Pro Val Ala
500 505 510
Ser Ser Glu Val Gly Leu Val Asn Lys Ser Val Val Leu Leu Pro Ser
515 520 525
Phe Tyr Ser His Thr Ser Val Asn Ile Thr Ile Asp Leu Gly Met Lys
530 535 540
Arg Ser Val Thr Val Thr Ile Ala Ser Pro Leu Ser Asn Ile Thr Leu
545 550 555 560
Pro Met Gln Asp Asn Asn Ile Asp Val Tyr Cys Ile Arg Ser Asn Gln
565 570 575
Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser Ser Leu Trp Asp Asn
580 585 590
Asn Phe Asn Ser Ala Cys Thr Asp Val Leu Asp Ala Thr Ala Val Ile
595 600 605
Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn Tyr Leu
610 615 620
Thr Phe Asn Lys Phe Cys Leu Ser Leu Asn Pro Val Gly Ala Asn Cys
625 630 635 640
Lys Leu Asp Val Ala Ala Arg Thr Arg Thr Asn Glu Gln Val Phe Gly
645 650 655
Ser Leu Tyr Val Ile Tyr Glu Glu Gly Asp Asn Ile Val Gly Val Pro
660 665 670
Ser Asp Asn Ser Gly Leu His Asp Leu Ser Val Leu His Leu Asp Ser
675 680 685
Cys Thr Asp Tyr Asn Ile Tyr Gly Arg Thr Gly Val Gly Ile Ile Arg
690 695 700
Lys Thr Asn Ser Thr Leu Leu Ser Gly Leu Tyr Tyr Thr Ser Leu Ser
705 710 715 720
Gly Asp Leu Leu Gly Phe Lys Asn Val Ser Asp Gly Val Val Tyr Ser
725 730 735
Val Thr Pro Cys Asp Val Ser Ala Gln Ala Ala Val Ile Asp Gly Ala
740 745 750
Ile Val Gly Ala Met Thr Ser Ile Asn Ser Glu Leu Leu Gly Leu Thr
755 760 765
His Trp Thr Thr Thr Pro Asn Phe Tyr Tyr Tyr Ser Ile Tyr Asn Tyr
770 775 780
Thr Asn Val Met Asn Arg Gly Thr Ala Ile Asp Asn Asp Ile Asp Cys
785 790 795 800
Glu Pro Ile Ile Thr Tyr Ser Asn Ile Gly Val Cys Lys Asn Gly Ala
805 810 815
Leu Val Phe Ile Asn Val Thr His Ser Asp Gly Asp Val Gln Pro Ile
820 825 830
Ser Thr Gly Asn Val Thr Ile Pro Thr Asn Phe Thr Ile Ser Val Gln
835 840 845
Val Glu Tyr Ile Gln Val Tyr Thr Thr Pro Val Ser Ile Asp Cys Ala
850 855 860
Arg Tyr Val Cys Asn Gly Asn Pro Arg Cys Asn Lys Leu Leu Thr Gln
865 870 875 880
Tyr Val Ser Ala Cys Gln Thr Ile Glu Gln Ala Leu Ala Met Gly Ala
885 890 895
Arg Leu Glu Asn Met Glu Ile Asp Ser Met Leu Phe Val Ser Glu Asn
900 905 910
Ala Leu Lys Leu Ala Ser Val Glu Ala Phe Asn Ser Thr Glu Asn Leu
915 920 925
Asp Pro Ile Tyr Lys Glu Trp Pro Asn Ile Gly Gly Ser Trp Leu Gly
930 935 940
Gly Leu Lys Asp Ile Leu Pro Ser His Asn Ser Lys Arg Lys Tyr Arg
945 950 955 960
Ser Ala Ile Glu Asp Leu Leu Phe Asp Lys Val Val Thr Ser Gly Leu
965 970 975
Gly Thr Val Asp Glu Asp Tyr Lys Arg Ser Ala Gly Gly Tyr Asp Ile
980 985 990
Ala Asp Leu Val Cys Ala Arg Tyr Tyr Asn Gly Ile Met Val Leu Pro
995 1000 1005
Gly Val Ala Asn Asp Asp Lys Met Thr Met Tyr Thr Ala Ser Leu
1010 1015 1020
Thr Gly Gly Ile Thr Leu Gly Ala Leu Ser Gly Gly Ala Val Ala
1025 1030 1035
Ile Pro Phe Ala Val Ala Val Gln Ala Arg Leu Asn Tyr Val Ala
1040 1045 1050
Leu Gln Thr Asp Val Leu Asn Lys Asn Gln Gln Ile Leu Ala Asn
1055 1060 1065
Ala Phe Asn Gln Ala Ile Gly Asn Ile Thr Gln Ala Phe Gly Lys
1070 1075 1080
Val Asn Asp Ala Ile His Gln Thr Ser Lys Gly Leu Ala Thr Val
1085 1090 1095
Ala Lys Ala Leu Ala Lys Val Gln Asp Val Val Asn Thr Gln Gly
1100 1105 1110
Gln Ala Leu Ser His Leu Thr Val Gln Leu Gln Asn Asn Phe Gln
1115 1120 1125
Ala Ile Ser Ser Ser Ile Ser Asp Ile Tyr Asn Arg Leu Asp Glu
1130 1135 1140
Leu Ser Ala Asp Ala Gln Val Asp Arg Leu Ile Thr Gly Arg Leu
1145 1150 1155
Thr Ala Leu Asn Ala Phe Val Ser Gln Thr Leu Thr Arg Gln Ala
1160 1165 1170
Glu Val Arg Ala Ser Arg Gln Leu Ala Lys Asp Lys Val Asn Glu
1175 1180 1185
Cys Val Arg Ser Gln Ser Gln Arg Phe Gly Phe Cys Gly Asn Gly
1190 1195 1200
Thr His Leu Phe Ser Leu Ala Asn Ala Ala Pro Asn Gly Met Ile
1205 1210 1215
Phe Phe His Thr Val Leu Leu Pro Thr Ala Tyr Glu Thr Val Thr
1220 1225 1230
Ala Trp Ser Gly Ile Cys Ala Ser Asp Gly Ser Arg Thr Phe Gly
1235 1240 1245
Leu Val Val Glu Asp Val Gln Leu Thr Leu Phe Arg Asn Leu Asp
1250 1255 1260
Glu Lys Phe Tyr Leu Thr Pro Arg Thr Met Tyr Gln Pro Arg Val
1265 1270 1275
Ala Thr Ser Ser Asp Phe Val Gln Ile Glu Gly Cys Asp Val Leu
1280 1285 1290
Phe Val Asn Gly Thr Val Ile Glu Leu Pro Ser Ile Ile Pro Asp
1295 1300 1305
Tyr Ile Asp Ile Asn Gln Thr Val Gln Asp Ile Leu Glu Asn Phe
1310 1315 1320
Arg Pro Asn Trp Thr Val Pro Glu Leu Pro Leu Asp Ile Phe His
1325 1330 1335
Ala Thr Tyr Leu Asn Leu Thr Gly Glu Ile Asn Asp Leu Glu Phe
1340 1345 1350
Arg Ser Glu Lys Leu His Asn Thr Thr Val Glu Leu Ala Ile Leu
1355 1360 1365
Ile Asp Asn Ile Asn Asn Thr Leu Val Asn Leu Glu Trp Leu Asn
1370 1375 1380
Arg Ile Glu Thr Tyr Val Lys Trp Pro Trp Tyr Val Trp Leu Leu
1385 1390 1395
Ile Gly Leu Val Val Ile Phe Cys Ile Pro Ile Leu Leu Phe Cys
1400 1405 1410
Cys Cys Ser Thr Gly Cys Cys Gly Cys Ile Gly Cys Leu Gly Ser
1415 1420 1425
Cys Cys His Ser Ile Cys Ser Arg Gly Gln Phe Glu Ser Tyr Glu
1430 1435 1440
Pro Ile Glu Lys Val His Val His
1445 1450
<210> 34
<211> 1355
<212> PRT
<213> EMCR Coronavirus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 34
Met Lys Leu Phe Leu Ile Leu Leu Ile Leu Pro Leu Val Ser Cys Phe
1 5 10 15
Ser Thr Cys Asn Ser Asn Ala Ser Ile Ser Met Leu Gln Leu Gly Val
20 25 30
Pro Asp Asn Ser Ser Thr Ile Val Thr Gly Leu Leu Pro Val His Trp
35 40 45
Ile Cys Ala Asn Gln Ser Thr Ser Ser Tyr Pro Ala Asn Gly Phe Phe
50 55 60
Tyr Ile Asp Val Gly Lys His Arg Ser Ala Phe Ala Leu His Ser Gly
65 70 75 80
Tyr Tyr Asp Ala Asn Gln Tyr Tyr Ile Tyr Leu Thr Asn Lys Ile His
85 90 95
Leu Asn Ala Pro Val Thr Leu Lys Ile Cys Lys Phe Gly Asn Thr Ser
100 105 110
Phe Asp Phe Leu Ser Asn Val Ser Thr Ser His Asp Cys Ile Val Asn
115 120 125
Leu Ser Phe Thr Glu Gln Leu Gly Val Pro Leu Gly Ile Thr Ile Ser
130 135 140
Gly Glu Thr Val Arg Leu His Leu Tyr Asn Ala Thr Arg Thr Phe Tyr
145 150 155 160
Val Pro Ala Ala Tyr Lys Leu Thr Lys Leu Ser Val Lys Cys Tyr Phe
165 170 175
Ser Glu Ser Cys Val Phe Ser Val Val Asn Ala Thr Ile Thr Val Asn
180 185 190
Val Thr Thr Leu Asn Gly Arg Ile Val Asn Tyr Thr Val Cys Asp Asp
195 200 205
Cys Asn Gly Tyr Thr Asp Asn Ile Phe Ser Val Gln Gln Asp Gly Arg
210 215 220
Ile Pro Asn Gly Phe Pro Phe Asn Asn Trp Phe Leu Leu Thr Asn Gly
225 230 235 240
Ser Thr Leu Val Asp Gly Val Ser Arg Leu Tyr Gln Pro Leu Arg Leu
245 250 255
Thr Cys Leu Trp Pro Val Pro Gly Leu Lys Ser Ser Thr Gly Phe Val
260 265 270
Tyr Phe Asn Ala Thr Gly Ser Asp Val Asn Cys Asn Gly Tyr Gln His
275 280 285
Asn Ser Val Ala Asp Val Met Arg Tyr Asn Leu Asn Leu Ser Ala Asn
290 295 300
Ser Val Asp Asn Leu Lys Ser Gly Val Ile Val Phe Lys Thr Leu Gln
305 310 315 320
Tyr Asp Val Leu Phe Tyr Cys Ser Asn Ser Ser Ser Gly Val Leu Asp
325 330 335
Thr Thr Ile Pro Phe Gly Pro Ser Ser Gln Pro Tyr Tyr Cys Phe Ile
340 345 350
Asn Ser Thr Ile Asn Thr Thr His Val Ser Thr Phe Val Gly Ile Leu
355 360 365
Pro Pro Thr Val Arg Glu Ile Val Val Ala Arg Thr Gly Gln Phe Tyr
370 375 380
Ile Asn Gly Phe Lys Tyr Phe Asp Leu Gly Phe Ile Glu Ala Val Asn
385 390 395 400
Phe Asn Val Thr Thr Ala Ser Ala Thr Asp Phe Trp Thr Val Ala Phe
405 410 415
Ala Thr Phe Val Asp Val Leu Val Asn Val Ser Ala Thr Asn Ile Gln
420 425 430
Asn Leu Leu Tyr Cys Asp Ser Pro Phe Glu Lys Leu Gln Cys Glu His
435 440 445
Leu Gln Phe Gly Leu Gln Asp Gly Phe Tyr Ser Ala Asn Phe Leu Asp
450 455 460
Asp Asn Val Leu Pro Glu Thr Tyr Val Ala Leu Pro Ile Tyr Tyr Gln
465 470 475 480
His Thr Asp Ile Asn Phe Thr Ala Thr Ala Ser Phe Gly Gly Ser Cys
485 490 495
Tyr Val Cys Lys Pro Arg Gln Val Asn Ile Ser Leu Asn Gly Asn Thr
500 505 510
Ser Val Cys Val Arg Thr Ser His Phe Ser Ile Arg Tyr Ile Tyr Asn
515 520 525
Arg Val Lys Ser Gly Ser Pro Gly Asp Ser Ser Trp His Ile Tyr Leu
530 535 540
Lys Ser Gly Thr Cys Pro Phe Ser Phe Ser Lys Leu Asn Asn Phe Gln
545 550 555 560
Lys Phe Lys Thr Ile Cys Phe Ser Thr Val Glu Val Pro Gly Ser Cys
565 570 575
Asn Phe Pro Leu Glu Ala Thr Trp His Tyr Thr Ser Tyr Thr Ile Val
580 585 590
Gly Ala Leu Tyr Val Thr Trp Ser Glu Gly Asn Ser Ile Thr Gly Val
595 600 605
Pro Tyr Pro Val Ser Gly Ile Arg Glu Phe Ser Asn Leu Val Leu Asn
610 615 620
Asn Cys Thr Lys Tyr Asn Ile Tyr Asp Tyr Val Gly Thr Gly Ile Ile
625 630 635 640
Arg Ser Ser Asn Gln Ser Leu Ala Gly Gly Ile Thr Tyr Val Ser Asn
645 650 655
Ser Gly Asn Leu Leu Gly Phe Lys Asn Val Ser Thr Gly Asn Ile Phe
660 665 670
Ile Val Thr Pro Cys Asn Gln Pro Asp Gln Val Ala Val Tyr Gln Gln
675 680 685
Ser Ile Ile Gly Ala Met Thr Ala Val Asn Glu Ser Arg Tyr Gly Leu
690 695 700
Gln Asn Leu Leu Gln Leu Pro Asn Phe Tyr Tyr Val Ser Asn Gly Gly
705 710 715 720
Asn Asn Cys Thr Thr Ala Val Met Ile Tyr Ser Asn Phe Gly Ile Cys
725 730 735
Ala Asp Gly Ser Leu Ile Pro Val Arg Pro Arg Asn Ser Ser Asp Asn
740 745 750
Gly Ile Ser Ala Ile Ile Thr Ala Asn Leu Ser Ile Pro Ser Asn Trp
755 760 765
Thr Thr Ser Val Gln Val Glu Tyr Leu Gln Ile Thr Ser Thr Pro Ile
770 775 780
Val Val Asp Cys Ala Thr Tyr Val Cys Asn Gly Asn Pro Arg Cys Lys
785 790 795 800
Asn Leu Leu Lys Gln Tyr Thr Ser Ala Cys Lys Thr Ile Glu Asp Ala
805 810 815
Leu Arg Leu Ser Ala His Leu Glu Thr Asn Asp Val Ser Ser Met Leu
820 825 830
Thr Phe Asp Ser Asn Ala Phe Ser Leu Ala Asn Val Thr Ser Phe Gly
835 840 845
Asp Tyr Asn Leu Ser Ser Val Leu Pro Gln Arg Asn Ile His Ser Ser
850 855 860
Arg Ile Ala Gly Arg Ser Ala Leu Glu Asp Leu Leu Phe Ser Lys Val
865 870 875 880
Val Thr Ser Gly Leu Gly Thr Val Asp Val Asp Tyr Lys Ser Cys Thr
885 890 895
Lys Gly Leu Ser Ile Ala Asp Leu Ala Cys Ala Gln Tyr Tyr Asn Gly
900 905 910
Ile Met Val Leu Pro Gly Val Ala Asp Ala Glu Arg Met Ala Met Tyr
915 920 925
Thr Gly Ser Leu Ile Gly Gly Met Val Leu Gly Gly Leu Thr Ser Ala
930 935 940
Ala Ala Ile Pro Phe Ser Leu Ala Leu Gln Ala Arg Leu Asn Tyr Val
945 950 955 960
Ala Leu Gln Thr Asp Val Leu Gln Glu Asn Gln Lys Ile Leu Ala Ala
965 970 975
Ser Phe Asn Lys Ala Ile Asn Asn Ile Val Ala Ser Phe Ser Ser Val
980 985 990
Asn Asp Ala Ile Thr His Thr Ala Glu Ala Ile His Thr Val Thr Ile
995 1000 1005
Ala Leu Asn Lys Ile Gln Asp Val Val Asn Gln Gln Gly Ser Ala
1010 1015 1020
Leu Asn His Leu Thr Ser Gln Leu Arg His Asn Phe Gln Ala Ile
1025 1030 1035
Ser Asn Ser Ile His Ala Ile Tyr Asp Arg Leu Asp Ser Ile Gln
1040 1045 1050
Ala Asp Gln Gln Val Asp Arg Leu Ile Thr Gly Arg Leu Ala Ala
1055 1060 1065
Leu Asn Ala Phe Val Ser Gln Val Leu Asn Lys Tyr Thr Glu Val
1070 1075 1080
Arg Gly Ser Arg Arg Leu Ala Gln Gln Lys Ile Asn Glu Cys Val
1085 1090 1095
Lys Ser Gln Ser Asn Arg Tyr Gly Phe Cys Gly Asn Gly Thr His
1100 1105 1110
Ile Phe Ser Ile Val Asn Ser Ala Pro Asp Gly Leu Leu Phe Leu
1115 1120 1125
His Thr Val Leu Leu Pro Thr Asp Tyr Lys Asn Val Lys Ala Trp
1130 1135 1140
Ser Gly Ile Cys Val Asp Gly Ile Tyr Gly Tyr Val Leu Arg Gln
1145 1150 1155
Pro Asn Leu Val Leu Tyr Ser Asp Asn Gly Val Phe Arg Val Thr
1160 1165 1170
Ser Arg Val Met Phe Gln Pro Arg Leu Pro Val Leu Ser Asp Phe
1175 1180 1185
Val Gln Ile Tyr Asn Cys Asn Val Thr Phe Val Asn Ile Ser Arg
1190 1195 1200
Val Glu Leu His Thr Val Ile Pro Asp Tyr Val Asp Val Asn Lys
1205 1210 1215
Thr Leu Gln Glu Phe Ala Gln Asn Leu Pro Lys Tyr Val Lys Pro
1220 1225 1230
Asn Phe Asp Leu Thr Pro Phe Asn Leu Thr Tyr Leu Asn Leu Ser
1235 1240 1245
Ser Glu Leu Lys Gln Leu Glu Ala Lys Thr Ala Ser Leu Phe Gln
1250 1255 1260
Thr Thr Val Glu Leu Gln Gly Leu Ile Asp Gln Ile Asn Ser Thr
1265 1270 1275
Tyr Val Asp Leu Lys Leu Leu Asn Arg Phe Glu Asn Tyr Ile Lys
1280 1285 1290
Trp Pro Trp Trp Val Trp Leu Ile Ile Ser Val Val Phe Val Val
1295 1300 1305
Leu Leu Ser Leu Leu Val Phe Cys Cys Leu Ser Thr Gly Cys Cys
1310 1315 1320
Gly Cys Cys Asn Cys Leu Thr Ser Ser Met Arg Gly Cys Cys Asp
1325 1330 1335
Cys Gly Ser Thr Lys Leu Pro Tyr Tyr Glu Phe Glu Lys Val His
1340 1345 1350
Val Gln
1355
<210> 35
<211> 1452
<212> PRT
<213> feline coronavirus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 35
Met Ile Val Leu Val Thr Cys Leu Leu Leu Leu Cys Ser Tyr His Thr
1 5 10 15
Val Leu Ser Thr Thr Asn Asn Glu Cys Ile Gln Val Asn Val Thr Gln
20 25 30
Leu Ala Gly Asn Glu Asn Leu Ile Arg Asp Phe Leu Phe Ser Asn Phe
35 40 45
Lys Glu Glu Gly Ser Val Val Val Gly Gly Tyr Tyr Pro Thr Glu Val
50 55 60
Trp Tyr Asn Cys Ser Arg Thr Ala Arg Thr Thr Ala Phe Gln Tyr Phe
65 70 75 80
Asn Asn Ile His Ala Phe Tyr Phe Val Met Glu Ala Met Glu Asn Ser
85 90 95
Thr Gly Asn Ala Arg Gly Lys Pro Leu Leu Phe His Val His Gly Glu
100 105 110
Pro Val Ser Val Ile Ile Ser Ala Tyr Arg Asp Asp Val Gln Gln Arg
115 120 125
Pro Leu Leu Lys His Gly Leu Val Cys Ile Thr Lys Asn Arg His Ile
130 135 140
Asn Tyr Glu Gln Phe Thr Ser Asn Gln Trp Asn Ser Thr Cys Thr Gly
145 150 155 160
Ala Asp Arg Lys Ile Pro Phe Ser Val Ile Pro Thr Asp Asn Gly Thr
165 170 175
Lys Ile Tyr Gly Leu Glu Trp Asn Asp Asp Phe Val Thr Ala Tyr Ile
180 185 190
Ser Gly Arg Ser Tyr His Leu Asn Ile Asn Thr Asn Trp Phe Asn Asn
195 200 205
Val Thr Leu Leu Tyr Ser Arg Ser Ser Thr Ala Thr Trp Glu Tyr Ser
210 215 220
Ala Ala Tyr Ala Tyr Gln Gly Val Ser Asn Phe Thr Tyr Tyr Lys Leu
225 230 235 240
Asn Asn Thr Asn Gly Leu Lys Thr Tyr Glu Leu Cys Glu Asp Tyr Glu
245 250 255
His Cys Thr Gly Tyr Ala Thr Asn Val Phe Ala Pro Thr Ser Gly Gly
260 265 270
Tyr Ile Pro Asp Gly Phe Ser Phe Asn Asn Trp Phe Leu Leu Thr Asn
275 280 285
Ser Ser Thr Phe Val Ser Gly Arg Phe Val Thr Asn Gln Pro Leu Leu
290 295 300
Ile Asn Cys Leu Trp Pro Val Pro Ser Phe Gly Val Ala Ala Gln Glu
305 310 315 320
Phe Cys Phe Glu Gly Ala Gln Phe Ser Gln Cys Asn Gly Val Ser Leu
325 330 335
Asn Asn Thr Val Asp Val Ile Arg Phe Asn Leu Asn Phe Thr Ala Asp
340 345 350
Val Gln Ser Gly Met Gly Ala Thr Val Phe Ser Leu Asn Thr Thr Gly
355 360 365
Gly Val Ile Leu Glu Ile Ser Cys Tyr Ser Asp Thr Val Ser Glu Ser
370 375 380
Ser Ser Tyr Ser Tyr Gly Glu Ile Pro Phe Gly Ile Thr Asp Gly Pro
385 390 395 400
Arg Tyr Cys Tyr Val Leu Tyr Asn Gly Thr Ala Leu Lys Tyr Leu Gly
405 410 415
Thr Leu Pro Pro Ser Val Lys Glu Ile Ala Ile Ser Lys Trp Gly His
420 425 430
Phe Tyr Ile Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro Ile Gly Cys
435 440 445
Ile Ser Phe Asn Leu Thr Thr Gly Val Ser Gly Ala Phe Trp Thr Ile
450 455 460
Ala Tyr Thr Ser Tyr Thr Glu Ala Leu Val Gln Val Glu Asn Thr Ala
465 470 475 480
Ile Lys Asn Val Thr Tyr Cys Asn Ser His Ile Asn Asn Ile Lys Cys
485 490 495
Ser Gln Leu Thr Ala Asn Leu Asn Asn Gly Phe Tyr Pro Val Ala Ser
500 505 510
Ser Glu Val Gly Phe Val Asn Lys Ser Val Val Leu Leu Pro Ser Phe
515 520 525
Phe Thr Tyr Thr Ala Val Asn Ile Thr Ile Asp Leu Gly Met Lys Leu
530 535 540
Ser Gly Tyr Gly Gln Pro Ile Ala Ser Thr Leu Ser Asn Ile Thr Leu
545 550 555 560
Pro Met Gln Asp Asn Asn Thr Asp Val Tyr Cys Ile Arg Ser Asn Gln
565 570 575
Phe Ser Val Tyr Val His Ser Thr Cys Lys Ser Ser Leu Trp Asp Asn
580 585 590
Ile Phe Asn Gln Asp Cys Thr Asp Val Leu Glu Ala Thr Ala Val Ile
595 600 605
Lys Thr Gly Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn Tyr Leu
610 615 620
Thr Phe Asn Lys Phe Cys Leu Ser Leu Ser Pro Val Gly Ala Asn Cys
625 630 635 640
Lys Phe Asp Val Ala Ala Arg Thr Arg Thr Asn Glu Gln Val Val Arg
645 650 655
Ser Leu Tyr Val Ile Tyr Glu Glu Gly Asp Asn Ile Val Gly Val Pro
660 665 670
Ser Asp Asn Ser Gly Leu His Asp Leu Ser Val Leu His Leu Asp Ser
675 680 685
Cys Thr Asp Tyr Asn Ile Tyr Gly Arg Thr Gly Val Gly Ile Ile Arg
690 695 700
Arg Thr Asn Ser Thr Leu Leu Ser Gly Leu Tyr Tyr Thr Ser Leu Ser
705 710 715 720
Gly Asp Leu Leu Gly Phe Lys Asn Val Ser Asp Gly Val Ile Tyr Ser
725 730 735
Val Thr Pro Cys Asp Val Ser Ala Gln Ala Ala Val Ile Asp Gly Ala
740 745 750
Ile Val Gly Ala Met Thr Ser Ile Asn Ser Glu Leu Leu Gly Leu Thr
755 760 765
His Trp Thr Thr Thr Pro Asn Phe Tyr Tyr Tyr Ser Ile Tyr Asn Tyr
770 775 780
Thr Ser Glu Arg Thr Arg Gly Thr Ala Ile Asp Ser Asn Asp Val Asp
785 790 795 800
Cys Glu Pro Val Ile Thr Tyr Ser Asn Ile Gly Val Cys Lys Asn Gly
805 810 815
Ala Leu Val Phe Ile Asn Val Thr His Ser Asp Gly Asp Val Gln Pro
820 825 830
Ile Ser Thr Gly Asn Val Thr Ile Pro Thr Asn Phe Thr Ile Ser Val
835 840 845
Gln Val Glu Tyr Met Gln Val Tyr Thr Thr Pro Val Ser Ile Asp Cys
850 855 860
Ala Arg Tyr Val Cys Asn Gly Asn Pro Arg Cys Asn Lys Leu Leu Thr
865 870 875 880
Gln Tyr Val Ser Ala Cys Gln Thr Ile Glu Gln Ala Leu Ala Met Gly
885 890 895
Ala Arg Leu Glu Asn Met Glu Val Asp Ser Met Leu Phe Val Ser Glu
900 905 910
Asn Ala Leu Lys Leu Ala Ser Val Glu Ala Phe Asn Ser Thr Glu Asn
915 920 925
Leu Asp Pro Ile Tyr Lys Glu Trp Pro Ser Ile Gly Gly Ser Trp Leu
930 935 940
Gly Gly Leu Lys Asp Ile Leu Pro Ser His Asn Ser Lys Arg Lys Tyr
945 950 955 960
Gly Ser Ala Ile Glu Asp Leu Leu Phe Asp Lys Val Val Thr Ser Gly
965 970 975
Leu Gly Thr Val Asp Glu Asp Tyr Lys Arg Cys Thr Gly Gly Tyr Asp
980 985 990
Ile Ala Asp Leu Val Cys Ala Gln Tyr Tyr Asn Gly Ile Met Val Leu
995 1000 1005
Pro Gly Val Ala Asn Ala Asp Lys Met Thr Met Tyr Thr Ala Ser
1010 1015 1020
Leu Ala Gly Gly Ile Thr Leu Gly Ala Leu Gly Gly Gly Ala Val
1025 1030 1035
Ala Ile Pro Phe Ala Val Ala Val Gln Ala Arg Leu Asn Tyr Val
1040 1045 1050
Ala Leu Gln Thr Asp Val Leu Asn Lys Asn Gln Gln Ile Leu Ala
1055 1060 1065
Asn Ala Phe Asn Gln Ala Ile Gly Asn Ile Thr Gln Ala Phe Gly
1070 1075 1080
Lys Val Asn Asp Ala Ile His Gln Thr Ser Gln Gly Leu Ala Thr
1085 1090 1095
Val Ala Lys Ala Leu Ala Lys Val Gln Asp Val Val Asn Thr Gln
1100 1105 1110
Gly Gln Ala Leu Ser His Leu Thr Val Gln Leu Gln Asn Asn Phe
1115 1120 1125
Gln Ala Ile Ser Ser Ser Ile Ser Asp Ile Tyr Asn Arg Leu Asp
1130 1135 1140
Glu Leu Ser Ala Asp Ala Gln Val Asp Arg Leu Ile Thr Gly Arg
1145 1150 1155
Leu Thr Ala Leu Asn Ala Phe Val Ser Gln Thr Leu Thr Arg Gln
1160 1165 1170
Ala Glu Val Arg Ala Ser Arg Gln Leu Ala Lys Asp Lys Val Asn
1175 1180 1185
Glu Cys Val Arg Ser Gln Ser Gln Arg Phe Gly Phe Cys Gly Asn
1190 1195 1200
Gly Thr His Leu Phe Ser Leu Ala Asn Ala Ala Pro Asn Gly Met
1205 1210 1215
Ile Phe Phe His Thr Val Leu Leu Pro Thr Ala Tyr Glu Thr Val
1220 1225 1230
Thr Ala Trp Ser Gly Ile Cys Ala Ser Asp Gly Asp Arg Thr Phe
1235 1240 1245
Gly Leu Val Val Lys Asp Val Gln Leu Thr Leu Phe Arg Asn Leu
1250 1255 1260
Asp Asp Lys Phe Tyr Leu Thr Pro Arg Thr Met Tyr Gln Pro Arg
1265 1270 1275
Val Ala Thr Ser Ser Asp Phe Val Gln Ile Glu Gly Cys Asp Val
1280 1285 1290
Leu Phe Val Asn Ala Thr Val Ile Asp Leu Pro Ser Ile Ile Pro
1295 1300 1305
Asp Tyr Ile Asp Ile Asn Gln Thr Val Gln Asp Ile Leu Glu Asn
1310 1315 1320
Tyr Arg Pro Asn Trp Thr Val Pro Glu Phe Thr Leu Asp Ile Phe
1325 1330 1335
Asn Ala Thr Tyr Leu Asn Leu Thr Gly Glu Ile Asp Asp Leu Glu
1340 1345 1350
Phe Arg Ser Glu Lys Leu His Asn Thr Thr Val Glu Leu Ala Ile
1355 1360 1365
Leu Ile Asp Asn Ile Asn Asn Thr Leu Val Asn Leu Glu Trp Leu
1370 1375 1380
Asn Arg Ile Glu Thr Tyr Val Lys Trp Pro Trp Tyr Val Trp Leu
1385 1390 1395
Leu Ile Gly Leu Val Val Val Phe Cys Ile Pro Leu Leu Leu Phe
1400 1405 1410
Cys Cys Phe Ser Thr Gly Cys Cys Gly Cys Ile Gly Cys Leu Gly
1415 1420 1425
Ser Cys Cys His Ser Ile Cys Ser Arg Arg Gln Phe Glu Asn Tyr
1430 1435 1440
Glu Pro Ile Glu Lys Val His Val His
1445 1450
<210> 36
<211> 1361
<212> PRT
<213> murine hepatitis virus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 36
Met Leu Phe Val Phe Leu Thr Leu Leu Pro Ser Ser Leu Gly Tyr Ile
1 5 10 15
Gly Asp Phe Arg Cys Ile Gln Leu Val Asn Thr Asp Thr Ser Asn Ala
20 25 30
Ser Ala Pro Ser Val Ser Thr Glu Val Val Asp Val Ser Lys Gly Ile
35 40 45
Gly Thr Tyr Tyr Val Leu Asp Arg Val Tyr Leu Asn Ala Thr Leu Leu
50 55 60
Leu Thr Gly Tyr Tyr Pro Val Asp Gly Ser Met Tyr Arg Asn Met Ala
65 70 75 80
Leu Thr Gly Ile Asn Thr Ile Ser Leu Asn Trp Tyr Lys Pro Pro Phe
85 90 95
Leu Ser Glu Phe Asn Asp Gly Ile Phe Ala Lys Val Lys Asn Leu Lys
100 105 110
Ala Ser Leu Pro Lys Asp Ser Ile Ser Tyr Phe Pro Thr Ile Ile Ile
115 120 125
Gly Ser Asn Phe Val Thr Thr Ser Tyr Thr Val Val Leu Glu Pro Tyr
130 135 140
Asn Gly Ile Ile Met Ala Ser Ile Cys Gln Tyr Thr Ile Cys Gln Leu
145 150 155 160
Pro Tyr Thr Asp Cys Lys Pro Asn Thr Gly Gly Asn Lys Leu Ile Gly
165 170 175
Phe Trp His Thr Glu Leu Lys Ser Pro Val Cys Ile Leu Lys Arg Asn
180 185 190
Phe Thr Phe Asn Val Asn Ala Glu Trp Leu Tyr Phe His Phe Tyr Gln
195 200 205
Gln Gly Gly Thr Phe Tyr Ala Tyr Tyr Ala Asp Val Ser Ser Ala Thr
210 215 220
Thr Phe Leu Phe Ser Met Tyr Ile Gly Asp Val Leu Thr Gln Tyr Phe
225 230 235 240
Val Leu Pro Tyr Met Cys Thr Leu Thr Thr Thr Gly Val Phe Ser Pro
245 250 255
Gln Tyr Trp Val Thr Pro Leu Val Lys Arg Gln Tyr Leu Phe Asn Phe
260 265 270
Asn Gln Lys Gly Ile Ile Thr Ser Ala Val Asp Cys Ala Ser Ser Tyr
275 280 285
Thr Ser Glu Ile Lys Cys Lys Thr Gln Ser Met Asn Pro Asn Thr Gly
290 295 300
Val Tyr Asp Leu Ser Gly Tyr Thr Val Gln Pro Val Gly Leu Val Tyr
305 310 315 320
Arg Arg Val Arg Asn Leu Pro Asp Cys Lys Ile Glu Glu Trp Leu Thr
325 330 335
Ala Lys Ser Val Pro Ser Pro Leu Asn Trp Glu Arg Lys Thr Phe Gln
340 345 350
Asn Cys Asn Phe Asp Leu Ser Ser Leu Leu Arg Phe Val Gln Ala Glu
355 360 365
Ser Leu Ser Cys Ser Asn Ile Asp Ala Ser Lys Val Tyr Gly Met Cys
370 375 380
Phe Gly Ser Ile Ser Ile Asp Lys Phe Ala Ile Pro Asn Arg Arg Arg
385 390 395 400
Val Asp Leu Gln Leu Gly Asn Ser Gly Phe Leu Gln Ser Phe Asn Tyr
405 410 415
Lys Ile Asp Thr Arg Ala Thr Ser Cys Gln Leu Tyr Tyr Ser Leu Ala
420 425 430
Lys Asn Asn Val Thr Val Asn Asn His Asn Pro Ser Ser Trp Asn Arg
435 440 445
Arg Tyr Gly Phe Asn Asp Val Ala Thr Phe Gly Thr Gly Lys His Asp
450 455 460
Val Ala Tyr Ala Glu Ala Cys Phe Thr Val Gly Ala Ser Tyr Cys Pro
465 470 475 480
Cys Ala Asn Pro Ser Ile Val Ser Pro Cys Thr Thr Gly Lys Pro Asn
485 490 495
Phe Ala Asn Cys Pro Thr Gly Thr Ser Asn Arg Glu Cys Thr Val Met
500 505 510
Pro Leu Ala Asn Asn Gln Phe Lys Cys Asp Cys Thr Cys Asn Pro Ser
515 520 525
Pro Leu Thr Thr Tyr Asp Leu Arg Cys Leu Gln Ala Arg Ser Met Leu
530 535 540
Gly Val Gly Asp His Cys Glu Gly Leu Gly Val Leu Glu Asp Lys Cys
545 550 555 560
Gly Gly Ser Asn Thr Cys Asn Cys Ser Ala His Ala Phe Val Gly Trp
565 570 575
Ala Lys Asp Ser Cys Leu Ala Asn Gly Arg Cys His Ile Phe Ser Asn
580 585 590
Leu Met Leu Asn Gly Ile Asn Ser Gly Thr Thr Cys Ser Met Asp Leu
595 600 605
Gln Leu Pro Asn Thr Glu Val Val Thr Gly Val Cys Val Lys Tyr Asp
610 615 620
Leu Tyr Gly Ile Thr Gly Gln Gly Ile Phe Lys Glu Val Lys Ala Asp
625 630 635 640
Tyr Tyr His Ser Trp Gln Asn Leu Leu Tyr Asp Val Asn Gly Asn Leu
645 650 655
Ile Gly Phe Arg Asp Phe Val Ala Asn Lys Ser Tyr Thr Ile Arg Ser
660 665 670
Cys Tyr Ser Gly Arg Val Ser Ala Ala Tyr His Gln Asp Ala Pro Glu
675 680 685
Pro Ala Leu Leu Tyr Arg Asn Leu Lys Cys Asp Tyr Val Phe Asn Asn
690 695 700
Asn Ile Ser Arg Glu Glu Thr Pro Leu Asn Tyr Phe Asp Ser Tyr Leu
705 710 715 720
Gly Cys Val Val Asn Ala Asp Asn Ser Thr Glu Glu Ala Val Asp Ala
725 730 735
Cys Asp Leu Arg Met Gly Ser Gly Leu Cys Val Asn Tyr Ser Thr Ser
740 745 750
His Arg Ala Arg Ser Ser Val Ser Thr Gly Tyr Lys Leu Thr Thr Phe
755 760 765
Glu Pro Phe Thr Val Arg Ile Val Asn Asp Ser Val Glu Ser Val Asp
770 775 780
Gly Leu Tyr Glu Leu Gln Ile Pro Thr Asn Phe Thr Ile Ala Ser His
785 790 795 800
Gln Glu Phe Val Gln Thr Arg Ser Pro Lys Val Thr Ile Asp Cys Ala
805 810 815
Ala Phe Val Cys Gly Gly His Thr Ala Cys Arg Gln Gln Leu Val Glu
820 825 830
Tyr Gly Ser Phe Cys Asp Asn Ile Asn Ala Ile Leu Gly Glu Val Asn
835 840 845
Asn Leu Ile Asp Thr Met Gln Leu Gln Val Ala Ser Ala Leu Ile Gln
850 855 860
Gly Val Thr Leu Ser Ser Arg Leu Ser Asp Gly Ile Gly Gly Gln Ile
865 870 875 880
Asp Asp Ile Asn Phe Ser Pro Leu Leu Gly Cys Leu Gly Ser Asp Cys
885 890 895
Gly Glu Val Thr Met Ala Ala Gln Thr Gly Arg Ser Ala Ile Glu Asp
900 905 910
Val Leu Phe Asp Lys Val Lys Leu Ser Asp Val Gly Phe Val Glu Ala
915 920 925
Tyr Asn Asn Cys Thr Gly Gly Gln Glu Val Arg Asp Leu Leu Cys Val
930 935 940
Gln Ser Phe Asn Gly Ile Lys Val Leu Pro Pro Val Leu Ser Glu Asn
945 950 955 960
Gln Ile Ser Gly Tyr Thr Ala Gly Ala Thr Val Ser Ala Met Phe Pro
965 970 975
Trp Ser Ala Ala Ala Gly Val Pro Phe Ser Leu Ser Val Gln Tyr Arg
980 985 990
Ile Asn Gly Leu Gly Val Thr Met Asn Val Leu Ser Glu Asn Gln Lys
995 1000 1005
Met Ile Ala Ser Ala Phe Asn Asn Ala Ile Gly Ala Ile Gln Glu
1010 1015 1020
Gly Phe Ala Ala Thr Asn Ser Ala Leu Ala Lys Met Gln Phe Val
1025 1030 1035
Val Asn Ala Asn Ala Glu Ala Leu Asn Asn Leu Leu Asn Gln Leu
1040 1045 1050
Ser Asn Arg Phe Gly Ala Ile Ser Ala Ser Leu Gln Glu Ile Leu
1055 1060 1065
Ser Arg Leu Asp Ala Leu Glu Ala Gln Ala Gln Ile Asp Arg Leu
1070 1075 1080
Ile Asn Gly Arg Leu Thr Ala Leu Asn Ala Tyr Val Ser Lys Gln
1085 1090 1095
Leu Ser Asp Met Thr Leu Val Lys Val Ser Ala Ala Gln Ala Ile
1100 1105 1110
Glu Lys Val Asn Glu Cys Val Lys Ser Gln Ser Ser Arg Ile Asn
1115 1120 1125
Phe Cys Gly Asn Gly Asn His Ile Leu Ser Leu Val Gln Asn Ala
1130 1135 1140
Pro Tyr Gly Leu Tyr Phe Ile His Phe Ser Tyr Val Pro Thr Ser
1145 1150 1155
Phe Thr Thr Ala Asn Val Ser Pro Gly Leu Cys Ile Ser Gly Asp
1160 1165 1170
Arg Gly Leu Ala Pro Lys Ala Gly Tyr Phe Val Gln Asp Asp Gly
1175 1180 1185
Glu Trp Lys Phe Thr Gly Ser Asn Tyr Tyr Tyr Pro Glu Pro Ile
1190 1195 1200
Thr Asp Lys Asn Ser Val Val Met Ser Ser Cys Ala Ala Asn Tyr
1205 1210 1215
Thr Lys Ala Pro Glu Val Phe Leu Asn Thr Ser Ile Pro Asn Leu
1220 1225 1230
Pro Asp Phe Lys Glu Glu Leu Asp Lys Trp Phe Lys Asn Gln Thr
1235 1240 1245
Ser Ile Ala Pro Asp Leu Ser Leu Asp Phe Glu Lys Leu Asn Val
1250 1255 1260
Thr Leu Leu Asp Leu Thr Asp Glu Met Asn Arg Ile Gln Asp Ala
1265 1270 1275
Ile Lys Lys Leu Asn Glu Ser Tyr Ile Asn Leu Lys Asp Val Gly
1280 1285 1290
Thr Tyr Glu Met Tyr Val Lys Trp Pro Trp Tyr Val Trp Leu Leu
1295 1300 1305
Ile Gly Leu Ala Gly Val Ala Val Cys Val Leu Leu Phe Phe Ile
1310 1315 1320
Cys Cys Cys Thr Gly Cys Gly Ser Cys Cys Phe Lys Lys Cys Gly
1325 1330 1335
Asn Cys Cys Asp Glu Cys Gly Gly His Gln Asp Ser Ile Val Ile
1340 1345 1350
His Asn Ile Ser Ser His Glu Asp
1355 1360
<210> 37
<211> 1361
<212> PRT
<213> human coronavirus OC43
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 37
Met Phe Leu Ile Leu Leu Ile Ser Leu Pro Thr Ala Phe Ala Val Ile
1 5 10 15
Gly Asp Leu Lys Cys Thr Ser Asp Thr Ser Tyr Ile Asn Asp Lys Asp
20 25 30
Thr Gly Pro Pro Pro Ile Ser Thr Asp Thr Val Asp Val Thr Asn Gly
35 40 45
Leu Gly Thr Tyr Tyr Val Leu Asp Arg Val Tyr Leu Asn Thr Thr Leu
50 55 60
Phe Leu Asn Gly Tyr Tyr Pro Thr Ser Gly Ser Thr Tyr Arg Asn Met
65 70 75 80
Ala Leu Lys Gly Ser Val Leu Leu Ser Arg Leu Trp Phe Lys Pro Pro
85 90 95
Phe Leu Ser Asp Phe Ile Asn Gly Ile Phe Ala Lys Val Lys Asn Thr
100 105 110
Lys Val Ile Lys Asp Arg Val Met Tyr Ser Glu Phe Pro Ala Ile Thr
115 120 125
Ile Gly Ser Thr Phe Val Asn Thr Ser Tyr Ser Val Val Val Gln Pro
130 135 140
Arg Thr Ile Asn Ser Thr Gln Asp Gly Tyr Asn Lys Leu Gln Gly Leu
145 150 155 160
Leu Glu Val Ser Val Cys Gln Tyr Asn Met Cys Glu Tyr Pro Gln Thr
165 170 175
Ile Cys His Pro Asn Leu Gly Asn His Arg Lys Glu Leu Trp His Leu
180 185 190
Asp Thr Gly Val Val Ser Cys Leu Tyr Lys Arg Asn Phe Thr Tyr Asp
195 200 205
Val Asn Ala Asp Tyr Leu Tyr Phe His Phe Tyr Gln Glu Gly Gly Thr
210 215 220
Phe Tyr Ala Tyr Phe Thr Asp Thr Gly Val Val Thr Lys Phe Leu Phe
225 230 235 240
Asn Val Tyr Leu Gly Met Ala Leu Ser His Tyr Tyr Val Met Pro Leu
245 250 255
Thr Cys Asn Ser Lys Val Lys Asn Gly Phe Thr Leu Glu Tyr Trp Val
260 265 270
Thr Pro Leu Thr Ser Arg Gln Tyr Leu Leu Ala Phe Asn Gln Asp Gly
275 280 285
Ile Ile Phe Asn Ala Val Asp Cys Met Ser Asp Phe Met Ser Glu Ile
290 295 300
Lys Cys Lys Thr Gln Ser Ile Ala Pro Pro Thr Gly Val Tyr Glu Leu
305 310 315 320
Asn Gly Tyr Thr Val Gln Pro Ile Ala Asp Val Tyr Arg Arg Lys Leu
325 330 335
Asn Leu Pro Asn Cys Asn Ile Glu Ala Trp Leu Asn Asp Lys Ser Val
340 345 350
Pro Ser Pro Leu Asn Trp Glu Arg Lys Thr Phe Ser Asn Cys Asn Phe
355 360 365
Asn Met Ser Ser Leu Met Ser Phe Ile Gln Ala Asp Ser Phe Thr Cys
370 375 380
Asn Asn Ile Asp Ala Ala Lys Ile Tyr Gly Met Cys Phe Ser Ser Ile
385 390 395 400
Thr Ile Asp Lys Phe Ala Ile Pro Asn Gly Arg Lys Val Asp Leu Gln
405 410 415
Leu Gly Asn Leu Gly Tyr Leu Gln Ser Phe Asn Tyr Arg Ile Asp Thr
420 425 430
Thr Ala Thr Ser Cys Gln Leu Tyr Tyr Asn Leu Pro Ala Ala Asn Val
435 440 445
Ser Val Ser Arg Phe Asn Pro Ser Thr Trp Asn Lys Arg Phe Gly Phe
450 455 460
Ile Glu Asp Ser Val Phe Lys Pro Arg Pro Ala Gly Val Leu Thr Asn
465 470 475 480
His Asp Val Val Tyr Ala Gln His Cys Phe Lys Ala Pro Lys Asn Phe
485 490 495
Cys Pro Cys Lys Leu Asn Gly Ser Cys Val Gly Ser Gly Pro Gly Lys
500 505 510
Asn Asn Gly Ile Gly Thr Cys Pro Ala Gly Thr Asn Tyr Leu Thr Cys
515 520 525
Asp Asn Leu Cys Thr Pro Asp Pro Ile Thr Phe Lys Ala Thr Gly Thr
530 535 540
Tyr Lys Cys Pro Gln Thr Lys Ser Leu Val Gly Ile Gly Glu His Cys
545 550 555 560
Ser Gly Leu Ala Val Lys Ser Asp Tyr Cys Gly Gly Asn Ser Cys Thr
565 570 575
Cys Arg Pro Gln Ala Phe Leu Gly Trp Ser Ala Asp Ser Cys Leu Gln
580 585 590
Gly Asp Lys Cys Asn Ile Phe Ala Asn Phe Ile Leu His Asp Val Asn
595 600 605
Ser Gly Leu Thr Cys Ser Thr Asp Leu Gln Lys Ala Asn Thr Asp Ile
610 615 620
Ile Leu Gly Val Cys Val Asn Tyr Asp Leu Tyr Gly Ile Leu Gly Gln
625 630 635 640
Gly Ile Phe Val Glu Val Asn Ala Thr Tyr Tyr Asn Ser Trp Gln Asn
645 650 655
Leu Leu Tyr Asp Ser Asn Gly Asn Leu Tyr Gly Phe Arg Asp Tyr Ile
660 665 670
Thr Asn Arg Thr Phe Met Ile Arg Ser Cys Tyr Ser Gly Arg Val Ser
675 680 685
Ala Ala Phe His Ala Asn Ser Ser Glu Pro Ala Leu Leu Phe Arg Asn
690 695 700
Ile Lys Cys Asn Tyr Val Phe Asn Asn Ser Leu Thr Arg Gln Leu Gln
705 710 715 720
Pro Ile Asn Tyr Phe Asp Ser Tyr Leu Gly Cys Val Val Asn Ala Tyr
725 730 735
Asn Ser Thr Ala Ile Ser Val Gln Thr Cys Asp Leu Thr Val Gly Ser
740 745 750
Gly Tyr Cys Val Asp Tyr Ser Lys Asn Arg Arg Ser Arg Gly Ala Ile
755 760 765
Thr Thr Gly Tyr Arg Phe Thr Asn Phe Glu Pro Phe Thr Val Asn Ser
770 775 780
Val Asn Asp Ser Leu Glu Pro Val Gly Gly Leu Tyr Glu Ile Gln Ile
785 790 795 800
Pro Ser Glu Phe Thr Ile Gly Asn Met Glu Glu Phe Ile Gln Thr Ser
805 810 815
Ser Pro Lys Val Thr Ile Asp Cys Ala Ala Phe Val Cys Gly Asp Tyr
820 825 830
Ala Ala Cys Lys Ser Gln Leu Val Glu Tyr Gly Ser Phe Cys Asp Asn
835 840 845
Ile Asn Ala Ile Leu Thr Glu Val Asn Glu Leu Leu Asp Thr Thr Gln
850 855 860
Leu Gln Val Ala Asn Ser Leu Met Asn Gly Val Thr Leu Ser Thr Lys
865 870 875 880
Leu Lys Asp Gly Val Asn Phe Asn Val Asp Asp Ile Asn Phe Ser Pro
885 890 895
Val Leu Gly Cys Leu Gly Ser Glu Cys Ser Lys Ala Ser Ser Arg Ser
900 905 910
Ala Ile Glu Asp Leu Leu Phe Asp Lys Val Lys Leu Ser Asp Val Gly
915 920 925
Phe Val Glu Ala Tyr Asn Asn Cys Thr Gly Gly Ala Glu Ile Arg Asp
930 935 940
Leu Ile Cys Val Gln Ser Tyr Lys Gly Ile Lys Val Leu Pro Pro Leu
945 950 955 960
Leu Ser Glu Asn Gln Ile Ser Gly Tyr Thr Leu Ala Ala Thr Ser Ala
965 970 975
Ser Leu Phe Pro Leu Trp Thr Ala Ala Ala Gly Val Pro Phe Tyr Leu
980 985 990
Asn Val Gln Tyr Arg Ile Asn Gly Leu Gly Val Thr Met Asp Val Leu
995 1000 1005
Ser Gln Asn Gln Lys Leu Ile Ala Asn Ala Phe Asn Asn Ala Leu
1010 1015 1020
Tyr Ala Ile Gln Glu Gly Phe Asp Ala Thr Asn Ser Ala Leu Val
1025 1030 1035
Lys Ile Gln Ala Val Val Asn Ala Asn Ala Glu Ala Leu Asn Asn
1040 1045 1050
Leu Leu Gln Gln Leu Ser Asn Arg Phe Gly Ala Ile Ser Ala Ser
1055 1060 1065
Leu Gln Glu Ile Leu Ser Arg Leu Asp Ala Leu Glu Ala Glu Ala
1070 1075 1080
Gln Ile Asp Arg Leu Ile Asn Gly Arg Leu Thr Ala Leu Asn Ala
1085 1090 1095
Tyr Val Ser Gln Gln Leu Ser Asp Ser Thr Leu Val Lys Phe Ser
1100 1105 1110
Ala Ala Gln Ala Met Glu Lys Val Asn Glu Cys Val Lys Ser Gln
1115 1120 1125
Ser Ser Arg Ile Asn Phe Cys Gly Asn Gly Asn His Ile Ile Ser
1130 1135 1140
Leu Val Gln Asn Ala Pro Tyr Gly Leu Tyr Phe Ile His Phe Ser
1145 1150 1155
Tyr Val Pro Thr Lys Tyr Val Thr Ala Arg Val Ser Pro Gly Leu
1160 1165 1170
Cys Ile Ala Gly Asp Arg Gly Ile Ala Pro Lys Ser Gly Tyr Phe
1175 1180 1185
Val Asn Val Asn Asn Thr Trp Met Tyr Thr Gly Ser Gly Tyr Tyr
1190 1195 1200
Tyr Pro Glu Pro Ile Thr Glu Asn Asn Val Val Val Met Ser Thr
1205 1210 1215
Cys Ala Val Asn Tyr Thr Lys Ala Pro Tyr Val Met Leu Asn Thr
1220 1225 1230
Ser Ile Pro Asn Leu Pro Asp Phe Lys Glu Glu Leu Asp Gln Trp
1235 1240 1245
Phe Lys Asn Gln Thr Ser Val Ala Pro Asp Leu Ser Leu Asp Tyr
1250 1255 1260
Ile Asn Val Thr Phe Leu Asp Leu Gln Val Glu Met Asn Arg Leu
1265 1270 1275
Gln Glu Ala Ile Lys Val Leu Asn Gln Ser Tyr Ile Asn Leu Lys
1280 1285 1290
Asp Ile Gly Thr Tyr Glu Tyr Tyr Val Lys Trp Pro Trp Tyr Val
1295 1300 1305
Trp Leu Leu Ile Cys Leu Ala Gly Val Ala Met Leu Val Leu Leu
1310 1315 1320
Phe Phe Ile Cys Cys Cys Thr Gly Cys Gly Thr Ser Cys Phe Lys
1325 1330 1335
Lys Cys Gly Gly Cys Cys Asp Asp Tyr Thr Gly Tyr Gln Glu Leu
1340 1345 1350
Val Ile Lys Thr Ser His Asp Asp
1355 1360
<210> 38
<211> 1383
<212> PRT
<213> porcine epidemic diarhhea virus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 38
Met Arg Ser Leu Ile Tyr Phe Trp Leu Leu Leu Pro Val Leu Pro Thr
1 5 10 15
Leu Ser Leu Pro Gln Asp Val Thr Arg Cys Gln Ser Thr Thr Asn Phe
20 25 30
Arg Arg Phe Phe Ser Lys Phe Asn Val Gln Ala Pro Ala Val Val Val
35 40 45
Leu Gly Gly Tyr Leu Pro Ser Met Asn Ser Ser Ser Trp Tyr Cys Gly
50 55 60
Thr Gly Ile Glu Thr Ala Ser Gly Val His Gly Ile Phe Leu Ser Tyr
65 70 75 80
Ile Asp Ser Gly Gln Gly Phe Glu Ile Gly Ile Ser Gln Glu Pro Phe
85 90 95
Asp Pro Ser Gly Tyr Gln Leu Tyr Leu His Lys Ala Thr Asn Gly Asn
100 105 110
Thr Asn Ala Ile Ala Arg Leu Arg Ile Cys Gln Phe Pro Asp Asn Lys
115 120 125
Thr Leu Gly Pro Thr Val Asn Asp Val Thr Thr Gly Arg Asn Cys Leu
130 135 140
Phe Asn Lys Ala Ile Pro Ala Tyr Met Arg Asp Gly Lys Asp Ile Val
145 150 155 160
Val Gly Ile Thr Trp Asp Asn Asp Arg Val Thr Val Phe Ala Asp Lys
165 170 175
Ile Tyr His Phe Tyr Leu Lys Asn Asp Trp Ser Arg Val Ala Thr Arg
180 185 190
Cys Tyr Asn Arg Arg Ser Cys Ala Met Gln Tyr Val Tyr Thr Pro Thr
195 200 205
Tyr Tyr Met Leu Asn Val Thr Ser Ala Gly Glu Asp Gly Ile Tyr Tyr
210 215 220
Glu Pro Cys Thr Ala Asn Cys Thr Gly Tyr Ala Ala Asn Val Phe Ala
225 230 235 240
Thr Asp Ser Asn Gly His Ile Pro Glu Gly Phe Ser Phe Asn Asn Trp
245 250 255
Phe Leu Leu Ser Asn Asp Ser Thr Leu Leu His Gly Lys Val Val Ser
260 265 270
Asn Gln Pro Leu Leu Val Asn Cys Leu Leu Ala Ile Pro Lys Ile Tyr
275 280 285
Gly Leu Gly Gln Phe Phe Ser Phe Asn His Thr Met Asp Gly Val Cys
290 295 300
Asn Gly Ala Ala Val Asp Arg Ala Pro Glu Ala Leu Arg Phe Asn Ile
305 310 315 320
Asn Asp Thr Ser Val Ile Leu Ala Glu Gly Ser Ile Val Leu His Thr
325 330 335
Ala Leu Gly Thr Asn Leu Ser Phe Val Cys Ser Asn Ser Ser Asp Pro
340 345 350
His Leu Ala Ile Phe Ala Ile Pro Leu Gly Ala Thr Glu Val Pro Tyr
355 360 365
Tyr Cys Phe Leu Lys Val Asp Thr Tyr Asn Ser Thr Val Tyr Lys Phe
370 375 380
Leu Ala Val Leu Pro Pro Thr Val Arg Glu Ile Val Ile Thr Lys Tyr
385 390 395 400
Gly Asp Val Tyr Val Asn Gly Phe Gly Tyr Leu His Leu Gly Leu Leu
405 410 415
Asp Ala Val Thr Ile Asn Phe Thr Gly His Gly Thr Asp Asp Asp Val
420 425 430
Ser Gly Phe Trp Thr Ile Ala Ser Thr Asn Phe Val Asp Ala Leu Ile
435 440 445
Glu Val Gln Gly Thr Ser Ile Gln Arg Ile Leu Tyr Cys Asp Asp Pro
450 455 460
Val Ser Gln Leu Lys Cys Ser Gln Val Ala Phe Asp Leu Asp Asp Gly
465 470 475 480
Phe Tyr Pro Ile Ser Ser Arg Asn Leu Leu Ser His Glu Gln Pro Ile
485 490 495
Ser Phe Val Thr Leu Pro Ser Phe Asn Asp His Ser Phe Val Asn Ile
500 505 510
Thr Val Ser Ala Ala Phe Gly Gly Leu Ser Ser Ala Asn Leu Val Ala
515 520 525
Ser Asp Thr Thr Ile Asn Gly Phe Ser Ser Phe Cys Val Asp Thr Arg
530 535 540
Gln Phe Thr Ile Thr Leu Phe Tyr Asn Val Thr Asn Ser Tyr Gly Tyr
545 550 555 560
Val Ser Lys Ser Gln Asp Ser Asn Cys Pro Phe Thr Leu Gln Ser Val
565 570 575
Asn Asp Tyr Leu Ser Phe Ser Lys Phe Cys Val Ser Thr Ser Leu Leu
580 585 590
Ala Gly Ala Cys Thr Ile Asp Leu Phe Gly Tyr Pro Ala Phe Gly Ser
595 600 605
Gly Val Lys Leu Thr Ser Leu Tyr Phe Gln Phe Thr Lys Gly Glu Leu
610 615 620
Ile Thr Gly Thr Pro Lys Pro Leu Glu Gly Ile Thr Asp Val Ser Phe
625 630 635 640
Met Thr Leu Asp Val Cys Thr Lys Tyr Thr Ile Tyr Gly Phe Lys Gly
645 650 655
Glu Gly Ile Ile Thr Leu Thr Asn Ser Ser Ile Leu Ala Gly Val Tyr
660 665 670
Tyr Thr Ser Asp Ser Gly Gln Leu Leu Ala Phe Lys Asn Val Thr Ser
675 680 685
Gly Ala Val Tyr Ser Val Thr Pro Cys Ser Phe Ser Glu Gln Ala Ala
690 695 700
Tyr Val Asn Asp Asp Ile Val Gly Val Ile Ser Ser Leu Ser Asn Ser
705 710 715 720
Thr Phe Asn Asn Thr Arg Glu Leu Pro Gly Phe Phe Tyr His Ser Asn
725 730 735
Asp Gly Ser Asn Cys Thr Glu Pro Val Leu Val Tyr Ser Asn Ile Gly
740 745 750
Val Cys Lys Ser Gly Ser Ile Gly Tyr Val Pro Ser Gln Tyr Gly Gln
755 760 765
Val Lys Ile Ala Pro Thr Val Thr Gly Asn Ile Ser Ile Pro Thr Asn
770 775 780
Phe Ser Met Ser Ile Arg Thr Glu Tyr Leu Gln Leu Tyr Asn Thr Pro
785 790 795 800
Val Ser Val Asp Cys Ala Thr Tyr Val Cys Asn Gly Asn Ser Arg Cys
805 810 815
Lys Gln Leu Leu Thr Gln Tyr Thr Ala Ala Cys Lys Thr Ile Glu Ser
820 825 830
Ala Leu Gln Leu Ser Ala Arg Leu Glu Ser Val Glu Val Asn Ser Met
835 840 845
Leu Thr Ile Ser Glu Glu Ala Leu Gln Leu Ala Thr Ile Ser Ser Phe
850 855 860
Asn Gly Asp Gly Tyr Asn Phe Thr Asn Val Leu Gly Ala Ser Val Tyr
865 870 875 880
Asp Pro Ala Ser Gly Arg Val Val Gln Lys Arg Ser Val Ile Glu Asp
885 890 895
Leu Leu Phe Asn Lys Val Val Thr Asn Gly Leu Gly Thr Val Asp Glu
900 905 910
Asp Tyr Lys Arg Cys Ser Asn Gly Arg Ser Val Ala Asp Leu Val Cys
915 920 925
Ala Gln Tyr Tyr Ser Gly Val Met Val Leu Pro Gly Val Val Asp Ala
930 935 940
Glu Lys Leu His Met Tyr Ser Ala Ser Leu Ile Gly Gly Met Ala Leu
945 950 955 960
Gly Gly Ile Thr Ala Ala Ala Ala Leu Pro Phe Ser Tyr Ala Val Gln
965 970 975
Ala Arg Leu Asn Tyr Leu Ala Leu Gln Thr Asp Val Leu Gln Arg Asn
980 985 990
Gln Gln Leu Leu Ala Glu Ser Phe Asn Ser Ala Ile Gly Asn Ile Thr
995 1000 1005
Ser Ala Phe Glu Ser Val Lys Glu Ala Ile Ser Gln Thr Ser Lys
1010 1015 1020
Gly Leu Asn Thr Val Ala His Ala Leu Thr Lys Val Gln Glu Val
1025 1030 1035
Val Asn Ser Gln Gly Ser Ala Leu Asn Gln Leu Thr Val Gln Leu
1040 1045 1050
Gln His Asn Phe Gln Ala Ile Ser Ser Ser Ile Asp Asp Ile Tyr
1055 1060 1065
Ser Arg Leu Asp Ile Leu Ser Ala Asp Val Gln Val Asp Arg Leu
1070 1075 1080
Ile Thr Gly Arg Leu Ser Ala Leu Asn Ala Phe Val Ala Gln Thr
1085 1090 1095
Leu Thr Lys Tyr Thr Glu Val Gln Ala Ser Arg Lys Leu Ala Gln
1100 1105 1110
Gln Lys Val Asn Glu Cys Val Lys Ser Gln Ser Gln Arg Tyr Gly
1115 1120 1125
Phe Cys Gly Gly Asp Gly Glu His Ile Phe Ser Leu Val Gln Ala
1130 1135 1140
Ala Pro Gln Gly Leu Leu Phe Leu His Thr Val Leu Val Pro Gly
1145 1150 1155
Asp Phe Val Asn Val Leu Ala Ile Ala Gly Leu Cys Val Asn Gly
1160 1165 1170
Glu Ile Ala Leu Thr Leu Arg Glu Pro Gly Leu Val Leu Phe Thr
1175 1180 1185
His Glu Leu Gln Thr Tyr Thr Ala Thr Glu Tyr Phe Val Ser Ser
1190 1195 1200
Arg Arg Met Phe Glu Pro Arg Lys Pro Thr Val Ser Asp Phe Val
1205 1210 1215
Gln Ile Glu Ser Cys Val Val Thr Tyr Val Asn Leu Thr Ser Asp
1220 1225 1230
Gln Leu Pro Asp Val Ile Pro Asp Tyr Ile Asp Val Asn Lys Thr
1235 1240 1245
Leu Asp Glu Ile Leu Ala Ser Leu Pro Asn Arg Thr Gly Pro Ser
1250 1255 1260
Leu Pro Leu Asp Val Phe Asn Ala Thr Tyr Leu Asn Leu Thr Gly
1265 1270 1275
Glu Ile Ala Asp Leu Glu Gln Arg Ser Glu Ser Leu Arg Asn Thr
1280 1285 1290
Thr Glu Glu Leu Arg Ser Leu Ile Asn Asn Ile Asn Asn Thr Leu
1295 1300 1305
Val Asp Leu Glu Trp Leu Asn Arg Val Glu Thr Tyr Ile Lys Trp
1310 1315 1320
Pro Trp Trp Val Trp Leu Ile Ile Val Ile Val Leu Ile Phe Val
1325 1330 1335
Val Ser Leu Leu Val Phe Cys Cys Ile Ser Thr Gly Cys Cys Gly
1340 1345 1350
Cys Cys Gly Cys Cys Gly Ala Cys Phe Ser Gly Cys Cys Arg Gly
1355 1360 1365
Pro Arg Leu Gln Pro Tyr Glu Ala Phe Glu Lys Val His Val Gln
1370 1375 1380
<210> 39
<211> 1349
<212> PRT
<213> porcine haemagglutinating encephalomyelitis virus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 39
Met Phe Phe Ile Leu Leu Ile Ser Leu Pro Ser Ala Phe Ala Val Ile
1 5 10 15
Gly Asp Leu Lys Cys Thr Thr Ser Leu Ile Asn Asp Val Asp Thr Gly
20 25 30
Val Pro Ser Ile Ser Ser Glu Val Val Asp Val Thr Asn Gly Leu Gly
35 40 45
Thr Phe Tyr Val Leu Asp Arg Val Tyr Leu Asn Thr Thr Leu Leu Leu
50 55 60
Asn Gly Tyr Tyr Pro Ile Ser Gly Ala Thr Phe Arg Asn Met Ala Leu
65 70 75 80
Lys Gly Thr Arg Leu Leu Ser Thr Leu Trp Phe Lys Pro Pro Phe Leu
85 90 95
Ser Pro Phe Asn Asp Gly Ile Phe Ala Lys Val Lys Asn Ser Arg Phe
100 105 110
Ser Lys Asp Gly Val Ile Tyr Ser Glu Phe Pro Ala Ile Thr Ile Gly
115 120 125
Ser Thr Phe Val Asn Thr Ser Tyr Ser Ile Val Val Glu Pro His Thr
130 135 140
Ser Leu Ile Asn Gly Asn Leu Gln Gly Leu Leu Gln Ile Ser Val Cys
145 150 155 160
Gln Tyr Thr Met Cys Glu Tyr Pro His Thr Ile Cys His Pro Asn Leu
165 170 175
Gly Asn Gln Arg Ile Glu Leu Trp His Tyr Asp Thr Asp Val Val Ser
180 185 190
Cys Leu Tyr Arg Arg Asn Phe Thr Tyr Asp Val Asn Ala Asp Tyr Leu
195 200 205
Tyr Phe His Phe Tyr Gln Glu Gly Gly Thr Phe Tyr Ala Tyr Phe Thr
210 215 220
Asp Thr Gly Phe Val Thr Lys Phe Leu Phe Lys Leu Tyr Leu Gly Thr
225 230 235 240
Val Leu Ser His Tyr Tyr Val Met Pro Leu Thr Cys Asn Ser Ala Leu
245 250 255
Ser Leu Glu Tyr Trp Val Thr Pro Leu Thr Thr Arg Gln Phe Leu Leu
260 265 270
Ala Phe Asp Gln Asp Gly Val Leu Tyr His Ala Val Asp Cys Ala Ser
275 280 285
Asp Phe Met Ser Glu Ile Met Cys Lys Thr Ser Ser Ile Thr Pro Pro
290 295 300
Thr Gly Val Tyr Glu Leu Asn Gly Tyr Thr Val Gln Pro Val Ala Thr
305 310 315 320
Val Tyr Arg Arg Ile Pro Asp Leu Pro Asn Cys Asp Ile Glu Ala Trp
325 330 335
Leu Asn Ser Lys Thr Val Ser Ser Pro Leu Asn Trp Glu Arg Lys Ile
340 345 350
Phe Ser Asn Cys Asn Phe Asn Met Gly Arg Leu Met Ser Phe Ile Gln
355 360 365
Ala Asp Ser Phe Gly Cys Asn Asn Ile Asp Ala Ser Arg Leu Tyr Gly
370 375 380
Met Cys Phe Gly Ser Ile Thr Ile Asp Lys Phe Ala Ile Pro Asn Ser
385 390 395 400
Arg Lys Val Asp Leu Gln Val Gly Lys Ser Gly Tyr Leu Gln Ser Phe
405 410 415
Asn Tyr Lys Ile Asp Thr Ala Val Ser Ser Cys Gln Leu Tyr Tyr Ser
420 425 430
Leu Pro Ala Ala Asn Val Ser Val Thr His Tyr Asn Pro Ser Ser Trp
435 440 445
Asn Arg Arg Tyr Gly Phe Asn Asn Gln Ser Phe Gly Ser Arg Gly Leu
450 455 460
His Asp Ala Val Tyr Ser Gln Gln Cys Phe Asn Thr Pro Asn Thr Tyr
465 470 475 480
Cys Pro Cys Arg Thr Ser Gln Cys Ile Gly Gly Ala Gly Thr Gly Thr
485 490 495
Cys Pro Val Gly Thr Thr Val Arg Lys Cys Phe Ala Ala Val Thr Lys
500 505 510
Ala Thr Lys Cys Thr Cys Trp Cys Gln Pro Asp Pro Ser Thr Tyr Lys
515 520 525
Gly Val Asn Ala Trp Thr Cys Pro Gln Ser Lys Val Ser Ile Gln Pro
530 535 540
Gly Gln His Cys Pro Gly Leu Gly Leu Val Glu Asp Asp Cys Ser Gly
545 550 555 560
Asn Pro Cys Thr Cys Lys Pro Gln Ala Phe Ile Gly Trp Ser Ser Glu
565 570 575
Thr Cys Leu Gln Asn Gly Arg Cys Asn Ile Phe Ala Asn Phe Ile Leu
580 585 590
Asn Asp Val Asn Ser Gly Thr Thr Cys Ser Thr Asp Leu Gln Gln Gly
595 600 605
Asn Thr Ile Ile Thr Thr Asp Val Cys Val Asn Tyr Asp Leu Tyr Gly
610 615 620
Ile Thr Gly Gln Gly Ile Leu Ile Glu Val Asn Ala Thr Tyr Tyr Asn
625 630 635 640
Ser Trp Gln Asn Leu Leu Tyr Asp Ser Ser Gly Asn Leu Tyr Gly Phe
645 650 655
Arg Asp Tyr Leu Ser Asn Arg Thr Phe Leu Ile Arg Ser Cys Tyr Ser
660 665 670
Gly Arg Val Ser Ala Val Phe His Ala Asn Ser Ser Glu Pro Ala Leu
675 680 685
Met Phe Arg Asn Leu Lys Cys Ser His Val Phe Asn Asn Thr Ile Leu
690 695 700
Arg Gln Ile Gln Leu Val Asn Tyr Phe Asp Ser Tyr Leu Gly Cys Val
705 710 715 720
Val Asn Ala Tyr Asn Asn Thr Ala Ser Ala Val Ser Thr Cys Asp Leu
725 730 735
Thr Val Gly Ser Gly Tyr Cys Val Asp Tyr Val Thr Ala Leu Arg Ser
740 745 750
Arg Arg Ser Phe Thr Thr Gly Tyr Arg Phe Thr Asn Phe Glu Pro Phe
755 760 765
Ala Ala Asn Leu Val Asn Asp Ser Ile Glu Pro Val Gly Gly Leu Tyr
770 775 780
Glu Ile Gln Ile Pro Ser Glu Phe Thr Ile Gly Asn Leu Glu Glu Phe
785 790 795 800
Ile Gln Thr Arg Ser Pro Lys Val Thr Ile Asp Cys Ala Thr Phe Val
805 810 815
Cys Gly Asp Tyr Ala Ala Cys Arg Gln Gln Leu Ala Glu Tyr Gly Ser
820 825 830
Phe Cys Glu Asn Ile Asn Ala Ile Leu Thr Glu Val Asn Glu Leu Leu
835 840 845
Asp Thr Thr Gln Leu Gln Val Ala Asn Ser Leu Met Asn Gly Val Thr
850 855 860
Leu Ser Thr Lys Ile Lys Asp Gly Ile Asn Phe Asn Val Asp Asp Ile
865 870 875 880
Asn Phe Ser Pro Val Leu Gly Cys Leu Gly Ser Glu Cys Asn Arg Ala
885 890 895
Ser Thr Arg Ser Ala Ile Glu Asp Leu Leu Phe Asp Lys Val Lys Leu
900 905 910
Ser Asp Val Gly Phe Val Gln Ala Tyr Asn Asn Cys Thr Gly Gly Ala
915 920 925
Glu Ile Arg Asp Leu Ile Cys Val Gln Ser Tyr Asn Gly Ile Lys Val
930 935 940
Leu Pro Pro Leu Leu Ser Glu Asn Gln Ile Ser Gly Tyr Thr Leu Ala
945 950 955 960
Ala Thr Ala Ala Ser Leu Phe Pro Pro Trp Thr Ala Ala Ala Gly Val
965 970 975
Pro Phe Tyr Leu Asn Val Gln Tyr Arg Ile Asn Gly Leu Gly Val Thr
980 985 990
Met Asp Val Leu Ser Gln Asn Gln Lys Leu Ile Ala Ser Ala Phe Asn
995 1000 1005
Asn Ala Leu Asp Ala Ile Gln Glu Gly Phe Asp Ala Thr Asn Ser
1010 1015 1020
Ala Leu Val Lys Ile Gln Ala Val Val Asn Ala Asn Ala Glu Ala
1025 1030 1035
Leu Asn Asn Leu Leu Gln Gln Leu Ser Asn Arg Phe Gly Ala Ile
1040 1045 1050
Ser Ala Ser Leu Gln Glu Ile Leu Ser Arg Leu Asp Ala Leu Glu
1055 1060 1065
Ala Lys Ala Gln Ile Asp Arg Leu Ile Asn Gly Arg Leu Thr Ala
1070 1075 1080
Leu Asn Ala Tyr Val Ser Gln Gln Leu Ser Asp Ser Thr Leu Val
1085 1090 1095
Lys Phe Ser Ala Ala Gln Ala Ile Glu Lys Val Asn Glu Cys Val
1100 1105 1110
Lys Ser Gln Ser Ser Arg Ile Asn Phe Cys Gly Asn Gly Asn His
1115 1120 1125
Ile Ile Ser Leu Val Gln Asn Ala Pro Tyr Gly Leu Tyr Phe Ile
1130 1135 1140
His Phe Ser Tyr Val Pro Thr Lys Tyr Val Thr Ala Lys Val Ser
1145 1150 1155
Pro Gly Leu Cys Ile Ala Gly Asp Ile Gly Ile Ser Pro Lys Ser
1160 1165 1170
Gly Tyr Phe Ile Asn Val Asn Asn Ser Trp Met Phe Thr Gly Ser
1175 1180 1185
Ser Tyr Tyr Tyr Pro Glu Pro Ile Thr Gln Asn Asn Val Val Val
1190 1195 1200
Met Ser Thr Cys Ala Val Asn Tyr Thr Lys Ala Pro Asp Leu Met
1205 1210 1215
Leu Asn Thr Ser Thr Pro Asn Leu Pro Asp Phe Lys Glu Glu Leu
1220 1225 1230
Tyr Gln Trp Phe Lys Asn Gln Ser Ser Val Ala Pro Asp Leu Ser
1235 1240 1245
Leu Asp Tyr Ile Asn Val Thr Phe Leu Asp Leu Gln Asp Glu Met
1250 1255 1260
Asn Arg Leu Gln Glu Ala Ile Lys Val Leu Asn Gln Ser Tyr Ile
1265 1270 1275
Asn Leu Lys Asp Ile Gly Thr Tyr Glu Tyr Tyr Val Lys Trp Pro
1280 1285 1290
Trp Tyr Val Trp Leu Leu Ile Gly Leu Ala Gly Val Ala Met Leu
1295 1300 1305
Val Leu Leu Phe Phe Ile Cys Cys Cys Thr Gly Cys Gly Thr Ser
1310 1315 1320
Cys Phe Lys Lys Cys Gly Gly Cys Cys Asp Asp Tyr Thr Gly His
1325 1330 1335
Gln Glu Phe Val Ile Lys Thr Ser His Asp Asp
1340 1345
<210> 40
<211> 1225
<212> PRT
<213> porcine respiratory coronavirus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 40
Met Lys Lys Leu Phe Val Val Leu Val Val Met Pro Leu Ile Tyr Gly
1 5 10 15
Asp Lys Phe Pro Thr Ser Val Val Ser Asn Cys Thr Asp Gln Cys Ala
20 25 30
Ser Tyr Val Ala Asn Val Phe Thr Ile Leu Pro Gly Gly Phe Ile Pro
35 40 45
Ser Asp Phe Ser Phe Asn Asn Trp Phe Leu Leu Thr Asn Ser Ser Thr
50 55 60
Leu Val Asn Gly Lys Leu Val Thr Lys Gln Pro Leu Leu Val Asn Cys
65 70 75 80
Leu Trp Pro Val Pro Ser Phe Glu Glu Val Ala Ser Thr Phe Cys Phe
85 90 95
Glu Gly Ala Asp Phe Asp Gln Cys Asn Gly Ala Val Leu Asn Asn Thr
100 105 110
Val Asp Val Ile Arg Phe Asn Leu Asn Phe Thr Thr Asn Val Gln Ser
115 120 125
Gly Lys Gly Ala Thr Val Phe Ser Leu Asn Thr Thr Gly Gly Val Thr
130 135 140
Leu Glu Ile Ser Cys Tyr Asn Asp Thr Val Ser Asp Ser Ser Phe Ser
145 150 155 160
Ser Tyr Gly Glu Ile Pro Phe Gly Val Thr Asn Gly Pro Arg Tyr Cys
165 170 175
Tyr Val Leu Tyr Asn Gly Thr Ala Leu Lys Tyr Leu Gly Thr Leu Pro
180 185 190
Pro Ser Val Lys Glu Ile Ala Ile Ser Lys Trp Gly His Phe Tyr Ile
195 200 205
Asn Gly Tyr Asn Phe Phe Ser Thr Phe Pro Ile Asp Cys Ile Ser Phe
210 215 220
Asn Leu Thr Thr Gly Asp Ser Asp Val Phe Trp Thr Ile Ala Tyr Thr
225 230 235 240
Ser Tyr Thr Glu Ala Leu Val Gln Val Glu Asn Thr Ala Ile Thr Asn
245 250 255
Val Thr Tyr Cys Asn Ser Tyr Val Asn Asn Ile Lys Cys Ser Gln Leu
260 265 270
Thr Ala Asn Leu Asn Asn Gly Phe Tyr Pro Val Ser Ser Ser Glu Val
275 280 285
Gly Ser Val Asn Lys Ser Val Val Leu Leu Pro Ser Phe Leu Thr His
290 295 300
Thr Ile Val Asn Ile Thr Ile Gly Leu Gly Met Lys Arg Ser Gly Tyr
305 310 315 320
Gly Gln Pro Ile Ala Ser Thr Leu Ser Asn Ile Thr Leu Pro Met Gln
325 330 335
Asp Asn Asn Asn Asp Val Tyr Cys Val Arg Ser Asp Gln Phe Ser Val
340 345 350
Tyr Val His Ser Thr Cys Lys Ser Val Leu Trp Asp Asn Val Phe Lys
355 360 365
Arg Asn Cys Thr Asp Val Leu Asp Ala Thr Ala Val Ile Lys Thr Gly
370 375 380
Thr Cys Pro Phe Ser Phe Asp Lys Leu Asn Asn Tyr Leu Thr Phe Asn
385 390 395 400
Lys Phe Cys Leu Ser Leu Ser Pro Val Gly Ala Asn Cys Lys Phe Asp
405 410 415
Val Ala Ala Arg Thr Arg Thr Asn Asp Gln Val Val Arg Ser Leu Tyr
420 425 430
Val Ile Tyr Glu Glu Gly Asp Ser Ile Val Gly Val Pro Ser Asp Asn
435 440 445
Ser Gly Leu His Asp Leu Ser Val Leu His Leu Asp Ser Cys Thr Asp
450 455 460
Tyr Asn Ile Tyr Gly Arg Thr Gly Val Gly Ile Ile Arg Gln Thr Asn
465 470 475 480
Arg Thr Ile Leu Ser Gly Leu Tyr Tyr Thr Ser Leu Ser Gly Asp Leu
485 490 495
Leu Gly Phe Thr Asn Val Ser Asp Gly Val Ile Tyr Ser Val Thr Pro
500 505 510
Cys Asp Val Ser Ala Gln Ala Ala Ile Ile Asp Gly Thr Ile Val Gly
515 520 525
Ala Ile Thr Ser Ile Asn Ser Glu Leu Leu Gly Leu Thr His Trp Thr
530 535 540
Thr Thr Pro Asn Phe Tyr Tyr Tyr Ser Ile Tyr Asn Tyr Thr Asn Asp
545 550 555 560
Lys Thr Arg Gly Thr Pro Ile Gly Ser Asn Asp Val Asp Cys Glu Pro
565 570 575
Val Ile Thr Tyr Ser Asn Ile Gly Val Cys Lys Asn Gly Ala Leu Val
580 585 590
Phe Ile Asn Val Thr His Ser Asp Gly Asp Val Gln Pro Ile Ser Thr
595 600 605
Gly Asn Val Thr Ile Pro Thr Asn Phe Thr Ile Ser Val Gln Val Glu
610 615 620
Tyr Ile Gln Val Tyr Thr Thr Pro Val Ser Ile Asp Cys Ser Arg Tyr
625 630 635 640
Val Cys Asn Gly Asn Pro Arg Cys Asn Lys Leu Leu Thr Gln Tyr Val
645 650 655
Ser Ala Cys Gln Thr Ile Glu Gln Ala Leu Ala Met Gly Ala Arg Leu
660 665 670
Glu Asn Met Glu Val Asp Ser Met Leu Phe Val Ser Glu Asn Ala Leu
675 680 685
Lys Leu Ala Ser Val Glu Ala Phe Asn Ser Ser Glu Thr Leu Asp Pro
690 695 700
Ile Tyr Lys Glu Trp Pro Asn Ile Gly Gly Phe Trp Leu Glu Gly Leu
705 710 715 720
Lys Tyr Ile Leu Pro Ser Asp Asn Ser Lys Arg Lys Tyr Arg Ser Ala
725 730 735
Ile Glu Asp Leu Leu Phe Ser Lys Val Val Thr Ser Gly Leu Gly Thr
740 745 750
Val Asp Glu Asp Tyr Lys Arg Cys Thr Gly Gly Tyr Asp Ile Ala Asp
755 760 765
Leu Val Cys Ala Gln Tyr Tyr Asn Gly Ile Met Val Leu Pro Gly Val
770 775 780
Ala Asn Ala Asp Lys Met Thr Met Tyr Thr Ala Ser Leu Ala Gly Gly
785 790 795 800
Ile Thr Leu Gly Ala Leu Gly Gly Gly Ala Val Ala Ile Pro Phe Ala
805 810 815
Val Ala Val Gln Ala Arg Leu Asn Tyr Val Ala Leu Gln Thr Asp Val
820 825 830
Leu Asn Lys Asn Gln Gln Ile Leu Ala Ser Ala Phe Asn Gln Ala Ile
835 840 845
Gly Asn Ile Thr Gln Ser Phe Gly Lys Val Asn Asp Ala Ile His Gln
850 855 860
Thr Ser Arg Gly Leu Thr Thr Val Ala Lys Ala Leu Ala Lys Val Gln
865 870 875 880
Asp Val Val Asn Thr Gln Gly Gln Ala Leu Arg His Leu Thr Val Gln
885 890 895
Leu Gln Asn Asn Phe Gln Ala Ile Ser Ser Ser Ile Ser Asp Ile Tyr
900 905 910
Asn Arg Leu Asp Glu Leu Ser Ala Asp Ala Gln Val Asp Arg Leu Ile
915 920 925
Thr Gly Arg Leu Thr Ala Leu Asn Ala Phe Val Ser Gln Thr Leu Thr
930 935 940
Arg Gln Ala Glu Val Arg Ala Ser Arg Gln Leu Ala Lys Asp Lys Val
945 950 955 960
Asn Glu Cys Val Arg Ser Gln Ser Gln Arg Phe Gly Phe Cys Gly Asn
965 970 975
Gly Thr His Leu Phe Ser Leu Ala Asn Ala Ala Pro Asn Gly Met Ile
980 985 990
Phe Phe His Thr Val Leu Leu Pro Thr Ala Tyr Glu Thr Val Thr Ala
995 1000 1005
Trp Ser Gly Ile Cys Ala Leu Asp Val Asp Arg Thr Phe Gly Leu
1010 1015 1020
Val Val Lys Asp Val Gln Leu Thr Leu Phe Arg Asn Leu Asp Asp
1025 1030 1035
Lys Phe Tyr Leu Thr Pro Arg Thr Met Tyr Gln Pro Arg Val Ala
1040 1045 1050
Thr Ser Ser Asp Phe Val Gln Ile Glu Gly Cys Asp Val Leu Phe
1055 1060 1065
Val Asn Thr Thr Val Ser Asp Leu Pro Ser Ile Ile Pro Asp Tyr
1070 1075 1080
Ile Asp Ile Asn Gln Thr Val Gln Asp Ile Leu Glu Asn Phe Arg
1085 1090 1095
Pro Asn Trp Thr Val Pro Glu Leu Thr Leu Asp Val Phe Asn Ala
1100 1105 1110
Thr Tyr Leu Asn Leu Thr Gly Glu Ile Asp Asp Leu Glu Phe Arg
1115 1120 1125
Ser Glu Lys Leu His Asn Thr Thr Val Glu Leu Ala Ile Leu Ile
1130 1135 1140
Asp Asn Ile Asn Asn Thr Val Val Asn Leu Glu Trp Leu Asn Arg
1145 1150 1155
Ile Glu Thr Tyr Val Lys Trp Pro Trp Tyr Val Trp Leu Leu Ile
1160 1165 1170
Gly Leu Val Val Ile Phe Cys Ile Pro Leu Leu Leu Phe Cys Cys
1175 1180 1185
Cys Ser Thr Gly Cys Cys Gly Cys Ile Gly Cys Leu Gly Ser Cys
1190 1195 1200
Cys His Ser Ile Phe Ser Arg Arg Gln Phe Glu Asn Tyr Glu Pro
1205 1210 1215
Ile Glu Lys Val His Val His
1220 1225
<210> 41
<211> 1360
<212> PRT
<213> rat coronavirus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 41
Met Leu Phe Val Phe Leu Thr Leu Leu Pro Ser Cys Leu Gly Tyr Ile
1 5 10 15
Gly Asp Phe Arg Cys Ile Asn Leu Val Asn Thr Arg Ile Ser Asn Ala
20 25 30
Arg Ala Pro Ser Val Ser Thr Glu Val Val Asp Val Ser Lys Gly Leu
35 40 45
Gly Thr Tyr Tyr Val Leu Asp Arg Val Tyr Leu Asn Ala Thr Leu Leu
50 55 60
Leu Thr Gly Tyr Tyr Pro Val Asp Gly Ser Met Tyr Arg Asn Met Ala
65 70 75 80
Leu Met Gly Thr Asn Thr Leu Ser Leu Asn Trp Phe Glu Pro Pro Phe
85 90 95
Leu Ser Glu Phe Asn Asp Gly Ile Tyr Ala Lys Val Lys Asn Leu Lys
100 105 110
Ala Ser Leu Pro Ile Gly Ser Ala Ser Tyr Phe Pro Thr Ile Ile Ile
115 120 125
Gly Ser Asn Phe Val Asn Thr Ser Tyr Thr Val Val Leu Glu Pro Tyr
130 135 140
Asn Gly Ile Ile Met Ala Ser Ile Cys Gln Tyr Thr Ile Cys Gln Leu
145 150 155 160
Pro His Thr Asp Cys Lys Pro Asn Thr Gly Gly Asn Thr Leu Ile Gly
165 170 175
Phe Trp His Thr Asp Leu Arg Pro Pro Val Cys Ile Leu Lys Arg Asn
180 185 190
Phe Thr Phe Asn Val Asn Ala Glu Trp Leu Tyr Phe His Phe Tyr Gln
195 200 205
Gln Gly Gly Thr Phe Tyr Ala Tyr Tyr Ala Asp Val Ser Ser Ala Thr
210 215 220
Thr Phe Leu Phe Ser Ser Tyr Ile Gly Ala Val Leu Thr Gln Tyr Phe
225 230 235 240
Val Leu Pro Tyr Met Cys Ser Pro Thr Thr Ser Gly Val Ser Ser Pro
245 250 255
Gln Tyr Trp Val Thr Pro Leu Val Lys Arg Gln Tyr Leu Phe Asn Phe
260 265 270
Asn Gln Lys Gly Ile Ile Thr Ser Ala Val Asp Cys Ala Ser Ser Tyr
275 280 285
Thr Ser Glu Ile Lys Cys Lys Thr Gln Ser Met Asn Pro Asn Thr Gly
290 295 300
Val Tyr Asp Leu Ser Gly Tyr Thr Val Gln Pro Val Gly Leu Val Tyr
305 310 315 320
Arg Arg Val Arg Asn Leu Pro Asp Cys Lys Ile Glu Glu Trp Leu Ala
325 330 335
Ala Asn Thr Val Pro Ser Pro Leu Asn Trp Glu Arg Lys Thr Phe Gln
340 345 350
Asn Cys Asn Phe Asn Leu Ser Ser Leu Leu Arg Phe Val Gln Ala Glu
355 360 365
Ser Leu Ser Cys Ser Asn Ile Asp Ala Ser Lys Val Tyr Gly Met Cys
370 375 380
Phe Gly Ser Ile Ser Ile Asp Lys Phe Ala Ile Pro Asn Ser Arg Arg
385 390 395 400
Val Asp Leu Gln Leu Gly Lys Ser Gly Leu Leu Gln Ser Phe Asn Tyr
405 410 415
Lys Ile Asp Thr Arg Ala Thr Ser Cys Gln Leu Tyr Tyr Ser Leu Ala
420 425 430
Gln Asp Asn Val Thr Val Ile Asn His Asn Pro Ser Ser Trp Asn Arg
435 440 445
Arg Tyr Gly Phe Asn Asp Val Ala Thr Phe His Ser Gly Glu His Asp
450 455 460
Val Ala Tyr Ala Glu Ala Cys Phe Thr Val Gly Ala Ser Tyr Cys Pro
465 470 475 480
Cys Ala Lys Pro Ser Thr Val Tyr Ser Cys Val Thr Gly Lys Pro Lys
485 490 495
Ser Ala Asn Cys Pro Thr Gly Thr Ser Asn Arg Glu Cys Asn Val Gln
500 505 510
Ala Ser Gly Phe Lys Ser Lys Cys Asp Cys Thr Cys Asn Pro Ser Pro
515 520 525
Leu Thr Thr Tyr Asp Pro Arg Cys Leu Gln Ala Arg Ser Met Leu Gly
530 535 540
Val Gly Asp His Cys Glu Gly Leu Gly Ile Leu Glu Asp Lys Cys Gly
545 550 555 560
Gly Ser Asn Ile Cys Asn Cys Ser Ala Asp Ala Phe Val Gly Trp Ala
565 570 575
Met Asp Ser Cys Leu Ser Asn Ala Arg Cys His Ile Phe Ser Asn Leu
580 585 590
Met Leu Asn Gly Ile Asn Ser Gly Thr Thr Cys Ser Thr Asp Phe Gln
595 600 605
Leu Pro Asn Thr Glu Val Val Thr Gly Val Cys Val Lys Tyr Asp Leu
610 615 620
Tyr Gly Ser Thr Gly Gln Gly Val Phe Lys Glu Val Lys Ala Asp Tyr
625 630 635 640
Tyr Asn Ser Trp Gln Asn Leu Leu Tyr Asp Val Asn Gly Asn Leu Asn
645 650 655
Gly Phe Arg Asp Ile Val Thr Asn Lys Thr Tyr Leu Leu Arg Ser Cys
660 665 670
Tyr Ser Gly Arg Val Ser Ala Ala Tyr His Gln Asp Ala Pro Glu Pro
675 680 685
Ala Leu Leu Tyr Arg Asn Leu Lys Cys Asp Tyr Val Phe Asn Asn Asn
690 695 700
Ile Ser Arg Glu Glu Thr Pro Leu Asn Tyr Phe Asp Ser Tyr Leu Gly
705 710 715 720
Cys Val Ile Asn Ala Asp Asn Ser Thr Glu Gln Ser Val Asp Ala Cys
725 730 735
Asp Leu Arg Met Gly Ser Gly Leu Cys Val Asn Tyr Ser Ile Ala His
740 745 750
Arg Ala Arg Arg Ser Val Ser Thr Gly Tyr Lys Leu Thr Thr Phe Glu
755 760 765
Pro Phe Thr Val Ser Ile Val Asn Asp Ser Val Glu Ser Val Gly Gly
770 775 780
Leu Tyr Glu Met Gln Ile Pro Thr Asn Phe Thr Ile Ala Ser His Gln
785 790 795 800
Glu Phe Ile Gln Thr Arg Ser Pro Lys Val Thr Ile Asp Cys Ala Ala
805 810 815
Phe Val Cys Gly Asp Tyr Thr Ala Cys Arg Gln Gln Leu Val Asp Tyr
820 825 830
Gly Ser Phe Cys Asp Asn Ile Asn Ala Ile Leu Gly Glu Val Asn Asn
835 840 845
Leu Ile Asp Thr Met Gln Leu Gln Val Ala Ser Ala Leu Ile Gln Gly
850 855 860
Val Thr Leu Ser Ser Arg Leu Ala Asp Gly Ile Ser Gly Gln Ile Asp
865 870 875 880
Asp Ile Asn Phe Ser Pro Leu Leu Gly Cys Leu Gly Ser Asp Cys Ser
885 890 895
Glu Gly Thr Lys Ala Ala Gln Gly Arg Ser Ala Ile Glu Asp Val Leu
900 905 910
Phe Asp Lys Val Lys Leu Ser Asp Val Gly Phe Val Glu Ser Tyr Asn
915 920 925
Asn Cys Thr Gly Gly Gln Glu Val Arg Asp Leu Leu Cys Val Gln Ser
930 935 940
Phe Asn Gly Ile Lys Val Leu Pro Pro Val Leu Ser Glu Ser Gln Ile
945 950 955 960
Ser Gly Tyr Thr Ala Gly Ala Thr Ala Ser Ala Met Phe Pro Pro Trp
965 970 975
Ser Ala Ala Ala Gly Val Pro Phe Ala Leu Ser Val Gln Tyr Arg Ile
980 985 990
Asn Gly Leu Gly Val Thr Met Asn Val Leu Ser Glu Asn Gln Lys Met
995 1000 1005
Ile Ala Ser Ser Phe Asn Asn Ala Ile Gly Ala Ile Gln Glu Gly
1010 1015 1020
Phe Asp Ala Thr Asn Ser Ala Leu Ala Lys Ile Gln Ser Val Val
1025 1030 1035
Asn Ala Asn Ala Glu Ala Leu Asn Asn Leu Leu Asn Gln Leu Ser
1040 1045 1050
Asn Arg Phe Gly Ala Ile Ser Ala Ser Leu Gln Glu Ile Leu Ser
1055 1060 1065
Arg Leu Asp Ala Leu Glu Ala Gln Ala Gln Ile Asp Arg Leu Ile
1070 1075 1080
Asn Gly Arg Leu Thr Ala Leu Asn Ala Tyr Val Ser Lys Gln Leu
1085 1090 1095
Ser Asp Met Thr Leu Ile Lys Val Ser Ala Ala Gln Ala Ile Glu
1100 1105 1110
Lys Val Asn Glu Cys Val Lys Ser Gln Ser Pro Arg Ile Asn Phe
1115 1120 1125
Cys Gly Asn Gly Asn His Ile Leu Ser Leu Val Gln Asn Ala Pro
1130 1135 1140
Tyr Gly Leu Tyr Phe Ile His Phe Ser Tyr Val Pro Thr Ser Phe
1145 1150 1155
Thr Thr Val Asn Val Ser Pro Gly Leu Cys Ile Ser Gly Asp Arg
1160 1165 1170
Gly Leu Ala Pro Lys Ala Gly Tyr Phe Val Gln Asp His Gly Glu
1175 1180 1185
Trp Lys Phe Thr Gly Ser Asn Tyr Tyr Tyr Pro Glu Ser Ile Thr
1190 1195 1200
Asp Lys Asn Ser Val Val Met Ser Ser Cys Ala Val Asn Tyr Thr
1205 1210 1215
Lys Ala Pro Glu Val Phe Leu Asn Thr Ser Ile Thr Asn Leu Pro
1220 1225 1230
Asp Phe Lys Glu Glu Leu Asp Lys Trp Phe Lys Asn Gln Thr Ser
1235 1240 1245
Ile Val Pro Asp Leu Ser Phe Asp Ile Gly Lys Leu Asn Val Thr
1250 1255 1260
Phe Leu Asp Leu Ser Tyr Glu Met Asn Arg Ile Gln Asp Ala Ile
1265 1270 1275
Lys Asn Leu Asn Glu Ser Tyr Ile Asn Leu Lys Glu Ile Gly Thr
1280 1285 1290
Tyr Glu Met Tyr Val Lys Trp Pro Trp Tyr Val Trp Leu Leu Ile
1295 1300 1305
Gly Leu Ala Gly Val Ala Val Cys Val Leu Leu Phe Phe Ile Cys
1310 1315 1320
Cys Cys Thr Gly Cys Gly Ser Cys Cys Phe Lys Lys Cys Gly Asn
1325 1330 1335
Cys Cys Asp Glu Tyr Gly Gly Arg Gln Ala Gly Ile Val Ile His
1340 1345 1350
Asn Ile Ser Ser His Glu Asp
1355 1360
<210> 42
<211> 1255
<212> PRT
<213> human SARS virus
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 42
Met Phe Ile Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu
1 5 10 15
Asp Arg Cys Thr Thr Phe Asp Asp Val Gln Ala Pro Asn Tyr Thr Gln
20 25 30
His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu Ile Phe Arg
35 40 45
Ser Asp Thr Leu Tyr Leu Thr Gln Asp Leu Phe Leu Pro Phe Tyr Ser
50 55 60
Asn Val Thr Gly Phe His Thr Ile Asn His Thr Phe Gly Asn Pro Val
65 70 75 80
Ile Pro Phe Lys Asp Gly Ile Tyr Phe Ala Ala Thr Glu Lys Ser Asn
85 90 95
Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gln
100 105 110
Ser Val Ile Ile Ile Asn Asn Ser Thr Asn Val Val Ile Arg Ala Cys
115 120 125
Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met
130 135 140
Gly Thr Gln Thr His Thr Met Ile Phe Asp Asn Ala Phe Asn Cys Thr
145 150 155 160
Phe Glu Tyr Ile Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser
165 170 175
Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly
180 185 190
Phe Leu Tyr Val Tyr Lys Gly Tyr Gln Pro Ile Asp Val Val Arg Asp
195 200 205
Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro Ile Phe Lys Leu Pro Leu
210 215 220
Gly Ile Asn Ile Thr Asn Phe Arg Ala Ile Leu Thr Ala Phe Ser Pro
225 230 235 240
Ala Gln Asp Ile Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr
245 250 255
Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr Ile
260 265 270
Thr Asp Ala Val Asp Cys Ser Gln Asn Pro Leu Ala Glu Leu Lys Cys
275 280 285
Ser Val Lys Ser Phe Glu Ile Asp Lys Gly Ile Tyr Gln Thr Ser Asn
290 295 300
Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn Ile Thr
305 310 315 320
Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser
325 330 335
Val Tyr Ala Trp Glu Arg Lys Lys Ile Ser Asn Cys Val Ala Asp Tyr
340 345 350
Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly
355 360 365
Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr Ala
370 375 380
Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gln Ile Ala Pro Gly
385 390 395 400
Gln Thr Gly Val Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe
405 410 415
Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn Ile Asp Ala Thr Ser
420 425 430
Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly Lys Leu
435 440 445
Arg Pro Phe Glu Arg Asp Ile Ser Asn Val Pro Phe Ser Pro Asp Gly
450 455 460
Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp
465 470 475 480
Tyr Gly Phe Tyr Thr Thr Thr Gly Ile Gly Tyr Gln Pro Tyr Arg Val
485 490 495
Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly
500 505 510
Pro Lys Leu Ser Thr Asp Leu Ile Lys Asn Gln Cys Val Asn Phe Asn
515 520 525
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg
530 535 540
Phe Gln Pro Phe Gln Gln Phe Gly Arg Asp Val Ser Asp Phe Thr Asp
545 550 555 560
Ser Val Arg Asp Pro Lys Thr Ser Glu Ile Leu Asp Ile Ser Pro Cys
565 570 575
Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Ala Ser Ser
580 585 590
Glu Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Asp Val Ser Thr
595 600 605
Ala Ile His Ala Asp Gln Leu Thr Pro Ala Trp Arg Ile Tyr Ser Thr
610 615 620
Gly Asn Asn Val Phe Gln Thr Gln Ala Gly Cys Leu Ile Gly Ala Glu
625 630 635 640
His Val Asp Thr Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
645 650 655
Cys Ala Ser Tyr His Thr Val Ser Leu Leu Arg Ser Thr Ser Gln Lys
660 665 670
Ser Ile Val Ala Tyr Thr Met Ser Leu Gly Ala Asp Ser Ser Ile Ala
675 680 685
Tyr Ser Asn Asn Thr Ile Ala Ile Pro Thr Asn Phe Ser Ile Ser Ile
690 695 700
Thr Thr Glu Val Met Pro Val Ser Met Ala Lys Thr Ser Val Asp Cys
705 710 715 720
Asn Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ala Asn Leu Leu Leu
725 730 735
Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Ser Gly Ile
740 745 750
Ala Ala Glu Gln Asp Arg Asn Thr Arg Glu Val Phe Ala Gln Val Lys
755 760 765
Gln Met Tyr Lys Thr Pro Thr Leu Lys Tyr Phe Gly Gly Phe Asn Phe
770 775 780
Ser Gln Ile Leu Pro Asp Pro Leu Lys Pro Thr Lys Arg Ser Phe Ile
785 790 795 800
Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Met
805 810 815
Lys Gln Tyr Gly Glu Cys Leu Gly Asp Ile Asn Ala Arg Asp Leu Ile
820 825 830
Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr
835 840 845
Asp Asp Met Ile Ala Ala Tyr Thr Ala Ala Leu Val Ser Gly Thr Ala
850 855 860
Thr Ala Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro Phe
865 870 875 880
Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln Asn
885 890 895
Val Leu Tyr Glu Asn Gln Lys Gln Ile Ala Asn Gln Phe Asn Lys Ala
900 905 910
Ile Ser Gln Ile Gln Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu Gly
915 920 925
Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu
930 935 940
Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn
945 950 955 960
Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln Ile Asp
965 970 975
Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr Gln
980 985 990
Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala
995 1000 1005
Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp
1010 1015 1020
Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ala Ala
1025 1030 1035
Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ser Gln
1040 1045 1050
Glu Arg Asn Phe Thr Thr Ala Pro Ala Ile Cys His Glu Gly Lys
1055 1060 1065
Ala Tyr Phe Pro Arg Glu Gly Val Phe Val Phe Asn Gly Thr Ser
1070 1075 1080
Trp Phe Ile Thr Gln Arg Asn Phe Phe Ser Pro Gln Ile Ile Thr
1085 1090 1095
Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly
1100 1105 1110
Ile Ile Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp
1115 1120 1125
Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr Ser
1130 1135 1140
Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala Ser Val
1145 1150 1155
Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala Lys
1160 1165 1170
Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr
1175 1180 1185
Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Val Trp Leu Gly Phe Ile
1190 1195 1200
Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile Leu Leu Cys Cys
1205 1210 1215
Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Ala Cys Ser Cys Gly
1220 1225 1230
Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu Lys
1235 1240 1245
Gly Val Lys Leu His Tyr Thr
1250 1255
<210> 43
<211> 1173
<212> PRT
<213> human coronavirus 229E
<220>
<221> MISC_FEATURE
<223> spike protein
<400> 43
Met Phe Val Leu Leu Val Ala Tyr Ala Leu Leu His Ile Ala Gly Cys
1 5 10 15
Gln Thr Thr Asn Gly Leu Asn Thr Ser Tyr Ser Val Cys Asn Gly Cys
20 25 30
Val Gly Tyr Ser Glu Asn Val Phe Ala Val Glu Ser Gly Gly Tyr Ile
35 40 45
Pro Ser Asp Phe Ala Phe Asn Asn Trp Phe Leu Leu Thr Asn Thr Ser
50 55 60
Ser Val Val Asp Gly Val Val Arg Ser Phe Gln Pro Leu Leu Leu Asn
65 70 75 80
Cys Leu Trp Ser Val Ser Gly Leu Arg Phe Thr Thr Gly Phe Val Tyr
85 90 95
Phe Asn Gly Thr Gly Arg Gly Asp Cys Lys Gly Phe Ser Ser Asp Val
100 105 110
Leu Ser Asp Val Ile Arg Tyr Asn Leu Asn Phe Glu Glu Asn Leu Arg
115 120 125
Arg Gly Thr Ile Leu Phe Lys Thr Ser Tyr Gly Val Val Val Phe Tyr
130 135 140
Cys Thr Asn Asn Thr Leu Val Ser Gly Asp Ala His Ile Pro Phe Gly
145 150 155 160
Thr Val Leu Gly Asn Phe Tyr Cys Phe Val Asn Thr Thr Ile Gly Asn
165 170 175
Glu Thr Thr Ser Ala Phe Val Gly Ala Leu Pro Lys Thr Val Arg Glu
180 185 190
Phe Val Ile Ser Arg Thr Gly His Phe Tyr Ile Asn Gly Tyr Arg Tyr
195 200 205
Phe Thr Leu Gly Asn Val Glu Ala Val Asn Phe Asn Val Thr Thr Ala
210 215 220
Glu Thr Thr Asp Phe Cys Thr Val Ala Leu Ala Ser Tyr Ala Asp Val
225 230 235 240
Leu Val Asn Val Ser Gln Thr Ser Ile Ala Asn Ile Ile Tyr Cys Asn
245 250 255
Ser Val Ile Asn Arg Leu Arg Cys Asp Gln Leu Ser Phe Asp Val Pro
260 265 270
Asp Gly Phe Tyr Ser Thr Ser Pro Ile Gln Ser Val Glu Leu Pro Val
275 280 285
Ser Ile Val Ser Leu Pro Val Tyr His Lys His Thr Phe Ile Val Leu
290 295 300
Tyr Val Asp Phe Lys Pro Gln Ser Gly Gly Gly Lys Cys Phe Asn Cys
305 310 315 320
Tyr Pro Ala Gly Val Asn Ile Thr Leu Ala Asn Phe Asn Glu Thr Lys
325 330 335
Gly Pro Leu Cys Val Asp Thr Ser His Phe Thr Thr Lys Tyr Val Ala
340 345 350
Val Tyr Ala Asn Val Gly Arg Trp Ser Ala Ser Ile Asn Thr Gly Asn
355 360 365
Cys Pro Phe Ser Phe Gly Lys Val Asn Asn Phe Val Lys Phe Gly Ser
370 375 380
Val Cys Phe Ser Leu Lys Asp Ile Pro Gly Gly Cys Ala Met Pro Ile
385 390 395 400
Val Ala Asn Trp Ala Tyr Ser Lys Tyr Tyr Thr Ile Gly Ser Leu Tyr
405 410 415
Val Ser Trp Ser Asp Gly Asp Gly Ile Thr Gly Val Pro Gln Pro Val
420 425 430
Glu Gly Val Ser Ser Phe Met Asn Val Thr Leu Asp Lys Cys Thr Lys
435 440 445
Tyr Asn Ile Tyr Asp Val Ser Gly Val Gly Val Ile Arg Val Ser Asn
450 455 460
Asp Thr Phe Leu Asn Gly Ile Thr Tyr Thr Ser Thr Ser Gly Asn Leu
465 470 475 480
Leu Gly Phe Lys Asp Val Thr Lys Gly Thr Ile Tyr Ser Ile Thr Pro
485 490 495
Cys Asn Pro Pro Asp Gln Leu Val Val Tyr Gln Gln Ala Val Val Gly
500 505 510
Ala Met Leu Ser Glu Asn Phe Thr Ser Tyr Gly Phe Ser Asn Val Val
515 520 525
Glu Leu Pro Lys Phe Phe Tyr Ala Ser Asn Gly Thr Tyr Asn Cys Thr
530 535 540
Asp Ala Val Leu Thr Tyr Ser Ser Phe Gly Val Cys Ala Asp Gly Ser
545 550 555 560
Ile Ile Ala Val Gln Pro Arg Asn Val Ser Tyr Asp Ser Val Ser Ala
565 570 575
Ile Val Thr Ala Asn Leu Ser Ile Pro Ser Asn Trp Thr Thr Ser Val
580 585 590
Gln Val Glu Tyr Leu Gln Ile Thr Ser Thr Pro Ile Val Val Asp Cys
595 600 605
Ser Thr Tyr Val Cys Asn Gly Asn Val Arg Cys Val Glu Leu Leu Lys
610 615 620
Gln Tyr Thr Ser Ala Cys Lys Thr Ile Glu Asp Ala Leu Arg Asn Ser
625 630 635 640
Ala Arg Leu Glu Ser Ala Asp Val Ser Glu Met Leu Thr Phe Asp Lys
645 650 655
Lys Ala Phe Thr Leu Ala Asn Val Ser Ser Phe Gly Asp Tyr Asn Leu
660 665 670
Ser Ser Val Ile Pro Ser Leu Pro Thr Ser Gly Ser Arg Val Ala Gly
675 680 685
Arg Ser Ala Ile Glu Asp Ile Leu Phe Ser Lys Leu Val Thr Ser Gly
690 695 700
Leu Gly Thr Val Asp Ala Asp Tyr Lys Lys Cys Thr Lys Gly Leu Ser
705 710 715 720
Ile Ala Asp Leu Ala Cys Ala Gln Tyr Tyr Asn Gly Ile Met Val Leu
725 730 735
Pro Gly Val Ala Asp Ala Glu Arg Met Ala Met Tyr Thr Gly Ser Leu
740 745 750
Ile Gly Gly Ile Ala Leu Gly Gly Leu Thr Ser Ala Val Ser Ile Pro
755 760 765
Phe Ser Leu Ala Ile Gln Ala Arg Leu Asn Tyr Val Ala Leu Gln Thr
770 775 780
Asp Val Leu Gln Glu Asn Gln Lys Ile Leu Ala Ala Ser Phe Asn Lys
785 790 795 800
Ala Met Thr Asn Ile Val Asp Ala Phe Thr Gly Val Asn Asp Ala Ile
805 810 815
Thr Gln Thr Ser Gln Ala Leu Gln Thr Val Ala Thr Ala Leu Asn Lys
820 825 830
Ile Gln Asp Val Val Asn Gln Gln Gly Asn Ser Leu Asn His Leu Thr
835 840 845
Ser Gln Leu Arg Gln Asn Phe Gln Ala Ile Ser Ser Ser Ile Gln Ala
850 855 860
Ile Tyr Asp Arg Leu Asp Thr Ile Gln Ala Asp Gln Gln Val Asp Arg
865 870 875 880
Leu Ile Thr Gly Arg Leu Ala Ala Leu Asn Val Phe Val Ser His Thr
885 890 895
Leu Thr Lys Tyr Thr Glu Val Arg Ala Ser Arg Gln Leu Ala Gln Gln
900 905 910
Lys Val Asn Glu Cys Val Lys Ser Gln Ser Lys Arg Tyr Gly Phe Cys
915 920 925
Gly Asn Gly Thr His Ile Phe Ser Ile Val Asn Ala Ala Pro Glu Gly
930 935 940
Leu Val Phe Leu His Thr Val Leu Leu Pro Thr Gln Tyr Lys Asp Val
945 950 955 960
Glu Ala Trp Ser Gly Leu Cys Val Asp Gly Thr Asn Gly Tyr Val Leu
965 970 975
Arg Gln Pro Asn Leu Ala Leu Tyr Lys Glu Gly Asn Tyr Tyr Arg Ile
980 985 990
Thr Ser Arg Ile Met Phe Glu Pro Arg Ile Pro Thr Met Ala Asp Phe
995 1000 1005
Val Gln Ile Glu Asn Cys Asn Val Thr Phe Val Asn Ile Ser Arg
1010 1015 1020
Ser Glu Leu Gln Thr Ile Val Pro Glu Tyr Ile Asp Val Asn Lys
1025 1030 1035
Thr Leu Gln Glu Leu Ser Tyr Lys Leu Pro Asn Tyr Thr Val Pro
1040 1045 1050
Asp Leu Val Val Glu Gln Tyr Asn Gln Thr Ile Leu Asn Leu Thr
1055 1060 1065
Ser Glu Ile Ser Thr Leu Glu Asn Lys Ser Ala Glu Leu Asn Tyr
1070 1075 1080
Thr Val Gln Lys Leu Gln Thr Leu Ile Asp Asn Ile Asn Ser Thr
1085 1090 1095
Leu Val Asp Leu Lys Trp Leu Asn Arg Val Glu Thr Tyr Ile Lys
1100 1105 1110
Trp Pro Trp Trp Val Trp Leu Cys Ile Ser Val Val Leu Ile Phe
1115 1120 1125
Val Val Ser Met Leu Leu Leu Cys Cys Cys Ser Thr Gly Cys Cys
1130 1135 1140
Gly Phe Phe Ser Cys Phe Ala Ser Ser Ile Arg Gly Cys Cys Glu
1145 1150 1155
Ser Thr Lys Leu Pro Tyr Tyr Asp Val Glu Lys Ile His Ile Gln
1160 1165 1170
<210> 44
<211> 226
<212> PRT
<213> EMCR coronvirus
<220>
<221> MISC_FEATURE
<223> ORF 4ab
<400> 44
Met Pro Phe Gly Gly Leu Phe Gln Leu Thr Leu Glu Ser Thr Ile Asn
1 5 10 15
Lys Ser Val Ala Asn Leu Lys Leu Pro Pro His Asp Val Thr Val Leu
20 25 30
Arg Asp Asn Leu Lys Pro Val Thr Thr Leu Ser Thr Ile Thr Ala Tyr
35 40 45
Leu Leu Val Ser Leu Phe Val Thr Tyr Phe Ala Leu Phe Lys Pro Leu
50 55 60
Thr Ala Arg Gly Arg Val Ala Cys Phe Val Leu Lys Leu Leu Thr Leu
65 70 75 80
Ser Val Tyr Val Pro Leu Leu Val Leu Phe Gly Met Tyr Leu Asp Ser
85 90 95
Phe Ile Ile Phe Phe Leu Arg Cys Cys Phe Asp Ser Tyr Met Leu Ala
100 105 110
Ile Met Pro Ile Ser Asn Lys Asn Phe Ser Phe Val Leu Phe Asn Val
115 120 125
Thr Lys Leu Cys Phe Val Ser Gly Lys Cys Trp Tyr Leu Glu Gln Ser
130 135 140
Phe Tyr Glu Asn Arg Phe Ala Ala Ile Tyr Gly Gly Asp His Tyr Val
145 150 155 160
Val Leu Gly Gly Glu Thr Ile Thr Phe Val Ser Phe Asp Asp Leu Tyr
165 170 175
Val Ala Ile Arg Gly Ser Cys Glu Lys Asn Leu Gln Leu Met Arg Lys
180 185 190
Val Asp Leu Tyr Asn Gly Ala Val Ile Tyr Ile Phe Ala Glu Glu Pro
195 200 205
Val Val Gly Ile Val Tyr Ser Ser Gln Leu Tyr Glu Asp Val Pro Ser
210 215 220
Ile Asn
225
<210> 45
<211> 88
<212> PRT
<213> human coronavirus 229E
<220>
<221> MISC_FEATURE
<223> ORF 4b
<400> 45
Met Gln Gly Lys Cys Trp Phe Leu Glu Asn Lys Ala Leu Lys Pro Phe
1 5 10 15
Val Cys Phe Tyr Gly Gly Asp Gln Phe Leu Tyr Ile Gly Asp Arg Ile
20 25 30
Val Ser Tyr Phe Ser Thr Asn Asp Leu Tyr Val Ala Leu Arg Gly Arg
35 40 45
Ile Asp Lys Asp Leu Ser Leu Ser Arg Lys Val Glu Leu Tyr Asn Gly
50 55 60
Glu Cys Val Tyr Leu Phe Cys Glu His Pro Ala Val Gly Ile Val Asn
65 70 75 80
Thr Asp Phe Lys Leu Glu Ile His
85
<210> 46
<211> 133
<212> PRT
<213> human coronavirus 229E
<220>
<221> MISC_FEATURE
<223> ORF 4a
<400> 46
Met Ala Leu Gly Leu Phe Thr Leu Gln Leu Val Ser Ala Val Asn Gln
1 5 10 15
Ser Leu Ser Asn Ala Lys Val Ser Ala Glu Val Ser Arg Gln Val Ile
20 25 30
Gln Asp Val Lys Asp Gly Thr Val Thr Phe Asn Leu Leu Ala Tyr Thr
35 40 45
Leu Met Ser Leu Phe Val Val Tyr Phe Ala Leu Phe Lys Ala Arg Ser
50 55 60
His Arg Gly Arg Ala Ala Leu Ile Val Phe Lys Ile Leu Ile Leu Phe
65 70 75 80
Val Tyr Val Pro Leu Leu Tyr Trp Ser Gln Ala Tyr Ile Tyr Ala Thr
85 90 95
Leu Ile Ala Val Ile Leu Leu Gly Arg Phe Phe His Thr Ala Trp His
100 105 110
Cys Trp Leu Tyr Lys Thr Trp Asp Phe Ile Val Phe Asn Val Thr Thr
115 120 125
Leu Cys Tyr Ala Arg
130
<210> 47
<211> 82
<212> PRT
<213> transmissable gastroenteritis virus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 47
Met Thr Phe Pro Arg Ala Leu Thr Val Ile Asp Asp Asn Gly Met Val
1 5 10 15
Ile Asn Ile Ile Phe Trp Phe Leu Leu Ile Ile Ile Leu Ile Leu Leu
20 25 30
Ser Ile Ala Leu Leu Asn Ile Ile Lys Leu Cys Met Val Cys Cys Asn
35 40 45
Leu Gly Arg Thr Val Ile Ile Val Pro Ala Gln His Ala Tyr Asp Ala
50 55 60
Tyr Lys Asn Phe Met Arg Ile Lys Ala Tyr Asn Pro Asp Gly Ala Leu
65 70 75 80
Leu Ala
<210> 48
<211> 108
<212> PRT
<213> avian infectious bronchitis virus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 48
Met Asn Leu Leu Asn Lys Ser Leu Glu Glu Asn Gly Ser Phe Leu Thr
1 5 10 15
Ala Leu Tyr Ile Ile Val Gly Phe Leu Ala Leu Tyr Leu Leu Gly Arg
20 25 30
Ala Leu Gln Ala Phe Val Gln Ala Ala Asp Ala Cys Cys Leu Phe Trp
35 40 45
Tyr Thr Trp Val Val Ile Pro Gly Ala Lys Gly Thr Ala Phe Val Tyr
50 55 60
Lys Tyr Thr Tyr Gly Arg Lys Leu Asn Asn Pro Glu Leu Glu Ala Val
65 70 75 80
Ile Val Asn Glu Phe Pro Lys Asn Gly Trp Asn Asn Lys Asn Pro Ala
85 90 95
Asn Phe Gln Asp Ala Gln Arg Asp Lys Leu Tyr Ser
100 105
<210> 49
<211> 84
<212> PRT
<213> bovine coronavirus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 49
Met Phe Met Ala Asp Ala Tyr Phe Ala Asp Thr Val Trp Tyr Val Gly
1 5 10 15
Gln Ile Ile Phe Ile Val Ala Ile Cys Leu Leu Val Ile Ile Val Val
20 25 30
Val Ala Phe Leu Ala Thr Phe Lys Leu Cys Ile Gln Leu Cys Gly Met
35 40 45
Cys Asn Thr Leu Val Leu Ser Pro Ser Ile Tyr Val Phe Asn Arg Gly
50 55 60
Arg Gln Phe Tyr Glu Phe Tyr Asn Asp Val Lys Pro Pro Val Leu Asp
65 70 75 80
Val Asp Asp Val
<210> 50
<211> 82
<212> PRT
<213> canine coronavirus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 50
Met Thr Phe Pro Arg Ala Leu Thr Val Ile Asp Asp Asn Gly Met Val
1 5 10 15
Ile Ser Ile Ile Phe Trp Phe Leu Leu Ile Ile Ile Leu Ile Leu Phe
20 25 30
Ser Ile Ala Leu Leu Asn Ile Ile Lys Leu Cys Met Val Cys Cys Asn
35 40 45
Leu Gly Arg Thr Val Ile Ile Val Pro Ala Arg His Ala Tyr Asp Ala
50 55 60
Tyr Lys Asn Phe Met Gln Ile Arg Ala Tyr Asn Pro Asp Glu Ala Leu
65 70 75 80
Leu Val
<210> 51
<211> 77
<212> PRT
<213> EMCR coronavirus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 51
Met Phe Leu Arg Leu Ile Asp Asp Asn Gly Ile Val Leu Asn Ser Ile
1 5 10 15
Leu Trp Leu Leu Val Met Ile Phe Phe Phe Val Leu Ala Met Thr Phe
20 25 30
Ile Lys Leu Ile Gln Leu Cys Phe Thr Cys His Tyr Phe Phe Ser Arg
35 40 45
Thr Leu Tyr Gln Pro Val Tyr Lys Ile Phe Leu Ala Tyr Gln Asp Tyr
50 55 60
Met Gln Ile Ala Pro Val Pro Ala Glu Val Leu Asn Val
65 70 75
<210> 52
<211> 82
<212> PRT
<213> feline coronavirus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 52
Met Thr Phe Pro Arg Ala Phe Thr Ile Ile Asp Asp His Gly Met Val
1 5 10 15
Val Ser Val Phe Phe Trp Leu Leu Leu Ile Ile Ile Leu Ile Leu Phe
20 25 30
Ser Ile Ala Leu Leu Asn Val Ile Lys Leu Cys Met Val Cys Cys Asn
35 40 45
Leu Gly Lys Thr Ile Ile Val Leu Pro Ala Arg His Ala Tyr Asp Ala
50 55 60
Tyr Lys Thr Phe Met Gln Thr Lys Ala Tyr Asn Pro Asp Glu Ala Phe
65 70 75 80
Leu Val
<210> 53
<211> 88
<212> PRT
<213> murine hepatitis virus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 53
Met Phe Asn Leu Phe Leu Thr Asp Thr Val Trp Tyr Val Gly Gln Ile
1 5 10 15
Ile Phe Ile Val Ala Val Cys Leu Met Val Thr Ile Ile Val Val Ala
20 25 30
Phe Leu Ala Ser Ile Lys Leu Cys Ile Gln Leu Cys Gly Leu Cys Asn
35 40 45
Thr Leu Leu Leu Ser Pro Ser Ile Cys Val Tyr Asn Arg Ser Lys Gln
50 55 60
Leu Tyr Lys Tyr Tyr Asn Glu Glu Val Arg Pro Pro Pro Leu Glu Val
65 70 75 80
Asp Asp Ile Ile Ile Gln Thr Leu
85
<210> 54
<211> 84
<212> PRT
<213> human coronavirus OC43
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 54
Met Phe Met Ala Asp Ala Tyr Leu Ala Asp Thr Val Trp Tyr Val Gly
1 5 10 15
Gln Ile Ile Phe Ile Val Ala Ile Cys Leu Leu Val Thr Ile Val Val
20 25 30
Val Ala Phe Leu Ala Thr Phe Lys Leu Cys Ile Gln Leu Cys Gly Met
35 40 45
Cys Asn Thr Leu Val Leu Ser Pro Ser Ile Tyr Val Phe Asn Arg Gly
50 55 60
Arg Gln Phe Tyr Glu Phe Tyr Asn Asp Val Lys Pro Pro Val Leu Asp
65 70 75 80
Val Asp Asp Val
<210> 55
<211> 76
<212> PRT
<213> porcine epidemic diarrhea virus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 55
Met Leu Gln Leu Val Asn Asp Asn Gly Leu Val Val Asn Val Ile Leu
1 5 10 15
Trp Leu Phe Val Leu Phe Phe Leu Leu Ile Ile Ser Ile Thr Phe Val
20 25 30
Gln Leu Val Asn Leu Cys Phe Thr Cys His Arg Leu Cys Asn Ser Ala
35 40 45
Val Tyr Thr Pro Ile Gly Arg Leu Tyr Arg Val Tyr Lys Ser Tyr Met
50 55 60
Arg Ile Asp Pro Leu Pro Ser Thr Val Ile Asp Val
65 70 75
<210> 56
<211> 84
<212> PRT
<213> porcine haemagglutinating encephalomyelitis virus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 56
Met Phe Met Ala Asp Ala Tyr Leu Ala Asp Thr Val Trp Tyr Val Gly
1 5 10 15
Gln Ile Ile Phe Ile Val Ala Ile Cys Leu Leu Val Ile Ile Val Val
20 25 30
Val Ala Phe Leu Ala Thr Phe Lys Leu Cys Ile Gln Leu Cys Gly Met
35 40 45
Cys Asn Thr Leu Val Leu Ser Pro Ser Ile Tyr Val Phe Asn Arg Gly
50 55 60
Arg Gln Phe Tyr Glu Phe Tyr Asn Asp Val Lys Pro Pro Val Leu Asp
65 70 75 80
Val Asp Asp Val
<210> 57
<211> 82
<212> PRT
<213> porcine respiratory coronavirus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 57
Met Thr Phe Pro Arg Ala Leu Thr Val Ile Asp Asp Asn Gly Met Val
1 5 10 15
Ile Ser Ile Ile Phe Trp Phe Leu Leu Ile Ile Ile Leu Ile Leu Leu
20 25 30
Ser Ile Ala Leu Leu Asn Ile Ile Lys Leu Cys Met Val Cys Cys Asn
35 40 45
Leu Gly Arg Thr Val Ile Ile Val Pro Val Gln His Ala Tyr Asp Ala
50 55 60
Tyr Lys Asn Phe Met Arg Ile Lys Ala Tyr Asn Pro Asp Gly Ala Leu
65 70 75 80
Leu Val
<210> 58
<211> 88
<212> PRT
<213> rat coronavirus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 58
Met Phe Asn Leu Phe Leu Ile Asp Thr Val Trp Tyr Val Gly Gln Ile
1 5 10 15
Ile Phe Ile Val Ala Val Cys Leu Met Val Thr Ile Ile Val Val Ala
20 25 30
Phe Leu Ala Ser Ile Lys Leu Cys Ile Gln Leu Cys Gly Leu Cys Asn
35 40 45
Thr Leu Leu Leu Ser Pro Ser Ile Tyr Val Tyr Asn Arg Ser Lys Gln
50 55 60
Leu Tyr Lys Tyr Tyr Asn Glu Glu Val Arg Pro Pro Pro Leu Glu Val
65 70 75 80
Asp Asp Ile Ile Ile Gln Thr Leu
85
<210> 59
<211> 76
<212> PRT
<213> human SARS virus
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 59
Met Tyr Ser Phe Val Ser Glu Glu Thr Gly Thr Leu Ile Val Asn Ser
1 5 10 15
Val Leu Leu Phe Leu Ala Phe Val Val Phe Leu Leu Val Thr Leu Ala
20 25 30
Ile Leu Thr Ala Leu Arg Leu Cys Ala Tyr Cys Cys Asn Ile Val Asn
35 40 45
Val Ser Leu Val Lys Pro Thr Val Tyr Val Tyr Ser Arg Val Lys Asn
50 55 60
Leu Asn Ser Ser Glu Gly Val Pro Asp Leu Leu Val
65 70 75
<210> 60
<211> 77
<212> PRT
<213> human coronavirus 229E
<220>
<221> MISC_FEATURE
<223> ORF E
<400> 60
Met Phe Leu Lys Leu Val Asp Asp His Ala Leu Val Val Asn Val Leu
1 5 10 15
Leu Trp Cys Val Val Leu Ile Val Ile Leu Leu Val Cys Ile Thr Ile
20 25 30
Ile Lys Leu Ile Lys Leu Cys Phe Thr Cys His Met Phe Cys Asn Arg
35 40 45
Thr Val Tyr Gly Pro Ile Lys Asn Val Tyr His Ile Tyr Gln Ser Tyr
50 55 60
Met His Ile Asp Pro Phe Pro Lys Arg Val Ile Asp Phe
65 70 75
<210> 61
<211> 262
<212> PRT
<213> transmissible gastroenteritis virus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 61
Met Lys Ile Leu Leu Ile Leu Ala Cys Val Ile Ala Cys Ala Cys Gly
1 5 10 15
Glu Arg Tyr Cys Ala Met Lys Ser Asp Thr Asp Leu Ser Cys Arg Asn
20 25 30
Ser Thr Ala Ser Asp Cys Glu Ser Cys Phe Asn Gly Gly Asp Leu Ile
35 40 45
Trp His Leu Ala Asn Trp Asn Phe Ser Trp Ser Ile Ile Leu Ile Val
50 55 60
Phe Ile Thr Val Leu Gln Tyr Gly Arg Pro Gln Phe Ser Trp Phe Val
65 70 75 80
Tyr Gly Ile Lys Met Leu Ile Met Trp Leu Leu Trp Pro Val Val Leu
85 90 95
Ala Leu Thr Ile Phe Asn Ala Tyr Ser Glu Tyr Gln Val Ser Arg Tyr
100 105 110
Val Met Phe Gly Phe Ser Ile Ala Gly Ala Ile Val Thr Phe Val Leu
115 120 125
Trp Ile Met Tyr Phe Val Arg Ser Ile Gln Leu Tyr Arg Arg Thr Lys
130 135 140
Ser Trp Trp Ser Phe Asn Pro Glu Thr Lys Ala Ile Leu Cys Val Ser
145 150 155 160
Ala Leu Gly Arg Ser Tyr Val Leu Pro Leu Glu Gly Val Pro Thr Gly
165 170 175
Val Thr Leu Thr Leu Leu Ser Gly Asn Leu Tyr Ala Glu Gly Phe Lys
180 185 190
Ile Ala Gly Gly Met Asn Ile Asp Asn Leu Pro Lys Tyr Val Met Val
195 200 205
Ala Leu Pro Ser Arg Thr Ile Val Tyr Thr Leu Val Gly Lys Lys Leu
210 215 220
Lys Ala Ser Ser Ala Thr Gly Trp Ala Tyr Tyr Val Lys Ser Lys Ala
225 230 235 240
Gly Asp Tyr Ser Thr Glu Ala Arg Thr Asp Asn Leu Ser Glu Gln Glu
245 250 255
Lys Leu Leu His Met Val
260
<210> 62
<211> 225
<212> PRT
<213> avian infectious bronchitis virus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 62
Met Pro Asn Glu Thr Asn Cys Thr Leu Asp Phe Glu Gln Ser Val Gln
1 5 10 15
Leu Phe Lys Glu Tyr Asn Leu Phe Ile Thr Ala Phe Leu Leu Phe Leu
20 25 30
Thr Ile Ile Leu Gln Tyr Gly Tyr Ala Thr Arg Ser Lys Val Ile Tyr
35 40 45
Thr Leu Lys Met Ile Val Leu Trp Cys Phe Trp Pro Leu Asn Ile Ala
50 55 60
Val Gly Val Ile Ser Cys Thr Tyr Pro Pro Asn Thr Gly Gly Leu Val
65 70 75 80
Ala Ala Ile Ile Leu Thr Val Phe Ala Cys Leu Ser Phe Val Gly Tyr
85 90 95
Trp Ile Gln Ser Ile Arg Leu Phe Lys Arg Cys Arg Ser Trp Trp Ser
100 105 110
Phe Asn Pro Glu Ser Asn Ala Val Gly Ser Ile Leu Leu Thr Asn Gly
115 120 125
Gln Gln Cys Asn Phe Ala Ile Glu Ser Val Pro Met Val Leu Ser Pro
130 135 140
Ile Ile Lys Asn Gly Val Leu Tyr Cys Glu Gly Gln Trp Leu Ala Lys
145 150 155 160
Cys Glu Pro Asp His Leu Pro Lys Asp Ile Phe Val Cys Thr Pro Asp
165 170 175
Arg Arg Asn Ile Tyr Arg Met Val Gln Lys Tyr Thr Gly Asp Gln Ser
180 185 190
Gly Asn Lys Lys Arg Phe Ala Thr Phe Val Tyr Ala Lys Gln Ser Val
195 200 205
Asp Thr Gly Glu Leu Glu Ser Val Ala Thr Gly Gly Ser Ser Leu Tyr
210 215 220
Thr
225
<210> 63
<211> 230
<212> PRT
<213> bovina coronavirus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 63
Met Ser Ser Val Thr Thr Pro Ala Pro Val Tyr Thr Trp Thr Ala Asp
1 5 10 15
Glu Ala Ile Lys Phe Leu Lys Glu Trp Asn Phe Ser Leu Gly Ile Ile
20 25 30
Leu Leu Phe Ile Thr Ile Ile Leu Gln Phe Gly Tyr Thr Ser Arg Ser
35 40 45
Met Phe Val Tyr Val Ile Lys Met Ile Ile Leu Trp Leu Met Trp Pro
50 55 60
Leu Thr Ile Ile Leu Thr Ile Phe Asn Cys Val Tyr Ala Leu Asn Asn
65 70 75 80
Val Tyr Leu Gly Phe Ser Ile Val Phe Thr Ile Val Ala Ile Ile Met
85 90 95
Trp Ile Val Tyr Phe Val Asn Ser Ile Arg Leu Phe Ile Arg Thr Gly
100 105 110
Ser Trp Trp Ser Phe Asn Pro Glu Thr Asn Asn Leu Met Cys Ile Asp
115 120 125
Met Lys Gly Arg Met Tyr Val Arg Pro Ile Ile Glu Asp Tyr His Thr
130 135 140
Leu Thr Val Thr Ile Ile Arg Gly His Leu Tyr Met Gln Gly Ile Lys
145 150 155 160
Leu Gly Thr Gly Tyr Ser Leu Ser Asp Leu Pro Ala Tyr Val Thr Val
165 170 175
Ala Lys Val Ser His Leu Leu Thr Tyr Lys Arg Gly Phe Leu Asp Lys
180 185 190
Ile Gly Asp Thr Ser Gly Phe Ala Val Tyr Val Lys Ser Lys Val Gly
195 200 205
Asn Tyr Arg Leu Pro Ser Thr Gln Lys Gly Ser Gly Met Asp Thr Ala
210 215 220
Leu Leu Arg Asn Asn Ile
225 230
<210> 64
<211> 262
<212> PRT
<213> canine coronavirus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 64
Met Lys Lys Ile Leu Phe Leu Leu Ala Cys Ala Ile Ala Cys Val Tyr
1 5 10 15
Gly Glu Arg Tyr Cys Ala Met Thr Glu Ser Ser Thr Ser Cys Arg Asn
20 25 30
Ser Thr Ala Gly Asn Cys Ala Ser Cys Phe Glu Thr Gly Asp Leu Ile
35 40 45
Trp His Leu Ala Asn Trp Asn Phe Ser Trp Ser Val Ile Leu Ile Ile
50 55 60
Phe Ile Thr Val Leu Gln Tyr Gly Arg Pro Gln Phe Ser Trp Phe Val
65 70 75 80
Cys Gly Ile Lys Met Leu Ile Met Trp Leu Leu Trp Pro Ile Val Leu
85 90 95
Ala Leu Thr Ile Phe Asn Ala Tyr Leu Glu Tyr Arg Val Ser Arg Tyr
100 105 110
Val Met Phe Gly Phe Ser Val Ala Gly Ala Thr Val Thr Phe Ile Leu
115 120 125
Trp Ile Met Tyr Phe Val Arg Ser Ile Gln Leu Tyr Arg Arg Thr Lys
130 135 140
Ser Trp Trp Ser Phe Asn Pro Glu Thr Ser Ala Ile Leu Cys Val Ser
145 150 155 160
Ala Leu Gly Arg Ser Tyr Val Leu Pro Leu Glu Gly Val Pro Thr Gly
165 170 175
Val Thr Leu Thr Leu Leu Ser Gly Asn Leu Cys Ala Glu Gly Phe Lys
180 185 190
Ile Ala Gly Gly Met Asn Ile Asp Asn Leu Pro Lys Tyr Val Met Val
195 200 205
Ala Leu Pro Val Arg Thr Ile Val Tyr Thr Leu Val Gly Lys Lys Leu
210 215 220
Lys Ala Ser Ser Ala Thr Gly Trp Ala Tyr Tyr Val Lys Ser Lys Ala
225 230 235 240
Gly Asp Tyr Ser Thr Asp Ala Arg Thr Asp Asn Leu Ser Glu His Glu
245 250 255
Lys Leu Leu His Met Val
260
<210> 65
<211> 226
<212> PRT
<213> EMCR coronavirus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 65
Met Ser Asn Ser Ser Val Pro Leu Ser Glu Val Tyr Val His Leu Arg
1 5 10 15
Asn Trp Asn Phe Ser Trp Asn Leu Ile Leu Thr Val Phe Ile Val Val
20 25 30
Leu Gln Tyr Gly His Tyr Lys Tyr Ser Arg Leu Leu Tyr Gly Leu Lys
35 40 45
Met Ser Val Leu Trp Cys Leu Trp Pro Leu Val Leu Ala Leu Ser Ile
50 55 60
Phe Asp Cys Phe Val Asn Phe Asn Val Asp Trp Val Phe Phe Gly Phe
65 70 75 80
Ser Ile Leu Met Ser Ile Ile Thr Leu Cys Leu Trp Val Met Tyr Phe
85 90 95
Val Asn Ser Phe Arg Leu Trp Arg Arg Val Lys Thr Phe Trp Ala Phe
100 105 110
Asn Pro Glu Thr Asn Ala Ile Ile Ser Leu Gln Val Tyr Gly His Asn
115 120 125
Tyr Tyr Leu Pro Val Met Ala Ala Pro Thr Gly Val Thr Leu Thr Leu
130 135 140
Leu Ser Gly Val Leu Leu Val Asp Gly His Lys Ile Ala Thr Arg Val
145 150 155 160
Gln Val Gly Gln Leu Pro Lys Tyr Val Ile Val Ala Thr Pro Ser Thr
165 170 175
Thr Ile Val Cys Asp Arg Val Gly Arg Ser Val Asn Glu Thr Ser Gln
180 185 190
Thr Gly Trp Ala Phe Tyr Val Arg Ala Lys His Gly Asp Phe Ser Gly
195 200 205
Val Ala Ser Gln Glu Gly Val Leu Ser Glu Arg Glu Lys Leu Leu His
210 215 220
Leu Ile
225
<210> 66
<211> 289
<212> PRT
<213> feline coronavirus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 66
Met His Met Met Pro Ile Arg Pro Leu Cys Lys Pro Arg His Ile Ile
1 5 10 15
Pro Thr Lys His Phe Trp Phe Glu Leu Asn Lys Met Lys Tyr Ile Leu
20 25 30
Leu Ile Leu Ala Cys Ile Ile Ala Cys Val Tyr Gly Glu Arg Tyr Cys
35 40 45
Ala Met Gln Asp Ser Gly Leu Gln Cys Ile Asn Gly Thr Asn Ser Arg
50 55 60
Cys Gln Thr Cys Phe Glu Arg Gly Asp Leu Ile Trp His Leu Ala Asn
65 70 75 80
Trp Asn Phe Ser Trp Ser Val Ile Leu Ile Val Phe Ile Thr Val Leu
85 90 95
Gln Tyr Gly Arg Pro Gln Phe Ser Trp Leu Val Tyr Gly Ile Lys Met
100 105 110
Leu Ile Met Trp Leu Leu Trp Pro Ile Val Leu Ala Leu Thr Ile Phe
115 120 125
Asn Ala Tyr Ser Glu Tyr Gln Val Ser Arg Tyr Val Met Phe Gly Phe
130 135 140
Ser Val Ala Gly Ala Val Val Thr Phe Ala Leu Trp Met Met Tyr Phe
145 150 155 160
Val Arg Ser Val Gln Leu Tyr Arg Arg Thr Lys Ser Trp Trp Ser Phe
165 170 175
Asn Pro Glu Thr Asn Ala Ile Leu Cys Val Asn Ala Leu Gly Arg Ser
180 185 190
Tyr Val Leu Pro Leu Asp Gly Thr Pro Thr Gly Val Thr Leu Thr Leu
195 200 205
Leu Ser Gly Asn Leu Tyr Ala Glu Gly Phe Lys Met Ala Gly Gly Leu
210 215 220
Thr Ile Glu His Leu Pro Lys Tyr Val Met Ile Ala Thr Pro Ser Arg
225 230 235 240
Thr Ile Val Tyr Thr Leu Val Gly Lys Gln Leu Lys Ala Thr Thr Ala
245 250 255
Thr Gly Trp Ala Tyr Tyr Val Lys Ser Lys Ala Gly Asp Tyr Ser Thr
260 265 270
Glu Ala Arg Thr Asp Asn Leu Ser Glu His Glu Lys Leu Leu His Met
275 280 285
Val
<210> 67
<211> 228
<212> PRT
<213> murine hepatitis virus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 67
Met Thr Ser Thr Thr Gln Ala Pro Gln Pro Val Tyr Gln Trp Thr Ala
1 5 10 15
Asp Glu Ala Ile Arg Phe Leu Lys Glu Trp Asn Phe Ser Leu Gly Ile
20 25 30
Ile Leu Leu Phe Val Thr Ile Ile Leu Gln Phe Gly Tyr Thr Ser Arg
35 40 45
Ser Met Phe Val Tyr Val Val Lys Met Ile Leu Leu Trp Leu Met Trp
50 55 60
Pro Leu Thr Ile Val Leu Cys Ile Phe Asn Cys Val Tyr Ala Leu Asn
65 70 75 80
Asn Val Tyr Leu Gly Phe Ser Ile Val Phe Thr Ile Val Ser Ile Ile
85 90 95
Met Trp Ile Met Tyr Phe Val Asn Ser Ile Arg Leu Phe Ile Arg Thr
100 105 110
Gly Ser Trp Trp Ser Phe Asn Pro Glu Thr Asn Asn Leu Met Cys Ile
115 120 125
Asp Met Lys Gly Thr Val Tyr Val Arg Pro Ile Ile Glu Asp Tyr His
130 135 140
Thr Leu Thr Ala Thr Ile Ile Arg Gly His Leu Tyr Met Gln Gly Val
145 150 155 160
Lys Leu Gly Thr Gly Phe Ser Leu Ser Asp Leu Pro Ala Tyr Val Thr
165 170 175
Val Ala Lys Val Ser His Leu Cys Thr Tyr Lys Arg Ala Phe Leu Asp
180 185 190
Lys Val Asp Gly Val Ser Gly Phe Ala Val Tyr Val Lys Ser Lys Val
195 200 205
Gly Asn Tyr Arg Leu Pro Ser Asn Lys Pro Ser Gly Met Asp Thr Ala
210 215 220
Leu Leu Arg Ile
225
<210> 68
<211> 230
<212> PRT
<213> human coronavirus OC43
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 68
Met Ser Ser Lys Thr Thr Pro Ala Pro Val Tyr Ile Trp Thr Ala Asp
1 5 10 15
Glu Ala Ile Lys Phe Leu Lys Glu Trp Asn Phe Ser Leu Gly Ile Ile
20 25 30
Leu Leu Phe Ile Thr Ile Ile Leu Gln Phe Gly Tyr Thr Ser Arg Ser
35 40 45
Met Phe Val Tyr Val Ile Lys Met Ile Ile Leu Trp Leu Met Trp Pro
50 55 60
Leu Thr Ile Ile Leu Thr Ile Phe Asn Cys Val Tyr Ala Leu Asn Asn
65 70 75 80
Val Tyr Leu Gly Leu Ser Ile Val Phe Thr Ile Val Ala Ile Ile Met
85 90 95
Trp Ile Val Tyr Phe Val Asn Ser Ile Arg Leu Phe Ile Arg Thr Gly
100 105 110
Ser Phe Trp Ser Phe Asn Pro Glu Thr Asn Asn Leu Met Cys Ile Asp
115 120 125
Met Lys Gly Thr Met Tyr Val Arg Pro Ile Ile Glu Asp Tyr His Thr
130 135 140
Leu Thr Val Thr Ile Ile Arg Gly His Leu Tyr Ile Gln Gly Ile Lys
145 150 155 160
Leu Gly Thr Gly Tyr Ser Leu Ala Asp Leu Pro Ala Tyr Met Thr Val
165 170 175
Ala Lys Val Thr His Leu Cys Thr Tyr Lys Arg Gly Phe Leu Asp Arg
180 185 190
Ile Ser Asp Thr Ser Gly Phe Ala Val Tyr Val Lys Ser Lys Val Gly
195 200 205
Asn Tyr Arg Leu Pro Ser Thr Gln Lys Gly Ser Gly Met Asp Thr Ala
210 215 220
Leu Leu Arg Asn Asn Ile
225 230
<210> 69
<211> 226
<212> PRT
<213> porcine epidemic diarrhea virus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 69
Met Ser Asn Gly Ser Ile Pro Val Asp Glu Val Ile Glu His Leu Arg
1 5 10 15
Asn Trp Asn Phe Thr Trp Asn Ile Ile Leu Thr Ile Leu Leu Val Val
20 25 30
Leu Gln Tyr Gly His Tyr Lys Tyr Ser Val Phe Leu Tyr Gly Val Lys
35 40 45
Met Ala Ile Leu Trp Ile Leu Trp Pro Leu Val Leu Ala Leu Ser Leu
50 55 60
Phe Asp Ala Trp Ala Ser Phe Gln Val Asn Trp Val Phe Phe Ala Phe
65 70 75 80
Ser Ile Leu Met Ala Cys Ile Thr Leu Met Leu Trp Ile Met Tyr Phe
85 90 95
Val Asn Ser Ile Arg Leu Trp Arg Arg Thr His Ser Trp Trp Ser Phe
100 105 110
Asn Pro Glu Thr Asp Ala Leu Leu Thr Thr Ser Val Met Gly Arg Gln
115 120 125
Val Cys Ile Pro Val Leu Gly Ala Pro Thr Gly Val Thr Leu Thr Leu
130 135 140
Leu Ser Gly Thr Leu Leu Val Glu Gly Tyr Lys Val Ala Thr Gly Val
145 150 155 160
Gln Val Ser Gln Leu Pro Asn Phe Val Thr Val Ala Lys Ala Thr Thr
165 170 175
Thr Ile Val Tyr Gly Arg Val Gly Arg Ser Val Asn Ala Ser Ser Gly
180 185 190
Thr Gly Trp Ala Phe Tyr Val Arg Ser Lys His Gly Asp Tyr Ser Ala
195 200 205
Val Ser Asn Pro Ser Ala Val Leu Thr Asp Ser Glu Lys Val Leu His
210 215 220
Leu Val
225
<210> 70
<211> 230
<212> PRT
<213> porcine haemagglutinating encephalomyelitis virus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 70
Met Ser Ser Pro Thr Thr Pro Val Pro Val Ile Ser Trp Thr Ala Asp
1 5 10 15
Glu Ala Ile Lys Phe Leu Lys Glu Trp Asn Phe Ser Leu Gly Ile Ile
20 25 30
Val Leu Phe Ile Thr Ile Ile Leu Gln Phe Gly Tyr Thr Ser Arg Ser
35 40 45
Met Phe Val Tyr Val Ile Lys Met Val Ile Leu Trp Leu Met Trp Pro
50 55 60
Leu Thr Ile Ile Leu Thr Ile Phe Asn Cys Val Tyr Ala Leu Asn Asn
65 70 75 80
Val Tyr Leu Gly Phe Ser Ile Val Phe Thr Ile Val Ala Ile Ile Met
85 90 95
Trp Val Val Tyr Phe Val Asn Ser Ile Arg Leu Phe Ile Arg Thr Gly
100 105 110
Ser Trp Trp Ser Phe Asn Pro Glu Thr Asn Asn Leu Met Cys Ile Asp
115 120 125
Met Lys Gly Arg Met Tyr Val Arg Pro Ile Ile Glu Asp Tyr His Thr
130 135 140
Leu Thr Ala Thr Ile Ile Arg Gly His Leu Tyr Ile Gln Gly Ile Lys
145 150 155 160
Leu Gly Thr Gly Tyr Ser Leu Ser Asp Leu Pro Ala Tyr Val Thr Val
165 170 175
Ala Lys Val Thr His Leu Cys Thr Tyr Lys Arg Gly Phe Leu Asp Arg
180 185 190
Ile Gly Asp Thr Ser Gly Phe Ala Val Tyr Val Lys Ser Lys Val Gly
195 200 205
Asn Tyr Arg Leu Pro Ser Thr His Lys Gly Ser Gly Met Asp Thr Ala
210 215 220
Leu Leu Arg Asn Asn Ile
225 230
<210> 71
<211> 262
<212> PRT
<213> porcine respiratory coronavirus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 71
Met Lys Ile Leu Leu Ile Leu Ala Cys Ala Ile Ala Cys Thr Cys Gly
1 5 10 15
Glu Arg Tyr Cys Ala Met Lys Asp Asp Thr Gly Leu Ser Cys Arg Asn
20 25 30
Gly Thr Ala Ser Asp Cys Glu Ser Cys Phe Asn Arg Gly Asp Leu Ile
35 40 45
Trp Leu Leu Ala Asn Trp Asn Phe Ser Trp Ser Ile Ile Leu Ile Ile
50 55 60
Phe Ile Thr Val Leu Gln Tyr Gly Arg Pro Gln Phe Ser Trp Phe Val
65 70 75 80
Tyr Gly Ile Lys Met Leu Ile Met Trp Leu Leu Trp Pro Ile Val Leu
85 90 95
Ala Leu Thr Ile Phe Asn Ala Tyr Ser Glu Tyr Gln Val Ser Arg Tyr
100 105 110
Val Met Phe Gly Phe Ser Ile Ala Gly Ala Ile Val Thr Phe Val Leu
115 120 125
Trp Ile Met Tyr Phe Val Arg Ser Ile Gln Leu Tyr Arg Arg Thr Lys
130 135 140
Ser Trp Trp Ser Phe Asn Pro Glu Thr Asn Ala Ile Leu Cys Val Ser
145 150 155 160
Ala Leu Gly Arg Ser Tyr Val Leu Pro Leu Glu Gly Val Pro Thr Gly
165 170 175
Val Thr Leu Thr Leu Leu Ser Gly Asn Leu Tyr Ala Glu Gly Phe Lys
180 185 190
Ile Ala Gly Gly Met Thr Ile Asp Asn Leu Pro Lys Tyr Val Met Val
195 200 205
Ala Leu Pro Ser Arg Thr Ile Val Tyr Thr Leu Val Gly Lys Lys Leu
210 215 220
Lys Ala Ser Ser Ala Thr Gly Trp Ala Tyr Tyr Val Lys Ser Lys Ala
225 230 235 240
Gly Asp Tyr Ser Thr Glu Ala Arg Thr Asp Asn Leu Ser Glu Gln Glu
245 250 255
Lys Leu Leu His Met Val
260
<210> 72
<211> 228
<212> PRT
<213> rat coronavirus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 72
Met Ser Ser Thr Thr Pro Ala Pro Gln Thr Val Tyr Gln Trp Thr Ala
1 5 10 15
Asp Val Ala Val Arg Phe Leu Lys Glu Trp Asn Phe Leu Leu Gly Ile
20 25 30
Ile Leu Leu Phe Ile Thr Ile Ile Leu Gln Phe Gly Tyr Thr Ser Arg
35 40 45
Ser Met Phe Ile Tyr Val Val Lys Met Ile Ile Leu Trp Leu Met Trp
50 55 60
Pro Leu Thr Ile Val Leu Cys Ile Phe Asn Cys Val Tyr Ala Leu Asn
65 70 75 80
Asn Val Tyr Leu Gly Phe Ser Ile Val Phe Thr Ile Val Ser Ile Val
85 90 95
Met Trp Ile Met Tyr Phe Val Asn Ser Ile Arg Leu Phe Ile Arg Thr
100 105 110
Gly Ser Trp Trp Ser Phe Asn Pro Glu Thr Asn Asn Leu Met Cys Ile
115 120 125
Asp Val Lys Gly Thr Val Tyr Val Arg Pro Ile Ile Glu Asp Tyr His
130 135 140
Thr Leu Thr Ala Thr Asn Val Arg Gly His Leu Tyr Met Gln Gly Val
145 150 155 160
Lys Leu Gly Thr Gly Phe Ser Leu Ser Asp Leu Pro Ala Tyr Val Thr
165 170 175
Val Ala Lys Val Ser His Leu Cys Thr Tyr Lys Arg Ala Phe Leu Asp
180 185 190
Lys Val Asp Gly Val Ser Gly Phe Ala Val Tyr Val Lys Ser Lys Val
195 200 205
Gly Asn Tyr Arg Leu Pro Ser Asn Lys Pro Ser Gly Ala Asp Thr Ala
210 215 220
Leu Leu Arg Ile
225
<210> 73
<211> 221
<212> PRT
<213> human SARS virus
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 73
Met Ala Asp Asn Gly Thr Ile Thr Val Glu Glu Leu Lys Gln Leu Leu
1 5 10 15
Glu Gln Trp Asn Leu Val Ile Gly Phe Leu Phe Leu Ala Trp Ile Met
20 25 30
Leu Leu Gln Phe Ala Tyr Ser Asn Arg Asn Arg Phe Leu Tyr Ile Ile
35 40 45
Lys Leu Val Phe Leu Trp Leu Leu Trp Pro Val Thr Leu Ala Cys Phe
50 55 60
Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Val Thr Gly Gly Ile Ala
65 70 75 80
Ile Ala Met Ala Cys Ile Val Gly Leu Met Trp Leu Ser Tyr Phe Val
85 90 95
Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe Asn
100 105 110
Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu Arg Gly Thr Ile Val
115 120 125
Thr Arg Pro Leu Met Glu Ser Glu Leu Val Ile Gly Ala Val Ile Ile
130 135 140
Arg Gly His Leu Arg Met Ala Gly His Pro Leu Gly Arg Cys Asp Ile
145 150 155 160
Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu Ser
165 170 175
Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Gly Thr Asp Ser Gly Phe
180 185 190
Ala Ala Tyr Asn Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr Asp
195 200 205
His Ala Gly Ser Asn Asp Asn Ile Ala Leu Leu Val Gln
210 215 220
<210> 74
<211> 225
<212> PRT
<213> human coronavirus 229E
<220>
<221> MISC_FEATURE
<223> ORF M
<400> 74
Met Ser Asn Asp Asn Cys Thr Gly Asp Ile Val Thr His Leu Lys Asn
1 5 10 15
Trp Asn Phe Gly Trp Asn Val Ile Leu Thr Ile Phe Ile Val Ile Leu
20 25 30
Gln Phe Gly His Tyr Lys Tyr Ser Arg Leu Phe Tyr Gly Leu Lys Met
35 40 45
Leu Val Leu Trp Leu Leu Trp Pro Leu Val Leu Ala Leu Ser Ile Phe
50 55 60
Asp Thr Trp Ala Asn Trp Asp Ser Asn Trp Ala Phe Val Ala Phe Ser
65 70 75 80
Phe Phe Met Ala Val Ser Thr Leu Val Met Trp Val Met Tyr Phe Ala
85 90 95
Asn Ser Phe Arg Leu Phe Arg Arg Ala Arg Thr Phe Trp Ala Trp Asn
100 105 110
Pro Glu Val Asn Ala Ile Thr Val Thr Thr Val Leu Gly Gln Thr Tyr
115 120 125
Tyr Gln Pro Ile Gln Gln Ala Pro Thr Gly Ile Thr Val Thr Leu Leu
130 135 140
Ser Gly Val Leu Tyr Val Asp Gly His Arg Leu Ala Ser Gly Val Gln
145 150 155 160
Val His Asn Leu Pro Glu Tyr Met Thr Val Ala Val Pro Ser Thr Thr
165 170 175
Ile Ile Tyr Ser Arg Val Gly Arg Ser Val Asn Ser Gln Asn Ser Thr
180 185 190
Gly Trp Val Phe Tyr Val Arg Val Lys His Gly Asp Phe Ser Ala Val
195 200 205
Ser Ser Pro Met Ser Asn Met Thr Glu Asn Glu Arg Leu Leu His Phe
210 215 220
Phe
225
<210> 75
<211> 382
<212> PRT
<213> transmissible gastroenteritis virus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 75
Met Ala Asn Gln Gly Gln Arg Val Ser Trp Gly Asp Glu Ser Thr Lys
1 5 10 15
Thr Arg Gly Arg Ser Asn Ser Arg Gly Arg Lys Asn Asn Asn Ile Pro
20 25 30
Leu Ser Phe Phe Asn Pro Ile Thr Leu Gln Gln Gly Ser Lys Phe Trp
35 40 45
Asn Leu Cys Pro Arg Asp Phe Val Pro Lys Gly Ile Gly Asn Arg Asp
50 55 60
Gln Gln Ile Gly Tyr Trp Asn Arg Gln Thr Arg Tyr Arg Met Val Lys
65 70 75 80
Gly Gln Arg Lys Glu Leu Pro Glu Arg Trp Phe Phe Tyr Tyr Leu Gly
85 90 95
Thr Gly Pro His Ala Asp Ala Lys Phe Lys Asp Lys Leu Asp Gly Val
100 105 110
Val Trp Val Ala Lys Asp Gly Ala Met Asn Lys Pro Thr Thr Leu Gly
115 120 125
Ser Arg Gly Ala Asn Asn Glu Ser Lys Ala Leu Lys Phe Asp Gly Lys
130 135 140
Val Pro Gly Glu Phe Gln Leu Glu Val Asn Gln Ser Arg Asp Asn Ser
145 150 155 160
Arg Ser Arg Ser Gln Ser Arg Ser Arg Ser Arg Asn Arg Ser Gln Ser
165 170 175
Arg Gly Arg Gln Gln Phe Asn Asn Lys Lys Asp Asp Ser Val Glu Gln
180 185 190
Ala Val Leu Ala Ala Leu Lys Lys Leu Gly Val Asp Thr Glu Lys Gln
195 200 205
Gln Gln Arg Ser Arg Ser Lys Ser Lys Glu Arg Ser Asn Ser Lys Thr
210 215 220
Arg Asp Thr Thr Pro Lys Asn Glu Asn Lys His Thr Trp Lys Arg Thr
225 230 235 240
Ala Gly Lys Gly Asp Val Thr Arg Phe Tyr Gly Ala Arg Ser Ser Ser
245 250 255
Ala Asn Phe Gly Asp Thr Asp Leu Val Ala Asn Gly Ser Ser Ala Lys
260 265 270
His Tyr Pro Gln Leu Ala Glu Cys Val Pro Ser Val Ser Ser Ile Leu
275 280 285
Phe Gly Ser Tyr Trp Thr Ser Lys Glu Asp Gly Asp Gln Ile Glu Val
290 295 300
Thr Phe Thr His Lys Tyr His Leu Pro Lys Asp Asp Pro Lys Thr Gly
305 310 315 320
Gln Phe Leu Gln Gln Ile Asn Ala Tyr Ala Arg Pro Ser Glu Val Ala
325 330 335
Lys Glu Gln Arg Lys Arg Lys Ser Arg Ser Lys Ser Ala Glu Arg Ser
340 345 350
Glu Gln Asp Val Val Pro Asp Ala Leu Ile Glu Asn Tyr Thr Asp Val
355 360 365
Phe Asp Asp Thr Gln Val Glu Ile Ile Asp Glu Val Thr Asn
370 375 380
<210> 76
<211> 381
<212> PRT
<213> canine coronavirus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 76
Met Ala Ser Gln Gly Gln Arg Val Ser Trp Gly Asp Glu Ser Thr Lys
1 5 10 15
Arg Arg Gly Arg Ser Asn Ser Arg Gly Arg Lys Asn Asn Asp Ile Pro
20 25 30
Leu Ser Phe Phe Asn Pro Ile Thr Leu Glu Gln Gly Ser Lys Phe Trp
35 40 45
Asp Leu Cys Pro Arg Asp Phe Val Pro Lys Gly Ile Gly Asn Lys Asp
50 55 60
Gln Gln Ile Gly Tyr Trp Asn Arg Gln Thr Arg Tyr Arg Met Val Lys
65 70 75 80
Gly Arg Arg Lys Asn Leu Pro Glu Lys Trp Phe Phe Tyr Tyr Leu Gly
85 90 95
Thr Gly Pro His Ala Asp Ala Lys Phe Lys Gln Lys Leu Asp Gly Val
100 105 110
Val Trp Val Ala Arg Gly Asp Ser Met Thr Lys Pro Thr Thr Leu Gly
115 120 125
Thr Arg Gly Thr Asn Asn Glu Ser Lys Ala Leu Lys Phe Asp Val Lys
130 135 140
Val Pro Ser Glu Phe His Leu Glu Val Asn Gln Leu Arg Asp Asn Ser
145 150 155 160
Arg Ser Arg Ser Gln Ser Arg Ser Gln Ser Arg Asn Arg Ser Gln Ser
165 170 175
Arg Gly Arg Gln Leu Ser Asn Asn Lys Lys Asp Asp Asn Val Glu Gln
180 185 190
Ala Val Leu Ala Ala Leu Lys Lys Leu Gly Val Asp Thr Glu Lys Gln
195 200 205
Gln Arg Ser Arg Ser Lys Ser Lys Glu Arg Ser Ser Ser Lys Thr Arg
210 215 220
Asp Thr Thr Pro Lys Asn Glu Asn Lys His Thr Trp Lys Arg Thr Ala
225 230 235 240
Gly Lys Gly Asp Val Thr Lys Phe Tyr Gly Ala Arg Ser Ser Ser Ala
245 250 255
Asn Phe Gly Asp Ser Asp Leu Val Ala Asn Gly Asn Gly Ala Lys His
260 265 270
Tyr Pro Gln Leu Ala Glu Cys Val Pro Ser Val Ser Ser Ile Leu Phe
275 280 285
Gly Ser His Trp Thr Ala Lys Glu Asp Gly Asp Gln Ile Glu Val Thr
290 295 300
Phe Thr His Lys Tyr His Leu Pro Lys Asp Asp Pro Lys Thr Gly Gln
305 310 315 320
Phe Leu Gln Gln Ile Asn Ala Tyr Ala Arg Pro Ser Glu Val Ala Lys
325 330 335
Glu Gln Arg Gln Arg Lys Ala Arg Ser Lys Ser Val Glu Arg Val Glu
340 345 350
Gln Glu Val Val Pro Asp Ala Leu Thr Glu Asn Tyr Thr Asp Val Phe
355 360 365
Asp Asp Thr Gln Val Glu Ile Ile Asp Glu Val Thr Asn
370 375 380
<210> 77
<211> 377
<212> PRT
<213> EMCR coronavirus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 77
Met Ala Ser Val Asn Trp Ala Asp Asp Arg Ala Ala Arg Lys Lys Phe
1 5 10 15
Pro Pro Pro Ser Phe Tyr Met Pro Leu Leu Val Ser Ser Asp Lys Ala
20 25 30
Pro Tyr Arg Val Ile Pro Arg Asn Leu Val Pro Ile Gly Lys Gly Asn
35 40 45
Lys Asp Glu Gln Ile Gly Tyr Trp Asn Val Gln Glu Arg Trp Arg Met
50 55 60
Arg Arg Gly Gln Arg Val Asp Leu Pro Pro Lys Val His Phe Tyr Tyr
65 70 75 80
Leu Gly Thr Gly Pro His Lys Asp Leu Lys Phe Arg Gln Arg Ser Asp
85 90 95
Gly Val Val Trp Val Ala Lys Glu Gly Ala Lys Thr Val Asn Thr Ser
100 105 110
Leu Gly Asn Arg Lys Arg Asn Gln Lys Pro Leu Glu Pro Lys Phe Ser
115 120 125
Ile Ala Leu Pro Pro Glu Leu Ser Val Val Glu Phe Glu Asp Arg Ser
130 135 140
Asn Asn Ser Ser Arg Ala Ser Ser Arg Ser Ser Thr Arg Asn Asn Ser
145 150 155 160
Arg Asp Ser Ser Arg Ser Thr Ser Arg Gln Gln Ser Arg Thr Arg Ser
165 170 175
Asp Ser Asn Gln Ser Ser Ser Asp Leu Val Ala Ala Val Thr Leu Ala
180 185 190
Leu Lys Asn Leu Gly Phe Asp Asn Gln Ser Lys Ser Pro Ser Ser Ser
195 200 205
Gly Thr Ser Thr Pro Lys Lys Pro Asn Lys Pro Leu Ser Gln Pro Arg
210 215 220
Ala Asp Lys Pro Ser Gln Leu Lys Lys Pro Arg Trp Lys Arg Val Pro
225 230 235 240
Thr Arg Glu Glu Asn Val Ile Gln Cys Phe Gly Pro Arg Asp Phe Asn
245 250 255
His Asn Met Gly Asp Ser Asp Leu Val Gln Asn Gly Val Asp Ala Lys
260 265 270
Gly Phe Pro Gln Leu Ala Glu Leu Ile Pro Asn Gln Ala Ala Leu Phe
275 280 285
Phe Asp Ser Glu Val Ser Thr Asp Glu Val Gly Asp Asn Val Gln Ile
290 295 300
Thr Tyr Thr Tyr Lys Met Leu Val Ala Lys Asp Asn Lys Asn Leu Pro
305 310 315 320
Lys Phe Ile Glu Gln Ile Ser Ala Phe Thr Lys Pro Ser Ser Ile Lys
325 330 335
Glu Met Gln Ser Gln Ser Ser His Val Ala Gln Asn Thr Val Leu Asn
340 345 350
Ala Ser Ile Pro Glu Ser Lys Pro Leu Ala Asp Asp Asp Ser Ala Ile
355 360 365
Ile Glu Ile Val Asn Glu Val Leu His
370 375
<210> 78
<211> 377
<212> PRT
<213> feline coronavirus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 78
Met Ala Thr Gln Gly Gln Arg Val Asn Trp Gly Asp Glu Pro Ser Lys
1 5 10 15
Arg Arg Gly Arg Ser Asn Ser Arg Gly Arg Lys Asn Asn Asp Ile Pro
20 25 30
Leu Ser Phe Tyr Asn Pro Ile Thr Leu Glu Gln Gly Ser Lys Phe Trp
35 40 45
Asn Leu Cys Pro Arg Asp Leu Val Pro Lys Gly Ile Gly Asn Lys Asp
50 55 60
Gln Gln Ile Gly Tyr Trp Asn Arg Gln Ile Arg Tyr Arg Ile Val Lys
65 70 75 80
Gly Gln Arg Lys Glu Leu Ala Glu Arg Trp Phe Phe Tyr Phe Leu Gly
85 90 95
Thr Gly Pro His Ala Asp Ala Lys Phe Lys Asp Lys Ile Asp Gly Val
100 105 110
Phe Trp Val Ala Arg Asp Gly Ala Met Asn Lys Pro Thr Thr Leu Gly
115 120 125
Thr Arg Gly Thr Asn Asn Glu Ser Lys Pro Leu Arg Phe Asp Gly Lys
130 135 140
Ile Pro Pro Gln Phe Gln Leu Glu Val Asn Arg Ser Arg Asn Asn Ser
145 150 155 160
Arg Ser Gly Ser Gln Ser Arg Ser Val Ser Arg Asn Arg Ser Gln Ser
165 170 175
Arg Gly Arg His His Ser Asn Asn Gln Asn Asn Asn Val Glu Asp Thr
180 185 190
Ile Val Ala Val Leu Glu Lys Leu Gly Val Thr Asp Lys Gln Arg Ser
195 200 205
Arg Ser Lys Pro Arg Glu Arg Ser Asp Ser Lys Pro Arg Asp Thr Thr
210 215 220
Pro Lys Asn Ala Asn Lys His Thr Trp Lys Lys Thr Ala Gly Lys Gly
225 230 235 240
Asp Val Thr Thr Phe Tyr Gly Ala Arg Ser Ser Ser Ala Asn Phe Gly
245 250 255
Asp Ser Asp Leu Val Ala Asn Gly Asn Ala Ala Lys Cys Tyr Pro Gln
260 265 270
Ile Ala Glu Cys Val Pro Ser Val Ser Ser Ile Ile Phe Gly Ser Gln
275 280 285
Trp Ser Ala Glu Glu Ala Gly Asp Gln Val Lys Val Thr Leu Thr His
290 295 300
Thr Tyr Tyr Leu Pro Lys Asp Asp Ala Lys Thr Ser Gln Phe Leu Glu
305 310 315 320
Gln Ile Asp Ala Tyr Lys Arg Pro Ser Glu Val Ala Lys Asp Gln Arg
325 330 335
Gln Arg Arg Ser Arg Ser Lys Ser Ala Asp Lys Lys Pro Glu Glu Leu
340 345 350
Ser Val Thr Leu Val Glu Ala Tyr Thr Asp Val Phe Asp Asp Thr Gln
355 360 365
Val Glu Met Ile Asp Glu Val Thr Asn
370 375
<210> 79
<211> 441
<212> PRT
<213> porcine epidemic diahrrea virus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 79
Met Ala Ser Val Ser Phe Gln Asp Arg Gly Arg Lys Arg Val Pro Leu
1 5 10 15
Ser Leu Tyr Ala Pro Leu Arg Val Thr Asn Asp Lys Pro Leu Ser Lys
20 25 30
Val Leu Ala Asn Asn Ala Val Pro Thr Asn Lys Gly Asn Lys Asp Gln
35 40 45
Gln Ile Gly Tyr Trp Asn Glu Gln Ile Arg Trp Arg Met Arg Arg Gly
50 55 60
Glu Arg Ile Glu Gln Pro Ser Asn Trp His Phe Tyr Tyr Leu Gly Thr
65 70 75 80
Gly Pro His Gly Asp Leu Arg Tyr Arg Thr Arg Thr Glu Gly Val Phe
85 90 95
Trp Val Ala Lys Glu Gly Ala Lys Thr Glu Pro Thr Asn Leu Gly Val
100 105 110
Arg Lys Ala Ser Glu Lys Pro Ile Ile Pro Lys Phe Ser Gln Gln Leu
115 120 125
Pro Ser Val Val Glu Ile Val Glu Pro Asn Thr Pro Pro Ala Ser Arg
130 135 140
Ala Asn Ser Arg Ser Arg Ser Arg Gly Asn Gly Asn Asn Arg Ser Arg
145 150 155 160
Ser Pro Ser Asn Asn Arg Gly Asn Asn Gln Ser Arg Gly Asn Ser Gln
165 170 175
Asn Arg Gly Asn Asn Gln Gly Arg Gly Ala Ser Gln Asn Arg Gly Gly
180 185 190
Asn Asn Asn Asn Asn Asn Lys Ser Arg Asn Gln Ser Asn Asn Arg Asn
195 200 205
Gln Ser Asn Asp Arg Gly Gly Val Thr Ser Arg Asp Asp Leu Val Ala
210 215 220
Ala Val Lys Asp Ala Leu Lys Ser Leu Gly Ile Gly Glu Asn Pro Asp
225 230 235 240
Arg His Lys Gln Gln Gln Lys Pro Lys Gln Glu Lys Ser Asp Asn Ser
245 250 255
Gly Lys Asn Thr Pro Lys Lys Asn Lys Ser Arg Ala Thr Ser Lys Glu
260 265 270
Arg Asp Leu Lys Asp Ile Pro Glu Trp Arg Arg Ile Pro Lys Gly Glu
275 280 285
Asn Ser Val Ala Ala Cys Phe Gly Pro Arg Gly Gly Phe Lys Asn Phe
290 295 300
Gly Asp Ala Glu Phe Val Glu Lys Gly Val Asp Ala Ser Gly Tyr Ala
305 310 315 320
Gln Ile Ala Ser Leu Ala Pro Asn Val Ala Ala Leu Leu Phe Gly Gly
325 330 335
Asn Val Ala Val Arg Glu Leu Ala Asp Ser Tyr Glu Ile Thr Tyr Asn
340 345 350
Tyr Lys Met Thr Val Pro Lys Ser Asp Pro Asn Val Glu Leu Leu Val
355 360 365
Ser Gln Val Asp Ala Phe Lys Thr Gly Asn Ala Lys Leu Gln Arg Lys
370 375 380
Lys Glu Lys Lys Asn Lys Arg Glu Thr Thr Leu Gln Gln His Glu Glu
385 390 395 400
Ala Ile Tyr Asp Asp Val Gly Ala Pro Ser Asp Val Thr His Ala Asn
405 410 415
Leu Glu Trp Asp Thr Ala Val Asp Gly Gly Asp Thr Ala Val Glu Ile
420 425 430
Ile Asn Glu Ile Phe Asp Thr Gly Asn
435 440
<210> 80
<211> 382
<212> PRT
<213> porcine respiratory coronavirus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 80
Met Ala Asn Gln Gly Gln Arg Val Ser Trp Gly Asp Glu Ser Thr Lys
1 5 10 15
Ile Arg Gly Arg Ser Asn Ser Arg Gly Arg Lys Ile Asn Asn Ile Pro
20 25 30
Leu Ser Phe Phe Asn Pro Ile Thr Leu Gln Gln Gly Ala Lys Phe Trp
35 40 45
Asn Ser Cys Pro Arg Asp Phe Val Pro Lys Gly Ile Gly Asn Arg Asp
50 55 60
Gln Gln Ile Gly Tyr Trp Asn Arg Gln Thr Arg Tyr Arg Met Val Lys
65 70 75 80
Gly Gln Arg Lys Glu Leu Pro Glu Arg Trp Phe Phe Tyr Tyr Leu Gly
85 90 95
Thr Gly Pro His Ala Asp Ala Lys Phe Lys Asp Lys Leu Asp Gly Val
100 105 110
Val Trp Val Ala Lys Asp Gly Ala Met Asn Lys Pro Thr Thr Leu Gly
115 120 125
Ser Arg Gly Ala Asn Asn Glu Ser Lys Ala Leu Lys Phe Asp Gly Lys
130 135 140
Val Pro Gly Glu Phe Gln Leu Glu Val Asn Gln Ser Arg Asp Asn Ser
145 150 155 160
Arg Ser Arg Ser Gln Ser Arg Ser Arg Ser Arg Asn Arg Ser Gln Ser
165 170 175
Arg Gly Arg Gln Gln Ser Asn Asn Lys Lys Asp Asp Ser Val Glu Gln
180 185 190
Ala Val Leu Ala Ala Leu Lys Lys Leu Gly Val Tyr Thr Glu Lys Gln
195 200 205
Gln Gln Arg Ser Arg Ser Lys Ser Lys Glu Arg Ser Asn Ser Lys Thr
210 215 220
Arg Asp Thr Thr Pro Lys Asn Glu Asn Lys His Thr Trp Lys Arg Thr
225 230 235 240
Ala Gly Lys Gly Asp Val Thr Arg Phe Tyr Gly Ala Arg Ser Ser Ser
245 250 255
Ala Asn Phe Gly Asp Ser Asp Leu Val Ala Asn Gly Ser Ser Ala Lys
260 265 270
His Tyr Pro Gln Leu Ala Glu Cys Val Pro Ser Val Ser Ser Ile Leu
275 280 285
Phe Gly Ser Tyr Trp Thr Ser Lys Glu Asp Gly Asp Gln Ile Glu Val
290 295 300
Thr Phe Thr His Lys Tyr His Leu Pro Lys Asp His Pro Lys Thr Glu
305 310 315 320
Gln Phe Leu Gln Gln Ile Asn Ala Tyr Ala Ser Pro Ser Glu Leu Ala
325 330 335
Lys Glu Gln Arg Lys Arg Lys Ser Arg Ser Lys Ser Ala Glu Arg Ser
340 345 350
Glu Gln Glu Val Val Pro Asp Ser Leu Ile Glu Asn Tyr Thr Asp Val
355 360 365
Phe Asp Asp Thr Gln Val Glu Met Ile Asp Glu Val Thr Asn
370 375 380
<210> 81
<211> 454
<212> PRT
<213> rat coronavirus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 81
Met Ser Phe Val Pro Gly Gln Glu Asn Ala Gly Ser Arg Ser Ser Ser
1 5 10 15
Gly Asn Arg Ala Gly Asn Gly Ile Leu Lys Lys Thr Thr Trp Ala Asp
20 25 30
Gln Thr Glu Arg Gly Gln Asn Asn Gly Asn Arg Gly Arg Arg Asn Gln
35 40 45
Pro Lys Gln Thr Ala Thr Thr Gln Pro Asn Thr Gly Ser Val Val Pro
50 55 60
His Tyr Ser Trp Phe Ser Gly Ile Thr Gln Phe Gln Lys Gly Lys Glu
65 70 75 80
Phe Gln Phe Ala Gly Gly Gln Gly Val Pro Ile Ala Asn Gly Ile Pro
85 90 95
Pro Ser Glu Gln Lys Gly Tyr Trp Tyr Arg His Asn Arg Arg Ser Phe
100 105 110
Lys Thr Pro Asp Gly Gln Gln Lys Gln Leu Leu Pro Arg Trp Tyr Phe
115 120 125
Tyr Tyr Leu Gly Thr Gly Pro His Ala Gly Ala Ser Phe Gly Asp Ser
130 135 140
Ile Glu Gly Val Phe Trp Val Ala Asn Ser Gln Ala Asp Thr Asn Thr
145 150 155 160
Ser Ala Asp Ile Val Glu Arg Asp Pro Ser Ser His Glu Ala Ile Pro
165 170 175
Thr Arg Phe Ala Pro Gly Thr Val Leu Pro Gln Gly Phe Tyr Val Glu
180 185 190
Gly Ser Gly Arg Ser Ala Pro Ala Ser Arg Ser Gly Ser Arg Ser Gln
195 200 205
Ser Arg Gly Pro Asn Asn Arg Ala Arg Ser Ser Ser Asn Gln Arg Gln
210 215 220
Pro Ala Ser Thr Val Lys Pro Asp Met Ala Glu Glu Ile Ala Ala Leu
225 230 235 240
Val Leu Ala Asn Leu Gly Lys Asp Ala Gly Gln Pro Lys Gln Val Thr
245 250 255
Lys Gln Ser Ala Lys Glu Val Arg Gln Lys Ile Leu Asn Lys Pro Arg
260 265 270
Gln Lys Arg Thr Pro Asn Lys Gln Cys Pro Val Gln Gln Cys Phe Gly
275 280 285
Lys Arg Gly Pro Asn Gln Asn Phe Gly Gly Pro Glu Met Leu Lys Leu
290 295 300
Gly Thr Ser Asp Pro Gln Phe Pro Ile Leu Ala Glu Leu Ala Pro Thr
305 310 315 320
Pro Gly Ala Phe Phe Phe Gly Ser Lys Leu Glu Leu Val Lys Lys Asn
325 330 335
Ser Gly Gly Val Asp Glu Pro Thr Lys Asp Val Tyr Glu Leu Gln Tyr
340 345 350
Ser Gly Ala Val Arg Phe Asp Ser Thr Leu Pro Gly Phe Glu Thr Ile
355 360 365
Met Lys Val Leu Asn Glu Asn Leu Asn Ala Tyr Gln Asn Gln Ala Gly
370 375 380
Gly Ala Asp Val Val Ser Pro Lys Pro Gln Arg Lys Arg Gly Thr Lys
385 390 395 400
Gln Thr Ala Gln Lys Glu Glu Leu Asp Ser Ile Ser Val Ala Lys Pro
405 410 415
Lys Ser Ala Val Gln Arg Asn Val Ser Arg Glu Leu Thr Pro Glu Asp
420 425 430
Arg Ser Leu Leu Ala Gln Ile Leu Asp Asp Gly Val Val Pro Asp Gly
435 440 445
Leu Asp Asp Ser Asn Val
450
<210> 82
<211> 389
<212> PRT
<213> human coronavirus 229E
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 82
Met Ala Thr Val Lys Trp Ala Asp Ala Ser Glu Pro Gln Arg Gly Arg
1 5 10 15
Gln Gly Arg Ile Pro Tyr Ser Leu Tyr Ser Pro Leu Leu Val Asp Ser
20 25 30
Glu Gln Pro Trp Lys Val Ile Pro Arg Asn Leu Val Pro Ile Asn Lys
35 40 45
Lys Asp Lys Asn Lys Leu Ile Gly Tyr Trp Asn Val Gln Lys Arg Phe
50 55 60
Arg Thr Arg Lys Gly Lys Arg Val Asp Leu Ser Pro Lys Leu His Phe
65 70 75 80
Tyr Tyr Leu Gly Thr Gly Pro His Lys Asp Ala Lys Phe Arg Glu Arg
85 90 95
Val Glu Gly Val Val Trp Val Ala Val Asp Gly Ala Lys Thr Glu Pro
100 105 110
Thr Gly Tyr Gly Val Arg Arg Lys Asn Ser Glu Pro Glu Ile Pro His
115 120 125
Phe Asn Gln Lys Leu Pro Asn Gly Val Thr Val Val Glu Glu Pro Asp
130 135 140
Ser Arg Ala Pro Ser Arg Ser Gln Ser Arg Ser Gln Ser Arg Gly Arg
145 150 155 160
Gly Glu Ser Lys Pro Gln Ser Arg Asn Pro Ser Ser Asp Arg Asn His
165 170 175
Asn Ser Gln Asp Asp Ile Met Lys Ala Val Ala Ala Ala Leu Lys Ser
180 185 190
Leu Gly Phe Asp Lys Pro Gln Glu Lys Asp Lys Lys Ser Ala Lys Thr
195 200 205
Gly Thr Pro Lys Pro Ser Arg Asn Gln Ser Pro Ala Ser Ser Gln Thr
210 215 220
Ser Ala Lys Ser Leu Ala Arg Ser Gln Ser Ser Glu Thr Lys Glu Gln
225 230 235 240
Lys His Glu Met Gln Lys Pro Arg Trp Lys Arg Gln Pro Asn Asp Asp
245 250 255
Val Thr Ser Asn Val Thr Gln Cys Phe Gly Pro Arg Asp Leu Asp His
260 265 270
Asn Phe Gly Ser Ala Gly Val Val Ala Asn Gly Val Lys Ala Lys Gly
275 280 285
Tyr Pro Gln Phe Ala Glu Leu Val Pro Ser Thr Ala Ala Met Leu Phe
290 295 300
Asp Ser His Ile Val Ser Lys Glu Ser Gly Asn Thr Val Val Leu Thr
305 310 315 320
Phe Thr Thr Arg Val Thr Val Pro Lys Asp His Pro His Leu Gly Lys
325 330 335
Phe Leu Glu Glu Leu Asn Ala Phe Thr Arg Glu Met Gln Gln His Pro
340 345 350
Leu Leu Asn Pro Ser Ala Leu Glu Phe Asn Pro Ser Gln Thr Ser Pro
355 360 365
Ala Thr Ala Glu Pro Val Arg Asp Glu Val Ser Ile Glu Thr Asp Ile
370 375 380
Ile Asp Glu Val Asn
385
<210> 83
<211> 264
<212> DNA
<213> EMCR Coronavirus
<220>
<221> 5'UTR
<222> (1)..(264)
<400> 83
agatagagaa ttttcttatt tagactttgt gtctactcct ctcaactaaa cgaaattttt 60
ctagtgctgt catttgttat ggcagtccta gtgtaattga aatttcgtca agtttgtaaa 120
ctggttaggc aagtgttgta ttttctgtgt ttaagcactg gtggttctgt ccactagtgc 180
acacattgat acttaagtgg tgttctgtca ctgcttattg tggaagcaac gttctgtcgt 240
tgtggaaacc aataactgct aacc 264
<210> 84
<211> 2685
<212> PRT
<213> human SARS virus
<220>
<221> MISC_FEATURE
<223> ORF 1B
<400> 84
Thr Pro Cys Gly Thr Gly Thr Ser Thr Asp Val Val Tyr Arg Ala Phe
1 5 10 15
Asp Ile Tyr Asn Glu Lys Val Ala Gly Phe Ala Lys Phe Leu Lys Thr
20 25 30
Asn Cys Cys Arg Phe Gln Glu Lys Asp Glu Glu Gly Asn Leu Leu Asp
35 40 45
Ser Tyr Phe Val Val Lys Arg His Thr Met Ser Asn Tyr Gln His Glu
50 55 60
Glu Thr Ile Tyr Asn Leu Val Lys Asp Cys Pro Ala Val Ala Val His
65 70 75 80
Asp Phe Phe Lys Phe Arg Val Asp Gly Asp Met Val Pro His Ile Ser
85 90 95
Arg Gln Arg Leu Thr Lys Tyr Thr Met Ala Asp Leu Val Tyr Ala Leu
100 105 110
Arg His Phe Asp Glu Gly Asn Cys Asp Thr Leu Lys Glu Ile Leu Val
115 120 125
Thr Tyr Asn Cys Cys Asp Asp Asp Tyr Phe Asn Lys Lys Asp Trp Tyr
130 135 140
Asp Phe Val Glu Asn Pro Asp Ile Leu Arg Val Tyr Ala Asn Leu Gly
145 150 155 160
Glu Arg Val Arg Gln Ser Leu Leu Lys Thr Val Gln Phe Cys Asp Ala
165 170 175
Met Arg Asp Ala Gly Ile Val Gly Val Leu Thr Leu Asp Asn Gln Asp
180 185 190
Leu Asn Gly Asn Trp Tyr Asp Phe Gly Asp Phe Val Gln Val Ala Pro
195 200 205
Gly Cys Gly Val Pro Ile Val Asp Ser Tyr Tyr Ser Leu Leu Met Pro
210 215 220
Ile Leu Thr Leu Thr Arg Ala Leu Ala Ala Glu Ser His Met Asp Ala
225 230 235 240
Asp Leu Ala Lys Pro Leu Ile Lys Trp Asp Leu Leu Lys Tyr Asp Phe
245 250 255
Thr Glu Glu Arg Leu Cys Leu Phe Asp Arg Tyr Phe Lys Tyr Trp Asp
260 265 270
Gln Thr Tyr His Pro Asn Cys Ile Asn Cys Leu Asp Asp Arg Cys Ile
275 280 285
Leu His Cys Ala Asn Phe Asn Val Leu Phe Ser Thr Val Phe Pro Pro
290 295 300
Thr Ser Phe Gly Pro Leu Val Arg Lys Ile Phe Val Asp Gly Val Pro
305 310 315 320
Phe Val Val Ser Thr Gly Tyr His Phe Arg Glu Leu Gly Val Val His
325 330 335
Asn Gln Asp Val Asn Leu His Ser Ser Arg Leu Ser Phe Lys Glu Leu
340 345 350
Leu Val Tyr Ala Ala Asp Pro Ala Met His Ala Ala Ser Gly Asn Leu
355 360 365
Leu Leu Asp Lys Arg Thr Thr Cys Phe Ser Val Ala Ala Leu Thr Asn
370 375 380
Asn Val Ala Phe Gln Thr Val Lys Pro Gly Asn Phe Asn Lys Asp Phe
385 390 395 400
Tyr Asp Phe Ala Val Ser Lys Gly Phe Phe Lys Glu Gly Ser Ser Val
405 410 415
Glu Leu Lys His Phe Phe Phe Ala Gln Asp Gly Asn Ala Ala Ile Ser
420 425 430
Asp Tyr Asp Tyr Tyr Arg Tyr Asn Leu Pro Thr Met Cys Asp Ile Arg
435 440 445
Gln Leu Leu Phe Val Val Glu Val Val Asp Lys Tyr Phe Asp Cys Tyr
450 455 460
Asp Gly Gly Cys Ile Asn Ala Asn Gln Val Ile Val Asn Asn Leu Asp
465 470 475 480
Lys Ser Ala Gly Phe Pro Phe Asn Lys Trp Gly Lys Ala Arg Leu Tyr
485 490 495
Tyr Asp Ser Met Ser Tyr Glu Asp Gln Asp Ala Leu Phe Ala Tyr Thr
500 505 510
Lys Arg Asn Val Ile Pro Thr Ile Thr Gln Met Asn Leu Lys Tyr Ala
515 520 525
Ile Ser Ala Lys Asn Arg Ala Arg Thr Val Ala Gly Val Ser Ile Cys
530 535 540
Ser Thr Met Thr Asn Arg Gln Phe His Gln Lys Leu Leu Lys Ser Ile
545 550 555 560
Ala Ala Thr Arg Gly Ala Thr Val Val Ile Gly Thr Ser Lys Phe Tyr
565 570 575
Gly Gly Trp His Asn Met Leu Lys Thr Val Tyr Ser Asp Val Glu Thr
580 585 590
Pro His Leu Met Gly Trp Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro
595 600 605
Asn Met Leu Arg Ile Met Ala Ser Leu Val Leu Ala Arg Lys His Asn
610 615 620
Thr Cys Cys Asn Leu Ser His Arg Phe Tyr Arg Leu Ala Asn Glu Cys
625 630 635 640
Ala Gln Val Leu Ser Glu Met Val Met Cys Gly Gly Ser Leu Tyr Val
645 650 655
Lys Pro Gly Gly Thr Ser Ser Gly Asp Ala Thr Thr Ala Tyr Ala Asn
660 665 670
Ser Val Phe Asn Ile Cys Gln Ala Val Thr Ala Asn Val Asn Ala Leu
675 680 685
Leu Ser Thr Asp Gly Asn Lys Ile Ala Asp Lys Tyr Val Arg Asn Leu
690 695 700
Gln His Arg Leu Tyr Glu Cys Leu Tyr Arg Asn Arg Asp Val Asp His
705 710 715 720
Glu Phe Val Asp Glu Phe Tyr Ala Tyr Leu Arg Lys His Phe Ser Met
725 730 735
Met Ile Leu Ser Asp Asp Ala Val Val Cys Tyr Asn Ser Asn Tyr Ala
740 745 750
Ala Gln Gly Leu Val Ala Ser Ile Lys Asn Phe Lys Ala Val Leu Tyr
755 760 765
Tyr Gln Asn Asn Val Phe Met Ser Glu Ala Lys Cys Trp Thr Glu Thr
770 775 780
Asp Leu Thr Lys Gly Pro His Glu Phe Cys Ser Gln His Thr Met Leu
785 790 795 800
Val Lys Gln Gly Asp Asp Tyr Val Tyr Leu Pro Tyr Pro Asp Pro Ser
805 810 815
Arg Ile Leu Gly Ala Gly Cys Phe Val Asp Asp Ile Val Lys Thr Asp
820 825 830
Gly Thr Leu Met Ile Glu Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr
835 840 845
Pro Leu Thr Lys His Pro Asn Gln Glu Tyr Ala Asp Val Phe His Leu
850 855 860
Tyr Leu Gln Tyr Ile Arg Lys Leu His Asp Glu Leu Thr Gly His Met
865 870 875 880
Leu Asp Met Tyr Ser Val Met Leu Thr Asn Asp Asn Thr Ser Arg Tyr
885 890 895
Trp Glu Pro Glu Phe Tyr Glu Ala Met Tyr Thr Pro His Thr Val Leu
900 905 910
Gln Ala Val Gly Ala Cys Val Leu Cys Asn Ser Gln Thr Ser Leu Arg
915 920 925
Cys Gly Ala Cys Ile Arg Arg Pro Phe Leu Cys Cys Lys Cys Cys Tyr
930 935 940
Asp His Val Ile Ser Thr Ser His Lys Leu Val Leu Ser Val Asn Pro
945 950 955 960
Tyr Val Cys Asn Ala Pro Gly Cys Asp Val Thr Asp Val Thr Gln Leu
965 970 975
Tyr Leu Gly Gly Met Ser Tyr Tyr Cys Lys Ser His Lys Pro Pro Ile
980 985 990
Ser Phe Pro Leu Cys Ala Asn Gly Gln Val Phe Gly Leu Tyr Lys Asn
995 1000 1005
Thr Cys Val Gly Ser Asp Asn Val Thr Asp Phe Asn Ala Ile Ala
1010 1015 1020
Thr Cys Asp Trp Thr Asn Ala Gly Asp Tyr Ile Leu Ala Asn Thr
1025 1030 1035
Cys Thr Glu Arg Leu Lys Leu Phe Ala Ala Glu Thr Leu Lys Ala
1040 1045 1050
Thr Glu Glu Thr Phe Lys Leu Ser Tyr Gly Ile Ala Thr Val Arg
1055 1060 1065
Glu Val Leu Ser Asp Arg Glu Leu His Leu Ser Trp Glu Val Gly
1070 1075 1080
Lys Pro Arg Pro Pro Leu Asn Arg Asn Tyr Val Phe Thr Gly Tyr
1085 1090 1095
Arg Val Thr Lys Asn Ser Lys Val Gln Ile Gly Glu Tyr Thr Phe
1100 1105 1110
Glu Lys Gly Asp Tyr Gly Asp Ala Val Val Tyr Arg Gly Thr Thr
1115 1120 1125
Thr Tyr Lys Leu Asn Val Gly Asp Tyr Phe Val Leu Thr Ser His
1130 1135 1140
Thr Val Met Pro Leu Ser Ala Pro Thr Leu Val Pro Gln Glu His
1145 1150 1155
Tyr Val Arg Ile Thr Gly Leu Tyr Pro Thr Leu Asn Ile Ser Asp
1160 1165 1170
Glu Phe Ser Ser Asn Val Ala Asn Tyr Gln Lys Val Gly Met Gln
1175 1180 1185
Lys Tyr Ser Thr Leu Gln Gly Pro Pro Gly Thr Gly Lys Ser His
1190 1195 1200
Phe Ala Ile Gly Leu Ala Leu Tyr Tyr Pro Ser Ala Arg Ile Val
1205 1210 1215
Tyr Thr Ala Cys Ser His Ala Ala Val Asp Ala Leu Cys Glu Lys
1220 1225 1230
Ala Leu Lys Tyr Leu Pro Ile Asp Lys Cys Ser Arg Ile Ile Pro
1235 1240 1245
Ala Arg Ala Arg Val Glu Cys Phe Asp Lys Phe Lys Val Asn Ser
1250 1255 1260
Thr Leu Glu Gln Tyr Val Phe Cys Thr Val Asn Ala Leu Pro Glu
1265 1270 1275
Thr Thr Ala Asp Ile Val Val Phe Asp Glu Ile Ser Met Ala Thr
1280 1285 1290
Asn Tyr Asp Leu Ser Val Val Asn Ala Arg Leu Arg Ala Lys His
1295 1300 1305
Tyr Val Tyr Ile Gly Asp Pro Ala Gln Leu Pro Ala Pro Arg Thr
1310 1315 1320
Leu Leu Thr Lys Gly Thr Leu Glu Pro Glu Tyr Phe Asn Ser Val
1325 1330 1335
Cys Arg Leu Met Lys Thr Ile Gly Pro Asp Met Phe Leu Gly Thr
1340 1345 1350
Cys Arg Arg Cys Pro Ala Glu Ile Val Asp Thr Val Ser Ala Leu
1355 1360 1365
Val Tyr Asp Asn Lys Leu Lys Ala His Lys Asp Lys Ser Ala Gln
1370 1375 1380
Cys Phe Lys Met Phe Tyr Lys Gly Val Ile Thr His Asp Val Ser
1385 1390 1395
Ser Ala Ile Asn Arg Pro Gln Ile Gly Val Val Arg Glu Phe Leu
1400 1405 1410
Thr Arg Asn Pro Ala Trp Arg Lys Ala Val Phe Ile Ser Pro Tyr
1415 1420 1425
Asn Ser Gln Asn Ala Val Ala Ser Lys Ile Leu Gly Leu Pro Thr
1430 1435 1440
Gln Thr Val Asp Ser Ser Gln Gly Ser Glu Tyr Asp Tyr Val Ile
1445 1450 1455
Phe Thr Gln Thr Thr Glu Thr Ala His Ser Cys Asn Val Asn Arg
1460 1465 1470
Phe Asn Val Ala Ile Thr Arg Ala Lys Ile Gly Ile Leu Cys Ile
1475 1480 1485
Met Ser Asp Arg Asp Leu Tyr Asp Lys Leu Gln Phe Thr Ser Leu
1490 1495 1500
Glu Ile Pro Arg Arg Asn Val Ala Thr Leu Gln Ala Glu Asn Val
1505 1510 1515
Thr Gly Leu Phe Lys Asp Cys Ser Lys Ile Ile Thr Gly Leu His
1520 1525 1530
Pro Thr Gln Ala Pro Thr His Leu Ser Val Asp Ile Lys Phe Lys
1535 1540 1545
Thr Glu Gly Leu Cys Val Asp Ile Pro Gly Ile Pro Lys Asp Met
1550 1555 1560
Thr Tyr Arg Arg Leu Ile Ser Met Met Gly Phe Lys Met Asn Tyr
1565 1570 1575
Gln Val Asn Gly Tyr Pro Asn Met Phe Ile Thr Arg Glu Glu Ala
1580 1585 1590
Ile Arg His Val Arg Ala Trp Ile Gly Phe Asp Val Glu Gly Cys
1595 1600 1605
His Ala Thr Arg Asp Ala Val Gly Thr Asn Leu Pro Leu Gln Leu
1610 1615 1620
Gly Phe Ser Thr Gly Val Asn Leu Val Ala Val Pro Thr Gly Tyr
1625 1630 1635
Val Asp Thr Glu Asn Asn Thr Glu Phe Thr Arg Val Asn Ala Lys
1640 1645 1650
Pro Pro Pro Gly Asp Gln Phe Lys His Leu Ile Pro Leu Met Tyr
1655 1660 1665
Lys Gly Leu Pro Trp Asn Val Val Arg Ile Lys Ile Val Gln Met
1670 1675 1680
Leu Ser Asp Thr Leu Lys Gly Leu Ser Asp Arg Val Val Phe Val
1685 1690 1695
Leu Trp Ala His Gly Phe Glu Leu Thr Ser Met Lys Tyr Phe Val
1700 1705 1710
Lys Ile Gly Pro Glu Arg Thr Cys Cys Leu Cys Asp Lys Arg Ala
1715 1720 1725
Thr Cys Phe Ser Thr Ser Ser Asp Thr Tyr Ala Cys Trp Asn His
1730 1735 1740
Ser Val Gly Phe Asp Tyr Val Tyr Asn Pro Phe Met Ile Asp Val
1745 1750 1755
Gln Gln Trp Gly Phe Thr Gly Asn Leu Gln Ser Asn His Asp Gln
1760 1765 1770
His Cys Gln Val His Gly Asn Ala His Val Ala Ser Cys Asp Ala
1775 1780 1785
Ile Met Thr Arg Cys Leu Ala Val His Glu Cys Phe Val Lys Arg
1790 1795 1800
Val Asp Trp Ser Val Glu Tyr Pro Ile Ile Gly Asp Glu Leu Arg
1805 1810 1815
Val Asn Ser Ala Cys Arg Lys Val Gln His Met Val Val Lys Ser
1820 1825 1830
Ala Leu Leu Ala Asp Lys Phe Pro Val Leu His Asp Ile Gly Asn
1835 1840 1845
Pro Lys Ala Ile Lys Cys Val Pro Gln Ala Glu Val Glu Trp Lys
1850 1855 1860
Phe Tyr Asp Ala Gln Pro Cys Ser Asp Lys Ala Tyr Lys Ile Glu
1865 1870 1875
Glu Leu Phe Tyr Ser Tyr Ala Thr His His Asp Lys Phe Thr Asp
1880 1885 1890
Gly Val Cys Leu Phe Trp Asn Cys Asn Val Asp Arg Tyr Pro Ala
1895 1900 1905
Asn Ala Ile Val Cys Arg Phe Asp Thr Arg Val Leu Ser Asn Leu
1910 1915 1920
Asn Leu Pro Gly Cys Asp Gly Gly Ser Leu Tyr Val Asn Lys His
1925 1930 1935
Ala Phe His Thr Pro Ala Phe Asp Lys Ser Ala Phe Thr Asn Leu
1940 1945 1950
Lys Gln Leu Pro Phe Phe Tyr Tyr Ser Asp Ser Pro Cys Glu Ser
1955 1960 1965
His Gly Lys Gln Val Val Ser Asp Ile Asp Tyr Val Pro Leu Lys
1970 1975 1980
Ser Ala Thr Cys Ile Thr Arg Cys Asn Leu Gly Gly Ala Val Cys
1985 1990 1995
Arg His His Ala Asn Glu Tyr Arg Gln Tyr Leu Asp Ala Tyr Asn
2000 2005 2010
Met Met Ile Ser Ala Gly Phe Ser Leu Trp Ile Tyr Lys Gln Phe
2015 2020 2025
Asp Thr Tyr Asn Leu Trp Asn Thr Phe Thr Arg Leu Gln Ser Leu
2030 2035 2040
Glu Asn Val Ala Tyr Asn Val Val Asn Lys Gly His Phe Asp Gly
2045 2050 2055
His Ala Gly Glu Ala Pro Val Ser Ile Ile Asn Asn Ala Val Tyr
2060 2065 2070
Thr Lys Val Asp Gly Ile Asp Val Glu Ile Phe Glu Asn Lys Thr
2075 2080 2085
Thr Leu Pro Val Asn Val Ala Phe Glu Leu Trp Ala Lys Arg Asn
2090 2095 2100
Ile Lys Pro Val Pro Glu Ile Lys Ile Leu Asn Asn Leu Gly Val
2105 2110 2115
Asp Ile Ala Ala Asn Thr Val Ile Trp Asp Tyr Lys Arg Glu Ala
2120 2125 2130
Pro Ala His Val Ser Thr Ile Gly Val Cys Thr Met Thr Asp Ile
2135 2140 2145
Ala Lys Lys Pro Thr Glu Ser Ala Cys Ser Ser Leu Thr Val Leu
2150 2155 2160
Phe Asp Gly Arg Val Glu Gly Gln Val Asp Leu Phe Arg Asn Ala
2165 2170 2175
Arg Asn Gly Val Leu Ile Thr Glu Gly Ser Val Lys Gly Leu Thr
2180 2185 2190
Pro Ser Lys Gly Pro Ala Gln Ala Ser Val Asn Gly Val Thr Leu
2195 2200 2205
Ile Gly Glu Ser Val Lys Thr Gln Phe Asn Tyr Phe Lys Lys Val
2210 2215 2220
Asp Gly Ile Ile Gln Gln Leu Pro Glu Thr Tyr Phe Thr Gln Ser
2225 2230 2235
Arg Asp Leu Glu Asp Phe Lys Pro Arg Ser Gln Met Glu Thr Asp
2240 2245 2250
Phe Leu Glu Leu Ala Met Asp Glu Phe Ile Gln Arg Tyr Lys Leu
2255 2260 2265
Glu Gly Tyr Ala Phe Glu His Ile Val Tyr Gly Asp Phe Ser His
2270 2275 2280
Gly Gln Leu Gly Gly Leu His Leu Met Ile Gly Leu Ala Lys Arg
2285 2290 2295
Ser Gln Asp Ser Pro Leu Lys Leu Glu Asp Phe Ile Pro Met Asp
2300 2305 2310
Ser Thr Val Lys Asn Tyr Phe Ile Thr Asp Ala Gln Thr Gly Ser
2315 2320 2325
Ser Lys Cys Val Cys Ser Val Ile Asp Leu Leu Leu Asp Asp Phe
2330 2335 2340
Val Glu Ile Ile Lys Ser Gln Asp Leu Ser Val Ile Ser Lys Val
2345 2350 2355
Val Lys Val Thr Ile Asp Tyr Ala Glu Ile Ser Phe Met Leu Trp
2360 2365 2370
Cys Lys Asp Gly His Val Glu Thr Phe Tyr Pro Lys Leu Gln Ala
2375 2380 2385
Ser Gln Ala Trp Gln Pro Gly Val Ala Met Pro Asn Leu Tyr Lys
2390 2395 2400
Met Gln Arg Met Leu Leu Glu Lys Cys Asp Leu Gln Asn Tyr Gly
2405 2410 2415
Glu Asn Ala Val Ile Pro Lys Gly Ile Met Met Asn Val Ala Lys
2420 2425 2430
Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Thr Leu Thr Leu Ala Val
2435 2440 2445
Pro Tyr Asn Met Arg Val Ile His Phe Gly Ala Gly Ser Asp Lys
2450 2455 2460
Gly Val Ala Pro Gly Thr Ala Val Leu Arg Gln Trp Leu Pro Thr
2465 2470 2475
Gly Thr Leu Leu Val Asp Ser Asp Leu Asn Asp Phe Val Ser Asp
2480 2485 2490
Ala Asp Ser Thr Leu Ile Gly Asp Cys Ala Thr Val His Thr Ala
2495 2500 2505
Asn Lys Trp Asp Leu Ile Ile Ser Asp Met Tyr Asp Pro Arg Thr
2510 2515 2520
Lys His Val Thr Lys Glu Asn Asp Ser Lys Glu Gly Phe Phe Thr
2525 2530 2535
Tyr Leu Cys Gly Phe Ile Lys Gln Lys Leu Ala Leu Gly Gly Ser
2540 2545 2550
Ile Ala Val Lys Ile Thr Glu His Ser Trp Asn Ala Asp Leu Tyr
2555 2560 2565
Lys Leu Met Gly His Phe Ser Trp Trp Thr Ala Phe Val Thr Asn
2570 2575 2580
Val Asn Ala Ser Ser Ser Glu Ala Phe Leu Ile Gly Ala Asn Tyr
2585 2590 2595
Leu Gly Lys Pro Lys Glu Gln Ile Asp Gly Tyr Thr Met His Ala
2600 2605 2610
Asn Tyr Ile Phe Trp Arg Asn Thr Asn Pro Ile Gln Leu Ser Ser
2615 2620 2625
Tyr Ser Leu Phe Asp Met Ser Lys Phe Pro Leu Lys Leu Arg Gly
2630 2635 2640
Thr Ala Val Met Ser Leu Lys Glu Asn Gln Ile Asn Asp Met Ile
2645 2650 2655
Tyr Ser Leu Leu Glu Lys Gly Arg Leu Ile Ile Arg Glu Asn Asn
2660 2665 2670
Arg Val Val Val Ser Ser Asp Ile Leu Val Asn Asn
2675 2680 2685
<210> 85
<211> 2652
<212> PRT
<213> avian infectious bronchitis virus
<220>
<221> MISC_FEATURE
<223> ORF 1B
<400> 85
Met Phe Gln Asn Leu Lys Arg Asn Cys Ala Arg Phe Gln Glu Leu Arg
1 5 10 15
Asp Thr Glu Asp Gly Asn Leu Glu Tyr Leu Asp Ser Tyr Phe Val Val
20 25 30
Lys Gln Thr Thr Pro Ser Asn Tyr Glu His Glu Lys Ser Cys Tyr Glu
35 40 45
Asp Leu Lys Ser Glu Val Thr Ala Asp His Asp Phe Phe Val Phe Asn
50 55 60
Lys Asn Ile Tyr Asn Ile Ser Arg Gln Arg Leu Thr Lys Tyr Thr Met
65 70 75 80
Met Asp Phe Cys Tyr Ala Leu Arg His Phe Asp Pro Lys Asp Cys Glu
85 90 95
Val Leu Lys Glu Ile Leu Val Thr Tyr Gly Cys Ile Glu Asp Tyr His
100 105 110
Pro Lys Trp Phe Glu Glu Asn Lys Asp Trp Tyr Asp Pro Ile Glu Asn
115 120 125
Ser Lys Tyr Tyr Val Met Leu Ala Lys Met Gly Pro Ile Val Arg Arg
130 135 140
Ala Leu Leu Asn Ala Ile Glu Phe Gly Asn Leu Met Val Glu Lys Gly
145 150 155 160
Tyr Val Gly Val Ile Thr Leu Asp Asn Gln Asp Leu Asn Gly Lys Phe
165 170 175
Tyr Asp Phe Gly Asp Phe Gln Lys Thr Ala Pro Gly Ala Gly Val Pro
180 185 190
Val Phe Asp Thr Tyr Tyr Ser Tyr Met Met Pro Ile Ile Ala Met Thr
195 200 205
Asp Ala Leu Ala Pro Glu Arg Tyr Phe Glu Tyr Asp Val His Lys Gly
210 215 220
Tyr Lys Ser Tyr Asp Leu Leu Lys Tyr Asp Tyr Thr Glu Glu Lys Gln
225 230 235 240
Glu Leu Phe Gln Lys Tyr Phe Lys Tyr Trp Asp Gln Glu Tyr His Pro
245 250 255
Asn Cys Arg Asp Cys Ser Asp Asp Arg Cys Leu Ile His Cys Ala Asn
260 265 270
Phe Asn Ile Leu Phe Ser Thr Leu Ile Pro Gln Thr Ser Phe Gly Asn
275 280 285
Leu Cys Arg Lys Val Phe Val Asp Gly Val Pro Phe Ile Ala Thr Cys
290 295 300
Gly Tyr His Ser Lys Glu Leu Gly Val Ile Met Asn Gln Asp Asn Thr
305 310 315 320
Met Ser Phe Ser Lys Met Gly Leu Ser Gln Leu Met Gln Phe Val Gly
325 330 335
Asp Pro Ala Leu Leu Val Gly Thr Ser Asn Asn Leu Val Asp Leu Arg
340 345 350
Thr Ser Cys Phe Ser Val Cys Ala Leu Thr Ser Gly Ile Thr His Gln
355 360 365
Thr Val Lys Pro Gly His Phe Asn Lys Asp Phe Tyr Asp Phe Ala Glu
370 375 380
Lys Ala Gly Met Phe Lys Glu Gly Ser Ser Ile Pro Leu Lys His Phe
385 390 395 400
Phe Tyr Pro Gln Thr Gly Asn Ala Ala Ile Asn Asp Tyr Asp Tyr Tyr
405 410 415
Arg Tyr Asn Arg Pro Thr Met Phe Asp Ile Cys Gln Leu Leu Phe Cys
420 425 430
Leu Glu Val Thr Ser Lys Tyr Phe Glu Cys Tyr Glu Gly Gly Cys Ile
435 440 445
Pro Ala Ser Gln Val Val Val Asn Asn Leu Asp Lys Ser Ala Gly Tyr
450 455 460
Pro Phe Asn Lys Phe Gly Lys Ala Arg Leu Tyr Tyr Glu Met Ser Leu
465 470 475 480
Glu Glu Gln Asp Gln Leu Phe Glu Ile Thr Lys Lys Asn Val Leu Pro
485 490 495
Thr Ile Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn Arg
500 505 510
Ala Arg Thr Val Ala Gly Val Ser Ile Leu Ser Thr Met Thr Asn Arg
515 520 525
Gln Phe His Gln Lys Ile Leu Lys Ser Ile Val Asn Thr Arg Asn Ala
530 535 540
Ser Val Val Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asn Met
545 550 555 560
Leu Arg Asn Leu Ile Gln Gly Val Glu Asp Pro Ile Leu Met Gly Trp
565 570 575
Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Leu Leu Arg Ile Ala
580 585 590
Ala Ser Leu Val Leu Ala Arg Lys His Thr Asn Cys Cys Ser Trp Ser
595 600 605
Glu Arg Ile Tyr Arg Leu Tyr Asn Glu Cys Ala Gln Val Leu Ser Glu
610 615 620
Thr Val Leu Ala Thr Gly Gly Ile Tyr Val Lys Pro Gly Gly Thr Ser
625 630 635 640
Ser Gly Asp Ala Thr Thr Ala Tyr Ala Asn Ser Val Phe Asn Ile Ile
645 650 655
Gln Ala Thr Ser Ala Asn Val Ala Arg Leu Leu Ser Val Ile Thr Arg
660 665 670
Asp Ile Val Tyr Asp Asn Ile Lys Ser Leu Gln Tyr Glu Leu Tyr Gln
675 680 685
Gln Val Tyr Arg Arg Val Asn Phe Asp Pro Ala Phe Val Glu Lys Phe
690 695 700
Tyr Ser Tyr Leu Cys Lys Asn Phe Ser Leu Met Ile Leu Ser Asp Asp
705 710 715 720
Gly Val Val Cys Tyr Asn Asn Thr Leu Ala Lys Gln Gly Leu Val Ala
725 730 735
Asp Ile Ser Gly Phe Arg Glu Val Leu Tyr Tyr Gln Asn Asn Val Phe
740 745 750
Met Ala Asp Ser Lys Cys Trp Val Glu Pro Asp Leu Glu Lys Gly Pro
755 760 765
His Glu Phe Cys Ser Gln His Thr Met Leu Val Glu Val Asp Gly Glu
770 775 780
Pro Lys Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Gly Ala Cys
785 790 795 800
Val Phe Val Asp Asp Val Asp Lys Thr Glu Pro Val Ala Val Met Glu
805 810 815
Arg Tyr Ile Ala Leu Ala Ile Asp Ala Tyr Pro Leu Val His His Glu
820 825 830
Asn Glu Glu Tyr Lys Lys Val Phe Phe Val Leu Leu Ala Tyr Ile Arg
835 840 845
Lys Leu Tyr Gln Glu Leu Ser Gln Asn Met Leu Met Asp Tyr Ser Phe
850 855 860
Val Met Asp Ile Asp Lys Gly Ser Lys Phe Trp Glu Gln Glu Phe Tyr
865 870 875 880
Glu Asn Met Tyr Arg Ala Pro Thr Thr Leu Gln Ser Cys Gly Val Cys
885 890 895
Val Val Cys Asn Ser Gln Thr Ile Leu Arg Cys Gly Asn Cys Ile Arg
900 905 910
Lys Pro Phe Leu Cys Cys Lys Cys Cys Tyr Asp His Val Met His Thr
915 920 925
Asp His Lys Asn Val Leu Ser Ile Asn Pro Tyr Ile Cys Ser Gln Leu
930 935 940
Gly Cys Gly Glu Ala Asp Val Thr Lys Leu Tyr Leu Gly Gly Met Ser
945 950 955 960
Tyr Phe Cys Gly Asn His Lys Pro Lys Leu Ser Ile Pro Leu Val Ser
965 970 975
Asn Gly Thr Val Phe Gly Ile Tyr Arg Ala Asn Cys Ala Gly Ser Glu
980 985 990
Asn Val Asp Asp Phe Asn Gln Leu Ala Thr Thr Asn Trp Ser Ile Val
995 1000 1005
Glu Pro Tyr Ile Leu Ala Asn Arg Cys Ser Asp Ser Leu Arg Arg
1010 1015 1020
Phe Ala Ala Glu Thr Val Lys Ala Thr Glu Glu Leu His Lys Gln
1025 1030 1035
Gln Phe Ala Ser Ala Glu Val Arg Glu Val Phe Ser Asp Arg Glu
1040 1045 1050
Leu Ile Leu Ser Trp Glu Pro Gly Lys Thr Arg Pro Pro Leu Asn
1055 1060 1065
Arg Asn Tyr Val Phe Thr Gly Tyr His Phe Thr Arg Thr Ser Lys
1070 1075 1080
Val Gln Leu Gly Asp Phe Thr Phe Glu Lys Gly Glu Gly Lys Asp
1085 1090 1095
Val Val Tyr Tyr Lys Ala Thr Ser Thr Ala Lys Leu Ser Val Gly
1100 1105 1110
Asp Ile Phe Val Leu Thr Ser His Asn Val Val Ser Leu Val Ala
1115 1120 1125
Pro Thr Leu Cys Pro Gln Gln Thr Phe Ser Arg Phe Val Asn Leu
1130 1135 1140
Arg Pro Asn Val Met Val Pro Glu Cys Phe Val Asn Asn Ile Pro
1145 1150 1155
Leu Tyr His Leu Val Gly Lys Gln Lys Arg Thr Thr Val Gln Gly
1160 1165 1170
Pro Pro Gly Ser Gly Lys Ser His Phe Ala Ile Gly Leu Ala Val
1175 1180 1185
Tyr Phe Ser Ser Ala Arg Val Val Phe Thr Ala Cys Ser His Ala
1190 1195 1200
Ala Val Asp Ala Leu Cys Glu Lys Ala Phe Lys Phe Leu Lys Val
1205 1210 1215
Asp Asp Cys Thr Arg Ile Val Pro Gln Arg Thr Thr Val Asp Cys
1220 1225 1230
Phe Ser Lys Phe Lys Ala Asn Asp Thr Gly Lys Lys Tyr Ile Phe
1235 1240 1245
Ser Thr Ile Asn Ala Leu Pro Glu Val Ser Cys Asp Ile Leu Leu
1250 1255 1260
Val Asp Glu Val Ser Met Leu Thr Asn Tyr Glu Leu Ser Phe Ile
1265 1270 1275
Asn Gly Lys Ile Asn Tyr Gln Tyr Val Val Tyr Val Gly Asp Pro
1280 1285 1290
Ala Gln Leu Pro Ala Pro Arg Thr Leu Leu Asn Gly Ser Leu Ser
1295 1300 1305
Pro Lys Asp Tyr Asn Val Val Thr Asn Leu Met Val Cys Val Lys
1310 1315 1320
Pro Asp Ile Phe Leu Ala Lys Cys Tyr Arg Cys Pro Lys Glu Ile
1325 1330 1335
Val Asp Thr Val Ser Thr Leu Val Tyr Asp Gly Lys Phe Ile Ala
1340 1345 1350
Asn Asn Pro Glu Ser Arg Glu Cys Phe Lys Val Ile Val Asn Asn
1355 1360 1365
Gly Asn Ser Asp Val Gly His Glu Ser Gly Ser Ala Tyr Asn Thr
1370 1375 1380
Thr Gln Leu Glu Phe Val Lys Asp Phe Val Cys Arg Asn Lys Gln
1385 1390 1395
Trp Arg Glu Ala Ile Phe Ile Ser Pro Tyr Asn Ala Met Asn Gln
1400 1405 1410
Arg Ala Tyr Arg Met Leu Gly Leu Asn Val Gln Thr Val Asp Ser
1415 1420 1425
Ser Gln Gly Ser Glu Tyr Asp Tyr Val Ile Phe Cys Val Thr Ala
1430 1435 1440
Asp Ser Gln His Ala Leu Asn Ile Asn Arg Phe Asn Val Ala Leu
1445 1450 1455
Thr Arg Ala Lys Arg Gly Ile Leu Val Val Met Arg Gln Arg Asp
1460 1465 1470
Glu Leu Tyr Ser Ala Leu Lys Phe Thr Glu Leu Asp Ser Glu Thr
1475 1480 1485
Ser Leu Gln Gly Thr Gly Leu Phe Lys Ile Cys Asn Lys Glu Phe
1490 1495 1500
Ser Gly Val His Pro Ala Tyr Ala Val Thr Thr Lys Ala Leu Ala
1505 1510 1515
Ala Thr Tyr Lys Val Asn Asp Glu Leu Ala Ala Leu Val Asn Val
1520 1525 1530
Glu Ala Gly Ser Glu Ile Thr Tyr Lys His Leu Ile Ser Leu Leu
1535 1540 1545
Gly Phe Lys Met Ser Val Asn Val Glu Gly Cys His Asn Met Phe
1550 1555 1560
Ile Thr Arg Asp Glu Ala Ile Arg Asn Val Arg Gly Trp Val Gly
1565 1570 1575
Phe Asp Val Glu Ala Thr His Ala Cys Gly Thr Asn Ile Gly Thr
1580 1585 1590
Asn Leu Pro Phe Gln Val Gly Phe Ser Thr Gly Ala Asp Phe Val
1595 1600 1605
Val Thr Pro Glu Gly Leu Val Asp Thr Ser Ile Gly Asn Asn Phe
1610 1615 1620
Glu Pro Val Asn Ser Lys Ala Pro Pro Gly Glu Gln Phe Asn His
1625 1630 1635
Leu Arg Val Leu Phe Lys Ser Ala Lys Pro Trp His Val Ile Arg
1640 1645 1650
Pro Arg Ile Val Gln Met Leu Ala Asp Asn Leu Cys Asn Val Ser
1655 1660 1665
Asp Cys Val Val Phe Val Thr Trp Cys His Gly Leu Glu Leu Thr
1670 1675 1680
Thr Leu Arg Tyr Phe Val Lys Ile Gly Lys Glu Gln Val Cys Ser
1685 1690 1695
Cys Gly Ser Arg Ala Thr Thr Phe Asn Ser His Thr Gln Ala Tyr
1700 1705 1710
Ala Cys Trp Lys His Cys Leu Gly Phe Asp Phe Val Tyr Asn Pro
1715 1720 1725
Leu Leu Val Asp Ile Gln Gln Trp Gly Tyr Ser Gly Asn Leu Gln
1730 1735 1740
Phe Asn His Asp Leu His Cys Asn Val His Gly His Ala His Val
1745 1750 1755
Ala Ser Val Asp Ala Ile Met Thr Arg Cys Leu Ala Ile Asn Asn
1760 1765 1770
Ala Phe Cys Gln Asp Val Asn Trp Asp Leu Thr Tyr Pro His Ile
1775 1780 1785
Ala Asn Glu Asp Glu Val Asn Ser Ser Cys Arg Tyr Leu Gln Arg
1790 1795 1800
Met Tyr Leu Asn Ala Cys Val Asp Ala Leu Lys Val Asn Val Val
1805 1810 1815
Tyr Asp Ile Gly Asn Pro Lys Gly Ile Lys Cys Val Arg Arg Gly
1820 1825 1830
Asp Val Asn Phe Arg Phe Tyr Asp Lys Asn Pro Ile Val Arg Asn
1835 1840 1845
Val Lys Gln Phe Glu Tyr Asp Tyr Asn Gln His Lys Asp Lys Phe
1850 1855 1860
Ala Asp Gly Leu Cys Met Phe Trp Asn Cys Asn Val Asp Cys Tyr
1865 1870 1875
Pro Asp Asn Ser Leu Val Cys Arg Tyr Asp Thr Arg Asn Leu Ser
1880 1885 1890
Val Phe Asn Leu Pro Gly Cys Asn Gly Gly Ser Leu Tyr Val Asn
1895 1900 1905
Lys His Ala Phe Tyr Thr Pro Lys Phe Asp Arg Ile Ser Phe Arg
1910 1915 1920
Asn Leu Lys Ala Met Pro Phe Phe Phe Tyr Asp Ser Ser Pro Cys
1925 1930 1935
Glu Thr Ile Gln Val Asp Gly Val Ala Gln Asp Leu Val Ser Leu
1940 1945 1950
Ala Thr Lys Asp Cys Ile Thr Lys Cys Asn Ile Gly Gly Ala Val
1955 1960 1965
Cys Lys Lys His Ala Gln Met Tyr Ala Glu Phe Val Thr Ser Tyr
1970 1975 1980
Asn Ala Ala Val Thr Ala Gly Phe Thr Phe Trp Val Thr Asn Lys
1985 1990 1995
Leu Asn Pro Tyr Asn Leu Trp Lys Ser Phe Ser Ala Leu Gln Ser
2000 2005 2010
Ile Asp Asn Ile Ala Tyr Asn Met Tyr Lys Gly Gly His Tyr Asp
2015 2020 2025
Ala Ile Ala Gly Glu Met Pro Thr Val Ile Thr Gly Asp Lys Val
2030 2035 2040
Phe Val Ile Asp Gln Gly Val Glu Lys Ala Val Phe Val Asn Gln
2045 2050 2055
Thr Thr Leu Pro Thr Ser Val Ala Phe Glu Leu Tyr Ala Lys Arg
2060 2065 2070
Asn Ile Arg Thr Leu Pro Asn Asn Arg Ile Leu Lys Gly Leu Gly
2075 2080 2085
Val Asp Val Thr Asn Gly Phe Val Ile Trp Asp Tyr Ala Asn Gln
2090 2095 2100
Thr Pro Leu Tyr Arg Asn Thr Val Lys Val Cys Ala Tyr Thr Asp
2105 2110 2115
Ile Glu Pro Asn Gly Leu Val Val Leu Tyr Asp Asp Arg Tyr Gly
2120 2125 2130
Asp Tyr Gln Ser Phe Leu Ala Ala Asp Asn Ala Val Leu Val Ser
2135 2140 2145
Thr Gln Cys Tyr Lys Arg Tyr Ser Tyr Val Glu Ile Pro Ser Asn
2150 2155 2160
Leu Leu Val Gln Asn Gly Met Pro Leu Lys Asp Gly Ala Asn Leu
2165 2170 2175
Tyr Val Tyr Lys Arg Val Asn Gly Ala Phe Val Thr Leu Pro Asn
2180 2185 2190
Thr Ile Asn Thr Gln Gly Arg Ser Tyr Glu Thr Phe Glu Pro Arg
2195 2200 2205
Ser Asp Ile Glu Arg Asp Phe Leu Ala Met Ser Glu Glu Ser Phe
2210 2215 2220
Val Glu Arg Tyr Gly Lys Asp Leu Gly Leu Gln His Ile Leu Tyr
2225 2230 2235
Gly Glu Val Asp Lys Pro Gln Leu Gly Gly Leu His Thr Val Ile
2240 2245 2250
Gly Met Tyr Arg Leu Leu Arg Ala Asn Lys Leu Asn Ala Lys Ser
2255 2260 2265
Val Thr Asn Ser Asp Ser Asp Val Met Gln Asn Tyr Phe Val Leu
2270 2275 2280
Ser Asp Asn Gly Ser Tyr Lys Gln Val Cys Thr Val Val Asp Leu
2285 2290 2295
Leu Leu Asp Asp Phe Leu Glu Leu Leu Arg Asn Ile Leu Lys Glu
2300 2305 2310
Tyr Gly Thr Asn Lys Ser Lys Val Val Thr Val Ser Ile Asp Tyr
2315 2320 2325
His Ser Ile Asn Phe Met Thr Trp Phe Glu Asp Gly Ser Ile Lys
2330 2335 2340
Thr Cys Tyr Pro Gln Leu Gln Ser Ala Trp Thr Cys Gly Tyr Asn
2345 2350 2355
Met Pro Glu Leu Tyr Lys Val Gln Asn Cys Val Met Glu Pro Cys
2360 2365 2370
Asn Ile Pro Asn Tyr Gly Val Gly Ile Thr Leu Pro Ser Gly Ile
2375 2380 2385
Leu Met Asn Val Ala Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Ser
2390 2395 2400
Lys Thr Thr Ile Cys Val Pro His Asn Met Arg Val Met His Phe
2405 2410 2415
Gly Ala Gly Ser Asp Lys Gly Val Ala Pro Gly Ser Thr Val Leu
2420 2425 2430
Lys Gln Trp Leu Pro Glu Gly Thr Leu Leu Val Asp Asn Asp Ile
2435 2440 2445
Val Asp Tyr Val Ser Asp Ala His Val Ser Val Leu Ser Asp Cys
2450 2455 2460
Asn Lys Tyr Asn Thr Glu His Lys Phe Asp Leu Val Ile Ser Asp
2465 2470 2475
Met Tyr Thr Asp Asn Asp Ser Lys Arg Lys His Glu Gly Val Ile
2480 2485 2490
Ala Asn Asn Gly Asn Asp Asp Val Phe Ile Tyr Leu Ser Ser Phe
2495 2500 2505
Leu Arg Asn Asn Leu Ala Leu Gly Gly Ser Phe Ala Val Lys Val
2510 2515 2520
Thr Glu Thr Ser Trp His Glu Val Leu Tyr Asp Ile Ala Gln Asp
2525 2530 2535
Cys Ala Trp Trp Thr Met Phe Cys Thr Ala Val Asn Ala Ser Ser
2540 2545 2550
Ser Glu Ala Phe Leu Ile Gly Val Asn Tyr Leu Gly Ala Ser Glu
2555 2560 2565
Lys Val Lys Val Ser Gly Lys Thr Leu His Ala Asn Tyr Ile Phe
2570 2575 2580
Trp Arg Asn Cys Asn Tyr Leu Gln Thr Ser Ala Tyr Ser Ile Phe
2585 2590 2595
Asp Val Ala Lys Phe Asp Leu Arg Leu Lys Ala Thr Pro Val Val
2600 2605 2610
Asn Leu Lys Thr Glu Gln Lys Thr Asp Leu Val Phe Asn Leu Ile
2615 2620 2625
Lys Cys Gly Lys Leu Leu Val Arg Asp Val Gly Asn Thr Ser Phe
2630 2635 2640
Thr Ser Asp Ser Phe Val Cys Thr Met
2645 2650
<210> 86
<211> 2685
<212> PRT
<213> bovine coronavirus
<220>
<221> MISC_FEATURE
<223> ORF 1B
<400> 86
Phe Phe Lys Arg Val Arg Gly Thr Ser Val Asp Ala Arg Leu Val Pro
1 5 10 15
Cys Ala Ser Gly Leu Ser Thr Asp Val Gln Leu Arg Ala Phe Asp Ile
20 25 30
Cys Asn Ala Ser Val Ala Gly Ile Gly Leu His Leu Lys Val Asn Cys
35 40 45
Cys Arg Phe Gln Arg Val Asp Glu Asn Gly Asp Lys Leu Asp Gln Phe
50 55 60
Phe Val Val Lys Arg Thr Asp Leu Thr Ile Tyr Asn Arg Glu Met Glu
65 70 75 80
Cys Tyr Glu Arg Val Lys Asp Cys Lys Phe Val Ala Glu His Asp Phe
85 90 95
Phe Thr Phe Asp Val Glu Gly Ser Arg Val Pro His Ile Val Arg Lys
100 105 110
Asp Leu Thr Lys Tyr Thr Met Leu Asp Leu Cys Tyr Ala Leu Arg His
115 120 125
Phe Asp Arg Asn Asp Cys Met Leu Leu Cys Asp Ile Leu Ser Ile Tyr
130 135 140
Ala Gly Cys Glu Gln Ser Tyr Phe Thr Lys Lys Asp Trp Tyr Asp Phe
145 150 155 160
Val Glu Asn Pro Asp Ile Ile Asn Val Tyr Lys Lys Leu Gly Pro Ile
165 170 175
Phe Asn Arg Ala Leu Val Ser Ala Thr Glu Phe Ala Asp Lys Leu Val
180 185 190
Glu Val Gly Leu Val Gly Ile Leu Thr Leu Asp Asn Gln Asp Leu Asn
195 200 205
Gly Lys Trp Tyr Asp Phe Gly Asp Tyr Val Ile Ala Ala Pro Gly Cys
210 215 220
Gly Val Ala Ile Ala Asp Ser Tyr Tyr Ser Tyr Met Met Pro Met Leu
225 230 235 240
Thr Met Cys His Ala Leu Asp Cys Glu Leu Tyr Val Asn Asn Ala Tyr
245 250 255
Arg Leu Phe Asp Leu Val Gln Tyr Asp Phe Thr Asp Tyr Lys Leu Glu
260 265 270
Leu Phe Asn Lys Tyr Phe Lys His Trp Ser Met Pro Tyr His Pro Asn
275 280 285
Thr Val Asp Cys Gln Asp Asp Arg Cys Ile Ile His Cys Ala Asn Phe
290 295 300
Asn Ile Leu Phe Ser Met Val Leu Pro Asn Thr Cys Phe Gly Pro Leu
305 310 315 320
Val Arg Gln Ile Phe Val Asp Gly Val Pro Phe Val Val Ser Ile Gly
325 330 335
Tyr His Tyr Lys Glu Leu Gly Ile Val Met Asn Met Asp Val Asp Thr
340 345 350
His Arg Tyr Arg Leu Ser Leu Lys Asp Leu Leu Leu Tyr Ala Ala Asp
355 360 365
Pro Ala Leu His Val Ala Ser Ala Ser Ala Leu Tyr Asp Leu Arg Thr
370 375 380
Cys Cys Phe Ser Val Ala Ala Ile Thr Ser Gly Val Lys Phe Gln Thr
385 390 395 400
Val Lys Pro Gly Asn Phe Asn Gln Asp Phe Tyr Asp Phe Ile Leu Ser
405 410 415
Lys Gly Leu Leu Lys Glu Gly Ser Ser Val Asp Leu Lys His Phe Phe
420 425 430
Phe Thr Gln Asp Gly Asn Ala Ala Ile Thr Asp Tyr Asn Tyr Tyr Lys
435 440 445
Tyr Asn Leu Pro Thr Met Val Asp Ile Lys Gln Leu Leu Phe Val Leu
450 455 460
Glu Val Val Tyr Lys Tyr Phe Glu Ile Tyr Asp Gly Gly Cys Ile Pro
465 470 475 480
Ala Ala Gln Val Ile Val Asn Asn Tyr Asp Lys Ser Ala Gly Tyr Pro
485 490 495
Phe Asn Lys Phe Gly Lys Ala Arg Leu Tyr Tyr Glu Ala Leu Ser Phe
500 505 510
Glu Glu Gln Asp Glu Ile Tyr Ala Tyr Thr Lys Arg Asn Val Leu Pro
515 520 525
Thr Leu Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn Arg
530 535 540
Ala Arg Thr Val Ala Gly Val Ser Ile Leu Ser Thr Met Thr Gly Arg
545 550 555 560
Met Phe His Gln Lys Cys Leu Lys Ser Ile Ala Ala Thr Arg Gly Val
565 570 575
Pro Val Val Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asp Met
580 585 590
Leu Arg Arg Leu Ile Lys Asp Val Asp Asn Pro Val Leu Met Gly Trp
595 600 605
Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Ile Leu Arg Ile Val
610 615 620
Ser Ser Leu Val Leu Ala Arg Lys His Glu Ala Cys Cys Ser Gln Ser
625 630 635 640
Asp Arg Phe Tyr Arg Leu Ala Asn Glu Cys Ala Gln Val Leu Ser Glu
645 650 655
Ile Val Met Cys Gly Gly Cys Tyr Tyr Val Lys Pro Gly Gly Thr Ser
660 665 670
Ser Gly Asp Ala Thr Thr Ala Phe Ala Asn Ser Val Phe Asn Ile Cys
675 680 685
Gln Ala Val Ser Ala Asn Val Cys Ala Leu Met Ser Cys Asn Gly Asn
690 695 700
Lys Ile Glu Asp Leu Ser Ile Arg Ala Leu Gln Lys Arg Leu Tyr Ser
705 710 715 720
His Val Tyr Arg Ser Asp Met Val Asp Ser Thr Phe Val Thr Glu Tyr
725 730 735
Tyr Glu Phe Leu Asn Lys His Phe Ser Met Met Ile Leu Ser Asp Asp
740 745 750
Gly Val Val Cys Tyr Asn Ser Asp Tyr Ala Ser Lys Gly Tyr Ile Ala
755 760 765
Asn Ile Ser Ala Phe Gln Gln Val Leu Tyr Tyr Gln Asn Asn Val Phe
770 775 780
Met Ser Glu Ser Lys Cys Trp Val Glu Asn Asp Ile Asn Asn Gly Pro
785 790 795 800
His Glu Phe Cys Ser Gln His Thr Met Leu Val Lys Met Asp Gly Asp
805 810 815
Asp Val Tyr Leu Pro Tyr Pro Val Pro Ser Arg Ile Leu Gly Ala Gly
820 825 830
Cys Phe Val Asp Asp Leu Leu Lys Thr Asp Ser Val Leu Leu Ile Glu
835 840 845
Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Val Tyr His Glu
850 855 860
Asn Glu Glu Tyr Gln Lys Val Phe Arg Val Tyr Leu Glu Tyr Ile Lys
865 870 875 880
Lys Leu Tyr Asn Glu Leu Gly Asn Gln Ile Leu Asp Ser Tyr Ser Val
885 890 895
Ile Leu Ser Thr Cys Asp Gly Gln Lys Phe Thr Asp Glu Ser Phe Tyr
900 905 910
Lys Asn Met Tyr Leu Arg Ser Ala Val Met Gln Ser Val Gly Ala Cys
915 920 925
Val Val Cys Ser Ser Gln Thr Ser Leu Arg Cys Gly Ser Cys Ile Arg
930 935 940
Lys Pro Leu Leu Cys Cys Lys Cys Cys Tyr Asp His Val Met Ala Thr
945 950 955 960
Asp His Lys Tyr Val Leu Ser Val Ser Pro Tyr Val Cys Asn Ala Pro
965 970 975
Gly Cys Asp Val Asn Asp Val Thr Lys Leu Tyr Leu Gly Gly Met Ser
980 985 990
Tyr Tyr Cys Glu Asp His Lys Pro Gln Tyr Ser Phe Lys Leu Val Met
995 1000 1005
Asn Gly Met Val Phe Gly Leu Tyr Lys Gln Ser Cys Thr Gly Ser
1010 1015 1020
Pro Tyr Ile Asp Asp Phe Asn Arg Ile Ala Ser Cys Lys Trp Thr
1025 1030 1035
Asp Val Asp Asp Tyr Ile Leu Ala Asn Glu Cys Thr Glu Arg Leu
1040 1045 1050
Lys Leu Phe Ala Ala Glu Thr Gln Lys Ala Thr Glu Glu Ala Phe
1055 1060 1065
Lys Gln Ser Tyr Ala Ser Ala Thr Ile Gln Glu Ile Val Ser Glu
1070 1075 1080
Arg Glu Leu Ile Leu Ser Trp Glu Ile Gly Lys Val Lys Pro Pro
1085 1090 1095
Leu Asn Lys Asn Tyr Val Phe Thr Gly Tyr His Phe Thr Lys Asn
1100 1105 1110
Gly Lys Thr Val Leu Gly Glu Tyr Val Phe Asp Lys Ser Glu Leu
1115 1120 1125
Thr Asn Gly Val Tyr Tyr Arg Ala Thr Thr Thr Tyr Lys Leu Ser
1130 1135 1140
Val Gly Asp Val Phe Val Leu Thr Ser His Ser Val Ala Asn Leu
1145 1150 1155
Ser Ala Pro Thr Leu Val Pro Gln Glu Asn Tyr Ser Ser Ile Arg
1160 1165 1170
Phe Ala Ser Val Tyr Ser Val Leu Glu Thr Phe Gln Asn Asn Val
1175 1180 1185
Val Asn Tyr Gln His Ile Gly Met Lys Arg Tyr Cys Thr Val Gln
1190 1195 1200
Gly Pro Pro Gly Thr Gly Lys Ser His Leu Ala Ile Gly Leu Ala
1205 1210 1215
Val Tyr Tyr Cys Thr Ala Arg Val Val Tyr Thr Ala Ala Ser His
1220 1225 1230
Ala Ala Val Asp Ala Leu Cys Glu Lys Ala Tyr Lys Phe Leu Asn
1235 1240 1245
Ile Asn Asp Cys Thr Arg Ile Val Pro Ala Lys Val Arg Val Glu
1250 1255 1260
Cys Tyr Asp Lys Phe Lys Ile Asn Asp Thr Thr Arg Lys Tyr Val
1265 1270 1275
Phe Thr Thr Ile Asn Ala Leu Pro Glu Met Val Thr Asp Ile Val
1280 1285 1290
Val Val Asp Glu Val Ser Met Leu Thr Asn Tyr Glu Leu Ser Val
1295 1300 1305
Ile Asn Ala Arg Ile Arg Ala Lys His Tyr Val Tyr Ile Gly Asp
1310 1315 1320
Pro Ala Gln Leu Pro Ala Pro Arg Val Leu Leu Ser Lys Gly Thr
1325 1330 1335
Leu Glu Pro Lys Tyr Phe Asn Thr Val Thr Lys Leu Met Cys Cys
1340 1345 1350
Leu Gly Pro Asp Ile Phe Leu Gly Thr Cys Tyr Arg Cys Pro Lys
1355 1360 1365
Glu Ile Val Asp Thr Val Ser Ala Leu Val Tyr Glu Asn Lys Leu
1370 1375 1380
Lys Ala Lys Asn Glu Ser Ser Ser Leu Cys Phe Lys Val Tyr Tyr
1385 1390 1395
Lys Gly Val Thr Thr His Glu Ser Ser Ser Ala Val Asn Met Gln
1400 1405 1410
Gln Ile Tyr Leu Ile Asn Lys Phe Leu Lys Ala Asn Pro Leu Trp
1415 1420 1425
His Lys Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln Asn Phe Ala
1430 1435 1440
Ala Lys Arg Val Leu Gly Leu Gln Thr Gln Thr Val Asp Ser Ala
1445 1450 1455
Gln Gly Ser Glu Tyr Asp Tyr Val Ile Tyr Ser Gln Thr Ala Glu
1460 1465 1470
Thr Ala His Ser Val Asn Val Asn Arg Phe Asn Val Ala Ile Thr
1475 1480 1485
Arg Ala Lys Lys Gly Ile Leu Cys Val Met Ser Asn Met Gln Leu
1490 1495 1500
Phe Glu Ala Leu Gln Phe Thr Thr Leu Thr Val Asp Lys Val Pro
1505 1510 1515
Gln Ala Val Glu Thr Arg Val Gln Cys Ser Thr Asn Leu Phe Lys
1520 1525 1530
Asp Cys Ser Lys Ser Tyr Ser Gly Tyr His Pro Ala His Ala Pro
1535 1540 1545
Ser Phe Leu Ala Val Asp Asp Lys Tyr Lys Ala Thr Gly Asp Leu
1550 1555 1560
Ala Val Cys Leu Gly Ile Gly Asp Ser Ala Val Thr Tyr Ser Arg
1565 1570 1575
Leu Ile Ser Leu Met Gly Phe Lys Leu Asp Val Thr Leu Asp Gly
1580 1585 1590
Tyr Cys Lys Leu Phe Ile Thr Lys Glu Glu Ala Val Lys Arg Val
1595 1600 1605
Arg Ala Trp Val Gly Phe Asp Ala Glu Gly Ala His Ala Thr Arg
1610 1615 1620
Asp Ser Ile Gly Thr Asn Phe Pro Leu Gln Leu Gly Phe Ser Thr
1625 1630 1635
Gly Ile Asp Phe Val Val Glu Ala Thr Gly Leu Phe Ala Asp Arg
1640 1645 1650
Asp Gly Tyr Ser Phe Lys Lys Ala Val Ala Lys Ala Pro Pro Gly
1655 1660 1665
Glu Gln Phe Lys His Leu Ile Pro Leu Met Thr Arg Gly Gln Arg
1670 1675 1680
Trp Asp Val Val Arg Pro Arg Ile Val Gln Met Phe Ala Asp His
1685 1690 1695
Leu Ile Asp Leu Ser Asp Cys Val Val Leu Val Thr Trp Ala Ala
1700 1705 1710
Asn Phe Glu Leu Thr Cys Leu Arg Tyr Phe Ala Lys Val Gly Arg
1715 1720 1725
Glu Ile Ser Cys Asn Val Ser Thr Lys Arg Ala Thr Ala Tyr Asn
1730 1735 1740
Ser Arg Thr Gly Tyr Tyr Gly Cys Trp Arg His Ser Val Thr Cys
1745 1750 1755
Asp Tyr Leu Tyr Asn Pro Leu Ile Val Asp Ile Gln Gln Trp Gly
1760 1765 1770
Tyr Ile Gly Ser Leu Ser Ser Asn His Asp Leu Tyr Cys Ser Val
1775 1780 1785
His Lys Gly Ala His Val Ala Ser Ser Asp Ala Ile Met Thr Arg
1790 1795 1800
Cys Leu Ala Val Tyr Asp Cys Phe Cys Asn Asn Ile Asn Trp Asn
1805 1810 1815
Val Glu Tyr Pro Ile Ile Ser Asn Glu Leu Ser Ile Asn Thr Ser
1820 1825 1830
Cys Arg Val Leu Gln Arg Val Met Leu Lys Ala Ala Met Leu Cys
1835 1840 1845
Asn Arg Tyr Thr Leu Cys Tyr Asp Ile Gly Asn Pro Lys Ala Ile
1850 1855 1860
Ala Cys Val Lys Asp Phe Asp Phe Lys Phe Tyr Asp Ala Gln Pro
1865 1870 1875
Ile Val Lys Ser Val Lys Thr Leu Leu Tyr Phe Phe Glu Ala His
1880 1885 1890
Lys Asp Ser Phe Lys Asp Gly Leu Cys Met Phe Trp Asn Cys Asn
1895 1900 1905
Val Asp Lys Tyr Pro Pro Asn Ala Val Val Cys Arg Phe Asp Thr
1910 1915 1920
Arg Val Leu Asn Asn Leu Asn Leu Pro Gly Cys Asn Gly Gly Ser
1925 1930 1935
Leu Tyr Val Asn Lys His Ala Phe His Thr Lys Pro Phe Ser Arg
1940 1945 1950
Ala Ala Phe Glu His Leu Lys Pro Met Pro Phe Phe Tyr Tyr Ser
1955 1960 1965
Asp Thr Pro Cys Val Tyr Met Asp Gly Met Asp Ala Lys Gln Val
1970 1975 1980
Asp Tyr Val Pro Leu Lys Ser Ala Thr Cys Ile Thr Arg Cys Asn
1985 1990 1995
Leu Gly Gly Ala Val Cys Leu Lys His Ala Glu Glu Tyr Arg Glu
2000 2005 2010
Tyr Leu Glu Ser Tyr Asn Thr Ala Thr Thr Ala Gly Phe Thr Phe
2015 2020 2025
Trp Val Tyr Lys Thr Phe Asp Phe Tyr Asn Leu Trp Asn Thr Phe
2030 2035 2040
Thr Lys Leu Gln Ser Leu Glu Asn Val Val Tyr Asn Leu Val Lys
2045 2050 2055
Thr Gly His Tyr Thr Gly Gln Ala Gly Glu Met Pro Cys Ala Ile
2060 2065 2070
Ile Asn Asp Lys Val Val Ala Lys Ile Asp Lys Glu Asp Val Val
2075 2080 2085
Ile Phe Ile Asn Asn Thr Thr Tyr Pro Thr Asn Val Ala Val Glu
2090 2095 2100
Leu Phe Ala Lys Arg Ser Ile Arg His His Pro Glu Leu Lys Leu
2105 2110 2115
Phe Arg Asn Leu Asn Ile Asp Val Cys Trp Lys His Val Ile Trp
2120 2125 2130
Asp Tyr Ala Arg Glu Ser Ile Phe Cys Ser Asn Thr Tyr Gly Val
2135 2140 2145
Cys Met Tyr Thr Asp Leu Lys Leu Ile Asp Lys Leu Asn Val Leu
2150 2155 2160
Phe Asp Gly Arg Asp Asn Gly Ala Leu Glu Ala Phe Lys Arg Ser
2165 2170 2175
Asn Asn Gly Val Tyr Ile Ser Thr Thr Lys Val Lys Ser Leu Ser
2180 2185 2190
Met Ile Arg Gly Pro Pro Arg Ala Glu Leu Asn Gly Val Val Val
2195 2200 2205
Asp Lys Val Gly Asp Thr Asp Cys Val Phe Tyr Phe Ala Val Arg
2210 2215 2220
Lys Glu Gly Gln Asp Val Ile Phe Ser Gln Phe Asp Ser Leu Arg
2225 2230 2235
Val Ser Ser Asn Gln Ser Pro Gln Gly Asn Leu Gly Ser Asn Glu
2240 2245 2250
Pro Gly Asn Val Gly Gly Asn Asp Ala Leu Ala Thr Ser Thr Ile
2255 2260 2265
Phe Thr Gln Ser Arg Val Ile Ser Ser Phe Thr Cys Arg Thr Asp
2270 2275 2280
Met Glu Lys Asp Phe Ile Ala Leu Asp Gln Asp Val Phe Ile Gln
2285 2290 2295
Lys Tyr Gly Leu Glu Asp Tyr Ala Phe Glu His Ile Val Tyr Gly
2300 2305 2310
Asn Phe Asn Gln Lys Ile Ile Gly Gly Leu His Leu Leu Ile Gly
2315 2320 2325
Leu Tyr Arg Arg Gln Gln Thr Ser Asn Leu Val Ile Gln Glu Phe
2330 2335 2340
Val Ser Tyr Asp Ser Ser Ile His Ser Tyr Phe Ile Thr Asp Glu
2345 2350 2355
Lys Ser Gly Gly Ser Lys Ser Val Cys Thr Val Ile Asp Ile Leu
2360 2365 2370
Leu Asp Asp Phe Val Ala Leu Val Lys Ser Leu Asn Leu Asn Cys
2375 2380 2385
Val Ser Lys Val Val Asn Val Asn Val Asp Phe Lys Asp Phe Gln
2390 2395 2400
Phe Met Leu Trp Cys Asn Asp Glu Lys Val Met Thr Phe Tyr Pro
2405 2410 2415
Arg Leu Gln Ala Ala Ser Asp Trp Lys Pro Gly Tyr Ser Met Pro
2420 2425 2430
Val Leu Tyr Lys Tyr Leu Asn Ser Pro Met Glu Arg Val Ser Leu
2435 2440 2445
Trp Asn Tyr Gly Lys Pro Val Thr Leu Pro Thr Gly Cys Met Met
2450 2455 2460
Asn Val Ala Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Thr Thr
2465 2470 2475
Thr Leu Ala Val Pro Val Asn Thr Arg Val Leu His Leu Gly Ala
2480 2485 2490
Gly Ser Glu Lys Gly Val Ala Pro Gly Ser Ala Val Leu Arg Gln
2495 2500 2505
Trp Leu Pro Ala Gly Thr Ile Leu Arg Gln Trp Leu Pro Ala Gly
2510 2515 2520
Thr Ile Leu Val His Asn Asp Leu Tyr Pro Phe Val Ser Asp Ser
2525 2530 2535
Val Ala Thr Tyr Phe Gly Asp Cys Ile Thr Leu Pro Phe Asp Cys
2540 2545 2550
Gln Trp Asp Leu Ile Ile Ser Asp Met Tyr Asp Leu Leu Leu Asp
2555 2560 2565
Ile Gly Val His Val Val Arg Cys Ser Tyr Ile His Cys His Met
2570 2575 2580
Ile Arg Asp Lys Leu Ala Leu Gly Gly Ser Val Ala Ile Lys Ile
2585 2590 2595
Thr Glu Phe Ser Trp Asn Ala Glu Leu Tyr Lys Leu Met Gly Tyr
2600 2605 2610
Phe Ala Phe Trp Thr Val Phe Cys Thr Asn Ala Asn Ala Ser Ser
2615 2620 2625
Ser Glu Gly Phe Leu Ile Gly Ile Asn Tyr Leu Gly Lys Pro Lys
2630 2635 2640
Val Glu Ile Asp Gly Asn Val Met His Ala Ile Ile Cys Phe Gly
2645 2650 2655
Glu Ile Pro Gln Phe Gly Thr Gly Val Leu Ile Ala Cys Leu Ile
2660 2665 2670
Trp Leu Asn Ser Arg Leu Ser Trp Leu Val Met Pro
2675 2680 2685
<210> 87
<211> 2678
<212> PRT
<213> EMCR Coronavirus
<220>
<221> MISC_FEATURE
<223> ORF 1B
<220>
<221> MISC_FEATURE
<222> (844)..(844)
<223> Unknown amino acid
<400> 87
Arg Ala Arg Gly Ser Ser Ala Ala Arg Leu Glu Pro Cys Asn Gly Thr
1 5 10 15
Asp Ile Asp Lys Cys Val Arg Ala Phe Asp Ile Tyr Asn Lys Asn Val
20 25 30
Ser Phe Leu Gly Lys Cys Leu Lys Met Asn Cys Val Arg Phe Lys Asn
35 40 45
Ala Asp Leu Lys Asp Gly Tyr Phe Val Ile Lys Arg Cys Thr Lys Ser
50 55 60
Val Met Glu His Glu Gln Ser Met Tyr Asn Leu Leu Asn Phe Ser Gly
65 70 75 80
Ala Leu Ala Glu His Asp Phe Phe Thr Trp Lys Asp Gly Arg Val Ile
85 90 95
Tyr Gly Asn Val Ser Arg His Asn Leu Thr Lys Tyr Thr Met Met Asp
100 105 110
Leu Val Tyr Ala Met Arg Asn Phe Asp Glu Gln Asn Cys Asp Val Leu
115 120 125
Lys Glu Val Leu Val Leu Thr Gly Cys Cys Asp Asn Ser Tyr Phe Asp
130 135 140
Ser Lys Gly Trp Tyr Asp Pro Val Glu Asn Glu Asp Ile His Arg Val
145 150 155 160
Tyr Ala Ser Leu Gly Lys Ile Val Ala Arg Ala Met Leu Lys Cys Val
165 170 175
Ala Leu Cys Asp Ala Met Val Ala Lys Gly Val Val Gly Val Leu Thr
180 185 190
Leu Asp Asn Gln Asp Leu Asn Gly Asn Phe Tyr Asp Phe Gly Asp Phe
195 200 205
Val Val Ser Leu Pro Asn Met Gly Val Pro Cys Cys Thr Ser Tyr Tyr
210 215 220
Ser Tyr Met Met Pro Ile Met Gly Leu Thr Asn Cys Leu Ala Ser Glu
225 230 235 240
Cys Phe Val Lys Ser Asp Ile Phe Gly Ser Asp Phe Lys Thr Phe Asp
245 250 255
Leu Leu Lys Tyr Asp Phe Thr Glu His Lys Glu Asn Leu Phe Asn Lys
260 265 270
Tyr Phe Lys His Trp Ser Phe Asp Tyr His Pro Asn Cys Ser Asp Cys
275 280 285
Tyr Asp Asp Met Cys Val Ile His Cys Ala Asn Phe Asn Thr Leu Phe
290 295 300
Ala Thr Thr Ile Pro Gly Thr Ala Phe Gly Pro Leu Cys Arg Lys Val
305 310 315 320
Phe Ile Asp Gly Val Pro Leu Val Thr Thr Ala Gly Tyr His Phe Lys
325 330 335
Gln Leu Gly Leu Val Trp Asn Lys Asp Val Asn Thr His Ser Val Arg
340 345 350
Leu Thr Ile Thr Glu Leu Leu Gln Phe Val Thr Asp Pro Ser Leu Ile
355 360 365
Ile Ala Ser Ser Pro Ala Leu Val Asp Gln Arg Thr Ile Cys Phe Ser
370 375 380
Val Ala Ala Leu Ser Thr Gly Leu Thr Asn Gln Val Val Lys Pro Gly
385 390 395 400
His Phe Asn Glu Glu Phe Tyr Asn Phe Leu Arg Leu Arg Gly Phe Phe
405 410 415
Asp Glu Gly Ser Glu Leu Thr Leu Lys His Phe Phe Phe Ala Gln Asn
420 425 430
Gly Asp Ala Ala Val Lys Asp Phe Asp Phe Tyr Arg Tyr Asn Lys Pro
435 440 445
Thr Ile Leu Asp Ile Cys Gln Ala Arg Val Thr Tyr Lys Ile Val Ser
450 455 460
Arg Tyr Phe Asp Ile Tyr Glu Gly Gly Cys Ile Lys Ala Cys Glu Val
465 470 475 480
Val Val Thr Asn Leu Asn Lys Ser Ala Gly Trp Pro Leu Asn Lys Phe
485 490 495
Gly Lys Ala Ser Leu Tyr Tyr Glu Ser Ile Ser Tyr Glu Glu Gln Asp
500 505 510
Ala Leu Phe Ala Leu Thr Lys Arg Asn Val Leu Pro Thr Met Thr Gln
515 520 525
Leu Asn Leu Lys Tyr Ala Ile Ser Gly Lys Glu Arg Ala Arg Thr Val
530 535 540
Gly Gly Val Ser Leu Leu Ser Thr Met Thr Thr Arg Gln Tyr His Gln
545 550 555 560
Lys His Leu Lys Ser Ile Val Asn Thr Arg Asn Ala Thr Val Val Ile
565 570 575
Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asn Asn Met Leu Arg Thr Leu
580 585 590
Ile Asp Gly Val Glu Asn Pro Met Leu Met Gly Trp Asp Tyr Pro Lys
595 600 605
Cys Asp Arg Ala Leu Pro Asn Met Ile Arg Met Ile Ser Ala Met Val
610 615 620
Leu Gly Ser Lys His Val Asn Cys Cys Thr Val Thr Asp Arg Phe Tyr
625 630 635 640
Arg Leu Gly Asn Glu Leu Ala Gln Val Leu Thr Glu Val Val Tyr Ser
645 650 655
Asn Gly Gly Phe Tyr Phe Lys Pro Gly Gly Thr Thr Ser Gly Asp Ala
660 665 670
Ser Thr Ala Tyr Ala Asn Ser Ile Phe Asn Ile Phe Gln Ala Val Ser
675 680 685
Ser Asn Ile Asn Arg Leu Leu Ser Val Pro Ser Asp Ser Cys Asn Asn
690 695 700
Val Asn Val Arg Asp Leu Gln Arg Arg Leu Tyr Asp Asn Cys Tyr Arg
705 710 715 720
Leu Thr Ser Val Glu Glu Ser Phe Ile Asp Asp Tyr Tyr Gly Tyr Leu
725 730 735
Arg Lys His Phe Ser Met Met Ile Leu Ser Asp Asp Gly Val Val Cys
740 745 750
Tyr Asn Lys Asp Tyr Ala Glu Leu Gly Tyr Ile Ala Asp Ile Ser Ala
755 760 765
Phe Lys Ala Thr Leu Tyr Tyr Gln Asn Asn Val Phe Met Ser Thr Ser
770 775 780
Lys Cys Trp Val Glu Glu Asp Leu Thr Lys Gly Pro His Glu Phe Cys
785 790 795 800
Ser Gln His Thr Met Gln Ile Val Asp Lys Asp Gly Thr Tyr Tyr Leu
805 810 815
Pro Tyr Pro Asp Pro Ser Arg Ile Leu Ser Ala Gly Val Phe Val Asp
820 825 830
Asp Val Val Lys Thr Asp Ala Val Val Leu Leu Xaa Arg Tyr Val Ser
835 840 845
Leu Ala Ile Asp Ala Tyr Pro Leu Ser Lys His Pro Asn Ser Glu Tyr
850 855 860
Arg Lys Val Phe Tyr Val Leu Leu Asp Trp Val Lys His Leu Asn Lys
865 870 875 880
Asn Leu Asn Glu Gly Val Leu Glu Ser Phe Ser Val Thr Leu Leu Asp
885 890 895
Asn Gln Glu Asp Lys Phe Trp Cys Glu Asp Phe Tyr Ala Ser Met Tyr
900 905 910
Glu Asn Ser Thr Ile Leu Gln Ala Ala Gly Leu Cys Val Val Cys Gly
915 920 925
Ser Gln Thr Val Leu Arg Cys Gly Asp Cys Leu Arg Lys Pro Met Leu
930 935 940
Cys Thr Lys Cys Ala Tyr Asp His Val Phe Gly Thr Asp His Lys Phe
945 950 955 960
Ile Leu Ala Ile Thr Pro Tyr Val Cys Asn Ala Ser Gly Cys Gly Val
965 970 975
Ser Asp Val Lys Lys Leu Tyr Leu Gly Gly Leu Asn Tyr Tyr Cys Thr
980 985 990
Asn His Lys Pro Gln Leu Ser Phe Pro Leu Cys Ser Ala Gly Asn Ile
995 1000 1005
Phe Gly Leu Tyr Lys Asn Ser Ala Thr Gly Ser Leu Asp Val Glu
1010 1015 1020
Val Phe Asn Arg Leu Ala Thr Ser Asp Trp Thr Asp Val Arg Asp
1025 1030 1035
Tyr Lys Leu Ala Asn Asp Val Lys Asp Thr Leu Arg Leu Phe Ala
1040 1045 1050
Ala Glu Thr Ile Lys Ala Lys Glu Glu Ser Val Lys Ser Ser Tyr
1055 1060 1065
Ala Phe Ala Thr Leu Lys Glu Val Val Gly Pro Lys Glu Leu Leu
1070 1075 1080
Leu Ser Trp Glu Ser Gly Lys Val Lys Pro Pro Leu Asn Arg Asn
1085 1090 1095
Ser Val Phe Thr Cys Phe Gln Ile Ser Lys Asp Ser Lys Phe Gln
1100 1105 1110
Ile Gly Glu Phe Ile Phe Glu Lys Val Glu Tyr Gly Ser Asp Thr
1115 1120 1125
Val Thr Tyr Lys Ser Thr Val Thr Thr Lys Leu Val Pro Gly Met
1130 1135 1140
Ile Phe Val Leu Thr Ser His Asn Val Gln Pro Leu Arg Ala Pro
1145 1150 1155
Thr Ile Ala Asn Gln Glu Lys Tyr Ser Ser Ile Tyr Lys Leu His
1160 1165 1170
Pro Ala Phe Asn Val Ser Asp Ala Tyr Ala Asn Leu Val Pro Tyr
1175 1180 1185
Tyr Gln Leu Ile Gly Lys Gln Lys Ile Thr Thr Ile Gln Gly Pro
1190 1195 1200
Pro Gly Ser Gly Lys Ser His Cys Ser Ile Gly Leu Gly Leu Tyr
1205 1210 1215
Tyr Pro Gly Ala Arg Ile Val Phe Val Ala Cys Ala His Ala Ala
1220 1225 1230
Val Asp Ser Leu Cys Ala Lys Ala Met Thr Val Tyr Ser Ile Asp
1235 1240 1245
Lys Cys Thr Arg Ile Ile Pro Ala Arg Ala Arg Val Glu Cys Tyr
1250 1255 1260
Ser Gly Phe Lys Pro Asn Asn Thr Ser Ala Gln Tyr Ile Phe Ser
1265 1270 1275
Thr Val Asn Ala Leu Pro Glu Cys Asn Ala Asp Ile Val Val Val
1280 1285 1290
Asp Glu Val Ser Met Cys Thr Asn Tyr Asp Leu Ser Val Ile Asn
1295 1300 1305
Gln Arg Leu Ser Tyr Lys His Ile Val Tyr Val Gly Asp Pro Gln
1310 1315 1320
Gln Leu Pro Ala Pro Arg Val Met Ile Thr Lys Gly Val Met Glu
1325 1330 1335
Pro Val Asp Tyr Asn Val Val Thr Gln Arg Met Cys Ala Ile Gly
1340 1345 1350
Pro Asp Val Phe Leu His Lys Cys Tyr Arg Cys Pro Ala Glu Ile
1355 1360 1365
Val Asn Thr Val Ser Glu Leu Val Tyr Glu Asn Lys Phe Val Pro
1370 1375 1380
Val Lys Pro Ala Ser Lys Gln Cys Phe Lys Ile Phe Phe Lys Gly
1385 1390 1395
Asn Val Gln Val Asp Asn Gly Ser Ser Ile Asn Arg Lys Gln Leu
1400 1405 1410
Glu Ile Val Lys Leu Phe Leu Val Lys Asn Pro Ser Trp Ser Lys
1415 1420 1425
Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln Asn Tyr Val Ala Ser
1430 1435 1440
Arg Phe Leu Gly Leu Gln Ile Gln Thr Val Asp Ser Ser Gln Gly
1445 1450 1455
Ser Glu Tyr Asp Tyr Val Ile Tyr Ala Gln Thr Ser Asp Thr Ala
1460 1465 1470
His Ala Cys Asn Val Asn Arg Phe Asn Val Ala Ile Thr Arg Ala
1475 1480 1485
Lys Lys Gly Ile Phe Cys Val Met Cys Asp Lys Thr Leu Phe Asp
1490 1495 1500
Ser Leu Lys Phe Phe Glu Ile Lys His Ala Asp Leu His Ser Ser
1505 1510 1515
Gln Val Cys Gly Leu Phe Lys Asn Cys Thr Arg Thr Pro Leu Asn
1520 1525 1530
Leu Pro Pro Thr His Ala His Thr Phe Leu Ser Leu Ser Asp Gln
1535 1540 1545
Phe Lys Thr Thr Gly Asp Leu Ala Val Gln Ile Gly Ser Asn Asn
1550 1555 1560
Val Cys Thr Tyr Glu His Val Ile Ser Phe Met Gly Phe Arg Phe
1565 1570 1575
Asp Ile Ser Ile Pro Gly Ser His Ser Leu Phe Cys Thr Arg Asp
1580 1585 1590
Phe Ala Ile Arg Asn Val Arg Gly Trp Leu Gly Met Asp Val Glu
1595 1600 1605
Ser Ala His Val Cys Gly Asp Asn Ile Gly Thr Asn Val Pro Leu
1610 1615 1620
Gln Val Gly Phe Ser Asn Gly Val Asn Phe Val Val Gln Thr Glu
1625 1630 1635
Gly Cys Val Ser Thr Asn Phe Gly Asp Val Ile Lys Pro Val Cys
1640 1645 1650
Ala Lys Ser Pro Pro Gly Glu Gln Phe Arg His Leu Val Pro Phe
1655 1660 1665
Leu Arg Lys Gly Gln Pro Trp Leu Ile Val Arg Arg Arg Ile Val
1670 1675 1680
Gln Met Ile Ser Asp Tyr Leu Ser Asn Leu Ser Asp Ile Leu Val
1685 1690 1695
Phe Val Leu Trp Ala Gly Ser Leu Glu Leu Thr Thr Met Arg Tyr
1700 1705 1710
Phe Val Lys Ile Gly Pro Ile Lys Tyr Cys Tyr Cys Gly Asn Ser
1715 1720 1725
Ala Thr Cys Tyr Asn Ser Val Ser Asn Glu Tyr Cys Cys Phe Lys
1730 1735 1740
His Ala Leu Gly Cys Asp Tyr Val Tyr Asn Pro Tyr Ala Phe Asp
1745 1750 1755
Ile Gln Gln Trp Gly Tyr Val Gly Ser Leu Ser Gln Asn His His
1760 1765 1770
Thr Phe Cys Asn Ile His Arg Asn Glu His Asp Ala Ser Gly Asp
1775 1780 1785
Ala Val Met Thr Arg Cys Leu Ala Val His Asp Cys Phe Val Lys
1790 1795 1800
Asn Val Asp Trp Thr Val Thr Tyr Pro Phe Ile Ala Asn Glu Lys
1805 1810 1815
Phe Ile Asn Gly Cys Gly Arg Asn Val Gln Gly His Val Val Arg
1820 1825 1830
Ala Ala Leu Lys Leu Tyr Lys Pro Ser Val Ile His Asp Ile Gly
1835 1840 1845
Asn Pro Lys Gly Val Arg Cys Ala Val Thr Asp Ala Lys Trp Tyr
1850 1855 1860
Cys Tyr Asp Lys Gln Pro Val Asn Ser Asn Val Lys Leu Leu Asp
1865 1870 1875
Tyr Asp Tyr Ala Thr His Gly Gln Leu Asp Gly Leu Cys Leu Phe
1880 1885 1890
Trp Asn Cys Asn Val Asp Met Tyr Pro Glu Phe Ser Ile Val Cys
1895 1900 1905
Arg Phe Asp Thr Arg Thr Arg Ser Val Phe Asn Leu Glu Gly Val
1910 1915 1920
Asn Gly Gly Ser Leu Tyr Val Asn Lys His Ala Phe His Thr Pro
1925 1930 1935
Ala Tyr Asp Lys Arg Ala Phe Val Lys Leu Lys Pro Met Pro Phe
1940 1945 1950
Phe Tyr Phe Asp Asp Ser Asp Cys Asp Val Val Gln Glu Gln Val
1955 1960 1965
Asn Tyr Val Pro Leu Arg Ala Ser Ser Cys Val Thr Arg Cys Asn
1970 1975 1980
Ile Gly Gly Ala Val Cys Ser Lys His Ala Asn Leu Tyr Gln Lys
1985 1990 1995
Tyr Val Glu Ala Tyr Asn Thr Phe Thr Gln Ala Gly Phe Asn Ile
2000 2005 2010
Trp Val Pro His Ser Phe Asp Val Tyr Asn Leu Trp Gln Ile Phe
2015 2020 2025
Ile Glu Thr Asn Leu Gln Ser Leu Glu Asn Ile Ala Phe Asn Val
2030 2035 2040
Val Lys Lys Gly Cys Phe Thr Gly Val Asp Gly Glu Leu Pro Val
2045 2050 2055
Ala Val Val Asn Asp Lys Val Phe Val Arg Tyr Gly Asp Val Asp
2060 2065 2070
Asn Leu Val Phe Thr Asn Lys Thr Thr Leu Pro Thr Asn Val Ala
2075 2080 2085
Phe Glu Leu Phe Ala Lys Arg Lys Met Gly Leu Thr Pro Pro Leu
2090 2095 2100
Ser Ile Leu Lys Asn Leu Gly Val Val Ala Thr Tyr Lys Phe Val
2105 2110 2115
Leu Trp Asp Tyr Glu Ala Glu Arg Pro Phe Thr Ser Tyr Thr Lys
2120 2125 2130
Ser Val Cys Lys Tyr Thr Asp Phe Asn Glu Asp Val Cys Val Cys
2135 2140 2145
Phe Asp Asn Ser Ile Gln Gly Ser Tyr Glu Arg Phe Thr Leu Thr
2150 2155 2160
Thr Asn Ala Val Leu Phe Ser Thr Val Val Ile Lys Asn Leu Thr
2165 2170 2175
Pro Ile Lys Leu Asn Phe Gly Met Leu Asn Gly Met Pro Val Ser
2180 2185 2190
Ser Ile Lys Ser Asp Lys Gly Val Glu Lys Leu Val Asn Trp Tyr
2195 2200 2205
Thr Tyr Val Arg Lys Asn Gly Gln Phe Gln Asp His Tyr Asp Gly
2210 2215 2220
Phe Tyr Thr Gln Gly Arg Asn Leu Ser Asp Phe Thr Pro Arg Ser
2225 2230 2235
Asp Met Glu Tyr Asp Phe Leu Asn Met Asp Met Gly Val Phe Ile
2240 2245 2250
Asn Lys Tyr Gly Leu Glu Asp Phe Asn Phe Glu His Val Val Tyr
2255 2260 2265
Gly Asp Val Ser Lys Thr Thr Leu Gly Gly Leu His Leu Leu Ile
2270 2275 2280
Ser Gln Phe Arg Leu Ser Lys Met Gly Val Leu Lys Ala Asp Asp
2285 2290 2295
Phe Val Thr Ala Ser Asp Thr Thr Leu Arg Cys Cys Thr Val Thr
2300 2305 2310
Tyr Leu Asn Glu Leu Ser Ser Lys Val Val Cys Thr Tyr Met Asp
2315 2320 2325
Leu Leu Leu Asp Asp Phe Val Thr Ile Leu Lys Ser Leu Asp Leu
2330 2335 2340
Gly Val Ile Ser Lys Val His Glu Val Ile Ile Asp Asn Lys Pro
2345 2350 2355
Tyr Arg Trp Met Leu Trp Cys Lys Asp Asn His Leu Ser Thr Phe
2360 2365 2370
Tyr Pro Gln Leu Gln Ser Ala Glu Trp Lys Cys Gly Tyr Ala Met
2375 2380 2385
Pro Gln Ile Tyr Lys Leu Gln Arg Met Cys Leu Glu Pro Cys Asn
2390 2395 2400
Leu Tyr Asn Tyr Gly Ala Gly Ile Lys Leu Pro Ser Gly Ile Met
2405 2410 2415
Leu Asn Val Val Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Ser
2420 2425 2430
Thr Thr Met Cys Val Pro His Asn Met Arg Val Leu His Tyr Gly
2435 2440 2445
Ala Gly Ser Asp Lys Gly Val Ala Pro Gly Thr Thr Val Leu Lys
2450 2455 2460
Arg Trp Leu Pro Pro Asp Ala Ile Ile Ile Asp Asn Asp Ile Asn
2465 2470 2475
Asp Tyr Val Ser Asp Ala Asp Phe Ser Ile Thr Gly Asp Cys Ala
2480 2485 2490
Thr Val Tyr Leu Glu Asp Lys Phe Asp Leu Leu Ile Ser Asp Met
2495 2500 2505
Tyr Asp Gly Arg Ile Lys Phe Cys Asp Gly Glu Asn Val Ser Lys
2510 2515 2520
Asp Gly Phe Phe Thr Tyr Leu Asn Gly Val Ile Arg Glu Lys Leu
2525 2530 2535
Ala Ile Gly Gly Ser Val Ala Ile Lys Ile Thr Glu Tyr Ser Trp
2540 2545 2550
Asn Lys Tyr Leu Tyr Glu Leu Ile Gln Arg Phe Ala Phe Trp Thr
2555 2560 2565
Leu Phe Cys Thr Ser Val Asn Thr Ser Ser Ser Glu Ala Phe Leu
2570 2575 2580
Ile Gly Ile Asn Tyr Leu Gly Asp Phe Ile Gln Gly Pro Phe Ile
2585 2590 2595
Ala Gly Asn Thr Val His Ala Asn Tyr Ile Phe Trp Arg Asn Ser
2600 2605 2610
Thr Ile Met Ser Leu Ser Tyr Asn Ser Val Leu Asp Leu Ser Lys
2615 2620 2625
Phe Glu Cys Lys His Lys Ala Thr Val Val Val Thr Leu Lys Asp
2630 2635 2640
Ser Asp Val Asn Asp Met Val Leu Ser Leu Ile Lys Ser Gly Arg
2645 2650 2655
Leu Leu Leu Arg Asn Ser Gly Arg Phe Gly Gly Phe Ser Asn His
2660 2665 2670
Leu Val Ser Thr Lys
2675
<210> 88
<211> 2733
<212> PRT
<213> murine hepatitis virus
<220>
<221> MISC_FEATURE
<223> ORF 1B
<400> 88
Leu Phe Leu Cys Arg His Arg Leu Pro Val Ser Val Lys Arg His Glu
1 5 10 15
Leu Phe Lys Arg Val Arg Gly Thr Ser Val Asn Ala Arg Leu Val Pro
20 25 30
Cys Ala Ser Gly Leu Asp Thr Asp Val Gln Leu Arg Ala Phe Asp Ile
35 40 45
Cys Asn Ala Asn Arg Ala Gly Ile Gly Leu Tyr Tyr Lys Val Asn Cys
50 55 60
Cys Arg Phe Gln Arg Ala Asp Glu Asp Gly Asn Thr Leu Asp Lys Phe
65 70 75 80
Phe Val Ile Lys Arg Thr Asn Leu Glu Val Tyr Asn Lys Glu Lys Glu
85 90 95
Cys Tyr Glu Leu Thr Lys Glu Cys Gly Val Val Ala Glu His Glu Phe
100 105 110
Phe Thr Phe Asp Val Glu Gly Ser Arg Val Pro His Ile Val Arg Lys
115 120 125
Asp Leu Ser Lys Tyr Thr Met Leu Asp Leu Cys Tyr Ala Leu Arg His
130 135 140
Phe Asp Arg Asn Asp Cys Ser Thr Leu Lys Glu Ile Leu Leu Thr Tyr
145 150 155 160
Ala Glu Cys Asp Glu Ser Tyr Phe Gln Lys Lys Asp Trp Tyr Asp Phe
165 170 175
Val Glu Asn Ser Asp Ile Ile Asn Val Tyr Lys Lys Leu Gly Pro Ile
180 185 190
Phe Asn Arg Ala Leu Leu Asn Thr Ala Lys Phe Ala Asp Thr Leu Val
195 200 205
Glu Ala Gly Leu Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Tyr
210 215 220
Gly Gln Trp Tyr Asp Phe Gly Asp Phe Val Lys Thr Val Pro Gly Cys
225 230 235 240
Gly Val Ala Val Ala Asp Ser Tyr Tyr Ser Tyr Met Met Pro Met Leu
245 250 255
Thr Met Cys His Ala Leu Asp Ser Glu Leu Phe Ile Asn Gly Thr Tyr
260 265 270
Arg Glu Phe Asp Leu Val Gln Tyr Asp Phe Thr Asp Phe Lys Leu Glu
275 280 285
Leu Phe Asn Lys Tyr Phe Lys Tyr Trp Ser Met Thr Tyr His Pro Asn
290 295 300
Thr Cys Glu Cys Glu Asp Asp Arg Cys Ile Ile His Cys Ala Asn Phe
305 310 315 320
Asn Ile Leu Phe Ser Met Val Leu Pro Lys Thr Cys Phe Gly Pro Leu
325 330 335
Val Arg Gln Ile Phe Val Asp Gly Val Pro Phe Val Val Ser Ile Gly
340 345 350
Tyr His Tyr Lys Glu Leu Gly Val Val Met Asn Met Asp Val Asp Thr
355 360 365
His Arg Tyr Arg Leu Ser Leu Lys Asp Leu Leu Leu Tyr Ala Ala Asp
370 375 380
Pro Ala Leu His Val Ala Ser Ala Ser Ala Leu Leu Asp Leu Arg Thr
385 390 395 400
Cys Cys Phe Ser Val Ala Ala Ile Thr Ser Gly Val Lys Phe Gln Thr
405 410 415
Val Lys Pro Gly Asn Phe Asn Gln Asp Phe Tyr Glu Phe Ile Leu Ser
420 425 430
Lys Gly Leu Leu Lys Glu Gly Ser Ser Val Asp Leu Lys His Phe Phe
435 440 445
Phe Thr Gln Asp Gly Asn Ala Ala Ile Thr Asp Tyr Asn Tyr Tyr Lys
450 455 460
Tyr Asn Leu Pro Thr Met Val Asp Ile Lys Gln Leu Leu Phe Val Leu
465 470 475 480
Glu Val Val Asn Lys Tyr Phe Glu Ile Tyr Asp Gly Gly Cys Ile Pro
485 490 495
Ala Thr Gln Val Ile Val Asn Asn Tyr Asp Lys Ser Ala Gly Tyr Pro
500 505 510
Phe Asn Lys Phe Gly Lys Ala Arg Leu Tyr Tyr Glu Ala Leu Ser Phe
515 520 525
Glu Glu Gln Asp Glu Val Tyr Ala Tyr Thr Lys Arg Asn Val Leu Pro
530 535 540
Thr Leu Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn Arg
545 550 555 560
Ala Arg Thr Val Ala Gly Val Ser Ile Leu Ser Thr Met Thr Gly Arg
565 570 575
Met Phe His Gln Lys Cys Leu Lys Ser Ile Ala Ala Thr Arg Gly Val
580 585 590
Pro Val Val Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asp Met
595 600 605
Leu Arg Arg Leu Ile Lys Asp Val Asp Ser Pro Val Leu Met Gly Trp
610 615 620
Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Ile Leu Arg Ile Ile
625 630 635 640
Ser Ser Leu Val Leu Ala Arg Lys His Asp Ser Cys Cys Ser His Thr
645 650 655
Asp Arg Phe Tyr Arg Leu Ala Asn Glu Cys Ala Gln Val Leu Ser Glu
660 665 670
Ile Val Met Cys Gly Gly Cys Tyr Tyr Val Lys Pro Gly Gly Thr Ser
675 680 685
Ser Gly Asp Ala Thr Thr Ala Phe Ala Asn Ser Val Phe Asn Ile Cys
690 695 700
Gln Ala Val Ser Ala Asn Val Cys Ser Leu Met Ala Cys Asn Gly His
705 710 715 720
Lys Ile Glu Asp Leu Ser Ile Arg Glu Leu Gln Lys Arg Leu Tyr Ser
725 730 735
Asn Val Tyr Arg Ala Asp His Val Asp Pro Ala Phe Val Asn Glu Tyr
740 745 750
Tyr Glu Phe Leu Asn Lys His Phe Ser Met Met Ile Leu Ser Asp Asp
755 760 765
Gly Val Val Cys Tyr Asn Ser Glu Phe Ala Ser Lys Gly Tyr Ile Ala
770 775 780
Asn Ile Ser Ala Phe Gln Gln Val Leu Tyr Tyr Gln Asn Asn Val Phe
785 790 795 800
Met Ser Glu Ala Lys Cys Trp Val Glu Thr Asp Ile Glu Lys Gly Pro
805 810 815
His Glu Phe Cys Ser Gln His Thr Met Leu Val Lys Met Asp Gly Asp
820 825 830
Glu Val Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Gly Ala Gly
835 840 845
Cys Phe Val Asp Asp Leu Leu Lys Thr Asp Ser Val Leu Leu Ile Glu
850 855 860
Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Val Tyr His Glu
865 870 875 880
Asn Pro Glu Tyr Gln Asn Val Phe Arg Val Tyr Leu Glu Tyr Ile Lys
885 890 895
Lys Leu Tyr Asn Asp Leu Gly Asn Gln Ile Leu Asp Ser Tyr Ser Val
900 905 910
Ile Leu Ser Thr Cys Asp Gly Gln Lys Phe Thr Asp Glu Thr Phe Tyr
915 920 925
Lys Asn Met Tyr Leu Arg Ser Ala Val Met Gln Ser Val Gly Ala Cys
930 935 940
Val Val Cys Ser Ser Gln Thr Ser Leu Arg Cys Gly Ser Cys Ile Arg
945 950 955 960
Lys Pro Leu Leu Cys Cys Lys Cys Ala Tyr Asp His Val Met Ser Thr
965 970 975
Asp His Lys Tyr Val Leu Ser Val Ser Pro Tyr Val Cys Asn Ser Pro
980 985 990
Gly Cys Asp Val Asn Asp Val Thr Lys Leu Tyr Leu Gly Gly Met Ser
995 1000 1005
Tyr Tyr Cys Glu Asp His Lys Pro Gln Tyr Ser Phe Lys Leu Val
1010 1015 1020
Met Asn Gly Met Val Phe Gly Leu Tyr Lys Gln Ser Cys Thr Gly
1025 1030 1035
Ser Pro Tyr Ile Glu Asp Phe Asn Lys Ile Ala Ser Cys Lys Trp
1040 1045 1050
Thr Glu Val Asp Asp Tyr Val Leu Ala Asn Glu Cys Thr Glu Arg
1055 1060 1065
Leu Lys Leu Phe Ala Ala Glu Thr Gln Lys Ala Thr Glu Glu Ser
1070 1075 1080
Phe Lys Gln Cys Tyr Ala Ser Ala Thr Ile Arg Glu Ile Val Ser
1085 1090 1095
Asp Arg Glu Leu Ile Leu Ser Trp Glu Ile Gly Lys Val Arg Pro
1100 1105 1110
Pro Leu Asn Lys Asn Tyr Val Phe Thr Gly Tyr His Phe Thr Ser
1115 1120 1125
Asn Gly Lys Thr Val Leu Gly Glu Tyr Val Phe Asp Lys Ser Glu
1130 1135 1140
Leu Thr Asn Gly Val Tyr Tyr Arg Ala Thr Thr Thr Tyr Lys Leu
1145 1150 1155
Ser Val Gly Asp Val Phe Ile Leu Thr Ser His Ala Val Ser Ser
1160 1165 1170
Leu Ser Ala Pro Thr Leu Val Pro Gln Glu Asn Tyr Thr Ser Ile
1175 1180 1185
Arg Phe Ala Ser Val Tyr Ser Val Pro Glu Thr Phe Gln Asn Asn
1190 1195 1200
Val Pro Asn Tyr Gln His Ile Gly Met Lys Arg Tyr Cys Thr Val
1205 1210 1215
Gln Gly Pro Pro Gly Thr Gly Lys Ser His Leu Ala Ile Gly Leu
1220 1225 1230
Ala Val Tyr Tyr Cys Thr Ala Arg Val Val Tyr Thr Ala Ala Ser
1235 1240 1245
His Ala Ala Val Asp Ala Leu Cys Glu Lys Ala Tyr Lys Phe Leu
1250 1255 1260
Asn Ile Asn Asp Cys Thr Arg Ile Val Pro Ala Lys Val Arg Val
1265 1270 1275
Asp Cys Tyr Asp Lys Phe Lys Val Asn Asp Thr Thr Arg Lys Tyr
1280 1285 1290
Val Phe Thr Thr Ile Asn Ala Leu Pro Glu Leu Val Thr Asp Ile
1295 1300 1305
Ile Val Val Asp Glu Val Ser Met Leu Thr Asn Tyr Glu Leu Ser
1310 1315 1320
Val Ile Asn Ser Arg Val Arg Ala Lys His Tyr Val Tyr Ile Gly
1325 1330 1335
Asp Pro Ala Gln Leu Pro Ala Pro Arg Val Leu Leu Asn Lys Gly
1340 1345 1350
Thr Leu Glu Pro Arg Tyr Phe Asn Ser Val Thr Lys Leu Met Cys
1355 1360 1365
Cys Leu Gly Pro Asp Ile Phe Leu Gly Thr Cys Tyr Arg Cys Pro
1370 1375 1380
Lys Glu Ile Val Asp Thr Val Ser Ala Leu Val Tyr His Asn Lys
1385 1390 1395
Leu Lys Ala Lys Asn Asp Asn Ser Ser Met Cys Phe Lys Val Tyr
1400 1405 1410
Tyr Lys Gly Gln Thr Thr His Glu Ser Ser Ser Ala Val Asn Met
1415 1420 1425
Gln Gln Ile Tyr Leu Ile Ser Lys Phe Leu Lys Ala Asn Pro Ser
1430 1435 1440
Trp Ser Asn Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln Asn Tyr
1445 1450 1455
Val Ala Lys Arg Val Leu Gly Leu Gln Thr Gln Thr Val Asp Ser
1460 1465 1470
Ala Gln Gly Ser Glu Tyr Asp Phe Val Ile Tyr Ser Gln Thr Ala
1475 1480 1485
Glu Thr Ala His Ser Val Asn Val Asn Arg Phe Asn Val Ala Ile
1490 1495 1500
Thr Arg Ala Lys Lys Gly Ile Leu Cys Val Met Ser Ser Met Gln
1505 1510 1515
Leu Phe Glu Ser Leu Asn Phe Ser Thr Leu Thr Leu Asp Lys Ile
1520 1525 1530
Asn Asn Pro Arg Leu Gln Cys Thr Thr Asn Leu Phe Lys Asp Cys
1535 1540 1545
Ser Arg Ser Tyr Ala Gly Tyr His Pro Ala His Ala Pro Ser Phe
1550 1555 1560
Leu Ala Val Asp Asp Lys Tyr Lys Val Gly Gly Asp Leu Ala Val
1565 1570 1575
Cys Leu Asn Val Ala Asp Ser Ala Val Thr Tyr Ser Arg Leu Ile
1580 1585 1590
Ser Leu Met Gly Phe Lys Leu Asp Leu Thr Leu Asp Gly Tyr Cys
1595 1600 1605
Lys Leu Phe Ile Thr Arg Asp Glu Ala Ile Arg Arg Val Arg Ala
1610 1615 1620
Trp Val Gly Phe Asp Ala Glu Gly Ala His Ala Thr Arg Asp Ser
1625 1630 1635
Ile Gly Thr Asn Phe Pro Leu Gln Leu Gly Phe Ser Thr Gly Ile
1640 1645 1650
Asp Phe Val Val Glu Ala Thr Gly Met Phe Ala Glu Arg Asp Gly
1655 1660 1665
Tyr Val Phe Lys Lys Ala Val Ala Arg Ala Pro Pro Gly Glu Gln
1670 1675 1680
Phe Lys His Leu Val Pro Leu Met Ser Arg Gly Gln Lys Trp Asp
1685 1690 1695
Val Val Arg Ile Arg Ile Val Gln Met Leu Ser Asp His Leu Val
1700 1705 1710
Asp Leu Ala Asp Ser Val Val Leu Val Thr Trp Ala Ala Ser Phe
1715 1720 1725
Glu Leu Thr Cys Leu Arg Tyr Phe Ala Lys Val Gly Lys Glu Val
1730 1735 1740
Val Cys Ser Val Cys Asn Lys Arg Ala Thr Cys Phe Asn Ser Arg
1745 1750 1755
Thr Gly Tyr Tyr Gly Cys Trp Arg His Ser Tyr Ser Cys Asp Tyr
1760 1765 1770
Leu Tyr Asn Pro Leu Ile Val Asp Ile Gln Gln Trp Gly Tyr Thr
1775 1780 1785
Gly Ser Leu Thr Ser Asn His Asp Leu Ile Cys Ser Val His Lys
1790 1795 1800
Gly Ala His Val Ala Ser Ser Asp Ala Ile Met Thr Arg Cys Leu
1805 1810 1815
Ala Val His Asp Cys Phe Cys Lys Ser Val Asn Trp Ser Leu Glu
1820 1825 1830
Tyr Pro Ile Ile Ser Asn Glu Val Ser Val Asn Thr Ser Cys Arg
1835 1840 1845
Leu Leu Gln Arg Val Met Phe Arg Ala Ala Met Leu Cys Asn Arg
1850 1855 1860
Tyr Asp Val Cys Tyr Asp Ile Gly Asn Pro Lys Gly Leu Ala Cys
1865 1870 1875
Val Lys Gly Tyr Asp Phe Lys Phe Tyr Asp Ala Ser Pro Val Val
1880 1885 1890
Lys Ser Val Lys Gln Phe Val Tyr Lys Tyr Glu Ala His Lys Asp
1895 1900 1905
Gln Phe Leu Asp Gly Leu Cys Met Phe Trp Asn Cys Asn Val Asp
1910 1915 1920
Lys Tyr Pro Ala Asn Ala Val Val Cys Arg Phe Asp Thr Arg Val
1925 1930 1935
Leu Asn Lys Leu Asn Leu Pro Gly Cys Asn Gly Gly Ser Leu Tyr
1940 1945 1950
Val Asn Lys His Ala Phe His Thr Ser Pro Phe Thr Arg Ala Ala
1955 1960 1965
Phe Glu Asn Leu Lys Pro Met Pro Phe Phe Tyr Tyr Ser Asp Thr
1970 1975 1980
Pro Cys Val Tyr Met Glu Gly Met Glu Ser Lys Gln Val Asp Tyr
1985 1990 1995
Val Pro Leu Arg Ser Ala Thr Cys Ile Thr Arg Cys Asn Leu Gly
2000 2005 2010
Gly Ala Val Cys Leu Lys His Ala Glu Asp Tyr Arg Glu Tyr Leu
2015 2020 2025
Glu Ser Tyr Asn Thr Ala Thr Thr Ala Gly Phe Thr Phe Trp Val
2030 2035 2040
Tyr Lys Thr Phe Asp Phe Tyr Asn Leu Trp Asn Thr Phe Thr Arg
2045 2050 2055
Leu Gln Ser Leu Glu Asn Val Val Tyr Asn Leu Val Asn Ala Gly
2060 2065 2070
His Phe Asp Gly Arg Ala Gly Glu Leu Pro Cys Ala Val Ile Gly
2075 2080 2085
Glu Lys Val Ile Ala Lys Ile Gln Asn Glu Asp Val Val Val Phe
2090 2095 2100
Lys Asn Asn Thr Pro Phe Pro Thr Asn Val Ala Val Glu Leu Phe
2105 2110 2115
Ala Lys Arg Ser Ile Arg Pro His Pro Glu Leu Lys Leu Phe Arg
2120 2125 2130
Asn Leu Asn Ile Asp Val Cys Trp Ser His Val Leu Trp Asp Tyr
2135 2140 2145
Ala Lys Asp Ser Val Phe Cys Ser Ser Thr Tyr Lys Val Cys Lys
2150 2155 2160
Tyr Thr Asp Leu Gln Cys Ile Glu Ser Leu Asn Val Leu Phe Asp
2165 2170 2175
Gly Arg Asp Asn Gly Ala Leu Glu Ala Phe Lys Lys Cys Arg Asp
2180 2185 2190
Gly Val Tyr Ile Asn Thr Thr Lys Ile Lys Ser Leu Ser Met Ile
2195 2200 2205
Lys Gly Pro Gln Arg Ala Asp Leu Asn Gly Val Val Val Glu Lys
2210 2215 2220
Val Gly Asp Ser Asp Val Glu Phe Trp Phe Ala Met Arg Arg Asp
2225 2230 2235
Gly Asp Asp Val Ile Phe Ser Arg Thr Gly Ser Leu Glu Pro Ser
2240 2245 2250
His Tyr Arg Ser Pro Gln Gly Asn Pro Gly Gly Asn Arg Val Gly
2255 2260 2265
Asp Leu Ser Gly Asn Glu Ala Leu Ala Arg Gly Thr Ile Phe Thr
2270 2275 2280
Gln Ser Arg Phe Leu Ser Ser Phe Ala Pro Arg Ser Glu Met Glu
2285 2290 2295
Lys Asp Phe Met Asp Leu Asp Glu Asp Val Phe Ile Ala Lys Tyr
2300 2305 2310
Ser Leu Gln Asp Tyr Ala Phe Glu His Val Val Tyr Gly Ser Phe
2315 2320 2325
Asn Gln Lys Ile Ile Gly Gly Leu His Leu Leu Ile Gly Leu Ala
2330 2335 2340
Arg Arg Gln Gln Lys Ser Asn Leu Val Ile Gln Glu Phe Val Pro
2345 2350 2355
Tyr Asp Ser Ser Ile His Ser Tyr Phe Ile Thr Asp Glu Asn Ser
2360 2365 2370
Gly Ser Ser Lys Ser Val Cys Thr Val Ile Asp Leu Leu Leu Asp
2375 2380 2385
Asp Phe Val Asp Ile Val Lys Ser Leu Asn Leu Asn Cys Val Ser
2390 2395 2400
Lys Val Val Asn Val Asn Val Asp Phe Lys Asp Phe Gln Phe Met
2405 2410 2415
Leu Trp Cys Asn Glu Glu Lys Val Met Thr Phe Tyr Pro Arg Leu
2420 2425 2430
Gln Ala Ala Ala Asp Trp Lys Pro Gly Tyr Val Met Pro Val Leu
2435 2440 2445
Tyr Lys Tyr Leu Glu Ser Pro Leu Glu Arg Val Asn Leu Trp Asn
2450 2455 2460
Tyr Gly Lys Pro Ile Thr Leu Pro Thr Gly Cys Leu Met Asn Val
2465 2470 2475
Ala Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Thr Thr Thr Leu
2480 2485 2490
Ala Val Pro Ala Asn Met Arg Val Leu His Leu Gly Ala Gly Ser
2495 2500 2505
Asp Lys Asp Val Ala Pro Gly Ser Ala Val Leu Arg Gln Trp Leu
2510 2515 2520
Pro Ala Gly Ser Ile Leu Val Asp Asn Asp Ile Asn Pro Phe Val
2525 2530 2535
Ser Asp Ser Val Ala Ser Tyr Tyr Gly Asn Cys Ile Thr Leu Pro
2540 2545 2550
Ile Ala Cys Gln Trp Asp Leu Ile Ile Ser Asp Met Tyr Asp Pro
2555 2560 2565
Leu Thr Lys Asn Ile Gly Glu Tyr Asn Val Ser Lys Asp Gly Phe
2570 2575 2580
Phe Thr Tyr Leu Cys His Leu Ile Arg Asp Lys Leu Ala Leu Gly
2585 2590 2595
Gly Ser Val Ala Ile Lys Ile Thr Glu Phe Ser Trp Asn Ala Glu
2600 2605 2610
Leu Tyr Ser Leu Met Gly Lys Phe Ala Phe Trp Thr Ile Phe Cys
2615 2620 2625
Thr Asn Val Asn Ala Ser Ser Ser Glu Gly Phe Leu Ile Gly Ile
2630 2635 2640
Asn Trp Leu Asn Arg Thr Arg Thr Glu Ile Asp Gly Lys Thr Met
2645 2650 2655
His Ala Asn Tyr Leu Phe Trp Arg Asn Ser Thr Met Trp Asn Gly
2660 2665 2670
Gly Ala Tyr Ser Leu Phe Asp Met Ser Lys Phe Pro Leu Lys Val
2675 2680 2685
Ala Gly Thr Ala Val Val Ser Leu Lys Pro Asp Gln Ile Asn Asp
2690 2695 2700
Leu Val Leu Ser Leu Ile Glu Lys Gly Lys Leu Leu Val Arg Asp
2705 2710 2715
Thr Arg Lys Glu Val Phe Val Gly Asp Ser Leu Val Asn Val Lys
2720 2725 2730
<210> 89
<211> 2721
<212> PRT
<213> human coronavirus OC43
<220>
<221> MISC_FEATURE
<223> ORF 1B
<400> 89
Phe Phe Lys Arg Val Arg Gly Thr Ser Val Asp Ala Arg Leu Val Pro
1 5 10 15
Cys Ala Ser Gly Leu Ser Thr Asp Val Gln Leu Arg Ala Phe Asp Ile
20 25 30
Tyr Asn Ala Ser Val Ala Gly Ile Gly Leu His Leu Lys Val Asn Cys
35 40 45
Cys Arg Phe Gln Arg Val Asp Glu Asn Gly Asp Lys Leu Asp Gln Phe
50 55 60
Phe Val Val Lys Arg Thr Asp Leu Thr Ile Tyr Asn Arg Glu Met Lys
65 70 75 80
Cys Tyr Glu Arg Val Lys Asp Cys Lys Phe Val Ala Glu His Asp Phe
85 90 95
Phe Thr Phe Asp Val Glu Gly Ser Arg Val Pro His Ile Val Arg Lys
100 105 110
Asp Leu Thr Lys Tyr Thr Met Leu Asp Leu Cys Tyr Ala Leu Arg His
115 120 125
Phe Asp Arg Asn Asp Cys Met Leu Leu Cys Asp Ile Leu Ser Ile Tyr
130 135 140
Ala Gly Cys Glu Gln Ser Tyr Phe Thr Lys Lys Asp Trp Tyr Asp Phe
145 150 155 160
Val Glu Asn Pro Asp Ile Ile Asn Val Tyr Lys Lys Leu Gly Pro Ile
165 170 175
Phe Asn Arg Ala Leu Val Ser Ala Thr Glu Phe Ala Asp Lys Leu Val
180 185 190
Glu Val Gly Leu Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Asn
195 200 205
Gly Lys Trp Tyr Asp Phe Gly Asp Tyr Val Ile Ala Ala Pro Gly Cys
210 215 220
Gly Val Ala Ile Ala Asp Ser Tyr Tyr Ser Tyr Ile Met Pro Met Leu
225 230 235 240
Thr Met Cys His Ala Leu Asp Cys Glu Leu Tyr Val Asn Asn Ala Tyr
245 250 255
Arg Leu Phe Asp Leu Val Gln Tyr Asp Phe Thr Asp Tyr Lys Leu Glu
260 265 270
Leu Phe Asn Lys Tyr Phe Lys His Trp Ser Met Pro Tyr His Pro Asn
275 280 285
Thr Val Asp Cys Gln Asp Asp Arg Cys Ile Ile His Cys Ala Asn Phe
290 295 300
Asn Ile Leu Phe Ser Met Val Leu Pro Asn Thr Cys Phe Gly Pro Leu
305 310 315 320
Val Arg Gln Ile Phe Val Asp Gly Val Pro Phe Val Val Ser Ile Gly
325 330 335
Tyr His Tyr Lys Glu Leu Gly Ile Val Met Asn Met Asp Val Asp Thr
340 345 350
His Arg Tyr Arg Leu Ser Leu Lys Asp Leu Leu Leu Tyr Ala Ala Asp
355 360 365
Pro Ala Leu His Val Ala Ser Ala Ser Ala Leu Tyr Asp Leu Arg Thr
370 375 380
Cys Cys Phe Ser Val Ala Ala Ile Thr Ser Gly Val Lys Phe Gln Thr
385 390 395 400
Val Lys Pro Gly Asn Phe Asn Gln Asp Phe Tyr Asp Phe Val Leu Ser
405 410 415
Lys Gly Leu Leu Lys Glu Gly Ser Ser Val Asp Leu Lys His Phe Phe
420 425 430
Phe Thr Gln Asp Gly Asn Ala Ala Ile Thr Asp Tyr Asn Tyr Tyr Lys
435 440 445
Tyr Asn Leu Pro Thr Met Val Asp Ile Lys Gln Leu Leu Phe Val Leu
450 455 460
Glu Val Val Tyr Lys Tyr Phe Glu Ile Tyr Asp Gly Gly Cys Ile Pro
465 470 475 480
Ala Ser Gln Val Ile Val Asn Asn Tyr Asp Lys Ser Ala Gly Tyr Pro
485 490 495
Phe Asn Lys Phe Gly Lys Ala Arg Leu Tyr Tyr Glu Ala Leu Ser Phe
500 505 510
Glu Glu Gln Asp Glu Ile Tyr Ala Tyr Thr Lys Arg Asn Val Leu Pro
515 520 525
Thr Leu Thr Gln Met Asn Leu Lys Tyr Ala Ile Ser Ala Lys Asn Arg
530 535 540
Ala Arg Thr Val Ala Gly Val Ser Ile Leu Ser Thr Met Thr Gly Arg
545 550 555 560
Met Phe His Gln Lys Cys Leu Lys Ser Ile Ala Ala Thr Arg Gly Val
565 570 575
Pro Val Val Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asp Met
580 585 590
Leu Arg Arg Leu Ile Lys Asp Val Asp Asn Pro Val Leu Met Gly Trp
595 600 605
Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Asn Leu Leu Arg Ile Val
610 615 620
Ser Ser Leu Val Leu Ala Arg Lys His Glu Thr Cys Cys Ser Gln Ser
625 630 635 640
Asp Arg Phe Tyr Arg Leu Ala Asn Glu Cys Ala Gln Val Leu Ser Glu
645 650 655
Ile Val Met Cys Gly Gly Cys Tyr Tyr Val Lys Pro Gly Gly Thr Ser
660 665 670
Ser Gly Asp Ala Thr Thr Ala Phe Ala Asn Ser Val Phe Asn Ile Cys
675 680 685
Gln Ala Val Ser Ala Asn Val Cys Ala Leu Met Ser Cys Asn Gly Asn
690 695 700
Lys Ile Glu Asp Leu Ser Ile Arg Ala Leu Gln Lys Arg Leu Tyr Ser
705 710 715 720
His Val Tyr Arg Ser Asp Lys Val Asp Ser Thr Phe Val Thr Glu Tyr
725 730 735
Tyr Glu Phe Leu Asn Lys His Phe Ser Met Met Ile Leu Ser Asp Asp
740 745 750
Gly Val Val Cys Tyr Asn Ser Asp Tyr Ala Ser Lys Gly Tyr Ile Ala
755 760 765
Asn Ile Ser Ala Phe Gln Gln Val Leu Tyr Tyr Gln Asn Asn Val Phe
770 775 780
Met Ser Glu Ser Lys Cys Trp Val Glu His Asp Ile Asn Asn Gly Pro
785 790 795 800
His Glu Phe Cys Ser Gln His Thr Met Leu Val Lys Met Asp Gly Asp
805 810 815
Asp Val Tyr Leu Pro Tyr Pro Asn Pro Ser Arg Ile Leu Gly Ala Gly
820 825 830
Cys Phe Val Asp Asp Leu Leu Lys Thr Asp Ser Val Leu Leu Ile Glu
835 840 845
Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Val Tyr His Glu
850 855 860
Asn Glu Glu Tyr Gln Lys Val Phe Arg Val Tyr Leu Ala Tyr Ile Lys
865 870 875 880
Lys Leu Tyr Asn Asp Leu Gly Asn Gln Ile Leu Asp Ser Tyr Ser Val
885 890 895
Ile Leu Ser Thr Cys Asp Gly Gln Lys Phe Thr Asp Glu Ser Phe Tyr
900 905 910
Lys Asn Met Tyr Leu Arg Ser Ala Val Met Gln Ser Val Gly Ala Cys
915 920 925
Val Val Cys Ser Ser Gln Thr Ser Leu Arg Cys Gly Ser Cys Ile Arg
930 935 940
Lys Pro Leu Leu Cys Cys Lys Cys Cys Tyr Asp His Val Met Ala Thr
945 950 955 960
Asp His Lys Tyr Val Leu Ser Val Ser Pro Tyr Val Cys Asn Ala Pro
965 970 975
Gly Cys Asp Val Asn Asp Val Thr Lys Leu Tyr Leu Gly Gly Met Ser
980 985 990
Tyr Tyr Cys Glu Asp His Lys Pro Gln Tyr Ser Phe Lys Leu Val Met
995 1000 1005
Asn Gly Leu Val Phe Gly Leu Tyr Lys Gln Ser Cys Thr Gly Ser
1010 1015 1020
Pro Tyr Ile Asp Asp Phe Asn Arg Ile Ala Ser Cys Lys Trp Thr
1025 1030 1035
Asp Val Asp Asp Tyr Ile Leu Ala Asn Glu Cys Thr Glu Arg Leu
1040 1045 1050
Lys Leu Phe Ala Ala Glu Thr Gln Lys Ala Thr Glu Glu Ala Phe
1055 1060 1065
Lys Gln Ser Tyr Ala Ser Ala Thr Ile Gln Glu Ile Val Ser Glu
1070 1075 1080
Arg Glu Leu Ile Leu Ser Trp Glu Ile Gly Lys Val Lys Pro Pro
1085 1090 1095
Leu Asn Lys Asn Tyr Val Phe Thr Gly Tyr His Phe Thr Lys Asn
1100 1105 1110
Gly Lys Thr Val Leu Gly Glu Tyr Val Phe Asp Lys Ser Glu Leu
1115 1120 1125
Thr Asn Gly Val Tyr Tyr Arg Ala Thr Thr Thr Tyr Lys Leu Ser
1130 1135 1140
Val Gly Asp Val Phe Val Leu Thr Ser His Ser Val Ala Asn Leu
1145 1150 1155
Ser Ala Pro Thr Leu Val Pro Gln Glu Asn Tyr Ser Ser Ile Arg
1160 1165 1170
Phe Ala Ser Val Tyr Ser Val Leu Glu Thr Phe Gln Asn Asn Val
1175 1180 1185
Val Asn Tyr Gln His Ile Gly Met Lys Arg Tyr Cys Thr Val Gln
1190 1195 1200
Gly Pro Pro Gly Thr Gly Lys Ser His Leu Ala Ile Gly Leu Ala
1205 1210 1215
Val Phe Tyr Cys Thr Ala Arg Val Val Tyr Thr Ala Ala Ser His
1220 1225 1230
Ala Ala Val Asp Ala Leu Cys Glu Lys Ala Tyr Lys Phe Leu Asn
1235 1240 1245
Ile Asn Asp Cys Thr Arg Ile Val Pro Ala Lys Val Arg Val Glu
1250 1255 1260
Cys Tyr Asp Lys Phe Lys Ile Asn Asp Thr Thr Arg Lys Tyr Val
1265 1270 1275
Phe Thr Thr Ile Asn Ala Leu Pro Glu Met Val Thr Asp Ile Val
1280 1285 1290
Val Val Asp Glu Val Ser Met Leu Thr Asn Tyr Glu Leu Ser Val
1295 1300 1305
Ile Asn Ala Arg Ile Arg Ala Lys His Tyr Val Tyr Ile Gly Asp
1310 1315 1320
Pro Ala Gln Leu Pro Ala Pro Arg Val Leu Leu Ser Lys Gly Thr
1325 1330 1335
Leu Glu Pro Lys Tyr Phe Asn Thr Val Thr Lys Leu Met Cys Cys
1340 1345 1350
Leu Gly Pro Asp Ile Phe Leu Gly Thr Cys Tyr Arg Cys Pro Lys
1355 1360 1365
Glu Ile Val Asp Thr Val Ser Ala Leu Val Tyr Glu Asn Lys Leu
1370 1375 1380
Lys Ala Lys Asn Glu Ser Ser Ser Leu Cys Phe Lys Val Tyr Tyr
1385 1390 1395
Lys Gly Val Thr Thr His Glu Ser Ser Ser Ala Val Asn Met Gln
1400 1405 1410
Gln Ile Tyr Leu Ile Asn Lys Phe Leu Lys Ala Asn Pro Leu Trp
1415 1420 1425
His Lys Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln Asn Phe Ala
1430 1435 1440
Ala Lys Arg Val Leu Gly Leu Gln Thr Gln Thr Val Asp Ser Ala
1445 1450 1455
Gln Gly Ser Glu Tyr Asp Tyr Val Ile Tyr Ser Gln Thr Ala Glu
1460 1465 1470
Thr Ala His Ser Val Asn Val Asn Arg Phe Asn Val Ala Ile Thr
1475 1480 1485
Arg Ala Lys Lys Gly Ile Leu Cys Val Met Ser Asn Met Gln Leu
1490 1495 1500
Phe Glu Ala Leu Gln Phe Thr Thr Leu Thr Leu Asp Lys Val Pro
1505 1510 1515
Gln Ala Val Glu Thr Lys Val Gln Cys Ser Thr Asn Leu Phe Lys
1520 1525 1530
Asp Cys Ser Lys Ser Tyr Ser Gly Tyr His Pro Ala His Ala Pro
1535 1540 1545
Ser Phe Leu Ala Val Asp Asp Lys Tyr Lys Ala Thr Gly Asp Leu
1550 1555 1560
Ala Val Cys Leu Gly Ile Gly Asp Ser Ala Val Thr Tyr Ser Arg
1565 1570 1575
Leu Ile Ser Leu Met Gly Phe Lys Leu Asp Val Thr Leu Asp Gly
1580 1585 1590
Tyr Cys Lys Leu Phe Ile Thr Lys Glu Glu Ala Val Lys Arg Val
1595 1600 1605
Arg Ala Trp Val Gly Phe Asp Ala Glu Gly Ala His Ala Thr Arg
1610 1615 1620
Asp Ser Ile Gly Thr Asn Phe Pro Leu Gln Leu Gly Phe Ser Thr
1625 1630 1635
Gly Ile Asp Phe Val Val Glu Ala Thr Gly Leu Phe Ala Asp Arg
1640 1645 1650
Asp Gly Tyr Ser Phe Lys Lys Ala Val Ala Lys Ala Pro Pro Gly
1655 1660 1665
Glu Gln Phe Lys His Leu Ile Pro Leu Met Thr Arg Gly His Arg
1670 1675 1680
Trp Asp Val Val Arg Pro Arg Ile Val Gln Met Phe Ala Asp His
1685 1690 1695
Leu Ile Asp Leu Ser Asp Cys Val Val Leu Val Thr Trp Ala Ala
1700 1705 1710
Asn Phe Glu Leu Thr Cys Leu Arg Tyr Phe Ala Lys Val Gly Arg
1715 1720 1725
Glu Ile Ser Cys Asn Val Cys Thr Lys Arg Ala Thr Val Tyr Asn
1730 1735 1740
Ser Arg Thr Gly Tyr Tyr Gly Cys Trp Arg His Ser Val Thr Cys
1745 1750 1755
Asp Tyr Leu Tyr Asn Pro Leu Ile Val Asp Ile Gln Gln Trp Gly
1760 1765 1770
Tyr Ile Gly Ser Leu Ser Ser Asn His Asp Leu Tyr Cys Ser Val
1775 1780 1785
His Lys Gly Ala His Val Ala Ser Ser Asp Ala Ile Met Thr Arg
1790 1795 1800
Cys Leu Ala Val Tyr Asp Cys Phe Cys Asn Asn Ile Asn Trp Asn
1805 1810 1815
Val Glu Tyr Pro Ile Ile Ser Asn Glu Leu Ser Ile Asn Thr Ser
1820 1825 1830
Cys Arg Val Leu Gln Arg Val Ile Leu Lys Ala Ala Met Leu Cys
1835 1840 1845
Asn Arg Tyr Thr Leu Cys Tyr Asp Ile Gly Asn Pro Lys Ala Ile
1850 1855 1860
Ala Cys Val Lys Asp Phe Asp Phe Lys Phe Tyr Asp Ala Gln Pro
1865 1870 1875
Ile Val Lys Ser Val Lys Thr Leu Leu Tyr Ser Phe Glu Ala His
1880 1885 1890
Lys Asp Ser Phe Lys Asp Gly Leu Cys Met Phe Trp Asn Cys Asn
1895 1900 1905
Val Asp Lys Tyr Pro Pro Asn Ala Val Val Cys Arg Phe Asp Thr
1910 1915 1920
Arg Val Leu Asn Asn Leu Asn Leu Pro Gly Cys Asn Gly Gly Ser
1925 1930 1935
Leu Tyr Val Asn Lys His Ala Phe His Thr Lys Pro Phe Ala Arg
1940 1945 1950
Ala Ala Phe Glu His Leu Lys Pro Met Pro Phe Phe Tyr Tyr Ser
1955 1960 1965
Asp Thr Pro Cys Val Tyr Met Asp Gly Met Asp Ala Lys Gln Val
1970 1975 1980
Asp Tyr Val Pro Leu Lys Ser Ala Thr Cys Ile Thr Arg Cys Asn
1985 1990 1995
Leu Gly Gly Ala Val Cys Leu Lys His Ala Glu Glu Tyr Arg Glu
2000 2005 2010
Tyr Leu Glu Ser Tyr Asn Thr Ala Thr Thr Ala Gly Phe Thr Phe
2015 2020 2025
Trp Val Tyr Lys Thr Phe Asp Phe Tyr Asn Leu Trp Asn Thr Phe
2030 2035 2040
Thr Lys Leu Gln Ser Leu Glu Asn Val Val Tyr Asn Leu Val Lys
2045 2050 2055
Thr Gly His Tyr Thr Gly Gln Ala Gly Glu Met Pro Cys Ala Ile
2060 2065 2070
Ile Asn Asp Lys Val Val Ala Lys Ile Asp Lys Glu Asp Val Val
2075 2080 2085
Ile Phe Ile Asn Asn Thr Thr Tyr Pro Thr Asn Val Ala Val Glu
2090 2095 2100
Leu Phe Ala Lys Arg Ser Val Arg His His Pro Glu Leu Lys Leu
2105 2110 2115
Phe Arg Asn Leu Asn Ile Asp Val Cys Trp Lys His Val Ile Trp
2120 2125 2130
Asp Tyr Ala Arg Glu Ser Ile Phe Cys Ser Asn Thr Tyr Gly Val
2135 2140 2145
Cys Met Tyr Thr Asp Leu Lys Phe Ile Asp Lys Leu Asn Val Leu
2150 2155 2160
Phe Asp Gly Arg Asp Asn Gly Ala Leu Glu Ala Phe Lys Arg Ser
2165 2170 2175
Asn Asn Gly Val Tyr Ile Ser Thr Thr Lys Val Lys Ser Leu Ser
2180 2185 2190
Met Ile Arg Gly Pro Pro Arg Ala Glu Leu Asn Gly Val Val Val
2195 2200 2205
Asp Lys Val Gly Asp Thr Asp Cys Val Phe Tyr Phe Ala Val Arg
2210 2215 2220
Lys Glu Gly Gln Asp Val Ile Phe Ser Gln Phe Asp Ser Leu Gly
2225 2230 2235
Val Ser Ser Asn Gln Ser Pro Gln Gly Asn Leu Gly Ser Asn Gly
2240 2245 2250
Lys Pro Gly Asn Val Gly Gly Asn Asp Ala Leu Ser Ile Ser Thr
2255 2260 2265
Ile Phe Thr Gln Ser Arg Val Ile Ser Ser Phe Thr Cys Arg Thr
2270 2275 2280
Asp Met Glu Lys Asp Phe Ile Ala Leu Asp Gln Asp Val Phe Ile
2285 2290 2295
Gln Lys Tyr Gly Leu Glu Asp Tyr Ala Phe Glu His Ile Val Tyr
2300 2305 2310
Gly Asn Phe Asn Gln Lys Ile Ile Gly Gly Leu His Leu Leu Ile
2315 2320 2325
Gly Leu Tyr Arg Arg Gln Gln Thr Ser Asn Leu Val Val Gln Glu
2330 2335 2340
Phe Val Ser Tyr Asp Ser Ser Ile His Ser Tyr Phe Ile Thr Asp
2345 2350 2355
Glu Lys Ser Gly Gly Ser Lys Ser Val Cys Thr Val Ile Asp Ile
2360 2365 2370
Leu Leu Asp Asp Phe Val Ala Leu Val Lys Ser Leu Asn Leu Asn
2375 2380 2385
Cys Val Ser Lys Val Val Asn Val Asn Val Asp Phe Lys Asp Phe
2390 2395 2400
Gln Phe Met Leu Trp Cys Asn Asp Glu Lys Val Met Thr Phe Tyr
2405 2410 2415
Pro Arg Leu Gln Ala Ala Ser Asp Trp Lys Pro Gly Tyr Ser Met
2420 2425 2430
Pro Val Leu Tyr Lys Tyr Leu Asn Ser Pro Met Glu Arg Val Ser
2435 2440 2445
Leu Trp Asn Tyr Gly Lys Pro Val Thr Leu Pro Thr Gly Cys Met
2450 2455 2460
Met Asn Val Ala Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Thr
2465 2470 2475
Thr Thr Leu Ala Val Pro Val Asn Met Arg Val Leu His Leu Gly
2480 2485 2490
Ala Gly Ser Glu Lys Gly Val Ala Pro Gly Ser Ala Val Leu Arg
2495 2500 2505
Gln Trp Leu Pro Ala Gly Thr Ile Leu Val Asp Asn Asp Leu Tyr
2510 2515 2520
Pro Phe Val Ser Asp Ser Val Ala Thr Tyr Phe Gly Asp Cys Ile
2525 2530 2535
Thr Leu Pro Phe Asp Cys Gln Trp Asp Leu Ile Ile Ser Asp Met
2540 2545 2550
Tyr Asp Pro Ile Thr Lys Asn Ile Gly Glu Tyr Asn Val Ser Lys
2555 2560 2565
Asp Gly Phe Phe Thr Tyr Ile Cys His Met Ile Arg Asp Lys Leu
2570 2575 2580
Ala Leu Gly Gly Ser Val Ala Ile Lys Ile Thr Glu Phe Ser Trp
2585 2590 2595
Asn Ala Glu Leu Tyr Lys Leu Met Gly Tyr Phe Ala Phe Trp Thr
2600 2605 2610
Val Phe Cys Thr Asn Ala Asn Ala Ser Ser Ser Glu Gly Phe Leu
2615 2620 2625
Ile Gly Ile Asn Tyr Leu Cys Lys Pro Lys Val Glu Ile Asp Gly
2630 2635 2640
Asn Val Met His Ala Asn Tyr Leu Phe Trp Arg Asn Ser Thr Val
2645 2650 2655
Trp Asn Gly Gly Ala Tyr Ser Leu Phe Asp Met Ala Lys Phe Pro
2660 2665 2670
Leu Lys Leu Ala Gly Thr Ala Val Ile Asn Leu Arg Ala Asp Gln
2675 2680 2685
Ile Asn Asp Met Val Tyr Ser Leu Leu Glu Lys Gly Lys Leu Leu
2690 2695 2700
Ile Arg Asp Thr Asn Lys Glu Val Phe Val Gly Asp Ser Leu Val
2705 2710 2715
Asn Val Ile
2720
<210> 90
<211> 2678
<212> PRT
<213> porcine epidemic diarrhea virus
<220>
<221> MISC_FEATURE
<223> ORF 1B
<400> 90
Tyr Gly Leu Phe Lys Arg Val Arg Gly Ser Ser Ala Ala Arg Leu Glu
1 5 10 15
Pro Cys Asn Gly Thr Asp Thr Gln His Val Tyr Arg Ala Phe Asp Ile
20 25 30
Tyr Asn Lys Asp Val Ala Cys Leu Gly Lys Phe Leu Lys Val Asn Cys
35 40 45
Val Arg Leu Lys Asn Leu Asp Lys His Asp Ala Phe Tyr Val Val Lys
50 55 60
Arg Cys Thr Lys Ser Ala Met Glu His Glu Gln Ser Ile Tyr Ser Arg
65 70 75 80
Leu Glu Lys Cys Gly Ala Ile Ala Glu His Asp Phe Phe Thr Trp Lys
85 90 95
Asp Gly Arg Ala Ile Tyr Gly Asn Val Cys Arg Lys Asp Leu Thr Glu
100 105 110
Tyr Thr Met Met Asp Leu Cys Tyr Ala Leu Arg Asn Phe Asp Glu Asn
115 120 125
Asn Cys Asp Val Leu Lys Ser Ile Leu Ile Lys Val Gly Ala Cys Glu
130 135 140
Glu Ser Tyr Phe Asn Asn Lys Val Trp Phe Asp Pro Val Glu Asn Glu
145 150 155 160
Asp Ile His Arg Val Tyr Ala Leu Leu Gly Thr Ile Val Ala Arg Ala
165 170 175
Met Leu Lys Cys Val Lys Phe Cys Asp Ala Met Val Glu Gln Gly Ile
180 185 190
Val Gly Val Val Thr Leu Asp Asn Gln Asp Leu Asn Gly Asp Phe Tyr
195 200 205
Asp Phe Gly Asp Phe Thr Cys Ser Ile Lys Gly Met Gly Val Pro Ile
210 215 220
Cys Thr Ser Tyr Tyr Ser Tyr Met Met Pro Val Met Gly Met Thr Asn
225 230 235 240
Cys Leu Ala Ser Glu Cys Phe Val Lys Ser Asp Ile Phe Gly Glu Asp
245 250 255
Phe Lys Ser Tyr Asp Leu Leu Glu Tyr Asp Phe Thr Glu His Lys Thr
260 265 270
Ala Leu Phe Asn Lys Tyr Phe Lys Tyr Trp Gly Leu Gln Tyr His Pro
275 280 285
Asn Cys Val Asp Cys Ser Asp Glu Gln Cys Ile Val His Cys Ala Asn
290 295 300
Phe Asn Thr Leu Phe Ser Thr Thr Ile Pro Ile Thr Ala Phe Gly Pro
305 310 315 320
Leu Cys Arg Lys Cys Trp Ile Asp Gly Val Pro Leu Val Thr Thr Ala
325 330 335
Gly Tyr His Phe Lys Gln Leu Gly Ile Val Trp Asn Asn Asp Leu Asn
340 345 350
Leu His Ser Ser Arg Leu Ser Ile Asn Glu Leu Leu Gln Phe Cys Ser
355 360 365
Asp Pro Ala Leu Leu Ile Ala Ser Ser Pro Ala Leu Val Asp Gln Arg
370 375 380
Thr Val Cys Phe Ser Val Ala Ala Leu Gly Thr Gly Met Thr Asn Gln
385 390 395 400
Thr Val Lys Pro Gly His Phe Asn Lys Glu Phe Tyr Asp Phe Leu Leu
405 410 415
Glu Gln Gly Phe Phe Ser Glu Gly Ser Glu Leu Thr Leu Lys His Phe
420 425 430
Phe Phe Ala Gln Lys Val Asp Ala Ala Val Lys Asp Phe Asp Tyr Tyr
435 440 445
Arg Tyr Asn Arg Pro Thr Val Leu Asp Ile Cys Gln Ala Arg Val Val
450 455 460
Tyr Gln Ile Val Gln Arg Tyr Phe Asp Ile Tyr Glu Gly Gly Cys Ile
465 470 475 480
Thr Ala Lys Glu Val Val Val Thr Asn Leu Asn Lys Ser Ala Gly Tyr
485 490 495
Pro Leu Asn Lys Phe Gly Lys Ala Gly Leu Tyr Tyr Glu Ser Leu Ser
500 505 510
Tyr Glu Glu Gln Asp Glu Leu Tyr Ala Tyr Thr Lys Arg Asn Ile Leu
515 520 525
Pro Thr Met Thr Gln Leu Asn Leu Lys Tyr Ala Ile Ser Gly Lys Glu
530 535 540
Arg Ala Arg Thr Val Gly Gly Val Ser Leu Leu Ser Thr Met Thr Thr
545 550 555 560
Arg Gln Tyr His Gln Lys His Leu Lys Ser Ile Val Asn Thr Arg Gly
565 570 575
Ala Ser Val Val Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp Asn
580 585 590
Met Leu Lys Asn Leu Ile Asp Gly Val Glu Asn Pro Cys Leu Met Gly
595 600 605
Trp Asp Tyr Pro Lys Cys Asp Arg Ala Leu Pro Asn Met Ile Arg Met
610 615 620
Ile Ser Ala Met Ile Leu Gly Ser Lys His Thr Thr Cys Cys Ser Ser
625 630 635 640
Thr Asp Arg Phe Phe Arg Leu Cys Asn Glu Leu Ala Gln Val Leu Thr
645 650 655
Glu Val Val Tyr Ser Asn Gly Gly Phe Tyr Leu Lys Pro Gly Gly Thr
660 665 670
Thr Ser Gly Asp Ala Thr Thr Ala Tyr Ala Asn Ser Val Phe Asn Ile
675 680 685
Phe Gln Ala Val Ser Ala Asn Val Asn Lys Leu Leu Ser Val Asp Ser
690 695 700
Asn Val Cys His Asn Leu Glu Val Lys Gln Leu Gln Arg Lys Leu Tyr
705 710 715 720
Glu Cys Cys Tyr Arg Ser Thr Ile Val Asp Asp Gln Phe Val Val Glu
725 730 735
Tyr Tyr Gly Tyr Leu Arg Lys His Phe Ser Met Met Ile Leu Ser Asp
740 745 750
Asp Gly Val Val Cys Tyr Asn Asn Asp Tyr Ala Ser Leu Gly Tyr Val
755 760 765
Ala Asp Leu Asn Ala Phe Lys Ala Val Leu Tyr Tyr Gln Asn Asn Val
770 775 780
Phe Met Ser Ala Ser Lys Cys Trp Ile Glu Pro Asp Ile Asn Lys Gly
785 790 795 800
Pro His Glu Phe Cys Ser Gln His Thr Met Gln Ile Val Asp Lys Glu
805 810 815
Gly Thr Tyr Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Ser Ala
820 825 830
Gly Val Phe Val Asp Asp Val Val Lys Thr Asp Ala Val Val Leu Leu
835 840 845
Glu Arg Tyr Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Ser Lys His
850 855 860
Glu Asn Pro Glu Tyr Lys Lys Val Phe Tyr Val Leu Leu Asp Trp Val
865 870 875 880
Lys His Leu Tyr Lys Thr Leu Asn Ala Gly Val Leu Glu Ser Phe Ser
885 890 895
Val Thr Leu Leu Glu Asp Ser Thr Ala Lys Phe Trp Asp Glu Ser Phe
900 905 910
Tyr Ala Asn Met Tyr Glu Lys Ser Ala Val Leu Gln Ser Ala Gly Leu
915 920 925
Cys Val Val Cys Gly Ser Gln Thr Val Leu Arg Cys Gly Asp Cys Leu
930 935 940
Arg Arg Pro Met Leu Cys Thr Lys Cys Ala Tyr Asp His Val Ile Gly
945 950 955 960
Thr Thr His Lys Phe Ile Leu Ala Ile Thr Pro Tyr Val Cys Cys Ala
965 970 975
Ser Asp Cys Gly Val Asn Asp Val Thr Lys Leu Tyr Leu Gly Gly Leu
980 985 990
Ser Tyr Trp Cys His Glu His Lys Pro Arg Leu Ala Phe Pro Leu Cys
995 1000 1005
Ser Ala Gly Asn Val Phe Gly Leu Tyr Lys Asn Ser Ala Thr Gly
1010 1015 1020
Ser Pro Asp Val Glu Asp Phe Asn Arg Ile Ala Thr Ser Asp Trp
1025 1030 1035
Thr Asp Val Ser Asp Tyr Arg Leu Ala Asn Asp Val Lys Asp Ser
1040 1045 1050
Leu Arg Leu Phe Ala Ala Glu Thr Ile Lys Ala Lys Glu Glu Ser
1055 1060 1065
Val Lys Ser Ser Tyr Ala Cys Ala Thr Leu His Glu Val Val Gly
1070 1075 1080
Pro Lys Glu Leu Leu Leu Lys Trp Glu Val Gly Arg Pro Lys Pro
1085 1090 1095
Pro Leu Asn Arg Asn Ser Val Phe Thr Cys Tyr His Ile Thr Lys
1100 1105 1110
Asn Thr Lys Phe Gln Ile Gly Glu Phe Val Phe Glu Lys Ala Glu
1115 1120 1125
Tyr Asp Asn Asp Ala Val Thr Tyr Lys Thr Thr Ala Thr Thr Lys
1130 1135 1140
Leu Val Pro Gly Met Val Phe Val Leu Thr Ser His Asn Val Gln
1145 1150 1155
Pro Leu Arg Ala Pro Thr Ile Ala Asn Gln Glu Arg Tyr Ser Thr
1160 1165 1170
Ile His Lys Leu His Pro Ala Phe Asn Ile Pro Glu Ala Tyr Ser
1175 1180 1185
Ser Leu Val Pro Tyr Tyr Gln Leu Ile Gly Lys Gln Lys Ile Thr
1190 1195 1200
Thr Ile Gln Gly Pro Pro Gly Ser Gly Lys Ser His Cys Val Ile
1205 1210 1215
Gly Leu Gly Leu Tyr Tyr Pro Gly Ala Arg Ile Val Phe Thr Ala
1220 1225 1230
Cys Ser His Ala Ala Val Asp Ser Leu Cys Val Lys Ala Ser Thr
1235 1240 1245
Ala Tyr Ser Asn Asp Lys Cys Ser Arg Ile Ile Pro Gln Arg Ala
1250 1255 1260
Arg Val Glu Cys Tyr Asp Gly Phe Lys Ser Asn Asn Thr Ser Ala
1265 1270 1275
Gln Tyr Leu Phe Ser Thr Val Asn Ala Leu Pro Glu Cys Asn Ala
1280 1285 1290
Asp Ile Val Val Val Asp Glu Val Ser Met Cys Thr Asn Tyr Asp
1295 1300 1305
Leu Ser Val Ile Asn Gln Arg Ile Ser Tyr Arg His Val Val Tyr
1310 1315 1320
Val Gly Asp Pro Gln Gln Leu Pro Ala Pro Arg Val Met Ile Ser
1325 1330 1335
Arg Gly Thr Leu Glu Pro Lys Asp Tyr Asn Val Val Thr Gln Arg
1340 1345 1350
Met Cys Ala Leu Lys Pro Asp Val Phe Leu His Lys Cys Tyr Arg
1355 1360 1365
Cys Pro Ala Glu Ile Val Arg Thr Val Ser Glu Met Val Tyr Glu
1370 1375 1380
Asn Gln Phe Ile Pro Val His Pro Asp Ser Lys Gln Cys Phe Lys
1385 1390 1395
Ile Phe Cys Lys Gly Asn Val Gln Val Asp Asn Gly Ser Ser Ile
1400 1405 1410
Asn Arg Arg Gln Leu Asp Val Val Arg Met Phe Leu Ala Lys Asn
1415 1420 1425
Pro Arg Trp Ser Lys Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln
1430 1435 1440
Asn Tyr Val Ala Ser Arg Leu Leu Gly Leu Gln Ile Gln Thr Val
1445 1450 1455
Asp Ser Ser Gln Gly Ser Glu Tyr Asp Tyr Val Ile Tyr Ala Gln
1460 1465 1470
Thr Ser Asp Thr Ala His Ala Ser Asn Val Asn Arg Phe Asn Val
1475 1480 1485
Ala Ile Thr Arg Ala Lys Lys Gly Ile Leu Cys Ile Met Cys Asp
1490 1495 1500
Arg Ser Leu Phe Asp Leu Leu Lys Phe Phe Glu Leu Lys Leu Ser
1505 1510 1515
Asp Leu Gln Ala Asn Glu Gly Cys Gly Leu Phe Lys Asp Cys Ser
1520 1525 1530
Arg Gly Asp Asp Leu Leu Pro Pro Ser His Ala Asn Thr Phe Met
1535 1540 1545
Ser Leu Ala Asp Asn Phe Lys Thr Asp Gln Tyr Leu Ala Val Gln
1550 1555 1560
Ile Gly Val Asn Gly Pro Ile Lys Tyr Glu His Val Ile Ser Phe
1565 1570 1575
Met Gly Phe Arg Phe Asp Ile Asn Ile Pro Asn His His Thr Leu
1580 1585 1590
Phe Cys Thr Arg Asp Phe Ala Met Arg Asn Val Arg Gly Trp Leu
1595 1600 1605
Gly Phe Asp Val Glu Gly Ala His Val Val Gly Ser Asn Val Gly
1610 1615 1620
Thr Asn Val Pro Leu Gln Leu Gly Phe Ser Asn Gly Val Asp Phe
1625 1630 1635
Val Val Arg Pro Glu Gly Cys Val Val Thr Glu Ser Gly Asp Tyr
1640 1645 1650
Ile Lys Pro Val Arg Ala Arg Ala Pro Pro Gly Glu Gln Phe Ala
1655 1660 1665
His Leu Leu Pro Leu Leu Lys Arg Gly Gln Pro Trp Asp Val Val
1670 1675 1680
Arg Lys Arg Ile Val Gln Met Cys Ser Asp Tyr Leu Ala Asn Leu
1685 1690 1695
Ser Asp Ile Leu Ile Phe Val Leu Trp Ala Gly Gly Leu Glu Leu
1700 1705 1710
Thr Thr Met Arg Tyr Phe Val Lys Ile Gly Pro Ser Lys Ser Cys
1715 1720 1725
Asp Cys Gly Lys Val Ala Thr Cys Tyr Asn Ser Ala Leu His Thr
1730 1735 1740
Tyr Cys Cys Phe Lys His Ala Leu Gly Cys Asp Tyr Leu Tyr Asn
1745 1750 1755
Pro Tyr Cys Ile Asp Ile Gln Gln Trp Gly Tyr Lys Gly Ser Leu
1760 1765 1770
Ser Leu Asn His His Glu His Cys Asn Val His Arg Asn Glu His
1775 1780 1785
Val Ala Ser Gly Asp Ala Ile Met Thr Arg Cys Leu Ala Ile His
1790 1795 1800
Asp Cys Phe Val Lys Asn Val Asp Trp Ser Ile Thr Tyr Pro Phe
1805 1810 1815
Ile Gly Asn Glu Ala Val Ile Asn Lys Ser Gly Arg Ile Val Gln
1820 1825 1830
Ser His Thr Met Arg Ser Val Leu Lys Leu Tyr Asn Pro Lys Ala
1835 1840 1845
Ile Tyr Asp Ile Gly Asn Pro Lys Gly Ile Arg Cys Ala Val Thr
1850 1855 1860
Asp Ala Lys Trp Phe Cys Phe Asp Lys Asn Pro Thr Asn Ser Asn
1865 1870 1875
Val Lys Thr Leu Glu Tyr Asp Tyr Ile Thr His Gly Gln Phe Asp
1880 1885 1890
Gly Leu Cys Leu Phe Trp Asn Cys Asn Val Asp Met Tyr Pro Glu
1895 1900 1905
Phe Ser Val Val Cys Arg Phe Asp Thr Arg Cys Arg Ser Pro Leu
1910 1915 1920
Asn Leu Glu Gly Cys Asn Gly Gly Ser Leu Tyr Val Asn Asn His
1925 1930 1935
Ala Phe His Thr Pro Ala Phe Asp Lys Arg Ala Phe Ala Lys Leu
1940 1945 1950
Lys Pro Met Pro Phe Phe Phe Tyr Asp Asp Thr Glu Cys Asp Lys
1955 1960 1965
Leu Gln Asp Ser Ile Asn Tyr Val Pro Leu Arg Ala Ser Asn Cys
1970 1975 1980
Ile Thr Lys Cys Asn Val Gly Gly Ala Val Cys Ser Lys His Cys
1985 1990 1995
Ala Met Tyr His Ser Tyr Val Asn Ala Tyr Asn Thr Phe Thr Ser
2000 2005 2010
Ala Gly Phe Thr Ile Trp Val Pro Thr Ser Phe Asp Thr Tyr Asn
2015 2020 2025
Leu Trp Gln Thr Phe Ser Asn Asn Leu Gln Gly Leu Glu Asn Ile
2030 2035 2040
Ala Phe Asn Val Leu Lys Lys Gly Ser Phe Val Gly Asp Glu Gly
2045 2050 2055
Glu Leu Pro Val Ala Val Val Asn Asp Lys Val Leu Val Arg Asp
2060 2065 2070
Gly Thr Val Asp Thr Leu Val Phe Thr Asn Lys Thr Ser Leu Pro
2075 2080 2085
Thr Asn Val Ala Phe Glu Leu Tyr Ala Lys Arg Lys Val Gly Leu
2090 2095 2100
Thr Pro Pro Ile Thr Ile Leu Arg Asn Leu Gly Val Val Cys Thr
2105 2110 2115
Ser Lys Cys Val Ile Trp Asp Tyr Glu Ala Glu Arg Pro Leu Thr
2120 2125 2130
Thr Phe Thr Lys Asp Val Cys Lys Tyr Thr Asp Phe Glu Gly Asp
2135 2140 2145
Val Cys Thr Leu Phe Asp Asn Ser Ile Val Gly Ser Leu Glu Arg
2150 2155 2160
Phe Ser Met Thr Gln Asn Ala Val Leu Met Ser Leu Thr Ala Val
2165 2170 2175
Lys Lys Leu Thr Gly Ile Lys Leu Thr Tyr Gly Tyr Leu Asn Gly
2180 2185 2190
Val Pro Val Asn Thr His Glu Asp Lys Pro Phe Thr Trp Tyr Ile
2195 2200 2205
Tyr Thr Arg Lys Asn Gly Lys Phe Glu Asp Tyr Pro Asp Gly Tyr
2210 2215 2220
Phe Thr Gln Gly Arg Thr Thr Ala Asp Phe Ser Pro Arg Ser Asp
2225 2230 2235
Met Glu Lys Asp Phe Leu Ser Met Asp Met Gly Leu Phe Ile Asn
2240 2245 2250
Lys Tyr Gly Leu Glu Asp Tyr Gly Phe Glu His Val Val Tyr Gly
2255 2260 2265
Asp Val Ser Lys Thr Thr Leu Gly Gly Leu His Leu Leu Ile Ser
2270 2275 2280
Gln Val Arg Leu Ala Cys Met Gly Val Leu Lys Ile Asp Glu Phe
2285 2290 2295
Val Ser Ser Asn Asp Ser Thr Leu Lys Ser Cys Thr Val Thr Tyr
2300 2305 2310
Ala Asp Asn Pro Ser Ser Lys Met Val Cys Thr Tyr Met Asp Leu
2315 2320 2325
Leu Leu Asp Asp Phe Val Ser Ile Leu Lys Ser Leu Asp Leu Ser
2330 2335 2340
Val Val Ser Lys Val His Glu Val Met Val Asp Cys Lys Met Trp
2345 2350 2355
Arg Trp Met Leu Trp Cys Lys Asp His Lys Leu Gln Thr Phe Tyr
2360 2365 2370
Pro Gln Leu Gln Ala Ser Glu Trp Lys Cys Gly Tyr Ser Met Pro
2375 2380 2385
Ser Ile Tyr Lys Ile Gln Arg Met Cys Leu Glu Pro Cys Asn Leu
2390 2395 2400
Tyr Asn Tyr Gly Ala Gly Val Lys Leu Pro Asp Gly Ile Met Phe
2405 2410 2415
Asn Val Val Lys Tyr Thr Gln Leu Cys Gln Tyr Leu Asn Ser Thr
2420 2425 2430
Thr Met Cys Val Pro His His Met Arg Val Leu His Leu Gly Ala
2435 2440 2445
Gly Ser Asp Lys Gly Val Ala Pro Gly Thr Ala Val Leu Arg Arg
2450 2455 2460
Trp Leu Pro Leu Asp Ala Ile Ile Val Asp Asn Asp Ser Val Asp
2465 2470 2475
Tyr Val Ser Asp Ala Asp Tyr Ser Val Thr Gly Asp Cys Ser Thr
2480 2485 2490
Leu Tyr Leu Ser Asp Lys Phe Asp Leu Val Ile Ser Asp Met Tyr
2495 2500 2505
Asp Gly Lys Ile Lys Ser Cys Asp Gly Glu Asn Val Ser Lys Glu
2510 2515 2520
Gly Phe Phe Pro Tyr Ile Asn Gly Val Ile Thr Glu Lys Leu Ala
2525 2530 2535
Leu Gly Gly Thr Val Ala Ile Lys Val Thr Glu Phe Ser Trp Asn
2540 2545 2550
Lys Lys Leu Tyr Glu Leu Ile Gln Lys Phe Glu Tyr Trp Thr Met
2555 2560 2565
Phe Cys Thr Ser Val Asn Thr Ser Ser Ser Glu Ala Phe Leu Ile
2570 2575 2580
Gly Val His Tyr Leu Gly Asp Phe Ala Ser Gly Ala Val Ile Asp
2585 2590 2595
Gly Asn Thr Met His Ala Asn Tyr Ile Phe Trp Arg Asn Ser Thr
2600 2605 2610
Ile Met Thr Met Ser Tyr Asn Ser Val Leu Asp Leu Ser Lys Phe
2615 2620 2625
Asn Cys Lys His Lys Ala Thr Val Val Val Asn Leu Lys Asp Ser
2630 2635 2640
Ser Ile Ser Asp Val Val Leu Gly Leu Leu Lys Asn Gly Lys Leu
2645 2650 2655
Leu Val Arg Asn Asn Asp Ala Ile Cys Gly Phe Ser Asn His Leu
2660 2665 2670
Val Asn Val Asn Lys
2675
<210> 91
<211> 2672
<212> PRT
<213> human coronavirus 229E
<220>
<221> MISC_FEATURE
<223> ORF 1B
<400> 91
Glu Pro Cys Asn Gly Thr Asp Ile Asp Tyr Cys Val Arg Ala Phe Asp
1 5 10 15
Val Tyr Asn Lys Asp Ala Ser Phe Ile Gly Lys Asn Leu Lys Ser Asn
20 25 30
Cys Val Arg Phe Lys Asn Val Asp Lys Asp Asp Ala Phe Tyr Ile Val
35 40 45
Lys Arg Cys Ile Lys Ser Val Met Asp His Glu Gln Ser Met Tyr Asn
50 55 60
Leu Leu Lys Gly Cys Asn Ala Val Ala Lys His Asp Phe Phe Thr Trp
65 70 75 80
His Glu Gly Arg Thr Ile Tyr Gly Asn Val Ser Arg Gln Asp Leu Thr
85 90 95
Lys Tyr Thr Met Met Asp Leu Cys Phe Ala Leu Arg Asn Phe Asp Glu
100 105 110
Lys Asp Cys Glu Val Phe Lys Glu Ile Leu Val Leu Thr Gly Cys Cys
115 120 125
Ser Thr Asp Tyr Phe Glu Met Lys Asn Trp Phe Asp Pro Ile Glu Asn
130 135 140
Glu Asp Ile His Arg Val Tyr Ala Ala Leu Gly Lys Val Val Ala Asn
145 150 155 160
Ala Met Leu Lys Cys Val Ala Phe Cys Asp Glu Met Val Leu Lys Gly
165 170 175
Val Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Asn Gly Asn Phe
180 185 190
Tyr Asp Phe Gly Asp Phe Val Leu Cys Pro Pro Gly Met Gly Ile Pro
195 200 205
Tyr Cys Thr Ser Tyr Tyr Ser Tyr Met Met Pro Val Met Gly Met Thr
210 215 220
Asn Cys Leu Ala Ser Glu Cys Phe Met Lys Ser Asp Ile Phe Gly Gln
225 230 235 240
Asp Phe Lys Thr Phe Asp Leu Leu Lys Tyr Asp Phe Thr Glu His Lys
245 250 255
Glu Val Leu Phe Asn Lys Tyr Phe Lys Tyr Trp Gly Gln Asp Tyr His
260 265 270
Pro Asp Cys Val Asp Cys His Asp Glu Met Cys Ile Leu His Cys Ser
275 280 285
Asn Phe Asn Thr Leu Phe Ala Thr Thr Ile Pro Asn Thr Ala Phe Gly
290 295 300
Pro Leu Cys Arg Lys Val Phe Ile Asp Gly Val Pro Val Val Ala Thr
305 310 315 320
Ala Gly Tyr His Phe Lys Gln Leu Gly Leu Val Trp Asn Lys Asp Val
325 330 335
Asn Thr His Ser Thr Arg Leu Thr Ile Thr Glu Leu Leu Gln Phe Val
340 345 350
Thr Asp Pro Thr Leu Ile Val Ala Ser Ser Pro Ala Leu Val Asp Lys
355 360 365
Arg Thr Val Cys Phe Ser Val Ala Ala Leu Ser Thr Gly Leu Thr Ser
370 375 380
Gln Thr Val Lys Pro Gly His Phe Asn Lys Glu Phe Tyr Asp Phe Leu
385 390 395 400
Arg Ser Gln Gly Phe Phe Asp Glu Gly Ser Glu Leu Thr Leu Lys His
405 410 415
Phe Phe Phe Thr Gln Lys Gly Asp Ala Ala Ile Lys Asp Phe Asp Tyr
420 425 430
Tyr Arg Tyr Asn Arg Pro Thr Met Leu Asp Ile Gly Gln Ala Arg Val
435 440 445
Ala Tyr Gln Val Ala Ala Arg Tyr Phe Asp Cys Tyr Glu Gly Gly Cys
450 455 460
Ile Thr Ser Arg Glu Val Val Val Thr Asn Leu Asn Lys Ser Ala Gly
465 470 475 480
Trp Pro Leu Asn Lys Phe Gly Lys Ala Gly Leu Tyr Tyr Glu Ser Ile
485 490 495
Ser Tyr Glu Glu Gln Asp Ala Ile Phe Ser Leu Thr Lys Arg Asn Ile
500 505 510
Leu Pro Thr Met Thr Gln Leu Asn Leu Lys Tyr Ala Ile Ser Gly Lys
515 520 525
Glu Arg Ala Arg Thr Val Gly Gly Val Ser Leu Leu Ala Thr Met Thr
530 535 540
Thr Arg Gln Phe His Gln Lys Cys Leu Lys Ser Ile Val Ala Thr Arg
545 550 555 560
Asn Ala Thr Val Val Ile Gly Thr Thr Lys Phe Tyr Gly Gly Trp Asp
565 570 575
Asn Met Leu Lys Asn Leu Met Ala Asp Val Asp Asp Pro Lys Leu Met
580 585 590
Gly Trp Asp Tyr Pro Lys Cys Asp Arg Ala Met Pro Ser Met Ile Arg
595 600 605
Met Leu Ser Ala Met Ile Leu Gly Ser Lys His Val Thr Cys Cys Thr
610 615 620
Ala Ser Asp Lys Phe Tyr Arg Leu Ser Asn Glu Leu Ala Gln Val Leu
625 630 635 640
Thr Glu Val Val Tyr Ser Asn Gly Gly Phe Tyr Phe Lys Pro Gly Gly
645 650 655
Thr Thr Ser Gly Asp Ala Thr Thr Ala Tyr Ala Asn Ser Val Phe Asn
660 665 670
Ile Phe Gln Ala Val Ser Ser Asn Ile Asn Cys Val Leu Ser Val Asn
675 680 685
Ser Ser Asn Cys Asn Asn Phe Asn Val Lys Lys Leu Gln Arg Gln Leu
690 695 700
Tyr Asp Asn Cys Tyr Arg Asn Ser Asn Val Asp Glu Ser Phe Val Asp
705 710 715 720
Asp Phe Tyr Gly Tyr Leu Gln Lys His Phe Ser Met Met Ile Leu Ser
725 730 735
Asp Asp Ser Val Val Cys Tyr Asn Lys Thr Tyr Ala Gly Leu Gly Tyr
740 745 750
Ile Ala Asp Ile Ser Ala Phe Lys Ala Thr Leu Tyr Tyr Gln Asn Gly
755 760 765
Val Phe Met Ser Thr Ala Lys Cys Trp Thr Glu Glu Asp Leu Ser Ile
770 775 780
Gly Pro His Glu Phe Cys Ser Gln His Thr Met Gln Ile Val Asp Glu
785 790 795 800
Asn Gly Lys Tyr Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Ile Ser
805 810 815
Ala Gly Val Phe Val Asp Asp Ile Thr Lys Thr Asp Ala Val Ile Leu
820 825 830
Leu Glu Arg Tyr Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Ser Lys
835 840 845
His Pro Lys Pro Glu Tyr Arg Lys Val Phe Tyr Ala Leu Leu Asp Trp
850 855 860
Val Lys His Leu Asn Lys Thr Leu Asn Glu Gly Val Leu Glu Ser Phe
865 870 875 880
Ser Val Thr Leu Leu Asp Glu His Glu Ser Lys Phe Trp Asp Glu Ser
885 890 895
Phe Tyr Ala Ser Met Tyr Glu Lys Ser Thr Val Leu Gln Ala Ala Gly
900 905 910
Leu Cys Val Val Cys Gly Ser Gln Thr Val Leu Arg Cys Gly Asp Cys
915 920 925
Leu Arg Arg Pro Met Leu Cys Thr Lys Cys Ala Tyr Asp His Val Phe
930 935 940
Gly Thr Asp His Lys Phe Ile Leu Ala Ile Thr Pro Tyr Val Cys Asn
945 950 955 960
Thr Ser Gly Cys Asn Val Asn Asp Val Thr Lys Leu Tyr Leu Gly Gly
965 970 975
Leu Asn Tyr Tyr Cys Val Asp His Lys Pro His Leu Ser Phe Pro Leu
980 985 990
Cys Ser Ala Gly Asn Val Phe Gly Leu Tyr Lys Ser Ser Ala Leu Gly
995 1000 1005
Ser Met Asp Ile Asp Val Phe Asn Lys Leu Ser Thr Ser Asp Trp
1010 1015 1020
Ser Asp Ile Arg Asp Tyr Lys Leu Ala Asn Asp Ala Lys Glu Ser
1025 1030 1035
Leu Arg Leu Phe Ala Ala Glu Thr Val Lys Ala Lys Glu Glu Ser
1040 1045 1050
Val Lys Ser Ser Tyr Ala Tyr Ala Thr Leu Lys Glu Ile Val Gly
1055 1060 1065
Pro Lys Glu Leu Leu Leu Leu Trp Glu Ser Gly Lys Ala Lys Pro
1070 1075 1080
Pro Leu Asn Arg Asn Ser Val Phe Thr Cys Phe Gln Ile Thr Lys
1085 1090 1095
Asp Ser Lys Phe Gln Val Gly Glu Phe Val Phe Glu Lys Val Asp
1100 1105 1110
Tyr Gly Ser Asp Thr Val Thr Tyr Lys Ser Thr Ala Thr Thr Lys
1115 1120 1125
Leu Val Pro Gly Met Leu Phe Ile Leu Thr Ser His Asn Val Ala
1130 1135 1140
Pro Leu Arg Ala Pro Thr Met Ala Asn Gln Glu Lys Tyr Ser Thr
1145 1150 1155
Ile Tyr Lys Leu His Pro Ser Phe Asn Val Ser Asp Ala Tyr Ala
1160 1165 1170
Asn Leu Val Pro Tyr Tyr Gln Leu Ile Gly Lys Gln Arg Ile Thr
1175 1180 1185
Thr Ile Gln Gly Pro Pro Gly Ser Gly Lys Ser His Cys Ser Ile
1190 1195 1200
Gly Ile Gly Val Tyr Tyr Pro Gly Ala Arg Ile Val Phe Thr Ala
1205 1210 1215
Cys Ser His Ala Ala Val Asp Ser Leu Cys Ala Lys Ala Val Thr
1220 1225 1230
Ala Tyr Ser Val Asp Lys Cys Thr Arg Ile Ile Pro Ala Arg Ala
1235 1240 1245
Arg Val Glu Cys Tyr Ser Gly Phe Lys Pro Asn Asn Asn Ser Ala
1250 1255 1260
Gln Tyr Val Phe Ser Thr Val Asn Ala Leu Pro Glu Val Asn Ala
1265 1270 1275
Asp Ile Val Val Val Asp Glu Val Ser Met Cys Thr Asn Tyr Asp
1280 1285 1290
Leu Ser Val Ile Asn Gln Arg Ile Ser Tyr Lys His Ile Val Tyr
1295 1300 1305
Val Gly Asp Pro Gln Gln Leu Pro Ala Pro Arg Val Leu Ile Ser
1310 1315 1320
Lys Gly Val Met Glu Pro Ile Asp Tyr Asn Val Val Thr Gln Arg
1325 1330 1335
Met Cys Ala Ile Gly Pro Asp Val Phe Leu His Lys Cys Tyr Arg
1340 1345 1350
Cys Pro Ala Glu Ile Val Asn Thr Val Ser Glu Leu Val Tyr Glu
1355 1360 1365
Asn Lys Phe Val Pro Val Lys Glu Ala Ser Lys Gln Cys Phe Lys
1370 1375 1380
Ile Phe Glu Arg Gly Ser Val Gln Val Asp Asn Gly Ser Ser Ile
1385 1390 1395
Asn Arg Arg Gln Leu Asp Val Val Lys Arg Phe Ile His Lys Asn
1400 1405 1410
Ser Thr Trp Ser Lys Ala Val Phe Ile Ser Pro Tyr Asn Ser Gln
1415 1420 1425
Asn Tyr Val Ala Ala Arg Leu Leu Gly Leu Gln Thr Gln Thr Val
1430 1435 1440
Asp Ser Ala Gln Gly Ser Glu Tyr Asp Tyr Val Ile Phe Ala Gln
1445 1450 1455
Thr Ser Asp Thr Ala His Ala Cys Asn Ala Asn Arg Phe Asn Val
1460 1465 1470
Ala Ile Thr Arg Ala Lys Lys Gly Ile Phe Cys Ile Met Ser Asp
1475 1480 1485
Arg Thr Leu Phe Asp Ala Leu Lys Phe Phe Glu Ile Thr Met Thr
1490 1495 1500
Asp Leu Gln Ser Glu Ser Ser Cys Gly Leu Phe Lys Asp Cys Ala
1505 1510 1515
Arg Asn Pro Ile Asp Leu Pro Pro Ser His Ala Thr Thr Tyr Leu
1520 1525 1530
Ser Leu Ser Asp Arg Phe Lys Thr Ser Gly Asp Leu Ala Val Gln
1535 1540 1545
Ile Gly Asn Asn Asn Val Cys Thr Tyr Glu His Val Ile Ser Tyr
1550 1555 1560
Met Gly Phe Arg Phe Asp Val Ser Met Pro Gly Ser His Ser Leu
1565 1570 1575
Phe Cys Thr Arg Asp Phe Ala Met Arg His Val Arg Gly Trp Leu
1580 1585 1590
Gly Met Asp Val Glu Gly Ala His Val Thr Gly Asp Asn Val Gly
1595 1600 1605
Thr Asn Val Pro Leu Gln Val Gly Phe Ser Asn Gly Val Asp Phe
1610 1615 1620
Val Ala Gln Pro Glu Gly Cys Val Leu Thr Asn Thr Gly Ser Val
1625 1630 1635
Val Lys Pro Val Arg Ala Arg Ala Pro Pro Gly Glu Gln Phe Thr
1640 1645 1650
His Ile Val Pro Leu Leu Arg Lys Gly Gln Pro Trp Ser Val Leu
1655 1660 1665
Arg Lys Arg Ile Val Gln Met Ile Ala Asp Phe Leu Ala Gly Ser
1670 1675 1680
Ser Asp Val Leu Val Phe Val Leu Trp Ala Gly Gly Leu Glu Leu
1685 1690 1695
Thr Thr Met Arg Tyr Phe Val Lys Ile Gly Ala Val Lys His Cys
1700 1705 1710
Gln Cys Gly Thr Val Ala Thr Cys Tyr Asn Ser Val Ser Asn Asp
1715 1720 1725
Tyr Cys Cys Phe Lys His Ala Leu Gly Cys Asp Tyr Val Tyr Asn
1730 1735 1740
Pro Tyr Val Ile Asp Ile Gln Gln Trp Gly Tyr Val Gly Ser Leu
1745 1750 1755
Ser Thr Asn His His Ala Ile Cys Asn Val His Arg Asn Glu His
1760 1765 1770
Val Ala Ser Gly Asp Ala Ile Met Thr Arg Cys Leu Ala Val Tyr
1775 1780 1785
Asp Cys Phe Val Lys Asn Val Asp Trp Ser Ile Thr Tyr Pro Met
1790 1795 1800
Ile Ala Asn Glu Asn Ala Ile Asn Lys Gly Gly Arg Thr Val Gln
1805 1810 1815
Ser His Ile Met Arg Ala Ala Ile Lys Leu Tyr Asn Pro Lys Ala
1820 1825 1830
Ile His Asp Ile Gly Asn Pro Lys Gly Ile Arg Cys Ala Val Thr
1835 1840 1845
Asp Ala Lys Trp Tyr Cys Tyr Asp Lys Asn Pro Ile Asn Ser Asn
1850 1855 1860
Val Lys Thr Leu Glu Tyr Asp Tyr Met Thr His Gly Gln Met Asp
1865 1870 1875
Gly Leu Cys Leu Phe Trp Asn Cys Asn Val Asp Met Tyr Pro Glu
1880 1885 1890
Phe Ser Ile Val Cys Arg Phe Asp Thr Arg Thr Arg Ser Thr Leu
1895 1900 1905
Asn Leu Glu Gly Val Asn Gly Gly Ser Leu Tyr Val Asn Asn His
1910 1915 1920
Ala Phe His Thr Pro Ala Tyr Asp Lys Arg Ala Met Ala Lys Leu
1925 1930 1935
Lys Pro Ala Pro Phe Phe Tyr Tyr Asp Asp Gly Ser Cys Glu Val
1940 1945 1950
Val His Asp Gln Val Asn Tyr Val Pro Leu Arg Ala Thr Asn Cys
1955 1960 1965
Ile Thr Lys Cys Asn Ile Gly Gly Ala Val Cys Ser Lys His Ala
1970 1975 1980
Asn Leu Tyr Arg Ala Tyr Val Glu Ser Tyr Asn Ile Phe Thr Gln
1985 1990 1995
Ala Gly Phe Asn Ile Trp Val Pro Thr Thr Phe Asp Cys Tyr Asn
2000 2005 2010
Leu Trp Gln Thr Phe Thr Glu Val Asn Leu Gln Gly Leu Glu Asn
2015 2020 2025
Ile Ala Phe Asn Val Val Asn Lys Gly Ser Phe Val Gly Ala Asp
2030 2035 2040
Gly Glu Leu Pro Val Ala Ile Ser Gly Asp Lys Val Phe Val Arg
2045 2050 2055
Asp Gly Asn Thr Asp Asn Leu Val Phe Val Asn Lys Thr Ser Leu
2060 2065 2070
Pro Thr Asn Ile Ala Phe Glu Leu Phe Ala Lys Arg Lys Val Gly
2075 2080 2085
Leu Thr Pro Pro Leu Ser Ile Leu Lys Asn Leu Gly Val Val Ala
2090 2095 2100
Thr Tyr Lys Phe Val Leu Trp Asp Tyr Glu Ala Glu Arg Pro Leu
2105 2110 2115
Thr Ser Phe Thr Lys Ser Val Cys Gly Tyr Thr Asp Phe Ala Glu
2120 2125 2130
Asp Val Cys Thr Cys Tyr Asp Asn Ser Ile Gln Gly Ser Tyr Glu
2135 2140 2145
Arg Phe Thr Leu Ser Thr Asn Ala Val Leu Phe Ser Ala Thr Ala
2150 2155 2160
Val Lys Thr Gly Gly Lys Ser Leu Pro Ala Ile Lys Leu Asn Phe
2165 2170 2175
Gly Met Leu Asn Gly Asn Ala Ile Ala Thr Val Lys Ser Glu Asp
2180 2185 2190
Gly Asn Ile Lys Asn Ile Asn Trp Phe Val Tyr Val Arg Lys Asp
2195 2200 2205
Gly Lys Pro Val Asp His Tyr Asp Gly Phe Tyr Thr Gln Gly Arg
2210 2215 2220
Asn Leu Gln Asp Phe Leu Pro Arg Ser Thr Met Glu Glu Asp Phe
2225 2230 2235
Leu Asn Met Asp Ile Gly Val Phe Ile Gln Lys Tyr Gly Leu Glu
2240 2245 2250
Asp Phe Asn Phe Glu His Val Val Tyr Gly Asp Val Ser Lys Thr
2255 2260 2265
Thr Leu Gly Gly Leu His Leu Leu Ile Ser Gln Val Arg Leu Ser
2270 2275 2280
Lys Met Gly Ile Leu Lys Ala Glu Glu Phe Val Ala Ala Ser Asp
2285 2290 2295
Ile Thr Leu Lys Cys Cys Thr Val Thr Tyr Leu Asn Asp Pro Ser
2300 2305 2310
Ser Lys Thr Val Cys Thr Tyr Met Asp Leu Leu Leu Asp Asp Phe
2315 2320 2325
Val Ser Val Leu Lys Ser Leu Asp Leu Thr Val Val Ser Lys Val
2330 2335 2340
His Glu Val Ile Ile Asp Asn Lys Pro Trp Arg Trp Met Leu Trp
2345 2350 2355
Cys Lys Asp Asn Ala Val Ala Thr Phe Tyr Pro Gln Leu Gln Ser
2360 2365 2370
Ala Glu Trp Lys Cys Gly Tyr Ser Met Pro Gly Ile Tyr Lys Thr
2375 2380 2385
Gln Arg Met Cys Leu Glu Pro Cys Asn Leu Tyr Asn Tyr Gly Ala
2390 2395 2400
Gly Leu Lys Leu Pro Ser Gly Ile Met Phe Asn Val Val Lys Tyr
2405 2410 2415
Thr Gln Leu Cys Gln Tyr Phe Asn Ser Thr Thr Leu Cys Val Pro
2420 2425 2430
His Asn Met Arg Val Leu His Leu Gly Ala Gly Ser Asp Tyr Gly
2435 2440 2445
Val Ala Pro Gly Thr Ala Val Leu Lys Arg Trp Leu Pro His Asp
2450 2455 2460
Ala Ile Val Val Asp Asn Asp Val Val Asp Tyr Val Ser Asp Ala
2465 2470 2475
Asp Phe Ser Val Thr Gly Asp Cys Ala Thr Val Tyr Leu Glu Asp
2480 2485 2490
Lys Phe Asp Leu Leu Ile Ser Asp Met Tyr Asp Gly Arg Thr Lys
2495 2500 2505
Ala Ile Asp Gly Glu Asn Val Ser Lys Glu Gly Phe Phe Thr Tyr
2510 2515 2520
Ile Asn Gly Phe Ile Cys Glu Lys Leu Ala Ile Gly Gly Ser Ile
2525 2530 2535
Ala Ile Lys Val Thr Glu Tyr Ser Trp Asn Lys Lys Leu Tyr Glu
2540 2545 2550
Leu Val Gln Arg Phe Ser Phe Trp Thr Met Phe Cys Thr Ser Val
2555 2560 2565
Asn Thr Ser Ser Ser Glu Ala Phe Val Val Gly Ile Asn Tyr Leu
2570 2575 2580
Gly Asp Phe Ala Gln Gly Pro Phe Ile Asp Gly Asn Ile Ile His
2585 2590 2595
Ala Asn Tyr Val Phe Trp Arg Asn Ser Thr Val Met Ser Leu Ser
2600 2605 2610
Tyr Asn Ser Val Leu Asp Leu Ser Lys Phe Asn Cys Lys His Lys
2615 2620 2625
Ala Thr Val Val Val Gln Leu Lys Asp Ser Asp Ile Asn Glu Met
2630 2635 2640
Val Leu Ser Leu Val Arg Ser Gly Lys Leu Leu Val Arg Gly Asn
2645 2650 2655
Gly Lys Cys Leu Ser Phe Ser Asn His Leu Val Ser Thr Lys
2660 2665 2670
<210> 92
<211> 448
<212> PRT
<213> human coronavirus OC43
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 92
Met Ser Phe Thr Pro Gly Lys Gln Ser Ser Ser Arg Ala Ser Ser Gly
1 5 10 15
Asn Arg Ser Gly Asn Gly Ile Leu Lys Trp Ala Asp Gln Ser Asp Gln
20 25 30
Phe Arg Asn Val Gln Thr Arg Gly Arg Arg Ala Gln Pro Lys Gln Thr
35 40 45
Ala Thr Ser Gln Gln Pro Ser Gly Gly Asn Val Val Pro Tyr Tyr Ser
50 55 60
Trp Phe Ser Gly Ile Thr Gln Phe Gln Lys Gly Lys Glu Phe Glu Phe
65 70 75 80
Val Glu Gly Gln Gly Val Pro Ile Ala Pro Gly Val Pro Ala Thr Glu
85 90 95
Ala Lys Gly Tyr Trp Tyr Arg His Asn Arg Arg Ser Phe Lys Thr Ala
100 105 110
Asp Gly Asn Gln Arg Gln Leu Leu Pro Arg Trp Tyr Phe Tyr Tyr Leu
115 120 125
Gly Thr Gly Pro His Ala Lys Asp Gln Tyr Gly Thr Asp Ile Asp Gly
130 135 140
Val Tyr Trp Val Ala Ser Asn Gln Ala Asp Val Asn Thr Pro Ala Asp
145 150 155 160
Ile Val Asp Arg Asp Pro Ser Ser Asp Glu Ala Ile Pro Thr Arg Phe
165 170 175
Pro Pro Gly Thr Val Leu Pro Gln Gly Tyr Tyr Ile Glu Gly Ser Gly
180 185 190
Arg Ser Ala Pro Asn Ser Arg Ser Thr Ser Arg Thr Ser Ser Arg Ala
195 200 205
Ser Ser Ala Gly Ser Arg Ser Arg Ala Asn Ser Gly Asn Arg Thr Pro
210 215 220
Thr Ser Gly Val Thr Pro Asp Met Ala Asp Gln Ile Ala Ser Leu Val
225 230 235 240
Leu Ala Lys Leu Gly Lys Asp Ala Thr Lys Pro Gln Gln Val Thr Lys
245 250 255
His Thr Ala Lys Glu Val Arg Gln Lys Ile Leu Asn Lys Pro Arg Gln
260 265 270
Lys Arg Ser Pro Asn Lys Gln Cys Thr Val Gln Gln Cys Phe Gly Lys
275 280 285
Arg Gly Pro Asn Gln Asn Phe Gly Gly Gly Glu Met Leu Lys Leu Gly
290 295 300
Thr Ser Asp Pro Gln Phe Pro Ile Leu Ala Glu Leu Ala Pro Thr Ala
305 310 315 320
Gly Ala Phe Phe Phe Gly Ser Arg Leu Glu Leu Ala Lys Val Gln Asn
325 330 335
Leu Ser Gly Asn Pro Asp Glu Pro Gln Lys Asp Val Tyr Glu Leu Arg
340 345 350
Tyr Asn Gly Ala Ile Arg Phe Asp Ser Thr Leu Ser Gly Phe Glu Thr
355 360 365
Ile Met Lys Val Leu Asn Glu Asn Leu Asn Ala Tyr Gln Gln Gln Asp
370 375 380
Gly Met Met Asn Met Ser Pro Lys Pro Gln Arg Gln Arg Gly His Lys
385 390 395 400
Asn Gly Gln Gly Glu Asn Asp Asn Ile Ser Val Ala Val Pro Lys Ser
405 410 415
Arg Val Gln Gln Asn Lys Ser Arg Glu Leu Thr Ala Glu Asp Ile Ser
420 425 430
Leu Leu Lys Lys Met Asp Glu Pro Tyr Thr Glu Asp Thr Ser Glu Ile
435 440 445
<210> 93
<211> 451
<212> PRT
<213> murine hepatitis virus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 93
Met Ser Phe Val Pro Gly Gln Glu Asn Ala Gly Ser Arg Ser Ser Ser
1 5 10 15
Gly Asn Arg Ala Gly Asn Gly Ile Leu Lys Lys Thr Thr Trp Ala Asp
20 25 30
Gln Thr Glu Arg Gly Asn Arg Gly Arg Arg Asn His Pro Lys Gln Thr
35 40 45
Ala Thr Thr Gln Pro Asn Ala Gly Ser Val Val Pro His Tyr Ser Trp
50 55 60
Phe Ser Gly Ile Thr Gln Phe Gln Lys Gly Lys Glu Phe Gln Phe Ala
65 70 75 80
Gln Gly Gln Gly Val Pro Ile Ala Ser Gly Ile Pro Ala Ser Glu Gln
85 90 95
Lys Gly Tyr Trp Tyr Arg His Asn Arg Arg Ser Phe Lys Thr Pro Asp
100 105 110
Gly Gln His Lys Gln Leu Leu Pro Arg Trp Tyr Phe Tyr Tyr Leu Gly
115 120 125
Thr Gly Pro His Ala Gly Ala Glu Tyr Gly Asp Asp Ile Glu Gly Val
130 135 140
Val Trp Val Ala Ser Gln Gln Ala Asp Thr Lys Thr Thr Ala Asp Val
145 150 155 160
Val Glu Arg Asp Pro Ser Ser His Glu Ala Ile Pro Thr Arg Phe Ala
165 170 175
Pro Gly Thr Val Leu Pro Gln Gly Phe Tyr Val Glu Gly Ser Gly Arg
180 185 190
Ser Ala Pro Ala Ser Arg Ser Gly Ser Arg Ser Gln Ser Arg Gly Pro
195 200 205
Asn Asn Arg Ala Arg Ser Ser Ser Asn Gln Arg Gln Pro Ala Ser Ala
210 215 220
Val Lys Pro Asp Met Ala Glu Glu Ile Ala Ala Leu Val Leu Ala Lys
225 230 235 240
Leu Gly Lys Asp Ala Gly Gln Pro Lys Gln Val Thr Lys Gln Ser Ala
245 250 255
Lys Glu Val Arg Gln Lys Ile Leu Thr Lys Pro Arg Gln Lys Arg Thr
260 265 270
Pro Asn Lys Gln Cys Pro Val Gln Gln Cys Phe Gly Lys Arg Gly Pro
275 280 285
Asn Gln Asn Phe Gly Gly Ser Glu Met Leu Lys Leu Gly Thr Ser Asp
290 295 300
Pro Gln Phe Pro Ile Leu Ala Glu Leu Ala Pro Thr Pro Ser Ala Phe
305 310 315 320
Phe Phe Gly Ser Lys Leu Glu Leu Val Lys Lys Asn Ser Gly Gly Ala
325 330 335
Asp Glu Pro Thr Lys Asp Val Tyr Glu Leu Gln Tyr Ser Gly Ala Ile
340 345 350
Arg Phe Asp Ser Thr Leu Pro Gly Phe Glu Thr Ile Met Lys Val Leu
355 360 365
Thr Glu Asn Leu Asn Ala Tyr Gln Asp Gln Ala Gly Ser Val Asp Leu
370 375 380
Val Ser Pro Lys Pro Pro Arg Arg Gly Arg Arg Gln Ala Gln Glu Lys
385 390 395 400
Lys Asp Glu Val Asp Asn Val Ser Val Ala Lys Pro Lys Ser Leu Val
405 410 415
Gln Arg Asn Val Ser Arg Glu Leu Thr Pro Glu Asp Arg Ser Leu Leu
420 425 430
Ala Gln Ile Leu Asp Asp Gly Val Val Pro Asp Gly Leu Glu Asp Asp
435 440 445
Ser Asn Val
450
<210> 94
<211> 409
<212> PRT
<213> avian infectious bronchitis virus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 94
Met Ala Ser Gly Lys Ala Ala Gly Lys Thr Asp Ala Pro Ala Pro Val
1 5 10 15
Ile Lys Leu Gly Gly Pro Lys Pro Pro Lys Val Gly Ser Ser Gly Asn
20 25 30
Ala Ser Trp Phe Gln Ala Ile Lys Ala Lys Lys Leu Asn Thr Pro Pro
35 40 45
Pro Lys Phe Glu Gly Ser Gly Val Pro Asp Asn Glu Asn Ile Lys Pro
50 55 60
Ser Gln Gln His Gly Tyr Trp Arg Arg Gln Ala Arg Phe Lys Pro Gly
65 70 75 80
Lys Gly Gly Arg Lys Pro Val Pro Asp Ala Trp Tyr Phe Tyr Tyr Thr
85 90 95
Gly Thr Gly Pro Ala Ala Asp Leu Asn Trp Gly Asp Thr Gln Asp Gly
100 105 110
Ile Val Trp Val Ala Ala Lys Gly Ala Asp Thr Lys Ser Arg Ser Asn
115 120 125
Gln Gly Thr Arg Asp Pro Asp Lys Phe Asp Gln Tyr Pro Leu Arg Phe
130 135 140
Ser Asp Gly Gly Pro Asp Gly Asn Phe Arg Trp Asp Phe Ile Pro Leu
145 150 155 160
Asn Arg Gly Arg Ser Gly Arg Ser Thr Ala Ala Ser Ser Ala Ala Ala
165 170 175
Ser Arg Ala Pro Ser Arg Glu Gly Ser Arg Gly Arg Arg Ser Asp Ser
180 185 190
Gly Asp Asp Leu Ile Ala Arg Ala Ala Lys Ile Ile Gln Asp Gln Gln
195 200 205
Lys Lys Gly Ser Arg Ile Thr Lys Ala Lys Ala Asp Glu Met Ala His
210 215 220
Arg Arg Tyr Cys Lys Arg Thr Ile Pro Pro Asn Tyr Arg Val Asp Gln
225 230 235 240
Val Phe Gly Pro Arg Thr Lys Gly Lys Glu Gly Asn Phe Gly Asp Asp
245 250 255
Lys Met Asn Glu Glu Gly Ile Lys Asp Gly Arg Val Thr Ala Met Leu
260 265 270
Asn Leu Val Pro Ser Ser His Ala Cys Leu Phe Gly Ser Arg Val Thr
275 280 285
Pro Lys Leu Gln Leu Asp Gly Leu His Leu Arg Phe Glu Phe Thr Thr
290 295 300
Val Val Pro Cys Asp Asp Pro Gln Phe Asp Asn Tyr Val Lys Ile Cys
305 310 315 320
Asp Gln Cys Val Asp Gly Val Gly Thr Arg Pro Lys Asp Asp Glu Pro
325 330 335
Lys Pro Lys Ser Arg Ser Ser Ser Arg Pro Ala Thr Arg Gly Asn Ser
340 345 350
Pro Ala Pro Arg Gln Gln Arg Pro Lys Lys Glu Lys Lys Leu Lys Lys
355 360 365
Gln Asp Asp Glu Ala Asp Lys Ala Leu Thr Ser Asp Glu Glu Arg Asn
370 375 380
Asn Ala Gln Leu Glu Phe Tyr Asp Glu Pro Lys Val Ile Asn Trp Gly
385 390 395 400
Asp Ala Ala Leu Gly Glu Asn Glu Leu
405
<210> 95
<211> 448
<212> PRT
<213> bovine corona virus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 95
Met Ser Phe Thr Pro Gly Lys Gln Ser Ser Ser Arg Ala Ser Phe Gly
1 5 10 15
Asn Arg Ser Gly Asn Gly Ile Leu Lys Trp Ala Asp Gln Ser Asp Gln
20 25 30
Ser Arg Asn Val Gln Thr Arg Gly Arg Arg Ala Gln Pro Lys Gln Thr
35 40 45
Ala Thr Ser Gln Leu Pro Ser Gly Gly Asn Val Val Pro Tyr Tyr Ser
50 55 60
Trp Phe Ser Gly Ile Thr Gln Phe Gln Lys Gly Lys Glu Phe Glu Phe
65 70 75 80
Ala Glu Gly Gln Gly Val Pro Ile Ala Pro Gly Val Pro Ala Thr Glu
85 90 95
Ala Lys Gly Tyr Trp Tyr Arg His Asn Arg Arg Ser Phe Lys Thr Ala
100 105 110
Asp Gly Asn Gln Arg Gln Leu Leu Pro Arg Trp Tyr Phe Tyr Tyr Leu
115 120 125
Gly Thr Gly Pro His Ala Lys Asp Gln Tyr Gly Thr Asp Ile Asp Gly
130 135 140
Val Phe Trp Val Ala Ser Asn Gln Ala Asp Val Asn Thr Pro Ala Asp
145 150 155 160
Ile Leu Asp Arg Asp Pro Ser Ser Asp Glu Ala Ile Pro Thr Arg Phe
165 170 175
Pro Pro Gly Thr Val Leu Pro Gln Gly Tyr Tyr Ile Glu Gly Ser Gly
180 185 190
Arg Ser Ala Pro Asn Ser Arg Ser Thr Ser Arg Ala Ser Ser Arg Ala
195 200 205
Ser Ser Ala Gly Ser Arg Ser Arg Ala Asn Ser Gly Asn Arg Thr Pro
210 215 220
Thr Ser Gly Val Thr Pro Asp Met Ala Asp Gln Ile Ala Ser Leu Val
225 230 235 240
Leu Ala Lys Leu Gly Lys Asp Ala Thr Lys Pro Gln Gln Val Thr Lys
245 250 255
Gln Thr Ala Lys Glu Ile Arg Gln Lys Ile Leu Asn Lys Pro Arg Gln
260 265 270
Lys Arg Ser Pro Asn Lys Gln Cys Thr Val Gln Gln Cys Phe Gly Lys
275 280 285
Arg Gly Pro Asn Gln Asn Phe Gly Gly Gly Glu Met Leu Lys Leu Gly
290 295 300
Thr Ser Asp Pro Gln Phe Pro Ile Leu Ala Glu Leu Ala Pro Thr Ala
305 310 315 320
Gly Ala Phe Phe Phe Gly Ser Arg Leu Glu Leu Ala Lys Val Gln Asn
325 330 335
Leu Ser Gly Asn Leu Asp Glu Pro Gln Lys Asp Val Tyr Glu Leu Arg
340 345 350
Tyr Asn Gly Ala Ile Arg Phe Asp Ser Thr Leu Ser Gly Phe Glu Thr
355 360 365
Ile Met Lys Val Leu Asn Glu Asn Leu Asn Ala Tyr Gln Gln Gln Asp
370 375 380
Gly Met Met Asn Met Ser Pro Lys Pro Gln Arg Gln Arg Gly Gln Lys
385 390 395 400
Asn Gly Gln Gly Glu Asn Asp Asn Ile Ser Val Ala Ala Pro Lys Ser
405 410 415
Arg Val Gln Gln Asn Lys Ser Arg Glu Leu Thr Ala Glu Asp Ile Ser
420 425 430
Leu Leu Lys Lys Met Asp Glu Pro Tyr Thr Glu Asp Thr Ser Glu Ile
435 440 445
<210> 96
<211> 449
<212> PRT
<213> porcine haemagglutinating encaphalomyelitis virus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 96
Met Ser Phe Thr Pro Gly Lys Gln Ser Ser Ser Arg Ala Ser Ser Gly
1 5 10 15
Asn Arg Ser Gly Asn Gly Ile Leu Lys Trp Ala Asp Gln Ser Asp Gln
20 25 30
Ser Arg Asn Val Gln Thr Arg Gly Arg Arg Val Gln Ser Lys Gln Thr
35 40 45
Ala Thr Ser Gln Gln Pro Ser Gly Gly Thr Val Val Pro Tyr Tyr Ser
50 55 60
Trp Phe Ser Gly Ile Thr Gln Phe Gln Lys Gly Lys Glu Phe Glu Phe
65 70 75 80
Ala Glu Gly Gln Gly Val Pro Ile Ala Pro Gly Val Pro Ser Thr Glu
85 90 95
Ala Lys Gly Tyr Trp Tyr Arg His Asn Arg Arg Ser Phe Lys Thr Ala
100 105 110
Asp Gly Asn Gln Arg Gln Leu Leu Pro Arg Trp Tyr Phe Tyr Tyr Leu
115 120 125
Gly Thr Gly Pro His Ala Lys Asp Gln Tyr Gly Thr Asp Ile Asp Gly
130 135 140
Val Phe Trp Val Ala Ser Asn Gln Ala Asp Ile Asn Thr Pro Ala Asp
145 150 155 160
Ile Val Asp Arg Asp Pro Ser Ser Asp Glu Ala Ile Pro Thr Arg Phe
165 170 175
Pro Pro Gly Thr Val Leu Pro Gln Gly Tyr Tyr Ile Glu Gly Ser Gly
180 185 190
Arg Ser Ala Pro Asn Ser Arg Ser Thr Ser Arg Ala Pro Asn Arg Ala
195 200 205
Pro Ser Ala Gly Ser Arg Ser Arg Ala Asn Ser Gly Asn Arg Thr Ser
210 215 220
Thr Pro Gly Val Thr Pro Asp Met Ala Asp Gln Ile Ala Ser Leu Val
225 230 235 240
Leu Ala Lys Leu Gly Lys Asp Ala Thr Lys Pro Gln Gln Val Thr Lys
245 250 255
Gln Thr Ala Lys Glu Val Arg Gln Lys Ile Leu Asn Lys Pro Arg Gln
260 265 270
Lys Arg Ser Pro Asn Lys Gln Cys Thr Val Gln Gln Cys Phe Gly Lys
275 280 285
Arg Gly Pro Asn Gln Asn Phe Gly Gly Gly Glu Met Leu Lys Leu Gly
290 295 300
Thr Ser Asp Pro Gln Phe Pro Ile Leu Ala Glu Leu Ala Pro Thr Ala
305 310 315 320
Gly Ala Phe Phe Phe Gly Ser Arg Leu Glu Leu Ala Lys Val Gln Asn
325 330 335
Leu Ser Gly Asn Pro Asp Glu Pro Gln Lys Asp Val Tyr Glu Leu Arg
340 345 350
Tyr Asn Gly Ala Ile Arg Phe Asp Ser Thr Leu Ser Gly Phe Glu Thr
355 360 365
Ile Met Lys Val Leu Asn Gln Asn Leu Asn Ala Tyr Gln His Gln Glu
370 375 380
Asp Gly Met Met Asn Ile Ser Pro Lys Pro Gln Arg Gln Arg Gly Gln
385 390 395 400
Lys Asn Gly Gln Val Glu Asn Asp Asn Val Ser Val Ala Ala Pro Lys
405 410 415
Ser Arg Val Gln Gln Asn Lys Ser Arg Glu Leu Thr Ala Glu Asp Ile
420 425 430
Ser Leu Leu Lys Lys Met Asp Glu Pro Tyr Thr Glu Asp Thr Ser Glu
435 440 445
Ile
<210> 97
<211> 422
<212> PRT
<213> human SARS virus
<220>
<221> MISC_FEATURE
<223> ORF N
<400> 97
Met Ser Asp Asn Gly Pro Gln Ser Asn Gln Arg Ser Ala Pro Arg Ile
1 5 10 15
Thr Phe Gly Gly Pro Thr Asp Ser Thr Asp Asn Asn Gln Asn Gly Gly
20 25 30
Arg Asn Gly Ala Arg Pro Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn
35 40 45
Asn Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Glu
50 55 60
Leu Arg Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Gly
65 70 75 80
Pro Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Val Arg
85 90 95
Gly Gly Asp Gly Lys Met Lys Glu Leu Ser Pro Arg Trp Tyr Phe Tyr
100 105 110
Tyr Leu Gly Thr Gly Pro Glu Ala Ser Leu Pro Tyr Gly Ala Asn Lys
115 120 125
Glu Gly Ile Val Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys
130 135 140
Asp His Ile Gly Thr Arg Asn Pro Asn Asn Asn Ala Ala Thr Val Leu
145 150 155 160
Gln Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly
165 170 175
Ser Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg
180 185 190
Gly Asn Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Asn Ser Pro
195 200 205
Ala Arg Met Ala Ser Gly Gly Gly Glu Thr Ala Leu Ala Leu Leu Leu
210 215 220
Leu Asp Arg Leu Asn Gln Leu Glu Ser Lys Val Ser Gly Lys Gly Gln
225 230 235 240
Gln Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser
245 250 255
Lys Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Gln Tyr Asn Val Thr
260 265 270
Gln Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly
275 280 285
Asp Gln Asp Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln
290 295 300
Ile Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg
305 310 315 320
Ile Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr His Gly
325 330 335
Ala Ile Lys Leu Asp Asp Lys Asp Pro Gln Phe Lys Asp Asn Val Ile
340 345 350
Leu Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu
355 360 365
Pro Lys Lys Asp Lys Lys Lys Lys Thr Asp Glu Ala Gln Pro Leu Pro
370 375 380
Gln Arg Gln Lys Lys Gln Pro Thr Val Thr Leu Leu Pro Ala Ala Asp
385 390 395 400
Met Asp Asp Phe Ser Arg Gln Leu Gln Asn Ser Met Ser Gly Ala Ser
405 410 415
Ala Asp Ser Thr Gln Ala
420
Claims (27)
- 도 1의 서열 또는 그것의 동족체를 포함하는 분리된 본질적으로 포유동물의 양성-센스 단일가닥 RNA 바이러스(EMCR-CoV).
- 바이러스의 핵산 서열을 결정하고, 그것을 최대 가능도 트리가 100 부트스트랩과 3 점블을 이용하여 생성되는 계통트리 분석에서 시험하고, 그리고 그것이 PEDV(돼지 유행성 설사 바이러스), HCoV-229E(인간 코로나바이러스 229E), PRCoV(돼지 호흡기 코로나바이러스), TGEV(전염성 위장염 바이러스), CaCoV(개 코로나바이러스) 및 FeCoV(고양이 코로나바이러스)의 바이러스 분리물에 상응하는 것보다 도 1에 나타낸 바와 같은 서열을 갖는 바이러스 분리물에 더욱 밀접히 계통적으로 상응한다는 것을 확인함에 의해, 그것에 계통적으로 상응한다고 동정될 수 있고 그리고 코로나바이러스에 속하는 분리된 양성-센스 단일 가닥 RNA 바이러스(EMCR-CoV).
- 제1항 또는 제2항에 있어서, 상기 핵산 서열은 상기 바이러스의 바이러스 단백질을 암호화하는 개방형해독틀(ORF)을 포함하는 바이러스.
- 제3항에 있어서, 상기 개방형해독틀은 바이러스 리플리카제, 핵 캡시드 단백질, 매트릭스 단백질 및 스파이크 단백질을 암호화하는 ORF들의 군으로부터 선택되 는 바이러스.
- 제1항 내지 제4항 중 어느 한 항에 있어서, 비정형 폐렴의 인간으로부터 분리될 수 있는 바이러스.
- 제1항 내지 제5항 중 어느 한 항에 따른 바이러스로부터 얻을 수 있는 분리된 또는 재조합 핵산 또는 그것의 EMCR-CoV 바이러스-특이 기능적 단편.
- 제6항에 따른 핵산을 포함하는 벡터.
- 제6항에 따른 핵산 또는 제7항에 따른 벡터를 포함하는 숙주세포.
- 제6항에 따른 핵산에 의해 암호화된 분리된 또는 재조합 단백질성 분자 또는 그것의 EMCR-CoV 바이러스-특이 기능적 단편.
- 제 9항에 따른 단백질성 분자 또는 그것의 EMCR-CoV 바이러스-특이 기능적 단편을 포함하는 항원.
- 제10항에 따른 항원에 대해 특이적으로 지향된 항체.
- 바이러스 분리물 또는 그것의 성분을 제11항에 따른 항체와 반응시키는 것을 포함하는, 바이러스 분리물이 EMCR-CoC 바이러스임을 동정하는 방법.
- 바이러스 분리물 또는 그것의 성분을 제6항에 따른 핵산과 반응시키는 것을포함하는, 바이러스 분리물이 EMCR-CoV 바이러스임을 동정하는 방법.
- 포유동물의 샘플에서 바이러스 분리물 또는 그것의 성분의 존재를, 상기 샘플과 제6항에 따른 핵산 또는 제11항에 따른 항체를 반응시키는 것에 의해 결정하는 것을 포함하는, 포유동물의 EMCR-CoV 감염을 바이러스학적으로 진단하는 방법.
- 포유동물의 샘플에서 EMCR-CoV 바이러스 또는 그것의 성분에 대해 특이적으로 지향된 항체의 존재를, 상기 샘플과 제9항에 따른 단백질성 분자 또는 그것의 단편 또는 제 10항에 따른 항원을 반응시키는 것에 의해 결정하는 것을 포함하는, 포유동물의 EMCR-CoV 감염을 혈청학적으로 진단하는 방법.
- 제1항 내지 제5항 중 어느 한 항에 따른 바이러스, 제 6항에 따른 핵산, 제9항에 따른 단백질성 분자 또는 그것의 단편, 제 10항에 따른 항원 및/또는 제11항에 따른 항체를 포함하는 EMCR-CoV 감염을 진단하기 위한 진단 키트.
- 약제학적 조성물의 생산을 위한, 제1항 내지 제5항 중 어느 한 항에 따른 바 이러스, 제 6항에 따른 핵산, 제한 7항에 따른 벡터, 제8항에 따른 숙주 세포, 제9항에 따른 단백질성 분자 또는 그것의 단편, 제 10항에 따른 항원 또는 제11항에 따른 항체의 용도.
- 제17항에 있어서, EMCR-CoV 바이러스 감염의 치료 및 예방용 약제학적 조성물의 생산을 위한 용도.
- 제17항 또는 제18항에 있어서, 비정형 폐렴의 치료 및 예방용 약제학적 조성물의 생산을 위한 용도.
- 제1항 내지 제5항 중 어느 한 항에 따른 바이러스, 제6항에 따른 핵산, 제7항에 따른 벡터, 제8항에 따른 숙주세포, 제9항에 따른 단백질성 분자 또는 그것의 단편, 제10항에 따른 항원 또는 제11항에 따른 항체를 포함하는 약제학적 조성물.
- 개체에게 제20항에 따른 약제학적 조성물을 제공하는 것을 포함하는 EMCR-CoV 바이러스 감염의 치료 및 예방방법.
- 개체에게 제20항에 따른 약제학적 조성물을 제공하는 것을 포함하는 비정형 폐렴의 치료 및 예방방법.
- 도 1에 나타낸 바와 같이 표시된 서열 또는 그것의 동족체를 포함하는 RNA 서열에 의해 암호화되는 바이러스 리플리카제.
- 도 1에 나타낸 바와 같이 표시된 아미노산 서열 또는 그것의 단편을 포함하는 바이러스 스파이크 단백질.
- 도 1에 나타낸 바와 같이 표시된 서열 또는 그것의 동족체를 포함하는 RNA 서열에 의해 암호화된 바이러스 핵 캡시드 단백질.
- 도 1에 나타낸 바와 같이 표시된 서열 또는 그것의 단편을 포함하는 RNA 서열에 의해 암호화된 바이러스 nsp 3 또는 엔벨롭프 단백질.
- 도 1에 나타낸 바와 같은 별개의 바이러스 단백질들을 암호화하는 하나 이상의 서열 또는 스트린전트 조건에서 이들 서열 중 어느 것과 혼성화할 수 있는 핵산서열을 포함하는 핵산서열.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP03078613 | 2003-11-18 | ||
EP03078613.1 | 2003-11-18 | ||
EP03078772.5 | 2003-12-01 | ||
EP03078772A EP1533370A1 (en) | 2003-11-18 | 2003-12-01 | Novel atypical pneumonia-causing virus |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20060123291A true KR20060123291A (ko) | 2006-12-01 |
Family
ID=34436724
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020067011389A KR20060123291A (ko) | 2003-11-18 | 2004-11-18 | 신규한 비정형 폐렴-원인성 바이러스 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20080044426A1 (ko) |
EP (2) | EP1533370A1 (ko) |
JP (1) | JP2007511237A (ko) |
KR (1) | KR20060123291A (ko) |
CA (1) | CA2546355A1 (ko) |
WO (1) | WO2005049814A2 (ko) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
PL213926B1 (pl) | 2001-01-19 | 2013-05-31 | Vironovative Bv | Wyizolowany ssaczy metapneumowirus o ujemnej pojedynczej nici RNA (MPV), kompozycja immunogenna, wyizolowane kwasy nukleinowe, sposoby wykrywania ssaczego metapneumowirusa, wektor, komórka gospodarza, wyizolowane bialko, przeciwcialo, sposób wirologicznego diagnozowania infekcji MPV, sposób serologicznego diagnozowania infekcji MPV, kompozycja farmaceutyczna, zestaw diagnostyczny oraz zastosowanie kompozycji farmaceutycznej |
CN101098958A (zh) | 2002-02-21 | 2008-01-02 | 免疫医疗疫苗公司 | 间质肺病毒株及其在疫苗制剂中以及用作抗原性序列表达载体的用途 |
WO2005017133A1 (en) | 2003-08-18 | 2005-02-24 | Amsterdam Institute Of Viral Genomics B.V. | Coronavirus, nucleic acid, protein, and methods for the generation of vaccine, medicaments and diagnostics |
WO2006076007A2 (en) * | 2004-04-22 | 2006-07-20 | Vanderbilt University | Methods of detecting coronavirus infections |
US20160008457A1 (en) * | 2014-07-11 | 2016-01-14 | Merial, Inc. | Inactivated Vaccine for Porcine Epidemic Diarrhea Virus (PEDV) |
GB201413020D0 (en) * | 2014-07-23 | 2014-09-03 | Pribright The Inst | Coronavirus |
ES2788393T3 (es) * | 2014-09-03 | 2020-10-21 | Intervet Int Bv | Coronavirus de bovino atenuado y vacunas relacionadas |
KR20170103874A (ko) * | 2015-01-23 | 2017-09-13 | 더 트러스티스 오브 더 유니버시티 오브 펜실바니아 | 돼지 유행성 설사병 바이러스를 위한 최적화된 합성 컨센서스 dna 백신의 면역원성 |
CN107831317A (zh) * | 2017-11-01 | 2018-03-23 | 杭州微瑞科技有限公司 | 犬冠状病毒抗体快速定量检测卡及使用方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005017133A1 (en) * | 2003-08-18 | 2005-02-24 | Amsterdam Institute Of Viral Genomics B.V. | Coronavirus, nucleic acid, protein, and methods for the generation of vaccine, medicaments and diagnostics |
-
2003
- 2003-12-01 EP EP03078772A patent/EP1533370A1/en not_active Withdrawn
-
2004
- 2004-11-18 KR KR1020067011389A patent/KR20060123291A/ko not_active Application Discontinuation
- 2004-11-18 CA CA002546355A patent/CA2546355A1/en not_active Abandoned
- 2004-11-18 US US10/579,614 patent/US20080044426A1/en not_active Abandoned
- 2004-11-18 EP EP04808721A patent/EP1694830A2/en not_active Withdrawn
- 2004-11-18 JP JP2006541061A patent/JP2007511237A/ja active Pending
- 2004-11-18 WO PCT/NL2004/000805 patent/WO2005049814A2/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
EP1694830A2 (en) | 2006-08-30 |
US20080044426A1 (en) | 2008-02-21 |
JP2007511237A (ja) | 2007-05-10 |
WO2005049814A2 (en) | 2005-06-02 |
WO2005049814A3 (en) | 2006-08-17 |
EP1533370A1 (en) | 2005-05-25 |
CA2546355A1 (en) | 2005-06-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110423844B (zh) | 检测bk病毒的方法和组合物 | |
AU2008266120B2 (en) | Vaccines containing canine parvovirus genetic variants | |
Toplak et al. | Genetic typing of bovine viral diarrhoea virus: most Slovenian isolates are of genotypes 1d and 1f | |
CN103451158B (zh) | 用于犬科动物中呼吸系统疾病控制的物质和方法 | |
KR20070028547A (ko) | 바이러스에 감염되고 백신접종된 생물의 동정 | |
KR20150064104A (ko) | 사람 베타 코로나바이러스의 계통 c 및 이의 바이러스 수용체로서 n-말단 디펩티딜 펩티다아제의 확인 | |
MXPA05001098A (es) | Proteina dentada, polimerasa y hemaglutinina/esterasa del coronavirus respiratorio canino. | |
JP2007502612A (ja) | コロナウイルス、核酸、蛋白質、ならびにワクチンの生成方法、薬剤および診断 | |
CN107002047B (zh) | 瘟病毒 | |
CN113293145B (zh) | 一种麻疹病毒活载体新冠疫苗 | |
US7220852B1 (en) | Coronavirus isolated from humans | |
JPH09511914A (ja) | ペスチウイルス株のヌクレオチド配列、それらの配列によりコードされるポリペプチド、ならびにペスチウイルス感染の診断および予防のためのそれらの使用 | |
IE912586A1 (en) | Bovine respiratory syncytial virus vaccines | |
CN101821284B (zh) | 猪痢疾短螺旋体的基因和蛋白质及其用途 | |
KR20060123291A (ko) | 신규한 비정형 폐렴-원인성 바이러스 | |
Kleiboeker | Sequence analysis of the fiber genomic region of a porcine adenovirus predicts a novel fiber protein | |
CN106191215B (zh) | 肌肉萎缩相关的蛋白质分子标记Dkk-3的筛选及其应用 | |
KR20100121288A (ko) | 유전자재조합 돼지열병 백신바이러스 Flc―LOM―BErns virus 및 이의 제조방법 | |
JP3262273B2 (ja) | ブタコレラを含むペスチウイルス感染の如き感染に対する保護法、ヌクレオチド配列およびワクチンの開発および診断に使用されるポリペプチド | |
US6805867B1 (en) | Bovine rotavirus genes | |
RU2813731C2 (ru) | Новый коронавирус рыб | |
KR20100121289A (ko) | 유전자재조합 돼지열병 백신바이러스 Flc―LOM virus 및 이의 제조방법 | |
KR102314100B1 (ko) | 신규한 개 아데노바이러스 2형 항원 및 이의 용도 | |
Welzel et al. | Stable expression of nucleocapsid proteins of Puumala and Hantaan virus in mammalian cells | |
JPH04506747A (ja) | リサウィルス感染症の検出法および/または同定法、モコラリサウィルスのペプチドおよび/またはペプチドの断片をコードする遺伝子のクローン化と発現、モコラウィルスおよび/またはリサウィルス群に対するワクチン、および遺伝子工学による前記ワクチンの製造方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
N231 | Notification of change of applicant | ||
WITN | Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid |