CN109851678A - A kind of inferior stable state bovine respiratory syncytial virus of improvement merges DNA molecular and its application of precursor F protein matter and coding - Google Patents
A kind of inferior stable state bovine respiratory syncytial virus of improvement merges DNA molecular and its application of precursor F protein matter and coding Download PDFInfo
- Publication number
- CN109851678A CN109851678A CN201910171850.0A CN201910171850A CN109851678A CN 109851678 A CN109851678 A CN 109851678A CN 201910171850 A CN201910171850 A CN 201910171850A CN 109851678 A CN109851678 A CN 109851678A
- Authority
- CN
- China
- Prior art keywords
- ser
- leu
- ile
- val
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108010068327 4-hydroxyphenylpyruvate dioxygenase Proteins 0.000 title claims abstract description 79
- 239000002243 precursor Substances 0.000 title claims abstract description 53
- 241000711895 Bovine orthopneumovirus Species 0.000 title claims abstract description 44
- 230000006872 improvement Effects 0.000 title abstract description 21
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 59
- 230000004927 fusion Effects 0.000 claims abstract description 56
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 55
- 229940023143 protein vaccine Drugs 0.000 claims abstract description 13
- 230000009466 transformation Effects 0.000 claims abstract description 13
- 238000011238 DNA vaccination Methods 0.000 claims abstract description 12
- 239000013612 plasmid Substances 0.000 claims abstract description 10
- 235000018102 proteins Nutrition 0.000 claims description 50
- 150000001413 amino acids Chemical class 0.000 claims description 37
- 230000035772 mutation Effects 0.000 claims description 20
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 16
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 15
- 235000001014 amino acid Nutrition 0.000 claims description 12
- 229940024606 amino acid Drugs 0.000 claims description 12
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 10
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 10
- 235000018417 cysteine Nutrition 0.000 claims description 10
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 10
- 239000002253 acid Substances 0.000 claims description 9
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 claims description 8
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 8
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 6
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 6
- 239000000178 monomer Substances 0.000 claims description 6
- 239000004474 valine Substances 0.000 claims description 6
- 108091005804 Peptidases Proteins 0.000 claims description 5
- 239000004365 Protease Substances 0.000 claims description 5
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 5
- 239000004220 glutamic acid Substances 0.000 claims description 5
- 239000002773 nucleotide Substances 0.000 claims description 5
- 125000003729 nucleotide group Chemical group 0.000 claims description 5
- 239000013600 plasmid vector Substances 0.000 claims description 5
- 239000004471 Glycine Substances 0.000 claims description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 4
- 229910021529 ammonia Inorganic materials 0.000 claims description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 4
- 230000002265 prevention Effects 0.000 claims description 4
- DWNBOPVKNPVNQG-LURJTMIESA-N (2s)-4-hydroxy-2-(propylamino)butanoic acid Chemical compound CCCN[C@H](C(O)=O)CCO DWNBOPVKNPVNQG-LURJTMIESA-N 0.000 claims description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 3
- 230000002209 hydrophobic effect Effects 0.000 claims description 3
- 229960000310 isoleucine Drugs 0.000 claims description 3
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 3
- 238000002360 preparation method Methods 0.000 claims description 3
- 108091008146 restriction endonucleases Proteins 0.000 claims description 3
- 238000000844 transformation Methods 0.000 claims 1
- 108091007433 antigens Proteins 0.000 abstract description 29
- 102000036639 antigens Human genes 0.000 abstract description 29
- 229960005486 vaccine Drugs 0.000 abstract description 29
- 241000700605 Viruses Species 0.000 abstract description 25
- 239000000427 antigen Substances 0.000 abstract description 25
- 210000004027 cell Anatomy 0.000 abstract description 22
- 238000013461 design Methods 0.000 abstract description 22
- 230000000241 respiratory effect Effects 0.000 abstract description 14
- 241000283690 Bos taurus Species 0.000 abstract description 13
- 238000000034 method Methods 0.000 abstract description 13
- 238000010353 genetic engineering Methods 0.000 abstract description 8
- 241001465754 Metazoa Species 0.000 abstract description 7
- 230000012743 protein tagging Effects 0.000 abstract description 4
- 230000007321 biological mechanism Effects 0.000 abstract description 3
- 230000001900 immune effect Effects 0.000 abstract description 3
- 108020004414 DNA Proteins 0.000 description 41
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 39
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 31
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 28
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 28
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 27
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 25
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 21
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 20
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 19
- 108010031719 prolyl-serine Proteins 0.000 description 19
- 108010036211 5-HT-moduline Proteins 0.000 description 18
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 18
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 18
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 18
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 18
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 18
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 17
- 244000309466 calf Species 0.000 description 17
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 17
- 108010017391 lysylvaline Proteins 0.000 description 17
- 108010073969 valyllysine Proteins 0.000 description 17
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 16
- 241000880493 Leptailurus serval Species 0.000 description 16
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 16
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 16
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 15
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 15
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 15
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 15
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 15
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 15
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 15
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 15
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 15
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 15
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 15
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 15
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 15
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 15
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 15
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 15
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 15
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 15
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 15
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 15
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 15
- YTHWAWACWGWBLE-MNSWYVGCSA-N Trp-Tyr-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 YTHWAWACWGWBLE-MNSWYVGCSA-N 0.000 description 15
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 15
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 15
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 15
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 15
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 14
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 14
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 14
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 14
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 14
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 14
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 14
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 14
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 14
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 14
- 108010064235 lysylglycine Proteins 0.000 description 14
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 13
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 13
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 13
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 13
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 13
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 13
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 13
- VMSSYINFMOFLJM-KJEVXHAQSA-N Thr-Tyr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O VMSSYINFMOFLJM-KJEVXHAQSA-N 0.000 description 13
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 13
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 13
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 13
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 12
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 12
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 12
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 12
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 12
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 12
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 12
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 12
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 12
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 12
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 12
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 12
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 12
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 12
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 12
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 12
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 12
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 12
- 108010084389 glycyltryptophan Proteins 0.000 description 12
- 108010037850 glycylvaline Proteins 0.000 description 12
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 12
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 11
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 11
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 11
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 11
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 11
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 108010012058 leucyltyrosine Proteins 0.000 description 11
- UEFODXNXUAVPTC-VEVYYDQMSA-N Asp-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UEFODXNXUAVPTC-VEVYYDQMSA-N 0.000 description 10
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 10
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 10
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 10
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 10
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 10
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 10
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 10
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 10
- 239000002671 adjuvant Substances 0.000 description 10
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 10
- 108010044940 alanylglutamine Proteins 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- 108010057821 leucylproline Proteins 0.000 description 10
- 108010051242 phenylalanylserine Proteins 0.000 description 10
- 108010015796 prolylisoleucine Proteins 0.000 description 10
- 108010003137 tyrosyltyrosine Proteins 0.000 description 10
- OFHXPCLWHLXQHT-JKQORVJESA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN OFHXPCLWHLXQHT-JKQORVJESA-N 0.000 description 9
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 9
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 9
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 9
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 9
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 9
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 9
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 9
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 9
- ACEDJCOOPZFUBU-CIUDSAMLSA-N Asp-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N ACEDJCOOPZFUBU-CIUDSAMLSA-N 0.000 description 9
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 9
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 9
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 9
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 9
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- QQQHYJFKDLDUNK-CIUDSAMLSA-N His-Asp-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QQQHYJFKDLDUNK-CIUDSAMLSA-N 0.000 description 9
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 9
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 9
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 9
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 9
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 9
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 9
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 9
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 9
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 9
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 9
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 9
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 9
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 9
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 9
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 9
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 9
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 9
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 9
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 9
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 9
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 9
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 9
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 9
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 9
- XIFAHCUNWWKUDE-DCAQKATOSA-N Val-Cys-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XIFAHCUNWWKUDE-DCAQKATOSA-N 0.000 description 9
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 9
- 108010092854 aspartyllysine Proteins 0.000 description 9
- 201000010099 disease Diseases 0.000 description 9
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 9
- 108010050848 glycylleucine Proteins 0.000 description 9
- 108010015792 glycyllysine Proteins 0.000 description 9
- 108010081551 glycylphenylalanine Proteins 0.000 description 9
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 8
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 8
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 8
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 8
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 8
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 8
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 8
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 8
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 8
- 241000699666 Mus <mouse, genus> Species 0.000 description 8
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 8
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 8
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 8
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 230000008859 change Effects 0.000 description 8
- 230000014509 gene expression Effects 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 108010018006 histidylserine Proteins 0.000 description 8
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 7
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 7
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 7
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 7
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 7
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 7
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 7
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 7
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 7
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 7
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 7
- QYKJOVAXAKTKBR-FXQIFTODSA-N Cys-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N QYKJOVAXAKTKBR-FXQIFTODSA-N 0.000 description 7
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 description 7
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 7
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 7
- PXEGEYISOXISDV-XIRDDKMYSA-N Cys-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 PXEGEYISOXISDV-XIRDDKMYSA-N 0.000 description 7
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 7
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 7
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 7
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 7
- WZAYJXZPSJOXCP-QAETUUGQSA-N Glu-Phe-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)N)CC1=CC=CC=C1 WZAYJXZPSJOXCP-QAETUUGQSA-N 0.000 description 7
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 7
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 7
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 7
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 7
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 7
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 7
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 7
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 7
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 7
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 7
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 7
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 7
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 7
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 7
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 7
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 7
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 7
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 7
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 7
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 7
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 7
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 7
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 7
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 7
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 7
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 7
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 7
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 7
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 6
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 6
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 6
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 6
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 6
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 6
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 6
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 6
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 6
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 6
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 6
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 6
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 6
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 6
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 6
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 6
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 6
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 6
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 6
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 6
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 6
- CKXMGSJPDQXBPG-JYJNAYRXSA-N Pro-Cys-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O CKXMGSJPDQXBPG-JYJNAYRXSA-N 0.000 description 6
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 6
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 6
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 6
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 6
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 6
- STJXERBCEWQLKS-IHPCNDPISA-N Trp-Tyr-Cys Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 STJXERBCEWQLKS-IHPCNDPISA-N 0.000 description 6
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 6
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 6
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 6
- 230000000840 anti-viral effect Effects 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 108010069495 cysteinyltyrosine Proteins 0.000 description 6
- 235000013601 eggs Nutrition 0.000 description 6
- 230000028993 immune response Effects 0.000 description 6
- 238000002649 immunization Methods 0.000 description 6
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108010090894 prolylleucine Proteins 0.000 description 6
- 210000002966 serum Anatomy 0.000 description 6
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 5
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 5
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 5
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 5
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 5
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 5
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 5
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 5
- UWXFFVQPAMBETM-ZLUOBGJFSA-N Cys-Asp-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UWXFFVQPAMBETM-ZLUOBGJFSA-N 0.000 description 5
- XRTISHJEPHMBJG-SRVKXCTJSA-N Cys-Asp-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XRTISHJEPHMBJG-SRVKXCTJSA-N 0.000 description 5
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 5
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 5
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 5
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 5
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 5
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 5
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 5
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 5
- IROABALAWGJQGM-OALUTQOASA-N Gly-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)CN IROABALAWGJQGM-OALUTQOASA-N 0.000 description 5
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 5
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 5
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 5
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 5
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 5
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 5
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 5
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 5
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 5
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 5
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 5
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 5
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 5
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 5
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 5
- ZBYHVSHBZYHQBW-SRVKXCTJSA-N Phe-Cys-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZBYHVSHBZYHQBW-SRVKXCTJSA-N 0.000 description 5
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 5
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 5
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 5
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 5
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 5
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 5
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 5
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 5
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 5
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 5
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 5
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 5
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 5
- HGEHWFGAKHSIDY-SRVKXCTJSA-N Tyr-Asp-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O HGEHWFGAKHSIDY-SRVKXCTJSA-N 0.000 description 5
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 5
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 5
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 5
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 5
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 5
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 5
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 5
- 230000003053 immunization Effects 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 5
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 4
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 4
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 4
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 4
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 4
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 4
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 4
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 4
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 4
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 4
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 4
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 4
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 4
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 4
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 4
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 4
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 4
- 108010070783 alanyltyrosine Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 229940125575 vaccine candidate Drugs 0.000 description 4
- 230000003612 virological effect Effects 0.000 description 4
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 3
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 3
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 3
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 3
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 3
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 3
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 3
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 3
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 3
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 3
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 3
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 3
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 3
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 3
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 3
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 3
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 3
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 3
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 3
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 3
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 3
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 3
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 3
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 3
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 3
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 3
- QADHATDBZXHRCA-ACZMJKKPSA-N Cys-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N QADHATDBZXHRCA-ACZMJKKPSA-N 0.000 description 3
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 3
- KCPOQGRVVXYLAC-KKUMJFAQSA-N Cys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KCPOQGRVVXYLAC-KKUMJFAQSA-N 0.000 description 3
- WTEACWBAULENKE-SRVKXCTJSA-N Cys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N WTEACWBAULENKE-SRVKXCTJSA-N 0.000 description 3
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 3
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 3
- 229940021995 DNA vaccine Drugs 0.000 description 3
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 3
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 3
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 3
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 3
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 3
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 3
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 3
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 3
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 3
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 3
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 3
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 3
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 3
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 3
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 3
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 3
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 3
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 3
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 3
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 3
- 241000711920 Human orthopneumovirus Species 0.000 description 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 3
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 3
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 3
- QRTVJGKXFSYJGW-KBIXCLLPSA-N Ile-Glu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QRTVJGKXFSYJGW-KBIXCLLPSA-N 0.000 description 3
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 3
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 3
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 3
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 3
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 3
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 3
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 3
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 3
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 3
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 3
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 3
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 3
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 3
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 3
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 3
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 3
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 3
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 3
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 3
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 3
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 3
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 3
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 3
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 3
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 3
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 3
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 3
- RJEFZSIVBHGRQJ-SRVKXCTJSA-N Met-Arg-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RJEFZSIVBHGRQJ-SRVKXCTJSA-N 0.000 description 3
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 3
- YKWHHKDMBZBMLG-GUBZILKMSA-N Met-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)N YKWHHKDMBZBMLG-GUBZILKMSA-N 0.000 description 3
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 3
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 3
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 3
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 3
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 3
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 3
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 3
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 3
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 3
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 3
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 3
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 3
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 3
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 3
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 3
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 3
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 3
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 3
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 3
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 3
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 3
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 3
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 3
- WDIJBEWLXLQQKD-ULQDDVLXSA-N Tyr-Arg-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O WDIJBEWLXLQQKD-ULQDDVLXSA-N 0.000 description 3
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 3
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 3
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 3
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 3
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 3
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 3
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 3
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 3
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 3
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 3
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 3
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 3
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 3
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 3
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 3
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 3
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 3
- 108010079547 glutamylmethionine Proteins 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 230000036039 immunity Effects 0.000 description 3
- 230000002779 inactivation Effects 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 108010009920 neokyotorphin (1-4) Proteins 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 108010044292 tryptophyltyrosine Proteins 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 2
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 2
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 2
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 2
- 108010041986 DNA Vaccines Proteins 0.000 description 2
- 230000006820 DNA synthesis Effects 0.000 description 2
- 102000002322 Egg Proteins Human genes 0.000 description 2
- 108010000912 Egg Proteins Proteins 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 2
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 2
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 2
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 2
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 2
- 206010035664 Pneumonia Diseases 0.000 description 2
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 2
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 2
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 2
- 206010061603 Respiratory syncytial virus infection Diseases 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 2
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 2
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 2
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 235000014103 egg white Nutrition 0.000 description 2
- 210000000969 egg white Anatomy 0.000 description 2
- 238000013467 fragmentation Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000002969 morbid Effects 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 150000007523 nucleic acids Chemical class 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 239000002574 poison Substances 0.000 description 2
- 231100000614 poison Toxicity 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 230000001681 protective effect Effects 0.000 description 2
- 238000012827 research and development Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000007921 spray Substances 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- FMWHSNJMHUNLAG-FXQIFTODSA-N Asp-Cys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FMWHSNJMHUNLAG-FXQIFTODSA-N 0.000 description 1
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 206010006448 Bronchiolitis Diseases 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- 208000029523 Interstitial Lung disease Diseases 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- DKEZVKFLETVJFY-CIUDSAMLSA-N Leu-Cys-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DKEZVKFLETVJFY-CIUDSAMLSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 1
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 1
- WRXOPYNEKGZWAZ-FXQIFTODSA-N Met-Ser-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O WRXOPYNEKGZWAZ-FXQIFTODSA-N 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 241000711504 Paramyxoviridae Species 0.000 description 1
- 241000144300 Peromyscus gossypinus Species 0.000 description 1
- VHDNDCPMHQMXIR-IHRRRGAJSA-N Phe-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHDNDCPMHQMXIR-IHRRRGAJSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- 206010035742 Pneumonitis Diseases 0.000 description 1
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- -1 Propylhomoserin-serine-glutamic acid Chemical compound 0.000 description 1
- 229940124679 RSV vaccine Drugs 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- 108010046722 Thrombospondin 1 Proteins 0.000 description 1
- 102100036034 Thrombospondin-1 Human genes 0.000 description 1
- QHONGSVIVOFKAC-ULQDDVLXSA-N Tyr-Pro-His Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QHONGSVIVOFKAC-ULQDDVLXSA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- 108010003533 Viral Envelope Proteins Proteins 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- 239000004411 aluminium Substances 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000009833 antibody interaction Effects 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 229940031567 attenuated vaccine Drugs 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 206010006451 bronchitis Diseases 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 230000001010 compromised effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000002050 diffraction method Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000000799 fusogenic effect Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 230000008105 immune reaction Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000008595 infiltration Effects 0.000 description 1
- 238000001764 infiltration Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 231100000225 lethality Toxicity 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 244000000010 microbial pathogen Species 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000000663 muscle cell Anatomy 0.000 description 1
- 210000001989 nasopharynx Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 210000000440 neutrophil Anatomy 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 229940068196 placebo Drugs 0.000 description 1
- 239000000902 placebo Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 230000003449 preventive effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000002633 protecting effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000001533 respiratory mucosa Anatomy 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 208000023504 respiratory system disease Diseases 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108091069025 single-strand RNA Proteins 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 238000005728 strengthening Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000005829 trimerization reaction Methods 0.000 description 1
- 239000012646 vaccine adjuvant Substances 0.000 description 1
- 229940124931 vaccine adjuvant Drugs 0.000 description 1
- 229940126580 vector vaccine Drugs 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
- 210000004885 white matter Anatomy 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/155—Paramyxoviridae, e.g. parainfluenza virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
- C07K14/08—RNA viruses
- C07K14/115—Paramyxoviridae, e.g. parainfluenza virus
- C07K14/135—Respiratory syncytial virus
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
Abstract
The invention belongs to field of biotechnology, disclose a kind of bovine respiratory syncytial virus fusion precursor F protein matter of inferior stable state by improvement of genes, the DNA molecular of coding and its application.The method of structure biology is applied in vaccine design and improvement by the present invention, by the observation to bovine respiratory syncytial virus F protein matter three-dimensional structure, finds the biological mechanism of its occurred conformation variation.ANTIGEN DESIGNThe and genetic engineering transformation are carried out on the basis of F protein matter three-dimensional structure, obtain the bovine respiratory syncytial virus F protein matter vaccine for being maintained at inferior stable state F protein matter fusion precursor.Progress genetic engineering transformation on the basis of this protein vaccine, building can make to receive the good protection of immune animal acquisition in the plasmid that cell surface expresses bovine respiratory born of the same parents zoarium virus F protein fusion precursor protein as DNA vaccination.By molecular biology, biology, cell biology, immunologic method determined the stability of vaccine, validity and safety.
Description
Technical field
The invention belongs to field of biotechnology, and in particular to a kind of bovine respiratory syncytial virus fusion precursor F protein matter and
The DNA molecular of the protein coding and its application.
Background technique
Bovine respiratory syncytial virus (Bovine Respiratory Syncytial Virus, BRSV) mainly causes 2~6
The capillary bronchitis and interstitial pneumonia of monthly age calf.Even if there are the cow antibody of medium level, First Year calf after birth
Illness rate still may be up to 50% or more.Although there are many BRSV to secure permission inactivation, attenuated vaccines in the market.But due to
The reasons such as the stability of these vaccines is low with the protective rate to animal, it is still available without ideal vaccine at present.Although
BRSV lethality is not high, but the lung because that can not be reversed caused by its complication and respiratory tract injury and medical expense, raising at
This increase can cause serious economic loss to cattle-raising.Only U.S.'s current year loss in 2015 is close to 1,000,000,000 dollars.Domestic phase
The data of pass have not been reported, but amount of livestock on hand nearly 100,000,000 of domestic ox in 2017, the slightly above quantity in the U.S..
Member during RSV belongs to Paramyxoviridae, pneumonitis virus belongs to, genome are Nonsegmented single strand RNA, by
15225 nucleotide compositions.RSV genome encoding mainly has 11 kinds of protein, the genome albumen that end encodes from 3' to 5' according to
Secondary is NSl, NS2, N, P, M, SH, G, F, M2 (M2-l, M2-2) and L etc., and wherein surface protein glycoprotein G and fusion protein F exist
Virus envelope surface forms furcella.When rsv infection host cell, viromembrane is needed to merge with host cell membrane.Based on working as
The preceding research merged to paramyxovirus, RSV F protein matter are folded into a kind of " before fusion " conformation (Pro- when originally forming
fusion).When the film of the two merges, the conformation of the fusion precursor, which carries out refolding and conformation change, to be become " after fusion "
Conformation (Post-fusion).Therefore, what which merged precursor protein is a kind of protein of inferior stable state, it passes through initial
It is folded into a kind of metastable form (conformation before merging), which then carries out discontinuously, gradually becomes with irreversible conformation
It is melted into the conformation (conformation after fusion) of a lower energy stabilization state.The biological mechanism of conformation change is that virus passes through F egg
White conformation change mediate retroviral and host cell membrane merge, and cause to borrow in the inhereditary material intrusion host cell of virus
Host cell systems are helped to complete the duplication of virus.So F protein fusion precursor is optimal vaccine candidate object, it is that main neutralize resists
Ultimate constituent.After organism infection RSV, F the and G antigen protein of encoding viral can stimulate immune response, generate serum IgG and neutralize
The secretory IgA antibody of antibody and respiratory mucosa plays a significant role in anti-rsv infection.There is problems in that natural F egg
White fusion precursor is very unstable.This is also that can not obtain permanently effective and stable epidemic disease using the method for traditional inactivation and attenuation
The reason of seedling.Need to carry out it improvement of conformational stability.
Before the fusion as the result is shown observed using Electron Microscopy RSV F protein with merge after tripolymer it
Between there are huge architectural differences.The variation of these structures is confirmed that McLellan also by protein crystallography method
J.S.et al.Science.136 (2013) and McLellan J.S.et al.158 (2013).Zoopery is as the result is shown
RSV F protein matter is unique Calder L.J.et al.Virology 271 in antigenicity before fusion and after fusion
(2000)。
Human airway syncytial virus (Human Respiratory Syncytial Virus, HRSV) and bovine respiratory
Syncytial virus have genetic similarity, up to 85%.Mankind RSV is First Year bronchiolitis and pneumonia after children's birth
Most common reason.RSV can also cause repeated infection, including serious lower respiratory illness, this is likely to occur in any age,
The people of especially the elderly or heart, lung or compromised immune.There are the cause of 4 300 ten thousand people childs and the elderly every year seriously
Respiratory disease leads to hospitalization.Relevant HRSV vaccine is in mouse, cotton mouse and non-human primate (NHP) animal
It is assessed in model.But the inactivation hRSV vaccine clinical experiment of eighties of last century the seventies Merck company research and development causes
The major accident of tested baby death.National Institutes of Health vaccine research center in 2013 is by the side of structure biology
The field of method application vaccine research and development.(F) glycoprotein before their metastable state fusion based on the design of protein atomic structure
(Pre-fusion F) vaccine candidate object achieves good result in the experiment of preclinical non-human primate.It is this at present
Mankind's RSV vaccine has come into the clinical I phase stage.(see, for example, WO20101149745, WO2010/1149743,
WO2009/1079796,WO 2012/158613).The vaccine candidate object of ox back scape is converted in physics and chemistry by these vaccine designs
It learns, also shows extraordinary prospect in biology and calf zoopery.However, the vaccine that they improve is in yield and antigen
Still there is improvable space in activity.
There are several bRSV vaccines for being recognized and having used on overseas market at present.The development and production of these traditional vaccines
Mode mainly passes through change condition of culture, or the attenuation epidemic disease that passage weakens pathogenic microorganisms toxicity on different culture cells
Seedling, such as (ZOETIS, Bovi-Shield Gold 5, SKU:540492).They the shortcomings that, are that validity period is short, and protective rate is low.
The limitation of this kind of vaccine is also manifested by: (1) virus of animals and humans needs to cultivate in zooblast, this makes vaccine raw
The cost of production is very high;(2) morbid substance in vaccine is possible to not be attenuated sufficiently in vaccine production process, this will lead to epidemic disease
Contain virulent property morbid substance in seedling, so that disease is propagated in the larger context;(3) attenuated strain is possible to occur
Mutation, causes disease.
Summary of the invention
In view of this, it is an object of the invention to existing vaccine there are aiming at the problem that, a kind of ox of conformational stability is provided
The fusion precursor protein of respiratory syncystial virus F protein, the DNA of encoding said proteins and its application.
To achieve the purpose of the present invention, the present invention adopts the following technical scheme:
A kind of fusion precursor protein of bovine respiratory syncytial virus F protein includes selected from the group below to F protein at least one
The transformation of wild type:
A, increase the connection quantity of disulfide bond between each monomer inside of F protein tripolymer and monomer;
B, mutation F protein tripolymer inside at least one amino acid, by the lesser amino acid mutation of side chain become side chain compared with
Hydrophobic binding inside big amino acid or increase;
C, the restriction enzyme site of at least one protease is rejected in mutation;
D, at least one larger amino acid of dynamic of F protein tripolymer is cut off, instead shorter link peptide;
E, extend spiral structure in the C- terminal of F- protein.
Preferably, the transformation to F protein wild type includes the 143rd glycine, the 404th serine, the 103rd
Position serine, the 262nd mutant serine are cysteine, and the 288th isoleucine mutation is phenylalanine, the 187th figured silk fabrics
Histidine mutations are leucine.It is highly preferred that the fusion precursor protein of the bovine respiratory syncytial virus F protein is set at following three kinds
Yield, stability and the affinity in conjunction with specific antibody that lower antigen is transformed in meter have preferable performance:
1) M1 is designed: the 159th hyte propylhomoserin, the 291st valine mutation are cysteine, wipe out from 109 to 137 amino
Link peptide serine-glutamic acid-Ser-Ser-glutamic acid-Ser-Ser-paddy ammonia is added in acid sequence
Acid;It is abbreviated as H159C, V291C;G143C, S404C;S103C, S262C;I288F;V187L;Δ109-137(SGSSGSSG);
2) M2 is designed: the 158th leucine, the 290th mutant serine are cysteine, wipe out from 114 to 132 amino
Link peptide serine-glutamic acid-Ser-Ser-glutamic acid is added in acid sequence;It is abbreviated as L158C, S290C;
G143C, S404C;S103C, S262C;I288F;V187L;Δ114-132(SGSSG);
3) M3 is designed: the 158th leucine, the 290th mutant serine are cysteine, wipe out from 114 to 132 amino
Link peptide serine-glutamic acid-Ser-Ser-glutamic acid-Ser-Ser-paddy ammonia is added in acid sequence
Acid;It is abbreviated as L158C, S290C;G143C, S404C;S103C, S262C;I288F;V187L;Δ109-137(SGSSGSSG).
The gene order of bovine respiratory born of the same parents' zoarium virus F protein matter of different virus strain is different, improved bovine respiratory
The sequence of the fusion precursor protein of syncystial virus F protein is also just different.Further, bovine respiratory syncytial virus F protein is melted
Precursor protein is closed, selected from one of following sequences:
(1) the F egg of tri- kinds of design improvements of M1, M2, M3 of bovine respiratory syncytial virus NP_048055.1 (ATue51908)
The amino acid sequence of white fusion precursor protein is successively as shown in SEQ ID NO:1,2,3;
(2) F of tri- kinds of design improvements of M1, M2, M3 of bovine respiratory syncytial virus YP_009505455 (ATCC 51908)
The amino acid sequence of the fusion precursor protein of albumen is successively as shown in SEQ ID NO:4,5,6;
(3) F proteins of tri- kinds of design improvements of M1, M2, M3 of bovine respiratory syncytial virus ANN02895 (USII/S1) is melted
The amino acid sequence of conjunction precursor protein is successively as shown in SEQ ID NO:7,8,9;
(4) F proteins of tri- kinds of design improvements of M1, M2, M3 of bovine respiratory syncytial virus AAL49410 (A51908) is melted
The amino acid sequence of conjunction precursor protein is successively as shown in SEQ ID NO:10,11,12;
(5) F proteins of tri- kinds of design improvements of M1, M2, M3 of bovine respiratory syncytial virus BAA00798.1 (RB94) is melted
The amino acid sequence of conjunction precursor protein is successively as shown in SEQ ID NO:13,14,15;
(6) F protein of tri- kinds of design improvements of M1, M2, M3 of bovine respiratory syncytial virus P22167.1 (Copenhagen)
Fusion precursor protein amino acid sequence successively as shown in SEQ ID NO:16,17,18;
(7) fusion of the F protein of tri- kinds of design improvements of M1, M2, M3 of bovine respiratory syncytial virus AAB22601 (391-2)
The amino acid sequence of precursor protein is successively as shown in SEQ ID NO:19,20,21.
The present invention also provides the DNA moleculars for the fusion precursor protein for encoding above-mentioned F protein.
In some embodiments, the nucleotide of the fusion precursor egg of F protein shown in the above-mentioned SEQ ID NO:1-21
Sequence is successively as shown in SEQ ID NO:22-42.
Experiment shows that the fusion precursor protein conformational stability of bovine respiratory syncytial virus F protein of the present invention can be used as
Vaccine candidate object is for preventing bovine respiratory syncytial virus.Therefore the present invention also provides bovine respiratories of the present invention to close born of the same parents
The fusion precursor protein of virus F protein and the fusion precursor for encoding the bovine respiratory syncytial virus F protein of the present invention
Application of the DNA molecular of albumen in the product of preparation prevention bovine respiratory syncytial virus.
The present invention also provides a kind of protein vaccines, by the fusion precursor egg of above-mentioned bovine respiratory syncytial virus F protein
It is at least one of white to be made.Further, the protein vaccine, by any one in amino acid sequence such as SEQ ID NO.1-21
The fusion precursor protein of bovine respiratory syncytial virus F protein shown in kind is made.
The present invention also provides a kind of DNA vaccinations, before the fusion comprising encoding above-mentioned bovine respiratory syncytial virus F protein
At least one of DNA molecular of body protein and plasmid vector.
Further, the DNA vaccination, as nucleotide sequence ox as shown in any one in SEQ ID NO.22-42
The DNA molecular and plasmid vector of the fusion precursor protein of respiratory syncystial virus F protein.
Preferably, plasmid vector described in the DNA vaccination is pVAC1-mcs plasmid.
As shown from the above technical solution, the present invention provides a kind of fusion precursor eggs of bovine respiratory syncytial virus F protein
White, encoding said proteins DNA moleculars and its application.The method of structure biology is applied to vaccine design and improvement by the present invention
In, by the observation to F- protein structure, find the biological mechanism of F- albumen occurred conformation variation.Born of the same parents are closed in bovine respiratory
ANTIGEN DESIGNThe and genetic engineering transformation are carried out on the basis of the three-dimensional structure of virus F protein, acquisition is maintained at inferior stable state F- egg
The fusion precursor protein of the bovine respiratory syncytial virus F protein of white fusion precursor.On the basis of this protein vaccine further into
The transformation of row genetic engineering, building can express bovine respiratory born of the same parents zoarium virus F protein in host cell surface and merge precursor protein
Plasmid as DNA vaccination, be able to maintain it largely in fusion precursor in the protein that secrets out of of mammalian cell expression
Metastable condition.By molecular biology, biology, cell biology, immunologic method determines the stability of vaccine, effectively
Property and safety.
Detailed description of the invention
Fig. 1 shows spatial position signal of the F protein of tri- kinds of M1, M2 and M3 designs on protein three-dimensional structure (5TGD)
Figure;
Fig. 2 shows the gel-filtration purified chromatogram of Strain 391-2F protein mutant M1;
Fig. 3 shows Strain 391-2F protein mutant M1SDS-PAGE gel electrophoresis figure;
Fig. 4 shows pVAC1-mcs plasmid map;
Fig. 5 shows the DNA vaccination and albumen epidemic disease of tri- kinds of mutant designs of M1, M2 and M3 of Strain 391-2 and ATue51908
The antiviral Activity Results figure of seedling the 5th week serum in mouse animal experiment;
Fig. 6 show Strain ATue51908 DNA and its protein mutant M1 immunogene in calf immunization experiment result
Figure;
Fig. 7 shows the experimental result picture of the BRSV challenge viral dosage by immune calf.
Specific embodiment
The present invention provides a kind of fusion precursor proteins of the bovine respiratory syncytial virus F protein of conformational stability, coding institute
DNA and its application of albumen are stated, those skilled in the art can use for reference present disclosure, be suitably modified realization of process parameters.Especially
It should be pointed out that all similar substitutions and modifications are apparent to those skilled in the art, they are all regarded
To be included in the present invention.Method and application of the invention is described by preferred embodiment, the obvious energy of related personnel
It is not departing from the content of present invention, in spirit and scope to methods herein and application is modified or appropriate changes and combinations, is coming
Implementation and application the technology of the present invention.
Cell culture used herein, molecular genetics, nucleic acid chemistry, biochemistry, Immunology Lab operation step
It suddenly is widely used conventional steps in corresponding field.Meanwhile for a better understanding of the present invention, relational language is provided below
Definition and explanation.
Antigen (antigen, Ag) is to refer to stimulation body to generate (specificity) immune response, and energy and immune response
Product antibodies and sensitized lymphocyte combine in vitro, and the substance of immunological effect (specific reaction) occurs.Antigen is can be
In being handed to the section of DNA or DNA fragmentation that induce immune response after host animal;Polypeptide, epitope, haptens or any combination thereof.
" polypeptide " and " protein " is used interchangeably herein, it is intended that the polymer of continuous amino acid residue.Term " core
Acid ", " nucleotide " are used interchangeably, and refer to RNA, DNA, cDNA (complementary DNA) or cRNA (complementary RNA) and its derivative, example
Such as comprising the form through modifying main chain.
" fusion " refers to the technology that two or more protein fusion expressions are formed to fusion protein.In general, by making
The DNA fragmentation for encoding two or more albumen is linked together with meeting frame with recombinant DNA technology, and carries out protein
Expression is to obtain fusion protein.Fusion, which uses, refers to that the sequence by MS and sPD-1 is fused together, the method that vaccine is made.
" prevention " refers to prevention disease or the associated patient's condition or symptom.
" carrier (vector) " refers to a kind of nucleic acid delivery vehicle that can be inserted polynucleotide.A kind of carrier can
With the element expressed containing various control, including but not limited to promoter sequence, transcriptional initiation sequence, enhancer sequence, selection
Element and reporter gene.In addition, carrier can also contain replication origin.
" host cell " refers to the cell that can be used for importing carrier comprising but it is not limited to such as Escherichia coli or withered grass bacterium
Prokaryotic cell, such as the fungal cell of yeast cells or Aspergillus, such as the insect cell of S2 drosophila cell or Sf9, or
Such as zooblast of bhk cell, HEK293 cell or people's cell.
" adjuvant " refers to nonspecific immunity strengthening agent, when it is delivered together with antigen or in advance into body, can increase
The immune response of strong body fight original changes type of immune response.There are many kinds of adjuvants, including but not limited to aluminium adjuvant (such as
Aluminium hydroxide), Freund's adjuvant (such as complete Freund's adjuvant and incomplete Freund's adjuvant), corynebacterium, lipopolysaccharides, cell
Factor etc..Freund's adjuvant is most common adjuvant in current animal experiment.Aluminum hydroxide adjuvant then in clinical trial use compared with
It is more.
Vaccine (vaccine) refers to the epidemic disease in order to prevent, control the generation of infectious disease, prevalence, for human body immunization campaign
Seedling class preventive biological products." DNA vector vaccine " refers to the vaccine based on DNA or RNA (such as plasmid, such as expression plasmid),
It optionally also includes adjuvant.
Test material that the present invention uses, reagent, instrument are all common commercially available product, can all be bought in market.
Below with reference to embodiment, the present invention is further explained:
Embodiment 1: the design and screening of the fusion precursor protein of the bovine respiratory syncytial virus F protein of conformational stability
(1) according to the principle of structure biology, the three-dimensional structure PDB for observing three protein is encoded to 5TGD, 5TDL and
3RRR determines the conformation change of body after RSV virus F protein fusion precursor and fusion, right on the basis of the three-dimensional structure of F protein
Seven kinds of wild BRSV Strain carry out ANTIGEN DESIGNThe and genetic engineering transformation, so that the F protein of expression is enough maintained at as far as possible
Merge the metastable condition of precursor.
Wherein the gene pool of the gene order of the wild virus strain of seven kinds of bovine respiratories born of the same parents' zoarium virus F protein logs in
Number (strain name) is respectively as follows: (1) NP_048055.1 (ATue51908);(2)YP_009505455(ATCC 51908);(3)
ANN02895(USII/S1);(4)AAL49410(A51908);(5)AM746678.1(RB94);(6)P22167.1
(Copenhagen);(7)AAB22601(391-2).
The target that ANTIGEN DESIGNThe and genetic engineering transformation are carried out on the basis of the three-dimensional structure of F protein is:
1) the connection quantity for increasing disulfide bond between each monomer inside of F protein tripolymer and monomer, hinders its generation
Conformation change locks it in the metastable condition of fusion precursor.
2) some amino acid inside mutation F protein tripolymer, mainly become the lesser amino acid mutation of some side chains
The biggish amino acid filling of side chain is internal to work as gap, increases internal hydrophobic binding, mobile space needed for reducing conformation change,
It is limited to change to body occurred conformation after fusion.
3) restriction enzyme site of certain protease is rejected in mutation, protects fusogenic peptide, prevents the structure because causing after protease digestion
As variation.
4) some larger amino acid of dynamic inside excision F protein tripolymer, shorter link peptide, improves egg
The stability of white matter.
5) extend spiral structure, the probability and and its stability that enhancing tripolymer is formed in the C- terminal of F- protein.
According to above-mentioned target, applicant carries out first round ANTIGEN DESIGNThe to virus on the basis of the three-dimensional structure of F protein
The original series genetic engineering transformation of the F protein of strain ATu51908 obtains 56 kinds of mutant antigens, wherein each antigen is set
Meter and improvement theory, the form of F protein, the sequence site of mutation, the affinity in conjunction with some specific antibodies and protein
Expression, as shown in table 1.Protein vaccine be by using Expi293F cell Thermo Fisher Scientific,
The mutant of MA and 293Fectin (Thermo Fisher Scientific) expression RSV F protein.The plasmid used is
PcDNA3.0. cell culture supernatant is harvested after transiently transfecting 5 days.It is centrifuged with 10,000 × g to remove cell fragment.Supernatant
Liquid is by being sterile filtered, using nickel (Ni) (Roche) and Strep-Tactin (iba) affinity chromatography to RSV F mutant protein
It is isolated and purified.Then it is further isolated and purified by molecular sieve pillar, obtains the protein vaccine of RSV F.
56 kinds of antigens of the acquisition of 1 first round of table ANTIGEN DESIGNThe and genetic engineering transformation
In conjunction with the well-designed of the first round, the mainly expression of antigen and the affinity in conjunction with specific antibody,
The design and improvement that the second wheel F protein has been carried out on the basis of Strain ATu51908 original series, obtain 20 by screening
Kind of antigen is in some extreme environments, such as high and low temperature (50-70 DEG C), freeze-thaw five times, soda acid (pH3-10) and salinity
(10-3000mM) and for a long time in the Physicochemical property and the affinity in conjunction with some specific antibodies for the conditions such as being placed at room temperature for
And its protein expression level.
Table 2 second takes turns stability of the 20 kinds of antigens of the acquisition of F protein ANTIGEN DESIGNThe under specific physical and electrochemical conditions
Matter
Table 3 second takes turns 20 kinds of antigens of the acquisition of F protein ANTIGEN DESIGNThe and the affinity of antibody interaction
As a result, it has been found that the design of three kinds of mutant antigens is in yield, stability and the affinity side in conjunction with specific antibody
There is preferable performance in face, is respectively as follows:
M1:H159C, V291C, G143C, S404C, S103C, N262C, I288F, V187L, Δ 109-137
(SGSSGSSG);That is the 159th hyte propylhomoserin of bovine respiratory born of the same parents zoarium virus F protein amino acid sequence, the 291st valine,
143 glycine, the 404th serine, the 103rd serine, the 262nd mutant serine are cysteine, and the 288th different
Leucine sports phenylalanine, and the 187th valine mutation is leucine, wipes out from 109 to 137 amino acid sequences, and
It wipes out site same position and link peptide SGSSGSSG (serine-glutamic acid-Ser-Ser-glutamic acid-silk is added
Propylhomoserin-serine-glutamic acid).
M2:L158C, S290C, G143C, S404C, S103C, N262C, I288F, V187L, Δ 114-132 (SGSSG);
I.e. the 158th leucine of bovine respiratory born of the same parents zoarium virus F protein amino acid sequence, the 290th serine, the 143rd glycine,
404th serine, the 103rd serine, the 262nd mutant serine are cysteine, and the 288th isoleucine mutation is
Phenylalanine, the 187th valine become leucine, wipe out from 114 to 132 amino acid sequences, and link peptide SGSSG (silk is added
Propylhomoserin-glutamic acid-Ser-Ser-glutamic acid);
M3:L158C, S290C, G143C, S404C, S103C, N262C, I288F, V187L, Δ 109-137
(SGSSGSSG);That is bovine respiratory born of the same parents zoarium virus F protein amino acid sequence length that link peptide is extended on the basis of M2,
Mutation becomes SGSSGSSG (serine-glutamic acid-Ser-Ser-glutamic acid-Ser-Ser-paddy ammonia
Acid).
Spatial position of tri- kinds of M1, the M2 and M3 designs on protein three-dimensional structure (5TGD) is as shown in Figure 1.
The amino acid sequence and corresponding DNA sequence dna of the fusion precursor protein of seven kinds of bovine respiratory born of the same parents' zoarium virus F proteins
As shown in table 4
The amino acid sequence and corresponding DNA sequence dna of the fusion precursor protein of 4 BRSV F protein of table
By taking Strain 391-2 as an example, the stability of the F protein of 391-2M1 design is detected, as a result such as Fig. 2 and figure
3。
The F protein of 391-2M1 design is largely retained in the trimerization of fusion precursor known to the gel filtration chromatography figure of Fig. 2
In the conformation of body, more single absorption peak is shown as.And gel images shown in Fig. 3 show that the F protein of 391-2M1 design is kept
In more stable tripolymer, only one band in the presence of reducing agent.But the 391-2 strain that do not improved
F protein is then in state that is unstable, easily being cut off by protease, shows as having multiple bands.
The F protein stability result of 391-2M2 and M3 design is similar to 391-2M1.
The preparation of embodiment 2:DNA vaccine
The corresponding DNA synthesis of antigen protein sequence progress, purification and detection that the screening of verifying obtains will be passed through, using next
Derived from InvivoGen, the pVAC1-mcs plasmid (Fig. 4) of USA is as DNA vaccine vector.Clone can in the antigen protein of this plasmid
It is anchored on cell surface with what is expressed and target, humoral immune reaction is stimulated by intramuscular injection.What muscle cell surface generated
Antigen is considered as being absorbed by antigen-expressing cells (APCs), and pass through main histocompatibility complex (MHC) II
Classpath is handled.
Embodiment 3: mouse immune experiment
By antigen protein yield is high and antigen-antibody affinity preferable 391-2M1,391-2M2,391-2M3 and
ATue51908M1, ATue51908M2, ATue51908M3 candidate carry out corresponding DNA synthesis, and sequence is as shown in table 3.Then
Respectively with the enzyme cutting clone of BamHI and EcoRI to the DNA vaccination immunization experiment for being used for mouse is added in pVAC1-mcs plasmid, together
When homologous protein vaccine immunization experiment is set, compare antiviral activity result such as Fig. 5 institute that mouse receives Post-immunisation serum
Show.
DNA vaccination through tri- kinds of M1, M2 and M3 design improvement on the basis of the F protein of Bovine 391-2 and A51908
It is immune in zero circle in mouse with homologous protein vaccine, reinforce after two weeks, homologous DNA vaccination and protein vaccine after five weeks
The antiviral activity of the serum of induction is very close.The F protein of body state after fusion than not improveing compares mean height
Hundreds times.The serum that display receives the mouse for the vaccine that improvement vaccine immunity is less improved has very strong antiviral activity.Often
The neutral titre of a animal is shown as a single point, indicates that geometry is equal with black level line.
Embodiment 4: calf immunization experiment
In order to further look at designed DNA and protein vaccine in the performance of calf, carry out following preclinical
Zoopery.The M1 immunogene of highly stable ATue51908DNA and its protein are selected, they are obtained in mouse experiment
Highest dilution factor, respectively EC50 26,716.8 and 19,605.8.As control, body after merging has been selected
ATue51908DNA and proteantigen, and placebo is immunized using phosphate buffered saline (PBS).10
One group of calf, immunity inoculation was carried out to 1-3 weeks big male calf.Twice at the 0th week and the 4th week, blood is collected after 2 weeks weekly
It is clear immune.Per injection is by DNA antigen or protein vaccine and vaccine adjuvant Montanide TM ISA71VG is added
(Seppic,France).EC50 dilution factor is observed, calf exponentially increased the 2nd week, the 4th week.The fusion of injection in 6th week
The calf serum antiviral activity of precursor dna vaccine and proteantigen is body after the year-on-year fusion not improved respectively
894 times of vaccine and 642 times.Maintain an equal level weighing apparatus (Fig. 6) after peaking within 6th week.
Embodiment 5: by the BRSV challenge viral dosage of immune calf
In order to prove that the vaccine by improvement can induce the immune system of calf to generate higher antiviral activity, carry out
BRSV challenge viral dosage method.The calf of embodiment 4 all by intranasal and endotracheal spray poison, carries out Disease Clinical to calf daily
Symptom monitoring and virus titer, calf is euthanized after nasopharynx sprays poison challenge 6 days.Sampling analysis bronchovesicular from lung three
A region lavation (bal) and lung bioplsy obtain, to determine virus titer, neutrophil infiltration, the journey of micro and macro lesion
Degree.The remaining situation of the BRSV of linked groups is observed, as a result such as Fig. 7.The organ-tissue of observation includes right heart (rc), Zuo Xin
(lc), tracheal epithelium (trsc), lung clean cell (lwc) sample.
The results show that DNA vaccination and protein vaccine that the present invention improves play fine protecting effect to calf,
It is fused remnants virus-free in histoorgan observed by the immune individual of precursor.But body is gentle after the fusion not improved
That rushes many viruses of control group discovery of solution infects remnants.
The above is only the preferred embodiment of the present invention, it is noted that those skilled in the art are come
It says, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications also should be regarded as
Protection scope of the present invention.
Sequence table
<110>Suzhou Yu Zhibo Biotechnology Co., Ltd
<120>a kind of improvement inferior stable state bovine respiratory syncytial virus fusion precursor F protein matter and coding DNA molecular and
It is applied
<130> MP1832488
<160> 42
<170> SIPOSequenceListing 1.0
<210> 1
<211> 494
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 1
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Ser Gly Ser Ser
100 105 110
Gly Ser Ser Gly Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala Val Ala
115 120 125
Ser Gly Val Ala Val Ser Lys Val Leu Cys Leu Glu Gly Glu Val Asn
130 135 140
Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val Ser Leu
145 150 155 160
Ser Asn Gly Val Ser Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn
165 170 175
Tyr Ile Asp Lys Glu Leu Leu Pro Gln Val Asn Asn His Asp Cys Arg
180 185 190
Ile Ser Asn Ile Glu Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg
195 200 205
Leu Leu Glu Ile Ala Arg Glu Phe Ser Val Asn Ala Gly Ile Thr Thr
210 215 220
Pro Leu Ser Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile
225 230 235 240
Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn
245 250 255
Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Ser Cys Val Lys
260 265 270
Glu Glu Val Ile Ala Tyr Val Val Gln Leu Pro Ile Tyr Gly Val Ile
275 280 285
Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp
290 295 300
Asn Lys Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp
305 310 315 320
Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Thr Glu Thr
325 330 335
Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu
340 345 350
Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Thr
355 360 365
Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser
370 375 380
Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys
385 390 395 400
Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn
405 410 415
Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp Thr Val Ser Val Gly
420 425 430
Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile
435 440 445
Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser
450 455 460
Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln
465 470 475 480
Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
485 490
<210> 2
<211> 501
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 2
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Arg Gly Ile Pro
100 105 110
Glu Ser Gly Ser Ser Gly Lys Arg Lys Arg Arg Phe Leu Gly Phe Leu
115 120 125
Leu Cys Ile Gly Ser Ala Val Ala Ser Gly Val Ala Val Ser Lys Val
130 135 140
Cys His Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala Leu Leu Ser
145 150 155 160
Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser Leu Leu Thr
165 170 175
Ser Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu Leu Leu Pro
180 185 190
Gln Val Asn Asn His Asp Cys Arg Ile Ser Asn Ile Glu Thr Val Ile
195 200 205
Glu Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala Arg Glu Phe
210 215 220
Ser Val Asn Ala Gly Ile Thr Thr Pro Leu Ser Thr Tyr Met Leu Thr
225 230 235 240
Asn Ser Glu Leu Leu Ser Leu Ile Cys Asp Met Pro Ile Thr Asn Asp
245 250 255
Gln Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg Gln Gln Ser
260 265 270
Tyr Ser Phe Met Cys Val Val Lys Glu Glu Val Ile Ala Tyr Val Val
275 280 285
Gln Leu Pro Ile Tyr Gly Val Ile Asp Thr Pro Cys Trp Lys Leu His
290 295 300
Thr Ser Pro Leu Cys Thr Thr Asp Asn Lys Glu Gly Ser Asn Ile Cys
305 310 315 320
Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val
325 330 335
Ser Phe Phe Pro Gln Thr Glu Thr Cys Lys Val Gln Ser Asn Arg Val
340 345 350
Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp Val Asn Leu
355 360 365
Cys Asn Thr Asp Ile Phe Asn Thr Lys Tyr Asp Cys Lys Ile Met Thr
370 375 380
Ser Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile Gly Ala Ile
385 390 395 400
Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg
405 410 415
Gly Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys
420 425 430
Gly Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys
435 440 445
Leu Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile Asn Tyr
450 455 460
Tyr Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser Ile Ala
465 470 475 480
Gln Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg Arg Ser
485 490 495
Asp Glu Leu Leu His
500
<210> 3
<211> 504
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 3
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Arg Gly Ile Pro
100 105 110
Glu Ser Gly Ser Ser Gly Ser Ser Gly Lys Arg Lys Arg Arg Phe Leu
115 120 125
Gly Phe Leu Leu Cys Ile Gly Ser Ala Val Ala Ser Gly Val Ala Val
130 135 140
Ser Lys Val Cys His Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala
145 150 155 160
Leu Leu Ser Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser
165 170 175
Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu
180 185 190
Leu Leu Pro Gln Val Asn Asn His Asp Cys Arg Ile Ser Asn Ile Glu
195 200 205
Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala
210 215 220
Arg Glu Phe Ser Val Asn Ala Gly Ile Thr Thr Pro Leu Ser Thr Tyr
225 230 235 240
Met Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile Cys Asp Met Pro Ile
245 250 255
Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg
260 265 270
Gln Gln Ser Tyr Ser Phe Met Cys Val Val Lys Glu Glu Val Ile Ala
275 280 285
Tyr Val Val Gln Leu Pro Ile Tyr Gly Val Ile Asp Thr Pro Cys Trp
290 295 300
Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp Asn Lys Glu Gly Ser
305 310 315 320
Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala
325 330 335
Gly Ser Val Ser Phe Phe Pro Gln Thr Glu Thr Cys Lys Val Gln Ser
340 345 350
Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp
355 360 365
Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Thr Lys Tyr Asp Cys Lys
370 375 380
Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile
385 390 395 400
Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn
405 410 415
Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val
420 425 430
Ser Asn Lys Gly Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr
435 440 445
Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile
450 455 460
Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala
465 470 475 480
Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile
485 490 495
Arg Arg Ser Asp Glu Leu Leu His
500
<210> 4
<211> 494
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 4
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Asn Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Ser Gly Ser Ser
100 105 110
Gly Ser Ser Gly Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala
115 120 125
Ser Gly Val Ala Val Ser Lys Val Leu Cys Leu Glu Gly Glu Val Asn
130 135 140
Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val Ser Ala
145 150 155 160
Gly Ile Thr Thr Pro Leu Ser Leu Ser Asn Gly Val Ser Leu Leu Thr
165 170 175
Ser Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu Leu Leu Pro
180 185 190
Lys Val Asn Asn His Asp Cys Arg Ile Ser Lys Ile Glu Thr Val Ile
195 200 205
Glu Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala Arg Glu Phe
210 215 220
Ser Val Asn Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile
225 230 235 240
Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn
245 250 255
Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Ser Cys Val Lys
260 265 270
Glu Glu Val Ile Ala Tyr Val Val Gln Leu Pro Ile Tyr Gly Val Ile
275 280 285
Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp
290 295 300
Asn Lys Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp
305 310 315 320
Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Thr Glu Thr
325 330 335
Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu
340 345 350
Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Thr
355 360 365
Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser
370 375 380
Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys
385 390 395 400
Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn
405 410 415
Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp Thr Val Ser Val Gly
420 425 430
Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile
435 440 445
Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser
450 455 460
Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln
465 470 475 480
Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
485 490
<210> 5
<211> 501
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 5
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Asn Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Arg Gly Ile Pro
100 105 110
Glu Ser Gly Ser Ser Gly Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu
115 120 125
Cys Ile Gly Ser Ala Ile Ala Ser Gly Val Ala Val Ser Lys Val Cys
130 135 140
His Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala Leu Leu Ser Thr
145 150 155 160
Asn Lys Ala Val Val Ser Ala Gly Ile Thr Thr Pro Leu Ser Leu Ser
165 170 175
Asn Gly Val Ser Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr
180 185 190
Ile Asp Lys Glu Leu Leu Pro Lys Val Asn Asn His Asp Cys Arg Ile
195 200 205
Ser Lys Ile Glu Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg Leu
210 215 220
Leu Glu Ile Ala Arg Glu Phe Ser Val Asn Thr Tyr Met Leu Thr Asn
225 230 235 240
Ser Glu Leu Leu Ser Leu Ile Cys Asp Met Pro Ile Thr Asn Asp Gln
245 250 255
Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg Gln Gln Ser Tyr
260 265 270
Ser Phe Met Cys Val Val Lys Glu Glu Val Ile Ala Tyr Val Val Gln
275 280 285
Leu Pro Ile Tyr Gly Val Ile Asp Thr Pro Cys Trp Lys Leu His Thr
290 295 300
Ser Pro Leu Cys Thr Thr Asp Asn Lys Glu Gly Ser Asn Ile Cys Leu
305 310 315 320
Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser
325 330 335
Phe Phe Pro Gln Thr Glu Thr Cys Lys Val Gln Ser Asn Arg Val Phe
340 345 350
Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp Val Asn Leu Cys
355 360 365
Asn Thr Asp Ile Phe Asn Thr Lys Tyr Asp Cys Lys Ile Met Thr Ser
370 375 380
Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile Gly Ala Ile Val
385 390 395 400
Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly
405 410 415
Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly
420 425 430
Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Leu
435 440 445
Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile Asn Tyr Tyr
450 455 460
Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser Ile Ala Gln
465 470 475 480
Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg Arg Ser Asp
485 490 495
Glu Leu Leu His Ser
500
<210> 6
<211> 494
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 6
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Asn Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Ser Gly Ser Ser
100 105 110
Gly Ser Ser Gly Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala
115 120 125
Ser Gly Val Ala Val Ser Lys Val Cys His Leu Glu Gly Glu Val Asn
130 135 140
Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val Ser Ala
145 150 155 160
Gly Ile Thr Thr Pro Leu Ser Leu Ser Asn Gly Val Ser Leu Leu Thr
165 170 175
Ser Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu Leu Leu Pro
180 185 190
Lys Val Asn Asn His Asp Cys Arg Ile Ser Lys Ile Glu Thr Val Ile
195 200 205
Glu Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala Arg Glu Phe
210 215 220
Ser Val Asn Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile
225 230 235 240
Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn
245 250 255
Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Cys Val Val Lys
260 265 270
Glu Glu Val Ile Ala Tyr Val Val Gln Leu Pro Ile Tyr Gly Val Ile
275 280 285
Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp
290 295 300
Asn Lys Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp
305 310 315 320
Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Thr Glu Thr
325 330 335
Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu
340 345 350
Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Thr
355 360 365
Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser
370 375 380
Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys
385 390 395 400
Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn
405 410 415
Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp Thr Val Ser Val Gly
420 425 430
Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile
435 440 445
Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser
450 455 460
Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln
465 470 475 480
Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
485 490
<210> 7
<211> 494
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 7
Met Ala Ala Met Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Met Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asp Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Thr Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Val Pro Ala Cys Phe Asn Arg Ala Lys Ser Gly Ser Ser
100 105 110
Gly Ser Ser Gly Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala
115 120 125
Ser Gly Val Ala Val Ser Lys Val Leu Cys Leu Glu Gly Glu Val Asn
130 135 140
Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val Ser Leu
145 150 155 160
Ser Asn Gly Val Ser Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn
165 170 175
Tyr Ile Asp Lys Glu Leu Leu Pro Lys Val Asn Asn His Asp Cys Arg
180 185 190
Ile Ser Asn Ile Glu Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg
195 200 205
Leu Leu Glu Ile Ala Arg Glu Phe Ser Val Asn Ala Gly Ile Thr Thr
210 215 220
Pro Leu Ser Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile
225 230 235 240
Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn
245 250 255
Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Ser Cys Val Lys
260 265 270
Glu Glu Val Ile Ala Tyr Val Val Gln Leu Pro Ile Tyr Gly Val Ile
275 280 285
Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp
290 295 300
Asn Lys Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp
305 310 315 320
Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Ala Glu Thr
325 330 335
Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu
340 345 350
Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Thr
355 360 365
Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser
370 375 380
Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys
385 390 395 400
Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn
405 410 415
Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp Thr Val Ser Val Gly
420 425 430
Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile
435 440 445
Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser
450 455 460
Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln
465 470 475 480
Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
485 490
<210> 8
<211> 501
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 8
Met Ala Ala Met Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Met Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asp Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Thr Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Val Pro Ala Cys Phe Asn Arg Ala Lys Arg Gly Ile Pro
100 105 110
Glu Ser Gly Ser Ser Gly Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu
115 120 125
Cys Ile Gly Ser Ala Ile Ala Ser Gly Val Ala Val Ser Lys Val Cys
130 135 140
His Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala Leu Leu Ser Thr
145 150 155 160
Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser Leu Leu Thr Ser
165 170 175
Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu Leu Leu Pro Lys
180 185 190
Val Asn Asn His Asp Cys Arg Ile Ser Asn Ile Glu Thr Val Ile Glu
195 200 205
Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala Arg Glu Phe Ser
210 215 220
Val Asn Ala Gly Ile Thr Thr Pro Leu Ser Thr Tyr Met Leu Thr Asn
225 230 235 240
Ser Glu Leu Leu Ser Leu Ile Cys Asp Met Pro Ile Thr Asn Asp Gln
245 250 255
Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg Gln Gln Ser Tyr
260 265 270
Ser Phe Met Cys Val Val Lys Glu Glu Val Ile Ala Tyr Val Val Gln
275 280 285
Leu Pro Ile Tyr Gly Val Ile Asp Thr Pro Cys Trp Lys Leu His Thr
290 295 300
Ser Pro Leu Cys Thr Thr Asp Asn Lys Glu Gly Ser Asn Ile Cys Leu
305 310 315 320
Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser
325 330 335
Phe Phe Pro Gln Ala Glu Thr Cys Lys Val Gln Ser Asn Arg Val Phe
340 345 350
Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp Val Asn Leu Cys
355 360 365
Asn Thr Asp Ile Phe Asn Thr Lys Tyr Asp Cys Lys Ile Met Thr Ser
370 375 380
Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile Gly Ala Ile Val
385 390 395 400
Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly
405 410 415
Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly
420 425 430
Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Leu
435 440 445
Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile Asn Tyr Tyr
450 455 460
Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser Ile Ala Gln
465 470 475 480
Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg Arg Ser Asp
485 490 495
Glu Leu Leu His Ser
500
<210> 9
<211> 494
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 9
Met Ala Ala Met Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Met Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asp Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Thr Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Val Pro Ala Cys Phe Asn Arg Ala Lys Ser Gly Ser Ser
100 105 110
Gly Ser Ser Gly Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala
115 120 125
Ser Gly Val Ala Val Ser Lys Val Cys His Leu Glu Gly Glu Val Asn
130 135 140
Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val Ser Leu
145 150 155 160
Ser Asn Gly Val Ser Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn
165 170 175
Tyr Ile Asp Lys Glu Leu Leu Pro Lys Val Asn Asn His Asp Cys Arg
180 185 190
Ile Ser Asn Ile Glu Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg
195 200 205
Leu Leu Glu Ile Ala Arg Glu Phe Ser Val Asn Ala Gly Ile Thr Thr
210 215 220
Pro Leu Ser Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile
225 230 235 240
Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn
245 250 255
Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Cys Leu Val Lys
260 265 270
Glu Glu Val Ile Ala Tyr Val Val Gln Leu Pro Ile Tyr Gly Val Ile
275 280 285
Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp
290 295 300
Asn Lys Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp
305 310 315 320
Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Ala Glu Thr
325 330 335
Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu
340 345 350
Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Thr
355 360 365
Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser
370 375 380
Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys
385 390 395 400
Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn
405 410 415
Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp Thr Val Ser Val Gly
420 425 430
Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile
435 440 445
Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser
450 455 460
Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln
465 470 475 480
Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
485 490
<210> 10
<211> 489
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 10
Met Arg Met Ile Ile Ser Ile Ile Leu Ile Ser Thr Tyr Val Pro His
1 5 10 15
Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe Tyr Gln Ser Thr Cys
20 25 30
Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu Arg Thr Gly Trp Tyr
35 40 45
Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile Gln Lys Asn Val Cys
50 55 60
Asn Gly Thr Asp Ser Lys Val Lys Leu Ile Lys Gln Glu Leu Glu Arg
65 70 75 80
Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu Met Gln Asn Glu Pro
85 90 95
Thr Cys Ser Ser Arg Ala Lys Ser Gly Ser Ser Gly Ser Ser Gly Leu
100 105 110
Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala Ser Gly Val Ala Val
115 120 125
Ser Lys Val Leu Cys Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala
130 135 140
Leu Leu Ser Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser
145 150 155 160
Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu
165 170 175
Leu Leu Pro Lys Val Asn Asn His Asp Cys Arg Ile Ser Asn Ile Ala
180 185 190
Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala
195 200 205
Arg Glu Phe Ser Val Asn Ala Gly Ile Thr Thr Pro Leu Ser Thr Tyr
210 215 220
Met Leu Thr Asn Ser Glu Leu Leu Ser Ile Ile Cys Asp Met Pro Ile
225 230 235 240
Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg
245 250 255
Gln Gln Ser Tyr Ser Phe Met Ser Cys Val Lys Glu Glu Val Ile Ala
260 265 270
Tyr Val Val Gln Leu Pro Leu Tyr Gly Val Ile Asp Thr Pro Cys Trp
275 280 285
Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp Asn Glu Glu Gly Ser
290 295 300
Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala
305 310 315 320
Gly Ser Val Ser Phe Phe Pro Gln Ala Glu Thr Cys Lys Val Gln Ser
325 330 335
Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp
340 345 350
Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Ala Lys Tyr Asp Cys Lys
355 360 365
Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile
370 375 380
Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn
385 390 395 400
Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val
405 410 415
Ser Asn Lys Gly Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr
420 425 430
Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile
435 440 445
Ile Asn Tyr Tyr Asn Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala
450 455 460
Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile
465 470 475 480
Arg Arg Ser Asp Glu Leu Leu His Ser
485
<210> 11
<211> 496
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 11
Met Arg Met Ile Ile Ser Ile Ile Leu Ile Ser Thr Tyr Val Pro His
1 5 10 15
Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe Tyr Gln Ser Thr Cys
20 25 30
Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu Arg Thr Gly Trp Tyr
35 40 45
Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile Gln Lys Asn Val Cys
50 55 60
Asn Gly Thr Asp Ser Lys Val Lys Leu Ile Lys Gln Glu Leu Glu Arg
65 70 75 80
Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu Met Gln Asn Glu Pro
85 90 95
Thr Cys Ser Ser Arg Ala Lys Arg Gly Ile Pro Glu Ser Gly Ser Ser
100 105 110
Gly Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala
115 120 125
Ile Ala Ser Gly Val Ala Val Ser Lys Val Cys His Leu Glu Gly Glu
130 135 140
Val Asn Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val
145 150 155 160
Ser Leu Ser Asn Gly Val Ser Leu Leu Thr Ser Lys Val Leu Asp Leu
165 170 175
Lys Asn Tyr Ile Asp Lys Glu Leu Leu Pro Lys Val Asn Asn His Asp
180 185 190
Cys Arg Ile Ser Asn Ile Ala Thr Val Ile Glu Phe Gln Gln Lys Asn
195 200 205
Asn Arg Leu Leu Glu Ile Ala Arg Glu Phe Ser Val Asn Ala Gly Ile
210 215 220
Thr Thr Pro Leu Ser Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser
225 230 235 240
Ile Ile Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser
245 250 255
Ser Asn Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Cys Val
260 265 270
Val Lys Glu Glu Val Ile Ala Tyr Val Val Gln Leu Pro Leu Tyr Gly
275 280 285
Val Ile Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr
290 295 300
Thr Asp Asn Glu Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg
305 310 315 320
Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Ala
325 330 335
Glu Thr Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn
340 345 350
Ser Leu Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe
355 360 365
Asn Ala Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser
370 375 380
Cys Ser Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys
385 390 395 400
Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe
405 410 415
Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp Thr Val Ser
420 425 430
Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu
435 440 445
Tyr Ile Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asn Pro Leu Val Phe
450 455 460
Pro Ser Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile
465 470 475 480
Asn Gln Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
485 490 495
<210> 12
<211> 489
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 12
Met Arg Met Ile Ile Ser Ile Ile Leu Ile Ser Thr Tyr Val Pro His
1 5 10 15
Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe Tyr Gln Ser Thr Cys
20 25 30
Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu Arg Thr Gly Trp Tyr
35 40 45
Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile Gln Lys Asn Val Cys
50 55 60
Asn Gly Thr Asp Ser Lys Val Lys Leu Ile Lys Gln Glu Leu Glu Arg
65 70 75 80
Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu Met Gln Asn Glu Pro
85 90 95
Thr Cys Ser Ser Arg Ala Lys Ser Gly Ser Ser Gly Ser Ser Gly Leu
100 105 110
Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala Ser Gly Val Ala Val
115 120 125
Ser Lys Val Cys His Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala
130 135 140
Leu Leu Ser Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser
145 150 155 160
Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu
165 170 175
Leu Leu Pro Lys Val Asn Asn His Asp Cys Arg Ile Ser Asn Ile Ala
180 185 190
Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala
195 200 205
Arg Glu Phe Ser Val Asn Ala Gly Ile Thr Thr Pro Leu Ser Thr Tyr
210 215 220
Met Leu Thr Asn Ser Glu Leu Leu Ser Ile Ile Cys Asp Met Pro Ile
225 230 235 240
Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg
245 250 255
Gln Gln Ser Tyr Ser Phe Met Cys Val Val Lys Glu Glu Val Ile Ala
260 265 270
Tyr Val Val Gln Leu Pro Leu Tyr Gly Val Ile Asp Thr Pro Cys Trp
275 280 285
Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp Asn Glu Glu Gly Ser
290 295 300
Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala
305 310 315 320
Gly Ser Val Ser Phe Phe Pro Gln Ala Glu Thr Cys Lys Val Gln Ser
325 330 335
Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp
340 345 350
Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Ala Lys Tyr Asp Cys Lys
355 360 365
Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile
370 375 380
Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn
385 390 395 400
Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val
405 410 415
Ser Asn Lys Gly Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr
420 425 430
Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile
435 440 445
Ile Asn Tyr Tyr Asn Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala
450 455 460
Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile
465 470 475 480
Arg Arg Ser Asp Glu Leu Leu His Ser
485
<210> 13
<211> 494
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 13
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Asn Ser Thr Asp Ser Asn Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Ser Ser Arg Ala Lys Ser Gly Ser Ser
100 105 110
Gly Ser Ser Gly Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala
115 120 125
Ser Gly Val Ala Val Ser Lys Val Leu Cys Leu Glu Gly Glu Val Asn
130 135 140
Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val Ser Leu
145 150 155 160
Ser Asn Gly Val Ser Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn
165 170 175
Tyr Ile Asp Lys Glu Leu Leu Pro Lys Val Asn Asn His Asp Cys Lys
180 185 190
Ile Ser Asn Ile Ala Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg
195 200 205
Leu Leu Glu Ile Ala Arg Glu Phe Ser Val Asn Ala Gly Ile Thr Thr
210 215 220
Pro Leu Ser Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile
225 230 235 240
Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn
245 250 255
Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Ser Cys Val Lys
260 265 270
Glu Glu Val Met Ala Tyr Val Val Gln Leu Pro Ile Tyr Gly Val Ile
275 280 285
Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp
290 295 300
Asn Lys Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp
305 310 315 320
Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Ala Glu Thr
325 330 335
Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu
340 345 350
Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Ala
355 360 365
Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser
370 375 380
Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys
385 390 395 400
Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn
405 410 415
Gly Cys Asp Tyr Val Ser Asn Arg Gly Val Asp Thr Val Ser Val Gly
420 425 430
Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile
435 440 445
Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser
450 455 460
Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln
465 470 475 480
Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
485 490
<210> 14
<211> 501
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 14
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Asn Ser Thr Asp Ser Asn Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Ser Ser Arg Ala Lys Arg Gly Ile Pro
100 105 110
Glu Ser Gly Ser Ser Gly Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu
115 120 125
Cys Ile Gly Ser Ala Ile Ala Ser Gly Val Ala Val Ser Lys Val Cys
130 135 140
His Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala Leu Leu Ser Thr
145 150 155 160
Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser Leu Leu Thr Ser
165 170 175
Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu Leu Leu Pro Lys
180 185 190
Val Asn Asn His Asp Cys Lys Ile Ser Asn Ile Ala Thr Val Ile Glu
195 200 205
Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala Arg Glu Phe Ser
210 215 220
Val Asn Ala Gly Ile Thr Thr Pro Leu Ser Thr Tyr Met Leu Thr Asn
225 230 235 240
Ser Glu Leu Leu Ser Leu Ile Cys Asp Met Pro Ile Thr Asn Asp Gln
245 250 255
Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg Gln Gln Ser Tyr
260 265 270
Ser Phe Met Cys Val Val Lys Glu Glu Val Met Ala Tyr Val Val Gln
275 280 285
Leu Pro Ile Tyr Gly Val Ile Asp Thr Pro Cys Trp Lys Leu His Thr
290 295 300
Ser Pro Leu Cys Thr Thr Asp Asn Lys Glu Gly Ser Asn Ile Cys Leu
305 310 315 320
Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser
325 330 335
Phe Phe Pro Gln Ala Glu Thr Cys Lys Val Gln Ser Asn Arg Val Phe
340 345 350
Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp Val Asn Leu Cys
355 360 365
Asn Thr Asp Ile Phe Asn Ala Lys Tyr Asp Cys Lys Ile Met Thr Ser
370 375 380
Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile Gly Ala Ile Val
385 390 395 400
Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly
405 410 415
Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Arg Gly
420 425 430
Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Leu
435 440 445
Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile Asn Tyr Tyr
450 455 460
Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser Ile Ala Gln
465 470 475 480
Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg Arg Ser Asp
485 490 495
Glu Leu Leu His Ser
500
<210> 15
<211> 494
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 15
Met Ala Thr Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Val Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Asn Ser Thr Asp Ser Asn Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Val Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Ser Ser Arg Ala Lys Ser Gly Ser Ser
100 105 110
Gly Ser Ser Gly Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala
115 120 125
Ser Gly Val Ala Val Ser Lys Val Cys His Leu Glu Gly Glu Val Asn
130 135 140
Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val Ser Leu
145 150 155 160
Ser Asn Gly Val Ser Leu Leu Thr Ser Lys Val Leu Asp Leu Lys Asn
165 170 175
Tyr Ile Asp Lys Glu Leu Leu Pro Lys Val Asn Asn His Asp Cys Lys
180 185 190
Ile Ser Asn Ile Ala Thr Val Ile Glu Phe Gln Gln Lys Asn Asn Arg
195 200 205
Leu Leu Glu Ile Ala Arg Glu Phe Ser Val Asn Ala Gly Ile Thr Thr
210 215 220
Pro Leu Ser Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile
225 230 235 240
Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser Ser Asn
245 250 255
Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Cys Val Val Lys
260 265 270
Glu Glu Val Met Ala Tyr Val Val Gln Leu Pro Ile Tyr Gly Val Ile
275 280 285
Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr Thr Asp
290 295 300
Asn Lys Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp
305 310 315 320
Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Ala Glu Thr
325 330 335
Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn Ser Leu
340 345 350
Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe Asn Ala
355 360 365
Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser Cys Ser
370 375 380
Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys
385 390 395 400
Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn
405 410 415
Gly Cys Asp Tyr Val Ser Asn Arg Gly Val Asp Thr Val Ser Val Gly
420 425 430
Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile
435 440 445
Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser
450 455 460
Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile Asn Gln
465 470 475 480
Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
485 490
<210> 16
<211> 504
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 16
Met Ala Ala Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Met Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Ile Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Arg Gly Ile Pro
100 105 110
Glu Ser Gly Ser Ser Gly Ser Ser Gly Arg Lys Arg Arg Phe Leu Gly
115 120 125
Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala Ser Gly Val Ala Val Ser
130 135 140
Lys Val Leu Cys Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala Leu
145 150 155 160
Leu Ser Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser Leu
165 170 175
Leu Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu Leu
180 185 190
Leu Pro Lys Val Asn Asn His Asp Cys Arg Ile Ser Asn Ile Glu Thr
195 200 205
Val Ile Glu Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala Arg
210 215 220
Glu Phe Ser Val Asn Ala Gly Ile Thr Thr Pro Leu Ser Thr Tyr Met
225 230 235 240
Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile Cys Asp Met Pro Ile Thr
245 250 255
Asn Asp Gln Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg Gln
260 265 270
Gln Ser Tyr Ser Phe Met Leu Cys Val Lys Glu Glu Val Ile Ala Tyr
275 280 285
Val Val Gln Leu Pro Ile Tyr Gly Val Ile Asp Thr Pro Cys Trp Lys
290 295 300
Leu His Thr Ser Pro Leu Cys Thr Thr Asp Asn Lys Glu Gly Ser Asn
305 310 315 320
Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly
325 330 335
Ser Val Ser Phe Phe Pro Gln Ala Glu Thr Cys Lys Val Gln Ser Asn
340 345 350
Arg Val Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp Val
355 360 365
Asn Leu Cys Asn Thr Asp Ile Phe Asn Thr Lys Tyr Asp Cys Lys Ile
370 375 380
Met Thr Ser Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile Gly
385 390 395 400
Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys
405 410 415
Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser
420 425 430
Asn Lys Gly Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val
435 440 445
Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile
450 455 460
Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser
465 470 475 480
Ile Ala Gln Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg
485 490 495
Arg Ser Asp Glu Leu Leu His Ser
500
<210> 17
<211> 511
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 17
Met Ala Ala Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Met Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Ile Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Arg Gly Ile Pro
100 105 110
Glu Arg Gly Ile Pro Glu Ser Gly Ser Ser Gly Arg Lys Arg Arg Phe
115 120 125
Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu Cys Ile Gly Ser Ala Ile
130 135 140
Ala Ser Gly Val Ala Val Ser Lys Val Cys His Leu Glu Gly Glu Val
145 150 155 160
Asn Lys Ile Lys Asn Ala Leu Leu Ser Thr Asn Lys Ala Val Val Ser
165 170 175
Leu Ser Asn Gly Val Ser Leu Leu Thr Ser Lys Val Leu Asp Leu Lys
180 185 190
Asn Tyr Ile Asp Lys Glu Leu Leu Pro Lys Val Asn Asn His Asp Cys
195 200 205
Arg Ile Ser Asn Ile Glu Thr Val Ile Glu Phe Gln Gln Lys Asn Asn
210 215 220
Arg Leu Leu Glu Ile Ala Arg Glu Phe Ser Val Asn Ala Gly Ile Thr
225 230 235 240
Thr Pro Leu Ser Thr Tyr Met Leu Thr Asn Ser Glu Leu Leu Ser Leu
245 250 255
Ile Cys Asp Met Pro Ile Thr Asn Asp Gln Lys Lys Leu Met Ser Ser
260 265 270
Asn Val Gln Ile Val Arg Gln Gln Ser Tyr Ser Phe Met Cys Val Val
275 280 285
Lys Glu Glu Val Ile Ala Tyr Val Val Gln Leu Pro Ile Tyr Gly Val
290 295 300
Ile Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro Leu Cys Thr Thr
305 310 315 320
Asp Asn Lys Glu Gly Ser Asn Ile Cys Leu Thr Arg Thr Asp Arg Gly
325 330 335
Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe Pro Gln Ala Glu
340 345 350
Thr Cys Lys Val Gln Ser Asn Arg Val Phe Cys Asp Thr Met Asn Ser
355 360 365
Leu Thr Leu Pro Thr Asp Val Asn Leu Cys Asn Thr Asp Ile Phe Asn
370 375 380
Thr Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp Ile Ser Cys
385 390 395 400
Ser Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr Gly Lys Thr
405 410 415
Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys Thr Phe Ser
420 425 430
Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp Thr Val Ser Val
435 440 445
Gly Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys Ala Leu Tyr
450 455 460
Ile Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu Val Phe Pro
465 470 475 480
Ser Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala Lys Ile Asn
485 490 495
Gln Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu His Ser
500 505 510
<210> 18
<211> 504
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 18
Met Ala Ala Thr Ala Met Arg Met Ile Ile Ser Ile Ile Phe Ile Ser
1 5 10 15
Thr Tyr Met Thr His Ile Thr Leu Cys Gln Asn Ile Thr Glu Glu Phe
20 25 30
Tyr Gln Ser Thr Cys Ser Ala Val Ser Arg Gly Tyr Leu Ser Ala Leu
35 40 45
Arg Thr Gly Trp Tyr Thr Ser Val Val Thr Ile Glu Leu Ser Lys Ile
50 55 60
Gln Lys Asn Val Cys Lys Ser Thr Asp Ser Lys Val Lys Leu Ile Lys
65 70 75 80
Gln Glu Leu Glu Arg Tyr Asn Asn Ala Val Ile Glu Leu Gln Ser Leu
85 90 95
Met Gln Asn Glu Pro Ala Cys Phe Ser Arg Ala Lys Arg Gly Ile Pro
100 105 110
Glu Ser Gly Ser Ser Gly Ser Ser Gly Arg Lys Arg Arg Phe Leu Gly
115 120 125
Phe Leu Leu Cys Ile Gly Ser Ala Ile Ala Ser Gly Val Ala Val Ser
130 135 140
Lys Val Cys His Leu Glu Gly Glu Val Asn Lys Ile Lys Asn Ala Leu
145 150 155 160
Leu Ser Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser Leu
165 170 175
Leu Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr Ile Asp Lys Glu Leu
180 185 190
Leu Pro Lys Val Asn Asn His Asp Cys Arg Ile Ser Asn Ile Glu Thr
195 200 205
Val Ile Glu Phe Gln Gln Lys Asn Asn Arg Leu Leu Glu Ile Ala Arg
210 215 220
Glu Phe Ser Val Asn Ala Gly Ile Thr Thr Pro Leu Ser Thr Tyr Met
225 230 235 240
Leu Thr Asn Ser Glu Leu Leu Ser Leu Ile Cys Asp Met Pro Ile Thr
245 250 255
Asn Asp Gln Lys Lys Leu Met Ser Ser Asn Val Gln Ile Val Arg Gln
260 265 270
Gln Ser Tyr Ser Phe Met Cys Val Val Lys Glu Glu Val Ile Ala Tyr
275 280 285
Val Val Gln Leu Pro Ile Tyr Gly Val Ile Asp Thr Pro Cys Trp Lys
290 295 300
Leu His Thr Ser Pro Leu Cys Thr Thr Asp Asn Lys Glu Gly Ser Asn
305 310 315 320
Ile Cys Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly
325 330 335
Ser Val Ser Phe Phe Pro Gln Ala Glu Thr Cys Lys Val Gln Ser Asn
340 345 350
Arg Val Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Thr Asp Val
355 360 365
Asn Leu Cys Asn Thr Asp Ile Phe Asn Thr Lys Tyr Asp Cys Lys Ile
370 375 380
Met Thr Ser Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile Gly
385 390 395 400
Ala Ile Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys
405 410 415
Asn Arg Gly Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser
420 425 430
Asn Lys Gly Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val
435 440 445
Asn Lys Leu Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile
450 455 460
Asn Tyr Tyr Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser
465 470 475 480
Ile Ala Gln Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg
485 490 495
Arg Ser Asp Glu Leu Leu His Ser
500
<210> 19
<211> 517
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 19
Met Ala Leu Ser Lys Val Lys Leu Asn Asp Thr Phe Asn Lys Asp Gln
1 5 10 15
Leu Leu Ser Thr Ser Lys Tyr Thr Ile Gln Arg Ser Thr Gly Asp Asn
20 25 30
Ile Asp Ile Pro Asn Tyr Asp Val Gln Lys His Leu Asn Lys Leu Cys
35 40 45
Gly Met Leu Leu Ile Thr Glu Asp Ala Asn His Lys Phe Thr Gly Leu
50 55 60
Ile Gly Met Leu Tyr Ala Met Ser Arg Leu Gly Arg Glu Asp Thr Leu
65 70 75 80
Lys Ile Leu Lys Asp Ala Gly Tyr Gln Val Arg Ala Asn Gly Val Asp
85 90 95
Val Ile Thr His Arg Gln Cys Val Asn Gly Lys Ser Gly Ser Ser Gly
100 105 110
Ser Ser Gly Gln Gly Asn Ile Glu Cys Glu Ser Arg Lys Ser Tyr Lys
115 120 125
Lys Met Leu Lys Glu Met Gly Glu Val Ala Cys Glu Tyr Arg His Asp
130 135 140
Phe Pro Asp Cys Gly Met Ile Val Leu Cys Val Ala Ala Leu Val Ile
145 150 155 160
Thr Lys Leu Leu Ala Gly Asp Arg Ser Gly Leu Thr Ala Val Ile Arg
165 170 175
Arg Ala Asn Asn Val Leu Arg Asn Glu Met Lys Arg Tyr Lys Gly Leu
180 185 190
Ile Pro Lys Asp Ile Ala Asn Ser Phe Tyr Glu Val Phe Glu Lys Tyr
195 200 205
Pro His Tyr Ile Asp Val Phe Val His Phe Gly Ile Ala Gln Ser Ser
210 215 220
Thr Arg Gly Gly Ser Arg Val Glu Gly Ile Phe Ala Cys Leu Phe Met
225 230 235 240
Asn Ala Tyr Gly Ala Gly Gln Val Met Leu Arg Trp Gly Val Leu Ala
245 250 255
Lys Ser Val Lys Asn Phe Met Leu Cys His Ala Ser Val Gln Ala Glu
260 265 270
Met Glu Gln Val Val Glu Val Tyr Glu Tyr Ala Gln Lys Leu Gly Gly
275 280 285
Glu Ala Gly Phe Tyr His Ile Leu Asn Asn Pro Lys Ala Ser Leu Leu
290 295 300
Ser Leu Thr Gln Phe Pro Asn Phe Ser Ser Val Val Leu Gly Asn Ala
305 310 315 320
Ala Gly Leu Gly Ile Met Gly Glu Tyr Arg Gly Thr Pro Arg Asn Gln
325 330 335
Asp Leu Tyr Asp Ala Ala Lys Ala Tyr Ala Glu Gln Leu Lys Glu Asn
340 345 350
Gly Val Ile Asn Tyr Ser Val Leu Asp Leu Thr Thr Glu Glu Leu Glu
355 360 365
Ala Ile Lys Asn Gln Leu Asn Pro Lys Asp Asn Asp Val Glu Leu Cys
370 375 380
Asn Thr Asp Ile Phe Asn Thr Lys Tyr Asp Cys Lys Ile Met Thr Ser
385 390 395 400
Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile Gly Ala Ile Val
405 410 415
Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly
420 425 430
Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly
435 440 445
Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Leu
450 455 460
Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile Asn Tyr Tyr
465 470 475 480
Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser Ile Ala Gln
485 490 495
Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg Arg Ser Asp
500 505 510
Glu Leu Leu His Ser
515
<210> 20
<211> 514
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 20
Met Ala Leu Ser Lys Val Lys Leu Asn Asp Thr Phe Asn Lys Asp Gln
1 5 10 15
Leu Leu Ser Thr Ser Lys Tyr Thr Ile Gln Arg Ser Thr Gly Asp Asn
20 25 30
Ile Asp Ile Pro Asn Tyr Asp Val Gln Lys His Leu Asn Lys Leu Cys
35 40 45
Gly Met Leu Leu Ile Thr Glu Asp Ala Asn His Lys Phe Thr Gly Leu
50 55 60
Ile Gly Met Leu Tyr Ala Met Ser Arg Leu Gly Arg Glu Asp Thr Leu
65 70 75 80
Lys Ile Leu Lys Asp Ala Gly Tyr Gln Val Arg Ala Asn Gly Val Asp
85 90 95
Val Ile Thr His Arg Gln Cys Val Asn Gly Lys Ser Gly Ser Ser Gly
100 105 110
Gln Gly Asn Ile Glu Cys Glu Ser Arg Lys Ser Tyr Lys Lys Met Leu
115 120 125
Lys Glu Met Gly Glu Val Ala Cys Glu Tyr Arg His Asp Phe Pro Asp
130 135 140
Cys Gly Met Ile Val Leu Cys Val Ala Ala Leu Val Ile Thr Lys Leu
145 150 155 160
Leu Ala Gly Asp Arg Ser Gly Leu Thr Ala Val Ile Arg Arg Ala Asn
165 170 175
Asn Val Leu Arg Asn Glu Met Lys Arg Tyr Lys Gly Leu Ile Pro Lys
180 185 190
Asp Ile Ala Asn Ser Phe Tyr Glu Val Phe Glu Lys Tyr Pro His Tyr
195 200 205
Ile Asp Val Phe Val His Phe Gly Ile Ala Gln Ser Ser Thr Arg Gly
210 215 220
Gly Ser Arg Val Glu Gly Ile Phe Ala Cys Leu Phe Met Asn Ala Tyr
225 230 235 240
Gly Ala Gly Gln Val Met Leu Arg Trp Gly Val Leu Ala Lys Ser Val
245 250 255
Lys Asn Phe Met Leu Cys His Ala Ser Val Gln Ala Glu Met Glu Gln
260 265 270
Val Val Glu Val Tyr Glu Tyr Ala Gln Lys Leu Gly Gly Glu Ala Gly
275 280 285
Phe Tyr His Ile Leu Asn Asn Pro Lys Ala Ser Leu Leu Ser Leu Thr
290 295 300
Gln Phe Pro Asn Phe Ser Ser Val Val Leu Gly Asn Ala Ala Gly Leu
305 310 315 320
Gly Ile Met Gly Glu Tyr Arg Gly Thr Pro Arg Asn Gln Asp Leu Tyr
325 330 335
Asp Ala Ala Lys Ala Tyr Ala Glu Gln Leu Lys Glu Asn Gly Val Ile
340 345 350
Asn Tyr Ser Val Leu Asp Leu Thr Thr Glu Glu Leu Glu Ala Ile Lys
355 360 365
Asn Gln Leu Asn Pro Lys Asp Asn Asp Val Glu Leu Cys Asn Thr Asp
370 375 380
Ile Phe Asn Thr Lys Tyr Asp Cys Lys Ile Met Thr Ser Lys Thr Asp
385 390 395 400
Ile Ser Cys Ser Val Ile Thr Ser Ile Gly Ala Ile Val Ser Cys Tyr
405 410 415
Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly Ile Ile Lys
420 425 430
Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp Thr
435 440 445
Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Leu Glu Gly Lys
450 455 460
Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile Asn Tyr Tyr Asp Pro Leu
465 470 475 480
Val Phe Pro Ser Asp Glu Phe Asp Ala Ser Ile Ala Gln Val Asn Ala
485 490 495
Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg Arg Ser Asp Glu Leu Leu
500 505 510
His Ser
<210> 21
<211> 517
<212> PRT
<213>artificial sequence (Artificial Sequence)
<400> 21
Met Ala Leu Ser Lys Val Lys Leu Asn Asp Thr Phe Asn Lys Asp Gln
1 5 10 15
Leu Leu Ser Thr Ser Lys Tyr Thr Ile Gln Arg Ser Thr Gly Asp Asn
20 25 30
Ile Asp Ile Pro Asn Tyr Asp Val Gln Lys His Leu Asn Lys Leu Cys
35 40 45
Gly Met Leu Leu Ile Thr Glu Asp Ala Asn His Lys Phe Thr Gly Leu
50 55 60
Ile Gly Met Leu Tyr Ala Met Ser Arg Leu Gly Arg Glu Asp Thr Leu
65 70 75 80
Lys Ile Leu Lys Asp Ala Gly Tyr Gln Val Arg Ala Asn Gly Val Asp
85 90 95
Val Ile Thr His Arg Gln Cys Val Asn Gly Lys Ser Gly Ser Ser Gly
100 105 110
Ser Ser Gly Gln Gly Asn Ile Glu Cys Glu Ser Arg Lys Ser Tyr Lys
115 120 125
Lys Met Leu Lys Glu Met Gly Glu Val Ala Cys Glu Tyr Arg His Asp
130 135 140
Phe Pro Asp Cys Gly Met Ile Val Leu Cys Val Ala Ala Leu Val Ile
145 150 155 160
Thr Lys Leu Leu Ala Gly Asp Arg Ser Gly Leu Thr Ala Val Ile Arg
165 170 175
Arg Ala Asn Asn Val Leu Arg Asn Glu Met Lys Arg Tyr Lys Gly Leu
180 185 190
Ile Pro Lys Asp Ile Ala Asn Ser Phe Tyr Glu Val Phe Glu Lys Tyr
195 200 205
Pro His Tyr Ile Asp Val Phe Val His Phe Gly Ile Ala Gln Ser Ser
210 215 220
Thr Arg Gly Gly Ser Arg Val Glu Gly Ile Phe Ala Cys Leu Phe Met
225 230 235 240
Asn Ala Tyr Gly Ala Gly Gln Val Met Leu Arg Trp Gly Val Leu Ala
245 250 255
Lys Ser Val Lys Asn Phe Met Leu Cys His Ala Ser Val Gln Ala Glu
260 265 270
Met Glu Gln Val Val Glu Val Tyr Glu Tyr Ala Gln Lys Leu Gly Gly
275 280 285
Glu Ala Gly Phe Tyr His Ile Leu Asn Asn Pro Lys Ala Ser Leu Leu
290 295 300
Ser Leu Thr Gln Phe Pro Asn Phe Ser Ser Val Val Leu Gly Asn Ala
305 310 315 320
Ala Gly Leu Gly Ile Met Gly Glu Tyr Arg Gly Thr Pro Arg Asn Gln
325 330 335
Asp Leu Tyr Asp Ala Ala Lys Ala Tyr Ala Glu Gln Leu Lys Glu Asn
340 345 350
Gly Val Ile Asn Tyr Ser Val Leu Asp Leu Thr Thr Glu Glu Leu Glu
355 360 365
Ala Ile Lys Asn Gln Leu Asn Pro Lys Asp Asn Asp Val Glu Leu Cys
370 375 380
Asn Thr Asp Ile Phe Asn Thr Lys Tyr Asp Cys Lys Ile Met Thr Ser
385 390 395 400
Lys Thr Asp Ile Ser Cys Ser Val Ile Thr Ser Ile Gly Ala Ile Val
405 410 415
Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly
420 425 430
Ile Ile Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly
435 440 445
Val Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Leu
450 455 460
Glu Gly Lys Ala Leu Tyr Ile Lys Gly Glu Pro Ile Ile Asn Tyr Tyr
465 470 475 480
Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser Ile Ala Gln
485 490 495
Val Asn Ala Lys Ile Asn Gln Ser Leu Ala Phe Ile Arg Arg Ser Asp
500 505 510
Glu Leu Leu His Ser
515
<210> 22
<211> 1482
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 22
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagagcggc agcagcggca gcagcggcct gggcttcctg 360
ctgtgcatcg gcagcgccgt ggccagcggc gtggccgtga gcaaggtgct gtgcctggag 420
ggcgaggtga acaagatcaa gaacgccctg ctgagcacca acaaggccgt ggtgagcctg 480
agcaacggcg tgagcctgct gaccagcaag gtgctggacc tgaagaacta catcgacaag 540
gagctgctgc cccaggtgaa caaccacgac tgcaggatca gcaacatcga gaccgtgatc 600
gagttccagc agaagaacaa caggctgctg gagatcgcca gggagttcag cgtgaacgcc 660
ggcatcacca cccccctgag cacctacatg ctgaccaaca gcgagctgct gagcctgatc 720
tgcgacatgc ccatcaccaa cgaccagaag aagctgatga gcagcaacgt gcagatcgtg 780
aggcagcaga gctacagctt catgagctgc gtgaaggagg aggtgatcgc ctacgtggtg 840
cagctgccca tctacggcgt gatcgacacc ccctgctgga agctgcacac cagccccctg 900
tgcaccaccg acaacaagga gggcagcaac atctgcctga ccaggaccga caggggctgg 960
tactgcgaca acgccggcag cgtgagcttc ttcccccaga ccgagacctg caaggtgcag 1020
agcaacaggg tgttctgcga caccatgaac agcctgaccc tgcccaccga cgtgaacctg 1080
tgcaacaccg acatcttcaa caccaagtac gactgcaaga tcatgaccag caagaccgac 1140
atcagctgca gcgtgatcac cagcatcggc gccatcgtga gctgctacgg caagaccaag 1200
tgcaccgcca gcaacaagaa caggggcatc atcaagacct tcagcaacgg ctgcgactac 1260
gtgagcaaca agggcgtgga caccgtgagc gtgggcaaca ccctgtacta cgtgaacaag 1320
ctggagggca aggccctgta catcaagggc gagcccatca tcaactacta cgaccccctg 1380
gtgttcccca gcgacgagtt cgacgccagc atcgcccagg tgaacgccaa gatcaaccag 1440
agcctggcct tcatcaggag gagcgacgag ctgctgcaca gc 1482
<210> 23
<211> 1503
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 23
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagaggggc atccccgaga gcggcagcag cggcaagagg 360
aagaggaggt tcctgggctt cctgctgtgc atcggcagcg ccgtggccag cggcgtggcc 420
gtgagcaagg tgtgccacct ggagggcgag gtgaacaaga tcaagaacgc cctgctgagc 480
accaacaagg ccgtggtgag cctgagcaac ggcgtgagcc tgctgaccag caaggtgctg 540
gacctgaaga actacatcga caaggagctg ctgccccagg tgaacaacca cgactgcagg 600
atcagcaaca tcgagaccgt gatcgagttc cagcagaaga acaacaggct gctggagatc 660
gccagggagt tcagcgtgaa cgccggcatc accacccccc tgagcaccta catgctgacc 720
aacagcgagc tgctgagcct gatctgcgac atgcccatca ccaacgacca gaagaagctg 780
atgagcagca acgtgcagat cgtgaggcag cagagctaca gcttcatgtg cgtggtgaag 840
gaggaggtga tcgcctacgt ggtgcagctg cccatctacg gcgtgatcga caccccctgc 900
tggaagctgc acaccagccc cctgtgcacc accgacaaca aggagggcag caacatctgc 960
ctgaccagga ccgacagggg ctggtactgc gacaacgccg gcagcgtgag cttcttcccc 1020
cagaccgaga cctgcaaggt gcagagcaac agggtgttct gcgacaccat gaacagcctg 1080
accctgccca ccgacgtgaa cctgtgcaac accgacatct tcaacaccaa gtacgactgc 1140
aagatcatga ccagcaagac cgacatcagc tgcagcgtga tcaccagcat cggcgccatc 1200
gtgagctgct acggcaagac caagtgcacc gccagcaaca agaacagggg catcatcaag 1260
accttcagca acggctgcga ctacgtgagc aacaagggcg tggacaccgt gagcgtgggc 1320
aacaccctgt actacgtgaa caagctggag ggcaaggccc tgtacatcaa gggcgagccc 1380
atcatcaact actacgaccc cctggtgttc cccagcgacg agttcgacgc cagcatcgcc 1440
caggtgaacg ccaagatcaa ccagagcctg gccttcatca ggaggagcga cgagctgctg 1500
cac 1503
<210> 24
<211> 1512
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 24
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagaggggc atccccgaga gcggcagcag cggcagcagc 360
ggcaagagga agaggaggtt cctgggcttc ctgctgtgca tcggcagcgc cgtggccagc 420
ggcgtggccg tgagcaaggt gtgccacctg gagggcgagg tgaacaagat caagaacgcc 480
ctgctgagca ccaacaaggc cgtggtgagc ctgagcaacg gcgtgagcct gctgaccagc 540
aaggtgctgg acctgaagaa ctacatcgac aaggagctgc tgccccaggt gaacaaccac 600
gactgcagga tcagcaacat cgagaccgtg atcgagttcc agcagaagaa caacaggctg 660
ctggagatcg ccagggagtt cagcgtgaac gccggcatca ccacccccct gagcacctac 720
atgctgacca acagcgagct gctgagcctg atctgcgaca tgcccatcac caacgaccag 780
aagaagctga tgagcagcaa cgtgcagatc gtgaggcagc agagctacag cttcatgtgc 840
gtggtgaagg aggaggtgat cgcctacgtg gtgcagctgc ccatctacgg cgtgatcgac 900
accccctgct ggaagctgca caccagcccc ctgtgcacca ccgacaacaa ggagggcagc 960
aacatctgcc tgaccaggac cgacaggggc tggtactgcg acaacgccgg cagcgtgagc 1020
ttcttccccc agaccgagac ctgcaaggtg cagagcaaca gggtgttctg cgacaccatg 1080
aacagcctga ccctgcccac cgacgtgaac ctgtgcaaca ccgacatctt caacaccaag 1140
tacgactgca agatcatgac cagcaagacc gacatcagct gcagcgtgat caccagcatc 1200
ggcgccatcg tgagctgcta cggcaagacc aagtgcaccg ccagcaacaa gaacaggggc 1260
atcatcaaga ccttcagcaa cggctgcgac tacgtgagca acaagggcgt ggacaccgtg 1320
agcgtgggca acaccctgta ctacgtgaac aagctggagg gcaaggccct gtacatcaag 1380
ggcgagccca tcatcaacta ctacgacccc ctggtgttcc ccagcgacga gttcgacgcc 1440
agcatcgccc aggtgaacgc caagatcaac cagagcctgg ccttcatcag gaggagcgac 1500
gagctgctgc ac 1512
<210> 25
<211> 1482
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 25
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaac agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagagcggc agcagcggca gcagcggcct gggcttcctg 360
ctgtgcatcg gcagcgccat cgccagcggc gtggccgtga gcaaggtgct gtgcctggag 420
ggcgaggtga acaagatcaa gaacgccctg ctgagcacca acaaggccgt ggtgagcgcc 480
ggcatcacca cccccctgag cctgagcaac ggcgtgagcc tgctgaccag caaggtgctg 540
gacctgaaga actacatcga caaggagctg ctgcccaagg tgaacaacca cgactgcagg 600
atcagcaaga tcgagaccgt gatcgagttc cagcagaaga acaacaggct gctggagatc 660
gccagggagt tcagcgtgaa cacctacatg ctgaccaaca gcgagctgct gagcctgatc 720
tgcgacatgc ccatcaccaa cgaccagaag aagctgatga gcagcaacgt gcagatcgtg 780
aggcagcaga gctacagctt catgagctgc gtgaaggagg aggtgatcgc ctacgtggtg 840
cagctgccca tctacggcgt gatcgacacc ccctgctgga agctgcacac cagccccctg 900
tgcaccaccg acaacaagga gggcagcaac atctgcctga ccaggaccga caggggctgg 960
tactgcgaca acgccggcag cgtgagcttc ttcccccaga ccgagacctg caaggtgcag 1020
agcaacaggg tgttctgcga caccatgaac agcctgaccc tgcccaccga cgtgaacctg 1080
tgcaacaccg acatcttcaa caccaagtac gactgcaaga tcatgaccag caagaccgac 1140
atcagctgca gcgtgatcac cagcatcggc gccatcgtga gctgctacgg caagaccaag 1200
tgcaccgcca gcaacaagaa caggggcatc atcaagacct tcagcaacgg ctgcgactac 1260
gtgagcaaca agggcgtgga caccgtgagc gtgggcaaca ccctgtacta cgtgaacaag 1320
ctggagggca aggccctgta catcaagggc gagcccatca tcaactacta cgaccccctg 1380
gtgttcccca gcgacgagtt cgacgccagc atcgcccagg tgaacgccaa gatcaaccag 1440
agcctggcct tcatcaggag gagcgacgag ctgctgcaca gc 1482
<210> 26
<211> 1503
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 26
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaac agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagaggggc atccccgaga gcggcagcag cggcaggaag 360
aggaggttcc tgggcttcct gctgtgcatc ggcagcgcca tcgccagcgg cgtggccgtg 420
agcaaggtgt gccacctgga gggcgaggtg aacaagatca agaacgccct gctgagcacc 480
aacaaggccg tggtgagcgc cggcatcacc acccccctga gcctgagcaa cggcgtgagc 540
ctgctgacca gcaaggtgct ggacctgaag aactacatcg acaaggagct gctgcccaag 600
gtgaacaacc acgactgcag gatcagcaag atcgagaccg tgatcgagtt ccagcagaag 660
aacaacaggc tgctggagat cgccagggag ttcagcgtga acacctacat gctgaccaac 720
agcgagctgc tgagcctgat ctgcgacatg cccatcacca acgaccagaa gaagctgatg 780
agcagcaacg tgcagatcgt gaggcagcag agctacagct tcatgtgcgt ggtgaaggag 840
gaggtgatcg cctacgtggt gcagctgccc atctacggcg tgatcgacac cccctgctgg 900
aagctgcaca ccagccccct gtgcaccacc gacaacaagg agggcagcaa catctgcctg 960
accaggaccg acaggggctg gtactgcgac aacgccggca gcgtgagctt cttcccccag 1020
accgagacct gcaaggtgca gagcaacagg gtgttctgcg acaccatgaa cagcctgacc 1080
ctgcccaccg acgtgaacct gtgcaacacc gacatcttca acaccaagta cgactgcaag 1140
atcatgacca gcaagaccga catcagctgc agcgtgatca ccagcatcgg cgccatcgtg 1200
agctgctacg gcaagaccaa gtgcaccgcc agcaacaaga acaggggcat catcaagacc 1260
ttcagcaacg gctgcgacta cgtgagcaac aagggcgtgg acaccgtgag cgtgggcaac 1320
accctgtact acgtgaacaa gctggagggc aaggccctgt acatcaaggg cgagcccatc 1380
atcaactact acgaccccct ggtgttcccc agcgacgagt tcgacgccag catcgcccag 1440
gtgaacgcca agatcaacca gagcctggcc ttcatcagga ggagcgacga gctgctgcac 1500
agc 1503
<210> 27
<211> 1482
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 27
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaac agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagagcggc agcagcggca gcagcggcct gggcttcctg 360
ctgtgcatcg gcagcgccat cgccagcggc gtggccgtga gcaaggtgtg ccacctggag 420
ggcgaggtga acaagatcaa gaacgccctg ctgagcacca acaaggccgt ggtgagcgcc 480
ggcatcacca cccccctgag cctgagcaac ggcgtgagcc tgctgaccag caaggtgctg 540
gacctgaaga actacatcga caaggagctg ctgcccaagg tgaacaacca cgactgcagg 600
atcagcaaga tcgagaccgt gatcgagttc cagcagaaga acaacaggct gctggagatc 660
gccagggagt tcagcgtgaa cacctacatg ctgaccaaca gcgagctgct gagcctgatc 720
tgcgacatgc ccatcaccaa cgaccagaag aagctgatga gcagcaacgt gcagatcgtg 780
aggcagcaga gctacagctt catgtgcgtg gtgaaggagg aggtgatcgc ctacgtggtg 840
cagctgccca tctacggcgt gatcgacacc ccctgctgga agctgcacac cagccccctg 900
tgcaccaccg acaacaagga gggcagcaac atctgcctga ccaggaccga caggggctgg 960
tactgcgaca acgccggcag cgtgagcttc ttcccccaga ccgagacctg caaggtgcag 1020
agcaacaggg tgttctgcga caccatgaac agcctgaccc tgcccaccga cgtgaacctg 1080
tgcaacaccg acatcttcaa caccaagtac gactgcaaga tcatgaccag caagaccgac 1140
atcagctgca gcgtgatcac cagcatcggc gccatcgtga gctgctacgg caagaccaag 1200
tgcaccgcca gcaacaagaa caggggcatc atcaagacct tcagcaacgg ctgcgactac 1260
gtgagcaaca agggcgtgga caccgtgagc gtgggcaaca ccctgtacta cgtgaacaag 1320
ctggagggca aggccctgta catcaagggc gagcccatca tcaactacta cgaccccctg 1380
gtgttcccca gcgacgagtt cgacgccagc atcgcccagg tgaacgccaa gatcaaccag 1440
agcctggcct tcatcaggag gagcgacgag ctgctgcaca gc 1482
<210> 28
<211> 1482
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 28
atggccgcca tggccatgag gatgatcatc agcatcatct tcatcagcac ctacatgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagga cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg accgagctgc agagcctgat gcagaacgtg 300
cccgcctgct tcaacagggc caagagcggc agcagcggca gcagcggcct gggcttcctg 360
ctgtgcatcg gcagcgccat cgccagcggc gtggccgtga gcaaggtgct gtgcctggag 420
ggcgaggtga acaagatcaa gaacgccctg ctgagcacca acaaggccgt ggtgagcctg 480
agcaacggcg tgagcctgct gaccagcaag gtgctggacc tgaagaacta catcgacaag 540
gagctgctgc ccaaggtgaa caaccacgac tgcaggatca gcaacatcga gaccgtgatc 600
gagttccagc agaagaacaa caggctgctg gagatcgcca gggagttcag cgtgaacgcc 660
ggcatcacca cccccctgag cacctacatg ctgaccaaca gcgagctgct gagcctgatc 720
tgcgacatgc ccatcaccaa cgaccagaag aagctgatga gcagcaacgt gcagatcgtg 780
aggcagcaga gctacagctt catgagctgc gtgaaggagg aggtgatcgc ctacgtggtg 840
cagctgccca tctacggcgt gatcgacacc ccctgctgga agctgcacac cagccccctg 900
tgcaccaccg acaacaagga gggcagcaac atctgcctga ccaggaccga caggggctgg 960
tactgcgaca acgccggcag cgtgagcttc ttcccccagg ccgagacctg caaggtgcag 1020
agcaacaggg tgttctgcga caccatgaac agcctgaccc tgcccaccga cgtgaacctg 1080
tgcaacaccg acatcttcaa caccaagtac gactgcaaga tcatgaccag caagaccgac 1140
atcagctgca gcgtgatcac cagcatcggc gccatcgtga gctgctacgg caagaccaag 1200
tgcaccgcca gcaacaagaa caggggcatc atcaagacct tcagcaacgg ctgcgactac 1260
gtgagcaaca agggcgtgga caccgtgagc gtgggcaaca ccctgtacta cgtgaacaag 1320
ctggagggca aggccctgta catcaagggc gagcccatca tcaactacta cgaccccctg 1380
gtgttcccca gcgacgagtt cgacgccagc atcgcccagg tgaacgccaa gatcaaccag 1440
agcctggcct tcatcaggag gagcgacgag ctgctgcaca gc 1482
<210> 29
<211> 1503
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 29
atggccgcca tggccatgag gatgatcatc agcatcatct tcatcagcac ctacatgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagga cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg accgagctgc agagcctgat gcagaacgtg 300
cccgcctgct tcaacagggc caagaggggc atccccgaga gcggcagcag cggcaggaag 360
aggaggttcc tgggcttcct gctgtgcatc ggcagcgcca tcgccagcgg cgtggccgtg 420
agcaaggtgt gccacctgga gggcgaggtg aacaagatca agaacgccct gctgagcacc 480
aacaaggccg tggtgagcct gagcaacggc gtgagcctgc tgaccagcaa ggtgctggac 540
ctgaagaact acatcgacaa ggagctgctg cccaaggtga acaaccacga ctgcaggatc 600
agcaacatcg agaccgtgat cgagttccag cagaagaaca acaggctgct ggagatcgcc 660
agggagttca gcgtgaacgc cggcatcacc acccccctga gcacctacat gctgaccaac 720
agcgagctgc tgagcctgat ctgcgacatg cccatcacca acgaccagaa gaagctgatg 780
agcagcaacg tgcagatcgt gaggcagcag agctacagct tcatgtgcgt ggtgaaggag 840
gaggtgatcg cctacgtggt gcagctgccc atctacggcg tgatcgacac cccctgctgg 900
aagctgcaca ccagccccct gtgcaccacc gacaacaagg agggcagcaa catctgcctg 960
accaggaccg acaggggctg gtactgcgac aacgccggca gcgtgagctt cttcccccag 1020
gccgagacct gcaaggtgca gagcaacagg gtgttctgcg acaccatgaa cagcctgacc 1080
ctgcccaccg acgtgaacct gtgcaacacc gacatcttca acaccaagta cgactgcaag 1140
atcatgacca gcaagaccga catcagctgc agcgtgatca ccagcatcgg cgccatcgtg 1200
agctgctacg gcaagaccaa gtgcaccgcc agcaacaaga acaggggcat catcaagacc 1260
ttcagcaacg gctgcgacta cgtgagcaac aagggcgtgg acaccgtgag cgtgggcaac 1320
accctgtact acgtgaacaa gctggagggc aaggccctgt acatcaaggg cgagcccatc 1380
atcaactact acgaccccct ggtgttcccc agcgacgagt tcgacgccag catcgcccag 1440
gtgaacgcca agatcaacca gagcctggcc ttcatcagga ggagcgacga gctgctgcac 1500
agc 1503
<210> 30
<211> 1482
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 30
atggccgcca tggccatgag gatgatcatc agcatcatct tcatcagcac ctacatgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagga cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg accgagctgc agagcctgat gcagaacgtg 300
cccgcctgct tcaacagggc caagagcggc agcagcggca gcagcggcct gggcttcctg 360
ctgtgcatcg gcagcgccat cgccagcggc gtggccgtga gcaaggtgtg ccacctggag 420
ggcgaggtga acaagatcaa gaacgccctg ctgagcacca acaaggccgt ggtgagcctg 480
agcaacggcg tgagcctgct gaccagcaag gtgctggacc tgaagaacta catcgacaag 540
gagctgctgc ccaaggtgaa caaccacgac tgcaggatca gcaacatcga gaccgtgatc 600
gagttccagc agaagaacaa caggctgctg gagatcgcca gggagttcag cgtgaacgcc 660
ggcatcacca cccccctgag cacctacatg ctgaccaaca gcgagctgct gagcctgatc 720
tgcgacatgc ccatcaccaa cgaccagaag aagctgatga gcagcaacgt gcagatcgtg 780
aggcagcaga gctacagctt catgtgcctg gtgaaggagg aggtgatcgc ctacgtggtg 840
cagctgccca tctacggcgt gatcgacacc ccctgctgga agctgcacac cagccccctg 900
tgcaccaccg acaacaagga gggcagcaac atctgcctga ccaggaccga caggggctgg 960
tactgcgaca acgccggcag cgtgagcttc ttcccccagg ccgagacctg caaggtgcag 1020
agcaacaggg tgttctgcga caccatgaac agcctgaccc tgcccaccga cgtgaacctg 1080
tgcaacaccg acatcttcaa caccaagtac gactgcaaga tcatgaccag caagaccgac 1140
atcagctgca gcgtgatcac cagcatcggc gccatcgtga gctgctacgg caagaccaag 1200
tgcaccgcca gcaacaagaa caggggcatc atcaagacct tcagcaacgg ctgcgactac 1260
gtgagcaaca agggcgtgga caccgtgagc gtgggcaaca ccctgtacta cgtgaacaag 1320
ctggagggca aggccctgta catcaagggc gagcccatca tcaactacta cgaccccctg 1380
gtgttcccca gcgacgagtt cgacgccagc atcgcccagg tgaacgccaa gatcaaccag 1440
agcctggcct tcatcaggag gagcgacgag ctgctgcaca gc 1482
<210> 31
<211> 1467
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 31
atgaggatga tcatcagcat catcctgatc agcacctacg tgccccacat caccctgtgc 60
cagaacatca ccgaggagtt ctaccagagc acctgcagcg ccgtgagcag gggctacctg 120
agcgccctga ggaccggctg gtacaccagc gtggtgacca tcgagctgag caagatccag 180
aagaacgtgt gcaacggcac cgacagcaag gtgaagctga tcaagcagga gctggagagg 240
tacaacaacg ccgtggtgga gctgcagagc ctgatgcaga acgagcccac ctgcagcagc 300
agggccaaga gcggcagcag cggcagcagc ggcctgggct tcctgctgtg catcggcagc 360
gccatcgcca gcggcgtggc cgtgagcaag gtgctgtgcc tggagggcga ggtgaacaag 420
atcaagaacg ccctgctgag caccaacaag gccgtggtga gcctgagcaa cggcgtgagc 480
ctgctgacca gcaaggtgct ggacctgaag aactacatcg acaaggagct gctgcccaag 540
gtgaacaacc acgactgcag gatcagcaac atcgccaccg tgatcgagtt ccagcagaag 600
aacaacaggc tgctggagat cgccagggag ttcagcgtga acgccggcat caccaccccc 660
ctgagcacct acatgctgac caacagcgag ctgctgagca tcatctgcga catgcccatc 720
accaacgacc agaagaagct gatgagcagc aacgtgcaga tcgtgaggca gcagagctac 780
agcttcatga gctgcgtgaa ggaggaggtg atcgcctacg tggtgcagct gcccctgtac 840
ggcgtgatcg acaccccctg ctggaagctg cacaccagcc ccctgtgcac caccgacaac 900
gaggagggca gcaacatctg cctgaccagg accgacaggg gctggtactg cgacaacgcc 960
ggcagcgtga gcttcttccc ccaggccgag acctgcaagg tgcagagcaa cagggtgttc 1020
tgcgacacca tgaacagcct gaccctgccc accgacgtga acctgtgcaa caccgacatc 1080
ttcaacgcca agtacgactg caagatcatg accagcaaga ccgacatcag ctgcagcgtg 1140
atcaccagca tcggcgccat cgtgagctgc tacggcaaga ccaagtgcac cgccagcaac 1200
aagaacaggg gcatcatcaa gaccttcagc aacggctgcg actacgtgag caacaagggc 1260
gtggacaccg tgagcgtggg caacaccctg tactacgtga acaagctgga gggcaaggcc 1320
ctgtacatca agggcgagcc catcatcaac tactacaacc ccctggtgtt ccccagcgac 1380
gagttcgacg ccagcatcgc ccaggtgaac gccaagatca accagagcct ggccttcatc 1440
aggaggagcg acgagctgct gcacagc 1467
<210> 32
<211> 1488
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 32
atgaggatga tcatcagcat catcctgatc agcacctacg tgccccacat caccctgtgc 60
cagaacatca ccgaggagtt ctaccagagc acctgcagcg ccgtgagcag gggctacctg 120
agcgccctga ggaccggctg gtacaccagc gtggtgacca tcgagctgag caagatccag 180
aagaacgtgt gcaacggcac cgacagcaag gtgaagctga tcaagcagga gctggagagg 240
tacaacaacg ccgtggtgga gctgcagagc ctgatgcaga acgagcccac ctgcagcagc 300
agggccaaga ggggcatccc cgagagcggc agcagcggca ggaagaggag gttcctgggc 360
ttcctgctgt gcatcggcag cgccatcgcc agcggcgtgg ccgtgagcaa ggtgtgccac 420
ctggagggcg aggtgaacaa gatcaagaac gccctgctga gcaccaacaa ggccgtggtg 480
agcctgagca acggcgtgag cctgctgacc agcaaggtgc tggacctgaa gaactacatc 540
gacaaggagc tgctgcccaa ggtgaacaac cacgactgca ggatcagcaa catcgccacc 600
gtgatcgagt tccagcagaa gaacaacagg ctgctggaga tcgccaggga gttcagcgtg 660
aacgccggca tcaccacccc cctgagcacc tacatgctga ccaacagcga gctgctgagc 720
atcatctgcg acatgcccat caccaacgac cagaagaagc tgatgagcag caacgtgcag 780
atcgtgaggc agcagagcta cagcttcatg tgcgtggtga aggaggaggt gatcgcctac 840
gtggtgcagc tgcccctgta cggcgtgatc gacaccccct gctggaagct gcacaccagc 900
cccctgtgca ccaccgacaa cgaggagggc agcaacatct gcctgaccag gaccgacagg 960
ggctggtact gcgacaacgc cggcagcgtg agcttcttcc cccaggccga gacctgcaag 1020
gtgcagagca acagggtgtt ctgcgacacc atgaacagcc tgaccctgcc caccgacgtg 1080
aacctgtgca acaccgacat cttcaacgcc aagtacgact gcaagatcat gaccagcaag 1140
accgacatca gctgcagcgt gatcaccagc atcggcgcca tcgtgagctg ctacggcaag 1200
accaagtgca ccgccagcaa caagaacagg ggcatcatca agaccttcag caacggctgc 1260
gactacgtga gcaacaaggg cgtggacacc gtgagcgtgg gcaacaccct gtactacgtg 1320
aacaagctgg agggcaaggc cctgtacatc aagggcgagc ccatcatcaa ctactacaac 1380
cccctggtgt tccccagcga cgagttcgac gccagcatcg cccaggtgaa cgccaagatc 1440
aaccagagcc tggccttcat caggaggagc gacgagctgc tgcacagc 1488
<210> 33
<211> 1467
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 33
atgaggatga tcatcagcat catcctgatc agcacctacg tgccccacat caccctgtgc 60
cagaacatca ccgaggagtt ctaccagagc acctgcagcg ccgtgagcag gggctacctg 120
agcgccctga ggaccggctg gtacaccagc gtggtgacca tcgagctgag caagatccag 180
aagaacgtgt gcaacggcac cgacagcaag gtgaagctga tcaagcagga gctggagagg 240
tacaacaacg ccgtggtgga gctgcagagc ctgatgcaga acgagcccac ctgcagcagc 300
agggccaaga gcggcagcag cggcagcagc ggcctgggct tcctgctgtg catcggcagc 360
gccatcgcca gcggcgtggc cgtgagcaag gtgtgccacc tggagggcga ggtgaacaag 420
atcaagaacg ccctgctgag caccaacaag gccgtggtga gcctgagcaa cggcgtgagc 480
ctgctgacca gcaaggtgct ggacctgaag aactacatcg acaaggagct gctgcccaag 540
gtgaacaacc acgactgcag gatcagcaac atcgccaccg tgatcgagtt ccagcagaag 600
aacaacaggc tgctggagat cgccagggag ttcagcgtga acgccggcat caccaccccc 660
ctgagcacct acatgctgac caacagcgag ctgctgagca tcatctgcga catgcccatc 720
accaacgacc agaagaagct gatgagcagc aacgtgcaga tcgtgaggca gcagagctac 780
agcttcatgt gcgtggtgaa ggaggaggtg atcgcctacg tggtgcagct gcccctgtac 840
ggcgtgatcg acaccccctg ctggaagctg cacaccagcc ccctgtgcac caccgacaac 900
gaggagggca gcaacatctg cctgaccagg accgacaggg gctggtactg cgacaacgcc 960
ggcagcgtga gcttcttccc ccaggccgag acctgcaagg tgcagagcaa cagggtgttc 1020
tgcgacacca tgaacagcct gaccctgccc accgacgtga acctgtgcaa caccgacatc 1080
ttcaacgcca agtacgactg caagatcatg accagcaaga ccgacatcag ctgcagcgtg 1140
atcaccagca tcggcgccat cgtgagctgc tacggcaaga ccaagtgcac cgccagcaac 1200
aagaacaggg gcatcatcaa gaccttcagc aacggctgcg actacgtgag caacaagggc 1260
gtggacaccg tgagcgtggg caacaccctg tactacgtga acaagctgga gggcaaggcc 1320
ctgtacatca agggcgagcc catcatcaac tactacaacc ccctggtgtt ccccagcgac 1380
gagttcgacg ccagcatcgc ccaggtgaac gccaagatca accagagcct ggccttcatc 1440
aggaggagcg acgagctgct gcacagc 1467
<210> 34
<211> 1482
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 34
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaac agcaccgaca gcaacgtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgca gcagcagggc caagagcggc agcagcggca gcagcggcct gggcttcctg 360
ctgtgcatcg gcagcgccat cgccagcggc gtggccgtga gcaaggtgct gtgcctggag 420
ggcgaggtga acaagatcaa gaacgccctg ctgagcacca acaaggccgt ggtgagcctg 480
agcaacggcg tgagcctgct gaccagcaag gtgctggacc tgaagaacta catcgacaag 540
gagctgctgc ccaaggtgaa caaccacgac tgcaagatca gcaacatcgc caccgtgatc 600
gagttccagc agaagaacaa caggctgctg gagatcgcca gggagttcag cgtgaacgcc 660
ggcatcacca cccccctgag cacctacatg ctgaccaaca gcgagctgct gagcctgatc 720
tgcgacatgc ccatcaccaa cgaccagaag aagctgatga gcagcaacgt gcagatcgtg 780
aggcagcaga gctacagctt catgagctgc gtgaaggagg aggtgatggc ctacgtggtg 840
cagctgccca tctacggcgt gatcgacacc ccctgctgga agctgcacac cagccccctg 900
tgcaccaccg acaacaagga gggcagcaac atctgcctga ccaggaccga caggggctgg 960
tactgcgaca acgccggcag cgtgagcttc ttcccccagg ccgagacctg caaggtgcag 1020
agcaacaggg tgttctgcga caccatgaac agcctgaccc tgcccaccga cgtgaacctg 1080
tgcaacaccg acatcttcaa cgccaagtac gactgcaaga tcatgaccag caagaccgac 1140
atcagctgca gcgtgatcac cagcatcggc gccatcgtga gctgctacgg caagaccaag 1200
tgcaccgcca gcaacaagaa caggggcatc atcaagacct tcagcaacgg ctgcgactac 1260
gtgagcaaca ggggcgtgga caccgtgagc gtgggcaaca ccctgtacta cgtgaacaag 1320
ctggagggca aggccctgta catcaagggc gagcccatca tcaactacta cgaccccctg 1380
gtgttcccca gcgacgagtt cgacgccagc atcgcccagg tgaacgccaa gatcaaccag 1440
agcctggcct tcatcaggag gagcgacgag ctgctgcaca gc 1482
<210> 35
<211> 1503
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 35
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaac agcaccgaca gcaacgtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgca gcagcagggc caagaggggc atccccgaga gcggcagcag cggcaggaag 360
aggaggttcc tgggcttcct gctgtgcatc ggcagcgcca tcgccagcgg cgtggccgtg 420
agcaaggtgt gccacctgga gggcgaggtg aacaagatca agaacgccct gctgagcacc 480
aacaaggccg tggtgagcct gagcaacggc gtgagcctgc tgaccagcaa ggtgctggac 540
ctgaagaact acatcgacaa ggagctgctg cccaaggtga acaaccacga ctgcaagatc 600
agcaacatcg ccaccgtgat cgagttccag cagaagaaca acaggctgct ggagatcgcc 660
agggagttca gcgtgaacgc cggcatcacc acccccctga gcacctacat gctgaccaac 720
agcgagctgc tgagcctgat ctgcgacatg cccatcacca acgaccagaa gaagctgatg 780
agcagcaacg tgcagatcgt gaggcagcag agctacagct tcatgtgcgt ggtgaaggag 840
gaggtgatgg cctacgtggt gcagctgccc atctacggcg tgatcgacac cccctgctgg 900
aagctgcaca ccagccccct gtgcaccacc gacaacaagg agggcagcaa catctgcctg 960
accaggaccg acaggggctg gtactgcgac aacgccggca gcgtgagctt cttcccccag 1020
gccgagacct gcaaggtgca gagcaacagg gtgttctgcg acaccatgaa cagcctgacc 1080
ctgcccaccg acgtgaacct gtgcaacacc gacatcttca acgccaagta cgactgcaag 1140
atcatgacca gcaagaccga catcagctgc agcgtgatca ccagcatcgg cgccatcgtg 1200
agctgctacg gcaagaccaa gtgcaccgcc agcaacaaga acaggggcat catcaagacc 1260
ttcagcaacg gctgcgacta cgtgagcaac aggggcgtgg acaccgtgag cgtgggcaac 1320
accctgtact acgtgaacaa gctggagggc aaggccctgt acatcaaggg cgagcccatc 1380
atcaactact acgaccccct ggtgttcccc agcgacgagt tcgacgccag catcgcccag 1440
gtgaacgcca agatcaacca gagcctggcc ttcatcagga ggagcgacga gctgctgcac 1500
agc 1503
<210> 36
<211> 1482
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 36
atggccacca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacgtgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaac agcaccgaca gcaacgtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg gtggagctgc agagcctgat gcagaacgag 300
cccgcctgca gcagcagggc caagagcggc agcagcggca gcagcggcct gggcttcctg 360
ctgtgcatcg gcagcgccat cgccagcggc gtggccgtga gcaaggtgtg ccacctggag 420
ggcgaggtga acaagatcaa gaacgccctg ctgagcacca acaaggccgt ggtgagcctg 480
agcaacggcg tgagcctgct gaccagcaag gtgctggacc tgaagaacta catcgacaag 540
gagctgctgc ccaaggtgaa caaccacgac tgcaagatca gcaacatcgc caccgtgatc 600
gagttccagc agaagaacaa caggctgctg gagatcgcca gggagttcag cgtgaacgcc 660
ggcatcacca cccccctgag cacctacatg ctgaccaaca gcgagctgct gagcctgatc 720
tgcgacatgc ccatcaccaa cgaccagaag aagctgatga gcagcaacgt gcagatcgtg 780
aggcagcaga gctacagctt catgtgcgtg gtgaaggagg aggtgatggc ctacgtggtg 840
cagctgccca tctacggcgt gatcgacacc ccctgctgga agctgcacac cagccccctg 900
tgcaccaccg acaacaagga gggcagcaac atctgcctga ccaggaccga caggggctgg 960
tactgcgaca acgccggcag cgtgagcttc ttcccccagg ccgagacctg caaggtgcag 1020
agcaacaggg tgttctgcga caccatgaac agcctgaccc tgcccaccga cgtgaacctg 1080
tgcaacaccg acatcttcaa cgccaagtac gactgcaaga tcatgaccag caagaccgac 1140
atcagctgca gcgtgatcac cagcatcggc gccatcgtga gctgctacgg caagaccaag 1200
tgcaccgcca gcaacaagaa caggggcatc atcaagacct tcagcaacgg ctgcgactac 1260
gtgagcaaca ggggcgtgga caccgtgagc gtgggcaaca ccctgtacta cgtgaacaag 1320
ctggagggca aggccctgta catcaagggc gagcccatca tcaactacta cgaccccctg 1380
gtgttcccca gcgacgagtt cgacgccagc atcgcccagg tgaacgccaa gatcaaccag 1440
agcctggcct tcatcaggag gagcgacgag ctgctgcaca gc 1482
<210> 37
<211> 1512
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 37
atggccgcca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacatgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg atcgagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagaggggc atccccgaga gcggcagcag cggcagcagc 360
ggcaggaaga ggaggttcct gggcttcctg ctgtgcatcg gcagcgccat cgccagcggc 420
gtggccgtga gcaaggtgct gtgcctggag ggcgaggtga acaagatcaa gaacgccctg 480
ctgagcacca acaaggccgt ggtgagcctg agcaacggcg tgagcctgct gaccagcaag 540
gtgctggacc tgaagaacta catcgacaag gagctgctgc ccaaggtgaa caaccacgac 600
tgcaggatca gcaacatcga gaccgtgatc gagttccagc agaagaacaa caggctgctg 660
gagatcgcca gggagttcag cgtgaacgcc ggcatcacca cccccctgag cacctacatg 720
ctgaccaaca gcgagctgct gagcctgatc tgcgacatgc ccatcaccaa cgaccagaag 780
aagctgatga gcagcaacgt gcagatcgtg aggcagcaga gctacagctt catgctgtgc 840
gtgaaggagg aggtgatcgc ctacgtggtg cagctgccca tctacggcgt gatcgacacc 900
ccctgctgga agctgcacac cagccccctg tgcaccaccg acaacaagga gggcagcaac 960
atctgcctga ccaggaccga caggggctgg tactgcgaca acgccggcag cgtgagcttc 1020
ttcccccagg ccgagacctg caaggtgcag agcaacaggg tgttctgcga caccatgaac 1080
agcctgaccc tgcccaccga cgtgaacctg tgcaacaccg acatcttcaa caccaagtac 1140
gactgcaaga tcatgaccag caagaccgac atcagctgca gcgtgatcac cagcatcggc 1200
gccatcgtga gctgctacgg caagaccaag tgcaccgcca gcaacaagaa caggggcatc 1260
atcaagacct tcagcaacgg ctgcgactac gtgagcaaca agggcgtgga caccgtgagc 1320
gtgggcaaca ccctgtacta cgtgaacaag ctggagggca aggccctgta catcaagggc 1380
gagcccatca tcaactacta cgaccccctg gtgttcccca gcgacgagtt cgacgccagc 1440
atcgcccagg tgaacgccaa gatcaaccag agcctggcct tcatcaggag gagcgacgag 1500
ctgctgcaca gc 1512
<210> 38
<211> 1533
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 38
atggccgcca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacatgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg atcgagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagaggggc atccccgaga ggggcatccc cgagagcggc 360
agcagcggca ggaagaggag gttcaggaag aggaggttcc tgggcttcct gctgtgcatc 420
ggcagcgcca tcgccagcgg cgtggccgtg agcaaggtgt gccacctgga gggcgaggtg 480
aacaagatca agaacgccct gctgagcacc aacaaggccg tggtgagcct gagcaacggc 540
gtgagcctgc tgaccagcaa ggtgctggac ctgaagaact acatcgacaa ggagctgctg 600
cccaaggtga acaaccacga ctgcaggatc agcaacatcg agaccgtgat cgagttccag 660
cagaagaaca acaggctgct ggagatcgcc agggagttca gcgtgaacgc cggcatcacc 720
acccccctga gcacctacat gctgaccaac agcgagctgc tgagcctgat ctgcgacatg 780
cccatcacca acgaccagaa gaagctgatg agcagcaacg tgcagatcgt gaggcagcag 840
agctacagct tcatgtgcgt ggtgaaggag gaggtgatcg cctacgtggt gcagctgccc 900
atctacggcg tgatcgacac cccctgctgg aagctgcaca ccagccccct gtgcaccacc 960
gacaacaagg agggcagcaa catctgcctg accaggaccg acaggggctg gtactgcgac 1020
aacgccggca gcgtgagctt cttcccccag gccgagacct gcaaggtgca gagcaacagg 1080
gtgttctgcg acaccatgaa cagcctgacc ctgcccaccg acgtgaacct gtgcaacacc 1140
gacatcttca acaccaagta cgactgcaag atcatgacca gcaagaccga catcagctgc 1200
agcgtgatca ccagcatcgg cgccatcgtg agctgctacg gcaagaccaa gtgcaccgcc 1260
agcaacaaga acaggggcat catcaagacc ttcagcaacg gctgcgacta cgtgagcaac 1320
aagggcgtgg acaccgtgag cgtgggcaac accctgtact acgtgaacaa gctggagggc 1380
aaggccctgt acatcaaggg cgagcccatc atcaactact acgaccccct ggtgttcccc 1440
agcgacgagt tcgacgccag catcgcccag gtgaacgcca agatcaacca gagcctggcc 1500
ttcatcagga ggagcgacga gctgctgcac agc 1533
<210> 39
<211> 1512
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 39
atggccgcca ccgccatgag gatgatcatc agcatcatct tcatcagcac ctacatgacc 60
cacatcaccc tgtgccagaa catcaccgag gagttctacc agagcacctg cagcgccgtg 120
agcaggggct acctgagcgc cctgaggacc ggctggtaca ccagcgtggt gaccatcgag 180
ctgagcaaga tccagaagaa cgtgtgcaag agcaccgaca gcaaggtgaa gctgatcaag 240
caggagctgg agaggtacaa caacgccgtg atcgagctgc agagcctgat gcagaacgag 300
cccgcctgct tcagcagggc caagaggggc atccccgaga gcggcagcag cggcagcagc 360
ggcaggaaga ggaggttcct gggcttcctg ctgtgcatcg gcagcgccat cgccagcggc 420
gtggccgtga gcaaggtgtg ccacctggag ggcgaggtga acaagatcaa gaacgccctg 480
ctgagcacca acaaggccgt ggtgagcctg agcaacggcg tgagcctgct gaccagcaag 540
gtgctggacc tgaagaacta catcgacaag gagctgctgc ccaaggtgaa caaccacgac 600
tgcaggatca gcaacatcga gaccgtgatc gagttccagc agaagaacaa caggctgctg 660
gagatcgcca gggagttcag cgtgaacgcc ggcatcacca cccccctgag cacctacatg 720
ctgaccaaca gcgagctgct gagcctgatc tgcgacatgc ccatcaccaa cgaccagaag 780
aagctgatga gcagcaacgt gcagatcgtg aggcagcaga gctacagctt catgtgcgtg 840
gtgaaggagg aggtgatcgc ctacgtggtg cagctgccca tctacggcgt gatcgacacc 900
ccctgctgga agctgcacac cagccccctg tgcaccaccg acaacaagga gggcagcaac 960
atctgcctga ccaggaccga caggggctgg tactgcgaca acgccggcag cgtgagcttc 1020
ttcccccagg ccgagacctg caaggtgcag agcaacaggg tgttctgcga caccatgaac 1080
agcctgaccc tgcccaccga cgtgaacctg tgcaacaccg acatcttcaa caccaagtac 1140
gactgcaaga tcatgaccag caagaccgac atcagctgca gcgtgatcac cagcatcggc 1200
gccatcgtga gctgctacgg caagaccaag tgcaccgcca gcaacaagaa caggggcatc 1260
atcaagacct tcagcaacgg ctgcgactac gtgagcaaca agggcgtgga caccgtgagc 1320
gtgggcaaca ccctgtacta cgtgaacaag ctggagggca aggccctgta catcaagggc 1380
gagcccatca tcaactacta cgaccccctg gtgttcccca gcgacgagtt cgacgccagc 1440
atcgcccagg tgaacgccaa gatcaaccag agcctggcct tcatcaggag gagcgacgag 1500
ctgctgcaca gc 1512
<210> 40
<211> 1551
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 40
atggccctga gcaaggtgaa gctgaacgac accttcaaca aggaccagct gctgagcacc 60
agcaagtaca ccatccagag gagcaccggc gacaacatcg acatccccaa ctacgacgtg 120
cagaagcacc tgaacaagct gtgcggcatg ctgctgatca ccgaggacgc caaccacaag 180
ttcaccggcc tgatcggcat gctgtacgcc atgagcaggc tgggcaggga ggacaccctg 240
aagatcctga aggacgccgg ctaccaggtg agggccaacg gcgtggacgt gatcacccac 300
aggcagtgcg tgaacggcaa gagcggcagc agcggcagca gcggccaggg caacatcgag 360
tgcgagagca ggaagagcta caagaagatg ctgaaggaga tgggcgaggt ggcctgcgag 420
tacaggcacg acttccccga ctgcggcatg atcgtgctgt gcgtggccgc cctggtgatc 480
accaagctgc tggccggcga caggagcggc ctgaccgccg tgatcaggag ggccaacaac 540
gtgctgagga acgagatgaa gaggtacaag ggcctgatcc ccaaggacat cgccaacagc 600
ttctacgagg tgttcgagaa gtacccccac tacatcgacg tgttcgtgca cttcggcatc 660
gcccagagca gcaccagggg cggcagcagg gtggagggca tcttcgcctg cctgttcatg 720
aacgcctacg gcgccggcca ggtgatgctg aggtggggcg tgctggccaa gagcgtgaag 780
aacttcatgc tgtgccacgc cagcgtgcag gccgagatgg agcaggtggt ggaggtgtac 840
gagtacgccc agaagctggg cggcgaggcc ggcttctacc acatcctgaa caaccccaag 900
gccagcctgc tgagcctgac ccagttcccc aacttcagca gcgtggtgct gggcaacgcc 960
gccggcctgg gcatcatggg cgagtacagg ggcaccccca ggaaccagga cctgtacgac 1020
gccgccaagg cctacgccga gcagctgaag gagaacggcg tgatcaacta cagcgtgctg 1080
gacctgacca ccgaggagct ggaggccatc aagaaccagc tgaaccccaa ggacaacgac 1140
gtggagctgt gcaacaccga catcttcaac accaagtacg actgcaagat catgaccagc 1200
aagaccgaca tcagctgcag cgtgatcacc agcatcggcg ccatcgtgag ctgctacggc 1260
aagaccaagt gcaccgccag caacaagaac aggggcatca tcaagacctt cagcaacggc 1320
tgcgactacg tgagcaacaa gggcgtggac accgtgagcg tgggcaacac cctgtactac 1380
gtgaacaagc tggagggcaa ggccctgtac atcaagggcg agcccatcat caactactac 1440
gaccccctgg tgttccccag cgacgagttc gacgccagca tcgcccaggt gaacgccaag 1500
atcaaccaga gcctggcctt catcaggagg agcgacgagc tgctgcacag c 1551
<210> 41
<211> 1542
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 41
atggccctga gcaaggtgaa gctgaacgac accttcaaca aggaccagct gctgagcacc 60
agcaagtaca ccatccagag gagcaccggc gacaacatcg acatccccaa ctacgacgtg 120
cagaagcacc tgaacaagct gtgcggcatg ctgctgatca ccgaggacgc caaccacaag 180
ttcaccggcc tgatcggcat gctgtacgcc atgagcaggc tgggcaggga ggacaccctg 240
aagatcctga aggacgccgg ctaccaggtg agggccaacg gcgtggacgt gatcacccac 300
aggcagtgcg tgaacggcaa gagcggcagc agcggccagg gcaacatcga gtgcgagagc 360
aggaagagct acaagaagat gctgaaggag atgggcgagg tggcctgcga gtacaggcac 420
gacttccccg actgcggcat gatcgtgctg tgcgtggccg ccctggtgat caccaagctg 480
ctggccggcg acaggagcgg cctgaccgcc gtgatcagga gggccaacaa cgtgctgagg 540
aacgagatga agaggtacaa gggcctgatc cccaaggaca tcgccaacag cttctacgag 600
gtgttcgaga agtaccccca ctacatcgac gtgttcgtgc acttcggcat cgcccagagc 660
agcaccaggg gcggcagcag ggtggagggc atcttcgcct gcctgttcat gaacgcctac 720
ggcgccggcc aggtgatgct gaggtggggc gtgctggcca agagcgtgaa gaacttcatg 780
ctgtgccacg ccagcgtgca ggccgagatg gagcaggtgg tggaggtgta cgagtacgcc 840
cagaagctgg gcggcgaggc cggcttctac cacatcctga acaaccccaa ggccagcctg 900
ctgagcctga cccagttccc caacttcagc agcgtggtgc tgggcaacgc cgccggcctg 960
ggcatcatgg gcgagtacag gggcaccccc aggaaccagg acctgtacga cgccgccaag 1020
gcctacgccg agcagctgaa ggagaacggc gtgatcaact acagcgtgct ggacctgacc 1080
accgaggagc tggaggccat caagaaccag ctgaacccca aggacaacga cgtggagctg 1140
tgcaacaccg acatcttcaa caccaagtac gactgcaaga tcatgaccag caagaccgac 1200
atcagctgca gcgtgatcac cagcatcggc gccatcgtga gctgctacgg caagaccaag 1260
tgcaccgcca gcaacaagaa caggggcatc atcaagacct tcagcaacgg ctgcgactac 1320
gtgagcaaca agggcgtgga caccgtgagc gtgggcaaca ccctgtacta cgtgaacaag 1380
ctggagggca aggccctgta catcaagggc gagcccatca tcaactacta cgaccccctg 1440
gtgttcccca gcgacgagtt cgacgccagc atcgcccagg tgaacgccaa gatcaaccag 1500
agcctggcct tcatcaggag gagcgacgag ctgctgcaca gc 1542
<210> 42
<211> 1551
<212> DNA
<213>artificial sequence (Artificial Sequence)
<400> 42
atggccctga gcaaggtgaa gctgaacgac accttcaaca aggaccagct gctgagcacc 60
agcaagtaca ccatccagag gagcaccggc gacaacatcg acatccccaa ctacgacgtg 120
cagaagcacc tgaacaagct gtgcggcatg ctgctgatca ccgaggacgc caaccacaag 180
ttcaccggcc tgatcggcat gctgtacgcc atgagcaggc tgggcaggga ggacaccctg 240
aagatcctga aggacgccgg ctaccaggtg agggccaacg gcgtggacgt gatcacccac 300
aggcagtgcg tgaacggcaa gagcggcagc agcggcagca gcggccaggg caacatcgag 360
tgcgagagca ggaagagcta caagaagatg ctgaaggaga tgggcgaggt ggcctgcgag 420
tacaggcacg acttccccga ctgcggcatg atcgtgctgt gcgtggccgc cctggtgatc 480
accaagctgc tggccggcga caggagcggc ctgaccgccg tgatcaggag ggccaacaac 540
gtgctgagga acgagatgaa gaggtacaag ggcctgatcc ccaaggacat cgccaacagc 600
ttctacgagg tgttcgagaa gtacccccac tacatcgacg tgttcgtgca cttcggcatc 660
gcccagagca gcaccagggg cggcagcagg gtggagggca tcttcgcctg cctgttcatg 720
aacgcctacg gcgccggcca ggtgatgctg aggtggggcg tgctggccaa gagcgtgaag 780
aacttcatgc tgtgccacgc cagcgtgcag gccgagatgg agcaggtggt ggaggtgtac 840
gagtacgccc agaagctggg cggcgaggcc ggcttctacc acatcctgaa caaccccaag 900
gccagcctgc tgagcctgac ccagttcccc aacttcagca gcgtggtgct gggcaacgcc 960
gccggcctgg gcatcatggg cgagtacagg ggcaccccca ggaaccagga cctgtacgac 1020
gccgccaagg cctacgccga gcagctgaag gagaacggcg tgatcaacta cagcgtgctg 1080
gacctgacca ccgaggagct ggaggccatc aagaaccagc tgaaccccaa ggacaacgac 1140
gtggagctgt gcaacaccga catcttcaac accaagtacg actgcaagat catgaccagc 1200
aagaccgaca tcagctgcag cgtgatcacc agcatcggcg ccatcgtgag ctgctacggc 1260
aagaccaagt gcaccgccag caacaagaac aggggcatca tcaagacctt cagcaacggc 1320
tgcgactacg tgagcaacaa gggcgtggac accgtgagcg tgggcaacac cctgtactac 1380
gtgaacaagc tggagggcaa ggccctgtac atcaagggcg agcccatcat caactactac 1440
gaccccctgg tgttccccag cgacgagttc gacgccagca tcgcccaggt gaacgccaag 1500
atcaaccaga gcctggcctt catcaggagg agcgacgagc tgctgcacag c 1551
Claims (10)
1. a kind of fusion precursor protein of bovine respiratory syncytial virus F protein includes selected from the group below to wild type F at least one
The transformation of albumen:
A, increase the connection quantity of disulfide bond between each monomer inside of F protein tripolymer and monomer;
B, it is biggish to be become side chain by least one amino acid inside mutation F protein tripolymer for the lesser amino acid mutation of side chain
Hydrophobic binding inside amino acid or increase;
C, the restriction enzyme site of at least one protease is rejected in mutation;
D, at least one larger amino acid of dynamic of F protein tripolymer is cut off, instead shorter link peptide;
E, extend C- terminal Alpha (α) spiral structure of F- protein.
2. fusion precursor protein according to claim 1, the transformation includes the 143rd glycine, the 404th silk ammonia
Acid, the 103rd serine, the 262nd mutant serine are cysteine, and the 288th isoleucine mutation is phenylalanine, the
187 valine mutations are leucine.
3. fusion precursor protein according to claim 2, also comprising one of following transformations:
1) the 159th hyte propylhomoserin, the 291st valine mutation are cysteine, wipe out from 109 to 137 amino acid sequences, are added
Link peptide serine-glutamic acid-Ser-Ser-glutamic acid-Ser-Ser-glutamic acid;
2) the 158th leucine, the 290th mutant serine are cysteine, wipe out from 114 to 132 amino acid sequences, are added
Link peptide serine-glutamic acid-Ser-Ser-glutamic acid;
3) the 158th leucine, the 290th mutant serine are cysteine, wipe out from 114 to 132 amino acid sequences, are added
Link peptide serine-glutamic acid-Ser-Ser-glutamic acid-Ser-Ser-glutamic acid.
4. fusion precursor protein according to claim 3, selected from one of following sequences:
(1) its amino acid sequence is as shown in SEQ ID NO:1,2 or 3;
(2) its amino acid sequence is as shown in SEQ ID NO:4,5 or 6;
(3) its amino acid sequence is as shown in SEQ ID NO:7,8 or 9;
(4) its amino acid sequence is as shown in SEQ ID NO:10,11 or 12;
(5) its amino acid sequence is as shown in SEQ ID NO:13,14 or 15;
(6) its amino acid sequence is as shown in SEQ ID NO:16,17 or 18;
(7) its amino acid sequence is as shown in SEQ ID NO:19,20 or 21.
5. encoding the DNA molecular for merging precursor protein described in claim 1-4 any one.
6. DNA molecular according to claim 5, nucleotide sequence is in sequence shown in SEQ ID NO:22-42
It is a kind of.
7. the fusion precursor protein or claim of bovine respiratory syncytial virus F protein described in claim 1-4 any one
Application of the DNA molecular described in 5-6 any one in the product of preparation prevention bovine respiratory syncytial virus.
8. a kind of protein vaccine, the fusion precursor of the bovine respiratory syncytial virus F protein as described in claim 1-4 any one
At least one of albumen is made.
9. a kind of DNA vaccination includes at least one of DNA molecular described in claim 5-6 any one and plasmid vector.
10. DNA vaccination according to claim 4, the plasmid vector is pVAC1-mcs plasmid.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910171850.0A CN109851678A (en) | 2019-03-07 | 2019-03-07 | A kind of inferior stable state bovine respiratory syncytial virus of improvement merges DNA molecular and its application of precursor F protein matter and coding |
PCT/CN2019/081693 WO2020177179A1 (en) | 2019-03-07 | 2019-04-08 | Modified metastable bovine respiratory syncytial virus fusion precursor f protein and coding dna molecule and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910171850.0A CN109851678A (en) | 2019-03-07 | 2019-03-07 | A kind of inferior stable state bovine respiratory syncytial virus of improvement merges DNA molecular and its application of precursor F protein matter and coding |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109851678A true CN109851678A (en) | 2019-06-07 |
Family
ID=66900241
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910171850.0A Pending CN109851678A (en) | 2019-03-07 | 2019-03-07 | A kind of inferior stable state bovine respiratory syncytial virus of improvement merges DNA molecular and its application of precursor F protein matter and coding |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN109851678A (en) |
WO (1) | WO2020177179A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117304278A (en) * | 2023-11-28 | 2023-12-29 | 江苏瑞科生物技术股份有限公司 | Recombinant RSV F protein and application thereof |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040005545A1 (en) * | 2002-02-21 | 2004-01-08 | Fouchier Ronaldus Adrianus Maria | Recombinant parainfluenza virus expression systems and vaccines comprising heterologous antigens derived from metapneumovirus |
CN106518988A (en) * | 2016-11-28 | 2017-03-22 | 烟台偌帝生物工程有限公司 | Bovine respiratory syncytial cell virus antigen protein |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3508505A1 (en) * | 2007-12-24 | 2019-07-10 | ID Biomedical Corporation of Quebec | Recombinant rsv antigens |
DK3275892T3 (en) * | 2011-05-13 | 2020-04-06 | Glaxosmithkline Biologicals Sa | PRÆFUSIONS-RSV F ANTIGENS |
CN105188745B (en) * | 2013-04-25 | 2019-10-18 | 扬森疫苗与预防公司 | RSV F polypeptide before stabilized soluble fusion |
-
2019
- 2019-03-07 CN CN201910171850.0A patent/CN109851678A/en active Pending
- 2019-04-08 WO PCT/CN2019/081693 patent/WO2020177179A1/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040005545A1 (en) * | 2002-02-21 | 2004-01-08 | Fouchier Ronaldus Adrianus Maria | Recombinant parainfluenza virus expression systems and vaccines comprising heterologous antigens derived from metapneumovirus |
CN106518988A (en) * | 2016-11-28 | 2017-03-22 | 烟台偌帝生物工程有限公司 | Bovine respiratory syncytial cell virus antigen protein |
Non-Patent Citations (3)
Title |
---|
AMANN,V.L. ET AL.: ""N [Bovine orthopneumovirus],Accession:AAB22601.1"", 《GENBANK》 * |
BUCHHOLZ,U.J. ET AL.: ""F0 [Bovine orthopneumovirus],Accession:NP_048055.1"", 《GENBANK》 * |
YUNUS,A.S. ET AL.: ""fusion protein [Bovine respiratory syncytial virus ATCC51908],Accession:AAL49410.1"", 《GENBANK》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117304278A (en) * | 2023-11-28 | 2023-12-29 | 江苏瑞科生物技术股份有限公司 | Recombinant RSV F protein and application thereof |
CN117304278B (en) * | 2023-11-28 | 2024-04-16 | 江苏瑞科生物技术股份有限公司 | Recombinant RSV F protein and application thereof |
Also Published As
Publication number | Publication date |
---|---|
WO2020177179A1 (en) | 2020-09-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11034731B2 (en) | Stabilized soluble pre-fusion RSV F polypeptides | |
JP7362819B2 (en) | Stabilized soluble prefusion RSV F protein | |
TWI663175B (en) | Stabilized soluble pre-fusion rsv f polypeptides | |
KR101801213B1 (en) | Modified rsv f proteins and methods of their use | |
AU2021232702B2 (en) | Stabilized pre-fusion RSV F proteins | |
KR20170102905A (en) | New multivalent nanoparticle-based vaccine | |
US8372963B2 (en) | RSV F-protein and its use | |
WO2017207477A1 (en) | Stabilized pre-fusion rsv f proteins | |
WO2023138333A1 (en) | Recombinant sars-cov-2 protein vaccine, and preparation method therefor and use thereof | |
CN109851678A (en) | A kind of inferior stable state bovine respiratory syncytial virus of improvement merges DNA molecular and its application of precursor F protein matter and coding | |
CN111166881B (en) | Recombinant respiratory syncytial virus multi-epitope chimeric vaccine and preparation method and application thereof | |
CN111138550B (en) | Recombinant respiratory syncytial virus chimeric vaccine and preparation method and application thereof | |
WO2023236822A1 (en) | Development and use of h5n6 avian influenza broad-spectrum vaccine | |
CN112592410B (en) | Canine adenovirus gene engineering subunit vaccine, preparation method and application thereof | |
WO2023207717A1 (en) | Development and use of broad-spectrum vaccine for h5n8 avian influenza | |
CN116162156A (en) | Preparation and application of influenza and new crown bigeminal polyclonal antibody |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 215125 unit 401, C19 building, biomedical industrial park, 218 Xinghu street, Suzhou Industrial Park, Suzhou area, China (Jiangsu) pilot Free Trade Zone, Suzhou City, Jiangsu Province Applicant after: Yikang Biology (Suzhou) Co.,Ltd. Address before: Room a2-327, 218 Xinghu street, Suzhou Industrial Park, Jiangsu 215000 Applicant before: SUZHOU YUZHIBO BIOTECHNOLOGY Co.,Ltd. |
|
CB02 | Change of applicant information | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190607 |
|
RJ01 | Rejection of invention patent application after publication |