CN117716036A - Temperature controllable self-replicating RNA vaccine for viral diseases - Google Patents
Temperature controllable self-replicating RNA vaccine for viral diseases Download PDFInfo
- Publication number
- CN117716036A CN117716036A CN202280043353.2A CN202280043353A CN117716036A CN 117716036 A CN117716036 A CN 117716036A CN 202280043353 A CN202280043353 A CN 202280043353A CN 117716036 A CN117716036 A CN 117716036A
- Authority
- CN
- China
- Prior art keywords
- virus
- composition
- ser
- gly
- ala
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000003612 virological effect Effects 0.000 title claims abstract description 75
- 201000010099 disease Diseases 0.000 title claims description 13
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 title claims description 13
- 108700021021 mRNA Vaccine Proteins 0.000 title description 6
- 229940022005 RNA vaccine Drugs 0.000 title description 4
- 241000700605 Viruses Species 0.000 claims abstract description 219
- 108020004999 messenger RNA Proteins 0.000 claims abstract description 96
- 108090001074 Nucleocapsid Proteins Proteins 0.000 claims abstract description 88
- 108010076504 Protein Sorting Signals Proteins 0.000 claims abstract description 65
- 241000282414 Homo sapiens Species 0.000 claims abstract description 29
- 108700002099 Coronavirus Nucleocapsid Proteins Proteins 0.000 claims abstract description 14
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims abstract description 12
- 230000003053 immunization Effects 0.000 claims abstract description 12
- 238000002649 immunization Methods 0.000 claims abstract description 12
- 102000036639 antigens Human genes 0.000 claims description 257
- 108091007433 antigens Proteins 0.000 claims description 257
- 239000000427 antigen Substances 0.000 claims description 233
- 239000000203 mixture Substances 0.000 claims description 217
- 229960005486 vaccine Drugs 0.000 claims description 143
- 241001678559 COVID-19 virus Species 0.000 claims description 121
- 108020001507 fusion proteins Proteins 0.000 claims description 101
- 102000037865 fusion proteins Human genes 0.000 claims description 101
- 238000000034 method Methods 0.000 claims description 86
- 230000028993 immune response Effects 0.000 claims description 81
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 77
- 241001115402 Ebolavirus Species 0.000 claims description 76
- 241000127282 Middle East respiratory syndrome-related coronavirus Species 0.000 claims description 73
- 108010074328 Interferon-gamma Proteins 0.000 claims description 61
- 102000008070 Interferon-gamma Human genes 0.000 claims description 60
- 229960003130 interferon gamma Drugs 0.000 claims description 60
- 239000002773 nucleotide Substances 0.000 claims description 57
- 125000003729 nucleotide group Chemical group 0.000 claims description 57
- 102000004388 Interleukin-4 Human genes 0.000 claims description 55
- 108090000978 Interleukin-4 Proteins 0.000 claims description 55
- 229940028885 interleukin-4 Drugs 0.000 claims description 55
- 206010022000 influenza Diseases 0.000 claims description 47
- 108700026244 Open Reading Frames Proteins 0.000 claims description 43
- 230000024932 T cell mediated immunity Effects 0.000 claims description 41
- 241000008904 Betacoronavirus Species 0.000 claims description 40
- 241000711573 Coronaviridae Species 0.000 claims description 36
- 229920001661 Chitosan Polymers 0.000 claims description 35
- 150000001413 amino acids Chemical class 0.000 claims description 35
- 239000012634 fragment Substances 0.000 claims description 35
- 241000315672 SARS coronavirus Species 0.000 claims description 32
- 108010052285 Membrane Proteins Proteins 0.000 claims description 30
- 102000018697 Membrane Proteins Human genes 0.000 claims description 30
- 230000014509 gene expression Effects 0.000 claims description 28
- 230000004936 stimulating effect Effects 0.000 claims description 27
- 208000025370 Middle East respiratory syndrome Diseases 0.000 claims description 23
- 241000712431 Influenza A virus Species 0.000 claims description 20
- 239000003795 chemical substances by application Substances 0.000 claims description 20
- 230000004044 response Effects 0.000 claims description 20
- 241000710929 Alphavirus Species 0.000 claims description 18
- 201000011001 Ebola Hemorrhagic Fever Diseases 0.000 claims description 18
- 241000713196 Influenza B virus Species 0.000 claims description 18
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 17
- 101710154606 Hemagglutinin Proteins 0.000 claims description 16
- 101710093908 Outer capsid protein VP4 Proteins 0.000 claims description 16
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 claims description 16
- 101710176177 Protein A56 Proteins 0.000 claims description 16
- 239000000185 hemagglutinin Substances 0.000 claims description 16
- 108091026890 Coding region Proteins 0.000 claims description 14
- 241000710959 Venezuelan equine encephalitis virus Species 0.000 claims description 14
- 102000004127 Cytokines Human genes 0.000 claims description 13
- 108090000695 Cytokines Proteins 0.000 claims description 13
- 230000028996 humoral immune response Effects 0.000 claims description 13
- 208000001528 Coronaviridae Infections Diseases 0.000 claims description 12
- 210000000612 antigen-presenting cell Anatomy 0.000 claims description 12
- 108091006197 SARS-CoV-2 Nucleocapsid Protein Proteins 0.000 claims description 11
- 108020003175 receptors Proteins 0.000 claims description 11
- 102000005962 receptors Human genes 0.000 claims description 11
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims description 9
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims description 9
- 210000002443 helper t lymphocyte Anatomy 0.000 claims description 9
- 239000002105 nanoparticle Substances 0.000 claims description 9
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 claims description 8
- 230000036039 immunity Effects 0.000 claims description 8
- 208000015181 infectious disease Diseases 0.000 claims description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 7
- 230000028327 secretion Effects 0.000 claims description 7
- 101001033280 Homo sapiens Cytokine receptor common subunit beta Proteins 0.000 claims description 6
- 240000007839 Kleinhovia hospita Species 0.000 claims description 6
- 101710172711 Structural protein Proteins 0.000 claims description 6
- 102000055647 human CSF2RB Human genes 0.000 claims description 6
- 150000002632 lipids Chemical class 0.000 claims description 6
- 101710144128 Non-structural protein 2 Proteins 0.000 claims description 5
- 101710199667 Nuclear export protein Proteins 0.000 claims description 5
- 101800000980 Protease nsP2 Proteins 0.000 claims description 5
- 241000712464 Orthomyxoviridae Species 0.000 claims description 4
- 241000710961 Semliki Forest virus Species 0.000 claims description 4
- 241000710960 Sindbis virus Species 0.000 claims description 4
- 108010087302 Viral Structural Proteins Proteins 0.000 claims description 4
- 230000004927 fusion Effects 0.000 claims description 4
- 230000036541 health Effects 0.000 claims description 4
- 230000008520 organization Effects 0.000 claims description 4
- 241000894007 species Species 0.000 claims description 4
- 108010061994 Coronavirus Spike Glycoprotein Proteins 0.000 claims description 3
- 239000002502 liposome Substances 0.000 claims description 3
- 241000712461 unidentified influenza virus Species 0.000 abstract description 14
- 102000011931 Nucleoproteins Human genes 0.000 description 167
- 108010061100 Nucleoproteins Proteins 0.000 description 167
- 108090000765 processed proteins & peptides Proteins 0.000 description 119
- 241000699670 Mus sp. Species 0.000 description 79
- 210000004027 cell Anatomy 0.000 description 71
- 108090000623 proteins and genes Proteins 0.000 description 68
- 230000007969 cellular immunity Effects 0.000 description 63
- 102000004169 proteins and genes Human genes 0.000 description 58
- 102000004196 processed proteins & peptides Human genes 0.000 description 51
- 239000000902 placebo Substances 0.000 description 45
- 229940068196 placebo Drugs 0.000 description 45
- 238000002347 injection Methods 0.000 description 40
- 239000007924 injection Substances 0.000 description 40
- 238000002255 vaccination Methods 0.000 description 40
- 210000004989 spleen cell Anatomy 0.000 description 36
- 210000001744 T-lymphocyte Anatomy 0.000 description 33
- 238000011510 Elispot assay Methods 0.000 description 27
- 238000003114 enzyme-linked immunosorbent spot assay Methods 0.000 description 26
- 230000001413 cellular effect Effects 0.000 description 25
- 230000003248 secreting effect Effects 0.000 description 25
- 238000011725 BALB/c mouse Methods 0.000 description 24
- 239000013598 vector Substances 0.000 description 22
- 101001024637 Severe acute respiratory syndrome coronavirus 2 Nucleoprotein Proteins 0.000 description 21
- 239000000872 buffer Substances 0.000 description 21
- 238000000338 in vitro Methods 0.000 description 20
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 19
- 108010078144 glutaminyl-glycine Proteins 0.000 description 19
- 108010061238 threonyl-glycine Proteins 0.000 description 18
- 238000013518 transcription Methods 0.000 description 18
- 230000035897 transcription Effects 0.000 description 18
- 210000004988 splenocyte Anatomy 0.000 description 17
- 230000006698 induction Effects 0.000 description 16
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 16
- 108010054155 lysyllysine Proteins 0.000 description 16
- 230000003472 neutralizing effect Effects 0.000 description 16
- 101100273730 Homo sapiens CD5 gene Proteins 0.000 description 15
- 206010028980 Neoplasm Diseases 0.000 description 15
- RQFQJYYMBWVMQG-IXDPLRRUSA-N chitotriose Chemical compound O[C@@H]1[C@@H](N)[C@H](O)O[C@H](CO)[C@H]1O[C@H]1[C@H](N)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)[C@@H](CO)O1 RQFQJYYMBWVMQG-IXDPLRRUSA-N 0.000 description 15
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 15
- 230000004727 humoral immunity Effects 0.000 description 15
- 239000000463 material Substances 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 14
- 108091005634 SARS-CoV-2 receptor-binding domains Proteins 0.000 description 14
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 14
- 101710198474 Spike protein Proteins 0.000 description 14
- 108010087924 alanylproline Proteins 0.000 description 14
- 108010089804 glycyl-threonine Proteins 0.000 description 14
- 108010050848 glycylleucine Proteins 0.000 description 14
- 239000000243 solution Substances 0.000 description 14
- 239000002671 adjuvant Substances 0.000 description 13
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 13
- 230000001939 inductive effect Effects 0.000 description 13
- 229940096437 Protein S Drugs 0.000 description 12
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 12
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 12
- 239000003153 chemical reaction reagent Substances 0.000 description 12
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 12
- 238000001727 in vivo Methods 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- 108010003700 lysyl aspartic acid Proteins 0.000 description 12
- 238000011740 C57BL/6 mouse Methods 0.000 description 11
- 101150049278 US20 gene Proteins 0.000 description 11
- 108010038633 aspartylglutamate Proteins 0.000 description 11
- 238000003556 assay Methods 0.000 description 11
- 108010015792 glycyllysine Proteins 0.000 description 11
- 208000037797 influenza A Diseases 0.000 description 11
- 208000037798 influenza B Diseases 0.000 description 11
- 241000880493 Leptailurus serval Species 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 210000002966 serum Anatomy 0.000 description 10
- 210000003491 skin Anatomy 0.000 description 10
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 9
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 9
- 108010010147 glycylglutamine Proteins 0.000 description 9
- 230000005847 immunogenicity Effects 0.000 description 9
- 229940027941 immunoglobulin g Drugs 0.000 description 9
- 108010005942 methionylglycine Proteins 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 238000012360 testing method Methods 0.000 description 9
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 8
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 8
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 8
- 108060001084 Luciferase Proteins 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- 102000007999 Nuclear Proteins Human genes 0.000 description 8
- 108010089610 Nuclear Proteins Proteins 0.000 description 8
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 8
- 108010068380 arginylarginine Proteins 0.000 description 8
- 108010060035 arginylproline Proteins 0.000 description 8
- 210000003719 b-lymphocyte Anatomy 0.000 description 8
- 238000012258 culturing Methods 0.000 description 8
- 108010087823 glycyltyrosine Proteins 0.000 description 8
- 235000014655 lactic acid Nutrition 0.000 description 8
- 239000004310 lactic acid Substances 0.000 description 8
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 7
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 7
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 7
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 7
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 7
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 7
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 7
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 7
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 7
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 7
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 7
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 7
- 102100021947 Survival motor neuron protein Human genes 0.000 description 7
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 7
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 7
- 102000044159 Ubiquitin Human genes 0.000 description 7
- 108090000848 Ubiquitin Proteins 0.000 description 7
- 241001115400 Zaire ebolavirus Species 0.000 description 7
- 125000000539 amino acid group Chemical group 0.000 description 7
- 108010008355 arginyl-glutamine Proteins 0.000 description 7
- 108010077245 asparaginyl-proline Proteins 0.000 description 7
- 108010093581 aspartyl-proline Proteins 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 238000011161 development Methods 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 108010012058 leucyltyrosine Proteins 0.000 description 7
- 244000052769 pathogen Species 0.000 description 7
- 108010004914 prolylarginine Proteins 0.000 description 7
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 7
- 108010003137 tyrosyltyrosine Proteins 0.000 description 7
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 6
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 6
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 6
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 6
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 6
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 6
- 206010006187 Breast cancer Diseases 0.000 description 6
- 208000026310 Breast neoplasm Diseases 0.000 description 6
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 6
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 6
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 6
- 108090000288 Glycoproteins Proteins 0.000 description 6
- 102000003886 Glycoproteins Human genes 0.000 description 6
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 6
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 6
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 6
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 6
- 239000005089 Luciferase Substances 0.000 description 6
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 6
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 6
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 6
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 6
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 230000002708 enhancing effect Effects 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 230000001404 mediated effect Effects 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 108010085203 methionylmethionine Proteins 0.000 description 6
- 108010070643 prolylglutamic acid Proteins 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 108010020532 tyrosyl-proline Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- QEZYFJFOYSPYQE-OWNCGHDPSA-N (2s,3s)-3-methyl-2-[[(2s)-1-[2-[[(2s)-1-[(2s)-3-phenyl-2-[[(2s)-pyrrolidine-2-carbonyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carbonyl]amino]pentanoic acid Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1N(C(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H]2NCCC2)CCC1 QEZYFJFOYSPYQE-OWNCGHDPSA-N 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 5
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 5
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 5
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 5
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 5
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 5
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 5
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 5
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 5
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 5
- 102100031673 Corneodesmosin Human genes 0.000 description 5
- 101710139375 Corneodesmosin Proteins 0.000 description 5
- 238000002965 ELISA Methods 0.000 description 5
- 229940124722 Ebola vaccine Drugs 0.000 description 5
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 5
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 5
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 5
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 5
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 5
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 5
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 5
- 241000282412 Homo Species 0.000 description 5
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 5
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 5
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 5
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 5
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 5
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 5
- 241000699666 Mus <mouse, genus> Species 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 5
- NUZHSNLQJDYSRW-BZSNNMDCSA-N Pro-Arg-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NUZHSNLQJDYSRW-BZSNNMDCSA-N 0.000 description 5
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 5
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 5
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 5
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 5
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 5
- 108010068265 aspartyltyrosine Proteins 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 239000013604 expression vector Substances 0.000 description 5
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 5
- 108010036413 histidylglycine Proteins 0.000 description 5
- 229960003971 influenza vaccine Drugs 0.000 description 5
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 230000009467 reduction Effects 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 4
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 4
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 4
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 4
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 4
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 4
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 4
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 4
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 4
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 4
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 4
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 4
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 4
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 4
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 4
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 4
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 4
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 4
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 4
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 4
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 4
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 4
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 4
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 4
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 4
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 4
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 4
- 210000001239 CD8-positive, alpha-beta cytotoxic T lymphocyte Anatomy 0.000 description 4
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 4
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 4
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 4
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 4
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 4
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 4
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 4
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 4
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 4
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 4
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 4
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 4
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 4
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 4
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 4
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 4
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 4
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 4
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 4
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 4
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 4
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 4
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 4
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 4
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 4
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 4
- 102000043129 MHC class I family Human genes 0.000 description 4
- 108091054437 MHC class I family Proteins 0.000 description 4
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 4
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- 101710141454 Nucleoprotein Proteins 0.000 description 4
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 4
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 4
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 4
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 4
- RTQKBZIRDWZLDF-BZSNNMDCSA-N Pro-Pro-Trp Chemical compound C([C@H]1C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)O)CCN1C(=O)[C@@H]1CCCN1 RTQKBZIRDWZLDF-BZSNNMDCSA-N 0.000 description 4
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 4
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 4
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 4
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 4
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 4
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 4
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 4
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 4
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 4
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 4
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 4
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 4
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 4
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 4
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 4
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 4
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 4
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 4
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 4
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 4
- MXFPBNFKVBHIRW-BZSNNMDCSA-N Tyr-Lys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O MXFPBNFKVBHIRW-BZSNNMDCSA-N 0.000 description 4
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 4
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 4
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 4
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 4
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 231100000673 dose–response relationship Toxicity 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 238000003384 imaging method Methods 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 108010053037 kyotorphin Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 238000006386 neutralization reaction Methods 0.000 description 4
- 238000009401 outcrossing Methods 0.000 description 4
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 230000001681 protective effect Effects 0.000 description 4
- 239000003161 ribonuclease inhibitor Substances 0.000 description 4
- YYGNTYWPHWGJRM-AAJYLUCBSA-N squalene Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C=C(/C)CC\C=C(/C)CCC=C(C)C YYGNTYWPHWGJRM-AAJYLUCBSA-N 0.000 description 4
- 229940031626 subunit vaccine Drugs 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010044292 tryptophyltyrosine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 3
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 3
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 3
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 3
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 3
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 3
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 3
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 3
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 3
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 3
- CFGHCPUPFHWMCM-FDARSICLSA-N Arg-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N CFGHCPUPFHWMCM-FDARSICLSA-N 0.000 description 3
- PSOPJDUQUVFSLS-GUBZILKMSA-N Arg-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PSOPJDUQUVFSLS-GUBZILKMSA-N 0.000 description 3
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 3
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 3
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 3
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 3
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 3
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 3
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 3
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 3
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 3
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 3
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 3
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 3
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 3
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 3
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 3
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 3
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 3
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 3
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 238000012286 ELISA Assay Methods 0.000 description 3
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 3
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 3
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 3
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 3
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 3
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 3
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 3
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 3
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 3
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 3
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 3
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 3
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 3
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 3
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 3
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 3
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 3
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 3
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 3
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 3
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 3
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 3
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 3
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 3
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 3
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 3
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 3
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 3
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 3
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 3
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 3
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 3
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 3
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 3
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 3
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- 102220617446 Leucine-rich repeat-containing protein 34_H10N_mutation Human genes 0.000 description 3
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 3
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 3
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 3
- WOEDRPCHKPSFDT-MXAVVETBSA-N Lys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N WOEDRPCHKPSFDT-MXAVVETBSA-N 0.000 description 3
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 3
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 3
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 3
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 3
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 3
- 102000043131 MHC class II family Human genes 0.000 description 3
- 108091054438 MHC class II family Proteins 0.000 description 3
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 3
- 108010047562 NGR peptide Proteins 0.000 description 3
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 3
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 3
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 3
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 3
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 3
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 3
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 3
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 3
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 3
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 3
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 3
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 3
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 3
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 3
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 3
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 3
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 3
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 3
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 3
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 3
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 3
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 3
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- 230000005867 T cell response Effects 0.000 description 3
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 3
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 3
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 3
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 3
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 3
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 3
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 3
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 3
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 3
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 3
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 3
- NENACTSCXYHPOX-ULQDDVLXSA-N Tyr-His-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O NENACTSCXYHPOX-ULQDDVLXSA-N 0.000 description 3
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 3
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 3
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 3
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 3
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 230000006196 deacetylation Effects 0.000 description 3
- 238000003381 deacetylation reaction Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000004744 fabric Substances 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 230000007774 longterm Effects 0.000 description 3
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 150000007523 nucleic acids Chemical group 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- 230000003389 potentiating effect Effects 0.000 description 3
- 230000037452 priming Effects 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 230000009257 reactivity Effects 0.000 description 3
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 3
- 239000012096 transfection reagent Substances 0.000 description 3
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 2
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 2
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 2
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 2
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 2
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 2
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 2
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 2
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 2
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- ZRNWJUAQKFUUKV-SRVKXCTJSA-N Arg-Met-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZRNWJUAQKFUUKV-SRVKXCTJSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 2
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 2
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 2
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 2
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 2
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 2
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 2
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 2
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 2
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 2
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 2
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 2
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 2
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- XSXVLWBWIPKUSN-UHFFFAOYSA-N Asp-Leu-Glu-Asp Chemical compound OC(=O)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(O)=O)C(O)=O XSXVLWBWIPKUSN-UHFFFAOYSA-N 0.000 description 2
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 2
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 2
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 2
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 2
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 2
- 229940022962 COVID-19 vaccine Drugs 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 2
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 2
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 2
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 2
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 2
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 2
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 2
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 2
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 2
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 2
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 2
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 2
- PDXIOFXRBVDSHD-JBACZVJFSA-N Gln-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)N)N PDXIOFXRBVDSHD-JBACZVJFSA-N 0.000 description 2
- ZVQZXPADLZIQFF-FHWLQOOXSA-N Gln-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 ZVQZXPADLZIQFF-FHWLQOOXSA-N 0.000 description 2
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 2
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 2
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 2
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 2
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 2
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 2
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 206010069767 H1N1 influenza Diseases 0.000 description 2
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 2
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 2
- BILZDIPAKWZFSG-PYJNHQTQSA-N His-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BILZDIPAKWZFSG-PYJNHQTQSA-N 0.000 description 2
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 2
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 2
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 2
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 2
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 2
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 2
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 2
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 2
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 2
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 2
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- 229940124873 Influenza virus vaccine Drugs 0.000 description 2
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 2
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 2
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 2
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 2
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- QRHWTCJBCLGYRB-FXQIFTODSA-N Met-Ala-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O QRHWTCJBCLGYRB-FXQIFTODSA-N 0.000 description 2
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 2
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 2
- BQVJARUIXRXDKN-DCAQKATOSA-N Met-Asn-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 BQVJARUIXRXDKN-DCAQKATOSA-N 0.000 description 2
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 2
- SDTSLIMYROCDNS-FXQIFTODSA-N Met-Cys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O SDTSLIMYROCDNS-FXQIFTODSA-N 0.000 description 2
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 2
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 2
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 2
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 2
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 2
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 2
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 102000005348 Neuraminidase Human genes 0.000 description 2
- 108010006232 Neuraminidase Proteins 0.000 description 2
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 2
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 2
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 2
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 2
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 2
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 2
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 2
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 2
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 2
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 2
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 2
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 2
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 2
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 2
- 101150062156 RBD2 gene Proteins 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 241000831652 Salinivibrio sharmensis Species 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 2
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 2
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 2
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 2
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 108091027544 Subgenomic mRNA Proteins 0.000 description 2
- 241001115376 Sudan ebolavirus Species 0.000 description 2
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 2
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 2
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 2
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 2
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 2
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 2
- UPOGHWJJZAZNSW-XIRDDKMYSA-N Trp-His-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O UPOGHWJJZAZNSW-XIRDDKMYSA-N 0.000 description 2
- JEYRCNVVYHTZMY-SZMVWBNQSA-N Trp-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JEYRCNVVYHTZMY-SZMVWBNQSA-N 0.000 description 2
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 2
- VNRTXOUAOUZCFW-WDSOQIARSA-N Trp-Val-His Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O VNRTXOUAOUZCFW-WDSOQIARSA-N 0.000 description 2
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 2
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 2
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 2
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 2
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 2
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 2
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 2
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 2
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 2
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 2
- AFWXOGHZEKARFH-ACRUOGEOSA-N Tyr-Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 AFWXOGHZEKARFH-ACRUOGEOSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 2
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 2
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 2
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- 241000711975 Vesicular stomatitis virus Species 0.000 description 2
- 208000036142 Viral infection Diseases 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 230000005875 antibody response Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 230000029918 bioluminescence Effects 0.000 description 2
- 238000005415 bioluminescence Methods 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 210000003068 cdc Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 230000009918 complex formation Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 230000036757 core body temperature Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- 239000000945 filler Substances 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 210000002865 immune cell Anatomy 0.000 description 2
- 230000002163 immunogen Effects 0.000 description 2
- 230000016784 immunoglobulin production Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010026228 mRNA guanylyltransferase Proteins 0.000 description 2
- 229940126582 mRNA vaccine Drugs 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000010172 mouse model Methods 0.000 description 2
- 229920001542 oligosaccharide Polymers 0.000 description 2
- 150000002482 oligosaccharides Chemical class 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000004043 responsiveness Effects 0.000 description 2
- 230000001932 seasonal effect Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 229940031439 squalene Drugs 0.000 description 2
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 201000010740 swine influenza Diseases 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 229940125575 vaccine candidate Drugs 0.000 description 2
- 230000009385 viral infection Effects 0.000 description 2
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- CEHZCZCQHUNAJF-AVGNSLFASA-N (2s)-1-[2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N1[C@H](C(O)=O)CCC1 CEHZCZCQHUNAJF-AVGNSLFASA-N 0.000 description 1
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- MLNSNVLOEIYJIU-ZUDIRPEPSA-N Ala-Leu-Thr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLNSNVLOEIYJIU-ZUDIRPEPSA-N 0.000 description 1
- FCXAUASCMJOFEY-NDKCEZKHSA-N Ala-Leu-Thr-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O FCXAUASCMJOFEY-NDKCEZKHSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- PYDIIVKGTBRIEL-SZMVWBNQSA-N Arg-Trp-Pro Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(O)=O PYDIIVKGTBRIEL-SZMVWBNQSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- JYHIVHINLJUIEG-BVSLBCMMSA-N Arg-Tyr-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYHIVHINLJUIEG-BVSLBCMMSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 1
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- UEONJSPBTSWKOI-CIUDSAMLSA-N Asn-Gln-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O UEONJSPBTSWKOI-CIUDSAMLSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 1
- RZNAMKZJPBQWDJ-SRVKXCTJSA-N Asn-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N RZNAMKZJPBQWDJ-SRVKXCTJSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 1
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 1
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 1
- BKZFBJYIVSBXCO-KKUMJFAQSA-N Asn-Phe-His Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O BKZFBJYIVSBXCO-KKUMJFAQSA-N 0.000 description 1
- UOUHBHOBGDCQPQ-IHPCNDPISA-N Asn-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)N)N UOUHBHOBGDCQPQ-IHPCNDPISA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 210000001266 CD8-positive T-lymphocyte Anatomy 0.000 description 1
- 208000025721 COVID-19 Diseases 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 108010088468 Ebola virus envelope glycoprotein Proteins 0.000 description 1
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 1
- XGKNQFOKIBKFTR-CIUDSAMLSA-N Gln-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCC(N)=O XGKNQFOKIBKFTR-CIUDSAMLSA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 1
- CMBXOSFZCFGDLE-IHRRRGAJSA-N Gln-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O CMBXOSFZCFGDLE-IHRRRGAJSA-N 0.000 description 1
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 1
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- 101710114810 Glycoprotein Proteins 0.000 description 1
- 206010061192 Haemorrhagic fever Diseases 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- CTCFZNBRZBNKAX-YUMQZZPRSA-N His-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 CTCFZNBRZBNKAX-YUMQZZPRSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 1
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 1
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 1
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- KQJBFMJFUXAYPK-AVGNSLFASA-N His-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KQJBFMJFUXAYPK-AVGNSLFASA-N 0.000 description 1
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- PFOUFRJYHWZJKW-NKIYYHGXSA-N His-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O PFOUFRJYHWZJKW-NKIYYHGXSA-N 0.000 description 1
- PBJOQLUVSGXRSW-YTQUADARSA-N His-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N)C(=O)O PBJOQLUVSGXRSW-YTQUADARSA-N 0.000 description 1
- 101000878605 Homo sapiens Low affinity immunoglobulin epsilon Fc receptor Proteins 0.000 description 1
- 241000598171 Human adenovirus sp. Species 0.000 description 1
- GRRNUXAQVGOGFE-UHFFFAOYSA-N Hygromycin-B Natural products OC1C(NC)CC(N)C(O)C1OC1C2OC3(C(C(O)C(O)C(C(N)CO)O3)O)OC2C(O)C(CO)O1 GRRNUXAQVGOGFE-UHFFFAOYSA-N 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- YTRFFJUOYBMLPN-UHFFFAOYSA-N Ile-Lys-Lys-Ser Chemical compound CCC(C)C(N)C(=O)NC(CCCCN)C(=O)NC(CCCCN)C(=O)NC(CO)C(O)=O YTRFFJUOYBMLPN-UHFFFAOYSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- WJBOZUVRPOIQNN-KJYZGMDISA-N Ile-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)C1=CN=CN1 WJBOZUVRPOIQNN-KJYZGMDISA-N 0.000 description 1
- CRYJOCSSSACEAA-VKOGCVSHSA-N Ile-Trp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CRYJOCSSSACEAA-VKOGCVSHSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- 241000481324 Influenza A virus (A/New Caledonia/20/1999(H1N1)) Species 0.000 description 1
- 241001111758 Influenza B virus (B/Florida/4/2006) Species 0.000 description 1
- 102100037850 Interferon gamma Human genes 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 1
- JVTAAEKCZFNVCJ-UHFFFAOYSA-M Lactate Chemical compound CC(O)C([O-])=O JVTAAEKCZFNVCJ-UHFFFAOYSA-M 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- 102100038007 Low affinity immunoglobulin epsilon Fc receptor Human genes 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- PGIAYBDRSREMJY-UHFFFAOYSA-N Lys-Lys-Met-Phe Chemical compound NCCCCC(N)C(=O)NC(CCCCN)C(=O)NC(CCSC)C(=O)NC(C(O)=O)CC1=CC=CC=C1 PGIAYBDRSREMJY-UHFFFAOYSA-N 0.000 description 1
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- VIZLHGTVGKBBKO-AVGNSLFASA-N Met-Arg-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VIZLHGTVGKBBKO-AVGNSLFASA-N 0.000 description 1
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 1
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- RMHHNLKYPOOKQN-FXQIFTODSA-N Met-Cys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O RMHHNLKYPOOKQN-FXQIFTODSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- VAGCEUUEMMXFEX-GUBZILKMSA-N Met-Met-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O VAGCEUUEMMXFEX-GUBZILKMSA-N 0.000 description 1
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 1
- KBTQZYASLSUFJR-KKUMJFAQSA-N Met-Phe-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KBTQZYASLSUFJR-KKUMJFAQSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 1
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 1
- ALTHVGNGGZZSAC-SRVKXCTJSA-N Met-Val-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N ALTHVGNGGZZSAC-SRVKXCTJSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- CKAVKDJBSNTJDB-SRVKXCTJSA-N Met-Val-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCSC CKAVKDJBSNTJDB-SRVKXCTJSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 101800000515 Non-structural protein 3 Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 1
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- IPVPGAADZXRZSH-RNXOBYDBSA-N Phe-Tyr-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IPVPGAADZXRZSH-RNXOBYDBSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 101800001758 RNA-directed RNA polymerase nsP4 Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 108091029810 SaRNA Proteins 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- 101710167605 Spike glycoprotein Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- 108010092262 T-Cell Antigen Receptors Proteins 0.000 description 1
- 210000000447 Th1 cell Anatomy 0.000 description 1
- 210000004241 Th2 cell Anatomy 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- IBBBOLAPFHRDHW-BPUTZDHNSA-N Trp-Asn-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N IBBBOLAPFHRDHW-BPUTZDHNSA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 1
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- GAKBTSMAPGLQFA-JNPHEJMOSA-N Tyr-Thr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 GAKBTSMAPGLQFA-JNPHEJMOSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- WGHVMKFREWGCGR-SRVKXCTJSA-N Val-Arg-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WGHVMKFREWGCGR-SRVKXCTJSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- AZDRQVAHHNSJOQ-UHFFFAOYSA-N alumane Chemical class [AlH3] AZDRQVAHHNSJOQ-UHFFFAOYSA-N 0.000 description 1
- 230000008350 antigen-specific antibody response Effects 0.000 description 1
- 239000008135 aqueous vehicle Substances 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 229960000190 bacillus calmette–guérin vaccine Drugs 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000036760 body temperature Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 239000000571 coke Substances 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 229940028617 conventional vaccine Drugs 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 210000004443 dendritic cell Anatomy 0.000 description 1
- 210000003595 dermal dendritic cell Anatomy 0.000 description 1
- 210000004207 dermis Anatomy 0.000 description 1
- 108010060455 des-Tyr- beta-casomorphin Proteins 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000001647 drug administration Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 210000002615 epidermis Anatomy 0.000 description 1
- 150000002270 gangliosides Chemical class 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 102000054999 human core Human genes 0.000 description 1
- 108700026469 human core Proteins 0.000 description 1
- 230000008348 humoral response Effects 0.000 description 1
- GRRNUXAQVGOGFE-NZSRVPFOSA-N hygromycin B Chemical compound O[C@@H]1[C@@H](NC)C[C@@H](N)[C@H](O)[C@H]1O[C@H]1[C@H]2O[C@@]3([C@@H]([C@@H](O)[C@@H](O)[C@@H](C(N)CO)O3)O)O[C@H]2[C@@H](O)[C@@H](CO)O1 GRRNUXAQVGOGFE-NZSRVPFOSA-N 0.000 description 1
- 229940097277 hygromycin b Drugs 0.000 description 1
- 230000008088 immune pathway Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 229940033324 influenza A vaccine Drugs 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000015788 innate immune response Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 210000001821 langerhans cell Anatomy 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 230000005923 long-lasting effect Effects 0.000 description 1
- 230000005877 long-lasting humoral immunity Effects 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 210000001806 memory b lymphocyte Anatomy 0.000 description 1
- 210000003071 memory t lymphocyte Anatomy 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229960005030 other vaccine in atc Drugs 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000007479 persistent immune response Effects 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 108010008595 sarcoma-associated antigen S1 Proteins 0.000 description 1
- 229940078677 sarna Drugs 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000009450 sialylation Effects 0.000 description 1
- 210000004927 skin cell Anatomy 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- 230000029069 type 2 immune response Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Landscapes
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
The present disclosure relates to mRNA, self-replicating RNA, and temperature-sensitive self-replicating RNA encoding a coronavirus nucleocapsid protein or an influenza virus nucleocapsid protein operably combined with a mammalian signal peptide. The present disclosure relates to mRNA, self-replicating RNA, and temperature-sensitive self-replicating RNA encoding one or more other viral nucleocapsid proteins in operable combination with a mammalian signal peptide. The RNA construct is suitable for active immunization against a virus in a mammalian subject, such as a human subject.
Description
Cross Reference to Related Applications
The present application claims the benefits of U.S. provisional application No. 63/275,398, U.S. provisional application No. 63/240,278, and U.S. provisional application No. 63/211,974, filed on 3, 11, 2021, 9, 2, 2021, 6, 17, each of which are hereby incorporated by reference in their entirety.
Submission of ASCII text file sequence Listing
The contents of the following submitted ASCII text files are incorporated herein by reference in their entirety: a sequence listing in Computer Readable Form (CRF) (file name: 699442001440seqlist. Txt, date recorded: 2022, 6, 16 days, size: 126,113 bytes).
Technical Field
The present disclosure relates to mRNA, self-replicating RNA, and temperature-sensitive self-replicating RNA encoding a coronavirus nucleocapsid protein or an influenza virus nucleocapsid protein operably combined with a mammalian signal peptide. The present disclosure relates to mRNA, self-replicating RNA, and temperature-sensitive self-replicating RNA encoding one or more other viral nucleocapsid proteins in operable combination with a mammalian signal peptide. The RNA construct is suitable for active immunization against a virus in a mammalian subject, such as a human subject.
Background
The genus beta coronavirus encompasses Severe Acute Respiratory Syndrome (SARS) -CoV-2, which causes a pandemic in COVID-19, SARS-CoV-1, which causes a 2002-2004SARS outbreak, and Middle East Respiratory Syndrome (MERS) -CoV. The popularity of covd-19 makes vaccine design and production an urgent need for immunization of a large population worldwide.
The SARS-CoV-2 vaccine currently approved by the U.S. food and drug administration (U.S. food & Drug Administration) is designed to elicit neutralizing antibodies (nAb) against spike (S) protein or the Receptor Binding Domain (RBD) of S protein prior to infection. However, this approach presents a significant challenge because the S protein is not very conserved even between SARS-CoV-1 and SARS-CoV-2 strains. In particular, small amino acid changes that occur between variants often result in conformational changes in the S protein, which may significantly reduce the effectiveness of the nAb triggered by a particular S protein of the covd-19 vaccine.
Thus, continued vaccine development targeting only the β -coronavirus S protein is expected to follow the path of seasonal influenza vaccines. This means that the continued advent of varieties would likely require the periodic development and production of new vaccines. While the annual production of beta coronavirus vaccines may be technically feasible, global vaccination efforts involving annual administration of new vaccines are economically and logistically impractical. The problem with the annual administration of new vaccines places a particularly undue burden on low and medium income countries.
Thus, there is a need in the art for a beta coronavirus vaccine that safely induces a sustained immune response with broad responsiveness against SARS-CoV-2 variants. Preferably, the persistent immune response is broadly responsive to other beta coronaviruses causing human disease. There is also a need in the art for an influenza virus vaccine that is safe and effective in inducing a broad reactive immune response against influenza a virus and/or influenza b virus.
Disclosure of Invention
The present disclosure relates to the use of nucleoprotein (also referred to herein as nucleocapsid protein) from a beta coronavirus as a vaccine antigen to induce a cellular immune response with broad responsiveness to a beta coronavirus variant. In some embodiments, temperature-controlled self-replicating RNA (referred to herein as srRNAts and c-srrrna) vaccine platforms are utilized. The c-srRNA vaccine platform is advantageous for inducing an effective cellular immune response following intradermal administration. In some embodiments, the nucleoprotein from SARS-CoV-2 is expressed in the host cell to address the infection caused by both SARS-CoV-2 and SARS-CoV-1, as well as variants thereof. In some embodiments, a nucleoprotein from a coronavirus is fused to a signal peptide of a human CD5 antigen and expressed in a host cell to enhance a cellular immune response elicited against the coronavirus. In some embodiments, the nucleoprotein from a first coronavirus is fused to a nucleoprotein from a second coronavirus, the second coronavirus being different from the first coronavirus. In some embodiments, the fusion protein comprises a tandem array of two or three coronavirus nucleoproteins. In a subset of these embodiments, the fusion protein comprises SARS-CoV-2 nucleoprotein and MERS-CoV nucleoprotein. In some embodiments, the fusion protein further comprises a coronavirus spike protein or fragment thereof. In this way, a more broadly reactive coronavirus-specific immune response is stimulated.
The present disclosure also relates to the use of nucleoprotein (also referred to herein as nucleocapsid protein) from influenza virus as vaccine antigen to induce a cellular immune response with broad reactivity to influenza a virus and/or influenza b virus that changes rapidly over time due to antigen drift and antigen shift. In some embodiments, a temperature-controllable self-replicating RNA vaccine platform is utilized. The c-srRNA vaccine platform is advantageous for inducing an effective cellular immune response following intradermal administration. In some embodiments, a nucleoprotein from one subtype of influenza a (FluA) virus is expressed in a host cell to address infections caused by the same and different subtypes of FluA. In some embodiments, a nucleoprotein from one lineage of influenza b (FluB) virus is expressed in a host cell to address infections caused by the same and different lineages of FluB. In some embodiments, a nucleoprotein from influenza virus is fused to a signal peptide of human CD5 antigen and expressed in a host cell to enhance the cellular immune response elicited against influenza virus. In some embodiments, the nucleoprotein from the FluA virus is fused to the nucleoprotein from the FluB virus. In some embodiments, the fusion protein comprises a tandem array of two or three nuclear proteins from one or more strains of FluA and/or one or more lineages of FluB. In some embodiments, the fusion protein further comprises influenza hemagglutinin or a fragment thereof. In this way, a more broadly reactive influenza-specific immune response is stimulated.
The present disclosure also relates to the use of a nucleoprotein (also referred to herein as a nucleocapsid protein) from ebola virus as a vaccine antigen to induce a broadly reactive cellular immune response to two, three or four ebola virus species that infect humans. In some embodiments, a temperature-controllable self-replicating RNA vaccine platform is utilized. The c-srRNA vaccine platform is advantageous for inducing an effective cellular immune response following intradermal administration. In some embodiments, a nucleoprotein from ebola virus is fused to a signal peptide of human CD5 antigen and expressed in a host cell to enhance the cellular immune response elicited against the ebola virus. In some embodiments, a nucleoprotein from a first ebola virus species is fused to a nucleoprotein from a second ebola virus species, the nucleoprotein from the second ebola virus species is optionally fused to a nucleoprotein of a third ebola virus species, and the nucleoprotein of the third ebola virus species is optionally fused to a nucleoprotein of a fourth ebola virus species. In some embodiments, the fusion protein comprises a tandem array of two, three, or four nucleoproteins, or fragments thereof, from two or more ebola virus species. In some embodiments, the fusion protein further comprises ebola virus envelope glycoprotein or a fragment thereof. In this way, a more broadly reactive ebola virus-specific immune response was stimulated.
In other embodiments, the present disclosure provides compositions comprising an excipient and a temperature-controllable self-replicating RNA. In some embodiments, the composition comprises chitosan. In some embodiments, the chitosan is a low molecular weight (about 3-5 kDa) chitosan oligosaccharide, such as chitosan oligosaccharide lactic acid. In some embodiments, the composition does not comprise a liposome or lipid nanoparticle.
Drawings
FIG. 1 shows a schematic representation of the mechanism by which the immune response of cells (CD4+ and CD8+ T cells) is induced following intradermal injection of temperature controlled self-replicating RNA (herein referred to as srRNAts and c-srRNA) vaccines.
FIG. 2 shows a schematic representation of SARS-CoV-2 nucleocapsid (N) protein expressed by mRNA, self-replicating RNA or temperature-sensitive self-replicating RNA (srRNAts) delivered to mammalian host cells. In an exemplary embodiment, the coding region of the N protein is a gene of interest (GOI) inserted into srrrnats. The amino acid sequence of the G5004 antigen is shown as SEQ ID NO. 5. The G5004 antigen is SARS-CoV-2N protein lacking a signal peptide. The amino acid sequence of the G5005 antigen is shown as SEQ ID NO. 6. The G5005 antigen is a fusion protein comprising a signal peptide (CD 5-SP) sequence from the human CD5 antigen as shown in SEQ ID NO. 8 and the SARS-CoV-2N protein, wherein CD5-SP replaces the initiating methionine at position 1 of the N protein. The amino acid sequence of the G5006 antigen is shown as SEQ ID NO. 7. The G5006 antigen is a fusion protein comprising the signal peptide sequence of CD5-SP, the SARS-CoV-2N protein and the MERS-CoV N protein. The nucleotide sequence of the coding G5004 antigen is shown as SEQ ID NO. 1. The nucleotide sequence of the coding G5005 antigen is shown as SEQ ID NO. 2. The nucleotide sequence encoding the G5006 antigen is shown in SEQ ID NO. 3 and in codon optimized form in SEQ ID NO. 4.
Fig. 3 shows a schematic diagram of an exemplary method for stimulating an immune response against coronavirus in a human subject. Temperature sensitive agents (ts agents) such as srrrnats are functional at licensed temperatures but non-functional at unlicensed temperatures. The temperature at or below the body surface of a human subject (surface body temperature) is the permissible temperature, while the core body temperature of a human subject is the higher unlicensed temperature. Thus, ts agents administered intradermally to a human subject are functional at a permissible temperature below the body surface of the human subject.
FIGS. 4A and 4B show the frequency of cytokine secreting cells in spleen cell samples obtained from CD-1-outcrossing mice that have been immunized by a single intradermal injection with 100. Mu.L of a solution containing 5. Mu.g or 25. Mu.g of temperature controllable self-replicating RNA encoding the G5004 antigen (srRNA 1ts2[ PCT/US20/67506 ]) or placebo (PBO: buffer only). FIG. 4A shows the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFCs) in 1x 10 x 6 spleen cells after restimulation by culturing the spleen cells in the presence or absence of SARS-CoV-2 nucleoprotein peptide pool, and FIG. 4B shows the frequency of interleukin-4 (IL-4) SFCs in the spleen cells. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Splenocytes were isolated 14 days after intradermal injection.
FIGS. 5A and 5B show the frequency of cytokine secreting cells in spleen cell samples obtained from CD-1-outcrossing mice that have been immunized by a single intradermal injection with 100. Mu.L of a solution containing 5. Mu.g or 25. Mu.g of temperature controllable self-replicating RNA encoding the G5005 antigen (srRNA 1ts2[ PCT/US20/67506 ]) or placebo (PBO: buffer only). FIG. 5A shows the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFCs) in 1x 10 x 6 spleen cells after restimulation by culturing the spleen cells in the presence or absence of SARS-CoV-2 nucleoprotein peptide pool, and FIG. 5B shows the frequency of interleukin-4 (IL-4) SFCs in the spleen cells. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Splenocytes were isolated 14 days after intradermal injection.
FIGS. 6A and 6B show the frequency of cytokine secreting cells in spleen cell samples obtained from BALB/c mice that have been immunized by a single intradermal injection of 100. Mu.L solution containing 5. Mu.g or 25. Mu.g of temperature controllable self replicating RNA encoding G5005 antigen (srRNA 1ts2[ PCT/US20/67506 ]) or placebo (PBO: buffer only). FIG. 6A shows the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFCs) in 1x 10 x 6 spleen cells after restimulation by culturing the spleen cells in the presence or absence of SARS-CoV-2 nucleoprotein peptide pool, and FIG. 6B shows the frequency of interleukin-4 (IL-4) SFCs in the spleen cells. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Splenocytes were isolated 30 days after vaccination.
FIG. 7 shows the level of SARS-CoV-2 antigen reactive immunoglobulin G (IgG) in serum of BALB/c mice that have been immunized by a single intradermal injection with 100. Mu.L of a solution containing 5. Mu.g or 25. Mu.g of temperature controllable self-replicating RNA encoding G5005 antigen (srRNA 1ts2[ PCT/US20/67506 ]) or placebo (PBO: buffer only). IgG levels are represented by OD450 in ELISA. IgG levels before (day-1) and after (day 30) vaccination are shown. Mean and standard deviation (error bars) of five mice per group (n=5) are shown.
FIG. 8 shows the frequency of interferon-gamma (INF-gamma) secreting cells in spleen cell samples obtained from BALB/c mice that have been immunized by a single intradermal injection of 100. Mu.L solution containing 5. Mu.g or 25. Mu.g of temperature controlled self-replicating RNA encoding G5006 antigen (srRNA 1ts2[ PCT/US20/67506 ]) or placebo (PBO: buffer only). Specifically, FIG. 8 shows the frequency of INF-gamma Spot Forming Cells (SFC) in 1x 10≡6 spleen cells after restimulation by culturing the spleen cells in the presence or absence of SARS-CoV-2 nucleoprotein peptide pool. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Splenocytes were isolated 14 days after vaccination.
Fig. 9 shows a schematic diagram of an exemplary pandemic influenza vaccine. Briefly, fusion proteins comprising a nucleoprotein from influenza a virus (FluA) and a nucleoprotein from influenza b virus (FluB) are expressed by mRNA, self-replicating RNA, or temperature-sensitive self-replicating RNA (srRNAts) delivered to mammalian host cells. In an exemplary embodiment, the coding region of the fusion protein is a gene of interest (GOI) inserted into srRNAts. Specifically, G5010 is a fusion protein comprising: the signal peptide sequence (CD 5-SP) from the human CD5 antigen, fluA nucleoprotein (influenza A, subtype H5N8 [ A/b.sub.der dock/Korea/Gochang 1/2014], genBank number KJ413835.1, proteinID number AHL 21420.1) and FluB nucleoprotein (influenza B [ B/Florida/4/2006], genBank number CY033879.1, proteinID number ACF 54251.1) as shown in SEQ ID No. 8. In G5010, CD5-SP replaces the initiator methionine of the FluA nucleoprotein and the FluA nucleoprotein was fused to the methionine of the initiation codon of the FluB nucleoprotein.
FIG. 10 shows an alignment of the nucleoprotein (SEQ ID NO: 13) of influenza A (H5N 8 strain; proteinID AHL 21420.1) used as vaccine antigen in G5010 and the nucleoprotein (SEQ ID NO: 17) of influenza A (NP/Annarbor H2N2; proteinID P21433) used as a source of the peptide pool for the ELISPot assay.
FIGS. 11A and 11B show the frequency of cytokine secreting cells in spleen cell samples obtained from BALB/c mice that have been immunized by a single intradermal injection of 100. Mu.L solution containing 5. Mu.g or 25. Mu.g of temperature controllable self replicating RNA encoding G5010 antigen (srRNA 1ts2[ PCT/US20/67506 ]) or placebo (PBO: buffer only). FIG. 11A shows the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFC) in 1x 10≡6 spleen cells after restimulation by culturing the spleen cells in the presence or absence of influenza A (H2N 2) nucleoprotein peptide pool, and FIG. 11B shows the frequency of interleukin-4 (IL-4) SFC in the spleen cells. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Splenocytes were isolated 14 days after vaccination.
Fig. 12 shows a schematic diagram of an exemplary generalized ebola virus vaccine. Briefly, fusion proteins comprising the nucleoproteins of four different ebola strains are expressed from mRNA, self-replicating RNA, or temperature-sensitive self-replicating RNA (srrrnats) delivered to mammalian host cells. In an exemplary embodiment, the coding region of the fusion protein is a gene of interest (GOI) inserted into srRNAts. Specifically, exemplary pan-ebola antigens are fusion proteins comprising: the signal peptide sequence from human CD5 antigen (CD 5-SP) as shown in SEQ ID NO:8, the portion of the nucleoprotein of the Zaeoloma type Ebola virus as shown in SEQ ID NO:18 (residues 2-739; total 738aa;GenBank ID:AF272001), the portion of the nucleoprotein of the Sudan type Ebola virus as shown in SEQ ID NO:19 (residues 403-738; total 336aa;GenBank ID:AF173836), the portion of the nucleoprotein of the Bendibu Jiao Xingai Bola virus as shown in SEQ ID NO:20 (residues 403-739; total 337aa;GenBank ID:FJ217161), and the portion of the nucleoprotein of the Taaeoloma type Ebola virus as shown in SEQ ID NO:21 (residues 483-651; total 169aa;GenBank ID:FJ217162). The amino acid sequence of the ubiquitin Ebola antigen is shown as SEQ ID NO. 22, and the nucleic acid sequence of the coding ubiquitin Ebola antigen is shown as SEQ ID NO. 23.
Figure 13 shows amino acid sequence similarity as a percentage of identity between four ebola virus species. Amino acid sequences of zaire type ebola virus NP (GenBank ID: AF 272001), sudan type ebola virus NP (GenBank ID: AF 173836), bund Jiao Xingai ebola virus NP (GenBank ID: FJ 217161), tay forest type ebola virus NP (GenBank ID: FJ 217162) were compared with each other by using NCBI BlastP algorithm. Based on the sequence alignment, proteins are divided into well-conserved regions (a) and less-conserved regions (B). For region a, the amino acid sequence identity between zaire-type ebola virus NP and sudan-type ebola virus NP is 88%, while for region B, the amino acid sequence identity is 42%. For region a, the amino acid sequence identity between zaire ebola virus NP and bundi Jiao Xingai bola virus NP is 92%, while for region B, the amino acid sequence identity is 53%. For region a, the amino acid sequence identity between zaire ebola virus NP and tailin ebola virus NP is 92%, while for region B the amino acid sequence identity is 54%. For region B, the present dibuch (B) and tazicar (B) sequences have a relatively high level of sequence similarity. Proteins are divided into well conserved regions (80% and 86% similarity; no tag) and less conserved regions (40% identity; referred to herein as region C) based on sequence alignment of region B.
FIGS. 14A and 14B show the frequency of cytokine secreting cells in spleen cell samples obtained from BALB/c mice that have been immunized by a single intradermal injection with 25. Mu.g of a 100. Mu.L solution of temperature controllable self-replicating RNA (also called c-srRNA) 1ts2 as described in WO 2021/138447 A1, or placebo (PBO: buffer only) encoding pan-type ebola antigen (srRNA 1ts 2-pan-type ebola, also called G5011). Fig. 14A shows the results obtained by treating the cells with a virus selected from the group consisting of the cross-tahlamys forest type ebola virus nucleoprotein (Swiss-Prot ID: b8XCN 6) [ JPT peptide; pepnix takie forest type ebola virus (NP); JPT product code: peptide scan of PM-TEBOV-NP ] the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFC) in 1x10 x 6 spleen cells after restimulation was cultured in the case of pool of 182 peptides (15 mer with 11 amino acid overlap), and FIG. 14B shows the frequency of interleukin-4 (IL-4) SFC in the spleen cells. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Splenocytes were isolated 14 days after vaccination.
FIG. 15 depicts a schematic diagram showing an exemplary srRNA1ts2 construct encoding the Receptor Binding Domain (RBD) of the spike protein of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). G5003 is the same antigen as "srRNA1ts2-2019-nCoV-RBD1" presented in FIG. 21 of WO 2021/138447 A1; and G5003 encodes a fusion protein comprising the signal peptide of CD5 (residues 1-24) and the RBD of the spike protein of SARS-CoV-2 (original strain). G5003o encodes a fusion protein (SEQ ID NO: 25) comprising the signal peptide of CD5 (residues 1-24) and the RBD of the spike protein of SARS-CoV-2 (Ornidus strain B.1.1.529: science Brief: ornidus (B.1.1.529) variant |CDC). The nucleotide sequence of the G5003o open reading frame is shown as SEQ ID NO. 24.
FIGS. 16A and 16B show the frequency of cytokine secreting cells in spleen cell samples obtained from C57BL/6 mice that have been immunized by a single intradermal injection with 100. Mu.L of a solution containing placebo (PBO: buffer only) or 25. Mu.g of temperature controllable self-replicating RNA encoding G5003o antigen (srRNA 1ts2 as described in WO 2021/138447 A1). FIG. 16A shows the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFC) in 1x 10≡6 splenocytes from immunized mice re-stimulated by the following manner and FIG. 16B shows the frequency of interleukin-4 (IL-4) SFC in the splenocytes: in the presence or absence of the RBD [ JPT peptide product code from Rong Bianchong (b.1.1.529) by SARS-CoV-2 obreck: spleen cells were cultured in the presence of pools of 53 peptides (15 mer with 11 amino acid overlap) obtained by peptide scanning of PM-SARS2-RBDMUT 08-1. The assay was performed by ELISpot assay. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Splenocytes were isolated 14 days after vaccination.
Figures 17A-17C show induction of both cellular and humoral immunity in mice as a result of administration of a composition comprising C-srRNA encoding an antigen, followed by administration of a composition comprising a protein antigen. Fig. 17A depicts a schematic of the experimental procedure. On day-40, blood was drawn from female BALB/c mice for Plaque Reduction Neutralization Test (PRNT). On day-36, these mice were treated with c-srRNA encoding the G5003 antigen. The c-srRNA was injected intradermally as naked RNA into the skin of mice without any nanoparticles or transfection reagents. On day-22 (14 days after c-srRNA-G5003 vaccination), half of the mice were sacrificed to obtain spleen cells for ELISPot assay. On day 0, the remaining mice were injected intradermally with adjuvant (adavax, commercially available from invitrogen TM Adjuvant) spike protein of the mixed SARS-CoV-2 delta variant (b.1.617.2). On day 7 (7 days after spike protein injection),blood was drawn for PRNT assay. FIG. 17B shows induction of cellular immunity against RBD proteins by single intradermal vaccination with the c-srRNA-G5003 vaccine. The graph shows the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFC) in 1x 10≡6 splenocytes from immunized mice re-stimulated by: spleen cells were cultured in the presence or absence of a pool of 53 peptides (15 mers with 11 amino acid overlaps) covering SARS-CoV-2RBD (original strain). The assay was performed by ELISpot assay. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Spleen cells were isolated on day-22 (14 days post vaccination). FIG. 17C shows the titers of serum antibodies that can neutralize (50%) of SARS-CoV-2 virus (delta variant B.1.617.2) as measured by a plaque reduction neutralization assay (PRNT). Exposure to the spike protein of the SARS-CoV-2 virus (delta variant B.1.617.2) induced neutralizing antibodies specific for the delta variant of the SARS-CoV-2 virus only in mice vaccinated with vaccine c-srRNA-G5003 (RBD encoding SARS-CoV-2 (original strain)).
Fig. 18A-18C show induction of cellular immunity in mice as a result of administration of a composition comprising a protein antigen followed by administration of a composition comprising C-srrrna encoding the antigen. Fig. 18A depicts a schematic of the experimental procedure. On day 0 (treatment 1), female C57BL/6 mice were injected intradermally with 10. Mu.g RBD protein (Sino Biological SARS-CoV-2[2019-nCoV]) +adjuvant (AddaVax commercially available from Invivogen TM Adjuvant). On day 14 (treatment 2), mice were treated with intradermal placebo (PBO: buffer only), 25. Mu.g of c-srRNA encoding G5003 antigen, 25. Mu.g of c-srRNA encoding G5003o antigen, or 10. Mu.g of RBD protein (Sino Biological SARS-CoV-2[ 2019-nCoV)]) +adjuvant (AddaVax) TM Adjuvant). On day 28, mice were sacrificed and spleen cells and serum were collected. FIG. 18B shows the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFC) in 1X 10≡6 splenocytes re-stimulated by culture in the presence or absence of a pool of 53 peptides (15 mers with 11 amino acid overlaps) covering SARS-CoV-2RBD (original strain)And FIG. 18C shows the frequency of interleukin 4 (IL-4) SFC in the spleen cells. The assay was performed by ELISpot assay. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure.
FIG. 19 shows serum antibody levels (represented by OD450 measurements) of RBD against SARS-CoV-2 virus (original strain) as determined by ELISA assay. Mean and standard deviation (error bars) of five mice per group (n=5) are shown. Data on day-1 (before treatment 1) and day 28 (after treatment 2) of each group are shown.
FIGS. 20A-20D show the frequency of interferon-gamma (INF-gamma) secreting cells or interleukin 4 (IL-4) secreting cells in spleen cell samples obtained from BALB/c mice that have been immunized by a single intradermal injection of 100. Mu.L solution containing 5. Mu.g (n=1) or 25. Mu.g (n=4) of temperature controllable self-replicating RNA encoding G5006 antigen (srRNA 1ts2 as described in WO 2021/138447 A1) (FIG. 2) or placebo (PBO: buffer: n=5 only). After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. The mean and standard deviation (error bars) of one mouse (n=1) or four mice (n=4) per group are shown. Splenocytes were isolated 14 days after vaccination. FIGS. 20A and 20B show the results after restimulation by culturing spleen cells in the presence or absence of a SARS-CoV-2 nucleoprotein peptide pool. FIGS. 20C and 20D show the results after restimulation by culturing spleen cells in the presence or absence of MERS-CoV-2 nucleoprotein peptide pool.
FIG. 21 shows survival (%) of female BALB/c mice vaccinated with c-srRNA-G5006, followed by injection of tumor cells expressing the G5006 antigen.
FIG. 22 depicts a schematic diagram showing an exemplary srRNA1ts2 construct encoding a fusion protein (designated herein as G5006 d) that is a signal peptide of CD5 (residues 1-24), the Receptor Binding Domain (RBD) of the spike protein of Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), the nucleoprotein of SARS-CoV-2, the nucleoprotein of MERS-CoV, and the fusion protein of the RBD of MERS-CoV. The amino acid sequence of the pan-type coronavirus antigen (G5006 d) is shown as SEQ ID NO. 27, and the nucleotide sequence of the open reading frame is shown as SEQ ID NO. 26.
FIGS. 23A-23B show the frequency of cytokine secreting cells in spleen cell samples obtained from female C57BL/6 mice that have been immunized by a single intradermal injection with 100. Mu.L solution of placebo (PBO: buffer only), 25. Mu.g of temperature controllable self-replicating RNA encoding G5006 antigen (srRNA 1ts2 as described in WO 2021/138447 A1), or 25. Mu.g of temperature controllable self-replicating RNA encoding G5006d antigen (srRNA 1ts2 as described in WO 2021/138447 A1). FIG. 23A shows the frequency of interferon-gamma (INF-gamma) Spot Forming Cells (SFC) in 1x 10≡6 splenocytes from immunized mice re-stimulated by the following manner, and FIG. 23B shows the frequency of interleukin-4 (IL-4) SFC in the splenocytes: in the presence or absence of the RBD [ JPT peptide product code from spike protein passing through (A) SARS-CoV-2: PM-WCPV-S-RBD-2]; (B) nucleoprotein of SARS-CoV-2 [ JPT peptide product code: PM-WCPV-NCAP ]; (C) nuclear protein of MERS-CoV [ JPT peptide, custom ]; and (D) MERS-CoV spike protein [ JPT peptide product code: peptide scan of PM-MERS-CoV-S-1] spleen cells were cultured with a pool of peptides (15 mer with 11 amino acid overlap). The assay was performed by ELISpot assay. After subtracting the frequency obtained in the absence of peptide (background), the frequency obtained in the presence of peptide is plotted in the figure. The mean and standard deviation (error bars) of five mice per group for PBO (n=5), four mice for G5006 (n=4) and five mice for G5006d (n=5) are shown. Splenocytes were isolated 14 days after vaccination.
FIG. 24 depicts a schematic diagram showing an exemplary srRNA1ts2 construct encoding a fusion protein (G5012) that is a fusion protein of the signal peptide of CD5 (residues 1-24), a portion of the Hemagglutinin (HA) of influenza A (A/New Caledonia/20/1999 (H1N 1)) (residues 25-165), a nucleoprotein of influenza A (A/breeder stack/Korea/Gochang 1/2014 (H5N 8)) (residues 166-662), a nucleoprotein of influenza B (B/Florida/4/2006) (residues 663-1222), and a portion of the Hemagglutinin (HA) of influenza B (B/Florida/4/2006) (residues 1223-1365). The amino acid sequence of the pan-type influenza virus antigen (G5012) is shown as SEQ ID NO. 29, and the nucleotide sequence of the open reading frame is shown as SEQ ID NO. 28.
FIG. 25 shows the effect of chitosan oligomer on gene (luciferase) expression from srRNA1ts2 (exemplary c-srRNA) in mice. c-srRNA encoding luciferase was injected intradermally into mice under the following conditions: 1, control-c-srRNA only; 2, c-srRNA mixed with chitosan oligosaccharide (0.001. Mu.g/mL); 3, c-srRNA mixed with chitosan oligosaccharide (0.01. Mu.g/mL); 4, c-srRNA mixed with chitosan oligosaccharide (0.5. Mu.g/mL); and 5, c-srRNA mixed with chitosan oligosaccharide lactic acid (0.1. Mu.g/mL).
Detailed Description
The broader, more durable protection against SARS-CoV-1, SARS-CoV-2, MERS-CoV and variants thereof is best achieved by a vaccine that induces cellular immunity (i.e., a T cell induction vaccine that involves CD8+ killer T cells and CD4+ helper T cells). This is in contrast to the current covd-19 vaccine paradigm that focuses on neutralizing antibodies as discussed in the background section. The critical importance of cellular immunity in combating coronaviruses has been demonstrated through experimentation and is widely discussed [ Sette and Crotty 2021]. Cellular immunity alone can provide protection via cd8+ killer T cells [ Matchett et al, 2021]. Furthermore, cellular immunity is dependent on linear T cell epitopes, whereas humoral immunity is dependent on conformational (as well as linear) B cell epitopes. Thus, cellular immunity is much more robust to variation than humoral immunity. Furthermore, memory T cells last longer than memory B cells, and thus may provide life-long immunity. This requires both a suitable antigen and a cell immunity based vaccine platform.
mRNA vaccine platform based on cellular immunity
Vaccine platforms are described in the earlier patent application by Elixirgen [ PCT/US20/67506, now published as WO 2021/138447A1 ]. This vaccine platform was optimized to induce cellular immunity, which was made possible by combining prior knowledge of vaccine biology with temperature-controlled self-replicating mRNA (srRNAts) based on alphaviruses such as Venezuelan Equine Encephalitis Virus (VEEV). The terms c-srRNA and srRNA are used interchangeably throughout this disclosure, with srRNA1ts2 (described in WO 2021/138447 A1) being an exemplary embodiment. srrrnats are based on srrrna, also known as self-amplified mRNA (saRNA or SAM), providing temperature sensitivity by incorporating small amino acid changes in the alphavirus replicase. Elixirgen Therapeutic the srRNAts function at 30-35 ℃, but not at or above 37 ℃ + -0.5 ℃. It has all the advantages of the mRNA platform: no genome integration, rapid development and deployment, and simple superior manufacturing process (GMP) and other advantages of the srRNA platform compared to the mRNA platform, in particular longer expression [ Johanning et al, 1995] and higher immunogenicity at lower doses [ Brito et al, 2014]. However, this simple temperature-controllable feature makes it possible to combine many of the desirable features of the T cell-inducing vaccines described herein.
Briefly, srRNA1ts2 is a temperature-sensitive self-replicating VEEV-based RNA replicon developed for transient expression of heterologous proteins. Temperature sensitivity was conferred by inserting five amino acid residues into the nonstructural protein 2 (nsP 2) of VEEV. The nsP2 protein is a helicase/protease that, together with nsP1, nsP3 and nsP4, constitutes a VEEV replicase. srRNA1ts2 does not contain VEEV structural proteins (capsids, E1, E2 and E3). The disclosure of WO 2021/138447 A1 of Elixirgen Therapeutics, inc. In particular, example 3 of WO 2021/138447 A1, FIG. 12 and SEQ ID NO.29-49 are hereby incorporated by reference.
Overall, the convincing potential of the srrrnats platform for immunogenicity (dose throttling) and safety benefits (temperature control and naked delivery), the ability to provide long-lasting baseline cellular immunity, and to provide rapid humoral responses across variants, makes the platform a strong candidate for large-scale deployment that can meet the global needs for inexpensive, safe, vaccine to handle variants that provide long-term immunity.
General techniques and definitions
Practice of the present disclosure will employ, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry and immunology, which are within the skill of the art.
As used herein and in the appended claims, the singular forms "a," "an," and "the" include plural referents unless the context clearly dictates otherwise. For example, "an" excipient includes one or more excipients.
The phrase "comprising" as used herein is open ended, meaning that embodiments may include additional elements. In contrast, the phrase "consisting of … …" is closed, indicating that such embodiments do not include additional elements (other than trace impurities). The phrase "consisting essentially of … …" is partially enclosed, meaning that the embodiments may further include elements that do not substantially alter the essential characteristics of the embodiments.
As used herein, the term "about" with respect to a value encompasses 90% to 110% of the value (e.g., when used with respect to chitosan oligosaccharides, a molecular weight of about 5,000 daltons refers to 4,500 daltons to 5,500 daltons).
The term "antigen" refers to a substance that is recognized and specifically bound by an antibody or T cell antigen receptor. Antigens may include peptides, polypeptides, proteins, glycoproteins, polysaccharides, complex carbohydrates, sugars, gangliosides, lipids, and phospholipids; portions thereof and combinations thereof. In the context of the present disclosure, the term "antigen" typically refers to a polypeptide or protein antigen of at least eight amino acid residues in length, which may comprise one or more post-translational modifications.
Unless otherwise specified, the terms "polypeptide" and "protein" are used interchangeably to refer to a polymer of amino acid residues, and are not limited to certain lengths. The polypeptide may comprise natural amino acid residues or a combination of natural and unnatural amino acid residues. The term also includes post-expression modifications of the polypeptide, such as glycosylation, sialylation, acetylation, phosphorylation, and the like. In some aspects, the polypeptide may contain modifications with respect to native or native sequences so long as the protein retains the desired activity (e.g., antigenicity).
As used herein, the terms "isolated" and "purified" refer to materials that are removed from at least one component with which they are naturally associated (e.g., from their original environment). The term "isolated" when used in reference to a recombinant protein refers to a protein that has been removed from the medium of the host cell that produced the protein. In some embodiments, the isolated protein (e.g., SARS-CoV-2 spike protein) is at least 75%, 90%, 95%, 96%, 97%, 98% or 99% pure as determined by HPLC.
An "effective amount" or "sufficient amount" of a substance is an amount sufficient to achieve a beneficial or desired result, including a clinical result, and thus, the "effective amount" depends on the context in which it is used. In the context of administering a composition of the present disclosure comprising an mRNA encoding an antigen, an effective amount contains sufficient mRNA to stimulate an immune response (preferably a cellular immune response to the antigen).
In the present disclosure, the terms "individual" and "subject" refer to mammals. "mammal" includes, but is not limited to, humans, non-human primates (e.g., monkeys), farm animals, sports animals, rodents (e.g., mice and rats), and pets (e.g., dogs and cats). In some preferred embodiments, the subject is a human subject.
The term "dose" as used herein with respect to a composition comprising an mRNA encoding an antigen refers to the measured portion of the subject taken (administered or received) at any one time. Administering a composition of the present disclosure to a subject in need thereof includes administering an effective amount of a composition comprising mRNA encoding an antigen to stimulate an immune response to the antigen in the subject.
"stimulation" of a response or parameter includes eliciting and/or enhancing the response or parameter when compared to a condition that is otherwise identical except for the parameter of interest, or alternatively, as compared to another condition (e.g., increased secretion of an antigen-specific cytokine upon administration of a composition comprising or encoding the antigen, as compared to administration of a control composition that does not comprise or encode the antigen). For example, "stimulation" of an immune response (e.g., a Th1 response) means an increase in the response. Depending on the measured parameters, the increase may be 2-fold to 200-fold or more, 5-fold to 500-fold or more, 10-fold to 1000-fold or more, or 2, 5, 10, 50 or 100-fold to 200, 500, 1,000, 5,000 or 10,000-fold.
Conversely, "suppressing" a response or parameter includes reducing and/or suppressing the response or parameter when compared to a condition that is otherwise identical except for the parameter of interest, or alternatively, as compared to another condition. For example, "inhibition" of an immune response (e.g., a Th2 response) means a decrease in the response. Depending on the measured parameters, the decrease may be 2-fold to 200-fold, 5-fold to 500-fold or more, 10-fold to 1000-fold or more, or 2, 5, 10, 50 or 100-fold to 200, 500, 1,000, 2,000, 5,000 or 10,000-fold.
The relative terms "higher" and "lower" refer to a measurable increase or decrease in response or parameter, respectively, when compared to a condition that is otherwise identical except for the parameter of interest, or alternatively, as compared to another condition. For example, "higher antibody titer" refers to an antigen-reactive antibody titer that is at least 2, 3, 4, 5, 6, 7, 8, 9, or 10-fold higher as a result of administering a composition of the present disclosure comprising an mRNA encoding an antigen than an antigen-reactive antibody titer as a result of a control condition (e.g., administering a comparative composition comprising no mRNA or no control mRNA encoding an antigen). Likewise, "lower antibody titer" refers to an antigen-reactive antibody titer that is at least 2, 3, 4, 5, 6, 7, 8, 9, or 10-fold lower as a result of the control conditions (e.g., administration of a comparative composition comprising no mRNA or comprising a control mRNA that does not encode an antigen) than an antigen-reactive antibody titer that is a result of the administration of a composition of the present disclosure comprising an mRNA that encodes an antigen.
As used herein, the term "immunization" refers to a process of increasing the response of a mammalian subject to an antigen, and thus improving its ability to resist or overcome an infection and/or to resist a disease.
As used herein, the term "vaccination" refers to the introduction of a vaccine into the body of a mammalian subject.
As used herein, "percent amino acid sequence identity (%)" and "percent identity" and "sequence identity" when used with respect to an amino acid sequence (reference polypeptide sequence) are defined as the percentage of amino acid residues in a candidate sequence (e.g., a subject antigen) that are identical to amino acid residues in the reference polypeptide sequence after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity and not considering any conservative substitutions as part of the sequence identity. The alignment used to determine the percent amino acid sequence identity can be accomplished in a variety of ways well known in the art, for example, using publicly available computer software such as BLAST, BLAST-2, ALIGN, or Megalign (DNASTAR) software. One skilled in the art can determine the appropriate parameters for aligning sequences, including any algorithms needed to achieve maximum alignment over the full length of the compared sequences.
Amino acid substitutions may include substitution of one amino acid in the polypeptide with another amino acid. Amino acid substitutions may be introduced into the antigen of interest and the product screened for desired activity (e.g., increased stability and/or immunogenicity).
Amino acids can generally be grouped according to the following common side chain characteristics:
(1) Hydrophobicity: norleucine, met, ala, val, leu, ile;
(2) Neutral hydrophilicity: cys, ser, thr, asn, gln;
(3) Acid: asp, glu;
(4) Alkaline: his, lys, arg;
(5) Residues that affect chain orientation: gly, pro; and
(6) Aromatic: trp, tyr, phe.
Conservative amino acid substitutions will involve the exchange of a member of one of these classes with another member of the same class. Non-conservative amino acid substitutions will involve exchanging members of one of these classes with members of the other class.
As used herein, the term "excipient" refers to a compound that is present in a composition comprising an active ingredient (e.g., mRNA encoding an antigen). Pharmaceutically acceptable excipients are inert pharmaceutical compounds and may include, for example, solvents, fillers, buffers, tonicity adjusting agents and preservatives (Pramantick et al, pharma Times,45:65-77,2013). In some embodiments, the compositions of the present disclosure include excipients that function as one or more of solvents, fillers, buffers, and tonicity adjusting agents (e.g., sodium chloride in saline may act as both an aqueous vehicle and tonicity adjusting agent).
Optimizing for intradermal delivery
Intradermal vaccination results in durable cellular immunity and increased immunogenicity [ hicking and Jones,2009]. Human skin (epidermis and dermis) is rich in Antigen Presenting Cells (APCs), including langerhans cells and dermal Dendritic Cells (DCs). Intradermal vaccination is known to be 5 to 10 times more effective than subcutaneous or intramuscular vaccination because it targets APCs present in the skin [ hicking and Jones,2009], thereby activating the T cell immune pathway to obtain durable immunity. By intradermal injection, srRNAts are taken up mainly by skin APCs, where they are replicated, antigen is produced, the antigen is digested into peptides, and the peptides are presented to T cells (fig. 1). Peptides presented by this pathway stimulate MHC-I restricted cd8+ killer T cells. In another approach, APCs also absorb antigens produced by nearby skin cells. Peptides presented by this pathway stimulate MHC-II restricted cd4+ helper T cells, which help B cells produce neutralizing antibodies (nAb) to combat viral infection.
Problems and solutions for intradermal injection
Here is the solution that we have identified as a potential problem and that provided by the srrrnats platform.
(1) One key unrecognized obstacle to the use of srRNA as an intradermal vaccine platform is that both mRNA and srRNA do not express antigen well at skin temperature [ PCT/US20/67506]. Non-intuitively, the temperature of human skin (about 30 ℃ -35 ℃) is lower than the human core body temperature (about 37 ℃); this means that the vectors and platforms developed at 37 ℃ are not optimal for intradermal injection. One innovation of the srRNAts platform is that it strongly expresses an antigen at skin temperature [ PCT/US20/67506]. In addition, such temperature control also minimizes safety risks caused by undesired systemic distribution of srRNAts, since srRNAts are deactivated once the temperature of the srRNAts rises above its allowable threshold (as it gets closer to the body core). In other words, the srrrNAts platform expresses the antigen optimally in case of intradermal injection compared to mRNA and srRNA, and it additionally has safety features: the ability of the vector to spread and produce is limited in other areas of the subject's body or the vector is inactive.
(2) Another challenge of intradermal vaccination is the lack of suitable additives. Because adjuvants (such as aluminum salts and oil-in-water) are too locally reactive for delivery by the intradermal route, no adjuvant is incorporated into clinically approved intradermal vaccines, resulting in lower immunogenicity [ Hickling and Jones,2009]. Lipid Nanoparticles (LNPs) for intramuscularly administered mRNA and srRNA vaccines are also oil-in-water, which may cause skin reactivities and increase the risk of allergic reactions to LNP components (such as PEG). The c-srRNA platform is a solution to this problem because the platform is injected with naked c-srRNA (LNP free, adjuvant free). First, the self-replication of RNA within cells (especially APCs) induces strong innate immunity, which replaces the primary function of adjuvants. Second, the data in the literature and obtained during the development of the present disclosure indicate that, particularly for intradermal injection, naked mRNA/srrrna is equally effective in producing antigen compared to electroporation of mRNA/srrrna [ Johansson et al 2012] and mRNA/srrrnas combined with LNP [ Golombek et al 2018 ].
(3) A third challenge is the limited number of precedents for intradermal vaccines. Only BCG vaccine is routinely administered intradermally, and currently available covd-19 vaccines are all administered intramuscularly. One way we have reduced the obstacles to using intradermal injections is to use specialized devices such as MicronJet600 (NanoPass) and Immucise (Terumo), which now allow for simple, consistent intradermal injections. These devices are also good candidates for mass production and deployment. However, intradermal injection by Mantoux technology using standard needles and syringes is also an option due to the relatively high cost of these special devices.
Design of suitable antigens
Methods focused on cellular immunity allow reconsidering all proteins encoded on the viral genome as antigen candidates, as humoral immunity (i.e. induction of neutralizing antibodies) is not a major consideration.
When selecting antigens that will provide broader protection against SARS-CoV-1, SARS-CoV-2, MERS-CoV and variants thereof, it is determined that nucleoprotein (N) is most suitable because (1) N is the most abundant protein, followed by membrane (M) and spike (S) in the viral particle [ Finkel et al, 2021], (2) N is generally the most conserved protein in the above mentioned beta coronaviruses [ Grifoni et al, 2020], and (3) epitopes of B and T cells are the most abundant in S and N [ Grifoni et al, 2020]. This is consistent with previous recommendations (N is the best antigen for a vaccine) [ Dutta et al 2020]. Notably, recent reports clearly indicate that vaccines using N alone as antigen can provide S-independent protective immunity in both hamsters and mice [ matrichett et al 2021]. Although disease enhancement of N-vaccine as well as S-vaccine was previously observed [ Lambert et al 2020], these data were obtained by using different vectors with unfavorable Th2> Th1 characteristics.
An exemplary vaccine candidate, srRNA1ts2-G5005, was designed to express the N protein of SARS-CoV-2 (SARS 2-N). However, MERS-N forms a distinct group and shows only 48% identity [ Tilocca et al 2020]. In this regard, another exemplary vaccine candidate, srRNA1ts2-G5006, was designed to express a fusion protein of SARS2-N and MERS-N. The G5005 and G5006 antigens are schematically shown in figure 2. srRNA1ts2-G5005 is suitable for inducing an immune response against SARS-CoV-1, SARS-CoV-2 and variants thereof. In contrast, srRNA1ts2-G5006 is suitable for inducing an immune response against the pan-type coronavirus (e.g., against SARS-CoV-1, SARS-CoV-2, MERS-CoV, and variants thereof).
To cope with the emergence of the variant (mutant) form of SARS-CoV-2 virus, C-srRNA encoding RBD of SARS-CoV-2 Ormi Rong Bianchong (G5003 o) was generated and administered intradermally to C57BL/6 mice (example 8 and FIG. 15). Cellular immunity was assessed 14 days after vaccination. The results clearly show that c-srRNA can induce an obreck Rong Bianchong-specific cellular immunity when it contains the open reading frame of the Receptor Binding Domain (RBD) of the obreck variety. Importantly, c-srRNA encoding G5003o antigen was found to induce a Th 1-biased response, as shown in FIGS. 16A-16B [ Th1 (INF-gamma) > Th2 (IL-4) ], which is advantageous for vaccines.
Inclusion of prime boost regimens
One of the unique features of the intradermally administered c-srRNA vaccine is its ability to induce cellular immunity but not significantly induce humoral immunity (i.e., antibodies). As determined during the development of the present disclosure, the c-srRNA vaccine was able to elicit a humoral immune response against the subsequently encountered protein antigen. Briefly, mice were first treated with c-srRNA encoding antigen (i.e., RBD of SARS-CoV-2 original strain), followed by adjuvanted variant RBD protein (i.e., RBD of SARS-CoV-2 delta variant), as described in example 9 and shown in FIG. 17A.
Cellular immunity (assessed by measuring the presence of antigen-specific IFN-gamma secreting T cells) has been induced on day 14 after primary vaccination (priming), as shown in FIG. 17B. No antigen-specific antibodies were detected at this time. Antibodies were induced after treatment with adjuvanted protein antigen as early as day 7 after the second vaccination (boost), as shown in figure 17C. Early induction of this antibody was consistent with a secondary immune response, indicating that c-srRNA has elicited humoral immunity. Importantly, antibodies raised against protein antigens are capable of neutralizing viral variants having RBD sequences that differ from the RBD antigens encoded by the c-srRNA vaccine. This unexpected finding suggests that the c-srRNA vaccine can induce a protective immune response against pathogens having an antigen sequence that is different from the antigen sequence encoded by the c-srRNA vaccine. Thus, the c-srRNA vaccine is expected to induce a broadly reactive immune response, which is crucial for providing protection against variant pathogens.
Subunit vaccines against pathogens typically do not provide long-lasting humoral immunity (i.e., pathogen-specific antibodies), and thus one or more booster vaccines are needed. As determined during development of the present disclosure, the c-srRNA vaccine is suitable for use as a booster vaccine when the adjuvanted protein is administered as a primary vaccine. Briefly, mice were first treated with adjuvanted protein (i.e., RBD of SARS-CoV-2 original strain), followed by placebo (PBO: buffer only), c-srRNA encoding G5003 antigen (original RBD), c-srRNA encoding G5003o antigen (obrong RBD), or adjuvanted protein antigen (original RBD), as described in example 10 and FIG. 18A.
As shown in fig. 17C, the C-srRNA vaccine alone did not induce humoral immunity in the form of a neutralizing antibody response (see, PBO day 7). However, when humoral immunity was elicited by adjuvanted proteins (as a model for primary vaccination), the C-srRNA boost vaccine was able to induce both antigen-specific cytokine responses (FIGS. 18B-18C) and antigen-specific antibody responses (FIG. 19). Notably, under current experimental conditions, shan Jizuo agent proteins do not induce RBD-specific antibodies. Clearly, cellular immunity induced by c-srRNA is able to stimulate antibody production against the protein antigens encountered earlier. This observation suggests that an important interaction between the cellular immune response and the humoral immune response occurs.
Elimination of antigen expressing cells in vivo
The c-srRNA vaccine is capable of inducing a strong cellular immune response (i.e., antigen specific CD8+ cytotoxic T lymphocytes and CD4+ helper T lymphocytes). Antigen-specific cd8+ CTLs lyse cells expressing the antigen. Antigen recognition by cd8+ CTLs is based on the presentation of short peptide fragments (T cell epitopes) by MHC class I molecules, so that the antigen need not be expressed on the surface of the target cells. For vaccines against pathogens, the vaccine is expected to lyse cells infected with the pathogen. For vaccines against cancer, the vaccine is expected to lyse cancer cells.
A c-srRNA vaccine was produced that encodes a fusion protein of SARS-CoV-2 nucleoprotein and MERS-CoV nucleoprotein (referred to as SMN protein or G5006) as antigens. To model virus-infected cells, a 4T1 breast cancer cell line was selected, which was derived from BALB/c mice and is known as a model for triple negative stage IV human breast cancer. When injected into BALB/c mice, 4T1 cells grew rapidly and formed tumors. This syngeneic mouse model was used to model the rapid increase in infected cells. 4T1 cells expressing SMN protein (designated 4T 1-SMN) were established by transfecting a plasmid vector encoding the SMN protein under the CMV promoter such that the protein was constitutively expressed in the 4T1 cells. The fusion protein is identical to G5006 except that the CD5 signal peptide is removed from the N-terminus of SMN protein expressed in 4T1 cells.
BALB/C mice were vaccinated with C-srRNA-G5006 and induction of cellular immunity was demonstrated by the presence of T cells that responded to both SARS-CoV-2 nucleoprotein (FIGS. 20A-20B) and MERS-CoV nucleoprotein (FIGS. 20C-20D). Subsequently, on day 24 (24 days after vaccination), 4T1-SMN cells were injected into BALB/c mice vaccinated with c-srRNA-G5006. As expected, 4T1-SMN cells grew rapidly in mice receiving placebo (no-vaccine group). In contrast, in c-srRNA-G5006 vaccinated mice, the growth of 4T1-SMN tumors was inhibited. In two mice receiving the 25 μ G c-srRNA-G5006 vaccine, even though the tumor initially grew, the mice eventually became tumor-free and survived for a long time after death of the placebo recipients. Furthermore, there was no tumor growth even after the second round of 4T1-SMN tumor injection at day 143 post-vaccination, and mice remained viable and tumor-free for the duration of the study (fig. 21). This result suggests that the c-srRNA vaccine encoding the G5006 antigen (i.e., SMN protein) can induce a protective immune response by eliminating cells infected with SARS-CoV-2 or MERS-CoV.
Ubiquitin-type coronavirus-enhanced vaccine
For infectious diseases, such as covd-19, world health organization (World Health Organization) guidelines require licensed vaccines capable of inducing neutralizing antibodies (nabs). This requirement is significant because nabs can prevent cells from being infected, and thus nabs can effectively control the spread of infection. However, the na b levels typically drop rapidly, so periodic (e.g., once or twice a year) boosting of the vaccine is required after the primary vaccination series (1 st and 2 nd vaccinations) are completed to maintain adequate na b levels. The high mutation rate of SARS-CoV-2 (particularly within the RBD of spike protein that is the target of nAb) is a major problem associated with the use of the first generation of COVID-19 vaccines that typically target SARS-CoV-2 spike protein.
To solve these problems, a new booster vaccine, c-srRNA-G5006d, was developed, which encodes a fusion protein comprising: CD5 signal peptide, spike-RBD of SARS-CoV-2, nucleoprotein of MERS-CoV, and spike-RBD of MERS-CoV (example 12 and FIG. 22). The amino acid sequence of the pan-type coronavirus antigen (G5006 d) is shown as SEQ ID NO. 27, and the nucleotide sequence of the open reading frame is shown as SEQ ID NO. 26. The order of each sequence segment of the fusion protein (RBD of SARS-CoV-2; nucleoprotein of MERS-CoV; RBD of MERS-CoV) can be altered and the amino acid sequence of each segment need not be 100% identical to the exemplary sequences provided herein.
After receiving a primary vaccine series (either vaccination 1 or vaccination 1 and vaccination 2) against the spike antigen or fragment thereof (RBD), the c-srRNA-G5006d vaccine is intended to be used as a booster vaccine. However, the c-srRNA-G5006d vaccine can also be used as part of the primary vaccine family.
The c-srRNA-G5006d vaccine boosts nAb levels and provides cellular immunity against human-infected beta coronaviruses. Cellular immunity is important to provide long-term protection against severe disease, hospitalization, and death.
When a c-srRNA vaccine encoding spike-RBD is used as a booster vaccine, the c-srRNA vaccine can increase the level of antibodies or nAbs against spike-RBD after administration of a vaccine that can elicit or induce humoral immunity, as described in example 10.
c-srRNA-G5006d encodes both the spike-RBD protein of SARS-CoV-2 and the spike-RBD protein of MERS-CoV. Thus, c-srRNA-G5006d can be used as a booster vaccine for both SARS-CoV-2 and MERS-CoV.
The spike proteins of SARS-CoV-2 and SARS-CoV are similar (about 76% identity) (Grifoni et al 2020). Thus, c-srRNA-G5006d is effective as a potentiator against SARS-CoV-2, SARS-CoV and its variants. In another aspect, the spike proteins of SARS-CoV-2 and MERS-CoV are different (about 35% identity) (Grifoni et al 2020). However, c-srRNA-G5006d also encodes spike-RBD of MERS-CoV. Thus, c-srRNA-G5006d is effective as a booster for MERS-CoV and its variants. In summary, c-srRNA-G5006d is effective as a potentiator against SARS-CoV-2, SARS-CoV, MERS-CoV and variants thereof.
c-srRNA-G5006d also encodes the nucleoprotein of SARS-CoV-2 and MERS-CoV. Thus, c-srRNA-G5006d is capable of inducing strong cellular immunity against SARS-CoV-2 and MERS-CoV. The nucleoproteins of SARS-CoV-2 and SARS-CoV are very similar to each other (about 90% identity) (Grifoni et al 2020). Thus, c-srRNA-G5006d provides strong cellular immunity against SARS-CoV-2, SARS-CoV and its variants. In contrast, the nucleoproteins of SARS-CoV-2 and MERS-CoV are different (about 48% identity) (Grifoni et al 2020). However, c-srRNA-G5006d also encodes the nucleoprotein of MERS-CoV. Thus, c-srRNA-G5006d is expected to provide strong cellular immunity against MERS-CoV and its variants. In summary, c-srRNA-G5006 induced potent immune responses against SARS-CoV-2, SARS-CoV, MERS-CoV and variants thereof.
As described in example 9 and example 10, the c-srRNA vaccine has a significant mode of action. That is, the encoded antigen does not appear to directly stimulate B cells, and thus the three-dimensional structure of the encoded antigen need not be considered. This is in contrast to conventional vaccines designed to directly stimulate B cells to produce antibodies directed against conformational epitopes (the three-dimensional structure of the antigen). That is why it is appropriate to use a fusion protein for the c-srRNA vaccine, whereas it is complicated to use a fusion protein for the conventional subunit vaccine because the natural three-dimensional structure of each antigen may be destroyed when expressed as a fusion protein. The c-srRNA boost vaccine stimulates antibody production by activation of CD4+ helper T cells, and thus it relies on a short peptide epitope (about 15 mer). Thus, two or more different antigens can simply be put together to form a single fusion protein to the antigen encoded by the c-srRNA vaccine, and this mechanism can be problematic for subunit vaccine design.
The fact that c-srRNA relies on short peptide epitopes for inducing cellular and humoral immune responses also provides advantages for vaccines that elicit broader reactivity against protection of variant pathogens. Many T cell epitopes are present in a single protein and thus any single mutation is unlikely to cause loss of immunogenicity. On the other hand, conventional subunit vaccines rely on the three-dimensional structure of protein antigens, and thus even a single mutation may alter the conformation of the protein, which may lead to loss of immunogenicity.
As shown in fig. 23A-23B, c-srRNA-G5006d can stimulate cellular immunity against all of the following proteins encoded by this vaccine: spike-RBD of SARS-CoV-2, nucleoprotein of MERS-CoV and spike-RBD of MERS-CoV.
Pandemic influenza booster vaccine
As determined during development of the present disclosure (see, example 6), when fusion proteins comprising nucleoproteins from representative influenza a and influenza b strains are expressed by intradermally injected temperature controllable self-replicating RNAs, the fusion proteins are capable of inducing a strong antigen-specific cellular immune response. Protection is generally thought to be mediated primarily by neutralizing antibodies directed against Hemagglutinin (HA), one of the surface proteins of influenza virus. Thus, FDA approved influenza vaccines include HA as an antigen alone or in combination with other influenza antigens. Since the c-srRNA based boost vaccine only requires CD4+ T cell epitopes on the HA protein to enhance Ab production, the three dimensional structure of the HA protein need not be considered. It is known that only some portion of the HA protein of H1N1 influenza virus can act as a CD4+ T cell epitope (Knowlden et al, pathogenens.8 (4): 220,2019). B cell epitopes and CD4+ T cell epitopes have been identified in both influenza A and influenza B (Terajima et al Virol J,10:244, 2013). The sequences of the HA proteins of representative H1N1 influenza viruses were aligned (Darricurrre et al, J Virol,92 (22): e01349-18, 2018) and regions with well conserved sequences were identified. Based on these considerations, the HA protein fragment (residues 316-456) of influenza A virus (A/New Caledonia/20/1999 (H1N 1)) [ GenBank accession number EU103824] and the HA protein fragment (residues 332-474) of influenza B virus (B/Florida/4/2006) [ GenBank accession number CY033876] were selected. Also included are nucleoproteins from influenza a and influenza b, which have been described in example 6 and are represented as G5010 antigens.
Figure 24 shows the design of a pan-type influenza booster vaccine. c-srRNA-G5012 encodes a fusion protein (G5012) comprising a signal peptide (residues 1-24) of CD5, a portion of Hemagglutinin (HA) of influenza A, a nucleoprotein of influenza B, and a portion of Hemagglutinin (HA) of influenza B. The amino acid sequence of the pan-type influenza virus antigen (G5012) is shown as SEQ ID NO. 29, and the nucleotide sequence of the open reading frame is shown as SEQ ID NO. 28. The order of each sequence segment of the fusion protein (part of HA for influenza a; nucleoprotein for influenza b; part of HA for influenza b) may be altered and the amino acid sequence of each segment need not be 100% identical to the exemplary sequences provided herein.
This c-srRNA-G5012 influenza vaccine potentiates nAb levels by enhancing HA-specific CD4+ helper T cells. It also provides cellular immunity against substantially all influenza viruses through evolutionarily conserved nucleoproteins. Cellular immunity is known to provide long-term protection against severe disease, hospitalization, and death.
Chitosan enhancement of in vivo Gene expression
Rnase inhibitors (proteins purified from human placenta) slightly enhance immunogenicity against antigens encoded on C-srRNA, most likely by enhancing expression of antigens from C-srRNA in vivo when injected intradermally into mice (see, e.g., figure 25C of WO 2021/138447 A1). RNase inhibitors can protect c-srRNA from RNase-mediated degradation in vivo. However, it is desirable to find alternative agents that can enhance the expression of a gene of interest (GOI) in vivo for therapeutic purposes, because it is difficult to use protein-based rnase inhibitors as excipients in injectable products.
Low molecular weight chitosan (molecular weight about 6 kDa) has been shown to inhibit RNase activity with an inhibition constant ranging from 30-220nM (Yakovlev et al Biochem Biophys Res Commun,357 (3): 584-8, 2007). Although this was only demonstrated in vitro and also for artificially prepared polynucleotides like poly (a)/poly (U), it was necessary to test in mice by intradermal injection of c-srRNA whether chitosan oligosaccharides could enhance expression of GOI from c-srRNA in vivo. As shown in example 14, the following two different chitosan oligomers were tested: chitosan oligomer (molecular weight. Ltoreq.5 kDa,. Gtoreq.75% deacetylation: heppe Medical Chitosan GmbH: product number 44009) and chitosan oligosaccharide lactic acid (molecular weight about 5kDa, >90% deacetylation: sigma-Aldrich: product number 523682). Unexpectedly, it was found that even very low levels of chitosan oligomer, as low as 0.001. Mu.g/mL (about 0.2nM: about 1/100 of the inhibition constant found by Yakovlev et al, supra, 2007) was able to enhance the expression of the luciferase encoded on c-srRNA by about 10-fold (FIG. 25). Similar enhancement of GOI expression was achieved by up to 0.5. Mu.g/mL of chitosan oligomer and 0.1. Mu.g/mL of chitosan oligosaccharide lactic acid.
Chitosan has been used as a nucleotide (DNA and RNA) delivery vehicle because it can form complexes or nanoparticles (reviewed in bussmann et al Adv Drug Deliv Rev,65 (9): 1234-70,2013; and Cao et al Drugs,17:381, 2019). However, it is notable that enhancement of GOI expression by chitosan oligomers is unlikely to be mediated by nanoparticle or complex formation of c-srrrna and chitosan oligomers. First, such low concentrations of chitosan oligomers do not allow for complex formation with RNA. Second, chitosan oligomer was added to the c-srRNA immediately prior to intradermal injection, so there was insufficient time to form a complex.
Since chitosan oligomers enhance GOI expression in vivo at much lower concentrations than are effective in vitro as rnase inhibitors (Yakovlev et al, supra, 2007), it is conceivable that such enhanced GOI expression by chitosan oligomers may not be mediated by its rnase inhibition mechanism. For example, chitosan oligomers may promote the incorporation of c-srRNA into cells, and thus may enhance the expression of GOI from c-srRNA. Nevertheless, this unexpected finding should provide an effective means of enhancing the in vivo therapeutic expression of the GOI encoded on c-srRNA.
Detailed description of the illustrated embodiments
1. A composition for stimulating an immune response against a coronavirus in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide; and
(ii) Nucleotide sequence encoding a nucleocapsid protein of a coronavirus.
2. The composition of embodiment 1, wherein the coronavirus is a beta coronavirus, optionally wherein the beta coronavirus is a human beta coronavirus.
3. The composition of embodiment 2, wherein the beta coronavirus comprises severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), severe acute respiratory syndrome coronavirus-1 (SARS-CoV-1), middle east respiratory syndrome related coronavirus (MERS-CoV), or a combination thereof.
4. The composition of embodiment 3, wherein the beta coronavirus comprises severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2).
5. The composition of embodiment 4, wherein the coronavirus nucleocapsid protein comprises a first nucleocapsid protein and a second nucleocapsid protein, wherein the first nucleocapsid protein is a SARS-CoV-2 nucleocapsid protein from a first variant of a first clade and the second nucleocapsid protein is a SARS-CoV-2 nucleocapsid protein from a second variant of a second clade, and wherein the first clade and the second clade are different clades defined by one or more of the world health organization, pango, GISAID, and Nextstrain.
6. A composition for stimulating an immune response against a coronavirus in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide; and
(ii) Nucleotide sequences encoding two or more coronavirus nucleocapsid proteins.
7. The composition of embodiment 6, wherein the coronavirus is a beta coronavirus, optionally wherein the beta coronavirus is a human beta coronavirus.
8. The composition of embodiment 7, wherein the beta coronavirus comprises severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), severe acute respiratory syndrome coronavirus-1 (SARS-CoV-1), middle east respiratory syndrome related coronavirus (MERS-CoV), or a combination thereof.
9. The composition of embodiment 8, wherein the beta coronavirus comprises severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2).
10. The composition of embodiment 9, wherein the two or more coronavirus nucleocapsid proteins comprise a SARS-CoV-2 nucleocapsid protein and a MERS nucleocapsid protein.
11. The composition of embodiment 9, wherein the two or more coronavirus nucleocapsid proteins comprise a SARS-CoV-2 nucleocapsid protein, a SARS-CoV-1 nucleocapsid protein, and a MERS nucleocapsid protein.
12. The composition of any of embodiments 6-11, wherein the two or more coronavirus nucleocapsid proteins are separated by a linker of one to ten residues in length.
13. The composition of any of embodiments 1-12, wherein the mammalian signal peptide is a signal peptide of a surface protein expressed in mammalian antigen presenting cells.
14. The composition of embodiment 13, wherein the mammalian signal peptide is a CD5 signal peptide and the amino acid sequence of the CD5 signal peptide comprises SEQ ID No. 8 or an amino acid sequence at least 90% or 95% identical to SEQ ID No. 8.
15. The composition of any one of embodiments 1-14, wherein the amino acid sequence of the nucleocapsid protein comprises residues 2-419 of SEQ ID No. 5 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to residues 2-419 of SEQ ID No. 5.
16. The composition of any of embodiments 1-14, wherein the amino acid sequence of the fusion protein comprises SEQ ID No. 6 or an amino acid sequence at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID No. 6.
17. The composition of any of embodiments 6-14, wherein the amino acid sequence of the fusion protein comprises SEQ ID No. 7 or an amino acid sequence at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID No. 7.
18. The composition of embodiment 16, wherein the open reading frame comprises the nucleotide sequence of SEQ ID No. 2.
19. The composition of embodiment 17, wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO. 3 or SEQ ID NO. 4.
20. The composition of any of embodiments 1-14, wherein the amino acid sequence of the fusion protein comprises residues 2-413 of SEQ ID No. 9 or an amino acid sequence at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to residues 2-413 of SEQ ID No. 9.
21. The composition of any of embodiments 1-14, wherein the amino acid sequence of the fusion protein comprises residues 2-422 of SEQ ID No. 10 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to residues 2-422 of SEQ ID No. 10.
22. The composition of any one of embodiments 1-21, wherein the composition does not comprise a liposome or a lipid nanoparticle.
23. The composition of any one of embodiments 1-22, wherein the mRNA is a self-replicating mRNA.
24. The composition of embodiment 23, wherein the self-replicating RNA comprises an alphavirus replicon lacking viral structural protein coding regions.
25. The composition of embodiment 24, wherein the alphavirus is selected from the group consisting of venezuelan equine encephalitis virus, sindbis virus, and semliki forest virus.
26. The composition of embodiment 25, wherein the alphavirus is venezuelan equine encephalitis virus.
27. The composition of any one of embodiments 23-26, wherein the alphavirus replicon comprises a non-structural protein coding region in which 12-18 nucleotides are inserted resulting in expression of nsP2 comprising 4 to 6 additional amino acids between β -sheet 4 and β -sheet 6 of non-structural protein 2 (nsP 2).
28. The composition of any one of embodiments 1-27, wherein the self-replicating mRNA is a temperature-sensitive agent (ts agent) that is capable of expressing the fusion at a permissive temperature and not expressing the fusion at an non-permissive temperature.
29. The composition of embodiment 28, wherein the permissible temperature is 31 ℃ to 35 ℃ and the impermissible temperature is at least 37 ℃ ± 0.5 ℃.
30. A method for stimulating an immune response against a coronavirus in a mammalian subject, the method comprising administering to a mammalian subject the composition of any one of embodiments 1-29 to stimulate an immune response against the coronavirus nucleocapsid protein in the mammalian subject.
31. The method of embodiment 30, wherein the composition is administered intradermally.
32. The method of embodiment 30 or embodiment 31, wherein the immune response comprises a coronavirus-reactive cellular immune response.
33. The method of embodiment 32, wherein the immune response further comprises a coronavirus-reactive humoral immune response.
34. The method of any one of embodiments 30-33, wherein the mammalian subject is a human subject.
35. A kit, the kit comprising:
the composition according to any one of embodiments 1-29 or any one of embodiments 37-62; and
a device for intradermal delivery of the composition to a mammalian subject.
36. The kit of embodiment 35, wherein the device comprises a syringe and a needle.
37. A composition for stimulating an immune response against two or more viruses in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide; and
(ii) Nucleotide sequences encoding a first nucleocapsid protein of a first virus and a second nucleocapsid protein of a second virus.
38. The composition of embodiment 37, wherein the first virus and the second virus are capable of causing a disease upon infection of a human subject.
39. The composition of embodiment 38, wherein the first virus and the second virus are different varieties, subtypes or lineages of the same species.
40. The composition of embodiment 38, wherein the first virus and the second virus are different species of the same genus.
41. The composition of embodiment 40, wherein both the first virus and the second virus are members of the genus betacoronavirus.
42. The composition of embodiment 41, wherein the first virus and the second virus comprise severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) and middle east respiratory syndrome related coronavirus (MERS-CoV).
43. The composition of embodiment 38, wherein the first virus and the second virus are members of different families, orders, classes, or gates of the same kingdom.
44. The composition of embodiment 43, wherein both the first virus and the second virus are members of the orthomyxoviridae family.
45. The composition of embodiment 44, wherein the first virus and the second virus comprise influenza a virus and influenza b virus.
46. The composition of embodiment 45, wherein the amino acid sequence of the fusion protein comprises SEQ ID NO 16 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID NO 16.
47. The composition of embodiment 38, wherein both the first virus and the second virus are members of the kingdom orthoribo virus, optionally wherein the first virus and the second virus comprise: (a) Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), severe acute respiratory syndrome coronavirus-1 (SARS-CoV-1) or middle east respiratory syndrome related coronavirus (MERS-CoV); and (b) influenza a virus or influenza b virus.
48. The composition of embodiment 40, wherein both the first virus and the second virus are members of the genus ebola, optionally wherein the first virus and the second virus are selected from the group consisting of zaire-type ebola virus, sudan-type ebola virus, bund Jiao Xingai ebola virus, and tay forest-type ebola virus.
49. The composition of embodiment 48, wherein the nucleotide sequence further encodes a third nucleocapsid protein of a third virus and a fourth nucleocapsid protein of a fourth virus, and the first virus, the second virus, the third virus, and the fourth virus are zaire-type ebola virus, sudan-type ebola virus, bundi Jiao Xingai bola virus, and tay forest-type ebola virus.
50. The composition of embodiment 49, wherein the amino acid sequence of the fusion protein comprises SEQ ID NO. 22 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID NO. 22.
51. The composition of embodiment 49, wherein the nucleotide sequence (ii) encodes a shared portion of a first nucleocapsid protein of the first virus for stimulating an immune response against all of the first virus, the second virus, the third virus, and the fourth virus.
52. The composition of embodiment 51, wherein the nucleotide sequence (ii) encodes a separate portion of the first nucleocapsid protein, the second nucleocapsid protein, the third nucleocapsid protein, and the fourth nucleocapsid protein each for stimulating an immune response against all of the first virus, the second virus, the third virus, and the fourth virus.
53. The composition of embodiment 52, wherein the nucleotide sequence (ii) encodes a fragment of a separate portion of a second nucleocapsid protein of the second virus for stimulating an immune response against the second virus and the third virus.
54. The composition of embodiment 37, wherein the nucleotide sequence (ii) encodes a shared portion of a first nucleocapsid protein of the first virus for stimulating an immune response against both the first virus and the second virus.
55. The composition of embodiment 54, wherein the nucleotide sequence (ii) encodes a separate portion of the first nucleocapsid protein and the second nucleocapsid protein each for stimulating an immune response against both the first virus and the second virus.
56. The composition of any one of embodiments 37-48, wherein the nucleotide sequence (ii) further encodes at least one other nucleocapsid protein of at least one other virus, and wherein the at least one other virus is different from the first virus and the second virus.
57. The composition of any of embodiments 37-56, wherein the first nucleocapsid protein and the second nucleocapsid protein, or the first nucleocapsid protein, the second nucleocapsid protein and the other nucleocapsid proteins, are separated by a linker of one to ten residues in length.
58. The composition of any one of embodiments 37-57, wherein the mammalian signal peptide is a signal peptide of a surface protein expressed in mammalian antigen presenting cells.
59. The composition of any one of embodiments 37-58, wherein the mRNA is a self-replicating mRNA.
60. The composition of embodiment 59, wherein the self-replicating mRNA is a temperature-sensitive agent (ts agent) that is capable of expressing the fusion protein at a permissive temperature and not expressing the fusion protein at an non-permissive temperature.
61. The composition of embodiment 60, wherein the permissible temperature is 31 ℃ to 35 ℃ and the impermissible temperature is at least 37 ℃ ± 0.5 ℃.
62. The composition of any one of embodiments 1-29 or any one of embodiments 37-61, wherein the composition further comprises chitosan.
63. A method for stimulating an immune response against two or more viruses in a mammalian subject, the method comprising administering to a mammalian subject a composition according to any one of embodiments 37-62 to stimulate an immune response against nucleocapsid proteins of the two or more viruses in the mammalian subject.
64. The method of embodiment 63, wherein the composition is administered intradermally.
65. The method of embodiment 63 or embodiment 64, wherein the immune response comprises a cellular immune response reactive to the two or more viruses.
66. The method of embodiment 65, wherein the cellular immune response comprises a nucleocapsid protein specific helper T lymphocyte (Th) response comprising nucleocapsid protein specific cytokine secretion.
67. The method of embodiment 66, wherein the nucleocapsid protein specific cytokine secretion comprises secretion of one or both of interferon-gamma and interleukin-4.
68. The method of embodiment 65, wherein the cellular immune response comprises a nucleocapsid protein specific Cytotoxic T Lymphocyte (CTL) response.
69. The method of any one of embodiments 65-68, wherein the immune response further comprises a humoral immune response reactive against the two or more viruses.
70. The method of any one of embodiments 63-69, wherein the mammalian subject is a human subject.
71. A composition for stimulating an immune response against a virus in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide;
(ii) A nucleotide sequence encoding a first viral antigen of a first virus or a fragment thereof; and
(iii) A nucleotide sequence encoding a second viral antigen or fragment thereof of the first virus or second virus, wherein the first viral antigen is a nucleocapsid protein and the second viral antigen is a surface protein, or the first viral antigen is a surface protein and the second viral antigen is a nucleocapsid protein.
72. A composition for stimulating an immune response against two or more viruses in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide;
(ii) A nucleotide sequence encoding a first viral antigen of a first virus or a fragment thereof;
(iii) A nucleotide sequence encoding a second viral antigen of the first virus or a fragment thereof;
(iv) A nucleotide sequence encoding a third viral antigen of a second virus or a fragment thereof;
(iii) A nucleotide sequence encoding a fourth viral antigen of said second virus or a fragment thereof,
wherein the first viral antigen is a first nucleocapsid protein and the second viral antigen is a first surface protein, or the first viral antigen is a first surface protein and the second viral antigen is a first nucleocapsid protein, and wherein the third viral antigen is a second nucleocapsid protein and the fourth viral antigen is a second surface protein, or the third viral antigen is a second surface protein and the fourth viral antigen is a second nucleocapsid protein.
73. The composition of embodiment 71 or embodiment 72, wherein the mRNA is a self-replicating mRNA.
74. The composition of embodiment 73, wherein the self-replicating RNA comprises an alphavirus replicon that lacks viral structural protein coding regions.
75. The composition of embodiment 74, wherein the alphavirus is selected from the group consisting of venezuelan equine encephalitis virus, sindbis virus, and semliki forest virus.
76. The composition of embodiment 74, wherein the alphavirus is venezuelan equine encephalitis virus.
77. The composition of any one of embodiments 73-76, wherein the self-replicating mRNA is a temperature-sensitive agent (ts agent) that is capable of expressing the fusion protein at a permissive temperature and not expressing the fusion protein at an non-permissive temperature.
78. The composition of embodiment 77, wherein said permissible temperature is 31 ℃ to 35 ℃ and said impermissible temperature is at least 37 ℃ ± 0.5 ℃.
79. The composition of any one of embodiments 74-78, wherein said alphavirus replicon comprises a non-structural protein coding region in which 12-18 nucleotides are inserted resulting in expression of nsP2 comprising 4 to 6 additional amino acids between β -sheet 4 and β -sheet 6 of non-structural protein 2 (nsP 2).
80. The composition of any one of embodiments 71-79, wherein the first virus and/or the second virus is a coronavirus, optionally wherein the coronavirus is a beta coronavirus, optionally wherein the beta coronavirus is a human beta coronavirus.
81. The composition of embodiment 80, wherein the first virus and/or the second virus is a beta coronavirus independently selected from the group consisting of: severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), severe acute respiratory syndrome coronavirus-1 (SARS-CoV-1), and middle east respiratory syndrome related coronavirus (MERS-CoV).
82. The composition of embodiment 80, wherein the first virus is SARS-CoV-2 and the second virus is MERS-CoV.
83. The composition of any one of embodiments 80-82, wherein each of said surface protein, said first surface protein, and/or said second surface protein comprises a Receptor Binding Domain (RBD) of a coronavirus spike protein.
84. The composition of embodiment 83, wherein the amino acid sequence of the fusion protein comprises SEQ ID NO 27 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID NO 27.
85. The composition of any one of embodiments 71-79, wherein said first virus and/or said second virus is a member of the orthomyxoviridae family.
86. The composition of embodiment 85, wherein said first virus and/or said second virus is independently selected from the group consisting of Influenza A Virus (IAV) and Influenza B Virus (IBV).
87. The composition of embodiment 86, wherein the first virus is an IAV and the second virus is an IBV.
88. The composition of any one of embodiments 85-87, wherein the surface protein, the first surface protein, and/or the second surface protein each comprise a portion of influenza hemagglutinin.
89. The composition of embodiment 88, wherein the amino acid sequence of the fusion protein comprises SEQ ID NO 29 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID NO 29.
90. The composition of any one of embodiments 71-89, wherein the composition further comprises chitosan.
91. A kit, the kit comprising:
(i) The composition of any one of embodiments 71-90; and
(ii) A device for intradermal delivery of the composition to a mammalian subject.
92. The kit of embodiment 91, wherein the device comprises a syringe and a needle.
93. The kit of embodiment 91 or embodiment 92, further comprising instructions for using the device to administer the composition to a mammalian subject to stimulate an immune response against one or more of the first viral antigen, the second viral antigen, the third viral antigen, and the fourth viral antigen.
94. A method of stimulating an immune response in a mammalian subject, the method comprising administering to a mammalian subject the composition of any one of embodiments 71-90 to stimulate an immune response in the mammalian subject against one or more of the first viral antigen, the second viral antigen, the third viral antigen, and the fourth viral antigen.
95. The method of embodiment 94, wherein the composition is administered intradermally.
96. The method of embodiment 95, wherein the immune response comprises a cellular immune response reactive against one or more of the first viral antigen, the second viral antigen, the third viral antigen, and the fourth viral antigen.
97. The method of embodiment 96, wherein the immune response further comprises a humoral immune response reactive against one or more of the first viral antigen, the second viral antigen, the third viral antigen, and the fourth viral antigen.
98. The method of any one of embodiments 94-97, wherein the mammalian subject is a human subject.
99. A method for active boosting against at least one virus, the method comprising intradermally administering the composition according to any one of embodiments 1-29, embodiments 37-62, or embodiments 71-90 to a mammalian subject in need thereof to stimulate a secondary immune response against the virus, wherein the mammalian subject has undergone a primary immunization regimen against the virus.
100. The method of embodiment 99, wherein the primary immunization regimen comprises administration of at least one dose of a different vaccine against the virus.
101. The method of embodiment 100, wherein the different vaccine comprises a protein antigen of the at least one virus or an inactivated virus, optionally wherein the protein antigen is a recombinant protein or fragment thereof.
102. A method for active boosting immunity against at least one virus, the method comprising:
(i) Administering intradermally the composition of any one of embodiments 1-29, any one of embodiments 37-62, or any one of embodiments 71-90 to a mammalian subject in need thereof to stimulate a primary immune response against the virus; and
(ii) At least one dose of a different vaccine against the virus is administered to the mammalian subject to stimulate a secondary immune response against the virus.
103. The method of embodiment 102, wherein the different vaccine comprises a protein antigen of the at least one virus or an inactivated virus, optionally wherein the protein antigen is a recombinant protein or fragment thereof.
104. A method for active primary immunization against at least one virus, the method comprising:
(i) Administering intradermally to a mammalian subject in need thereof the composition according to any one of embodiments 1-29, any one of embodiments 37-62, or any one of embodiments 71-90, to stimulate a primary immune response against the virus, wherein the mammalian subject has not undergone a primary immune regimen against the virus.
105. The method of embodiment 104, the method further comprising:
(ii) At least one dose of a different vaccine against the virus is administered to the mammalian subject to stimulate a secondary immune response against the virus.
106. The method of embodiment 105, wherein the different vaccine comprises a protein antigen of the at least one virus or an inactivated virus, optionally wherein the protein antigen is a recombinant protein or fragment thereof.
107. The method of any one of embodiments 94-106, wherein the mammalian subject is a human subject. 108. An expression vector comprising the mRNA of any one of the preceding claims in operable combination with a promoter.
109. The expression vector of embodiment 108, wherein the promoter is a T7 promoter or an SP6 promoter.
110. The expression vector of embodiment 108, wherein the vector is a plasmid.
111. The expression vector according to any one of embodiments 108-110, further comprising a selectable marker.
Examples
Abbreviations: ab (antibody); APC (antigen presenting cell); coV (coronavirus); c-srRNA (temperature controlled self-replicating RNA); CTL (cytotoxic T lymphocytes); fluA or IAV (influenza A virus); fluB or IBV (influenza B virus); IL-4 (interleukin-4); INF-gamma (interferon gamma); GOI (gene of interest); HA (hemagglutinin); MERS (middle east respiratory syndrome associated); nAb (neutralizing antibody); n or NP (nucleocapsid or nucleoprotein); nsP (non-structural protein); ORF (open reading frame); PBO (placebo); RBD (receptor binding domain); s (spike); PRNT (plaque reduction neutralization test); SARS (severe acute respiratory syndrome); SFC (spot forming cells); SFU (spot forming unit); srRNAts (temperature sensitive self-replicating RNA); th (helper T lymphocytes); and Tx (processing). The terms c-srRNA and srRNA are used interchangeably throughout the disclosure, with srRNA1ts2 (described in WO 2021/138447 A1) being an exemplary embodiment.
EXAMPLE 1 cellular immunity induced by srRNA1ts2-G5004
This example describes the findings that: when SARS-CoV-2 nucleoprotein alone (G5004 antigen, without signal peptide) was expressed from intradermally injected temperature-controllable self-replicating RNA, the protein did not induce an effective cellular immune response.
Materials and methods.
CD-1 inbred female mice.
srRNA1ts2-G5004 mRNA was produced by in vitro transcription of a temperature controlled self-replicating RNA vector encoding the G5004 antigen (srRNA 1ts2 as described in PCT/US 2020/067506) (FIG. 2).
From nucleoprotein (UniProt: P0DTC 9) [ JPT peptide product code) by SARS-CoV-2: PM-WCPV-NCAP ] peptides were scanned to obtain 102 peptide (15 mer with 11 amino acid overlap) pools.
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
Recently, vaccination with nucleoprotein (N) alone has been shown to elicit cellular immunity and spike-independent protective immunity of SARS-CoV-2 in mice and hamsters (Machett et al, biorxiv.2021.04.26.441518.2021). Vaccination involves intravenous administration of a human adenovirus serotype 5 (Ad 5) vector (Ad 5-N) expressing an N sequence derived from the USA-WA1/2021 strain.
To test whether nuclear protein (N) alone (without signal peptide) could induce cellular immunity, ELISPot assays were performed 14 days after CD-1-outcrossing mice were vaccinated by a single intradermal injection of 5 μg or 25 μg of srRNA1ts2-G5004 (FIG. 2) or placebo (PBO: buffer only). Only weak induction of interferon-gamma (INF-gamma) secreting T cells (fig. 4A) and IL-4 secreting T cells (fig. 4B) was observed. Interestingly, no dose-dependent INF-gamma response was observed (5. Mu.g compared to 25. Mu.g).
It was concluded that nucleoprotein (N) alone did not induce an effective cellular immune response upon expression of temperature-controlled self-replicating RNA by intradermal injection.
EXAMPLE 2 cellular immunity induced by srRNA1ts2-G5005
This example describes the following findings: addition of the CD5 signal peptide to SARS-CoV-2 nucleoprotein induces an effective cellular immune response in CD-1 mice upon expression of temperature-controlled self-replicating RNA by intradermal injection.
Materials and methods
CD-1 inbred female mice.
srRNA1ts2-G5005 mRNA was produced by in vitro transcription of a temperature controlled self-replicating RNA vector encoding the G5005 antigen (srRNA 1ts2 as disclosed in PCT/US 2020/067506) (FIG. 2).
From nucleoprotein (UniProt: P0DTC 9) [ JPT peptide product code) by SARS-CoV-2: PM-WCPV-NCAP ] peptides were scanned to obtain 102 peptide (15 mer with 11 amino acid overlap) pools.
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
Wild-type nucleoprotein contains no signal peptide or transmembrane domain and is therefore not expected to be directed against the secretory pathway of mammalian host cells. The inventors speculate that the lack of signal peptide may be responsible for the lack of induction of an effective cellular immune response by wild-type nucleoprotein (expressed by srRNA1ts2-G5004 of example 1). With this in mind, the coding region of the signal peptide sequence from the human CD5 gene was added to the nucleoprotein coding region to replace the initiation codon (ATG) of nucleoprotein in srRNA1ts2-G5005 (FIG. 2). The amino acid sequence of the CD5 signal peptide is MPMGSLQPLATLYLLGMLVASCLG (shown as SEQ ID NO: 8).
Cellular immunity was assessed by ELISpot assay 14 days after CD-1-outcrossing mice were vaccinated by single intradermal injection of 5 μg or 25 μg of srRNA1ts2-G5005 (FIG. 2) or placebo (PBO: buffer only).
As shown in fig. 5A, antigen-specific INF- γ secreting T cells were strongly induced in a dose-dependent manner (5 μg compared to 25 μg). In contrast, antigen-specific IL-4 secreting T cells were hardly induced (FIG. 5B). Th1 cells secrete INF-gamma, whereas Th2 cells secrete IL-4. Th1> Th2 immune responses are generally considered to be an advantageous feature of vaccines.
In summary, the addition of a signal peptide derived from human CD5 to the N-terminus of nucleoprotein (N) results in the induction of a strong antigen-specific cellular immune response when the protein is expressed by intradermally injected temperature-controllable self-replicating RNA. The srRNA1ts2-G5005 vaccine also showed good Th1 skewed (Th 1> Th 2) immune response.
EXAMPLE 3 cellular immunity induced by srRNA1ts2-G5005
This example describes the following findings: addition of CD5 signal peptide to SARS-CoV-2 nucleoprotein induces potent cellular immune responses in BALB/c mice upon expression of temperature-controlled self-replicating RNA by intradermal injection.
Materials and methods
BALB/c female mice.
srRNA1ts2-G5005 mRNA was produced by in vitro transcription of a temperature controlled self-replicating RNA vector encoding the G5005 antigen (srRNA 1ts2 as described in PCT/US 2020/067506) (FIG. 2).
From nucleoprotein (UniProt: P0DTC 9) [ JPT peptide product code) by SARS-CoV-2: PM-WCPV-NCAP ] peptides were scanned to obtain 102 peptide (15 mer with 11 amino acid overlap) pools.
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
To test whether srRNA1ts2-G5005 could induce a strong cellular immune response in another mouse strain, immunogenicity studies were also performed in BALB/c mice. Cellular immunity was assessed by ELISpot assay 30 days after BALB/c mice were vaccinated by single intradermal injection of 5 μg or 25 μg of srRNA1ts2-G5005 (FIG. 2) or placebo (PBO: buffer only).
As shown in fig. 6A, antigen-specific INF- γ secreting T cells were strongly induced in a dose-dependent manner (5 μg compared to 25 μg). In contrast, antigen-specific IL-4 secreting T cells were not induced (FIG. 6B). Thus, favorable Th1> Th2 cellular responses were also observed in BALB/c mice.
In summary, the addition of a signal peptide derived from human CD5 to the N-terminus of nucleoprotein (N) significantly enhances the antigen-specific cellular immune response when the protein is expressed by intradermally injected temperature-controllable self-replicating RNA. As in CD-1 mice, the srRNA1ts2-G5005 vaccine showed an advantageous Th1 skewed (Th 1> Th 2) immune response in BALB/c mice.
EXAMPLE 4 humoral immunity induced by srRNA1ts2-G5005
This example describes the following findings: when SARS-CoV-2 nucleoprotein is expressed from intradermally injected temperature-controllable self-replicating RNA, the SARS-CoV-2 nucleoprotein induces an effective humoral immune response upon ligation to the human CD5 signal peptide.
Materials and methods
BALB/c female mice.
srRNA1ts2-G5005 mRNA was produced by in vitro transcription of a temperature controlled self-replicating RNA vector encoding the G5005 antigen (srRNA 1ts2 as described in PCT/US 20/67506) (FIG. 2).
SARS-CoV-2 nucleocapsid IgG ELISA KIT (ENZO: ENZ-KIT 193-0001).
Results
To test whether srRNA1ts2-G5005 could induce humoral immunity, nuclear protein-specific IgG levels in serum were measured by ELISA after 30 days of BALB/c mice were vaccinated by a single intradermal injection of 5 μg or 25 μg of srRNA1ts2-G5005 (FIG. 2) or placebo (PBO: buffer only). IgG levels are represented by OD450 in ELISA. IgG levels were measured before (day-1) and after (day 30) vaccination (day 0).
As shown in fig. 7, nucleoprotein-specific serum IgG (5 μg compared to 25 μg) was strongly induced in a dose-dependent manner.
In summary, the addition of a signal peptide derived from human CD5 to the N-terminus of nucleoprotein (N) induces an antigen-specific humoral immune response when the protein is expressed by intradermally injected temperature-controllable self-replicating RNA.
EXAMPLE 5 cellular immunity induced by srRNA1ts2-G5006
This example describes the following findings: when a fusion protein comprising SARS-CoV-2 nucleoprotein and MERS-CoV nucleoprotein is expressed by intradermally injected temperature-controllable self-replicating RNA, the fusion protein can induce strong cellular immunity against SARS-CoV-2 and MERS-CoV.
Materials and methods
BALB/c female mice.
srRNA1ts2-G5006 mRNA was produced by in vitro transcription of a temperature controlled self-replicating RNA vector encoding the G5006 antigen (srRNA 1ts2 as described in PCT/US 2020/067506) (FIG. C).
From nucleoprotein (UniProt: P0DTC 9) [ JPT peptide product code) by SARS-CoV-2: PM-WCPV-NCAP ] peptides the resulting pool of peptides (15 mer with 11 amino acid overlap) was scanned.
Peptide pools derived from peptide scans (15 MERS with 11 amino acid overlap) of nucleoprotein by MERS-CoV.
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
T cell epitopes are present in short linear peptides (typically in the size range of 8-11 residues for MHC class I and 10-30 residues for MHC class II). Unlike many B cell epitopes, the 3-D conformation of T cell epitopes is not important for recognition of immune cell receptors. Thus, the inventors speculate that nucleoproteins from different strains of β -coronavirus may be fused together in the absence of long linkers (greater than 10 amino acids in length) and used as vaccine antigens to elicit immune responses against different β -coronaviruses (e.g., SARS-CoV-1 and its variants, SARS-CoV-2 and its variants, and MERS-CoV and its variants).
To test this concept, a fusion protein comprising human CD5 signal peptide, SARS-CoV-2 nucleoprotein and MERS-CoV nucleoprotein was designed (see G5006 in FIG. 2C). Mice were vaccinated with srRNA1ts2-G5006 by intradermal injection and antigen-specific cellular immune responses were measured by the ELISPot assay. As expected, the srRNA1ts2-G5006 vaccine induced a strong INF-gamma secreting T cell response against both SARS-CoV-2 nucleoprotein (FIG. 8) and MERS-CoV nucleoprotein. Furthermore, cellular immune responses are expected to have a Th1> Th2 balance.
In summary, when fusion proteins comprising nucleoproteins from different beta coronaviruses are expressed by intradermally injected temperature-controllable self-replicating RNAs, the fusion proteins induce a strong antigen-specific cellular immune response.
Example 6 cellular immunity induced by srRNA1ts2-G5010 (influenza A vaccine)
This example describes an assessment of the immune response induced by a fusion protein comprising influenza a virus (FluA) nucleoprotein and influenza b virus (FluB) nucleoprotein when said fusion protein is expressed by intradermally injected temperature-controllable self-replicating RNA.
Materials and methods
BALB/c female mice.
srRNA1ts2-G5010 mRNA was produced by in vitro transcription of a temperature controllable self-replicating RNA vector (srRNA 1ts2[ PCT/US20/67506 ]) encoding the G5010 antigen (FIG. 9). The amino acid sequence of the G5010 fusion protein is shown as SEQ ID NO. 16. The nucleic acid sequence encoding the G5010 fusion protein was codon optimized for expression in human cells and is shown in SEQ ID NO. 15.
From Nucleoprotein (NP) through influenza a (H2N 2) (Swiss-Prot ID P21433) [ JPT peptide product code: peptide scan of PM-INFA-NPH2N2] a pool of 122 overlapping peptides (15 mer with 11 amino acid overlap) was obtained. The amino acid sequence of H2N2 nucleoprotein is shown in SEQ ID NO. 17.
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
Influenza a and b can infect humans and cause seasonal epidemics or pandemics (see "influenza virus type (Types of Influenza Viruses)" from CDC website www.cdc.gov/flu/abaout/viruses/types.htm). Nucleoprotein antigens are more conserved among different influenza strains than Hemagglutinin (HA) and Neuraminidase (NA) antigens conventionally included in influenza vaccines. For example, the amino acid sequences of the nucleoproteins of representative influenza a strains (H1N 1, H3N2, H5N8, H7N7, H7N9, H9N2, H10N 8) are very similar. Likewise, the amino acid sequences of the nucleoproteins of representative influenza b strains (Yamagata, victoria) are very similar. In contrast, the amino acid sequence of the nucleoprotein of influenza a is significantly different from that of influenza b.
T cell epitopes are present in short linear peptides (typically in the size range of 8-11 residues for MHC class I and 10-30 residues for MHC class II). Unlike many B cell epitopes, the conformation or 3D structure of T cell epitopes is not important for the recognition of immune cell receptors. Thus, one representative nucleoprotein from influenza a is expected to include many T-cell epitopes common to many influenza a strains. Also, one representative nucleoprotein from influenza b is expected to include a number of T-cell epitopes common to a number of influenza b strains. Thus, the inventors speculate that nucleoproteins from different influenza strains may be fused together in the absence of long linkers (greater than 10 amino acids in length) for use as vaccine antigens to elicit immune responses against different influenza viruses (e.g., different strains of influenza a and different strains of influenza b).
The amino acid sequences of the nucleoproteins of representative influenza a strains (H1N 1, H3N2, H5N8, H7N7, H7N9, H9N2 and H10N 8) were found to be similar to each other. The nucleoprotein of influenza strain H5N8 was chosen because it has minimal differences from the nucleoproteins of the other strains (H1N 1, H3N2, H7N7, H7N9, H9N2 and H10N 8). The nucleoprotein of influenza B strain (B/Florida/4/2006;GenBank CY033879.1) was selected as a representative influenza B nucleoprotein. A fusion protein comprising a human CD5 signal peptide, one FluA nucleoprotein and one FluB nucleoprotein (see G5010 in fig. 9) was designed and the coding region of the fusion protein was cloned downstream of the subgenomic promoter of srRNA1ts 2. mRNA is then produced by in vitro transcription. The amino acid sequence of FluA nucleoprotein is shown as SEQ ID NO. 13 (influenza A, subtype H5N8 [ A/b.sub.d./Korea/Gochang 1/2014], genBank number KJ413835.1, proteinID number AHL 21420.1), and the amino acid sequence of FluB nucleoprotein is shown as SEQ ID NO. 14 (influenza B [ B/Florida/4/2006], genBank number CY033879.1, proteinID number ACF 54251.1).
Mice were vaccinated with srRNA1ts2-G5010 by intradermal injection and antigen-specific cellular immune responses were measured by ELISPot assay. To recall nucleoprotein-reactive T-cell immunity, splenocytes harvested from mice 14 days post vaccination were restimulated using 122 overlapping peptide pools from peptide scans of the influenza a nucleoprotein sequence as shown in SEQ ID NO: 17. Despite the differences between the influenza a nucleoprotein sequence of G5010 and that of the peptide pool (fig. 10), the srRNA1ts2-G5010 vaccine induced a strong INF- γ secreting T cell response against the FluA nucleoprotein (fig. 11). Importantly, there was little induction of IL-4 secreting T cells against FluA nucleoprotein. These results indicate that the srRNA1ts2-G5010 vaccine induces a Th1 (INF-gamma) dominant response (Th 1> Th2 balance), which is an advantageous feature of vaccines against viral diseases.
In summary, when fusion proteins comprising nucleoproteins from representative influenza a and influenza b strains are expressed by intradermally injected temperature-controllable self-replicating RNAs, the fusion proteins induce a strong antigen-specific cellular immune response.
EXAMPLE 7 cellular immunity induced by srRNA1ts 2-ubiquitin (ubiquitin vaccine)
This example describes the following findings: when fusion proteins comprising nucleoprotein fragments from four ebola virus species (zaire-type ebola virus, sudan-type ebola virus, bundi Jiao Xingai ebola virus, tayi forest-type ebola virus) are used as vaccine antigens, the fusion proteins can induce strong cellular immunity against ebola virus. This example uses temperature-controlled self-replicating RNA as an expression vector.
Materials and methods
BALB/c female mice.
In vitro transcription of temperature controlled self-replicating RNA vectors (srRNA 1ts2[ PCT/US20/67506 ]) encoding the pan-type Ebola antigens produced srRNA1ts 2-pan-type Ebola mRNA (FIG. 12).
From the nuclear protein of Ebola virus by Ebola virus-Talin type (Swiss-Prot ID: B8XCN 6) [ JPT peptide; pepnix takie forest type ebola virus (NP); JPT product code: PM-TEBOV-NP ] was scanned for a pool of 182 peptides (15 mer with 11 amino acid overlap).
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
Ebola virus causes a highly fatal hemorrhagic fever. Four ebola virus species are known to cause disease in humans: ebola virus (zaire ebola virus), sudan virus (sudan ebola virus), bund Jiao Bingdu (bund Jiao Xingai ebola virus), and taeda forest virus (taeda forest type ebola virus, formerly known as coltadalaw ebola virus).
Currently, only one licensed vaccine (rVSV-ZEBOV) is available for ebola virus. This vaccine is an attenuated recombinant Vesicular Stomatitis Virus (VSV) that expresses a major Glycoprotein (GP) from zaire ebola virus. Although vaccines can induce neutralizing antibodies against ebola virus, the protein sequence of GP is highly different among four ebola virus species that infect humans. Thus, the rVSV-ZEBOV vaccine is effective against only zaire-type ebola virus. It is desirable to have a universal ebola virus vaccine that can provide protection against all four ebola virus species.
The Nucleoprotein (NP) sequence is more conserved among the four ebola virus species than GP. However, unlike GP, NP is not a surface protein, and thus antibodies raised against NP are not neutralizing antibodies. Importantly, it has been shown that mice vaccinated with zaire-type ebola virus NP can be protected from zaire-type ebola virus challenge, which is mediated by cellular rather than humoral immunity (Wilson and Hart, J Virol,75:2660-2664,2001). Protection was also shown to be mediated by MHC class I restricted cd8+ killer T cells (cytotoxic T lymphocytes) rather than MHC class II restricted cd4+ helper T cells (Wilson and Hart, supra, 2001).
The use of the fusion proteins of NPs of all four ebola virus species as vaccine antigens provides protection against all four ebola virus species. However, each NP is about 740 amino acids in length. Thus, fusing four intact NPs together will result in a relatively large protein of about 3,000 amino acids. Smaller sized antigens are desirable for many vaccine platforms.
The amino acid sequences of the nucleoprotein of the four ebola viruses were compared using NCBI BlastP (zaire ebola virus NP (GenBank ID: AF 272001), sudan ebola virus NP (GenBank ID: AF 173836), bunsen Jiao Xingai ebola virus NP (GenBank ID: FJ 217161), and tay forest ebola virus NP (GenBank ID: FJ 217162)). The sequences of the N-terminal half of NPs (designated region A) were found to be similar to each other (88% -92% identical), while the sequences of the C-terminal half of NPs (designated region B) were found to be diverse (42% -54%) (Table 7-1). Thus, zaire (a) was chosen as representative of zaire type (a), sudan type (a), bunsen type (a) and tazilin type (a). For region B, the present dibuca type (B) and the tazicar type (B) were found to be similar to each other (80% and 86% identity) except for the middle portion (40% identity) (referred to as region C). Thus, zaire (B), sudan (B), bendiburley (B) and taisen (C) were selected for inclusion in the pan-type ebola vaccine. Before the four nucleoproteins are assembled into a single fusion protein, an additional 8 amino acid sequences are added to both sides so that the possible T cell epitopes at the ends of the nucleoprotein fragment are not destroyed. A schematic representation of fusion proteins of the pan ebola antigen is shown in fig. 12 and comprises NP fragments of zaire type (a), zaire type (B), sudan type (B), bendio type (B) and taisen type (C), and a human CD5 signal peptide. A graph showing the percent identity of ebola virus NP sequences is shown in fig. 13. The amino acid sequence of the ubiquitin Ebola antigen is shown as SEQ ID NO. 22, and the nucleic acid sequence of the coding ubiquitin Ebola antigen is shown as SEQ ID NO. 23.
TABLE 7-1 percent identity between Ebola virus NP domain sequences
Virus (virus) | Zaire type (A) | Zaire type (B) |
Sudan type pill | 88% | 42% |
Bendi cloth coke type | 92% | 53% |
Forest type tower | 92% |
The srRNA1ts 2-ubiquitin-type ebola vaccine was generated by cloning the ubiquitin fusion protein downstream of the subgenomic promoter of srRNA1ts 2. mRNA was produced by in vitro transcription and used for intradermal vaccination of BALB/c mice in vivo. Antigen-specific cellular immune responses were measured by ELISpot assay. To recall the nucleoprotein-reactive T cell immunity, splenocytes harvested from mice 14 days post vaccination were restimulated using a pool of 182 peptides scanned from the peptide of nucleoprotein (ebola virus-talin type ebola virus nucleoprotein (Swiss-Prot ID: B8XCN 6)). The srRNA1ts 2-ubiquitin ebola vaccine induced a strong INF-gamma secreting T cell response against Taoism nucleoprotein (FIG. 14A). Surprisingly, only a small fraction (169 aa) of the taverine-type nucleoprotein was contained in the mRNA vaccine, while the peptide pool for restimulation covered the entire taverine-type nucleoprotein sequence. Importantly, there was little induction of IL-4 secreting T cells against the taisen-type nucleoprotein (FIG. 14B). These results indicate that srRNA1ts 2-pan-ebola vaccine induces a Th1 (INF-gamma) dominant response (Th 1> Th2 balance), which is an advantageous feature of vaccines against viral diseases.
In summary, when fusion proteins comprising nucleoprotein fragments from four ebola virus species are expressed from intradermally injected temperature-controllable self-replicating RNAs, the fusion proteins induce a strong antigen-specific cellular immune response. The examples show that the size of fusion proteins used as a ubiquitin ebola vaccine can be reduced by removing more well-conserved portions of one or more of the nucleoproteins that make up the vaccine. The pan-ebola antigen is also suitable for use in other vaccine platforms (e.g., adenovirus, adeno-associated virus, recombinant proteins, etc.).
Example 8 cellular immunity induced by srRNA1ts2-G5003o (Omikovia vaccine)
This example describes the following findings: intradermal delivery of c-srRNA encoding RBD of SARS-CoV-2 (amikates strain B.1.1.529) can induce strong cellular immunity in mice.
Materials and methods
C57BL/6 female mice.
srRNA1ts2-G5003o (mRNA) was produced by in vitro transcription of a temperature-controllable self-replicating RNA vector (srRNA 1ts2[ WO 2021/138447A1 ]) encoding the G5003o antigen (FIG. 15).
From RBD (S-RBD B.1.1.529) by the SARS-CoV-2 omnikon variety [ JPT Peptides: PM-SARS2-RBDMUT08-1] peptide was scanned to obtain a pool of peptides (15 mer with 11 amino acid overlap).
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
In this example, c-srRNA encoding the RBD of SARS-CoV-2 Oryctolagkis Rong Bianchong (G5003 o) was generated (FIG. 15). RNA was administered intradermally to C57BL/6 mice and spleen cells were collected after 14 days to examine cellular immunity against SARS-CoV-2RBD (Omik Rong Bianchong). Induction of INF- γ secreting T cells was clearly observed in the c-srRNA-G5003o receptor (fig. 16A), whereas induction of IL-4 secreting T cells was not observed in the c-srRNA-G5003o receptor (fig. 16B).
Conclusion(s)
By using the obreck Rong Bianchong specific RBD as an antigen, we have demonstrated that an obreck Rong Bianchong specific cellular immune response can be induced when proteins are expressed from intradermally injected temperature controlled self-replicating RNAs. A favorable Th1 (INF-. Gamma.) Th2 (IL-4) response was also observed.
EXAMPLE 9.c efficacy of srRNA priming protein boosting protocol
This example describes the following findings: administration of a c-srRNA vaccine encoding a protein antigen of a primary virus is capable of eliciting a humoral immune response against the protein antigen of a variant virus.
Materials and methods
BALB/c female mice.
srRNA1ts2-G5003 (mRNA) was produced by in vitro transcription of a temperature-controllable self-replicating RNA vector (srRNA 1ts2[ WO 2021/138447A1 ]) encoding the G5003 antigen (FIG. 15).
Recombinant SARS-CoV-2B.1.617.2 spike GCN4-IZ protein (R & D Systems, catalog number 10878-CV).
Based on AddaVax TM An oil-in-water adjuvant of squalene was obtained from InvivoGen.
From RBD [ JPT Peptides ] by SARS-CoV-2 (original strain): pepmix SARS-CoV-2 (S-RBD) PM-WCPV-S-RBD-2] peptide pool of peptides (15 mer with 11 amino acid overlap)
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Vero76 cells for plaque reduction neutralization assay (PRNT).
SARS-CoV-2 delta variant live virus for PRNT assay
For the PRNT assay, vero76 cells were first treated with serial dilutions of mouse serum and then infected with the live virus of SARS-CoV-2 (delta variant strain). In this assay, infected cells die and form plaques after fixation and staining with crystal violet. If the serum contains neutralizing antibodies, the viral infection is inhibited, resulting in a reduced number of plaques. The results are shown as serum dilution titer (PRNT) showing a 50% reduction in plaque number 50 )。
Results
A composition comprising c-srRNA encoding the G5003 antigen (RBD of SARS-CoV-2 original strain) was administered intradermally as naked mRNA into the skin of BALB/c mice (FIG. 17A). That is, the srRNA1ts2-G5003 composition does not contain any nanoparticles or transfection reagents. Subsequently, a composition comprising spike protein of SARS-CoV-2 (delta variant B.1.617.2) in admixture with an adjuvant was administered intradermally (FIG. 17A).
Cellular immunity against the SARS-CoV-2RBD protein was detected in mouse spleen cells 14 days after a single intradermal injection of the c-srRNA-G5003 composition (FIG. 17B). As early as day 7 after protein antigen exposure, subsequent exposure of immunized mice to spike protein of a different strain of SARS-CoV-2 (delta variant b.1.617.2) induced neutralizing antibodies to delta variant of SARS-CoV-2 (detected by PRNT assay) (fig. 17C). In contrast, mice that did not receive the c-srRNA-G5003 encoding the RBD of SARS-CoV-2 (original strain) did not mount a neutralizing antibody response against the delta variant of SARS-CoV-2. Early induction of neutralizing antibodies is a characteristic of secondary immune responses, suggesting that c-srRNA elicits humoral immune responses prior to exposure to adjuvanted RBD proteins.
Conclusion(s)
The results indicate that the c-srRNA immunogen can induce an effective immune response that is broadly reactive to both the antigen encoded by the c-srRNA and to different variant antigens. Thus, the c-srRNA SARS-CoV-2RBD immunogen is suitable for use in immunization protocols against a broad spectrum of SARS-CoV-2 strains.
EXAMPLE 10 efficacy of the protein priming c-srRNA boost regimen
This example describes the following findings: the c-srRNA vaccine can enhance antibody titers when used as a booster vaccine for other vaccines.
Materials and methods
C57BL/6 female mice.
RBD protein (Sino Biological SARS-CoV-2[2019-nCoV ] spike RBD-His recombinant protein, catalog number 40592-V08B)
Based on AddaVax TM An oil-in-water adjuvant of squalene was obtained from InvivoGen.
srRNA1ts2-G5003 (mRNA) was produced by in vitro transcription of a temperature-controllable self-replicating RNA vector (srRNA 1ts2[ WO 2021/138447A1 ]) encoding the G5003 antigen (FIG. 15).
srRNA1ts2-G5003o (mRNA) was produced by in vitro transcription of a temperature-controllable self-replicating RNA vector (srRNA 1ts2[ WO 2021/138447A1 ]) encoding the G5003o antigen (FIG. 15).
From RBD [ JPT Peptides ] by SARS-CoV-2 (original strain): pepmix SARS-CoV-2 (S-RBD) PM-WCPV-S-RBD-2] peptide pool of peptides (15 mer with 11 amino acid overlap)
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
ELISA assay plates (ENZO SARS-CoV-2IgG ELISA KIT [ catalog number ENZ-KIT 170-0001; plates were coated with SARS-CoV-2 (original strain) S1 antigen RBD protein ]).
Results
To test the possibility that the c-srRNA vaccine can be used as a booster vaccine, mice were first vaccinated with adjuvanted protein (in this case, RBD of SARS-CoV-2[ original strain "). Fourteen days later (day 14), mice were further treated with intradermal injections of placebo (PBO: buffer only), c-srRNA encoding the G5003 antigen, c-srRNA encoding the G5003o antigen, or adjuvanted RBD protein (FIG. 18A).
On day 28, cellular immunity was assessed by ELISpot assay. As expected, RBD (1 st) +pbo (2 nd) groups were unable to induce cellular immunity, whereas RBD (1 st) +rbd (2 nd) groups induced cellular immunity (fig. 18B, fig. 18C). Interestingly, RBD (1st) +c-srRNA-G5003 and C-srRNA-G5003o groups also induced cellular immunity (FIG. 18B, FIG. 18C). This is expected because the c-srRNA vaccine alone can induce cellular immunity.
On day 28, serum antibody levels against RBD of SARS-CoV-2 virus (original strain) were assessed by ELISA assay (fig. 19). The first vaccination with adjuvanted RBD protein alone may weakly induce antibodies. In another aspect, the c-srRNA vaccine is capable of inducing antibodies at a level similar to that of the second vaccination adjuvanted protein.
Conclusion(s)
The results indicate that the c-srRNA vaccine can function as a booster vaccine for both cellular and humoral immunity.
Example 11 potent cellular immune response induced by srRNA1ts2-G5006
This example describes the following findings: when a fusion protein comprising SARS-CoV-2 nucleoprotein and MERS-CoV nucleoprotein is expressed by intradermally injected temperature-controllable self-replicating RNA, the fusion protein can induce strong cellular immunity against SARS-CoV-2 and MERS-CoV. Vaccinated mice can eliminate implanted tumor cells that express fusion proteins comprising SARS-CoV-2 nucleoprotein and MERS-CoV nucleoprotein.
Materials and methods
BALB/c female mice.
srRNA1ts2-G5006 mRNA was produced by in vitro transcription of a temperature controlled self-replicating RNA vector encoding the G5006 antigen (srRNA 1ts2 as described in WO 2021/138447 A1) (FIG. 2).
From nucleoprotein (UniProt: P0DTC 9) [ JPT peptide product code) by SARS-CoV-2: PM-WCPV-NCAP ] peptides the resulting pool of peptides (15 mer with 11 amino acid overlap) was scanned.
Pools of peptides (15 MERS with 11 amino acid overlap) were scanned from peptides of nucleoprotein by MERS-CoV (YP 009047211.1). Peptides were tailored by JPT Peptides.
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
A4T 1 breast cancer cell line (catalog number CRL-2539) derived from BALB/c mice and known as a model for triple negative stage IV human breast cancer was purchased from ATCC.
Plasmid DNA encoding the fusion protein of the nucleoprotein of SARS-CoV-2 and MERS-CoV under the CMV promoter (non-secreted form of G5006, i.e. without CD5 signal peptide) and the hygromycin resistance gene under the SV40 early promoter was transfected into 4T1 cells. Cells expressing the fusion protein of SARS-CoV-2 and MERS-CoV (referred to as 4T 1-SMN) were isolated by culturing the cells in the presence of 200 μg/mL hygromycin B.
Results
To model virus-infected cells, we used a 4T1 breast cancer cell line derived from BALB/c mice and known as a model for triple negative stage IV human breast cancer. When injected into BALB/c mice, 4T1 cells grew rapidly and formed tumors. This syngeneic mouse model was used to model the rapid increase in infected cells. To this end, we first prepared a plasmid vector encoding a fusion protein of the nucleoprotein of SARS-CoV-2 and MERS-CoV under the CMV promoter (designated SMN protein), thus allowing constitutive expression of the protein. This fusion protein is identical to G5006, but the CD5 signal peptide is removed from the N-terminus of the protein. Naturally, nucleoproteins do not have a signal peptide and reside within the cytoplasm of the cell. 4T1 cells expressing SMN protein (designated 4T 1-SMN) were established after hygromycin selection, as the plasmid vector also carries the hygromycin resistance gene.
BALB/c mice were vaccinated with c-srRNA-G5006 and induction of cellular immunity was demonstrated by the presence of T cells that responded to both SARS-CoV-2 nucleoprotein (FIG. 20A) and MERS-CoV nucleoprotein (FIG. 20B).
On day 24 (24 days after vaccination), 4T1-SMN cells were injected into BALB/c mice vaccinated with c-srRNA-G5006 (FIG. 21). As expected, 4T1-SMN cells grew rapidly in mice receiving placebo (no vaccinated group) 4T1-SMN cells. In another aspect, the growth of the 4T1-SMN tumor is inhibited in c-srRNA-G5006 vaccinated mice. Two mice receiving the 25 μ G c-srRNA-G5006 vaccine became tumor-free and survived, although the tumor initially grew. Furthermore, even after the second round of injection of 4T1-SMN tumor at day 143 post-vaccination, no tumor grew and mice were tumor-free and continued to survive (fig. 21).
Conclusion(s)
The c-srRNA vaccine can induce stronger cellular immunity, and the cellular immunity can kill and eliminate cells expressing the antigen. This result suggests that c-srRNA acts as a vaccine by eliminating infected cells.
EXAMPLE 12 cellular immunity induced by srRNA1ts 2-ubiquitin coronavirus vaccine
This example describes the following findings: when a fusion protein comprising the CD5 signal peptide, the spike-RBD of SARS-CoV-2, the nucleoprotein of MERS-CoV and the spike-RBD of MERS-CoV is expressed by intradermally injected temperature-controllable self-replicating RNA, the fusion protein can induce strong cellular immunity against all of these antigens.
Materials and methods
C57BL/6 female mice.
srRNA1ts2-G5006 mRNA was produced by in vitro transcription of a temperature controlled self-replicating RNA vector encoding the G5006 antigen (srRNA 1ts2 as described in WO 2021/138447 A1) (FIG. 2).
srRNA1ts2-G5006d mRNA was produced by in vitro transcription of a temperature controlled self-replicating RNA vector encoding G5006d antigen (srRNA 1ts2 as described in WO 2021/138447 A1) (FIG. 22).
From RBD [ JPT Peptides ] by SARS-CoV-2 (original strain): pepmix SARS-CoV-2 (S-RBD) PM-WCPV-S-RBD-2] peptide pool of peptides (15 mer with 11 amino acid overlap)
From nucleoprotein (UniProt: P0DTC 9) [ JPT peptide product code) by SARS-CoV-2: PM-WCPV-NCAP ] peptides the resulting pool of peptides (15 mer with 11 amino acid overlap) was scanned.
Pools of peptides (15 MERS with 11 amino acid overlap) were scanned from peptides of nucleoprotein by MERS-CoV (YP 009047211.1). Peptides were tailored by JPT Peptides.
From the spike glycoprotein (Swiss-Prot ID: K9N5Q 8) of MERS-CoV (middle east respiratory syndrome associated coronavirus) [ JPT peptide product code: PM-MERS-CoV-S-1] peptide scan resulted in a pool of 336 (168+168) peptides (15 mer with 11 amino acid overlap).
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
Here we designed a new booster vaccine, a c-srRNA vaccine (called c-srRNA-G5006 d) encoding a fusion protein comprising the CD5 signal peptide, the spike-RBD of SARS-CoV-2, the nucleoprotein of MERS-CoV, and the spike-RBD of MERS-CoV (FIG. 22).
Mice were vaccinated intradermally with placebo (PBO: buffer only), c-srRNA encoding the G5006 antigen, and c-srRNA encoding the G5006d antigen. On day 14 post-vaccination, cellular immunity was assessed by ELISpot assay.
As shown in fig. 23A, c-srRNA-G5006d can stimulate cellular immunity against all of the following proteins encoded on this vaccine: spike-RBD of SARS-CoV-2, nucleoprotein of MERS-CoV and spike-RBD of MERS-CoV.
Conclusion(s)
The results indicate that the c-srRNA vaccine can function as a booster vaccine for both cellular and humoral immunity.
EXAMPLE 13 srRNA1 ts2-ubiquitin-type influenza Virus vaccine
This example describes the design of a pan-type influenza booster vaccine based on the unique features of the c-srRNA vaccine platform. The antigen encoded on c-srRNA (G5012) is a fusion protein of the CD5 signal peptide (residues 1-24), a portion of the hemagglutinin of influenza A (HA), the nucleoprotein of influenza A, the nucleoprotein of influenza B, and a portion of the hemagglutinin of influenza B (HA).
Materials and methods
C57BL/6 female mice.
The c-srRNA-G5012 mRNA was produced by in vitro transcription of a temperature controllable self-replicating RNA vector encoding the G5012 antigen (srRNA 1ts2 as described in WO 2021/138447 A1) (FIG. 24).
A pool of peptides (15 mers with 11 amino acid overlaps) obtained from peptide scans through a portion of Hemagglutinin (HA) of influenza a, nucleoprotein of influenza b, and a portion of Hemagglutinin (HA) of influenza b.
ELISPot assay plates and reagents for interferon gamma (INF-gamma) and interleukin-4 (IL-4) (Cellular Technology Limited, ohio).
Immunospot S6 Entry Analyzer (Cellular Technology Limited, ohio, U.S.A.).
Results
Mice were vaccinated with intradermal injections of placebo (PBO: buffer only) and c-srRNA encoding the G5012 antigen. On day 14 post-vaccination, cellular immunity was assessed by ELISpot assay.
c-srRNA-G5012 stimulated cellular immunity against all of the following antigens encoded on this vaccine: hemagglutinin of influenza a (HA), nucleoprotein of influenza a, nucleoprotein of influenza b, and hemagglutinin of influenza b (HA).
Conclusion(s)
The results indicate that the c-srRNA vaccine can function as a booster vaccine for both cellular and humoral immunity.
EXAMPLE 14 Chitosan enhances luciferase expression by srRNA1ts2-LUC2
This example describes the following findings: the chitosan oligomer is capable of enhancing in vivo expression of a gene of interest (GOI) encoded by the c-srRNA construct.
Materials and methods
C57BL/6 female mice.
srRNA1ts2-LUC2 (mRNA) is produced by in vitro transcription of a temperature-controllable self-replicating RNA vector encoding a luciferase gene(s) (e.g., srRNA1ts2 as described in WO 2021/138447 A1).
Chitosan oligomer (molecular weight: 5kDa, deacetylation: heppe Medical Chitosan GmbH: product number 44009).
Chitosan oligosaccharide lactic acid (molecular weight about 5kDa, >90% deacetylated: sigma-Aldrich: product number 523682).
Bioluminescence imaging systems, AMI HTX (Spectral Instruments Imaging, atlasen, arizona).
Results
To test whether chitosan oligomer could enhance expression of the GOI encoded on C-srRNA in vivo, 5. Mu.g of C-srRNA (also referred to as srRNA1ts 2) encoding a luciferase gene as GOI was mixed with chitosan and intradermally administered to each C57BL/6 mouse (FIG. 25). In lactate ringer's solution, c-srRNA is formulated as naked RNA in the absence of lipid nanoparticles or any other transfection reagent. Luciferase activity was visualized and quantified by using the bioluminescence imaging system AMI HTX (Spectral Instruments Imaging, atlasen, arizona).
Five mice each were tested in the following groups: 1, control-c-srRNA only; 2, c-srRNA mixed with chitosan oligosaccharide (0.001. Mu.g/mL); 3, c-srRNA mixed with chitosan oligosaccharide (0.01. Mu.g/mL); 4, c-srRNA mixed with chitosan oligosaccharide (0.5. Mu.g/mL); 5, c-srRNA mixed with chitosan oligosaccharide lactic acid (0.1. Mu.g/mL).
As shown in FIG. 27, all conditions with chitosan oligomers at concentrations of 0.001. Mu.g/mL, 0.01. Mu.g/mL and 0.5. Mu.g/mL and conditions with chitosan oligosaccharide lactic acid at a concentration of 0.1. Mu.g/mL showed about 10-fold higher levels of luciferase activity compared to the control condition (i.e., c-srRNA: no chitosan).
Conclusion(s)
When mixed with c-srRNA prior to intradermal injection into mouse skin, low molecular weight chitosans (e.g., chitosan oligomers and chitosan oligosaccharide lactic acid) can enhance expression of GOI encoded on the c-srRNA. Chitosan oligomers provide about 10-fold enhancement of gene expression even at very low concentrations (0.001. Mu.g/mL or about 0.2 nM). This surprising finding provides an effective means of enhancing the in vivo therapeutic expression of GOI encoded on c-srRNA.
Sequence(s)
SEQ ID NO:1
>G5004(SARS-COV-2_N,mRNA)
SEQ ID NO:2
G5005 (CD 5-SP_SARS-COV-2_N, mRNA Synthesis)
SEQ ID NO:3
G5006A (CD 5-SP_SARS2-N_MERS-N, mRNA synthesized)
SEQ ID NO:4
G5006B (CD 5-SP_SARS2-N_MERS-N: CODON-OPTIMIZED, synthesis of mRNA)
SEQ ID NO:5
G5004 (SARS-COV-2_N, protein)
SEQ ID NO:6
G5005 (CD 5-SP_SARS-CoV-2_N, synthetic protein)
SEQ ID NO:7
G5006a (CD 5-SP_SARS2-N_MERS-N, synthetic protein)
SEQ ID NO:8
Human CD5 Signal peptide (Homo sapiens, protein)
MPMGSLQPLATLYLLGMLVASCLG
SEQ ID NO:9
MERS-N protein
SEQ ID NO:10
SARS-COV-1-N protein
SEQ ID NO:11
Cloning site (synthetic DNA)
GGCGCGCCattggccaccGCGGCCGC
The gene of interest (e.g., the beta-coronavirus nucleoprotein open reading frame) is inserted between nucleotides 18 and 19 of SEQ ID NO: 11. GGCGCGCC is an AscI restriction site and GCGGCCGC is a NotI restriction site.
SEQ ID NO:12
Artificial work
Attggccacc (synthetic DNA)
This nucleotide sequence was added to the 5' end of the β coronavirus nucleoprotein encoding region to provide a Kozak consensus sequence for initiating mRNA translation.
SEQ ID NO:13
Influenza A virus (H5N 8) nucleocapsid protein
>
SEQ ID NO:14
Influenza b virus nucleocapsid protein
SEQ ID NO:15
G5010 (synthetic mRNA) [ codon optimized sequence for pan-influenza antigen ]
SEQ ID NO:16
G5010 (synthetic protein) [ pandemic influenza antigen ]
SEQ ID NO:17
Nucleoprotein of influenza a virus (H2N 2) (Swiss-Protein ID P21433) [ JPT peptide product code: PM-INFA-NPH2N2]
ASQGTKRSYEQMETDGERQNANEIRASVGKMIGGIGRFYIQMCTELKLSDYEGRLIQNSLTIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYKRVDGKWMRELVLYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDTTYQRTRALVRTGMDPRMCSLMQGSTLPRRSGAAGAAVKGVGTMVMELIRMIKRGINDRNFWRGENGRKTRNAYERMCNILKGKFQTAAQRAMMDQVRESRNPGNAEIEDLIFLARSALILRGSVAHKSCLPACVYGPAVASGYDFEKEGYSLVGIDPFKLLQNSQVYSLIRPNENPAHKSQLVWMACNSAAFEDLRVSSFIRGTKVIPRGKLSTRGVQIASNENMDTMGSSTLELRSRYWAIRTRSGGNTNQQRASAGQISVQPTFSVQRNLPFDKPTIMAAFTGNAEGRTSDMRAEIIRMMEGAKPEEVSFQGRGVFELSDEKATNPIVPSFDMSNEGSYFFGDNAEEYDN
SEQ ID NO:18
Nucleoprotein fragments of zaire ebola virus
SEQ ID NO:19
Nuclear protein fragment of Sudan type ebola virus
SEQ ID NO:20
Nuclear protein fragment of Bendi cloth Jiao Xingai Bola virus
SEQ ID NO:21
Nuclear protein fragment of Tayi forest type ebola virus
SEQ ID NO:22
General Ebola antigen (synthetic protein)
SEQ ID NO:23
General Ebola antigen (synthetic mRNA)
SEQ ID NO:24
G5003o (synthetic mRNA) [ CD5sp+RBD2 (Omikovin) ]
SEQ ID NO:25
G5003o (synthetic protein) [ CD5sp+RBD2 (Omikovin) ]
SEQ ID NO:26
G5006d (Synthesis of mRNA)
SEQ ID NO:27
G5006d (synthetic protein)
SEQ ID NO:28
G50012 (Synthesis of mRNA)
SEQ ID NO:29
G50012 (synthetic protein)
Sequence listing
<110> illix root treatment Co
<120> temperature-controllable self-replicating RNA for viral diseases
Vaccine
<130> 69944-20014.40
<140> not yet allocated
<141> along with the submission
<150> US 63/275,398
<151> 2021-11-03
<150> US 63/240,278
<151> 2021-09-02
<150> US 63/211,974
<151> 2021-06-17
<160> 29
<170> FastSEQ version 4.0 for Windows
<210> 1
<211> 1260
<212> DNA
<213> SARS-COV-2
<400> 1
atgtctgata atggacccca aaatcagcga aatgcacccc gcattacgtt tggtggaccc 60
tcagattcaa ctggcagtaa ccagaatgga gaacgcagtg gggcgcgatc aaaacaacgt 120
cggccccaag gtttacccaa taatactgcg tcttggttca ccgctctcac tcaacatggc 180
aaggaagacc ttaaattccc tcgaggacaa ggcgttccaa ttaacaccaa tagcagtcca 240
gatgaccaaa ttggctacta ccgaagagct accagacgaa ttcgtggtgg tgacggtaaa 300
atgaaagatc tcagtccaag atggtatttc tactacctag gaactgggcc agaagctgga 360
cttccctatg gtgctaacaa agacggcatc atatgggttg caactgaggg agccttgaat 420
acaccaaaag atcacattgg cacccgcaat cctgctaaca atgctgcaat cgtgctacaa 480
cttcctcaag gaacaacatt gccaaaaggc ttctacgcag aagggagcag aggcggcagt 540
caagcctctt ctcgttcctc atcacgtagt cgcaacagtt caagaaattc aactccaggc 600
agcagtaggg gaacttctcc tgctagaatg gctggcaatg gcggtgatgc tgctcttgct 660
ttgctgctgc ttgacagatt gaaccagctt gagagcaaaa tgtctggtaa aggccaacaa 720
caacaaggcc aaactgtcac taagaaatct gctgctgagg cttctaagaa gcctcggcaa 780
aaacgtactg ccactaaagc atacaatgta acacaagctt tcggcagacg tggtccagaa 840
caaacccaag gaaattttgg ggaccaggaa ctaatcagac aaggaactga ttacaaacat 900
tggccgcaaa ttgcacaatt tgcccccagc gcttcagcgt tcttcggaat gtcgcgcatt 960
ggcatggaag tcacaccttc gggaacgtgg ttgacctaca caggtgccat caaattggat 1020
gacaaagatc caaatttcaa agatcaagtc attttgctga ataagcatat tgacgcatac 1080
aaaacattcc caccaacaga gcctaaaaag gacaaaaaga agaaggctga tgaaactcaa 1140
gccttaccgc agagacagaa gaaacagcaa actgtgactc ttcttcctgc tgcagatttg 1200
gatgatttct ccaaacaatt gcaacaatcc atgagcagtg ctgactcaac tcaggcctaa 1260
<210> 2
<211> 1329
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 2
atgcccatgg ggtctctgca accgctggcc accttgtacc tgctggggat gctggtcgct 60
tcctgcctcg gatctgataa tggaccccaa aatcagcgaa atgcaccccg cattacgttt 120
ggtggaccct cagattcaac tggcagtaac cagaatggag aacgcagtgg ggcgcgatca 180
aaacaacgtc ggccccaagg tttacccaat aatactgcgt cttggttcac cgctctcact 240
caacatggca aggaagacct taaattccct cgaggacaag gcgttccaat taacaccaat 300
agcagtccag atgaccaaat tggctactac cgaagagcta ccagacgaat tcgtggtggt 360
gacggtaaaa tgaaagatct cagtccaaga tggtatttct actacctagg aactgggcca 420
gaagctggac ttccctatgg tgctaacaaa gacggcatca tatgggttgc aactgaggga 480
gccttgaata caccaaaaga tcacattggc acccgcaatc ctgctaacaa tgctgcaatc 540
gtgctacaac ttcctcaagg aacaacattg ccaaaaggct tctacgcaga agggagcaga 600
ggcggcagtc aagcctcttc tcgttcctca tcacgtagtc gcaacagttc aagaaattca 660
actccaggca gcagtagggg aacttctcct gctagaatgg ctggcaatgg cggtgatgct 720
gctcttgctt tgctgctgct tgacagattg aaccagcttg agagcaaaat gtctggtaaa 780
ggccaacaac aacaaggcca aactgtcact aagaaatctg ctgctgaggc ttctaagaag 840
cctcggcaaa aacgtactgc cactaaagca tacaatgtaa cacaagcttt cggcagacgt 900
ggtccagaac aaacccaagg aaattttggg gaccaggaac taatcagaca aggaactgat 960
tacaaacatt ggccgcaaat tgcacaattt gcccccagcg cttcagcgtt cttcggaatg 1020
tcgcgcattg gcatggaagt cacaccttcg ggaacgtggt tgacctacac aggtgccatc 1080
aaattggatg acaaagatcc aaatttcaaa gatcaagtca ttttgctgaa taagcatatt 1140
gacgcataca aaacattccc accaacagag cctaaaaagg acaaaaagaa gaaggctgat 1200
gaaactcaag ccttaccgca gagacagaag aaacagcaaa ctgtgactct tcttcctgct 1260
gcagatttgg atgatttctc caaacaattg caacaatcca tgagcagtgc tgactcaact 1320
caggcctaa 1329
<210> 3
<211> 2568
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 3
atgcccatgg ggtctctgca accgctggcc accttgtacc tgctggggat gctggtcgct 60
tcctgcctcg gatctgataa tggaccccaa aatcagcgaa atgcaccccg cattacgttt 120
ggtggaccct cagattcaac tggcagtaac cagaatggag aacgcagtgg ggcgcgatca 180
aaacaacgtc ggccccaagg tttacccaat aatactgcgt cttggttcac cgctctcact 240
caacatggca aggaagacct taaattccct cgaggacaag gcgttccaat taacaccaat 300
agcagtccag atgaccaaat tggctactac cgaagagcta ccagacgaat tcgtggtggt 360
gacggtaaaa tgaaagatct cagtccaaga tggtatttct actacctagg aactgggcca 420
gaagctggac ttccctatgg tgctaacaaa gacggcatca tatgggttgc aactgaggga 480
gccttgaata caccaaaaga tcacattggc acccgcaatc ctgctaacaa tgctgcaatc 540
gtgctacaac ttcctcaagg aacaacattg ccaaaaggct tctacgcaga agggagcaga 600
ggcggcagtc aagcctcttc tcgttcctca tcacgtagtc gcaacagttc aagaaattca 660
actccaggca gcagtagggg aacttctcct gctagaatgg ctggcaatgg cggtgatgct 720
gctcttgctt tgctgctgct tgacagattg aaccagcttg agagcaaaat gtctggtaaa 780
ggccaacaac aacaaggcca aactgtcact aagaaatctg ctgctgaggc ttctaagaag 840
cctcggcaaa aacgtactgc cactaaagca tacaatgtaa cacaagcttt cggcagacgt 900
ggtccagaac aaacccaagg aaattttggg gaccaggaac taatcagaca aggaactgat 960
tacaaacatt ggccgcaaat tgcacaattt gcccccagcg cttcagcgtt cttcggaatg 1020
tcgcgcattg gcatggaagt cacaccttcg ggaacgtggt tgacctacac aggtgccatc 1080
aaattggatg acaaagatcc aaatttcaaa gatcaagtca ttttgctgaa taagcatatt 1140
gacgcataca aaacattccc accaacagag cctaaaaagg acaaaaagaa gaaggctgat 1200
gaaactcaag ccttaccgca gagacagaag aaacagcaaa ctgtgactct tcttcctgct 1260
gcagatttgg atgatttctc caaacaattg caacaatcca tgagcagtgc tgactcaact 1320
caggccatgg catcccctgc tgcacctcgt gctgtttcct ttgccgataa caatgatata 1380
acaaatacaa acctatctcg aggtagagga cgtaatccaa aaccacgagc tgcaccaaat 1440
aacactgtct cttggtacac tgggcttacc caacacggga aagtccctct tacctttcca 1500
cctgggcagg gtgtacctct taatgccaat tctacccctg cgcaaaatgc tgggtattgg 1560
cggagacagg acagaaaaat taataccggg aatggaatta agcaactggc tcccaggtgg 1620
tacttctact acactggaac tggacccgaa gcagcactcc cattccgggc tgttaaggat 1680
ggcatcgttt gggtccatga agatggcgcc actgatgctc cttcaacttt tgggacgcgg 1740
aaccctaaca atgattcagc tattgttaca caattcgcgc ccggtactaa gcttcctaaa 1800
aacttccaca ttgaggggac tggaggcaat agtcaatcat cttcaagagc ctctagctta 1860
agcagaaact cttccagatc tagttcacaa ggttcaagat caggaaactc tacccgcggc 1920
acttctccag gtccatctgg aatcggagca gtaggaggtg atctacttta ccttgatctt 1980
ctgaacagac tacaagccct tgagtctggc aaagtaaagc aatcgcagcc aaaagtaatc 2040
actaagaaag atgctgctgc tgctaaaaat aagatgcgcc acaagcgcac ttccaccaaa 2100
agtttcaaca tggtgcaagc ttttggtctt cgcggaccag gagacctcca gggaaacttt 2160
ggtgatcttc aattgaataa actcggcact gaggacccac gttggcccca aattgctgag 2220
cttgctccta cagccagtgc ttttatgggt atgtcgcaat ttaaacttac ccatcagaac 2280
aatgatgatc atggcaaccc tgtgtacttc cttcggtaca gtggagccat taaacttgac 2340
ccaaagaatc ccaactacaa taagtggttg gagcttcttg agcaaaatat tgatgcctac 2400
aaaaccttcc ctaagaagga aaagaaacaa aaggcaccaa aagaagaatc aacagaccaa 2460
atgtctgaac ctccaaagga gcagcgtgtg caaggtagca tcactcagcg cactcgcacc 2520
cgtccaagtg ttcagcctgg tccaatgatt gatgttaaca ctgattag 2568
<210> 4
<211> 2568
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 4
atgcctatgg gctctctgca gcccctggcc accctgtacc tgctgggcat gctggtggcc 60
agctgcctgg gaagcgacaa cggcccccag aaccagagaa acgcccctag aatcacattt 120
ggcggcccta gtgatagcac cggatctaat caaaacggcg agagaagcgg cgctcggtct 180
aaacagagac ggccacaggg actgcctaac aacaccgcca gctggttcac cgccctgacc 240
cagcacggca aggaggacct taagttccct cggggacagg gcgtgccaat caacaccaac 300
tctagtcccg acgaccagat cggctattat agaagagcca caagacgcat cagaggtggc 360
gacggcaaga tgaaggacct gagccctcgc tggtactttt actacctggg gaccggccct 420
gaagccggcc tgccttacgg cgccaacaag gacggaatca tctgggtcgc caccgagggc 480
gccctgaata cccctaagga ccacatcggc accagaaacc ctgctaataa tgccgctatc 540
gtgctgcagc tgcctcaggg caccaccctg cctaagggct tctacgccga gggctcccgg 600
ggaggttccc aggctagcag cagatcttcc agccggagca gaaacagctc caggaacagc 660
acacctggca gcagcagagg tacgagccct gcccggatgg ccggaaacgg cggcgatgcc 720
gccctggccc tgctgctgct ggacagactg aaccagctcg agagcaagat gtctggcaag 780
ggccagcagc agcagggcca gacagtgacc aagaaatccg ccgctgaggc cagcaaaaaa 840
cccagacaga aaagaaccgc tacaaaggcc tacaacgtta cccaggcctt tggcagacgg 900
ggcccagagc agacccaggg aaacttcggc gaccaggagc tgatccggca gggcacggac 960
tacaagcact ggcctcaaat cgcccagttt gccccttccg ccagcgcttt cttcggaatg 1020
agcagaatcg gcatggaagt gacacctagt ggcacctggc tgacctacac cggtgccatt 1080
aagctggatg acaaggaccc caacttcaag gatcaggtga tcctgctgaa caagcacatt 1140
gatgcttaca agaccttccc acctaccgag ccaaaaaaag ataagaagaa aaaagccgat 1200
gagacacaag ccctgcccca gaggcagaag aagcaacaaa ccgtcaccct gctgcctgct 1260
gccgacctgg acgacttcag caaacagctg cagcagagca tgagctctgc tgatagcacc 1320
caggccatgg cctctccagc cgctcccaga gctgtgtcct tcgccgataa taacgacatc 1380
acaaacacca acctgagccg gggcagaggc agaaacccta aacctagagc cgcccccaac 1440
aacaccgtga gctggtatac aggcctcacc cagcatggca aggtgcctct gacattcccc 1500
cctgggcagg gcgtgcccct gaacgccaac agcacccctg cccagaatgc cggctactgg 1560
cggaggcagg acagaaagat caacactggt aacggcatca agcagctggc cccacggtgg 1620
tatttctact acaccggcac cggcccggaa gccgccctcc ccttcagagc cgtgaaggac 1680
ggcatcgtgt gggtgcacga ggacggcgcc acagatgccc cgtctacatt tggcactcgg 1740
aatcccaata acgacagcgc catcgtgacc cagttcgccc ctggcaccaa gctgcctaag 1800
aactttcaca tcgagggcac aggaggcaac agccagagca gcagccgggc ttcgagcctg 1860
tctcggaata gctcccggtc cagctctcag ggcagccgca gtggaaattc cacccggggc 1920
acatctcctg gccccagcgg catcggcgct gtgggcggag acctgctcta cctggacctg 1980
ctgaacagac tgcaggcact tgaaagcggc aaagttaagc aatctcaacc taaggtgatc 2040
accaaaaagg acgccgccgc cgctaagaac aagatgagac acaagagaac aagcacaaag 2100
agcttcaaca tggtgcaagc cttcggcctg cggggacctg gcgacctgca gggcaacttc 2160
ggcgacctgc agctgaacaa gctgggcaca gaggatcctc gatggcccca gatcgccgaa 2220
ctagctccaa ccgccagcgc cttcatgggc atgagccagt tcaagctgac acaccagaac 2280
aatgacgatc acggaaatcc tgtgtacttc ctgagataca gcggcgccat caagctggat 2340
cctaagaacc ccaactacaa caagtggctg gaactgctgg aacagaacat cgacgcctac 2400
aagaccttcc ccaagaagga aaagaagcag aaggccccta aagaggaaag cacagatcag 2460
atgagcgagc ctcccaagga acagagagtg cagggatcta tcacccagag aacaagaaca 2520
agacccagcg tgcagcctgg ccctatgatt gacgtgaaca ccgactag 2568
<210> 5
<211> 419
<212> PRT
<213> SARS-COV-2
<400> 5
Met Ser Asp Asn Gly Pro Gln Asn Gln Arg Asn Ala Pro Arg Ile Thr
1 5 10 15
Phe Gly Gly Pro Ser Asp Ser Thr Gly Ser Asn Gln Asn Gly Glu Arg
20 25 30
Ser Gly Ala Arg Ser Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn Asn
35 40 45
Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Asp Leu
50 55 60
Lys Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Ser Pro
65 70 75 80
Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Ile Arg Gly
85 90 95
Gly Asp Gly Lys Met Lys Asp Leu Ser Pro Arg Trp Tyr Phe Tyr Tyr
100 105 110
Leu Gly Thr Gly Pro Glu Ala Gly Leu Pro Tyr Gly Ala Asn Lys Asp
115 120 125
Gly Ile Ile Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys Asp
130 135 140
His Ile Gly Thr Arg Asn Pro Ala Asn Asn Ala Ala Ile Val Leu Gln
145 150 155 160
Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly Ser
165 170 175
Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg Asn
180 185 190
Ser Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Thr Ser Pro Ala
195 200 205
Arg Met Ala Gly Asn Gly Gly Asp Ala Ala Leu Ala Leu Leu Leu Leu
210 215 220
Asp Arg Leu Asn Gln Leu Glu Ser Lys Met Ser Gly Lys Gly Gln Gln
225 230 235 240
Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser Lys
245 250 255
Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Ala Tyr Asn Val Thr Gln
260 265 270
Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly Asp
275 280 285
Gln Glu Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln Ile
290 295 300
Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg Ile
305 310 315 320
Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr Thr Gly Ala
325 330 335
Ile Lys Leu Asp Asp Lys Asp Pro Asn Phe Lys Asp Gln Val Ile Leu
340 345 350
Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu Pro
355 360 365
Lys Lys Asp Lys Lys Lys Lys Ala Asp Glu Thr Gln Ala Leu Pro Gln
370 375 380
Arg Gln Lys Lys Gln Gln Thr Val Thr Leu Leu Pro Ala Ala Asp Leu
385 390 395 400
Asp Asp Phe Ser Lys Gln Leu Gln Gln Ser Met Ser Ser Ala Asp Ser
405 410 415
Thr Gln Ala
<210> 6
<211> 442
<212> PRT
<213> artificial sequence
<220>
<223> synthetic construct
<400> 6
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly
1 5 10 15
Met Leu Val Ala Ser Cys Leu Gly Ser Asp Asn Gly Pro Gln Asn Gln
20 25 30
Arg Asn Ala Pro Arg Ile Thr Phe Gly Gly Pro Ser Asp Ser Thr Gly
35 40 45
Ser Asn Gln Asn Gly Glu Arg Ser Gly Ala Arg Ser Lys Gln Arg Arg
50 55 60
Pro Gln Gly Leu Pro Asn Asn Thr Ala Ser Trp Phe Thr Ala Leu Thr
65 70 75 80
Gln His Gly Lys Glu Asp Leu Lys Phe Pro Arg Gly Gln Gly Val Pro
85 90 95
Ile Asn Thr Asn Ser Ser Pro Asp Asp Gln Ile Gly Tyr Tyr Arg Arg
100 105 110
Ala Thr Arg Arg Ile Arg Gly Gly Asp Gly Lys Met Lys Asp Leu Ser
115 120 125
Pro Arg Trp Tyr Phe Tyr Tyr Leu Gly Thr Gly Pro Glu Ala Gly Leu
130 135 140
Pro Tyr Gly Ala Asn Lys Asp Gly Ile Ile Trp Val Ala Thr Glu Gly
145 150 155 160
Ala Leu Asn Thr Pro Lys Asp His Ile Gly Thr Arg Asn Pro Ala Asn
165 170 175
Asn Ala Ala Ile Val Leu Gln Leu Pro Gln Gly Thr Thr Leu Pro Lys
180 185 190
Gly Phe Tyr Ala Glu Gly Ser Arg Gly Gly Ser Gln Ala Ser Ser Arg
195 200 205
Ser Ser Ser Arg Ser Arg Asn Ser Ser Arg Asn Ser Thr Pro Gly Ser
210 215 220
Ser Arg Gly Thr Ser Pro Ala Arg Met Ala Gly Asn Gly Gly Asp Ala
225 230 235 240
Ala Leu Ala Leu Leu Leu Leu Asp Arg Leu Asn Gln Leu Glu Ser Lys
245 250 255
Met Ser Gly Lys Gly Gln Gln Gln Gln Gly Gln Thr Val Thr Lys Lys
260 265 270
Ser Ala Ala Glu Ala Ser Lys Lys Pro Arg Gln Lys Arg Thr Ala Thr
275 280 285
Lys Ala Tyr Asn Val Thr Gln Ala Phe Gly Arg Arg Gly Pro Glu Gln
290 295 300
Thr Gln Gly Asn Phe Gly Asp Gln Glu Leu Ile Arg Gln Gly Thr Asp
305 310 315 320
Tyr Lys His Trp Pro Gln Ile Ala Gln Phe Ala Pro Ser Ala Ser Ala
325 330 335
Phe Phe Gly Met Ser Arg Ile Gly Met Glu Val Thr Pro Ser Gly Thr
340 345 350
Trp Leu Thr Tyr Thr Gly Ala Ile Lys Leu Asp Asp Lys Asp Pro Asn
355 360 365
Phe Lys Asp Gln Val Ile Leu Leu Asn Lys His Ile Asp Ala Tyr Lys
370 375 380
Thr Phe Pro Pro Thr Glu Pro Lys Lys Asp Lys Lys Lys Lys Ala Asp
385 390 395 400
Glu Thr Gln Ala Leu Pro Gln Arg Gln Lys Lys Gln Gln Thr Val Thr
405 410 415
Leu Leu Pro Ala Ala Asp Leu Asp Asp Phe Ser Lys Gln Leu Gln Gln
420 425 430
Ser Met Ser Ser Ala Asp Ser Thr Gln Ala
435 440
<210> 7
<211> 855
<212> PRT
<213> artificial sequence
<220>
<223> synthetic construct
<400> 7
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly
1 5 10 15
Met Leu Val Ala Ser Cys Leu Gly Ser Asp Asn Gly Pro Gln Asn Gln
20 25 30
Arg Asn Ala Pro Arg Ile Thr Phe Gly Gly Pro Ser Asp Ser Thr Gly
35 40 45
Ser Asn Gln Asn Gly Glu Arg Ser Gly Ala Arg Ser Lys Gln Arg Arg
50 55 60
Pro Gln Gly Leu Pro Asn Asn Thr Ala Ser Trp Phe Thr Ala Leu Thr
65 70 75 80
Gln His Gly Lys Glu Asp Leu Lys Phe Pro Arg Gly Gln Gly Val Pro
85 90 95
Ile Asn Thr Asn Ser Ser Pro Asp Asp Gln Ile Gly Tyr Tyr Arg Arg
100 105 110
Ala Thr Arg Arg Ile Arg Gly Gly Asp Gly Lys Met Lys Asp Leu Ser
115 120 125
Pro Arg Trp Tyr Phe Tyr Tyr Leu Gly Thr Gly Pro Glu Ala Gly Leu
130 135 140
Pro Tyr Gly Ala Asn Lys Asp Gly Ile Ile Trp Val Ala Thr Glu Gly
145 150 155 160
Ala Leu Asn Thr Pro Lys Asp His Ile Gly Thr Arg Asn Pro Ala Asn
165 170 175
Asn Ala Ala Ile Val Leu Gln Leu Pro Gln Gly Thr Thr Leu Pro Lys
180 185 190
Gly Phe Tyr Ala Glu Gly Ser Arg Gly Gly Ser Gln Ala Ser Ser Arg
195 200 205
Ser Ser Ser Arg Ser Arg Asn Ser Ser Arg Asn Ser Thr Pro Gly Ser
210 215 220
Ser Arg Gly Thr Ser Pro Ala Arg Met Ala Gly Asn Gly Gly Asp Ala
225 230 235 240
Ala Leu Ala Leu Leu Leu Leu Asp Arg Leu Asn Gln Leu Glu Ser Lys
245 250 255
Met Ser Gly Lys Gly Gln Gln Gln Gln Gly Gln Thr Val Thr Lys Lys
260 265 270
Ser Ala Ala Glu Ala Ser Lys Lys Pro Arg Gln Lys Arg Thr Ala Thr
275 280 285
Lys Ala Tyr Asn Val Thr Gln Ala Phe Gly Arg Arg Gly Pro Glu Gln
290 295 300
Thr Gln Gly Asn Phe Gly Asp Gln Glu Leu Ile Arg Gln Gly Thr Asp
305 310 315 320
Tyr Lys His Trp Pro Gln Ile Ala Gln Phe Ala Pro Ser Ala Ser Ala
325 330 335
Phe Phe Gly Met Ser Arg Ile Gly Met Glu Val Thr Pro Ser Gly Thr
340 345 350
Trp Leu Thr Tyr Thr Gly Ala Ile Lys Leu Asp Asp Lys Asp Pro Asn
355 360 365
Phe Lys Asp Gln Val Ile Leu Leu Asn Lys His Ile Asp Ala Tyr Lys
370 375 380
Thr Phe Pro Pro Thr Glu Pro Lys Lys Asp Lys Lys Lys Lys Ala Asp
385 390 395 400
Glu Thr Gln Ala Leu Pro Gln Arg Gln Lys Lys Gln Gln Thr Val Thr
405 410 415
Leu Leu Pro Ala Ala Asp Leu Asp Asp Phe Ser Lys Gln Leu Gln Gln
420 425 430
Ser Met Ser Ser Ala Asp Ser Thr Gln Ala Met Ala Ser Pro Ala Ala
435 440 445
Pro Arg Ala Val Ser Phe Ala Asp Asn Asn Asp Ile Thr Asn Thr Asn
450 455 460
Leu Ser Arg Gly Arg Gly Arg Asn Pro Lys Pro Arg Ala Ala Pro Asn
465 470 475 480
Asn Thr Val Ser Trp Tyr Thr Gly Leu Thr Gln His Gly Lys Val Pro
485 490 495
Leu Thr Phe Pro Pro Gly Gln Gly Val Pro Leu Asn Ala Asn Ser Thr
500 505 510
Pro Ala Gln Asn Ala Gly Tyr Trp Arg Arg Gln Asp Arg Lys Ile Asn
515 520 525
Thr Gly Asn Gly Ile Lys Gln Leu Ala Pro Arg Trp Tyr Phe Tyr Tyr
530 535 540
Thr Gly Thr Gly Pro Glu Ala Ala Leu Pro Phe Arg Ala Val Lys Asp
545 550 555 560
Gly Ile Val Trp Val His Glu Asp Gly Ala Thr Asp Ala Pro Ser Thr
565 570 575
Phe Gly Thr Arg Asn Pro Asn Asn Asp Ser Ala Ile Val Thr Gln Phe
580 585 590
Ala Pro Gly Thr Lys Leu Pro Lys Asn Phe His Ile Glu Gly Thr Gly
595 600 605
Gly Asn Ser Gln Ser Ser Ser Arg Ala Ser Ser Leu Ser Arg Asn Ser
610 615 620
Ser Arg Ser Ser Ser Gln Gly Ser Arg Ser Gly Asn Ser Thr Arg Gly
625 630 635 640
Thr Ser Pro Gly Pro Ser Gly Ile Gly Ala Val Gly Gly Asp Leu Leu
645 650 655
Tyr Leu Asp Leu Leu Asn Arg Leu Gln Ala Leu Glu Ser Gly Lys Val
660 665 670
Lys Gln Ser Gln Pro Lys Val Ile Thr Lys Lys Asp Ala Ala Ala Ala
675 680 685
Lys Asn Lys Met Arg His Lys Arg Thr Ser Thr Lys Ser Phe Asn Met
690 695 700
Val Gln Ala Phe Gly Leu Arg Gly Pro Gly Asp Leu Gln Gly Asn Phe
705 710 715 720
Gly Asp Leu Gln Leu Asn Lys Leu Gly Thr Glu Asp Pro Arg Trp Pro
725 730 735
Gln Ile Ala Glu Leu Ala Pro Thr Ala Ser Ala Phe Met Gly Met Ser
740 745 750
Gln Phe Lys Leu Thr His Gln Asn Asn Asp Asp His Gly Asn Pro Val
755 760 765
Tyr Phe Leu Arg Tyr Ser Gly Ala Ile Lys Leu Asp Pro Lys Asn Pro
770 775 780
Asn Tyr Asn Lys Trp Leu Glu Leu Leu Glu Gln Asn Ile Asp Ala Tyr
785 790 795 800
Lys Thr Phe Pro Lys Lys Glu Lys Lys Gln Lys Ala Pro Lys Glu Glu
805 810 815
Ser Thr Asp Gln Met Ser Glu Pro Pro Lys Glu Gln Arg Val Gln Gly
820 825 830
Ser Ile Thr Gln Arg Thr Arg Thr Arg Pro Ser Val Gln Pro Gly Pro
835 840 845
Met Ile Asp Val Asn Thr Asp
850 855
<210> 8
<211> 24
<212> PRT
<213> Homo sapiens (Homo sapiens)
<400> 8
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly
1 5 10 15
Met Leu Val Ala Ser Cys Leu Gly
20
<210> 9
<211> 413
<212> PRT
<213> MERS
<400> 9
Met Ala Ser Pro Ala Ala Pro Arg Ala Val Ser Phe Ala Asp Asn Asn
1 5 10 15
Asp Ile Thr Asn Thr Asn Leu Ser Arg Gly Arg Gly Arg Asn Pro Lys
20 25 30
Pro Arg Ala Ala Pro Asn Asn Thr Val Ser Trp Tyr Thr Gly Leu Thr
35 40 45
Gln His Gly Lys Val Pro Leu Thr Phe Pro Pro Gly Gln Gly Val Pro
50 55 60
Leu Asn Ala Asn Ser Thr Pro Ala Gln Asn Ala Gly Tyr Trp Arg Arg
65 70 75 80
Gln Asp Arg Lys Ile Asn Thr Gly Asn Gly Ile Lys Gln Leu Ala Pro
85 90 95
Arg Trp Tyr Phe Tyr Tyr Thr Gly Thr Gly Pro Glu Ala Ala Leu Pro
100 105 110
Phe Arg Ala Val Lys Asp Gly Ile Val Trp Val His Glu Asp Gly Ala
115 120 125
Thr Asp Ala Pro Ser Thr Phe Gly Thr Arg Asn Pro Asn Asn Asp Ser
130 135 140
Ala Ile Val Thr Gln Phe Ala Pro Gly Thr Lys Leu Pro Lys Asn Phe
145 150 155 160
His Ile Glu Gly Thr Gly Gly Asn Ser Gln Ser Ser Ser Arg Ala Ser
165 170 175
Ser Leu Ser Arg Asn Ser Ser Arg Ser Ser Ser Gln Gly Ser Arg Ser
180 185 190
Gly Asn Ser Thr Arg Gly Thr Ser Pro Gly Pro Ser Gly Ile Gly Ala
195 200 205
Val Gly Gly Asp Leu Leu Tyr Leu Asp Leu Leu Asn Arg Leu Gln Ala
210 215 220
Leu Glu Ser Gly Lys Val Lys Gln Ser Gln Pro Lys Val Ile Thr Lys
225 230 235 240
Lys Asp Ala Ala Ala Ala Lys Asn Lys Met Arg His Lys Arg Thr Ser
245 250 255
Thr Lys Ser Phe Asn Met Val Gln Ala Phe Gly Leu Arg Gly Pro Gly
260 265 270
Asp Leu Gln Gly Asn Phe Gly Asp Leu Gln Leu Asn Lys Leu Gly Thr
275 280 285
Glu Asp Pro Arg Trp Pro Gln Ile Ala Glu Leu Ala Pro Thr Ala Ser
290 295 300
Ala Phe Met Gly Met Ser Gln Phe Lys Leu Thr His Gln Asn Asn Asp
305 310 315 320
Asp His Gly Asn Pro Val Tyr Phe Leu Arg Tyr Ser Gly Ala Ile Lys
325 330 335
Leu Asp Pro Lys Asn Pro Asn Tyr Asn Lys Trp Leu Glu Leu Leu Glu
340 345 350
Gln Asn Ile Asp Ala Tyr Lys Thr Phe Pro Lys Lys Glu Lys Lys Gln
355 360 365
Lys Ala Pro Lys Glu Glu Ser Thr Asp Gln Met Ser Glu Pro Pro Lys
370 375 380
Glu Gln Arg Val Gln Gly Ser Ile Thr Gln Arg Thr Arg Thr Arg Pro
385 390 395 400
Ser Val Gln Pro Gly Pro Met Ile Asp Val Asn Thr Asp
405 410
<210> 10
<211> 422
<212> PRT
<213> SARS-COV-1
<400> 10
Met Ser Asp Asn Gly Pro Gln Ser Asn Gln Arg Ser Ala Pro Arg Ile
1 5 10 15
Thr Phe Gly Gly Pro Thr Asp Ser Thr Asp Asn Asn Gln Asn Gly Gly
20 25 30
Arg Asn Gly Ala Arg Pro Lys Gln Arg Arg Pro Gln Gly Leu Pro Asn
35 40 45
Asn Thr Ala Ser Trp Phe Thr Ala Leu Thr Gln His Gly Lys Glu Glu
50 55 60
Leu Arg Phe Pro Arg Gly Gln Gly Val Pro Ile Asn Thr Asn Ser Gly
65 70 75 80
Pro Asp Asp Gln Ile Gly Tyr Tyr Arg Arg Ala Thr Arg Arg Val Arg
85 90 95
Gly Gly Asp Gly Lys Met Lys Glu Leu Ser Pro Arg Trp Tyr Phe Tyr
100 105 110
Tyr Leu Gly Thr Gly Pro Glu Ala Ser Leu Pro Tyr Gly Ala Asn Lys
115 120 125
Glu Gly Ile Val Trp Val Ala Thr Glu Gly Ala Leu Asn Thr Pro Lys
130 135 140
Asp His Ile Gly Thr Arg Asn Pro Asn Asn Asn Ala Ala Thr Val Leu
145 150 155 160
Gln Leu Pro Gln Gly Thr Thr Leu Pro Lys Gly Phe Tyr Ala Glu Gly
165 170 175
Ser Arg Gly Gly Ser Gln Ala Ser Ser Arg Ser Ser Ser Arg Ser Arg
180 185 190
Gly Asn Ser Arg Asn Ser Thr Pro Gly Ser Ser Arg Gly Asn Ser Pro
195 200 205
Ala Arg Met Ala Ser Gly Gly Gly Glu Thr Ala Leu Ala Leu Leu Leu
210 215 220
Leu Asp Arg Leu Asn Gln Leu Glu Ser Lys Val Ser Gly Lys Gly Gln
225 230 235 240
Gln Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala Glu Ala Ser
245 250 255
Lys Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Gln Tyr Asn Val Thr
260 265 270
Gln Ala Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly Asn Phe Gly
275 280 285
Asp Gln Asp Leu Ile Arg Gln Gly Thr Asp Tyr Lys His Trp Pro Gln
290 295 300
Ile Ala Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly Met Ser Arg
305 310 315 320
Ile Gly Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr Tyr His Gly
325 330 335
Ala Ile Lys Leu Asp Asp Lys Asp Pro Gln Phe Lys Asp Asn Val Ile
340 345 350
Leu Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro Pro Thr Glu
355 360 365
Pro Lys Lys Asp Lys Lys Lys Lys Thr Asp Glu Ala Gln Pro Leu Pro
370 375 380
Gln Arg Gln Lys Lys Gln Pro Thr Val Thr Leu Leu Pro Ala Ala Asp
385 390 395 400
Met Asp Asp Phe Ser Arg Gln Leu Gln Asn Ser Met Ser Gly Ala Ser
405 410 415
Ala Asp Ser Thr Gln Ala
420
<210> 11
<211> 26
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 11
ggcgcgccat tggccaccgc ggccgc 26
<210> 12
<211> 10
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 12
attggccacc 10
<210> 13
<211> 497
<212> PRT
<213> influenza A Virus
<400> 13
Ala Ser Gln Gly Thr Lys Arg Ser Tyr Glu Gln Met Glu Thr Gly Gly
1 5 10 15
Glu Arg Gln Asn Ala Thr Glu Ile Arg Ala Ser Val Gly Arg Met Val
20 25 30
Gly Gly Ile Gly Arg Phe Tyr Ile Gln Met Cys Thr Glu Leu Lys Leu
35 40 45
Ser Asp Tyr Glu Gly Arg Leu Ile Gln Asn Ser Ile Thr Ile Glu Arg
50 55 60
Met Val Leu Ser Ala Phe Asp Glu Arg Arg Asn Lys Tyr Leu Glu Glu
65 70 75 80
His Pro Ser Ala Gly Lys Asp Pro Lys Lys Thr Gly Gly Pro Ile Tyr
85 90 95
Arg Arg Arg Asp Gly Lys Trp Val Arg Glu Leu Ile Leu Tyr Asp Lys
100 105 110
Glu Glu Ile Arg Arg Ile Trp Arg Gln Ala Asn Asn Gly Glu Asp Ala
115 120 125
Thr Ala Gly Leu Thr His Leu Met Ile Trp His Ser Asn Leu Asn Asp
130 135 140
Ala Thr Tyr Gln Arg Thr Arg Ala Leu Val Arg Thr Gly Met Asp Pro
145 150 155 160
Arg Met Cys Ser Leu Met Gln Gly Ser Thr Leu Pro Arg Arg Ser Gly
165 170 175
Ala Ala Gly Ala Ala Val Lys Gly Val Gly Thr Met Val Met Glu Leu
180 185 190
Ile Arg Met Ile Lys Arg Gly Ile Asn Asp Arg Asn Phe Trp Arg Gly
195 200 205
Glu Asn Gly Arg Arg Thr Arg Ile Ala Tyr Glu Arg Met Cys Asn Ile
210 215 220
Leu Lys Gly Lys Phe Gln Thr Ala Ala Gln Arg Ala Met Met Asp Gln
225 230 235 240
Val Arg Glu Ser Arg Asn Pro Gly Asn Ala Glu Ile Glu Asp Leu Ile
245 250 255
Phe Leu Ala Arg Ser Ala Leu Ile Leu Arg Gly Ser Val Ala His Lys
260 265 270
Ser Cys Leu Pro Ala Cys Val Tyr Gly Leu Ala Val Ala Ser Gly Tyr
275 280 285
Asp Phe Glu Arg Glu Gly Tyr Ser Leu Val Gly Ile Asp Pro Phe Arg
290 295 300
Leu Leu Gln Asn Ser Gln Val Phe Ser Leu Ile Arg Pro Asn Glu Asn
305 310 315 320
Pro Ala His Lys Ser Gln Leu Val Trp Met Ala Cys His Ser Ala Ala
325 330 335
Phe Glu Asp Leu Arg Val Ser Ser Phe Ile Arg Gly Thr Arg Val Val
340 345 350
Pro Arg Gly Gln Leu Ser Thr Arg Gly Val Gln Ile Ala Ser Asn Glu
355 360 365
Asn Met Glu Thr Met Asp Ser Ser Thr Leu Glu Leu Arg Ser Arg Tyr
370 375 380
Trp Ala Ile Arg Thr Arg Ser Gly Gly Thr Thr Asn Gln Gln Arg Ala
385 390 395 400
Ser Ala Gly Gln Ile Ser Val Gln Pro Thr Phe Ser Val Gln Arg Asn
405 410 415
Leu Pro Phe Glu Arg Ala Thr Ile Met Ala Ala Phe Thr Gly Asn Thr
420 425 430
Glu Gly Arg Thr Ser Asp Met Arg Thr Glu Ile Ile Arg Met Met Glu
435 440 445
Ser Ala Lys Pro Glu Asp Val Ser Phe Gln Gly Arg Gly Val Phe Glu
450 455 460
Leu Ser Asp Glu Lys Ala Thr Asn Pro Ile Val Pro Ser Phe Asp Met
465 470 475 480
Ser Asn Glu Gly Ser Tyr Phe Phe Gly Asp Asn Ala Glu Glu Tyr Asp
485 490 495
Asn
<210> 14
<211> 560
<212> PRT
<213> influenza b virus
<400> 14
Met Ser Asn Met Asp Ile Asp Gly Ile Asn Thr Gly Thr Ile Asp Lys
1 5 10 15
Thr Pro Glu Glu Ile Thr Pro Gly Thr Ser Gly Thr Thr Arg Pro Ile
20 25 30
Ile Arg Pro Ala Thr Leu Ala Pro Pro Ser Asn Lys Arg Thr Arg Asn
35 40 45
Pro Ser Pro Glu Arg Ala Thr Thr Ser Ser Glu Asp Asp Val Gly Arg
50 55 60
Lys Thr Gln Lys Lys Gln Thr Pro Thr Glu Ile Lys Lys Ser Val Tyr
65 70 75 80
Asn Met Val Val Lys Leu Gly Glu Phe Tyr Asn Gln Met Met Val Lys
85 90 95
Ala Gly Leu Asn Asp Asp Met Glu Arg Asn Leu Ile Gln Asn Ala His
100 105 110
Ala Val Glu Arg Ile Leu Leu Ala Ala Thr Asp Asp Lys Lys Thr Glu
115 120 125
Phe Gln Lys Lys Lys Asn Ala Arg Asp Val Lys Glu Gly Lys Glu Glu
130 135 140
Ile Asp His Asn Lys Thr Gly Gly Thr Phe Tyr Lys Met Val Arg Asp
145 150 155 160
Asp Lys Thr Ile Tyr Phe Ser Pro Ile Arg Ile Thr Phe Leu Lys Glu
165 170 175
Glu Val Lys Thr Met Tyr Lys Thr Thr Met Gly Ser Asp Gly Phe Ser
180 185 190
Gly Leu Asn His Ile Met Ile Gly His Ser Gln Met Asn Asp Val Cys
195 200 205
Phe Gln Arg Ser Lys Ala Leu Lys Arg Val Gly Leu Asp Pro Ser Leu
210 215 220
Ile Ser Thr Phe Ala Gly Ser Thr Ile Pro Arg Arg Ser Gly Ala Thr
225 230 235 240
Gly Val Ala Ile Lys Gly Gly Gly Thr Leu Val Ala Glu Ala Ile Arg
245 250 255
Phe Ile Gly Arg Ala Met Ala Asp Arg Gly Leu Leu Arg Asp Ile Lys
260 265 270
Ala Lys Thr Ala Tyr Glu Lys Ile Leu Leu Asn Leu Lys Asn Lys Cys
275 280 285
Ser Ala Pro Gln Gln Lys Ala Leu Val Asp Gln Val Ile Gly Ser Arg
290 295 300
Asn Pro Gly Ile Ala Asp Ile Glu Asp Leu Thr Leu Leu Ala Arg Ser
305 310 315 320
Met Val Val Val Arg Pro Ser Val Ala Ser Lys Val Val Leu Pro Ile
325 330 335
Ser Ile Tyr Ala Lys Ile Pro Gln Leu Gly Phe Asn Val Glu Glu Tyr
340 345 350
Ser Met Val Gly Tyr Glu Ala Met Ala Leu Tyr Asn Met Ala Thr Pro
355 360 365
Val Ser Ile Leu Arg Met Gly Asp Asp Ala Lys Asp Lys Ser Gln Leu
370 375 380
Phe Phe Met Ser Cys Phe Gly Ala Ala Tyr Glu Asp Leu Arg Val Leu
385 390 395 400
Ser Ala Leu Thr Gly Thr Glu Phe Lys Pro Arg Ser Ala Leu Lys Cys
405 410 415
Lys Gly Phe His Val Pro Ala Lys Glu Gln Val Glu Gly Met Gly Ala
420 425 430
Ala Leu Met Ser Ile Lys Leu Gln Phe Trp Ala Pro Met Thr Arg Ser
435 440 445
Gly Gly Asn Glu Val Gly Gly Asp Gly Gly Ser Gly Gln Ile Ser Cys
450 455 460
Ser Pro Val Phe Ala Val Glu Arg Pro Ile Ala Leu Ser Lys Gln Ala
465 470 475 480
Val Arg Arg Met Leu Ser Met Asn Ile Glu Gly Arg Asp Ala Asp Val
485 490 495
Lys Gly Asn Leu Leu Lys Met Met Asn Asp Ser Met Ala Lys Lys Thr
500 505 510
Ser Gly Asn Ala Phe Ile Gly Lys Lys Met Phe Gln Ile Ser Asp Lys
515 520 525
Asn Lys Thr Asn Pro Val Glu Ile Pro Ile Lys Gln Thr Ile Pro Asn
530 535 540
Phe Phe Phe Gly Arg Asp Thr Ala Glu Asp Tyr Asp Asp Leu Asp Tyr
545 550 555 560
<210> 15
<211> 3246
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 15
atgcctatgg gcagcctgca gccactggct acactgtacc tgctgggcat gctggtggcc 60
tcttgtctgg gcgccagcca aggcactaag agaagctacg agcagatgga aaccggaggc 120
gaacggcaga acgccacaga gatcagagcc tctgtgggcc gtatggtcgg cggcatcggc 180
agattctaca tccagatgtg caccgaactg aagctgagcg actacgaggg ccgcctgatc 240
cagaacagca tcacaatcga gagaatggtg ctgtccgcct ttgacgagcg gagaaacaaa 300
tacctggaag agcaccctag cgccggaaaa gatcctaaga aaaccggcgg acctatctac 360
agaagaagag atggtaagtg ggtgagagag ctgattctgt acgataagga agagattcga 420
agaatctgga gacaggccaa caacggcgag gatgccaccg caggcctgac acacctgatg 480
atctggcaca gcaacctgaa cgatgcgacc taccagcgca cgcgggccct ggtcagaacc 540
ggcatggatc ctcggatgtg tagcctgatg cagggcagca cactgccaag acggagtggg 600
gccgccggcg ctgcagtgaa gggcgtcgga accatggtga tggagctgat ccggatgata 660
aagcggggca tcaacgacag aaacttctgg cgaggcgaga acggccgaag aacccggatc 720
gcctacgaga gaatgtgcaa catcctgaaa ggaaaattcc agaccgccgc ccagcgggcc 780
atgatggacc aggtgcgcga gagcagaaac cccggcaatg ccgagatcga ggacctgatc 840
ttcctggcca gaagcgccct cattcttaga ggctctgtgg cccacaagag ctgtctgcct 900
gcctgtgtgt acggcctggc agtggcctca ggctacgact tcgagcggga aggatacagt 960
ctggtgggca tcgacccttt cagactcctg cagaatagcc aggtgtttag cctgatcaga 1020
ccaaacgaaa accccgccca taagagccag ctggtgtgga tggcctgcca cagcgccgcc 1080
tttgaggatc tgagagtgag ctcttttatc agaggcaccc gggtggttcc acgaggtcaa 1140
ctgtctacaa gaggtgtgca gatcgccagc aacgagaaca tggagaccat ggatagcagc 1200
accctggaac tgagatccag atactgggcc atcaggacac ggagcggcgg caccaccaat 1260
cagcagcgcg ccagcgccgg ccagatctct gtccagccta cgtttagcgt gcagcggaat 1320
ttgcccttcg aacgcgccac aatcatggct gctttcaccg gcaatacaga gggcagaacc 1380
agcgatatga gaacagaaat tatccgtatg atggagtccg caaaacctga ggacgtgtcc 1440
ttccaaggca gaggcgtgtt cgagctgagc gacgagaagg ccaccaaccc tatcgtgcct 1500
agcttcgata tgtctaatga gggcagctac tttttcggag ataacgccga agagtacgac 1560
aacatgtcta atatggatat cgacggcatt aacaccggca ccatcgacaa aacccctgag 1620
gagatcaccc ctggcaccag cggcacaacc cggcccatca tccgccccgc tacactggct 1680
ccacctagca acaagcggac cagaaatccc tcgccagaaa gagccacaac ctccagcgag 1740
gacgacgtgg gacggaagac acaaaagaag cagaccccta cagagatcaa gaagtctgtt 1800
tacaacatgg tggtgaaact gggcgagttc tacaaccaga tgatggtgaa ggccggcctg 1860
aacgacgata tggaaagaaa tctgatccag aacgcccacg ccgtggagcg gattctgctg 1920
gccgccaccg atgataagaa gaccgaattc cagaaaaaga aaaacgccag agacgtgaag 1980
gaaggcaagg aagagatcga ccacaacaag acaggcggca cattctacaa gatggtccgg 2040
gacgacaaga ccatctactt cagccctatc cggataacat tcctgaaaga agaagtgaag 2100
accatgtaca aaaccacaat gggctctgac ggcttcagcg gcctgaatca catcatgatc 2160
ggccactctc aaatgaacga tgtgtgcttc cagagaagca aggctctgaa gcgcgtgggc 2220
ctggatccta gcctgatctc taccttcgcc ggcagcacca tccccagaag atcgggcgct 2280
accggcgtgg ctatcaaggg aggaggcaca ctggtggctg aagccatcag attcatcgga 2340
agagccatgg ccgacagagg actcctgaga gatatcaaag ccaaaaccgc ctacgaaaaa 2400
atcctgctga acctgaagaa caagtgcagc gcgcctcaac agaaggccct ggtggaccag 2460
gttatcggct ctagaaaccc tggaatcgcc gatatcgagg acctgacact gctggccaga 2520
tctatggtgg tggtgagacc ctccgtggcc agcaaggtgg tgctgcctat cagcatctac 2580
gccaagatcc ctcagctggg atttaacgtg gaagaataca gcatggttgg ttatgaggcc 2640
atggccctgt acaacatggc cacacctgtg tccatcctga gaatgggcga cgatgccaaa 2700
gacaagagcc agctgttctt catgagctgc ttcggcgctg cctatgagga cctgagagtg 2760
ctgtccgctc ttacaggaac agagttcaag cctaggagcg cactgaagtg caagggcttc 2820
cacgtgcccg ccaaggaaca ggtggaaggc atgggagctg ctctgatgtc catcaagctg 2880
caattttggg ctcctatgac ccggagcggc ggaaatgagg tgggtggcga cggaggcagc 2940
ggacagattt cttgcagccc cgtatttgcc gtggagagac caatcgccct gtccaagcag 3000
gccgtgagaa gaatgctgag catgaacatc gagggccggg acgccgacgt gaagggcaac 3060
ctgttgaaga tgatgaacga cagcatggcc aagaagacca gtggcaatgc cttcatcggc 3120
aagaagatgt tccagatctc cgacaagaac aagaccaacc ccgtggaaat ccccatcaag 3180
cagacaatcc ctaacttctt cttcggcaga gacaccgccg aagactatga cgacctggac 3240
tactga 3246
<210> 16
<211> 1081
<212> PRT
<213> artificial sequence
<220>
<223> synthetic construct
<400> 16
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly
1 5 10 15
Met Leu Val Ala Ser Cys Leu Gly Ala Ser Gln Gly Thr Lys Arg Ser
20 25 30
Tyr Glu Gln Met Glu Thr Gly Gly Glu Arg Gln Asn Ala Thr Glu Ile
35 40 45
Arg Ala Ser Val Gly Arg Met Val Gly Gly Ile Gly Arg Phe Tyr Ile
50 55 60
Gln Met Cys Thr Glu Leu Lys Leu Ser Asp Tyr Glu Gly Arg Leu Ile
65 70 75 80
Gln Asn Ser Ile Thr Ile Glu Arg Met Val Leu Ser Ala Phe Asp Glu
85 90 95
Arg Arg Asn Lys Tyr Leu Glu Glu His Pro Ser Ala Gly Lys Asp Pro
100 105 110
Lys Lys Thr Gly Gly Pro Ile Tyr Arg Arg Arg Asp Gly Lys Trp Val
115 120 125
Arg Glu Leu Ile Leu Tyr Asp Lys Glu Glu Ile Arg Arg Ile Trp Arg
130 135 140
Gln Ala Asn Asn Gly Glu Asp Ala Thr Ala Gly Leu Thr His Leu Met
145 150 155 160
Ile Trp His Ser Asn Leu Asn Asp Ala Thr Tyr Gln Arg Thr Arg Ala
165 170 175
Leu Val Arg Thr Gly Met Asp Pro Arg Met Cys Ser Leu Met Gln Gly
180 185 190
Ser Thr Leu Pro Arg Arg Ser Gly Ala Ala Gly Ala Ala Val Lys Gly
195 200 205
Val Gly Thr Met Val Met Glu Leu Ile Arg Met Ile Lys Arg Gly Ile
210 215 220
Asn Asp Arg Asn Phe Trp Arg Gly Glu Asn Gly Arg Arg Thr Arg Ile
225 230 235 240
Ala Tyr Glu Arg Met Cys Asn Ile Leu Lys Gly Lys Phe Gln Thr Ala
245 250 255
Ala Gln Arg Ala Met Met Asp Gln Val Arg Glu Ser Arg Asn Pro Gly
260 265 270
Asn Ala Glu Ile Glu Asp Leu Ile Phe Leu Ala Arg Ser Ala Leu Ile
275 280 285
Leu Arg Gly Ser Val Ala His Lys Ser Cys Leu Pro Ala Cys Val Tyr
290 295 300
Gly Leu Ala Val Ala Ser Gly Tyr Asp Phe Glu Arg Glu Gly Tyr Ser
305 310 315 320
Leu Val Gly Ile Asp Pro Phe Arg Leu Leu Gln Asn Ser Gln Val Phe
325 330 335
Ser Leu Ile Arg Pro Asn Glu Asn Pro Ala His Lys Ser Gln Leu Val
340 345 350
Trp Met Ala Cys His Ser Ala Ala Phe Glu Asp Leu Arg Val Ser Ser
355 360 365
Phe Ile Arg Gly Thr Arg Val Val Pro Arg Gly Gln Leu Ser Thr Arg
370 375 380
Gly Val Gln Ile Ala Ser Asn Glu Asn Met Glu Thr Met Asp Ser Ser
385 390 395 400
Thr Leu Glu Leu Arg Ser Arg Tyr Trp Ala Ile Arg Thr Arg Ser Gly
405 410 415
Gly Thr Thr Asn Gln Gln Arg Ala Ser Ala Gly Gln Ile Ser Val Gln
420 425 430
Pro Thr Phe Ser Val Gln Arg Asn Leu Pro Phe Glu Arg Ala Thr Ile
435 440 445
Met Ala Ala Phe Thr Gly Asn Thr Glu Gly Arg Thr Ser Asp Met Arg
450 455 460
Thr Glu Ile Ile Arg Met Met Glu Ser Ala Lys Pro Glu Asp Val Ser
465 470 475 480
Phe Gln Gly Arg Gly Val Phe Glu Leu Ser Asp Glu Lys Ala Thr Asn
485 490 495
Pro Ile Val Pro Ser Phe Asp Met Ser Asn Glu Gly Ser Tyr Phe Phe
500 505 510
Gly Asp Asn Ala Glu Glu Tyr Asp Asn Met Ser Asn Met Asp Ile Asp
515 520 525
Gly Ile Asn Thr Gly Thr Ile Asp Lys Thr Pro Glu Glu Ile Thr Pro
530 535 540
Gly Thr Ser Gly Thr Thr Arg Pro Ile Ile Arg Pro Ala Thr Leu Ala
545 550 555 560
Pro Pro Ser Asn Lys Arg Thr Arg Asn Pro Ser Pro Glu Arg Ala Thr
565 570 575
Thr Ser Ser Glu Asp Asp Val Gly Arg Lys Thr Gln Lys Lys Gln Thr
580 585 590
Pro Thr Glu Ile Lys Lys Ser Val Tyr Asn Met Val Val Lys Leu Gly
595 600 605
Glu Phe Tyr Asn Gln Met Met Val Lys Ala Gly Leu Asn Asp Asp Met
610 615 620
Glu Arg Asn Leu Ile Gln Asn Ala His Ala Val Glu Arg Ile Leu Leu
625 630 635 640
Ala Ala Thr Asp Asp Lys Lys Thr Glu Phe Gln Lys Lys Lys Asn Ala
645 650 655
Arg Asp Val Lys Glu Gly Lys Glu Glu Ile Asp His Asn Lys Thr Gly
660 665 670
Gly Thr Phe Tyr Lys Met Val Arg Asp Asp Lys Thr Ile Tyr Phe Ser
675 680 685
Pro Ile Arg Ile Thr Phe Leu Lys Glu Glu Val Lys Thr Met Tyr Lys
690 695 700
Thr Thr Met Gly Ser Asp Gly Phe Ser Gly Leu Asn His Ile Met Ile
705 710 715 720
Gly His Ser Gln Met Asn Asp Val Cys Phe Gln Arg Ser Lys Ala Leu
725 730 735
Lys Arg Val Gly Leu Asp Pro Ser Leu Ile Ser Thr Phe Ala Gly Ser
740 745 750
Thr Ile Pro Arg Arg Ser Gly Ala Thr Gly Val Ala Ile Lys Gly Gly
755 760 765
Gly Thr Leu Val Ala Glu Ala Ile Arg Phe Ile Gly Arg Ala Met Ala
770 775 780
Asp Arg Gly Leu Leu Arg Asp Ile Lys Ala Lys Thr Ala Tyr Glu Lys
785 790 795 800
Ile Leu Leu Asn Leu Lys Asn Lys Cys Ser Ala Pro Gln Gln Lys Ala
805 810 815
Leu Val Asp Gln Val Ile Gly Ser Arg Asn Pro Gly Ile Ala Asp Ile
820 825 830
Glu Asp Leu Thr Leu Leu Ala Arg Ser Met Val Val Val Arg Pro Ser
835 840 845
Val Ala Ser Lys Val Val Leu Pro Ile Ser Ile Tyr Ala Lys Ile Pro
850 855 860
Gln Leu Gly Phe Asn Val Glu Glu Tyr Ser Met Val Gly Tyr Glu Ala
865 870 875 880
Met Ala Leu Tyr Asn Met Ala Thr Pro Val Ser Ile Leu Arg Met Gly
885 890 895
Asp Asp Ala Lys Asp Lys Ser Gln Leu Phe Phe Met Ser Cys Phe Gly
900 905 910
Ala Ala Tyr Glu Asp Leu Arg Val Leu Ser Ala Leu Thr Gly Thr Glu
915 920 925
Phe Lys Pro Arg Ser Ala Leu Lys Cys Lys Gly Phe His Val Pro Ala
930 935 940
Lys Glu Gln Val Glu Gly Met Gly Ala Ala Leu Met Ser Ile Lys Leu
945 950 955 960
Gln Phe Trp Ala Pro Met Thr Arg Ser Gly Gly Asn Glu Val Gly Gly
965 970 975
Asp Gly Gly Ser Gly Gln Ile Ser Cys Ser Pro Val Phe Ala Val Glu
980 985 990
Arg Pro Ile Ala Leu Ser Lys Gln Ala Val Arg Arg Met Leu Ser Met
995 1000 1005
Asn Ile Glu Gly Arg Asp Ala Asp Val Lys Gly Asn Leu Leu Lys Met
1010 1015 1020
Met Asn Asp Ser Met Ala Lys Lys Thr Ser Gly Asn Ala Phe Ile Gly
1025 1030 1035 1040
Lys Lys Met Phe Gln Ile Ser Asp Lys Asn Lys Thr Asn Pro Val Glu
1045 1050 1055
Ile Pro Ile Lys Gln Thr Ile Pro Asn Phe Phe Phe Gly Arg Asp Thr
1060 1065 1070
Ala Glu Asp Tyr Asp Asp Leu Asp Tyr
1075 1080
<210> 17
<211> 497
<212> PRT
<213> influenza A Virus
<400> 17
Ala Ser Gln Gly Thr Lys Arg Ser Tyr Glu Gln Met Glu Thr Asp Gly
1 5 10 15
Glu Arg Gln Asn Ala Asn Glu Ile Arg Ala Ser Val Gly Lys Met Ile
20 25 30
Gly Gly Ile Gly Arg Phe Tyr Ile Gln Met Cys Thr Glu Leu Lys Leu
35 40 45
Ser Asp Tyr Glu Gly Arg Leu Ile Gln Asn Ser Leu Thr Ile Glu Arg
50 55 60
Met Val Leu Ser Ala Phe Asp Glu Arg Arg Asn Lys Tyr Leu Glu Glu
65 70 75 80
His Pro Ser Ala Gly Lys Asp Pro Lys Lys Thr Gly Gly Pro Ile Tyr
85 90 95
Lys Arg Val Asp Gly Lys Trp Met Arg Glu Leu Val Leu Tyr Asp Lys
100 105 110
Glu Glu Ile Arg Arg Ile Trp Arg Gln Ala Asn Asn Gly Asp Asp Ala
115 120 125
Thr Ala Gly Leu Thr His Met Met Ile Trp His Ser Asn Leu Asn Asp
130 135 140
Thr Thr Tyr Gln Arg Thr Arg Ala Leu Val Arg Thr Gly Met Asp Pro
145 150 155 160
Arg Met Cys Ser Leu Met Gln Gly Ser Thr Leu Pro Arg Arg Ser Gly
165 170 175
Ala Ala Gly Ala Ala Val Lys Gly Val Gly Thr Met Val Met Glu Leu
180 185 190
Ile Arg Met Ile Lys Arg Gly Ile Asn Asp Arg Asn Phe Trp Arg Gly
195 200 205
Glu Asn Gly Arg Lys Thr Arg Asn Ala Tyr Glu Arg Met Cys Asn Ile
210 215 220
Leu Lys Gly Lys Phe Gln Thr Ala Ala Gln Arg Ala Met Met Asp Gln
225 230 235 240
Val Arg Glu Ser Arg Asn Pro Gly Asn Ala Glu Ile Glu Asp Leu Ile
245 250 255
Phe Leu Ala Arg Ser Ala Leu Ile Leu Arg Gly Ser Val Ala His Lys
260 265 270
Ser Cys Leu Pro Ala Cys Val Tyr Gly Pro Ala Val Ala Ser Gly Tyr
275 280 285
Asp Phe Glu Lys Glu Gly Tyr Ser Leu Val Gly Ile Asp Pro Phe Lys
290 295 300
Leu Leu Gln Asn Ser Gln Val Tyr Ser Leu Ile Arg Pro Asn Glu Asn
305 310 315 320
Pro Ala His Lys Ser Gln Leu Val Trp Met Ala Cys Asn Ser Ala Ala
325 330 335
Phe Glu Asp Leu Arg Val Ser Ser Phe Ile Arg Gly Thr Lys Val Ile
340 345 350
Pro Arg Gly Lys Leu Ser Thr Arg Gly Val Gln Ile Ala Ser Asn Glu
355 360 365
Asn Met Asp Thr Met Gly Ser Ser Thr Leu Glu Leu Arg Ser Arg Tyr
370 375 380
Trp Ala Ile Arg Thr Arg Ser Gly Gly Asn Thr Asn Gln Gln Arg Ala
385 390 395 400
Ser Ala Gly Gln Ile Ser Val Gln Pro Thr Phe Ser Val Gln Arg Asn
405 410 415
Leu Pro Phe Asp Lys Pro Thr Ile Met Ala Ala Phe Thr Gly Asn Ala
420 425 430
Glu Gly Arg Thr Ser Asp Met Arg Ala Glu Ile Ile Arg Met Met Glu
435 440 445
Gly Ala Lys Pro Glu Glu Val Ser Phe Gln Gly Arg Gly Val Phe Glu
450 455 460
Leu Ser Asp Glu Lys Ala Thr Asn Pro Ile Val Pro Ser Phe Asp Met
465 470 475 480
Ser Asn Glu Gly Ser Tyr Phe Phe Gly Asp Asn Ala Glu Glu Tyr Asp
485 490 495
Asn
<210> 18
<211> 738
<212> PRT
<213> zaire Ebola Virus
<400> 18
Asp Ser Arg Pro Gln Lys Ile Trp Met Ala Pro Ser Leu Thr Glu Ser
1 5 10 15
Asp Met Asp Tyr His Lys Ile Leu Thr Ala Gly Leu Ser Val Gln Gln
20 25 30
Gly Ile Val Arg Gln Arg Val Ile Pro Val Tyr Gln Val Asn Asn Leu
35 40 45
Glu Glu Ile Cys Gln Leu Ile Ile Gln Ala Phe Glu Ala Gly Val Asp
50 55 60
Phe Gln Glu Ser Ala Asp Ser Phe Leu Leu Met Leu Cys Leu His His
65 70 75 80
Ala Tyr Gln Gly Asp Tyr Lys Leu Phe Leu Glu Ser Gly Ala Val Lys
85 90 95
Tyr Leu Glu Gly His Gly Phe Arg Phe Glu Val Lys Lys Arg Asp Gly
100 105 110
Val Lys Arg Leu Glu Glu Leu Leu Pro Ala Val Ser Ser Gly Lys Asn
115 120 125
Ile Lys Arg Thr Leu Ala Ala Met Pro Glu Glu Glu Thr Thr Glu Ala
130 135 140
Asn Ala Gly Gln Phe Leu Ser Phe Ala Ser Leu Phe Leu Pro Lys Leu
145 150 155 160
Val Val Gly Glu Lys Ala Cys Leu Glu Lys Val Gln Arg Gln Ile Gln
165 170 175
Val His Ala Glu Gln Gly Leu Ile Gln Tyr Pro Thr Ala Trp Gln Ser
180 185 190
Val Gly His Met Met Val Ile Phe Arg Leu Met Arg Thr Asn Phe Leu
195 200 205
Ile Lys Phe Leu Leu Ile His Gln Gly Met His Met Val Ala Gly His
210 215 220
Asp Ala Asn Asp Ala Val Ile Ser Asn Ser Val Ala Gln Ala Arg Phe
225 230 235 240
Ser Gly Leu Leu Ile Val Lys Thr Val Leu Asp His Ile Leu Gln Lys
245 250 255
Thr Glu Arg Gly Val Arg Leu His Pro Leu Ala Arg Thr Ala Lys Val
260 265 270
Lys Asn Glu Val Asn Ser Phe Lys Ala Ala Leu Ser Ser Leu Ala Lys
275 280 285
His Gly Glu Tyr Ala Pro Phe Ala Arg Leu Leu Asn Leu Ser Gly Val
290 295 300
Asn Asn Leu Glu His Gly Leu Phe Pro Gln Leu Ser Ala Ile Ala Leu
305 310 315 320
Gly Val Ala Thr Ala His Gly Ser Thr Leu Ala Gly Val Asn Val Gly
325 330 335
Glu Gln Tyr Gln Gln Leu Arg Glu Ala Ala Thr Glu Ala Glu Lys Gln
340 345 350
Leu Gln Gln Tyr Ala Glu Ser Arg Glu Leu Asp His Leu Gly Leu Asp
355 360 365
Asp Gln Glu Lys Lys Ile Leu Met Asn Phe His Gln Lys Lys Asn Glu
370 375 380
Ile Ser Phe Gln Gln Thr Asn Ala Met Val Thr Leu Arg Lys Glu Arg
385 390 395 400
Leu Ala Lys Leu Thr Glu Ala Ile Thr Ala Ala Ser Leu Pro Lys Thr
405 410 415
Ser Gly His Tyr Asp Asp Asp Asp Asp Ile Pro Phe Pro Gly Pro Ile
420 425 430
Asn Asp Asp Asp Asn Pro Gly His Gln Asp Asp Asp Pro Thr Asp Ser
435 440 445
Gln Asp Thr Thr Ile Pro Asp Val Val Val Asp Pro Asp Asp Gly Ser
450 455 460
Tyr Gly Glu Tyr Gln Ser Tyr Ser Glu Asn Gly Met Asn Ala Pro Asp
465 470 475 480
Asp Leu Val Leu Phe Asp Leu Asp Glu Asp Asp Glu Asp Thr Lys Pro
485 490 495
Val Pro Asn Arg Ser Thr Lys Gly Gly Gln Gln Lys Asn Ser Gln Lys
500 505 510
Gly Gln His Ile Glu Gly Arg Gln Thr Gln Phe Arg Pro Ile Gln Asn
515 520 525
Val Pro Gly Pro His Arg Thr Ile His His Ala Ser Ala Pro Leu Thr
530 535 540
Asp Asn Asp Arg Arg Asn Glu Pro Ser Gly Ser Thr Ser Pro Arg Met
545 550 555 560
Leu Thr Pro Ile Asn Glu Glu Ala Asp Pro Leu Asp Asp Ala Asp Asp
565 570 575
Glu Thr Ser Ser Leu Pro Pro Leu Glu Ser Asp Asp Glu Glu Gln Asp
580 585 590
Arg Asp Gly Thr Ser Asn Arg Thr Pro Thr Val Ala Pro Pro Ala Pro
595 600 605
Val Tyr Arg Asp His Ser Glu Lys Lys Glu Leu Pro Gln Asp Glu Gln
610 615 620
Gln Asp Gln Asp His Thr Gln Glu Ala Arg Asn Gln Asp Ser Asp Asn
625 630 635 640
Thr Gln Ser Glu His Ser Leu Glu Glu Met Tyr Arg His Ile Leu Arg
645 650 655
Ser Gln Gly Pro Phe Asp Ala Val Leu Tyr Tyr His Met Met Lys Asp
660 665 670
Glu Pro Val Val Phe Ser Thr Ser Asp Gly Lys Glu Tyr Thr Tyr Pro
675 680 685
Asp Ser Leu Glu Glu Glu Tyr Pro Pro Trp Leu Thr Glu Lys Glu Ala
690 695 700
Met Asn Glu Glu Asn Arg Phe Val Thr Leu Asp Gly Gln Gln Phe Tyr
705 710 715 720
Trp Pro Val Met Asn His Lys Asn Lys Phe Met Ala Ile Leu Gln His
725 730 735
His Gln
<210> 19
<211> 336
<212> PRT
<213> Sudan type ebola virus
<400> 19
Ala Lys Leu Thr Glu Ala Ile Thr Thr Ala Ser Lys Ile Lys Val Gly
1 5 10 15
Asp Arg Tyr Pro Asp Asp Asn Asp Ile Pro Phe Pro Gly Pro Ile Tyr
20 25 30
Asp Asp Thr His Pro Asn Pro Ser Asp Asp Asn Pro Asp Asp Ser Arg
35 40 45
Asp Thr Thr Ile Pro Gly Gly Val Val Asp Pro Tyr Asp Asp Glu Ser
50 55 60
Asn Asn Tyr Pro Asp Tyr Glu Asp Ser Ala Glu Gly Thr Thr Gly Asp
65 70 75 80
Leu Asp Leu Phe Asn Leu Asp Asp Asp Asp Asp Asp Ser Arg Pro Gly
85 90 95
Pro Pro Asp Arg Gly Gln Asn Lys Glu Arg Ala Ala Arg Thr Tyr Gly
100 105 110
Leu Gln Asp Pro Thr Leu Asp Gly Ala Lys Lys Val Pro Glu Leu Thr
115 120 125
Pro Gly Ser His Gln Pro Gly Asn Leu His Ile Thr Lys Ser Gly Ser
130 135 140
Asn Thr Asn Gln Pro Gln Gly Asn Met Ser Ser Thr Leu His Ser Met
145 150 155 160
Thr Pro Ile Gln Glu Glu Ser Glu Pro Asp Asp Gln Lys Asp Asn Asp
165 170 175
Asp Glu Ser Leu Thr Ser Leu Asp Ser Glu Gly Asp Glu Asp Gly Glu
180 185 190
Ser Ile Ser Glu Glu Asn Thr Pro Thr Val Ala Pro Pro Ala Pro Val
195 200 205
Tyr Lys Asp Thr Gly Val Asp Thr Asn Gln Gln Asn Gly Pro Ser Ser
210 215 220
Thr Val Asp Ser Gln Gly Ser Glu Ser Glu Ala Leu Pro Ile Asn Ser
225 230 235 240
Lys Lys Ser Ser Ala Leu Glu Glu Thr Tyr Tyr His Leu Leu Lys Thr
245 250 255
Gln Gly Pro Phe Glu Ala Ile Asn Tyr Tyr His Leu Met Ser Asp Glu
260 265 270
Pro Ile Ala Phe Ser Thr Glu Ser Gly Lys Glu Tyr Ile Phe Pro Asp
275 280 285
Ser Leu Glu Glu Ala Tyr Pro Pro Trp Leu Ser Glu Lys Glu Ala Leu
290 295 300
Glu Lys Glu Asn Arg Tyr Leu Val Ile Asp Gly Gln Gln Phe Leu Trp
305 310 315 320
Pro Val Met Ser Leu Arg Asp Lys Phe Leu Ala Val Leu Gln His Asp
325 330 335
<210> 20
<211> 337
<212> PRT
<213> Bendi cloth Jiao Xingai Bola virus
<400> 20
Ala Lys Leu Thr Glu Ala Ile Thr Ser Thr Ser Ile Leu Lys Thr Gly
1 5 10 15
Arg Arg Tyr Asp Asp Asp Asn Asp Ile Pro Phe Pro Gly Pro Ile Asn
20 25 30
Asp Asn Glu Asn Ser Gly Gln Asn Asp Asp Asp Pro Thr Asp Ser Gln
35 40 45
Asp Thr Thr Ile Pro Asp Val Ile Ile Asp Pro Asn Asp Gly Gly Tyr
50 55 60
Asn Asn Tyr Ser Asp Tyr Ala Asn Asp Ala Ala Ser Ala Pro Asp Asp
65 70 75 80
Leu Val Leu Phe Asp Leu Glu Asp Glu Asp Asp Ala Asp Asn Pro Ala
85 90 95
Gln Asn Thr Pro Glu Lys Asn Asp Arg Pro Ala Thr Thr Lys Leu Arg
100 105 110
Asn Gly Gln Asp Gln Asp Gly Asn Gln Gly Glu Thr Ala Ser Pro Arg
115 120 125
Val Ala Pro Asn Gln Tyr Arg Asp Lys Pro Met Pro Gln Val Gln Asp
130 135 140
Arg Ser Glu Asn His Asp Gln Thr Leu Gln Thr Gln Ser Arg Val Leu
145 150 155 160
Thr Pro Ile Ser Glu Glu Ala Asp Pro Ser Asp His Asn Asp Gly Asp
165 170 175
Asn Glu Ser Ile Pro Pro Leu Glu Ser Asp Asp Glu Gly Ser Thr Asp
180 185 190
Thr Thr Ala Ala Glu Thr Lys Pro Ala Thr Ala Pro Pro Ala Pro Val
195 200 205
Tyr Arg Ser Ile Ser Val Asp Asp Ser Val Pro Ser Glu Asn Ile Pro
210 215 220
Ala Gln Ser Asn Gln Thr Asn Asn Glu Asp Asn Val Arg Asn Asn Ala
225 230 235 240
Gln Ser Glu Gln Ser Ile Ala Glu Met Tyr Gln His Ile Leu Lys Thr
245 250 255
Gln Gly Pro Phe Asp Ala Ile Leu Tyr Tyr His Met Met Lys Glu Glu
260 265 270
Pro Ile Ile Phe Ser Thr Ser Asp Gly Lys Glu Tyr Thr Tyr Pro Asp
275 280 285
Ser Leu Glu Asp Glu Tyr Pro Pro Trp Leu Ser Glu Lys Glu Ala Met
290 295 300
Asn Glu Asp Asn Arg Phe Ile Thr Met Asp Gly Gln Gln Phe Tyr Trp
305 310 315 320
Pro Val Met Asn His Arg Asn Lys Phe Met Ala Ile Leu Gln His His
325 330 335
Arg
<210> 21
<211> 169
<212> PRT
<213> Tayi forest type ebola virus
<400> 21
Leu Val Leu Phe Asp Leu Glu Asp Gly Asp Glu Asp Asp His Arg Pro
1 5 10 15
Ser Ser Ser Ser Glu Asn Asn Asn Lys His Ser Leu Thr Gly Thr Asp
20 25 30
Ser Asn Lys Thr Ser Asn Trp Asn Arg Asn Pro Thr Asn Met Pro Lys
35 40 45
Lys Asp Ser Thr Gln Asn Asn Asp Asn Pro Ala Gln Arg Ala Gln Glu
50 55 60
Tyr Ala Arg Asp Asn Ile Gln Asp Thr Pro Thr Pro His Arg Ala Leu
65 70 75 80
Thr Pro Ile Ser Glu Glu Thr Gly Ser Asn Gly His Asn Glu Asp Asp
85 90 95
Ile Asp Ser Ile Pro Pro Leu Glu Ser Asp Glu Glu Asn Asn Thr Glu
100 105 110
Thr Thr Ile Thr Thr Thr Lys Asn Thr Thr Ala Pro Pro Ala Pro Val
115 120 125
Tyr Arg Ser Asn Ser Glu Lys Glu Pro Leu Pro Gln Glu Lys Ser Gln
130 135 140
Lys Gln Pro Asn Gln Val Ser Gly Ser Glu Asn Thr Asp Asn Lys Pro
145 150 155 160
His Ser Glu Gln Ser Val Glu Glu Met
165
<210> 22
<211> 1604
<212> PRT
<213> artificial sequence
<220>
<223> synthetic construct
<400> 22
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly
1 5 10 15
Met Leu Val Ala Ser Cys Leu Gly Asp Ser Arg Pro Gln Lys Ile Trp
20 25 30
Met Ala Pro Ser Leu Thr Glu Ser Asp Met Asp Tyr His Lys Ile Leu
35 40 45
Thr Ala Gly Leu Ser Val Gln Gln Gly Ile Val Arg Gln Arg Val Ile
50 55 60
Pro Val Tyr Gln Val Asn Asn Leu Glu Glu Ile Cys Gln Leu Ile Ile
65 70 75 80
Gln Ala Phe Glu Ala Gly Val Asp Phe Gln Glu Ser Ala Asp Ser Phe
85 90 95
Leu Leu Met Leu Cys Leu His His Ala Tyr Gln Gly Asp Tyr Lys Leu
100 105 110
Phe Leu Glu Ser Gly Ala Val Lys Tyr Leu Glu Gly His Gly Phe Arg
115 120 125
Phe Glu Val Lys Lys Arg Asp Gly Val Lys Arg Leu Glu Glu Leu Leu
130 135 140
Pro Ala Val Ser Ser Gly Lys Asn Ile Lys Arg Thr Leu Ala Ala Met
145 150 155 160
Pro Glu Glu Glu Thr Thr Glu Ala Asn Ala Gly Gln Phe Leu Ser Phe
165 170 175
Ala Ser Leu Phe Leu Pro Lys Leu Val Val Gly Glu Lys Ala Cys Leu
180 185 190
Glu Lys Val Gln Arg Gln Ile Gln Val His Ala Glu Gln Gly Leu Ile
195 200 205
Gln Tyr Pro Thr Ala Trp Gln Ser Val Gly His Met Met Val Ile Phe
210 215 220
Arg Leu Met Arg Thr Asn Phe Leu Ile Lys Phe Leu Leu Ile His Gln
225 230 235 240
Gly Met His Met Val Ala Gly His Asp Ala Asn Asp Ala Val Ile Ser
245 250 255
Asn Ser Val Ala Gln Ala Arg Phe Ser Gly Leu Leu Ile Val Lys Thr
260 265 270
Val Leu Asp His Ile Leu Gln Lys Thr Glu Arg Gly Val Arg Leu His
275 280 285
Pro Leu Ala Arg Thr Ala Lys Val Lys Asn Glu Val Asn Ser Phe Lys
290 295 300
Ala Ala Leu Ser Ser Leu Ala Lys His Gly Glu Tyr Ala Pro Phe Ala
305 310 315 320
Arg Leu Leu Asn Leu Ser Gly Val Asn Asn Leu Glu His Gly Leu Phe
325 330 335
Pro Gln Leu Ser Ala Ile Ala Leu Gly Val Ala Thr Ala His Gly Ser
340 345 350
Thr Leu Ala Gly Val Asn Val Gly Glu Gln Tyr Gln Gln Leu Arg Glu
355 360 365
Ala Ala Thr Glu Ala Glu Lys Gln Leu Gln Gln Tyr Ala Glu Ser Arg
370 375 380
Glu Leu Asp His Leu Gly Leu Asp Asp Gln Glu Lys Lys Ile Leu Met
385 390 395 400
Asn Phe His Gln Lys Lys Asn Glu Ile Ser Phe Gln Gln Thr Asn Ala
405 410 415
Met Val Thr Leu Arg Lys Glu Arg Leu Ala Lys Leu Thr Glu Ala Ile
420 425 430
Thr Ala Ala Ser Leu Pro Lys Thr Ser Gly His Tyr Asp Asp Asp Asp
435 440 445
Asp Ile Pro Phe Pro Gly Pro Ile Asn Asp Asp Asp Asn Pro Gly His
450 455 460
Gln Asp Asp Asp Pro Thr Asp Ser Gln Asp Thr Thr Ile Pro Asp Val
465 470 475 480
Val Val Asp Pro Asp Asp Gly Ser Tyr Gly Glu Tyr Gln Ser Tyr Ser
485 490 495
Glu Asn Gly Met Asn Ala Pro Asp Asp Leu Val Leu Phe Asp Leu Asp
500 505 510
Glu Asp Asp Glu Asp Thr Lys Pro Val Pro Asn Arg Ser Thr Lys Gly
515 520 525
Gly Gln Gln Lys Asn Ser Gln Lys Gly Gln His Ile Glu Gly Arg Gln
530 535 540
Thr Gln Phe Arg Pro Ile Gln Asn Val Pro Gly Pro His Arg Thr Ile
545 550 555 560
His His Ala Ser Ala Pro Leu Thr Asp Asn Asp Arg Arg Asn Glu Pro
565 570 575
Ser Gly Ser Thr Ser Pro Arg Met Leu Thr Pro Ile Asn Glu Glu Ala
580 585 590
Asp Pro Leu Asp Asp Ala Asp Asp Glu Thr Ser Ser Leu Pro Pro Leu
595 600 605
Glu Ser Asp Asp Glu Glu Gln Asp Arg Asp Gly Thr Ser Asn Arg Thr
610 615 620
Pro Thr Val Ala Pro Pro Ala Pro Val Tyr Arg Asp His Ser Glu Lys
625 630 635 640
Lys Glu Leu Pro Gln Asp Glu Gln Gln Asp Gln Asp His Thr Gln Glu
645 650 655
Ala Arg Asn Gln Asp Ser Asp Asn Thr Gln Ser Glu His Ser Leu Glu
660 665 670
Glu Met Tyr Arg His Ile Leu Arg Ser Gln Gly Pro Phe Asp Ala Val
675 680 685
Leu Tyr Tyr His Met Met Lys Asp Glu Pro Val Val Phe Ser Thr Ser
690 695 700
Asp Gly Lys Glu Tyr Thr Tyr Pro Asp Ser Leu Glu Glu Glu Tyr Pro
705 710 715 720
Pro Trp Leu Thr Glu Lys Glu Ala Met Asn Glu Glu Asn Arg Phe Val
725 730 735
Thr Leu Asp Gly Gln Gln Phe Tyr Trp Pro Val Met Asn His Lys Asn
740 745 750
Lys Phe Met Ala Ile Leu Gln His His Gln Ala Lys Leu Thr Glu Ala
755 760 765
Ile Thr Thr Ala Ser Lys Ile Lys Val Gly Asp Arg Tyr Pro Asp Asp
770 775 780
Asn Asp Ile Pro Phe Pro Gly Pro Ile Tyr Asp Asp Thr His Pro Asn
785 790 795 800
Pro Ser Asp Asp Asn Pro Asp Asp Ser Arg Asp Thr Thr Ile Pro Gly
805 810 815
Gly Val Val Asp Pro Tyr Asp Asp Glu Ser Asn Asn Tyr Pro Asp Tyr
820 825 830
Glu Asp Ser Ala Glu Gly Thr Thr Gly Asp Leu Asp Leu Phe Asn Leu
835 840 845
Asp Asp Asp Asp Asp Asp Ser Arg Pro Gly Pro Pro Asp Arg Gly Gln
850 855 860
Asn Lys Glu Arg Ala Ala Arg Thr Tyr Gly Leu Gln Asp Pro Thr Leu
865 870 875 880
Asp Gly Ala Lys Lys Val Pro Glu Leu Thr Pro Gly Ser His Gln Pro
885 890 895
Gly Asn Leu His Ile Thr Lys Ser Gly Ser Asn Thr Asn Gln Pro Gln
900 905 910
Gly Asn Met Ser Ser Thr Leu His Ser Met Thr Pro Ile Gln Glu Glu
915 920 925
Ser Glu Pro Asp Asp Gln Lys Asp Asn Asp Asp Glu Ser Leu Thr Ser
930 935 940
Leu Asp Ser Glu Gly Asp Glu Asp Gly Glu Ser Ile Ser Glu Glu Asn
945 950 955 960
Thr Pro Thr Val Ala Pro Pro Ala Pro Val Tyr Lys Asp Thr Gly Val
965 970 975
Asp Thr Asn Gln Gln Asn Gly Pro Ser Ser Thr Val Asp Ser Gln Gly
980 985 990
Ser Glu Ser Glu Ala Leu Pro Ile Asn Ser Lys Lys Ser Ser Ala Leu
995 1000 1005
Glu Glu Thr Tyr Tyr His Leu Leu Lys Thr Gln Gly Pro Phe Glu Ala
1010 1015 1020
Ile Asn Tyr Tyr His Leu Met Ser Asp Glu Pro Ile Ala Phe Ser Thr
1025 1030 1035 1040
Glu Ser Gly Lys Glu Tyr Ile Phe Pro Asp Ser Leu Glu Glu Ala Tyr
1045 1050 1055
Pro Pro Trp Leu Ser Glu Lys Glu Ala Leu Glu Lys Glu Asn Arg Tyr
1060 1065 1070
Leu Val Ile Asp Gly Gln Gln Phe Leu Trp Pro Val Met Ser Leu Arg
1075 1080 1085
Asp Lys Phe Leu Ala Val Leu Gln His Asp Ala Lys Leu Thr Glu Ala
1090 1095 1100
Ile Thr Ser Thr Ser Ile Leu Lys Thr Gly Arg Arg Tyr Asp Asp Asp
1105 1110 1115 1120
Asn Asp Ile Pro Phe Pro Gly Pro Ile Asn Asp Asn Glu Asn Ser Gly
1125 1130 1135
Gln Asn Asp Asp Asp Pro Thr Asp Ser Gln Asp Thr Thr Ile Pro Asp
1140 1145 1150
Val Ile Ile Asp Pro Asn Asp Gly Gly Tyr Asn Asn Tyr Ser Asp Tyr
1155 1160 1165
Ala Asn Asp Ala Ala Ser Ala Pro Asp Asp Leu Val Leu Phe Asp Leu
1170 1175 1180
Glu Asp Glu Asp Asp Ala Asp Asn Pro Ala Gln Asn Thr Pro Glu Lys
1185 1190 1195 1200
Asn Asp Arg Pro Ala Thr Thr Lys Leu Arg Asn Gly Gln Asp Gln Asp
1205 1210 1215
Gly Asn Gln Gly Glu Thr Ala Ser Pro Arg Val Ala Pro Asn Gln Tyr
1220 1225 1230
Arg Asp Lys Pro Met Pro Gln Val Gln Asp Arg Ser Glu Asn His Asp
1235 1240 1245
Gln Thr Leu Gln Thr Gln Ser Arg Val Leu Thr Pro Ile Ser Glu Glu
1250 1255 1260
Ala Asp Pro Ser Asp His Asn Asp Gly Asp Asn Glu Ser Ile Pro Pro
1265 1270 1275 1280
Leu Glu Ser Asp Asp Glu Gly Ser Thr Asp Thr Thr Ala Ala Glu Thr
1285 1290 1295
Lys Pro Ala Thr Ala Pro Pro Ala Pro Val Tyr Arg Ser Ile Ser Val
1300 1305 1310
Asp Asp Ser Val Pro Ser Glu Asn Ile Pro Ala Gln Ser Asn Gln Thr
1315 1320 1325
Asn Asn Glu Asp Asn Val Arg Asn Asn Ala Gln Ser Glu Gln Ser Ile
1330 1335 1340
Ala Glu Met Tyr Gln His Ile Leu Lys Thr Gln Gly Pro Phe Asp Ala
1345 1350 1355 1360
Ile Leu Tyr Tyr His Met Met Lys Glu Glu Pro Ile Ile Phe Ser Thr
1365 1370 1375
Ser Asp Gly Lys Glu Tyr Thr Tyr Pro Asp Ser Leu Glu Asp Glu Tyr
1380 1385 1390
Pro Pro Trp Leu Ser Glu Lys Glu Ala Met Asn Glu Asp Asn Arg Phe
1395 1400 1405
Ile Thr Met Asp Gly Gln Gln Phe Tyr Trp Pro Val Met Asn His Arg
1410 1415 1420
Asn Lys Phe Met Ala Ile Leu Gln His His Arg Leu Val Leu Phe Asp
1425 1430 1435 1440
Leu Glu Asp Gly Asp Glu Asp Asp His Arg Pro Ser Ser Ser Ser Glu
1445 1450 1455
Asn Asn Asn Lys His Ser Leu Thr Gly Thr Asp Ser Asn Lys Thr Ser
1460 1465 1470
Asn Trp Asn Arg Asn Pro Thr Asn Met Pro Lys Lys Asp Ser Thr Gln
1475 1480 1485
Asn Asn Asp Asn Pro Ala Gln Arg Ala Gln Glu Tyr Ala Arg Asp Asn
1490 1495 1500
Ile Gln Asp Thr Pro Thr Pro His Arg Ala Leu Thr Pro Ile Ser Glu
1505 1510 1515 1520
Glu Thr Gly Ser Asn Gly His Asn Glu Asp Asp Ile Asp Ser Ile Pro
1525 1530 1535
Pro Leu Glu Ser Asp Glu Glu Asn Asn Thr Glu Thr Thr Ile Thr Thr
1540 1545 1550
Thr Lys Asn Thr Thr Ala Pro Pro Ala Pro Val Tyr Arg Ser Asn Ser
1555 1560 1565
Glu Lys Glu Pro Leu Pro Gln Glu Lys Ser Gln Lys Gln Pro Asn Gln
1570 1575 1580
Val Ser Gly Ser Glu Asn Thr Asp Asn Lys Pro His Ser Glu Gln Ser
1585 1590 1595 1600
Val Glu Glu Met
<210> 23
<211> 4815
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 23
atgcccatgg ggtctctgca accgctggcc accttgtacc tgctggggat gctggtcgct 60
tcctgcctcg gagattctcg tcctcagaaa atctggatgg cgccgagtct cactgaatct 120
gacatggatt accacaagat cttgacagca ggtctgtccg ttcaacaggg gattgttcgg 180
caaagagtca tcccagtgta tcaagtaaac aatcttgaag aaatttgcca acttatcata 240
caggcctttg aagcaggtgt tgattttcaa gagagtgcgg acagtttcct tctcatgctt 300
tgtcttcatc atgcgtacca gggagattac aaacttttct tggaaagtgg cgcagtcaag 360
tatttggaag ggcacgggtt ccgttttgaa gtcaagaagc gtgatggagt gaagcgcctt 420
gaggaattgc tgccagcagt atctagtgga aaaaacatta agagaacact tgctgccatg 480
ccggaagagg agacaactga agctaatgcc ggtcagtttc tctcctttgc aagtctattc 540
cttccgaaat tggtagtagg agaaaaggct tgccttgaga aggttcaaag gcaaattcaa 600
gtacatgcag agcaaggact gatacaatat ccaacagctt ggcaatcagt aggacacatg 660
atggtgattt tccgtttgat gcgaacaaat tttctgatca aatttctcct aatacaccaa 720
gggatgcaca tggttgccgg gcatgatgcc aacgatgctg tgatttcaaa ttcagtggct 780
caagctcgtt tttcaggctt attgattgtc aaaacagtac ttgatcatat cctacaaaag 840
acagaacgag gagttcgtct ccatcctctt gcaaggaccg ccaaggtaaa aaatgaggtg 900
aactccttta aggctgcact cagctccctg gccaagcatg gagagtatgc tcctttcgcc 960
cgacttttga acctttctgg agtaaataat cttgagcatg gtcttttccc tcaactatcg 1020
gcaattgcac tcggagtcgc cacagcacac gggagtaccc tcgcaggagt aaatgttgga 1080
gaacagtatc aacaactcag agaggctgcc actgaggctg agaagcaact ccaacaatat 1140
gcagagtctc gcgaacttga ccatcttgga cttgatgatc aggaaaagaa aattcttatg 1200
aacttccatc agaaaaagaa cgaaatcagc ttccagcaaa caaacgctat ggtaactcta 1260
agaaaagagc gcctggccaa gctgacagaa gctatcactg ctgcgtcact gcccaaaaca 1320
agtggacatt acgatgatga tgacgacatt ccctttccag gacccatcaa tgatgacgac 1380
aatcctggcc atcaagatga tgatccgact gactcacagg atacgaccat tcccgatgtg 1440
gtggttgatc ctgatgatgg aagctacggc gaataccaga gttactcgga aaacggcatg 1500
aatgcaccag atgacttggt cctattcgat ctagacgagg acgacgagga cactaagcca 1560
gtgcctaata gatcgaccaa gggtggacaa cagaagaaca gtcaaaaggg ccagcatata 1620
gagggcagac agacacaatt caggccaatt caaaatgtcc caggccctca cagaacaatc 1680
caccacgcca gtgcgccact cacggacaat gacagaagaa atgaaccctc cggctcaacc 1740
agccctcgca tgctgacacc aattaacgaa gaggcagacc cactggacga tgccgacgac 1800
gagacgtcta gccttccgcc cttggagtca gatgatgaag agcaggacag ggacggaact 1860
tccaaccgca cacccactgt cgccccaccg gctcccgtat acagagatca ctctgaaaag 1920
aaagaactcc cgcaagacga gcaacaagat caggaccaca ctcaagaggc caggaaccag 1980
gacagtgaca acacccagtc agaacactcc cttgaggaga tgtatcgcca cattctaaga 2040
tcacaggggc catttgatgc tgttttgtat tatcatatga tgaaggatga gcctgtagtt 2100
ttcagtacca gtgatggcaa agagtacacg tatccagact cccttgaaga ggaatatcca 2160
ccatggctca ctgaaaaaga ggctatgaat gaagagaata gatttgttac attggatggt 2220
caacaatttt attggccggt gatgaatcac aagaataaat tcatggcaat cctgcaacat 2280
catcaggcta aattgaccga agccatcacg actgcatcga agatcaaggt tggagaccgt 2340
tatcctgatg acaatgatat tccatttccc gggccgatct atgatgacac tcaccccaat 2400
ccctctgatg acaatcctga tgattcacgt gatacaacta ttccaggtgg tgttgttgac 2460
ccgtatgatg atgagagtaa taattatcct gactacgagg attcggctga aggcaccaca 2520
ggagatcttg atctcttcaa tttggacgac gacgatgatg acagccgacc aggaccacca 2580
gacagggggc agaacaagga gagggcggcc cggacatatg gcctccaaga tccgaccttg 2640
gacggagcga aaaaggtgcc ggagttgacc ccaggttccc atcaaccagg caacctccac 2700
atcaccaagt cgggttcaaa caccaaccaa ccacaaggca atatgtcatc tactctccat 2760
agtatgaccc ctatacagga agaatcagag cccgatgatc aaaaagataa tgatgacgag 2820
agtctcacat cccttgactc tgaaggtgac gaagatggtg agagcatctc tgaggagaac 2880
accccaactg tagctccacc agcaccagtc tacaaagaca ctggagtaga cactaatcag 2940
cagaatggac caagcagtac tgtagatagt caaggttctg aaagtgaagc tctcccaatc 3000
aactctaaaa agagttccgc actagaagaa acatattatc atctcctaaa aacacagggt 3060
ccatttgagg caatcaatta ttatcaccta atgagtgatg aacccattgc ttttagcact 3120
gaaagtggca aggaatatat ctttccagac tcccttgaag aagcctaccc gccgtggttg 3180
agtgagaagg aggccttaga gaaggaaaat cgttatctgg tcattgatgg ccagcaattc 3240
ctctggccgg taatgagcct acgggacaag ttccttgccg ttcttcaaca tgacgccaaa 3300
ttgaccgaag ctattacttc cacctctatc ctcaaaacag gaaggcggta tgatgatgac 3360
aatgacatac cctttccagg gccaatcaat gataacgaga actctggtca gaacgatgac 3420
gatccaacag actcccagga taccacaatc ccggatgtaa taatcgatcc aaacgatggt 3480
gggtataata attacagcga ttatgcaaat gatgctgcaa gtgctcctga tgacctagtt 3540
ctttttgacc ttgaggacga ggatgatgct gataacccgg ctcaaaacac gccagaaaaa 3600
aatgatagac cagcaacaac aaagctgaga aatggacagg accaggatgg aaaccaaggc 3660
gaaactgcat ccccacgggt agcccccaac caatacagag acaagccaat gccacaagta 3720
caggacagat ccgaaaatca tgaccaaacc cttcaaacac agtccagggt tttgactcct 3780
atcagcgagg aagcagaccc cagcgaccac aacgatggtg acaatgaaag cattcctccc 3840
ctggaatcag acgacgaggg tagcactgat actactgcag cagaaacaaa gcctgccact 3900
gcacctcccg ctcccgtcta ccgaagtatc tccgtagatg attctgtccc ctcagagaac 3960
attcccgcac agtccaatca aacgaacaat gaggacaatg tcaggaacaa tgctcagtcg 4020
gagcaatcca ttgcagaaat gtatcaacat atcttgaaaa cacaaggacc ttttgatgcc 4080
atcctttact accatatgat gaaagaagag cccatcattt tcagcactag tgatgggaag 4140
gagtatacat atccagactc tcttgaagat gagtatccac cctggctcag cgagaaggaa 4200
gccatgaacg aagacaatag attcataacc atggatggtc agcagtttta ctggcctgtg 4260
atgaatcata gaaataaatt catggcaatc ctccagcatc acaggcttgt tctttttgac 4320
cttgaagatg gtgacgagga tgatcaccga ccgtcaagtt catcagagaa caacaacaaa 4380
cacagtctta caggaactga cagtaacaaa acaagtaact ggaatcgaaa cccgactaat 4440
atgccaaaga aagactccac acaaaacaat gacaatcctg cacagcgggc tcaagaatac 4500
gccagggata acatccagga tacaccaaca ccccatcgag ctctaactcc catcagcgaa 4560
gaaaccggct ccaatggtca caatgaagat gacattgata gcatccctcc tttggaatca 4620
gacgaagaaa acaacactga gacaaccatt accaccacaa aaaataccac tgctccacca 4680
gcacctgttt atcggagtaa ttcagaaaag gagcccctcc cgcaagaaaa atcccagaag 4740
caaccaaacc aagtgagtgg tagtgagaat accgacaata aacctcactc agagcaatca 4800
gtggaagaaa tgtaa 4815
<210> 24
<211> 744
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 24
atgcccatgg ggtctctgca accgctggcc accttgtacc tgctggggat gctggtcgct 60
tcctgcctcg gaagagtcca accaacagaa tctattgtta gatttcctaa tattacaaac 120
ttgtgccctt ttgatgaagt ttttaacgcc accagatttg catctgttta tgcttggaac 180
aggaagagaa tcagcaactg tgttgctgat tattctgtcc tatataatct cgcaccattt 240
ttcactttta agtgttatgg agtgtctcct actaaattaa atgatctctg ctttactaat 300
gtctatgcag attcatttgt aattagaggt gatgaagtca gacaaatcgc tccagggcaa 360
actggaaaca ttgctgatta taattataaa ttaccagatg attttacagg ctgcgttata 420
gcttggaatt ctaacaagct tgattctaag gttagtggta attataatta cctgtataga 480
ttgtttagga agtctaatct caaacctttt gagagagata tttcaactga aatctatcag 540
gccggtaaca aaccttgtaa tggtgttgca ggttttaatt gttactttcc tttacgatca 600
tatagtttcc gacccactta tggtgttggt caccaaccat acagagtagt agtactttct 660
tttgaacttc tacatgcacc agcaactgtt tgtggaccta aaaagtctac taatttggtt 720
aaaaacaaat gtgtcaattt ctaa 744
<210> 25
<211> 247
<212> PRT
<213> artificial sequence
<220>
<223> synthetic construct
<400> 25
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly
1 5 10 15
Met Leu Val Ala Ser Cys Leu Gly Arg Val Gln Pro Thr Glu Ser Ile
20 25 30
Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Asp Glu Val Phe
35 40 45
Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile
50 55 60
Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Leu Ala Pro Phe
65 70 75 80
Phe Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu
85 90 95
Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu
100 105 110
Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Asn Ile Ala Asp Tyr Asn
115 120 125
Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser
130 135 140
Asn Lys Leu Asp Ser Lys Val Ser Gly Asn Tyr Asn Tyr Leu Tyr Arg
145 150 155 160
Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr
165 170 175
Glu Ile Tyr Gln Ala Gly Asn Lys Pro Cys Asn Gly Val Ala Gly Phe
180 185 190
Asn Cys Tyr Phe Pro Leu Arg Ser Tyr Ser Phe Arg Pro Thr Tyr Gly
195 200 205
Val Gly His Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu
210 215 220
His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val
225 230 235 240
Lys Asn Lys Cys Val Asn Phe
245
<210> 26
<211> 4056
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 26
atgcccatgg ggtctctgca accgctggcc accttgtacc tgctggggat gctggtcgct 60
tcctgcctcg gaagagtcca accaacagaa tctattgtta gatttcctaa tattacaaac 120
ttgtgccctt ttggtgaagt ttttaacgcc accagatttg catctgttta tgcttggaac 180
aggaagagaa tcagcaactg tgttgctgat tattctgtcc tatataattc cgcatcattt 240
tccactttta agtgttatgg agtgtctcct actaaattaa atgatctctg ctttactaat 300
gtctatgcag attcatttgt aattagaggt gatgaagtca gacaaatcgc tccagggcaa 360
actggaaaga ttgctgatta taattataaa ttaccagatg attttacagg ctgcgttata 420
gcttggaatt ctaacaatct tgattctaag gttggtggta attataatta ccggtataga 480
ttgtttagga agtctaatct caaacctttt gagagagata tttcaactga aatctatcag 540
gccggtagca aaccttgtaa tggtgttgaa ggttttaatt gttactttcc tttacaatca 600
tatggtttcc aacccactaa tggtgttggt taccaaccat acagagtagt agtactttct 660
tttgaacttc tacatgcacc agcaactgtt tgtggaccta aaaagtctac taatttggtt 720
aaaaacaaat gtgtcaattt ctcaggtggt ggcggttcag gcggaggtgg ctctggcggt 780
ggcggatcga tgtctgataa tggaccccaa aatcagcgaa atgcaccccg cattacgttt 840
ggtggaccct cagattcaac tggcagtaac cagaatggag aacgcagtgg ggcgcgatca 900
aaacaacgtc ggccccaagg tttacccaat aatactgcgt cttggttcac cgctctcact 960
caacatggca aggaaggcct taaattccct cgaggacaag gcgttccaat taacaccaat 1020
agcagtccag atgaccaaat tggctactac cgaagagcta ccagacgaat tcgtggtggt 1080
gacggtaaaa tgaaagatct cagtccaaga tggtatttct actacctagg aactgggcca 1140
gaagctggac ttccctatgg tgctaacaaa gacggcatca tatgggttgc aactgaggga 1200
gccttgaata caccaaaaga tcacattggc acccgcaatc ctgctaacaa tgctgcaatc 1260
gtgctacaac ttcctcaagg aacaacattg ccaaaaggct tctacgcaga agggagcaga 1320
ggcggcagtc aagcctcttc tcgttcctca tcacgtagtc gcaacagttc aagaaattca 1380
actccaggca gcagtatggg aacttctcct gctagaatgg ctggcaatgg ctgtgatgct 1440
gctcttgctt tgctgctgct tgacagattg aaccagcttg agagcaaaat gtctggtaaa 1500
ggccaacaac aacaaggcca aactgtcact aagaaatctg ctgctgaggc ttctaagaag 1560
cctcggcaaa aacgtactgc cactaaagca tacaatgtaa cacaagcttt cggcagacgt 1620
ggtccagaac aaacccaagg aaattttggg gaccaggaac taatcagaca aggaactgat 1680
tacaaacatt ggccgcaaat tgcacaattt gcccccagcg cttcagcgtt cttcggaatg 1740
tcgcgcattg gcatggaagt cacaccttcg ggaacgtggt tgacctacac aggtgccatc 1800
aaattggatg acaaagatcc aaatttcaaa gatcaagtca ttttgctgaa taagcatatt 1860
gacgcataca aaacattccc accaacagag cctaaaaagg acaaaaagaa gaaggcttat 1920
gaaactcaag ccttaccgca gagacagaag aaacagcaaa ctgtgactct tcttcctgct 1980
gcagatttgg atgatttctc caaacaattg caacaatcca tgagcagtgc tgactcaact 2040
caggccatgg catcccctgc tgcacctcgt gctgtttcct ttgccgataa caatgatata 2100
acaaatacaa acctatctcg aggtagagga cgtaatccaa aaccacgagc tgcaccaaat 2160
aacactgtct cttggtacac tgggcttacc caacacggga aagtccctct tacctttcca 2220
cctgggcagg gtgtacctct taatgccaat tctacccctg cgcaaaatgc tgggtattgg 2280
cggagacagg acagaaaaat taataccggg aatggaatta agcaactggc tcccaggtgg 2340
tacttctact acactggaac tggacccgaa gcagcactcc cattccgggc tgttaaggat 2400
ggcatcgttt gggtccatga agatggcgcc actgatgctc cttcaacttt tgggacgcgg 2460
aaccctaaca atgattcagc tattgttaca caattcgcgc ccggtactaa gcttcctaaa 2520
aacttccaca ttgaggggac tggaggcaat agtcaatcat cttcaagagc ctctagctta 2580
agcagaaact cttccagatc tagttcacaa ggttcaagat caggaaactc tacccgcggc 2640
acttctccag gtccatctgg aatcggagca gtaggaggtg atctacttta ccttgatctt 2700
ctgaacagac tacaagccct tgagtctggc aaagtaaagc aatcgcagcc aaaagtaatc 2760
actaagaaag atgctgctgc tgctaaaaat aagatgcgcc acaagcgcac ttccaccaaa 2820
agtttcaaca tggtgcaagc ttttggtctt cgcggaccag gagacctcca gggaaacttt 2880
ggtgatcttc aattgaataa actcggcact gaggacccac gttggcccca aattgctgag 2940
cttgctccta cagccagtgc ttttatgggt atgtcgcaat ttaaacttac ccatcagaac 3000
aatgatgatc atggcaaccc tgtgtacttc cttcggtaca gtggagccat taaacttgac 3060
ccaaagaatc ccaactacaa taagtggttg gagcttcttg agcaaaatat tgatgcctac 3120
aaaaccttcc ctaagaagga aaagaaacaa aaggcaccaa aagaagaatc aacagaccaa 3180
atgtctgaac ctccaaagga gcagcgtgtg caaggtagca tcactcagcg cactcgcacc 3240
cgtccaagtg ttcagcctgg tccaatgatt gatgttaaca ctgattctgg tggcggtggc 3300
tcgggcggag gtgggtcggg tggcggcgga tcagaagcaa aaccttctgg ctcagttgtg 3360
gaacaggctg aaggtgttga atgtgatttt tcacctcttc tgtctggcac acctcctcag 3420
gtttataatt tcaagcgttt ggtttttacc aattgcaatt ataatcttac caaattgctt 3480
tcactttttt ctgtgaatga ttttacttgt agtcaaatat ctccagcagc aattgctagc 3540
aactgttatt cttcactgat tttggattac ttttcatacc cacttagtat gaaatccgat 3600
ctcagtgtta gttctgctgg tccaatatcc cagtttaatt ataaacagtc cttttctaat 3660
cccacatgtt tgattttagc gactgttcct cataacctta ctactattac taagcctctt 3720
aagtacagct atattaacaa gtgctctcgt cttctttctg atgatcgtac tgaagtacct 3780
cagttagtga acgctaatca atactcaccc tgtgtatcca ttgtcccatc cactgtgtgg 3840
gaagacggtg attattatag gaaacaacta tctccacttg aaggtggtgg ctggcttgtt 3900
gctagtggct caactgttgc catgactgag caattacaga tgggctttgg tattacagtt 3960
caatatggta cagacaccaa tagtgtttgc cccaagcttg aatttgctaa tgacacaaaa 4020
attgcctctc aattaggcaa ttgcgtggaa tattag 4056
<210> 27
<211> 1351
<212> PRT
<213> artificial sequence
<220>
<223> synthetic construct
<400> 27
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly
1 5 10 15
Met Leu Val Ala Ser Cys Leu Gly Arg Val Gln Pro Thr Glu Ser Ile
20 25 30
Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe
35 40 45
Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile
50 55 60
Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe
65 70 75 80
Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu
85 90 95
Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu
100 105 110
Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn
115 120 125
Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser
130 135 140
Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Arg Tyr Arg
145 150 155 160
Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr
165 170 175
Glu Ile Tyr Gln Ala Gly Ser Lys Pro Cys Asn Gly Val Glu Gly Phe
180 185 190
Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly
195 200 205
Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu
210 215 220
His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val
225 230 235 240
Lys Asn Lys Cys Val Asn Phe Ser Gly Gly Gly Gly Ser Gly Gly Gly
245 250 255
Gly Ser Gly Gly Gly Gly Ser Met Ser Asp Asn Gly Pro Gln Asn Gln
260 265 270
Arg Asn Ala Pro Arg Ile Thr Phe Gly Gly Pro Ser Asp Ser Thr Gly
275 280 285
Ser Asn Gln Asn Gly Glu Arg Ser Gly Ala Arg Ser Lys Gln Arg Arg
290 295 300
Pro Gln Gly Leu Pro Asn Asn Thr Ala Ser Trp Phe Thr Ala Leu Thr
305 310 315 320
Gln His Gly Lys Glu Gly Leu Lys Phe Pro Arg Gly Gln Gly Val Pro
325 330 335
Ile Asn Thr Asn Ser Ser Pro Asp Asp Gln Ile Gly Tyr Tyr Arg Arg
340 345 350
Ala Thr Arg Arg Ile Arg Gly Gly Asp Gly Lys Met Lys Asp Leu Ser
355 360 365
Pro Arg Trp Tyr Phe Tyr Tyr Leu Gly Thr Gly Pro Glu Ala Gly Leu
370 375 380
Pro Tyr Gly Ala Asn Lys Asp Gly Ile Ile Trp Val Ala Thr Glu Gly
385 390 395 400
Ala Leu Asn Thr Pro Lys Asp His Ile Gly Thr Arg Asn Pro Ala Asn
405 410 415
Asn Ala Ala Ile Val Leu Gln Leu Pro Gln Gly Thr Thr Leu Pro Lys
420 425 430
Gly Phe Tyr Ala Glu Gly Ser Arg Gly Gly Ser Gln Ala Ser Ser Arg
435 440 445
Ser Ser Ser Arg Ser Arg Asn Ser Ser Arg Asn Ser Thr Pro Gly Ser
450 455 460
Ser Met Gly Thr Ser Pro Ala Arg Met Ala Gly Asn Gly Cys Asp Ala
465 470 475 480
Ala Leu Ala Leu Leu Leu Leu Asp Arg Leu Asn Gln Leu Glu Ser Lys
485 490 495
Met Ser Gly Lys Gly Gln Gln Gln Gln Gly Gln Thr Val Thr Lys Lys
500 505 510
Ser Ala Ala Glu Ala Ser Lys Lys Pro Arg Gln Lys Arg Thr Ala Thr
515 520 525
Lys Ala Tyr Asn Val Thr Gln Ala Phe Gly Arg Arg Gly Pro Glu Gln
530 535 540
Thr Gln Gly Asn Phe Gly Asp Gln Glu Leu Ile Arg Gln Gly Thr Asp
545 550 555 560
Tyr Lys His Trp Pro Gln Ile Ala Gln Phe Ala Pro Ser Ala Ser Ala
565 570 575
Phe Phe Gly Met Ser Arg Ile Gly Met Glu Val Thr Pro Ser Gly Thr
580 585 590
Trp Leu Thr Tyr Thr Gly Ala Ile Lys Leu Asp Asp Lys Asp Pro Asn
595 600 605
Phe Lys Asp Gln Val Ile Leu Leu Asn Lys His Ile Asp Ala Tyr Lys
610 615 620
Thr Phe Pro Pro Thr Glu Pro Lys Lys Asp Lys Lys Lys Lys Ala Tyr
625 630 635 640
Glu Thr Gln Ala Leu Pro Gln Arg Gln Lys Lys Gln Gln Thr Val Thr
645 650 655
Leu Leu Pro Ala Ala Asp Leu Asp Asp Phe Ser Lys Gln Leu Gln Gln
660 665 670
Ser Met Ser Ser Ala Asp Ser Thr Gln Ala Met Ala Ser Pro Ala Ala
675 680 685
Pro Arg Ala Val Ser Phe Ala Asp Asn Asn Asp Ile Thr Asn Thr Asn
690 695 700
Leu Ser Arg Gly Arg Gly Arg Asn Pro Lys Pro Arg Ala Ala Pro Asn
705 710 715 720
Asn Thr Val Ser Trp Tyr Thr Gly Leu Thr Gln His Gly Lys Val Pro
725 730 735
Leu Thr Phe Pro Pro Gly Gln Gly Val Pro Leu Asn Ala Asn Ser Thr
740 745 750
Pro Ala Gln Asn Ala Gly Tyr Trp Arg Arg Gln Asp Arg Lys Ile Asn
755 760 765
Thr Gly Asn Gly Ile Lys Gln Leu Ala Pro Arg Trp Tyr Phe Tyr Tyr
770 775 780
Thr Gly Thr Gly Pro Glu Ala Ala Leu Pro Phe Arg Ala Val Lys Asp
785 790 795 800
Gly Ile Val Trp Val His Glu Asp Gly Ala Thr Asp Ala Pro Ser Thr
805 810 815
Phe Gly Thr Arg Asn Pro Asn Asn Asp Ser Ala Ile Val Thr Gln Phe
820 825 830
Ala Pro Gly Thr Lys Leu Pro Lys Asn Phe His Ile Glu Gly Thr Gly
835 840 845
Gly Asn Ser Gln Ser Ser Ser Arg Ala Ser Ser Leu Ser Arg Asn Ser
850 855 860
Ser Arg Ser Ser Ser Gln Gly Ser Arg Ser Gly Asn Ser Thr Arg Gly
865 870 875 880
Thr Ser Pro Gly Pro Ser Gly Ile Gly Ala Val Gly Gly Asp Leu Leu
885 890 895
Tyr Leu Asp Leu Leu Asn Arg Leu Gln Ala Leu Glu Ser Gly Lys Val
900 905 910
Lys Gln Ser Gln Pro Lys Val Ile Thr Lys Lys Asp Ala Ala Ala Ala
915 920 925
Lys Asn Lys Met Arg His Lys Arg Thr Ser Thr Lys Ser Phe Asn Met
930 935 940
Val Gln Ala Phe Gly Leu Arg Gly Pro Gly Asp Leu Gln Gly Asn Phe
945 950 955 960
Gly Asp Leu Gln Leu Asn Lys Leu Gly Thr Glu Asp Pro Arg Trp Pro
965 970 975
Gln Ile Ala Glu Leu Ala Pro Thr Ala Ser Ala Phe Met Gly Met Ser
980 985 990
Gln Phe Lys Leu Thr His Gln Asn Asn Asp Asp His Gly Asn Pro Val
995 1000 1005
Tyr Phe Leu Arg Tyr Ser Gly Ala Ile Lys Leu Asp Pro Lys Asn Pro
1010 1015 1020
Asn Tyr Asn Lys Trp Leu Glu Leu Leu Glu Gln Asn Ile Asp Ala Tyr
1025 1030 1035 1040
Lys Thr Phe Pro Lys Lys Glu Lys Lys Gln Lys Ala Pro Lys Glu Glu
1045 1050 1055
Ser Thr Asp Gln Met Ser Glu Pro Pro Lys Glu Gln Arg Val Gln Gly
1060 1065 1070
Ser Ile Thr Gln Arg Thr Arg Thr Arg Pro Ser Val Gln Pro Gly Pro
1075 1080 1085
Met Ile Asp Val Asn Thr Asp Ser Gly Gly Gly Gly Ser Gly Gly Gly
1090 1095 1100
Gly Ser Gly Gly Gly Gly Ser Glu Ala Lys Pro Ser Gly Ser Val Val
1105 1110 1115 1120
Glu Gln Ala Glu Gly Val Glu Cys Asp Phe Ser Pro Leu Leu Ser Gly
1125 1130 1135
Thr Pro Pro Gln Val Tyr Asn Phe Lys Arg Leu Val Phe Thr Asn Cys
1140 1145 1150
Asn Tyr Asn Leu Thr Lys Leu Leu Ser Leu Phe Ser Val Asn Asp Phe
1155 1160 1165
Thr Cys Ser Gln Ile Ser Pro Ala Ala Ile Ala Ser Asn Cys Tyr Ser
1170 1175 1180
Ser Leu Ile Leu Asp Tyr Phe Ser Tyr Pro Leu Ser Met Lys Ser Asp
1185 1190 1195 1200
Leu Ser Val Ser Ser Ala Gly Pro Ile Ser Gln Phe Asn Tyr Lys Gln
1205 1210 1215
Ser Phe Ser Asn Pro Thr Cys Leu Ile Leu Ala Thr Val Pro His Asn
1220 1225 1230
Leu Thr Thr Ile Thr Lys Pro Leu Lys Tyr Ser Tyr Ile Asn Lys Cys
1235 1240 1245
Ser Arg Leu Leu Ser Asp Asp Arg Thr Glu Val Pro Gln Leu Val Asn
1250 1255 1260
Ala Asn Gln Tyr Ser Pro Cys Val Ser Ile Val Pro Ser Thr Val Trp
1265 1270 1275 1280
Glu Asp Gly Asp Tyr Tyr Arg Lys Gln Leu Ser Pro Leu Glu Gly Gly
1285 1290 1295
Gly Trp Leu Val Ala Ser Gly Ser Thr Val Ala Met Thr Glu Gln Leu
1300 1305 1310
Gln Met Gly Phe Gly Ile Thr Val Gln Tyr Gly Thr Asp Thr Asn Ser
1315 1320 1325
Val Cys Pro Lys Leu Glu Phe Ala Asn Asp Thr Lys Ile Ala Ser Gln
1330 1335 1340
Leu Gly Asn Cys Val Glu Tyr
1345 1350
<210> 28
<211> 4098
<212> DNA
<213> artificial sequence
<220>
<223> synthetic construct
<400> 28
atgcctatgg gcagcctgca gccactggct acactgtacc tgctgggcat gctggtggcc 60
tcttgtctgg gcataggaga gtgtccaaag tatgtcagga gtgcaaaatt aaggatggtt 120
acaggactaa ggaacatccc atccattcaa tccagaggtt tgtttggagc cattgccggt 180
ttcattgaag gggggtggac tggaatggta gatgggtggt atggttatca tcatcagaat 240
gagcaaggat ctggctatgc tgcagatcaa aaaagtacac aaaatgccat taacgggatt 300
acaaacaagg tgaattctgt aattgagaaa atgaacactc aattcacagc tgtgggcaaa 360
gaattcaaca aattggaaag aaggatggaa aacttaaata aaaaagttga tgatgggttt 420
ctagacattt ggacatataa tgcagaattg ttggttctac tggaaaatga aaggactttg 480
gatttccatg actccgccag ccaaggcact aagagaagct acgagcagat ggaaaccgga 540
ggcgaacggc agaacgccac agagatcaga gcctctgtgg gccgtatggt cggcggcatc 600
ggcagattct acatccagat gtgcaccgaa ctgaagctga gcgactacga gggccgcctg 660
atccagaaca gcatcacaat cgagagaatg gtgctgtccg cctttgacga gcggagaaac 720
aaatacctgg aagagcaccc tagcgccgga aaagatccta agaaaaccgg cggacctatc 780
tacagaagaa gagatggtaa gtgggtgaga gagctgattc tgtacgataa ggaagagatt 840
cgaagaatct ggagacaggc caacaacggc gaggatgcca ccgcaggcct gacacacctg 900
atgatctggc acagcaacct gaacgatgcg acctaccagc gcacgcgggc cctggtcaga 960
accggcatgg atcctcggat gtgtagcctg atgcagggca gcacactgcc aagacggagt 1020
ggggccgccg gcgctgcagt gaagggcgtc ggaaccatgg tgatggagct gatccggatg 1080
ataaagcggg gcatcaacga cagaaacttc tggcgaggcg agaacggccg aagaacccgg 1140
atcgcctacg agagaatgtg caacatcctg aaaggaaaat tccagaccgc cgcccagcgg 1200
gccatgatgg accaggtgcg cgagagcaga aaccccggca atgccgagat cgaggacctg 1260
atcttcctgg ccagaagcgc cctcattctt agaggctctg tggcccacaa gagctgtctg 1320
cctgcctgtg tgtacggcct ggcagtggcc tcaggctacg acttcgagcg ggaaggatac 1380
agtctggtgg gcatcgaccc tttcagactc ctgcagaata gccaggtgtt tagcctgatc 1440
agaccaaacg aaaaccccgc ccataagagc cagctggtgt ggatggcctg ccacagcgcc 1500
gcctttgagg atctgagagt gagctctttt atcagaggca cccgggtggt tccacgaggt 1560
caactgtcta caagaggtgt gcagatcgcc agcaacgaga acatggagac catggatagc 1620
agcaccctgg aactgagatc cagatactgg gccatcagga cacggagcgg cggcaccacc 1680
aatcagcagc gcgccagcgc cggccagatc tctgtccagc ctacgtttag cgtgcagcgg 1740
aatttgccct tcgaacgcgc cacaatcatg gctgctttca ccggcaatac agagggcaga 1800
accagcgata tgagaacaga aattatccgt atgatggagt ccgcaaaacc tgaggacgtg 1860
tccttccaag gcagaggcgt gttcgagctg agcgacgaga aggccaccaa ccctatcgtg 1920
cctagcttcg atatgtctaa tgagggcagc tactttttcg gagataacgc cgaagagtac 1980
gacaacatgt ctaatatgga tatcgacggc attaacaccg gcaccatcga caaaacccct 2040
gaggagatca cccctggcac cagcggcaca acccggccca tcatccgccc cgctacactg 2100
gctccaccta gcaacaagcg gaccagaaat ccctcgccag aaagagccac aacctccagc 2160
gaggacgacg tgggacggaa gacacaaaag aagcagaccc ctacagagat caagaagtct 2220
gtttacaaca tggtggtgaa actgggcgag ttctacaacc agatgatggt gaaggccggc 2280
ctgaacgacg atatggaaag aaatctgatc cagaacgccc acgccgtgga gcggattctg 2340
ctggccgcca ccgatgataa gaagaccgaa ttccagaaaa agaaaaacgc cagagacgtg 2400
aaggaaggca aggaagagat cgaccacaac aagacaggcg gcacattcta caagatggtc 2460
cgggacgaca agaccatcta cttcagccct atccggataa cattcctgaa agaagaagtg 2520
aagaccatgt acaaaaccac aatgggctct gacggcttca gcggcctgaa tcacatcatg 2580
atcggccact ctcaaatgaa cgatgtgtgc ttccagagaa gcaaggctct gaagcgcgtg 2640
ggcctggatc ctagcctgat ctctaccttc gccggcagca ccatccccag aagatcgggc 2700
gctaccggcg tggctatcaa gggaggaggc acactggtgg ctgaagccat cagattcatc 2760
ggaagagcca tggccgacag aggactcctg agagatatca aagccaaaac cgcctacgaa 2820
aaaatcctgc tgaacctgaa gaacaagtgc agcgcgcctc aacagaaggc cctggtggac 2880
caggttatcg gctctagaaa ccctggaatc gccgatatcg aggacctgac actgctggcc 2940
agatctatgg tggtggtgag accctccgtg gccagcaagg tggtgctgcc tatcagcatc 3000
tacgccaaga tccctcagct gggatttaac gtggaagaat acagcatggt tggttatgag 3060
gccatggccc tgtacaacat ggccacacct gtgtccatcc tgagaatggg cgacgatgcc 3120
aaagacaaga gccagctgtt cttcatgagc tgcttcggcg ctgcctatga ggacctgaga 3180
gtgctgtccg ctcttacagg aacagagttc aagcctagga gcgcactgaa gtgcaagggc 3240
ttccacgtgc ccgccaagga acaggtggaa ggcatgggag ctgctctgat gtccatcaag 3300
ctgcaatttt gggctcctat gacccggagc ggcggaaatg aggtgggtgg cgacggaggc 3360
agcggacaga tttcttgcag ccccgtattt gccgtggaga gaccaatcgc cctgtccaag 3420
caggccgtga gaagaatgct gagcatgaac atcgagggcc gggacgccga cgtgaagggc 3480
aacctgttga agatgatgaa cgacagcatg gccaagaaga ccagtggcaa tgccttcatc 3540
ggcaagaaga tgttccagat ctccgacaag aacaagacca accccgtgga aatccccatc 3600
aagcagacaa tccctaactt cttcttcggc agagacaccg ccgaagacta tgacgacctg 3660
gactacatag gaaattgccc aatatgggtg aaaacacctt tgaagcttgc caatggaacc 3720
aaatatagac ctcctgcaaa actattaaag gaaaggggtt tcttcggagc tattgctggt 3780
ttcctagaag gaggatggga aggaatgatt gcaggctggc acggatacac atctcacgga 3840
gcacatggag tggcagtggc ggcggacctt aagagtacgc aagaagctat aaacaagata 3900
acaaaaaatc tcaattcttt gagtgagcta gaagtaaaga atcttcaaag actaagtggt 3960
gccatggatg aactccacaa cgaaatactc gagctggatg agaaagtgga tgatctcaga 4020
gctgacacta taagctcgca aatagaactt gcagtcttgc tttccaacga aggaataata 4080
aacagtgaag atgagtga 4098
<210> 29
<211> 1365
<212> PRT
<213> artificial sequence
<220>
<223> synthetic construct
<400> 29
Met Pro Met Gly Ser Leu Gln Pro Leu Ala Thr Leu Tyr Leu Leu Gly
1 5 10 15
Met Leu Val Ala Ser Cys Leu Gly Ile Gly Glu Cys Pro Lys Tyr Val
20 25 30
Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
35 40 45
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
50 55 60
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
65 70 75 80
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
85 90 95
Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn
100 105 110
Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Arg Arg
115 120 125
Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
130 135 140
Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
145 150 155 160
Asp Phe His Asp Ser Ala Ser Gln Gly Thr Lys Arg Ser Tyr Glu Gln
165 170 175
Met Glu Thr Gly Gly Glu Arg Gln Asn Ala Thr Glu Ile Arg Ala Ser
180 185 190
Val Gly Arg Met Val Gly Gly Ile Gly Arg Phe Tyr Ile Gln Met Cys
195 200 205
Thr Glu Leu Lys Leu Ser Asp Tyr Glu Gly Arg Leu Ile Gln Asn Ser
210 215 220
Ile Thr Ile Glu Arg Met Val Leu Ser Ala Phe Asp Glu Arg Arg Asn
225 230 235 240
Lys Tyr Leu Glu Glu His Pro Ser Ala Gly Lys Asp Pro Lys Lys Thr
245 250 255
Gly Gly Pro Ile Tyr Arg Arg Arg Asp Gly Lys Trp Val Arg Glu Leu
260 265 270
Ile Leu Tyr Asp Lys Glu Glu Ile Arg Arg Ile Trp Arg Gln Ala Asn
275 280 285
Asn Gly Glu Asp Ala Thr Ala Gly Leu Thr His Leu Met Ile Trp His
290 295 300
Ser Asn Leu Asn Asp Ala Thr Tyr Gln Arg Thr Arg Ala Leu Val Arg
305 310 315 320
Thr Gly Met Asp Pro Arg Met Cys Ser Leu Met Gln Gly Ser Thr Leu
325 330 335
Pro Arg Arg Ser Gly Ala Ala Gly Ala Ala Val Lys Gly Val Gly Thr
340 345 350
Met Val Met Glu Leu Ile Arg Met Ile Lys Arg Gly Ile Asn Asp Arg
355 360 365
Asn Phe Trp Arg Gly Glu Asn Gly Arg Arg Thr Arg Ile Ala Tyr Glu
370 375 380
Arg Met Cys Asn Ile Leu Lys Gly Lys Phe Gln Thr Ala Ala Gln Arg
385 390 395 400
Ala Met Met Asp Gln Val Arg Glu Ser Arg Asn Pro Gly Asn Ala Glu
405 410 415
Ile Glu Asp Leu Ile Phe Leu Ala Arg Ser Ala Leu Ile Leu Arg Gly
420 425 430
Ser Val Ala His Lys Ser Cys Leu Pro Ala Cys Val Tyr Gly Leu Ala
435 440 445
Val Ala Ser Gly Tyr Asp Phe Glu Arg Glu Gly Tyr Ser Leu Val Gly
450 455 460
Ile Asp Pro Phe Arg Leu Leu Gln Asn Ser Gln Val Phe Ser Leu Ile
465 470 475 480
Arg Pro Asn Glu Asn Pro Ala His Lys Ser Gln Leu Val Trp Met Ala
485 490 495
Cys His Ser Ala Ala Phe Glu Asp Leu Arg Val Ser Ser Phe Ile Arg
500 505 510
Gly Thr Arg Val Val Pro Arg Gly Gln Leu Ser Thr Arg Gly Val Gln
515 520 525
Ile Ala Ser Asn Glu Asn Met Glu Thr Met Asp Ser Ser Thr Leu Glu
530 535 540
Leu Arg Ser Arg Tyr Trp Ala Ile Arg Thr Arg Ser Gly Gly Thr Thr
545 550 555 560
Asn Gln Gln Arg Ala Ser Ala Gly Gln Ile Ser Val Gln Pro Thr Phe
565 570 575
Ser Val Gln Arg Asn Leu Pro Phe Glu Arg Ala Thr Ile Met Ala Ala
580 585 590
Phe Thr Gly Asn Thr Glu Gly Arg Thr Ser Asp Met Arg Thr Glu Ile
595 600 605
Ile Arg Met Met Glu Ser Ala Lys Pro Glu Asp Val Ser Phe Gln Gly
610 615 620
Arg Gly Val Phe Glu Leu Ser Asp Glu Lys Ala Thr Asn Pro Ile Val
625 630 635 640
Pro Ser Phe Asp Met Ser Asn Glu Gly Ser Tyr Phe Phe Gly Asp Asn
645 650 655
Ala Glu Glu Tyr Asp Asn Met Ser Asn Met Asp Ile Asp Gly Ile Asn
660 665 670
Thr Gly Thr Ile Asp Lys Thr Pro Glu Glu Ile Thr Pro Gly Thr Ser
675 680 685
Gly Thr Thr Arg Pro Ile Ile Arg Pro Ala Thr Leu Ala Pro Pro Ser
690 695 700
Asn Lys Arg Thr Arg Asn Pro Ser Pro Glu Arg Ala Thr Thr Ser Ser
705 710 715 720
Glu Asp Asp Val Gly Arg Lys Thr Gln Lys Lys Gln Thr Pro Thr Glu
725 730 735
Ile Lys Lys Ser Val Tyr Asn Met Val Val Lys Leu Gly Glu Phe Tyr
740 745 750
Asn Gln Met Met Val Lys Ala Gly Leu Asn Asp Asp Met Glu Arg Asn
755 760 765
Leu Ile Gln Asn Ala His Ala Val Glu Arg Ile Leu Leu Ala Ala Thr
770 775 780
Asp Asp Lys Lys Thr Glu Phe Gln Lys Lys Lys Asn Ala Arg Asp Val
785 790 795 800
Lys Glu Gly Lys Glu Glu Ile Asp His Asn Lys Thr Gly Gly Thr Phe
805 810 815
Tyr Lys Met Val Arg Asp Asp Lys Thr Ile Tyr Phe Ser Pro Ile Arg
820 825 830
Ile Thr Phe Leu Lys Glu Glu Val Lys Thr Met Tyr Lys Thr Thr Met
835 840 845
Gly Ser Asp Gly Phe Ser Gly Leu Asn His Ile Met Ile Gly His Ser
850 855 860
Gln Met Asn Asp Val Cys Phe Gln Arg Ser Lys Ala Leu Lys Arg Val
865 870 875 880
Gly Leu Asp Pro Ser Leu Ile Ser Thr Phe Ala Gly Ser Thr Ile Pro
885 890 895
Arg Arg Ser Gly Ala Thr Gly Val Ala Ile Lys Gly Gly Gly Thr Leu
900 905 910
Val Ala Glu Ala Ile Arg Phe Ile Gly Arg Ala Met Ala Asp Arg Gly
915 920 925
Leu Leu Arg Asp Ile Lys Ala Lys Thr Ala Tyr Glu Lys Ile Leu Leu
930 935 940
Asn Leu Lys Asn Lys Cys Ser Ala Pro Gln Gln Lys Ala Leu Val Asp
945 950 955 960
Gln Val Ile Gly Ser Arg Asn Pro Gly Ile Ala Asp Ile Glu Asp Leu
965 970 975
Thr Leu Leu Ala Arg Ser Met Val Val Val Arg Pro Ser Val Ala Ser
980 985 990
Lys Val Val Leu Pro Ile Ser Ile Tyr Ala Lys Ile Pro Gln Leu Gly
995 1000 1005
Phe Asn Val Glu Glu Tyr Ser Met Val Gly Tyr Glu Ala Met Ala Leu
1010 1015 1020
Tyr Asn Met Ala Thr Pro Val Ser Ile Leu Arg Met Gly Asp Asp Ala
1025 1030 1035 1040
Lys Asp Lys Ser Gln Leu Phe Phe Met Ser Cys Phe Gly Ala Ala Tyr
1045 1050 1055
Glu Asp Leu Arg Val Leu Ser Ala Leu Thr Gly Thr Glu Phe Lys Pro
1060 1065 1070
Arg Ser Ala Leu Lys Cys Lys Gly Phe His Val Pro Ala Lys Glu Gln
1075 1080 1085
Val Glu Gly Met Gly Ala Ala Leu Met Ser Ile Lys Leu Gln Phe Trp
1090 1095 1100
Ala Pro Met Thr Arg Ser Gly Gly Asn Glu Val Gly Gly Asp Gly Gly
1105 1110 1115 1120
Ser Gly Gln Ile Ser Cys Ser Pro Val Phe Ala Val Glu Arg Pro Ile
1125 1130 1135
Ala Leu Ser Lys Gln Ala Val Arg Arg Met Leu Ser Met Asn Ile Glu
1140 1145 1150
Gly Arg Asp Ala Asp Val Lys Gly Asn Leu Leu Lys Met Met Asn Asp
1155 1160 1165
Ser Met Ala Lys Lys Thr Ser Gly Asn Ala Phe Ile Gly Lys Lys Met
1170 1175 1180
Phe Gln Ile Ser Asp Lys Asn Lys Thr Asn Pro Val Glu Ile Pro Ile
1185 1190 1195 1200
Lys Gln Thr Ile Pro Asn Phe Phe Phe Gly Arg Asp Thr Ala Glu Asp
1205 1210 1215
Tyr Asp Asp Leu Asp Tyr Ile Gly Asn Cys Pro Ile Trp Val Lys Thr
1220 1225 1230
Pro Leu Lys Leu Ala Asn Gly Thr Lys Tyr Arg Pro Pro Ala Lys Leu
1235 1240 1245
Leu Lys Glu Arg Gly Phe Phe Gly Ala Ile Ala Gly Phe Leu Glu Gly
1250 1255 1260
Gly Trp Glu Gly Met Ile Ala Gly Trp His Gly Tyr Thr Ser His Gly
1265 1270 1275 1280
Ala His Gly Val Ala Val Ala Ala Asp Leu Lys Ser Thr Gln Glu Ala
1285 1290 1295
Ile Asn Lys Ile Thr Lys Asn Leu Asn Ser Leu Ser Glu Leu Glu Val
1300 1305 1310
Lys Asn Leu Gln Arg Leu Ser Gly Ala Met Asp Glu Leu His Asn Glu
1315 1320 1325
Ile Leu Glu Leu Asp Glu Lys Val Asp Asp Leu Arg Ala Asp Thr Ile
1330 1335 1340
Ser Ser Gln Ile Glu Leu Ala Val Leu Leu Ser Asn Glu Gly Ile Ile
1345 1350 1355 1360
Asn Ser Glu Asp Glu
1365
Claims (107)
1. A composition for stimulating an immune response against a coronavirus in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide; and
(ii) Nucleotide sequence encoding a nucleocapsid protein of a coronavirus.
2. The composition of claim 1, wherein the coronavirus is a beta coronavirus, optionally wherein the beta coronavirus is a human beta coronavirus.
3. The composition of claim 2, wherein the beta coronavirus comprises severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), severe acute respiratory syndrome coronavirus-1 (SARS-CoV-1), middle east respiratory syndrome related coronavirus (MERS-CoV), or a combination thereof.
4. The composition of claim 3, wherein the beta coronavirus comprises severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2).
5. The composition of claim 4, wherein the coronavirus nucleocapsid protein comprises a first nucleocapsid protein and a second nucleocapsid protein, wherein the first nucleocapsid protein is a SARS-CoV-2 nucleocapsid protein from a first variant of a first clade and the second nucleocapsid protein is a SARS-CoV-2 nucleocapsid protein from a second variant of a second clade, and wherein the first clade and the second clade are different clades defined by one or more of the world health organization, pango, GISAID, and Nextstrain.
6. A composition for stimulating an immune response against a coronavirus in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide; and
(ii) Nucleotide sequences encoding two or more coronavirus nucleocapsid proteins.
7. The composition of claim 6, wherein the coronavirus is a beta coronavirus, optionally wherein the beta coronavirus is a human beta coronavirus.
8. The composition of claim 7, wherein the beta coronavirus comprises severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), severe acute respiratory syndrome coronavirus-1 (SARS-CoV-1), middle east respiratory syndrome related coronavirus (MERS-CoV), or a combination thereof.
9. The composition of claim 8, wherein the beta coronavirus comprises severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2).
10. The composition of claim 9, wherein the two or more coronavirus nucleocapsid proteins comprise a SARS-CoV-2 nucleocapsid protein and a MERS nucleocapsid protein.
11. The composition of claim 9, wherein the two or more coronavirus nucleocapsid proteins comprise a SARS-CoV-2 nucleocapsid protein, a SARS-CoV-1 nucleocapsid protein, and MERS nucleocapsid protein.
12. The composition of any one of claims 6-11, wherein the two or more coronavirus nucleocapsid proteins are separated by a linker of one to ten residues in length.
13. The composition of any one of claims 1-12, wherein the mammalian signal peptide is a signal peptide of a surface protein expressed in mammalian antigen presenting cells.
14. The composition of claim 13, wherein the mammalian signal peptide is a CD5 signal peptide and the amino acid sequence of the CD5 signal peptide comprises SEQ ID No. 8 or an amino acid sequence at least 90% or 95% identical to SEQ ID No. 8.
15. The composition of any one of claims 1-14, wherein the amino acid sequence of the nucleocapsid protein comprises residues 2-419 of SEQ ID No. 5 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to residues 2-419 of SEQ ID No. 5.
16. The composition of any one of claims 1-14, wherein the amino acid sequence of the fusion protein comprises SEQ ID No. 6 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID No. 6.
17. The composition of any one of claims 6-14, wherein the amino acid sequence of the fusion protein comprises SEQ ID No. 7 or an amino acid sequence at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID No. 7.
18. The composition of claim 16, wherein the open reading frame comprises the nucleotide sequence of SEQ ID No. 2.
19. The composition of claim 17, wherein the open reading frame comprises the nucleotide sequence of SEQ ID No. 3 or SEQ ID No. 4.
20. The composition of any one of claims 1-14, wherein the amino acid sequence of the fusion protein comprises residues 2-413 of SEQ ID No. 9 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to residues 2-413 of SEQ ID No. 9.
21. The composition of any one of claims 1-14, wherein the amino acid sequence of the fusion protein comprises residues 2-422 of SEQ ID No. 10 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to residues 2-422 of SEQ ID No. 10.
22. The composition of any one of claims 1-21, wherein the composition does not comprise a liposome or a lipid nanoparticle.
23. The composition of any one of claims 1-22, wherein the mRNA is a self-replicating mRNA.
24. The composition of claim 23, wherein the self-replicating RNA comprises an alphavirus replicon lacking viral structural protein coding regions.
25. The composition of claim 24, wherein the alphavirus is selected from the group consisting of venezuelan equine encephalitis virus, sindbis virus, and semliki forest virus.
26. The composition of claim 25, wherein the alphavirus is venezuelan equine encephalitis virus.
27. The composition of any one of claims 23-26, wherein the alphavirus replicon comprises a non-structural protein coding region in which 12-18 nucleotides are inserted resulting in expression of nsP2 comprising 4 to 6 additional amino acids between β -sheet 4 and β -sheet 6 of non-structural protein 2 (nsP 2).
28. The composition of any one of claims 1-27, wherein the self-replicating mRNA is a temperature-sensitive agent (ts agent) that is capable of expressing the fusion at a permissive temperature and not expressing the fusion at an non-permissive temperature.
29. The composition of claim 28, wherein the permissible temperature is 31 ℃ to 35 ℃ and the impermissible temperature is at least 37 ℃ ± 0.5 ℃.
30. A method for stimulating an immune response against a coronavirus in a mammalian subject, the method comprising administering the composition of any one of claims 1-29 to a mammalian subject to stimulate an immune response against the coronavirus nucleocapsid protein in the mammalian subject.
31. The method of claim 30, wherein the composition is administered intradermally.
32. The method of claim 30 or claim 31, wherein the immune response comprises a coronavirus-reactive cellular immune response.
33. The method of claim 32, wherein the immune response further comprises a coronavirus-reactive humoral immune response.
34. The method of any one of claims 30-33, wherein the mammalian subject is a human subject.
35. A kit, the kit comprising:
the composition of any one of claims 1-29 or any one of claims 37-62; and
a device for intradermal delivery of the composition to a mammalian subject.
36. The kit of claim 35, wherein the device comprises a syringe and a needle.
37. A composition for stimulating an immune response against two or more viruses in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide; and
(ii) Nucleotide sequences encoding a first nucleocapsid protein of a first virus and a second nucleocapsid protein of a second virus.
38. The composition of claim 37, wherein the first virus and the second virus are capable of causing a disease upon infection of a human subject.
39. The composition of claim 38, wherein the first virus and the second virus are different varieties, subtypes or lineages of the same species.
40. The composition of claim 38, wherein the first virus and the second virus are different species of the same genus.
41. The composition of claim 40, wherein both the first virus and the second virus are members of the genus beta coronavirus.
42. The composition of claim 41, wherein the first virus and the second virus comprise Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) and middle east respiratory syndrome associated coronavirus (MERS-CoV).
43. The composition of claim 38, wherein the first virus and the second virus are members of different families, orders, classes, or gates of the same kingdom.
44. The composition of claim 43, wherein both the first virus and the second virus are members of the orthomyxoviridae family.
45. The composition of claim 44, wherein the first virus and the second virus comprise influenza A virus and influenza B virus.
46. The composition of claim 45, wherein the amino acid sequence of the fusion protein comprises SEQ ID NO 16 or an amino acid sequence at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID NO 16.
47. The composition of claim 38, wherein both the first virus and the second virus are members of the orthoriboviridae kingdom, optionally wherein the first virus and the second virus comprise: (a) Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), severe acute respiratory syndrome coronavirus-1 (SARS-CoV-1) or middle east respiratory syndrome related coronavirus (MERS-CoV); and (b) influenza a virus or influenza b virus.
48. The composition of claim 40, wherein both the first virus and the second virus are members of the genus ebola, optionally wherein the first virus and the second virus are selected from the group consisting of zaire-type ebola virus, sudan-type ebola virus, bund Jiao Xingai ebola virus, and tay forest-type ebola virus.
49. The composition of claim 48, wherein the nucleotide sequence further encodes a third nucleocapsid protein of a third virus and a fourth nucleocapsid protein of a fourth virus, and the first virus, the second virus, the third virus, and the fourth virus are zaire-type ebola virus, sudan-type ebola virus, bundi Jiao Xingai bola virus, and tay forest-type ebola virus.
50. The composition of claim 49, wherein the amino acid sequence of the fusion protein comprises SEQ ID NO. 22 or an amino acid sequence that is at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID NO. 22.
51. The composition of claim 49, wherein the nucleotide sequence (ii) encodes a shared portion of a first nucleocapsid protein of the first virus for stimulating an immune response against all of the first virus, the second virus, the third virus, and the fourth virus.
52. The composition of claim 51, wherein the nucleotide sequence (ii) encodes a separate portion of the first nucleocapsid protein, the second nucleocapsid protein, the third nucleocapsid protein, and the fourth nucleocapsid protein each for stimulating an immune response against all of the first virus, the second virus, the third virus, and the fourth virus.
53. The composition of claim 52, wherein the nucleotide sequence (ii) encodes a fragment of a separate portion of a second nucleocapsid protein of the second virus for stimulating an immune response against the second virus and the third virus.
54. The composition of claim 37, wherein the nucleotide sequence (ii) encodes a shared portion of a first nucleocapsid protein of the first virus for stimulating an immune response against both the first virus and the second virus.
55. The composition of claim 54, wherein the nucleotide sequence (ii) encodes separate portions of the first nucleocapsid protein and the second nucleocapsid protein each for stimulating an immune response against both the first virus and the second virus.
56. The composition of any one of claims 37-48, wherein the nucleotide sequence (ii) further encodes at least one other nucleocapsid protein of at least one other virus, and wherein the at least one other virus is different from the first virus and the second virus.
57. The composition of any one of claims 37-56, wherein the first nucleocapsid protein and the second nucleocapsid protein, or the first nucleocapsid protein, the second nucleocapsid protein and the other nucleocapsid proteins are separated by a linker of one to ten residues in length.
58. The composition of any one of claims 37-57, wherein the mammalian signal peptide is a signal peptide of a surface protein expressed in mammalian antigen presenting cells.
59. The composition of any one of claims 37-58, wherein the mRNA is a self-replicating mRNA.
60. The composition of claim 59, wherein the self-replicating mRNA is a temperature-sensitive agent (ts agent) that is capable of expressing the fusion protein at a permissive temperature and not expressing the fusion protein at an non-permissive temperature.
61. The composition of claim 60, wherein the permissible temperature is 31 ℃ to 35 ℃ and the impermissible temperature is at least 37 ℃ ± 0.5 ℃.
62. The composition of any one of claims 1-29 or any one of claims 37-61, wherein the composition further comprises chitosan.
63. A method for stimulating an immune response against two or more viruses in a mammalian subject, the method comprising administering to a mammalian subject the composition of any one of claims 37-62 to stimulate an immune response against nucleocapsid proteins of the two or more viruses in the mammalian subject.
64. The method of claim 63, wherein the composition is administered intradermally.
65. The method of claim 63 or claim 64, wherein the immune response comprises a cellular immune response reactive to the two or more viruses.
66. The method of claim 65, wherein the cellular immune response comprises a nucleocapsid protein specific helper T lymphocyte (Th) response comprising nucleocapsid protein specific cytokine secretion.
67. The method of claim 66, wherein the nucleocapsid protein specific cytokine secretion comprises secretion of one or both of interferon-gamma and interleukin-4.
68. The method of claim 65, wherein the cellular immune response comprises a nucleocapsid protein specific Cytotoxic T Lymphocyte (CTL) response.
69. The method of any one of claims 65-68, wherein the immune response further comprises a humoral immune response reactive against the two or more viruses.
70. The method of any one of claims 63-69, wherein the mammalian subject is a human subject.
71. A composition for stimulating an immune response against a virus in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide;
(ii) A nucleotide sequence encoding a first viral antigen of a first virus or a fragment thereof; and
(iii) A nucleotide sequence encoding a second viral antigen of said first virus or second virus or a fragment thereof,
wherein the first viral antigen is a nucleocapsid protein and the second viral antigen is a surface protein, or the first viral antigen is a surface protein and the second viral antigen is a nucleocapsid protein.
72. A composition for stimulating an immune response against two or more viruses in a mammalian subject, the composition comprising an excipient and a messenger RNA (mRNA) comprising an Open Reading Frame (ORF) encoding a fusion protein, wherein the ORF comprises from 5 'to 3':
(i) A nucleotide sequence encoding a mammalian signal peptide;
(ii) A nucleotide sequence encoding a first viral antigen of a first virus or a fragment thereof;
(iii) A nucleotide sequence encoding a second viral antigen of the first virus or a fragment thereof;
(iv) A nucleotide sequence encoding a third viral antigen of a second virus or a fragment thereof;
(iii) A nucleotide sequence encoding a fourth viral antigen of said second virus or a fragment thereof,
wherein the first viral antigen is a first nucleocapsid protein and the second viral antigen is a first surface protein, or the first viral antigen is a first surface protein and the second viral antigen is a first nucleocapsid protein, and
wherein the third viral antigen is a second nucleocapsid protein and the fourth viral antigen is a second surface protein, or the third viral antigen is a second surface protein and the fourth viral antigen is a second nucleocapsid protein.
73. The composition of claim 71 or claim 72, wherein the mRNA is a self-replicating mRNA.
74. The composition of claim 73, wherein the self-replicating RNA comprises an alphavirus replicon lacking viral structural protein coding regions.
75. The composition of claim 74, wherein said alphavirus is selected from the group consisting of venezuelan equine encephalitis virus, sindbis virus, and semliki forest virus.
76. The composition of claim 74, wherein said alphavirus is venezuelan equine encephalitis virus.
77. The composition of any one of claims 73-76, wherein the self-replicating mRNA is a temperature-sensitive agent (ts agent) that is capable of expressing the fusion protein at a permissive temperature and not expressing the fusion protein at an non-permissive temperature.
78. The composition of claim 77, wherein said permissible temperature is 31 ℃ to 35 ℃ and said impermissible temperature is at least 37 ℃ ± 0.5 ℃.
79. The composition of any one of claims 74-78, wherein said alphavirus replicon comprises a non-structural protein coding region in which 12-18 nucleotides are inserted resulting in expression of nsP2 comprising 4 to 6 additional amino acids between β -sheet 4 and β -sheet 6 of non-structural protein 2 (nsP 2).
80. The composition of any one of claims 71-79, wherein the first virus and/or the second virus is a coronavirus, optionally wherein the coronavirus is a beta coronavirus, optionally wherein the beta coronavirus is a human beta coronavirus.
81. The composition of claim 80, wherein said first virus and/or said second virus is a beta coronavirus independently selected from the group consisting of: severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), severe acute respiratory syndrome coronavirus-1 (SARS-CoV-1), and middle east respiratory syndrome related coronavirus (MERS-CoV).
82. The composition of claim 80, wherein the first virus is SARS-CoV-2 and the second virus is MERS-CoV.
83. The composition of any one of claims 80-82, wherein each of the surface protein, the first surface protein, and/or the second surface protein comprises a Receptor Binding Domain (RBD) of a coronavirus spike protein.
84. The composition of claim 83, wherein the amino acid sequence of the fusion protein comprises SEQ ID No. 27 or an amino acid sequence at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID No. 27.
85. The composition of any one of claims 71-79, wherein said first virus and/or said second virus is a member of the orthomyxoviridae family.
86. The composition of claim 85, wherein said first virus and/or said second virus are independently selected from the group consisting of Influenza A Virus (IAV) and Influenza B Virus (IBV).
87. The composition of claim 86, wherein the first virus is an IAV and the second virus is an IBV.
88. The composition of any one of claims 85-87, wherein the surface protein, the first surface protein, and/or the second surface protein each comprise a portion of influenza hemagglutinin.
89. The composition of claim 88, wherein the amino acid sequence of the fusion protein comprises SEQ ID No. 29 or an amino acid sequence at least 75%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to SEQ ID No. 29.
90. The composition of any one of claims 71-89, wherein the composition further comprises chitosan.
91. A kit, the kit comprising:
(i) The composition of any one of claims 71-90; and
(ii) A device for intradermal delivery of the composition to a mammalian subject.
92. The kit of claim 91, wherein the device comprises a syringe and a needle.
93. The kit of claim 91 or claim 92, further comprising instructions for using the device to administer the composition to a mammalian subject to stimulate an immune response against one or more of the first viral antigen, the second viral antigen, the third viral antigen, and the fourth viral antigen.
94. A method of stimulating an immune response in a mammalian subject, the method comprising administering to a mammalian subject the composition of any one of claims 71-90 to stimulate an immune response in the mammalian subject against one or more of the first viral antigen, the second viral antigen, the third viral antigen, and the fourth viral antigen.
95. The method of claim 94, wherein the composition is administered intradermally.
96. The method of claim 95, wherein the immune response comprises a cellular immune response reactive against one or more of the first viral antigen, the second viral antigen, the third viral antigen, and the fourth viral antigen.
97. The method of claim 96, wherein the immune response further comprises a reactive humoral immune response against one or more of the first, second, third, and fourth viral antigens.
98. The method of any one of claims 94-97, wherein the mammalian subject is a human subject.
99. A method for active boosting against at least one virus, the method comprising intradermally administering to a mammalian subject in need thereof a composition according to any one of claims 1-29, any one of claims 37-62, or any one of claims 71-90 to stimulate a secondary immune response against the virus, wherein the mammalian subject has undergone a primary immunization regimen against the virus.
100. The method of claim 99, wherein the primary immunization regimen comprises administration of at least one dose of a different vaccine against the virus.
101. The method of claim 100, wherein the different vaccine comprises a protein antigen of the at least one virus or an inactivated virus, optionally wherein the protein antigen is a recombinant protein or fragment thereof.
102. A method for active boosting immunity against at least one virus, the method comprising:
(i) Administering intradermally the composition of any one of claims 1-29, any one of claims 37-62, or any one of claims 71-90 to a mammalian subject in need thereof to stimulate a primary immune response against the virus; and
(ii) At least one dose of a different vaccine against the virus is administered to the mammalian subject to stimulate a secondary immune response against the virus.
103. The method of claim 102, wherein the different vaccine comprises a protein antigen of the at least one virus or an inactivated virus, optionally wherein the protein antigen is a recombinant protein or fragment thereof.
104. A method for active primary immunization against at least one virus, the method comprising:
(i) Administering intradermally the composition of any one of claims 1-29, any one of claims 37-62, or any one of claims 71-90 to a mammalian subject in need thereof to stimulate a primary immune response against the virus; wherein the mammalian subject has not undergone a primary immunization regimen against the virus.
105. The method of claim 104, the method further comprising:
(ii) At least one dose of a different vaccine against the virus is administered to the mammalian subject to stimulate a secondary immune response against the virus.
106. The method of claim 105, wherein the different vaccine comprises a protein antigen of the at least one virus or an inactivated virus, optionally wherein the protein antigen is a recombinant protein or fragment thereof.
107. The method of any one of claims 94-106, wherein the mammalian subject is a human subject.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US63/211,974 | 2021-06-17 | ||
US63/240,278 | 2021-09-02 | ||
US202163275398P | 2021-11-03 | 2021-11-03 | |
US63/275,398 | 2021-11-03 | ||
PCT/US2022/034104 WO2022266511A1 (en) | 2021-06-17 | 2022-06-17 | Temperature-controllable, self-replicating rna vaccines for viral diseases |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117716036A true CN117716036A (en) | 2024-03-15 |
Family
ID=90157510
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202280043353.2A Pending CN117716036A (en) | 2021-06-17 | 2022-06-17 | Temperature controllable self-replicating RNA vaccine for viral diseases |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117716036A (en) |
-
2022
- 2022-06-17 CN CN202280043353.2A patent/CN117716036A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Kaur et al. | COVID-19 Vaccine: A comprehensive status report | |
US11759516B2 (en) | Nucleic acid vaccine against the SARS-CoV-2 coronavirus | |
US20230270841A1 (en) | Coronavirus vaccine | |
EP2961846B1 (en) | Crimean-congo haemorrhagic fever virus antigenic composition | |
US11975067B2 (en) | Coronavirus disease (COVID-19) vaccine | |
JP2021509107A (en) | Hepatitis B virus (HBV) vaccine and its use | |
CN113151184A (en) | Method for cell membrane-based display of coronavirus immunogens to induce neutralizing antibodies | |
US20230190917A1 (en) | Viral vaccine vector for immunization against a betacoronavirus | |
KR102027758B1 (en) | Attenuated swine influenza vaccines and methods of making and use thereof | |
US20240269268A1 (en) | Temperature-controllable, self-replicating rna vaccines for viral diseases | |
WO2021198701A1 (en) | Influenza vaccines | |
US20220275346A1 (en) | Hantavirus antigenic composition | |
CN116075319A (en) | Vaccine against SARS-COV-2 and its preparation | |
EP3511417B1 (en) | Rift valley fever virus glycoproteins, gn and gc, and their use | |
CN116200403A (en) | Novel coronavirus mRNA vaccine for preventing mutant strain | |
CN113801206B (en) | Method for inducing neutralizing antibodies against novel coronaviruses using receptor recognition domains | |
CN117716036A (en) | Temperature controllable self-replicating RNA vaccine for viral diseases | |
KR20230008707A (en) | Vaccine composition for treatment of coronavirus | |
JP4797149B2 (en) | Vector vaccine against influenza virus | |
JP2024537250A (en) | Influenza vaccine | |
Reza et al. | WAYS OF MAKING EFFECTIVE AND SAFE VACCINES AGAINST SARS-CoV-2 | |
AU2022289462A1 (en) | Virus-like particle vaccine for coronavirus | |
CN116457011A (en) | Vaccine composition for treating coronavirus | |
Zhenga et al. | Construction and immunogenicity of a recombinant fowlpox virus containing the capsid and 3C protease coding regions of foot-and-mouth disease virus | |
JP2003310271A (en) | Aids vaccine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |