CN113388041B - 具有融合前早期构象的SARS-CoV-2 S三聚体蛋白及其应用 - Google Patents
具有融合前早期构象的SARS-CoV-2 S三聚体蛋白及其应用 Download PDFInfo
- Publication number
- CN113388041B CN113388041B CN202110268273.4A CN202110268273A CN113388041B CN 113388041 B CN113388041 B CN 113388041B CN 202110268273 A CN202110268273 A CN 202110268273A CN 113388041 B CN113388041 B CN 113388041B
- Authority
- CN
- China
- Prior art keywords
- ser
- leu
- val
- gly
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 239000013638 trimer Substances 0.000 title claims abstract description 55
- 108090000623 proteins and genes Proteins 0.000 title abstract description 159
- 102000004169 proteins and genes Human genes 0.000 title abstract description 157
- 102100031673 Corneodesmosin Human genes 0.000 claims abstract description 122
- 101710139375 Corneodesmosin Proteins 0.000 claims abstract description 111
- 230000004927 fusion Effects 0.000 claims abstract description 109
- 201000010099 disease Diseases 0.000 claims abstract description 16
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 16
- 238000002360 preparation method Methods 0.000 claims abstract description 10
- 241001678559 COVID-19 virus Species 0.000 claims description 102
- 210000004027 cell Anatomy 0.000 claims description 86
- 108020001507 fusion proteins Proteins 0.000 claims description 71
- 102000037865 fusion proteins Human genes 0.000 claims description 71
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 43
- 238000000034 method Methods 0.000 claims description 39
- 239000013639 protein trimer Substances 0.000 claims description 38
- 239000013598 vector Substances 0.000 claims description 28
- 208000025721 COVID-19 Diseases 0.000 claims description 23
- 241000238631 Hexapoda Species 0.000 claims description 23
- 210000002966 serum Anatomy 0.000 claims description 23
- 208000037847 SARS-CoV-2-infection Diseases 0.000 claims description 19
- 108020004707 nucleic acids Proteins 0.000 claims description 19
- 102000039446 nucleic acids Human genes 0.000 claims description 19
- 150000007523 nucleic acids Chemical class 0.000 claims description 19
- 241000701447 unidentified baculovirus Species 0.000 claims description 18
- 125000000539 amino acid group Chemical group 0.000 claims description 17
- 239000003153 chemical reaction reagent Substances 0.000 claims description 17
- 239000002671 adjuvant Substances 0.000 claims description 15
- 229940096437 Protein S Drugs 0.000 claims description 14
- 102000014914 Carrier Proteins Human genes 0.000 claims description 13
- 108010078791 Carrier Proteins Proteins 0.000 claims description 13
- -1 CRM389 Proteins 0.000 claims description 12
- 239000000203 mixture Substances 0.000 claims description 12
- 102000004190 Enzymes Human genes 0.000 claims description 11
- 108090000790 Enzymes Proteins 0.000 claims description 11
- 108010031318 Vitronectin Proteins 0.000 claims description 11
- 239000013604 expression vector Substances 0.000 claims description 11
- 239000002773 nucleotide Substances 0.000 claims description 11
- 125000003729 nucleotide group Chemical group 0.000 claims description 11
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 11
- 238000005829 trimerization reaction Methods 0.000 claims description 11
- 239000008194 pharmaceutical composition Substances 0.000 claims description 10
- 238000006467 substitution reaction Methods 0.000 claims description 9
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 8
- 108091005804 Peptidases Proteins 0.000 claims description 8
- 238000000746 purification Methods 0.000 claims description 8
- 239000004365 Protease Substances 0.000 claims description 7
- 239000002808 molecular sieve Substances 0.000 claims description 7
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 claims description 7
- 229960005486 vaccine Drugs 0.000 claims description 7
- 108010071134 CRM197 (non-toxic variant of diphtheria toxin) Proteins 0.000 claims description 6
- 238000001042 affinity chromatography Methods 0.000 claims description 6
- 210000004369 blood Anatomy 0.000 claims description 6
- 239000008280 blood Substances 0.000 claims description 6
- 210000004899 c-terminal region Anatomy 0.000 claims description 6
- 238000004113 cell culture Methods 0.000 claims description 6
- 239000003795 chemical substances by application Substances 0.000 claims description 6
- 238000003776 cleavage reaction Methods 0.000 claims description 6
- 230000028993 immune response Effects 0.000 claims description 6
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 6
- 230000007017 scission Effects 0.000 claims description 6
- 101100534087 Cowpox virus (strain Brighton Red) CRMA gene Proteins 0.000 claims description 5
- 239000003814 drug Substances 0.000 claims description 5
- 239000003937 drug carrier Substances 0.000 claims description 5
- 230000001939 inductive effect Effects 0.000 claims description 5
- 239000007787 solid Substances 0.000 claims description 5
- 102000002260 Alkaline Phosphatase Human genes 0.000 claims description 4
- 108020004774 Alkaline Phosphatase Proteins 0.000 claims description 4
- 239000011616 biotin Substances 0.000 claims description 4
- 229960002685 biotin Drugs 0.000 claims description 4
- 235000020958 biotin Nutrition 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 239000007850 fluorescent dye Substances 0.000 claims description 4
- 102000004961 Furin Human genes 0.000 claims description 3
- 108090001126 Furin Proteins 0.000 claims description 3
- DZBUGLKDJFMEHC-UHFFFAOYSA-N benzoquinolinylidene Natural products C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 claims description 3
- 230000002163 immunogen Effects 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 241000894007 species Species 0.000 claims description 3
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 claims description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 claims description 2
- 102000044437 S1 domains Human genes 0.000 claims description 2
- 108700036684 S1 domains Proteins 0.000 claims description 2
- 210000002381 plasma Anatomy 0.000 claims description 2
- 229920002704 polyhistidine Polymers 0.000 claims description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims 2
- 241000711573 Coronaviridae Species 0.000 abstract description 23
- 230000002265 prevention Effects 0.000 abstract description 6
- 238000003745 diagnosis Methods 0.000 abstract description 5
- 208000001528 Coronaviridae Infections Diseases 0.000 abstract description 2
- 230000001747 exhibiting effect Effects 0.000 abstract description 2
- 235000018102 proteins Nutrition 0.000 description 145
- 239000000523 sample Substances 0.000 description 46
- 108010050848 glycylleucine Proteins 0.000 description 45
- 241000880493 Leptailurus serval Species 0.000 description 41
- 235000001014 amino acid Nutrition 0.000 description 33
- 108010047495 alanylglycine Proteins 0.000 description 31
- 108010057821 leucylproline Proteins 0.000 description 31
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 28
- 108010044940 alanylglutamine Proteins 0.000 description 26
- 229940024606 amino acid Drugs 0.000 description 25
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 24
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 23
- 150000001413 amino acids Chemical class 0.000 description 22
- 241000700605 Viruses Species 0.000 description 21
- 108010005233 alanylglutamic acid Proteins 0.000 description 21
- 108010078144 glutaminyl-glycine Proteins 0.000 description 20
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 19
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 19
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 19
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 18
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 18
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 18
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 18
- 108010009962 valyltyrosine Proteins 0.000 description 18
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 17
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 17
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 17
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 17
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 17
- 108010092854 aspartyllysine Proteins 0.000 description 17
- 238000001514 detection method Methods 0.000 description 17
- 108010084389 glycyltryptophan Proteins 0.000 description 17
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 16
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 16
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 16
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 16
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 16
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 16
- 230000003472 neutralizing effect Effects 0.000 description 16
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 15
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 15
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 15
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 15
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 15
- 238000006386 neutralization reaction Methods 0.000 description 15
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 14
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 14
- 108010009298 lysylglutamic acid Proteins 0.000 description 14
- 108010054155 lysyllysine Proteins 0.000 description 14
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 13
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 13
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 13
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 13
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 13
- 108010036413 histidylglycine Proteins 0.000 description 13
- 230000003053 immunization Effects 0.000 description 13
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 12
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 12
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 12
- 108020003175 receptors Proteins 0.000 description 12
- 102000005962 receptors Human genes 0.000 description 12
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 11
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 11
- 241000282693 Cercopithecidae Species 0.000 description 11
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 11
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 11
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 11
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 11
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 11
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 11
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 11
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 11
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 11
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 11
- 239000000872 buffer Substances 0.000 description 11
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 11
- 108010089804 glycyl-threonine Proteins 0.000 description 11
- 238000002649 immunization Methods 0.000 description 11
- 238000003018 immunoassay Methods 0.000 description 11
- 108010034529 leucyl-lysine Proteins 0.000 description 11
- 239000002609 medium Substances 0.000 description 11
- 239000002245 particle Substances 0.000 description 11
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 11
- 108010029020 prolylglycine Proteins 0.000 description 11
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 10
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 10
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 10
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 10
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 10
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 10
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 10
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 10
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 10
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 10
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 10
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 10
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 10
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 10
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 10
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 10
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 10
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 10
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 10
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 10
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 10
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 10
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 10
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 10
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 10
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 10
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 10
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 10
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 10
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 10
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 10
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 10
- 108010054812 diprotin A Proteins 0.000 description 10
- 229940088598 enzyme Drugs 0.000 description 10
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 10
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 10
- 108010040030 histidinoalanine Proteins 0.000 description 10
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 10
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 10
- 108010017391 lysylvaline Proteins 0.000 description 10
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 10
- 108010015796 prolylisoleucine Proteins 0.000 description 10
- 108010090894 prolylleucine Proteins 0.000 description 10
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 9
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 9
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 9
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 9
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 9
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 9
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 9
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 9
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 9
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 9
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 9
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 9
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 9
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 9
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 9
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 9
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 9
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 9
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 9
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 9
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 9
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 9
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 9
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 9
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 9
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 9
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 9
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 9
- AMRANMVXQWXNAH-ZLUOBGJFSA-N Asp-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O AMRANMVXQWXNAH-ZLUOBGJFSA-N 0.000 description 9
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 9
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 9
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 9
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 9
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 9
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 9
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 9
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 9
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 9
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 9
- DEVDFMRWZASYOF-ZLUOBGJFSA-N Cys-Asn-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DEVDFMRWZASYOF-ZLUOBGJFSA-N 0.000 description 9
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 9
- HPZAJRPYUIHDIN-BZSNNMDCSA-N Cys-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N HPZAJRPYUIHDIN-BZSNNMDCSA-N 0.000 description 9
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 9
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 9
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 9
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 9
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 9
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 9
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 9
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 9
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 9
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 9
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 9
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 9
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 9
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 9
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 9
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 9
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 9
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 9
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 9
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 9
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 9
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 9
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 9
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 9
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 9
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 9
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 9
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 9
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 9
- FHGVHXCQMJWQPK-SRVKXCTJSA-N His-Lys-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O FHGVHXCQMJWQPK-SRVKXCTJSA-N 0.000 description 9
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 9
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 9
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 9
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 9
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 9
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 9
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 9
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 9
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 9
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 9
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 9
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 9
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 9
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 9
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 9
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 9
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 9
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 9
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 9
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 9
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 9
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 9
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 9
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 9
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 9
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 9
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 9
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 9
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 9
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 9
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 9
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 9
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 9
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 9
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 9
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 9
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 9
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 9
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 9
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 9
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 9
- 241000315672 SARS coronavirus Species 0.000 description 9
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 9
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 9
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 9
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 9
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 9
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 9
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 9
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 9
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 9
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 9
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 9
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 9
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 9
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 9
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 9
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 9
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 9
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 9
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 9
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 9
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 9
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 9
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 9
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 9
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 9
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 9
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 9
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 9
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 9
- 108010093581 aspartyl-proline Proteins 0.000 description 9
- 239000007853 buffer solution Substances 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 9
- 108010016616 cysteinylglycine Proteins 0.000 description 9
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 9
- 108010037850 glycylvaline Proteins 0.000 description 9
- 230000005847 immunogenicity Effects 0.000 description 9
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 9
- 239000007788 liquid Substances 0.000 description 9
- 108010071207 serylmethionine Proteins 0.000 description 9
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 9
- 108010051110 tyrosyl-lysine Proteins 0.000 description 9
- 238000005406 washing Methods 0.000 description 9
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 8
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 8
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 8
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 8
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 8
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 8
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 8
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 8
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 8
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 8
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 8
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 8
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 8
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 8
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 8
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 8
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 8
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 8
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 8
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 8
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 8
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 8
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 8
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 8
- GFAPBMCRSMSGDZ-XGEHTFHBSA-N Cys-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N)O GFAPBMCRSMSGDZ-XGEHTFHBSA-N 0.000 description 8
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 8
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 8
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 8
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 8
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 8
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 8
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 8
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 8
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 8
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 8
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 8
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 8
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 8
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 8
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 8
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 8
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 8
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 8
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 8
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 8
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 8
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 8
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 8
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 8
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 8
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 8
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 8
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 8
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 8
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 8
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 8
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 8
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 8
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 8
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 8
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 8
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 8
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 8
- CJAHQEZWDZNSJO-KKUMJFAQSA-N Phe-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CJAHQEZWDZNSJO-KKUMJFAQSA-N 0.000 description 8
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 8
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 8
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 8
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 8
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 8
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 8
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 8
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 8
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 8
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 8
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 8
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 8
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 8
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 8
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 8
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 8
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 8
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 8
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 8
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 8
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 8
- UMFLBPIPAJMNIM-LYARXQMPSA-N Thr-Trp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N)O UMFLBPIPAJMNIM-LYARXQMPSA-N 0.000 description 8
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 8
- BYSKNUASOAGJSS-NQCBNZPSSA-N Trp-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BYSKNUASOAGJSS-NQCBNZPSSA-N 0.000 description 8
- TUUXFNQXSFNFLX-XIRDDKMYSA-N Trp-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N TUUXFNQXSFNFLX-XIRDDKMYSA-N 0.000 description 8
- FBVGQXJIXFZKSQ-GMVOTWDCSA-N Tyr-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FBVGQXJIXFZKSQ-GMVOTWDCSA-N 0.000 description 8
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 8
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 8
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 8
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 8
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 8
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 8
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 8
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 8
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 8
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 8
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 108010069495 cysteinyltyrosine Proteins 0.000 description 8
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 8
- 108010092114 histidylphenylalanine Proteins 0.000 description 8
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 8
- 108010084572 phenylalanyl-valine Proteins 0.000 description 8
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 108010073969 valyllysine Proteins 0.000 description 8
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 7
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 7
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 7
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 7
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 7
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 7
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 7
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 7
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 7
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 7
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 7
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 7
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 7
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 7
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 7
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 7
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 7
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 7
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 7
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 7
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 7
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 7
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 7
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 7
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 7
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 7
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 7
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 7
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 7
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 7
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 7
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 7
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 7
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 7
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 7
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 7
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 7
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 7
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 7
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 7
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 7
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 7
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 7
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 7
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 7
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 7
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 7
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 7
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 7
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 7
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 7
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 7
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 7
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 7
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 7
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 7
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 7
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 7
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 7
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 7
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 7
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 7
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 7
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 7
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 7
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 7
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 7
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 7
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 7
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 7
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 7
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 7
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 7
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 7
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 7
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 7
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 7
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 7
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 7
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 7
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 7
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 229910052802 copper Inorganic materials 0.000 description 7
- 239000010949 copper Substances 0.000 description 7
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 7
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 7
- 108010015792 glycyllysine Proteins 0.000 description 7
- 108010018006 histidylserine Proteins 0.000 description 7
- 108010056582 methionylglutamic acid Proteins 0.000 description 7
- 108010004914 prolylarginine Proteins 0.000 description 7
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 6
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 6
- JRVABKHPWDRUJF-UBHSHLNASA-N Asn-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N JRVABKHPWDRUJF-UBHSHLNASA-N 0.000 description 6
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 6
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 6
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 6
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 6
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 6
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 6
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 6
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 6
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 6
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 6
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 6
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 6
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 6
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 6
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 6
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 6
- 238000011993 High Performance Size Exclusion Chromatography Methods 0.000 description 6
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 6
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 6
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 6
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 6
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 6
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 6
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 6
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 6
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 6
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 6
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 6
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 6
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 6
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 6
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- 241000127282 Middle East respiratory syndrome-related coronavirus Species 0.000 description 6
- 241000699670 Mus sp. Species 0.000 description 6
- 108010003201 RGH 0205 Proteins 0.000 description 6
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 6
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 6
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 6
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 6
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 6
- 101710167605 Spike glycoprotein Proteins 0.000 description 6
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 6
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 6
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 6
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 6
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 6
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 6
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 238000001962 electrophoresis Methods 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 108010020688 glycylhistidine Proteins 0.000 description 6
- 108010028295 histidylhistidine Proteins 0.000 description 6
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 6
- 239000000178 monomer Substances 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 238000004062 sedimentation Methods 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 6
- 239000006228 supernatant Substances 0.000 description 6
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 5
- IGXNPQWXIRIGBF-KEOOTSPTSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IGXNPQWXIRIGBF-KEOOTSPTSA-N 0.000 description 5
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 5
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 5
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 5
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 5
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 5
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 5
- 108020004414 DNA Proteins 0.000 description 5
- 238000002965 ELISA Methods 0.000 description 5
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 5
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 5
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 5
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 5
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 5
- 108090000190 Thrombin Proteins 0.000 description 5
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 5
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 5
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 5
- 229910052782 aluminium Inorganic materials 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 239000011324 bead Substances 0.000 description 5
- 238000004587 chromatography analysis Methods 0.000 description 5
- 238000000113 differential scanning calorimetry Methods 0.000 description 5
- 238000001493 electron microscopy Methods 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 230000001900 immune effect Effects 0.000 description 5
- 108010027338 isoleucylcysteine Proteins 0.000 description 5
- 239000003550 marker Substances 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 230000009257 reactivity Effects 0.000 description 5
- 229960004072 thrombin Drugs 0.000 description 5
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 4
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 4
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 4
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 4
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 4
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 4
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 4
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 4
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 4
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 4
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 4
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 4
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 4
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 4
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 4
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 4
- 108090000288 Glycoproteins Proteins 0.000 description 4
- 102000003886 Glycoproteins Human genes 0.000 description 4
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 4
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 4
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 4
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 4
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 4
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 4
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 4
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 4
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 4
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 4
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 4
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 4
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 4
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 4
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 4
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 4
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 4
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 4
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 4
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 4
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 4
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 4
- DHINLYMWMXQGMQ-IHRRRGAJSA-N Val-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 DHINLYMWMXQGMQ-IHRRRGAJSA-N 0.000 description 4
- 238000010521 absorption reaction Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 238000010828 elution Methods 0.000 description 4
- 238000010324 immunological assay Methods 0.000 description 4
- 230000014759 maintenance of location Effects 0.000 description 4
- 230000034217 membrane fusion Effects 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 238000003127 radioimmunoassay Methods 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 230000004083 survival effect Effects 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 230000003612 virological effect Effects 0.000 description 4
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 3
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 3
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 3
- 102100035765 Angiotensin-converting enzyme 2 Human genes 0.000 description 3
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 3
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 3
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 3
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 3
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 3
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 3
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 3
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 3
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 3
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 3
- 208000034657 Convalescence Diseases 0.000 description 3
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 3
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- OTMSDBZUPAUEDD-UHFFFAOYSA-N Ethane Chemical compound CC OTMSDBZUPAUEDD-UHFFFAOYSA-N 0.000 description 3
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 3
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 3
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 3
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 3
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 3
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 3
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 3
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 3
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 3
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 3
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 3
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 3
- 241000482741 Human coronavirus NL63 Species 0.000 description 3
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 3
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 3
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 3
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 3
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 3
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 3
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 3
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 3
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 3
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 3
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 3
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- 108010047562 NGR peptide Proteins 0.000 description 3
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 3
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 3
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 3
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 3
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 3
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 3
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 3
- 239000012505 Superdex™ Substances 0.000 description 3
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 3
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 3
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 3
- NJLQMKZSXYQRTO-FHWLQOOXSA-N Tyr-Glu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NJLQMKZSXYQRTO-FHWLQOOXSA-N 0.000 description 3
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 3
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 3
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 3
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 3
- 230000005875 antibody response Effects 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 230000004069 differentiation Effects 0.000 description 3
- 239000003085 diluting agent Substances 0.000 description 3
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 3
- 239000012520 frozen sample Substances 0.000 description 3
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 239000012160 loading buffer Substances 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 238000002844 melting Methods 0.000 description 3
- 230000008018 melting Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 239000002953 phosphate buffered saline Substances 0.000 description 3
- 229920002401 polyacrylamide Polymers 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 239000000277 virosome Substances 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- OBYNJKLOYWCXEP-UHFFFAOYSA-N 2-[3-(dimethylamino)-6-dimethylazaniumylidenexanthen-9-yl]-4-isothiocyanatobenzoate Chemical compound C=12C=CC(=[N+](C)C)C=C2OC2=CC(N(C)C)=CC=C2C=1C1=CC(N=C=S)=CC=C1C([O-])=O OBYNJKLOYWCXEP-UHFFFAOYSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 2
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 2
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 2
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 2
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 2
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 2
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 2
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 2
- 108090001008 Avidin Proteins 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- XMTDCXXLDZKAGI-ACZMJKKPSA-N Cys-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N XMTDCXXLDZKAGI-ACZMJKKPSA-N 0.000 description 2
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 2
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 2
- ANRWXLYGJRSQEQ-CIUDSAMLSA-N Cys-His-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ANRWXLYGJRSQEQ-CIUDSAMLSA-N 0.000 description 2
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 2
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 2
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 2
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 2
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 2
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 2
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 2
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 2
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 2
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 2
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 2
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 2
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 2
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 2
- JWLWNCVBBSBCEM-NKIYYHGXSA-N His-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O JWLWNCVBBSBCEM-NKIYYHGXSA-N 0.000 description 2
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 2
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 2
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 2
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- HQXSFFSLXFHWOX-IXOXFDKPSA-N Lys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N)O HQXSFFSLXFHWOX-IXOXFDKPSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 2
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- AWMMBHDKERMOID-YTQUADARSA-N Lys-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCCCN)N)C(=O)O AWMMBHDKERMOID-YTQUADARSA-N 0.000 description 2
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 2
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- 241000282567 Macaca fascicularis Species 0.000 description 2
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 239000000020 Nitrocellulose Substances 0.000 description 2
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 2
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 2
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 2
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 2
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 2
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 108010004729 Phycoerythrin Proteins 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 2
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 101710198474 Spike protein Proteins 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 2
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- NCGUQWSJUKYCIT-SZZJOZGLSA-N Thr-His-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NCGUQWSJUKYCIT-SZZJOZGLSA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 2
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 2
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 2
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 2
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 2
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 2
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 2
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 2
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 2
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 2
- 241000711975 Vesicular stomatitis virus Species 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 230000003281 allosteric effect Effects 0.000 description 2
- 230000003698 anagen phase Effects 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 210000004507 artificial chromosome Anatomy 0.000 description 2
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000007865 diluting Methods 0.000 description 2
- 231100000673 dose–response relationship Toxicity 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 239000004816 latex Substances 0.000 description 2
- 229920000126 latex Polymers 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 210000004779 membrane envelope Anatomy 0.000 description 2
- 229920001220 nitrocellulos Polymers 0.000 description 2
- 230000003204 osmotic effect Effects 0.000 description 2
- 230000007918 pathogenicity Effects 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 239000008363 phosphate buffer Substances 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 239000012521 purified sample Substances 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000007789 sealing Methods 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 238000012916 structural analysis Methods 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 238000004448 titration Methods 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- QCDWFXQBSFUVSP-UHFFFAOYSA-N 2-phenoxyethanol Chemical compound OCCOC1=CC=CC=C1 QCDWFXQBSFUVSP-UHFFFAOYSA-N 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical group C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 102100030988 Angiotensin-converting enzyme Human genes 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- 240000003291 Armoracia rusticana Species 0.000 description 1
- 235000011330 Armoracia rusticana Nutrition 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- DZQKLNLLWFQONU-LKXGYXEUSA-N Asp-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O DZQKLNLLWFQONU-LKXGYXEUSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 241000008904 Betacoronavirus Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- 102000011632 Caseins Human genes 0.000 description 1
- 108010076119 Caseins Proteins 0.000 description 1
- 102100023321 Ceruloplasmin Human genes 0.000 description 1
- 238000011537 Coomassie blue staining Methods 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 102000047174 Disks Large Homolog 4 Human genes 0.000 description 1
- 108700019745 Disks Large Homolog 4 Proteins 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 241001115402 Ebolavirus Species 0.000 description 1
- 238000011510 Elispot assay Methods 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- 101710204837 Envelope small membrane protein Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241001524679 Escherichia virus M13 Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- 108010044091 Globulins Proteins 0.000 description 1
- 102000006395 Globulins Human genes 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108060003393 Granulin Proteins 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 241000711467 Human coronavirus 229E Species 0.000 description 1
- 241001109669 Human coronavirus HKU1 Species 0.000 description 1
- 241001428935 Human coronavirus OC43 Species 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- LLHYWBGDMBGNHA-VGDYDELISA-N Ile-Cys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LLHYWBGDMBGNHA-VGDYDELISA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 102000004407 Lactalbumin Human genes 0.000 description 1
- 108090000942 Lactalbumin Proteins 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- ZVXSESPJMKNIQA-YXMSTPNBSA-N Lys-Thr-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZVXSESPJMKNIQA-YXMSTPNBSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101710145006 Lysis protein Proteins 0.000 description 1
- 101710175243 Major antigen Proteins 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 101001028244 Onchocerca volvulus Fatty-acid and retinol-binding protein 1 Proteins 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- ALHULIGNEXGFRM-QWRGUYRKSA-N Phe-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=CC=C1 ALHULIGNEXGFRM-QWRGUYRKSA-N 0.000 description 1
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- NAOVYENZCWFBDG-BZSNNMDCSA-N Phe-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 NAOVYENZCWFBDG-BZSNNMDCSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 101000933967 Pseudomonas phage KPP25 Major capsid protein Proteins 0.000 description 1
- 241001112090 Pseudovirus Species 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108020005091 Replication Origin Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- FRPNVPKQVFHSQY-BPUTZDHNSA-N Ser-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FRPNVPKQVFHSQY-BPUTZDHNSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 241000580858 Simian-Human immunodeficiency virus Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- IBBBOLAPFHRDHW-BPUTZDHNSA-N Trp-Asn-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N IBBBOLAPFHRDHW-BPUTZDHNSA-N 0.000 description 1
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 1
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 1
- 108010046334 Urease Proteins 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 108010003533 Viral Envelope Proteins Proteins 0.000 description 1
- 230000010530 Virus Neutralization Effects 0.000 description 1
- 239000005862 Whey Substances 0.000 description 1
- 108010046377 Whey Proteins Proteins 0.000 description 1
- 102000007544 Whey Proteins Human genes 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 125000000129 anionic group Chemical group 0.000 description 1
- 239000003945 anionic surfactant Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 239000003429 antifungal agent Substances 0.000 description 1
- 239000012062 aqueous buffer Substances 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 201000009310 astigmatism Diseases 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 239000007975 buffered saline Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 239000003093 cationic surfactant Substances 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000012767 chemiluminescent enzyme immunoassay Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- 239000012501 chromatography medium Substances 0.000 description 1
- 238000011097 chromatography purification Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000000120 cytopathologic effect Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- PXEDJBXQKAGXNJ-QTNFYWBSSA-L disodium L-glutamate Chemical compound [Na+].[Na+].[O-]C(=O)[C@@H](N)CCC([O-])=O PXEDJBXQKAGXNJ-QTNFYWBSSA-L 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000004043 dyeing Methods 0.000 description 1
- 230000012202 endocytosis Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000003114 enzyme-linked immunosorbent spot assay Methods 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 239000006260 foam Substances 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000005534 hematocrit Methods 0.000 description 1
- 239000000413 hydrolysate Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 229910052588 hydroxylapatite Inorganic materials 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 108010027775 interleukin-1beta-converting enzyme inhibitor Proteins 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 238000000752 ionisation method Methods 0.000 description 1
- FZWBNHMXJMCXLU-BLAUPYHCSA-N isomaltotriose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O)O1 FZWBNHMXJMCXLU-BLAUPYHCSA-N 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 210000003141 lower extremity Anatomy 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 235000013923 monosodium glutamate Nutrition 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 208000003154 papilloma Diseases 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- XYJRXVWERLGGKC-UHFFFAOYSA-D pentacalcium;hydroxide;triphosphate Chemical compound [OH-].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O XYJRXVWERLGGKC-UHFFFAOYSA-D 0.000 description 1
- 229960003742 phenol Drugs 0.000 description 1
- 229960005323 phenoxyethanol Drugs 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 239000002861 polymer material Substances 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 239000004800 polyvinyl chloride Substances 0.000 description 1
- 229920000915 polyvinyl chloride Polymers 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000004262 preparative liquid chromatography Methods 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 229940024999 proteolytic enzymes for treatment of wounds and ulcers Drugs 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 230000000284 resting effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 238000001338 self-assembly Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 229940073490 sodium glutamate Drugs 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 235000010356 sorbitol Nutrition 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 229940031626 subunit vaccine Drugs 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 238000002198 surface plasmon resonance spectroscopy Methods 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 1
- 229940126585 therapeutic drug Drugs 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 229940033663 thimerosal Drugs 0.000 description 1
- ANRHNWWPFJCPAZ-UHFFFAOYSA-M thionine Chemical compound [Cl-].C1=CC(N)=CC2=[S+]C3=CC(N)=CC=C3N=C21 ANRHNWWPFJCPAZ-UHFFFAOYSA-M 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000004627 transmission electron microscopy Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 230000002477 vacuolizing effect Effects 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56983—Viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/005—Assays involving biological materials from specific organisms or of a specific nature from viruses
- G01N2333/08—RNA viruses
- G01N2333/165—Coronaviridae, e.g. avian infectious bronchitis virus
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Virology (AREA)
- General Health & Medical Sciences (AREA)
- Immunology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Urology & Nephrology (AREA)
- Pharmacology & Pharmacy (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Hematology (AREA)
- Animal Behavior & Ethology (AREA)
- Biomedical Technology (AREA)
- Cell Biology (AREA)
- Food Science & Technology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Biotechnology (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Communicable Diseases (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明提供了具有融合前早期构象的SARS‑CoV‑2S三聚体蛋白,及其制备方法。本发明还涉及呈现该种构象的冠状病毒S蛋白在诊断、预防和治疗新型冠状病毒感染和导致疾病的用途。
Description
技术领域
本发明涉及病毒学领域。具体而言,本发明提供了具有融合前早期构象的SARS-CoV-2 S三聚体蛋白,及其制备方法。本发明还涉及呈现该种构象的冠状病毒S蛋白在诊断、预防和治疗新型冠状病毒感染和导致疾病的用途。
背景技术
冠状病毒,英文简写CoV,全称Coronavirus:Corona(冕、冠)+Virus(病毒)=冠状病毒。“冠状病毒”因在电子显微镜下呈现的标志性冠状结构而得名,该结构为S糖蛋白在病毒包膜表面形成的刺突。
冠状病毒颗粒呈圆形或椭圆形,亦为多形性,直径50~200nm,属于尺寸较大的病毒。冠状病毒是包膜病毒,病毒衣壳外面包裹着脂质包膜,其上排列较宽的刺突(S蛋白),形状如太阳光环。S蛋白位于病毒表面,形成棒状结构,作为病毒的主要抗原蛋白之一,介导受体结合和细胞融合的主要抗原。负责识别宿主细胞表面的受体,可刺激机体产生中和抗体。
已知冠状病毒中,有7种可引发人类疾病,包括:HCoV-229E、HCoV-OC43、HCoV-NL63、HCoV-HKU1、SARS-CoV、MERS-CoV和SARS-CoV-2。其中,前4种为局部流行性疾病,主要引起轻度自限性疾病,而SARS-CoV和MERS-CoV可引发重症。2002年和2012年发现的SARS-CoV和MERS-CoV属于β-冠状病毒,并由于其对人类的高威胁性被列入WHO高威胁清单。SARS-CoV-2(sever acute respiratory syndrome coronaviruses 2,严重急性呼吸综合征冠状病毒2)是第七个能够引发人类疾病的离散冠状病毒种属,表征为β-冠状病毒。冠状病毒引发的高患病率对人类健康构成持续威胁。
SARS-CoV-2基因组全长约29,800个碱基对(base pair,bp),基因组一般含有13个基因,编码14个蛋白,主要包括多聚蛋白(polyprotein)、刺突蛋白(spike protein,S)、基质蛋白(matrix protein,M)、囊膜蛋白(envelop protein,E)和衣壳蛋白(nucelocapsidprotein,N)及其他此类病毒特有蛋白。其中主要的膜蛋白S,含S1和S2两个亚基,S1上含有N端结构域NTD,受体结合域RBD,及两个连接功能域SD1和SD2。
冠状病毒属于囊膜病毒,囊膜病毒通过与靶细胞膜的融合入侵宿主细胞。其中,冠状病毒属于I类囊膜病毒,包含HIV、流感病毒、埃博拉病毒等。I类囊膜病毒的病毒膜融合由病毒表面主要糖蛋白介导,冠状病毒S糖蛋白前体具有蛋白酶切割位点,成熟后会被宿主切割成S1和S2两个亚基,S1/S2裂解位点位于残基660-675的灵活环中,该残基在预融合S1-S2三聚体峰中完全暴露。要成功感染SARS-CoV,需要宿主细胞蛋白酶切割S1/S2位点。冠状病毒成熟过程中,该类病毒在膜融合前后会发生显著构象变化,融合前构象为prefusion,S1和S2维持完整的三聚体构象,融合后的构象为postfusion,S1蛋白与受体结合,触发一系列构象变化,且在蛋白水解酶和细胞内吞泡的酸性条件下,导致S2构象延长,S2高度疏水的N端融合肽FP锚定在宿主细胞上。S2三聚体重折叠成高度稳定的构象,诱导六螺旋束的形成,完成膜融合。
特别的,在β冠状病毒中,在融合前构象中,S蛋白中的S1也会经历特殊的构象变化,该构象变化为β冠状病毒所特有,其他型目前还没有发现。
目前已证实,新型冠状病毒的受体与SARS-CoV一样都是血管紧张素转化酶2(ACE2)。基于利用冷冻透射电子显微镜单颗粒分析(cryo-EM,single particle analysis,SPA)解析的SARS-CoV的S糖蛋白与其宿主细胞受体ACE2的复合结构,早期研究表明,SARS-CoV病毒与受体结合前,三聚体S糖蛋白中的受体结合域RBD为“向下”构象,当病毒与受体结合时,其中一个RBD采用突出的“向上”构象与ACE2结合,从而诱导其他两个RBD也与受体结合。同时在MERS-CoV S糖蛋白中也有类似的观察,显示RBD“向上”突出。最新利用冷冻电镜解析的新型冠状病毒SARS-CoV-2的S蛋白结构也有一个RBD“向上”的构象变化。由于S三聚体糖蛋白是冠状病毒与细胞受体结合介导病毒入胞和感染并致病的重要蛋白,因此成为诊断、预防和治疗病毒感染和致病的主要靶标,然而,S蛋白的变构现象及其呈现多种的不同构象与机体产生的免疫反应相关,给研究带了极大的挑战,截至目前鉴定的S三聚体蛋白呈现为融合前(prefusion)和融合后(postfusion)两种构象,融合前构象呈现为亚稳态(metastable)构象,在受体刺激或其他条件作用下自发变构成为融合后构象,病毒S变成融合后构象时已经处于感染入胞状态,因此,针对融合前构象S蛋白开发疫苗或药物将具有更好的病毒中和作用和抑制病毒复制效果,而使得稳定S蛋白融合前构象及其表位研究成为冠状病毒的研究重点。
发明内容
本申请的发明人经过大量实验和反复摸索,出人意料地获得了具有早期融合前构象的SARS-CoV-2的S三聚体,该构象区别于已知的融合前构象,并且对SARS-CoV-2的诊断、预防及治疗具有更好的效果,由此完成了本发明。
因此,在第一方面,本发明提供了一种融合蛋白,其包含严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的S蛋白或其变体序列、以及三聚化结构域序列;
其中,所述S蛋白变体与野生型S蛋白(例如SEQ ID NO:1所示的S蛋白)相比具有一个或几个氨基酸的置换、缺失或添加(例如1个,2个,3个,4个,5个,6个,7个,8个,9个或10个氨基酸的置换、缺失或添加),或者与野生型S蛋白相比具有至少80%、至少85%、至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少98%、或至少99%的序列同一性,并且所述S蛋白变体具有野生型S蛋白的免疫学活性(例如,在受试者中诱导针对SARS-CoV-2的特异性免疫应答如中和抗体应答的活性)。
在本发明中,术语“三聚化结构域”是指,能够使本文所述的融合蛋白的若干拷贝三聚化(即形成生物分子复合物)的任何多肽或蛋白质。在某些实施方案中,所述三聚化结构域是同源三聚化结构域,其将相同的融合蛋白聚集并形成三聚体。在某些实施方案中,所述三聚化结构域序列包含如SEQ ID NO:3所示的序列。
在某些实施方案中,所述S蛋白变体相比于野生型S蛋白具有更优的构象稳定性,其不易转变为融合后构象。
在某些实施方案中,所述S蛋白变体在其S1结构域和S2结构域之间不包含弗林蛋白酶识别位点。在某些实施方案中,所述S蛋白变体不包含弗林蛋白酶识别位点。
在某些实施方案中,所述S蛋白变体在相应于如下位置的氨基酸位置中的一处或多处(例如1处,2处或3处或4处)包含氨基酸置换,所述如下位置参考野生型S蛋白所示的氨基酸位置:668、669、671。
在某些实施方案中,与野生型S蛋白相比,所述S蛋白变体包含选自下列的1个或几个(例如1个,2个或3个)氨基酸置换:在相应于野生型S蛋白的位置668的位置上的氨基酸置换为A,在相应于野生型S蛋白的位置669的位置上的氨基酸置换为G,在相应于野生型S蛋白的位置671的位置上的氨基酸置换为G。
在某些实施方案中,与野生型S蛋白相比,所述S蛋白变体在与野生型S蛋白的第668-671位相对应的位置上的氨基酸残基被置换为AGAG。
在某些实施方案中,与野生型S蛋白相比,所述S蛋白变体在与野生型S蛋白的第972位和/或第973位相对应的位置上包含氨基酸置换。具体地,所述氨基酸置换为:所述S蛋白变体在与野生型S蛋白的第972位相对应的位置上的氨基酸被置换为P,和/或所述S蛋白变体在与野生型S蛋白的第973位相对应的位置上的氨基酸被置换为P。
在某些实施方案中,所述S蛋白变体包含:在相应于野生型S蛋白的位置972的位置上的氨基酸置换为P,和/或在相应于野生型S蛋白的位置973的位置上的氨基酸置换为P。
在某些实施方案中,所述S蛋白包含如SEQ ID NO:1所示的氨基酸序列。
在某些实施方案中,所述S蛋白变体包含如SEQ ID NO:2所示的氨基酸序列。
在某些实施方案中,所述融合蛋白从N端至C端包含所述S蛋白或其变体序列和三聚化结构域序列。
在某些实施方案中,所述融合蛋白在所述S蛋白或其变体序列和三聚化结构域序列之间进一步包含蛋白酶识别位点序列。
在某些实施方案中,所述蛋白酶是凝血酶。
在某些实施方案中,所述蛋白酶识别位点序列包含如SEQ ID NO:4或5所示的序列。
在某些实施方案中,所述融合蛋白包含选自下列的氨基酸序列:
(1)由SEQ ID NO:6所示序列的第1位至第1238位氨基酸残基构成的氨基酸序列;
(2)由SEQ ID NO:8所示序列的第1位至第1238位氨基酸残基构成的氨基酸序列。
在某些实施方案中,所述融合蛋白进一步包含载体蛋白序列。
在某些实施方案中,所述载体蛋白序列位于所述S蛋白或其变体序列的N端。在某些实施方案中,所述载体蛋白位于所述三聚化结构域序列的C端。
在某些实施方案中,所述载体蛋白选自CRMA(例如SEQ ID NO:10所示的CRMA)、CRM389(例如SEQ ID NO:11所示的CRM389)或CRM197(例如SEQ ID NO:12所示的CRM197)。
在某些实施方案中,所述载体蛋白序列与其相邻结构域之间任选地通过肽接头连接。在某些实施方案中,所述肽接头是(GmS)n,其中m选自1-4的整数(例如1,2,3,4),n选自1-3的整数(例如1,2,3)。
在某些实施方案中,所述融合蛋白包含选自下列的氨基酸序列:
(1)由SEQ ID NO:13所示序列的第1位至第1443位氨基酸残基构成的氨基酸序列;
(2)由SEQ ID NO:14所示序列的第1位至第1443位氨基酸残基构成的氨基酸序列;
(3)由SEQ ID NO:15所示序列的第1位至第1642位氨基酸残基构成的氨基酸序列;
(4)由SEQ ID NO:16所示序列的第1位至第1642位氨基酸残基构成的氨基酸序列;
(5)由SEQ ID NO:17所示序列的第1位至第1788位氨基酸残基构成的氨基酸序列;
(6)由SEQ ID NO:18所示序列的第1位至第1788位氨基酸残基构成的氨基酸序列。
在某些实施方案中,所述融合蛋白包含标签序列。在某些实施方案中,所述标签序列是纯化标签,例如多组氨酸标签、myc标签或HA标签。在某些实施方案中,所述标签序列位于所述融合蛋白的N端或C端。在某些实施方案中,所述标签序列与其相邻结构域之间任选地通过肽接头或酶切位点序列连接。在某些实施方案中,所述融合蛋白包含SEQ ID NOs:13-18任一项所示的序列。
在第二方面,本发明提供了一种多聚体,其包含第一方面所述的融合蛋白。
在某些实施方案中,所述多聚体是同源多聚体。
在某些实施方案中,所述多聚体是三聚体。
在某些实施方案中,所述多聚体是由相同的融合蛋白所形成的三聚体。
在第三方面,本发明提供了一种分离的核酸分子,其包含编码第一方面所述的融合蛋白的核苷酸序列。
在第四方面,本发明提供了一种载体,其包含如上所述的分离的核酸分子。在某些实施方案中,所述载体是例如质粒,粘粒,噬菌体等。优选地,所述载体是杆状病毒表达载体。
在第五方面,本发明提供了一种宿主细胞,其包含如上所述的分离的核酸分子或载体。此类宿主细胞包括但不限于,原核细胞例如大肠杆菌细胞,以及真核细胞例如酵母细胞,昆虫细胞,植物细胞和动物细胞(如哺乳动物细胞,例如小鼠细胞、人细胞等)。优选地,所述宿主细胞是昆虫细胞。
在第六方面,本发明提供了制备第一方面所述的融合蛋白或第二方面所述的多聚体的方法,其包括,培养第五方面的宿主细胞,和从细胞培养物中回收所述融合蛋白。在某些实施方案中,所述融合蛋白以三聚体形式存在。在某些实施方案中,所述宿主细胞是昆虫细胞。
在第七方面,本发明提供了制备严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的S蛋白三聚体的方法,其包括:在杆状病毒昆虫表达系统中表达第一方面所述的融合蛋白,从而产生所述S蛋白三聚体。
在某些实施方案中,所述杆状病毒昆虫表达系统中的昆虫宿主细胞选自sf9或sf21。
在某些实施方案中,所述杆状病毒昆虫表达系统中的杆状病毒表达载体选自pAcgp67(A-C),pFastbac,pEx/bac。
在某些实施方案中,所述方法包括以下步骤:
(1)将包含编码第一方面所述的融合蛋白的核苷酸序列的杆状病毒表达载体引入昆虫宿主细胞中;
(2)培养所述昆虫宿主细胞;
(3)从细胞培养物(例如培养上清)中回收S蛋白三聚体。
在某些实施方案中,所述方法还包括:对所产生的S蛋白三聚体进行纯化。优选地,通过亲和层析和/或分子筛对S蛋白三聚体进行纯化。
在某些实施方案中,所述纯化包括:将所述细胞培养上清交换缓冲液到PBS或TB缓冲液;将交换缓冲液的细胞培养上清通过Ni亲和层析纯化,洗脱杂蛋白。
在第八方面,本发明提供了一种严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的S蛋白三聚体,其具备早期融合前构象,所述早期融合前构象包含以下特征中的1项、2项或3项:
(1)NTD结构域质量中心与S2结构域质量中心的连线,和RBD结构域质量中心与S2结构域质量中心的连线的夹角为约81.4°;
(2)NTD结构域质量中心与S2结构域质量中心的垂直距离为约
(3)RBD结构域质量中心与S2结构域质量中心的垂直距离为约
在某些实施方案中,上述构象特征通过冷冻电镜三维结构重建测定。
在某些实施方案中,所述S蛋白三聚体是由第一方面所述的融合蛋白所形成的三聚体。在某些实施方案中,所述三聚体是由相同的融合蛋白形成的同源三聚体。
在某些实施方案中,所述S蛋白三聚体通过第七方面所述的方法制备。
在第九方面,本发明提供了一种药物组合物,其包含本发明的融合蛋白、多聚体、分离的核酸分子、载体、宿主细胞、或S蛋白三聚体,以及药学上可接受的载体和/或赋形剂。
在某些实施方案中,所述药物组合物是免疫原性组合物或疫苗。在某些实施方案中,所述药学上可接受的载体和/或赋形剂是佐剂。
在某些实施方案中,所述融合蛋白、多聚体、分离的核酸分子、载体、宿主细胞、或S蛋白三聚体以预防和/或治疗预防SARS-CoV-2感染或由SARS-CoV-2感染所致的疾病COVID-19的有效量存在。
在另一方面,本发明提供了本发明的融合蛋白、多聚体、分离的核酸分子、载体、宿主细胞、S蛋白三聚体、或药物组合物,在制备用于在受试者中诱导针对SARS-CoV-2的特异性免疫应答和/或用于在受试者中预防和/或治疗SARS-CoV-2感染的药剂中的应用。
在另一方面,本发明提供了本发明的融合蛋白、多聚体、分离的核酸分子、载体、宿主细胞、S蛋白三聚体、或药物组合物,在制备用于在受试者中预防和/或治疗由SARS-CoV-2感染所致的疾病COVID-19的药剂中的应用。
在某些实施方案中,所述药剂为疫苗。
在某些实施方案中,所述受试者是人。
在另一方面,本发明提供了用于在受试者中诱导针对SARS-CoV-2的特异性免疫应答或用于在受试者中预防和/或治疗SARS-CoV-2感染的方法,其包括向有此需要的受试者施用免疫学有效量的本发明的多聚体、S蛋白三聚体或药物组合物。
在另一方面,本发明提供了一种试剂盒,其包含本发明的融合蛋白、多聚体、分离的核酸分子、载体、宿主细胞、或S蛋白三聚体。
在某些实施方案中,所述试剂盒包含捕获试剂,所述捕获试剂选自本发明的融合蛋白、多聚体、或S蛋白三聚体。在某些实施方案中,所述试剂盒还包含使用本发明的融合蛋白、多聚体、或S蛋白三聚体作为捕获试剂来检测样品中的对SARS-CoV-2特异性的抗体的说明书,或者使用本发明的融合蛋白、多聚体、或S蛋白三聚体作为捕获试剂来测定来自受试者的样品中对SARS-CoV-2特异性的抗体的水平,从而确定所述受试者是否患有SARS-CoV-2感染的说明书。
在某些实施方案中,所述受试者是人。
在某些实施方案中,所述捕获试剂连接于固相载体表面。在本文中,所述的固相载体包括由聚合物材料(例如聚氯乙烯、聚苯乙烯、聚丙酰胺或纤维素)制成或包被的凹孔平板、试管、珠粒(例如乳胶颗粒)或薄膜(例如硝酸纤维素膜),或由功能基团(例如氨基、羧基、生物素或亲和素)预包被的磁珠。在某些实施方案中,所述固相载体选自磁珠或微量滴定板(例如微孔板或酶标板)。
在某些实施方案中,所述试剂盒还包含检测试剂,所述检测试剂可以测定“捕获试剂/对SARS-CoV-2特异性的抗体”免疫复合物的形成。
在某些实施方案中,所述检测试剂选自带有可检测的标记的二级抗体。
在某些实施方案中,所述可检测的标记选自酶(例如辣根过氧化物酶或碱性磷酸酶)、化学发光试剂(例如吖啶酯类化合物)、荧光染料或生物素。
在某些实施方案中,所述二级抗体对待测抗体所来自的物种(例如人)的抗体是特异的。在某些实施方案中,所述二级抗体是抗-免疫球蛋白抗体。在某些实施方案中,所述抗-免疫球蛋白抗体选自抗IgG抗体、抗IgM抗体或抗IgA抗体;优选地,所述抗-免疫球蛋白抗体是抗IgG抗体。
在另一方面,本发明提供了本发明的融合蛋白、多聚体、分离的核酸分子、载体、宿主细胞、或S蛋白三聚体,在制备用于测定来自受试者的样品中的对SARS-CoV-2特异性的抗体的存在或其水平和/或用于确定受试者是否患有SARS-CoV-2感染的试剂盒中的用途。
在某些实施方案中,所述样品是血液样品,例如全血、血浆或血清。
在某些实施方案中,所述受试者是人。
在另一个方面,本发明提供了测定来自受试者的样品中的对SARS-CoV-2特异性的抗体的存在或其水平的方法,所述方法包括在允许抗原-抗体复合物形成的条件下,使所述样品与本发明的融合蛋白、多聚体、或S蛋白三聚体接触,并且检测抗原-抗体复合物的形成。在某些实施方案中,所述对SARS-CoV-2特异性的抗体可以是IgG,IgM,IgA,或总抗体。所述总抗体是指所述样品中对SARS-CoV-2特异性的所有抗体,而不限于其中的任何一类,因此包括IgM、IgG、IgA等。
在某些实施方案中,所述方法包括以下步骤:
(1)将所述样品与捕获试剂接触,以获得抗原-抗体免疫复合物;所述捕获试剂选自本发明的融合蛋白、多聚体或S蛋白三聚体;
(2)测定步骤(1)获得的抗原-抗体免疫复合物的形成或其量。
在某些实施方案中,在步骤(2)中,通过免疫学检测来测定所述免疫复合物的形成或其量。
在某些实施方案中,所述免疫学检测选自酶免疫测定法(例如ELISA)、化学发光免疫分析法、荧光免疫分析法或放射免疫测定法。
在某些实施方案中,所述捕获试剂连接与固相载体表面。
在某些实施方案中,在步骤(2)中,使用检测试剂来检测所述免疫复合物的量,所述检测试剂选自带有可检测的标记的二级抗体,例如抗-免疫球蛋白抗体,例如抗IgG抗体、抗IgM抗体或抗IgA抗体。
在另一个方面,本发明提供了确定受试者是否患有SARS-CoV-2感染的方法,所述方法包括:
(i)通过上一方面所述的方法来测定来自所述受试者的样品中对SARS-CoV-2特异性的抗体的水平;和,
(ii)将所述水平与参考值比较,以判断所述受试者是否患有SARS-CoV-2感染。
术语定义
在本发明中,除非另有说明,否则本文中使用的科学和技术名词具有本领域技术人员所通常理解的含义。并且,本文中所用的病毒学、生物化学、核酸化学、免疫学等实验室操作步骤均为相应领域内广泛使用的常规步骤。同时,为了更好地理解本发明,下面提供相关术语的定义和解释。
如本文中所使用的,术语“S蛋白”也称为刺突蛋白,其是冠状病毒的主要结构蛋白之一。SARS-CoV-2的S蛋白的序列是本领域技术人员已知的,例如可参见NCBI数据库中如下序列:NC_045512.2。
如本文中所使用的,术语“S蛋白三聚体”、“S三聚体蛋白”、“S三聚体”可互换使用,其是指由三个S蛋白单体自组装所形成的三聚体。对于SARS-CoV-2,S蛋白三聚体在膜融合前后会发生显著构象变化,融合前构象为prefusion,融合后的构象为postfusion。在本发明中,意外发现了不同于已报道的融合前构象的一种前前构象,将其命名为“早期融合前构象”或“融合前早期构象”,该构象的S三聚体蛋白的NTD和RBD结构域与融合前S蛋白具有明显构象差异。通过冷冻电镜三维结构重建测定,所述早期融合前构象可以包含以下特征中的1项、2项或3项:(1)NTD结构域质量中心与S2结构域质量中心的连线,和RBD结构域质量中心与S2结构域质量中心的连线的夹角为约81.4°;(2)NTD结构域质量中心与S2结构域质量中心的垂直距离为约(3)RBD结构域质量中心与S2结构域质量中心的垂直距离为约此外,本领域人员知晓,对于蛋白质不同构象的区分,除了通过结构解析方法,如冷冻电镜三维结构重建、X-射线晶体学或核磁共振波谱法进行测定,通过对两种空间结构的几何测量进行定量区分,也可以通过免疫筛选特异识别本发明的融合前早期构象而不识别其他构象的单克隆抗体,再通过单克隆抗体与S蛋白进行反应的免疫分析方法进行判断是否具有本发明构象的蛋白。
如本文中所使用的,术语“杆状病毒昆虫表达系统”是指以昆虫细胞为宿主细胞、以杆状病毒表达载体为表达载体的重组蛋白表达系统。该表达系统是本领域技术人员熟知的,杆状病毒表达载体(例如pAcgp67(A-C),pFastbac,pEx/bac等)、昆虫宿主细胞(例如sf9或sf21等)均市售可得。
如本文中所使用的,术语“载体(vector)”是指,可将多聚核苷酸插入其中的一种核酸运载工具。当载体能使插入的多核苷酸编码的蛋白获得表达时,载体称为表达载体。载体可以通过转化,转导或者转染导入宿主细胞,使其携带的遗传物质元件在宿主细胞中获得表达。载体是本领域技术人员公知的,包括但不限于:质粒;噬菌粒;柯斯质粒;人工染色体,例如酵母人工染色体(YAC)、细菌人工染色体(BAC)或P1来源的人工染色体(PAC);噬菌体如λ噬菌体或M13噬菌体及动物病毒等。可用作载体的动物病毒包括但不限于,逆转录酶病毒(包括慢病毒)、腺病毒、腺相关病毒、疱疹病毒(如单纯疱疹病毒)、痘病毒、杆状病毒、乳头瘤病毒、乳头多瘤空泡病毒(如SV40)。一种载体可以含有多种控制表达的元件,包括但不限于,启动子序列、转录起始序列、增强子序列、选择元件及报告基因。另外,载体还可含有复制起始位点。
如本文中所使用的,术语“宿主细胞”是指,可用于导入载体的细胞,其包括但不限于,如大肠杆菌或枯草菌等的原核细胞,如酵母细胞或曲霉菌等的真菌细胞,如S2果蝇细胞或Sf9等的昆虫细胞,或者如纤维原细胞,CHO细胞,COS细胞,NSO细胞,HeLa细胞,BHK细胞,HEK 293细胞或人细胞等的动物细胞。
如本文中所使用的,术语“同一性”用于指两个多肽之间或两个核酸之间序列的匹配情况。当两个进行比较的序列中的某个位置都被相同的碱基或氨基酸单体亚单元占据时(例如,两个DNA分子的每一个中的某个位置都被腺嘌呤占据,或两个多肽的每一个中的某个位置都被赖氨酸占据),那么各分子在该位置上是同一的。两个序列之间的“百分数同一性”是由这两个序列共有的匹配位置数目除以进行比较的位置数目×100的函数。例如,如果两个序列的10个位置中有6个匹配,那么这两个序列具有60%的同一性。例如,DNA序列CTGACT和CAGGTT共有50%的同一性(总共6个位置中有3个位置匹配)。通常,在将两个序列比对以产生最大同一性时进行比较。这样的比对可通过使用,例如,可通过计算机程序例如Align程序(DNAstar,Inc.)方便地进行的Needleman等人(1970)J.Mol.Biol.48:443-453的方法来实现。还可使用已整合入ALIGN程序(版本2.0)的E.Meyers和W.Miller(Comput.ApplBiosci.,4:11-17(1988))的算法,使用PAM120权重残基表(weight residue table)、12的缺口长度罚分和4的缺口罚分来测定两个氨基酸序列之间的百分数同一性。此外,可使用已整合入GCG软件包(可在www.gcg.com上获得)的GAP程序中的Needleman和Wunsch(JMoIBiol.48:444-453(1970))算法,使用Blossum 62矩阵或PAM250矩阵以及16、14、12、10、8、6或4的缺口权重(gap weight)和1、2、3、4、5或6的长度权重来测定两个氨基酸序列之间的百分数同一性。
本文涉及的二十个常规氨基酸的编写遵循常规用法。参见例如,Immunology-ASynthesis(2nd Edition,E.S.Golub and D.R.Gren,Eds.,Sinauer Associates,Sunderland,Mass.(1991)),其以引用的方式并入本文中。在本发明中,术语“多肽”和“蛋白质”具有相同的含义且可互换使用。并且在本发明中,氨基酸通常用本领域公知的单字母和三字母缩写来表示。例如,丙氨酸可用A或Ala表示。
如本文中所使用的,术语“药学上可接受的载体和/或赋形剂”是指,在药理学和/或生理学上与受试者和活性成分相容的载体和/或赋形剂,其是本领域公知的(参见例如Remington's Pharmaceutical Sciences.Edited by Gennaro AR,19thed.Pennsylvania:Mack Publishing Company,1995),并且包括但不限于:pH调节剂,表面活性剂,离子强度增强剂,维持渗透压的试剂,延迟吸收的试剂,稀释剂,佐剂,防腐剂,稳定剂等。例如,pH调节剂包括但不限于磷酸盐缓冲液。表面活性剂包括但不限于阳离子,阴离子或者非离子型表面活性剂,例如Tween-80。离子强度增强剂包括但不限于氯化钠。维持渗透压的试剂包括但不限于糖、NaCl及其类似物。延迟吸收的试剂包括但不限于单硬脂酸盐和明胶。稀释剂包括但不限于水,水性缓冲液(如缓冲盐水),醇和多元醇(如甘油)等。佐剂包括但不限于铝佐剂(例如氢氧化铝),弗氏佐剂(例如完全弗氏佐剂)等。防腐剂包括但不限于各种抗细菌试剂和抗真菌试剂,例如硫柳汞,2-苯氧乙醇,对羟苯甲酸酯,三氯叔丁醇,苯酚,山梨酸等。稳定剂具有本领域技术人员通常理解的含义,其能够稳定药物中的活性成分的期望活性(例如对PSD-95泛素化的抑制活性),包括但不限于谷氨酸钠,明胶,SPGA,糖类(如山梨醇,甘露醇,淀粉,蔗糖,乳糖,葡聚糖,或葡萄糖),氨基酸(如谷氨酸,甘氨酸),蛋白质(如干燥乳清,白蛋白或酪蛋白)或其降解产物(如乳白蛋白水解物)等。
如本文中所使用的,术语,术语“色谱层析“包括但不限于:离子交换色谱(例如阳离子交换色谱)、疏水相互作用色谱、吸附层析法(例如羟基磷灰石色谱)、凝胶过滤(凝胶排阻)层析、亲和层析法。
如本文中所使用的,术语“载体蛋白”是指这样的蛋白,其可以充当抗原性蛋白的载体,即,其可以在特定位置处(例如蛋白内部,N端或C端)插入抗原性蛋白,以便该抗原性蛋白能够呈现出来,从而该抗原性蛋白能够被抗体或免疫系统识别。此类载体蛋白是本领域技术人员熟知的,包括例如,CRMA蛋白、CRM389蛋白、CRM197蛋白等待。任选地,可以在抗原性蛋白与载体蛋白之间使用连接体(例如柔性或刚性连接体),以促进二者各自的折叠。
如本文中所使用的,术语“免疫学测定”是指,利用抗原-抗体之间的特异性相互作用/结合亲和力来进行的测定,其一般可用于检测特定抗原或者抗体在样品中的存在或水平。此类免疫学检测是本领域技术人员公知的,包括但不限于,酶免疫测定法(EIA)、化学发光免疫分析法(CLIA)、放射免疫测定法(RIA)、荧光免疫测定法(FIA)、Western印迹法、免疫比浊法、表面等离子共振法等。在某些实施方式中,所述免疫学检测为酶免疫测定法(EIA),例如ELISA检测法、Elispot检测法或CLEIA检测法。关于免疫学检测的详细描述,可参见例如,Fundamental Immunology,Ch.7Paul,W.,ed.,第2版,Raven Press,N.Y.(1989)。
如本文中所使用的,术语“可检测的标记”可以是可通过荧光、光谱、光化学、生物化学、免疫学、电学、光学或化学手段检测的任何物质。特别优选的是,此类标记能够适用于免疫学检测(例如,酶联免疫测定法、放射免疫测定法、荧光免疫测定法、化学发光免疫测定法等)。这类标记是本领域熟知的,包括但不限于,酶(例如,辣根过氧化物酶、碱性磷酸酶、β-半乳糖苷酶、脲酶、葡萄糖氧化酶,等)、放射性核素(例如,3H、125I、35S、14C或32P)、荧光染料(例如,异硫氰酸荧光素(FITC)、荧光素、异硫氰酸四甲基罗丹明(TRITC)、藻红蛋白(PE)、德克萨斯红、罗丹明、量子点或花菁染料衍生物(例如Cy7、Alexa 750))、发光物质(例如化学发光物质,如吖啶酯类化合物)、磁珠(例如,)、测热标记物例如胶体金或有色玻璃或塑料(例如,聚苯乙烯、聚丙烯、乳胶,等)珠、以及用于结合上述标记物修饰的亲和素(例如,链霉亲和素)的生物素。本发明中涵盖的标记物可通过本领域已知的方法检测。例如,放射性标记可使用摄影胶片或闪烁计算器检测,荧光标记物可使用光检测器检测,以检测发射的光。酶标记物一般通过给酶提供底物及检测通过酶对底物的作用产生的反应产物来检测,及测热标记物通过简单可视化着色标记物来检测。在某些实施方案中,可通过不同长度的接头将如上所述的可检测的标记连接至检测试剂,以降低潜在的位阻。
如本文中所使用的,术语“约”是指本领域的普通技术人员认为的在所修饰的数值的可接受的标准误差内,例如所修饰的数值的±10%、±9%、±8%、±7%、±6%、±5%、±4%、±3%、±2%、±1%、±0.5%、±0.1%、±0.05%或±0.01%以内。
如本文中所使用的,术语“和/或”,例如“A和/或B”应理解为是指“A和B”或“A或B”,并且应被理解为对两种含义或任一含义提供明确的支持。具体地,“在相应于野生型S蛋白的位置972的位置上的氨基酸置换为P,和/或在相应于野生型S蛋白的位置973的位置上的氨基酸置换为P”,应理解为“在相应于野生型S蛋白的位置972的位置上的氨基酸置换为P,或在相应于野生型S蛋白的位置973的位置上的氨基酸置换为P”,或“在相应于野生型S蛋白的位置972的位置上的氨基酸置换为P,并且在相应于野生型S蛋白的位置973的位置上的氨基酸置换为P”。
如本文中所使用的,术语“有效量”是指能够有效实现预期目的的量。例如,预防疾病(例如SARS-CoV-2感染,或由SARS-CoV-2感染所致的疾病COVID-19)有效量是指,能够有效预防,阻止,或延迟疾病(例如SARS-CoV-2感染,或由SARS-CoV-2感染所致的疾病COVID-19)的发生的量。测定这样的有效量在本领域技术人员的能力范围之内。治疗疾病(例如SARS-CoV-2感染,或由SARS-CoV-2感染所致的疾病COVID-19)有效量是指,能够实现所需治疗效果的量,例如实现减轻与待治疗疾病相关的症状的量。
发明的有益效果
冠状病毒S蛋白的变构现象及其呈现多种的不同构象与机体产生的免疫反应相关,对于针对SARS-CoV-2的亚单位疫苗开发策略至关重要。本发明出人意料地获得了呈现早期融合前构象的SARS-CoV-2的S三聚体,该构象区别于已报道的融合前构象,与病人恢复期血清具有良好的反应活性,在小鼠和非人灵长类动物中显示出较高的免疫原性,可作为诊断试剂的检测抗原和疫苗的免疫原,同时,早期融合前构象的结构信息为治疗性药物设计和筛选提供了重要依据,显示了其在SARS-CoV-2诊断、预防和治疗的应用前景和价值。
下面将结合附图和实施例对本发明的实施方案进行详细描述,但是本领域技术人员将理解,下列附图和实施例仅用于说明本发明,而不是对本发明的范围的限定。根据附图和优选实施方案的下列详细描述,本发明的各种目的和有利方面对于本领域技术人员来说将变得显然。
附图说明
图1显示了在本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的SDS聚丙烯酰胺凝胶电泳结果。M:分子量Marker;1:S-WT三聚体蛋白。结果显示,S-WT三聚体蛋白经过亲和层析一步纯化后,纯度约为60%左右。
图2显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的过Superdex 200(高效液相色谱分子筛)纯化并通过SDS聚丙烯酰胺凝胶电泳显示纯度结果。M:分子量Marker;1:S-WT三聚体蛋白过柱前;2:S-WT三聚体蛋白的洗脱峰1。结果显示经过分子筛色谱纯化的S-WT三聚体蛋白纯度达到90%以上。
图3显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白的过Superdex 200(高效液相色谱分子筛)纯化并通过SDS聚丙烯酰胺凝胶电泳显示纯度结果。Marker:分子量Marker;S-2P:S-2P三聚体蛋白过柱前;纯化峰1:S-2P三聚体蛋白洗脱峰1。结果显示经过分子筛色谱纯化的S-2P三聚体蛋白纯度达到90%以上。
图4显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的高效体积排阻色谱(HPSEC)的结果,融合前构象S三聚体蛋白类病毒颗粒的保留体积为5.3mL。
图5显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白的高效体积排阻色谱(HPSEC)的结果,融合前构象S三聚体蛋白类病毒颗粒的保留体积为5.3mL。
图6显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的分析超离的结果,结果显示该蛋白沉降系数约为15.2S,分子量约为577kDa,为三聚体。
图7显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白的分析超离的结果,结果显示该蛋白沉降系数约为14.1S,分子量约为544kDa,为三聚体。
图8显示了本发明实施例的融合前早期构象的S-WT三聚体蛋白的差示扫描量热法(DSC)获得的熔解温度Tm值,约为45.5°和64.5°。
图9显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白的差示扫描量热法(DSC)获得的熔解温度Tm值,约为64.1°。
图10显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT和S-2P三聚体蛋白与恢复期人血清的反应,其中A-F为SARS-CoV-2诊断阳性血清,G和H为健康人血清对照。结果显示恢复期血清对融合前早期构象的S-WT和S-2P三聚体蛋白具有良好的反应活性。
图11显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的冷冻电镜结构,以及其与现有技术中的融合前构象的结构比较,其中A显示本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的侧向视野和顶部视野下的的冻电镜结构照片,B为现有技术中的融合前构象的SARS-CoV-2的S蛋白三聚体的冻电镜结构照片。结果显示本发明实施的融合前早期构象的SARS-CoV-2的S三聚体蛋白呈三倍对称结构,最大高度约为最大宽度约为
图12显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT体蛋白与现有技术中的融合前构象的SARS-CoV-2的S蛋白在结构上的差异。A显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白(蓝色)与现有技术中的融合前构象的SARS-CoV-2的S三聚体蛋白在结构上的差异(灰白色),前者蛋白结构在高度上比后者低约B显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT单体蛋白与现有技术中的融合前构象的SARS-CoV-2的S单体蛋白在结构上的差异,其中现有技术中的融合前构象的S单体蛋白中NTD结构域和RBD结构域分别标记为绿色和淡紫色,本发明实施例的融合前早期构象的S单体蛋白的NTD结构域(红色)和RBD结构域分别标记为红色和黄色,结果显示融合前早期构象中S单体蛋白的NTD结构域和RBD结构域较融合前构象明显不同。
图13显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT单体蛋白的构象信息(A),和现有技术中的融合前构象的SARS-CoV-2的S单体蛋白的构象信息(B)。结果显示,本发明实施例的SARS-CoV-2的S单体蛋白融合前早期构象以及S蛋白的NTD和RBD结构域与现有技术中的S蛋白融合前构象相比具有构象差异,其中NTD结构域质量中心及S2结构域质量中心的连线,和RBD结构域质量中心及及S2结构域质量中心的连线的夹角约为81.4°。NTD结构域质量中心与S2结构域质量中心的垂直距离约为RBD结构域质量中心与S2结构域质量中心的垂直距离约为
图14显示本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的构象与已报道的代表性融合前构象的SARS-CoV-2的S三聚体蛋白的结构差异。结果显示,本发明实施例的融合前早期构象的S-WT三聚体蛋白结构(A)矮于6种列举的冠状病毒融合前构象的S三聚体蛋白:SARS-CoV-2(B)、SARS-CoV(C)、MERS-CoV(D)、HCoV-NL63(E)、IBV(F)及PdCoV(G)。
图15显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT和S-2P三聚体蛋白免疫小鼠的结果。结果显示,S1组的中和滴度低于RBD和S-WT组(高达1.5log)。S-WT三聚体蛋白在两种佐剂下都能产生3.5log的中和抗体滴度(图5(E,F)),表明S-WT三聚体蛋白在两种佐剂中都能产生高免疫原性的中和抗体。低剂量(10μg和1μg)SARS-CV-2的S蛋白的RBD以及本发明实施例的S-WT三聚体蛋白、S-2P三聚体蛋白结合铝佐剂的免疫原性实验的结果表明,所有动物在第2周均出现IgG血清转换,而S-WT和S-SP的滴度随时间增加而增加,RBD滴度维持在较低水平(<2.5log;图15(G))。在第5周,所有小鼠都显示出高的IgG抗体滴度(高达4.5log)。S-WT和S-2P在第3周时的中和滴度则呈剂量依赖性。在第5周、10μg S-WT、S-2P或RBD诱导的中和滴度(约为3.5log)相当,而1μg S-2P诱导的中和滴度显著高于S-WT和RBD(约为1log,图15(H))。这表明三聚体稳定的本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白可以在较低的免疫剂量下提供较高的免疫原性。
图16显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白免疫猴子的结果。结果显示,两只实验猴对SARS-CoV-2 s型特异性IgG和中和抗体均产生了类似的动力学反应。IgG抗体在第2周(第3周)发生血清转换,在第2周(第3周)后一周达到约3.5log,滴度值维持到第5周。第三次接种后,IgG抗体最高达到3.8log(第7周),并在第8周维持水平(图16(A))。中和性抗体反应在第2周可检测到,在第7周达到峰值,平均ID50=29,798(图16(B,D)),与IgG的趋势一致(图16(A))。第8周时平均中和滴度略有下降,至ID50=13856,第三次免疫后公猴的中和滴度明显高于母猴(图16(C,D))。在对照组的猴子身上既没有检测到IgG也没有检测到中和抗体。对COVID-19恢复期的人类血清(n=18)的中和滴度检测的平均结果是ID50=706,表明本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白在猴子中的中和抗体滴定高于恢复期血清的40倍。
序列信息
本发明涉及的部分序列的信息提供于下面的表1中。
表1:序列的描述
具体实施方式
现参照下列意在举例说明本发明(而非限定本发明)的实施例来描述本发明。除非特别指明,本发明中所使用的分子生物学实验方法和免疫检测法,基本上参照J.Sambrook等人,分子克隆:实验室手册,第2版,冷泉港实验室出版社,1989,以及F.M.Ausubel等人,精编分子生物学实验指南,第3版,John Wiley&Sons,Inc.,1995中所述的方法进行;限制性内切酶的使用依照产品制造商推荐的条件。本领域技术人员知晓,实施例以举例方式描述本发明,且不意欲限制本发明所要求保护的范围。
实施例1:融合前早期构象的野生型和突变型S三聚体蛋白的制备
本实施例制备单体序列为SEQ ID NO:6和8的野生型和突变型S三聚体蛋白。
用做模板S三聚体蛋白片段的制备
送生工公司合成编码SEQ ID NO:1所示的野生型S蛋白的核苷酸序列。
早期融合前构象的S三聚体蛋白表达载体的构建
以前一步骤所获得3.6kb大小的编码SEQ ID NO:1所示的野生型S蛋白的核苷酸序列作为PCR反应的模板,以pAc-S-F:5'-GCCTTTGCGGCGGATCTTGGATCCTGTGTTAATCTTACAACCAGA-3'为正向引物,pAc-S-R1:5'-CGTCCTTACGCACGTAAGCCTGACCGTCACGGGGAGCCTCGGGGATGTAACCGGAACCGGGGGAACCGCTTCCACGAGGGACCAATGGCCATTTTATATACTGCTCA-3'为反向引物,通过PCR反应来获得中间片段。
再以中间片段为模板,以pAc-S-F为正向引物,pAc-S-R2:5'-AAGGATCAGATCTGCAGCGGCCGCTTAATGGTGGTGGTGGTGGTGATGGTGATGATGGCCCAGGAAGGTGGACAGCAGCACCCACTCACCGTCCTTACGCACGTAAGC-3'为反向引物,再次进行PCR反应,引入凝血酶酶切位点(即,SEQ ID NO:4所示的凝血酶识别序列-1)、三聚化结构域序列(SEQ ID NO:3)和纯化标签10×His的基因序列,酶切位点为BamHI和NotI。
上述PCR反应在PCR热循环仪(Biometra T3)上按照如下条件进行:
通过上述PCR反应扩增得到3.8kb左右大小特异的DNA片段,将该PCR产物与商售的pAcgp67B载体(BD公司生产)利用Gibson反应液(NEB公司)连接装配,得到pAc-S质粒。经BamHI/NotI酶切鉴定,所得pAc-S质粒中插入的目的片段的核苷酸序列为SEQ ID NO:7所示的核苷酸序列,其编码的氨基酸序列为SEQ ID NO:6所示的氨基酸序列(所对应的蛋白命名为S-WT,其为融合前早期构象的野生型S三聚体蛋白)。取1μL的pAc-S质粒(0.15mg/ml)转化40μL的氯化钙法制备的感受态大肠杆菌ER2566(购自新英格兰生物实验室公司),涂布于卡那霉素(终浓度25mg/mL,下同)抗性的固体LB培养基,37℃静置培养10-12小时至单菌落清晰可辨。挑取单个菌落,挑取单菌落至含4mL卡那霉素抗性的液体LB培养基之试管,37℃220转/分振荡培养10小时,从中取1mL菌液于-70℃冻干保存。采用试剂盒(购自TianGEN,DP214)提取回收pAc-S质粒。
采用相同的方法,制备获得插入了SEQ ID NO:9所示的核苷酸序列(其编码的氨基酸序列为SEQ ID NO:8所示的氨基酸序列,该氨基酸所对应的蛋白命名为S-2P,其为融合前早期构象的突变型S三聚体蛋白)的质粒。所用引物包括:
pAc-S0-F:
5'-CAGACTAATTCTCCTGCAGGCGCAGGAAGTGTAGCTAGTCAATCC-3'
pAc-S0-R:
5'-GGATTGACTAGCTACACTTCCTGCGCCTGCAGGAGAATTAGTCTG-3'
S-2P-F:
5'-ATCCTTTCACGTCTTGACCCACCGGAGGCTGAAGTGCAAATTGA-3'
S-2P-R:
5'-TCAATTTGCACTTCAGCCTCCGGTGGGTCAAGACGTGAAAGGAT-3'。
S三聚体蛋白的大量表达
昆虫细胞的转染
(1)确定sf9细胞(购自Invitrogen,11496-015)或sf21细胞(购自Invitrogen,11497-013)处于对数增长期(1.5-2.5×106/mL),生存率维持在90%以上。在24孔板中加入200μL培养基ESF 921(购自Expression systems,96-001-01)含2%FBS),0.1μgBaculovirus DNA(购自Expression systems,91-002)和1ug pAc-S质粒,混合均匀。在50μLESF921培养基(购自Expression systems,96-001-01)中加入1μL转染试剂(购自Expression systems,95-055-075),混合均匀。将上述两者合成一管,混合均匀,在室温中静置30min。静置过程中清洗细胞(时间快到前进行):待细胞完全贴壁后,用枪吸走培养基,再加入300μL ESF921培养基,动作迅速,以免使细胞失水,轻轻摇摆后,去掉培养基,再加入300μL ESF921培养基。时间到后,将约100μL的上述混合物滴加到每孔细胞中,均匀滴加。然后置于27℃孵育6h,弃去上清,补加500μL完全培养基(50%CCM3+50%TNM-FH(SIGMA-ALDRICH,T1032)+10%FBS)。
(2)收集步骤(1)中所得细胞上清,500g离心5min,去除细胞残骸和碎片,将上清于4℃避光存放,此为P1病毒种液。
(3)杆状病毒的扩增
确定sf9细胞或sf21细胞处于对数增长期(1.5-2.5×106/mL),生存率维持在90%以上。在10cm板上铺8-10mL密度为6×105/mL个细胞。静置15min使细胞贴壁。加入P1病毒液约600μL,均匀滴入。置于27℃孵育3-4d。观察细胞病变现象。收集细胞上清,1000rpm离心5min,去除细胞残骸和碎片,并用0.22um滤膜过滤。将上清于4℃避光存放,此为P2病毒种液。P2病毒滴度约为106-107pfu/mL。P3可依照此方法在250mL摇瓶中按体积放大扩增病毒。
昆虫细胞蛋白表达操作
1L摇瓶中加入250mL ESF921培养基培养的密度为2×106/mL,生存率90%以上的H5细胞(购自Invitrogen,B855-02)。按相应MOI加入病毒量,用封口膜封住瓶口,置于27℃摇床中120rpm培养。每天取出摇瓶中细胞观察并计数,记录相关数据。合适的MOI能保证第一天细胞70%以上病变。第二天细胞全部病变,且生存率80%左右。第三天细胞出现破裂,生存率降低至30-50%。此时可考虑收集细胞。采用10000rpm,离心10min来收集细胞,然后分离上清并进行纯化,得到单体序列为SEQ ID NO:6的野生型S三聚体蛋白S-WT,以及单体序列为SEQ ID NO:8的突变型S三聚体蛋白S-2P。
S三聚体蛋白纯化
利用AKTA系统进行Ni亲和层析纯化;
仪器系统:AKTA Pure型制备型液相色谱仪;
纯化介质:Ni Sepharose 6Fast Flow亲和介质;缓冲液:分为A泵、B泵缓冲液,一般A泵为1×PBS缓冲液(160g/L NaCL,8.1mmol/LNa2HPO4,1.5mmol/L KH2PO4,2.7mmol/LKCL,pH7.4),B泵为1×PBS+250mmol/L咪唑缓冲液;
系统流速:5mL/min;检测波长:UV@280nm
洗脱条件:用50mM咪唑缓冲液(250mmol/L咪唑缓冲液经1×PBS缓冲液稀释获得)洗脱杂蛋白,接着用1×PBS清洗,再用250mM咪唑缓冲液洗脱目的蛋白(S三聚体蛋白);
收集250mM咪唑洗脱的产物,获得纯化的样品10mL。取洗脱产物各50μL,加入6XLoading Buffer 30μL混匀,于80℃水浴10min后取10μl于10%SDS-聚丙烯酰胺凝胶中以120V电压电泳120min。随后以考马斯亮兰染色显示电泳条带,本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的SDS聚丙烯酰胺凝胶电泳结果见图1,其中M:分子量Marker;1:S-WT三聚体蛋白。由电泳结果可知,S-WT三聚体蛋白浓度约为0.2mg/ml,SDS-PAGE考染纯度约为60%左右。
实施例2:S三聚体蛋白的分子筛色谱纯化
仪器系统:GE Healthcare(原Amershan Pharmacia)公司生产的AKTA explorer100型制备型液相色谱系统。
层析介质:Superdex 200。
柱体积:5.5cm×60cm。
缓冲液:20mM磷酸缓冲液pH7.4
流速:1mL/min
检测器波长:280nm。
样品为实施例2中250mM洗脱的样品。
洗脱程序为:收集穿透峰,每个峰都收集,S三聚体蛋白在第一个穿透峰中。
收集Supedex 200穿透的产物,获得纯化的样品5mL。取洗脱产物各50μL,加入6XLoading Buffer 30μL混匀,于80℃水浴10min后取10μl于10%SDS-聚丙烯酰胺凝胶中以120V电压电泳120min。随后以考马斯亮兰染色显示电泳条带,电泳结果见图2和图3,结果显示本发明实施例的SARS-CoV-2的野生型S-WT三聚体蛋白和突变型S-2P三聚体蛋白的纯度均达到90%以上。
实施例3:S三聚体蛋白的高效体积排阻色谱(HPSEC)分析
仪器:Waters。系统流速:G3000PWXL为0.5mL/min。波长190~600nm,柱波长为280nm和254nm。
缓冲液:PBS
操作流程:预先平衡层析柱30~60min至280nm吸收值无明显的变化,将检测器的吸收值归零。编辑层析运行方法,样品先离心,将待分析样品注入100μL样品环中,设置自动上样,运行30min,观察S三聚体的保留体积约为5.3mL。
结果如图4和图5所示,本发明实施例的融合前构象的SARS-CoV-2的野生型S-WT三聚体蛋白和突变型S-2P三聚体蛋白类病毒颗粒的保留体积均为5.3mL。
实施例4:分析超离法计算样品沉降系数
所使用的仪器为Beckman XL-A分析型超速离心机,其配有光学检测系统及An-60Ti转头。
按照操作说明安装样品池,对照池加入400μL的样品缓冲液(与样品统一缓冲液),样品池加入380μL的样品(OD280为0.8左右),配平样品池,重量差在0.1g以内。
将样品池放入An-60Ti转头中,在将转头置于Beckman XL-A型分析超速离心机腔内,装上光路检测器。
参数设定:温度(20℃)、Rmin(6.0cm)、Rmax(7.2cm)、波长(280nm)、步进速度(0.003cm)、扫描模式(continuous)、数据间隔(30sec)和数据的数目(150scans)。设置离心转速为30000rpm。
实验结束后使用SENDTERP软件计算缓冲液的密度和粘度,以及已知蛋白的偏微比容。使用Origin版的Nonlin和SEDFIT软件进行沉降系数的分析,预设球蛋白的摩擦比f/f0=1.2,根据样品蛋白分子量和基本性质设定分析范围,计算分辨率设为100,一般要求RMSD值不大于0.01且残基图波动在0.05以内。结果见图6和图7。
图6和图7显示了本发明实施例的融合前早期构象的SARS-CoV-2的野生型S-WT三聚体蛋白和突变型S-2P三聚体蛋白的分析超离的结果,结果显示S-WT蛋白沉降系数约为15.2S,分子量约为577kDa,为三聚体,S-2P蛋白沉降系数为14.1S,分子量约为554kDa,为三聚体。
实施例5:差式量热扫描法(DSC)检测
所使用的仪器为差示扫描量热仪(VP-Capillary DSC),购自METTLER TOLEDO。
(1)用酸性或弱碱性洗液对上样槽清洗1次,再用去离子水清洗3次;
(2)将样品及其对应缓冲液各吸取500μL于EP管中,离心除气泡;
(3)分别吸取350μL缓冲液于样品槽及对照槽中,润洗3次,再各自加入350μL缓冲液,加样过程动作要轻以免产生气泡;
(4)启动DSC软件和仪器,设置扫描循环次数,当扫描循环中DP值稳定在±0.2之间时即可让其循环扫描,基线平衡不少于3次;
(5)当最后一次循环扫描结束,温度降至30℃-10℃之间时将样品槽中的缓冲液吸出,迅速加入待测蛋白样品,并继续扫描测试;
(6)扫描完成后,电脑会自然生成图表,将样品测试曲线扣除缓冲液曲线后即可获得S蛋白样品的Tm值和曲线。
图8和图9显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的差示扫描量热法(DSC)获得的熔解温度Tm值,约为45.5°和64.5°;S-2P三聚体蛋白的溶解温度Tm约为64.1°,为单峰,稳定性高于S-WT三聚体蛋白。
实施例6:恢复期血清检测反应活性
Western Blot检测(WB)
将等量的蛋白质样品与载入缓冲液混合,煮沸10分钟,按照实验室标准方案载入SDS-PAGE凝胶用于western blotting(BioRad),蛋白在BioRad迷你protean Tetra系统80V下电泳70分钟,凝胶在室温下用考马斯亮蓝R-250(Bio-Rad)染色30分钟。分离的蛋白质使用反式印迹涡轮转移系统(Bio-Rad)转移到硝化纤维素膜上,封闭后,与人血清(1:500稀释)孵育1小时。清洗5次,每次清洗5分钟,以去除未结合的抗体,用碱性磷酸酶结合的山羊抗小鼠二抗或山羊抗人IgG二抗(Invitrogen)孵育膜。再次清洗膜,然后使用化学发光底物试剂盒检测。
ELISA检测
(1)将S-WT或S-2P三聚体蛋白稀释至1μg/mL包被96孔板,每孔100μL,室温静置2h;
(2)洗板1次,用牛血清蛋白稀释液(ED,200μL/孔)室温封闭2h;
(3)洗板1次,将ED将血清稀释100倍加入首孔,2倍系列稀释11个梯度,双孔重复,室温放置1h;
(4)洗板5次,将二抗GAH-HRP(1:5000)加入96孔板,100μL/孔,室温放置1h;
(5)洗板5次,室温显色10min,终止,酶标仪450nm波长处检测;使用GraphPadPrism 5(GraphPad,USA)软件进行数据分析。结果如图10所示。
图10显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT和S-2P三聚体蛋白与恢复期人血清的WB和ELISA反应,其中A-F为SARS-CoV-2诊断阳性血清,G和H为健康人血清对照。结果显示恢复期血清对融合前早期构象的S-WT和S-2P三聚体蛋白具有良好的反应活性。
实施例7:冷冻电镜结构解析及分析
冷冻样品制备
(1)在冷冻样品制样前,先对铜网(R2/2,200目,ThermoFisher Scientific)进行辉光放电亲水处理:
(2)将FEI Mark IV Vitrobot(ThermoFisher Scientific)制样仪内腔设置为低温(6-8℃)、高湿度(100%)的环境;
(3)将制样泡沫盒装满液氮,待中间铜碗冷却后,缓慢注入气态乙烷,通过低温加压使其变成液态;
(4)将2.5-3μL纯化的SARS-CoV-2的S-WT三聚体蛋白样品滴加至已处理的铜网中,吸附5s,滤纸吸水6s后将铜网迅速浸入液态乙烷内,使铜网上样品的水冷冻成一层略高于样品的玻璃态的薄冰层;
(5)将制备好的样品(铜网)保存于液氮中备用。
低温电镜观察病毒样颗粒及数据收集
(1)在液氮的环境中,用镊子将铜网放置于冷冻样品杆的探头上,并快速转移至电镜的样品室;
(2)待电镜(ThermoFisher Scientific,FEI Tecnai F30)腔室真空压力稳定后,随机选取数个视野观察,确认样品的制备质量;
(3)利用FEI的EPU软件自动收集样品数据,在低放大倍数下,寻找并选取颗粒平均分布的样品区域,设置不同水平的欠焦值(-1.5至-3.0μm),接着在放大倍数为93,000×和电子剂量为的条件下进行拍照;
(4)利用FEI Falcon-3电子直接检测器采集movie形式的冷冻电镜数据。
冷冻电镜单颗粒重构
(1)将采集到的数字化图像进行图片筛选,剔除漂移、像散和污染严重的低质量图片;
(2)利用MotionCor2软件对收集的movie形式的电镜图片进行漂移校正;
(3)将高质量的校正后图片用衬度传递函数(contrast transfer function,CTF)校正;
(4)运用EMAN2软件中的Particle Boxer程序挑选图片中的每一个完整的颗粒,确定颗粒在图片中的坐标位置;
(5)运用CryoSPARC v2软件对挑取的蛋白颗粒投影进行reference-free 2Dclassification、3D classification、初始模型构建、3D refine的归类和分析等,最终获得蛋白的三维结构密度图;
三维结构的分析
运用Chimera软件显示本发明实施例的SARS-CoV-2的S-WT蛋白结构进行分析及作图,结果如下:
图11显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的冷冻电镜结构,以及其与现有技术中的融合前构象的SARS-CoV-2的S蛋白三聚体结构比较。A显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的侧向视野和顶部视野下的的冻电镜结构照片。B显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白(灰色)与现有技术中的融合前构象的SARS-CoV-2的S单体蛋白的结构(彩色)比较。结果显示本发明实施的融合前早期构象的SARS-CoV-2的S三聚体蛋白呈三倍对称结构,最大高度约为最大宽度约为
图12显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT体蛋白与现有技术中的融合前构象的SARS-CoV-2的S蛋白在结构上的差异。A显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白(蓝色)与现有技术中的融合前构象的SARS-CoV-2的S三聚体蛋白在结构上的差异(灰白色),前者蛋白结构在高度上比后者低约B显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT单体蛋白与现有技术中的融合前构象的SARS-CoV-2的S单体蛋白在结构上的差异,其中现有技术中的融合前构象的S单体蛋白中NTD结构域和RBD结构域分别标记为绿色和淡紫色,本发明实施例的融合前早期构象的S单体蛋白的NTD结构域(红色)和RBD结构域分别标记为红色和黄色,结果显示融合前早期构象中S单体蛋白的NTD结构域和RBD结构域较融合前构象明显不同。
图13显示了本发明实施例的融合前早期构象的SARS-CoV-2的S-WT单体蛋白的构象信息(左),和现有技术中的融合前构象的SARS-CoV-2的S单体蛋白的构象信息(右)。结果显示,本发明实施例的SARS-CoV-2的S单体蛋白融合前早期构象以及S蛋白的NTD和RBD结构域与现有技术中的S蛋白融合前构象相比具有构象差异,其中NTD结构域质量中心及S2结构域质量中心的连线,和RBD结构域质量中心及及S2结构域质量中心的连线的夹角约为81.4°。NTD结构域质量中心与S2结构域质量中心的垂直距离约为RBD结构域质量中心与S2结构域质量中心的垂直距离约为
图14显示本发明实施例的融合前早期构象的SARS-CoV-2的S-WT三聚体蛋白的构象与已报道的代表性融合前构象的S三聚体蛋白的结构差异,本发明实施例的融合前早期构象的S-WT三聚体蛋白(A)结构矮于6种列举的冠状病毒融合前构象的SARS-CoV-2的S三聚体蛋白:SARS-CoV-2(B)、SARS-CoV(C)、MERS-CoV(D)、HCoV-NL63(E)、IBV(F)及PdCoV(G)。
实施例8:免疫原性评价
本实验方案经厦门大学实验动物管理伦理委员会批准。所有操作都严格按照动物伦理准则和批准的规程进行。
选用6周龄的Balb/C小鼠,将其分为三组,每组5只,每组分别使用SARS-CV-2的S蛋白的RBD(免疫剂量为100μg)、S1(免疫剂量为35μg)以及S-WT三聚体蛋白(免疫剂量为35μg)并结合铝或50%弗氏完全佐剂(Sigma-Aldrich)进行免疫,分别于0周、1周和4周肌肉注射(150μL)小鼠的左或右后肢。分别于第0周、1周、2周、3周、4周和5周采集眼球静脉血,其中第0周、1周和4周在注射前采血。血液样本13000g离心10分钟,得到血清样本在-20℃保存。用终点酶联免疫吸附法和水泡性口炎病毒(VSV)假病毒中和法分别测定抗原特异性IgG和中和抗体滴度,结果如图15所示。
结果显示,S1组的中和滴度低于RBD和S-WT组(高达1.5log)。S-WT在两种佐剂下都能产生3.5log的中和抗体滴度(图15(E,F)),表明S-WT在两种佐剂中都能产生高免疫原性的中和抗体。
此外,我们比较了低剂量(10μg和1μg)SARS-CV-2的S蛋白的RBD以及本发明实施例的S-WT三聚体蛋白、S-2P三聚体蛋白结合铝佐剂的免疫原性,免疫方案同上。结果如图15所示。三针免疫后结果表明,所有动物在第2周均出现IgG血清转换,而S-WT和S-SP的滴度随时间增加而增加,RBD滴度维持在较低水平(<2.5log;图15(G))。在第5周,所有小鼠都显示出高的IgG抗体滴度(高达4.5log)。S-WT和S-2P在第3周时的中和滴度则呈剂量依赖性。在第5周,10μg S-WT、S-2P或RBD诱导的中和滴度(约为3.5log)相当,而1μg S-2P诱导的中和滴度显著高于S-WT和RBD(约为1log,图15(H))。这表明本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白可以在较低的免疫剂量下提供较高的免疫原性。
为评估本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白在非人类灵长类动物中的免疫原性,将未检出抗SARS-CoV-2感染中和抗体的2只食蟹猴(MacacaFascicularis)采用肌肉注射S-2P三聚体蛋白(20μg/dose,铝佐剂)3次(0、2、6周)。每只猴分别采集免疫前血清,免疫后每隔1周采集血清样本,结果如图16所示。
结果显示,两只实验猴对SARS-CoV-2 s型特异性IgG和中和抗体均产生了类似的动力学反应。IgG抗体在第2周(第3周)发生血清转换,在第2周(第3周)后一周达到约3.5log,滴度值维持到第5周。第三次接种后,IgG抗体最高达到3.8log(第7周),并在第8周维持水平(图16(A))。中和性抗体反应在第2周可检测到,在第7周达到峰值,平均ID50=29,798(图16(B,D)),与IgG的趋势一致(图16(A))。第8周时平均中和滴度略有下降,至ID50=13856,第三次免疫后公猴的中和滴度明显高于母猴(图16(C,D))。在对照组的猴子身上既没有检测到IgG也没有检测到中和抗体。对COVID-19恢复期的人类血清(n=18)的中和滴度检测的平均结果是ID50=706,表明本发明实施例的融合前早期构象的SARS-CoV-2的S-2P三聚体蛋白在猴子中的中和抗体滴定高于恢复期血清的40倍。
实施例9:其他具有融合前早期构象的S三聚体蛋白的制备及性能评价
除实施例1获得的S三聚体蛋白(SEQ ID NO:6和SEQ ID NO:8)之外,发明人还通过相同的方法制备获得了下表中所列的多种具有融合前早期构象的S三聚体蛋白,并通过实施例2-8所述的方法对其进行性能评价。这些具有融合前早期构象的S三聚体蛋白与病人恢复期血清具有良好的反应活性,可作为诊断试剂的检测抗原和疫苗的免疫原,显示了其在SARS-CoV-2诊断、预防和治疗的应用前景和价值。
表2:S三聚体蛋白单体所包含的组分
尽管本发明的具体实施方式已经得到详细的描述,但本领域技术人员将理解:根据已经公布的所有教导,可以对细节进行各种修改和变动,并且这些改变均在本发明的保护范围之内。本发明的全部分为由所附权利要求及其任何等同物给出。
SEQUENCE LISTING
<110> 厦门大学
厦门万泰沧海生物技术有限公司
<120> 具有融合前早期构象的SARS-CoV-2 S三聚体蛋白及其应用
<130> IDC210072
<150> CN202010171146.8
<151> 2020-03-12
<160> 18
<170> PatentIn version 3.5
<210> 1
<211> 1199
<212> PRT
<213> Artificial Sequence
<220>
<223> SARS-CoV-2野生型S蛋白氨基酸序列
<400> 1
Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn
1 5 10 15
Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser
20 25 30
Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val
35 40 45
Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg
50 55 60
Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser
65 70 75 80
Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu
85 90 95
Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val
100 105 110
Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly
115 120 125
Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg
130 135 140
Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro
145 150 155 160
Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg
165 170 175
Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys
180 185 190
His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala
195 200 205
Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe
210 215 220
Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser
225 230 235 240
Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu
245 250 255
Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr
260 265 270
Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr
275 280 285
Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe
290 295 300
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn
305 310 315 320
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val
325 330 335
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser
340 345 350
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val
355 360 365
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp
370 375 380
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln
385 390 395 400
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr
405 410 415
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly
420 425 430
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys
435 440 445
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr
450 455 460
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser
465 470 475 480
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val
485 490 495
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly
500 505 510
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn
515 520 525
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys
530 535 540
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp
545 550 555 560
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys
565 570 575
Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn
580 585 590
Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
595 600 605
Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr
610 615 620
Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu
625 630 635 640
His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
645 650 655
Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser
660 665 670
Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu
675 680 685
Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe
690 695 700
Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr
705 710 715 720
Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser
725 730 735
Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala
740 745 750
Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe
755 760 765
Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
770 775 780
Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys
785 790 795 800
Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp
805 810 815
Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala
820 825 830
Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
835 840 845
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu
850 855 860
Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu
865 870 875 880
Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly
885 890 895
Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln
900 905 910
Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala
915 920 925
Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala
930 935 940
Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser
945 950 955 960
Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu
965 970 975
Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr
980 985 990
Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala
995 1000 1005
Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser
1010 1015 1020
Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe
1025 1030 1035
Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr
1040 1045 1050
Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys
1055 1060 1065
His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser
1070 1075 1080
Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro
1085 1090 1095
Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp
1100 1105 1110
Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln
1115 1120 1125
Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys
1130 1135 1140
Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile
1145 1150 1155
Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn
1160 1165 1170
Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu
1175 1180 1185
Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro
1190 1195
<210> 2
<211> 1199
<212> PRT
<213> Artificial Sequence
<220>
<223> SARS-CoV-2突变型S蛋白氨基酸序列
<400> 2
Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn
1 5 10 15
Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser
20 25 30
Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val
35 40 45
Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg
50 55 60
Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser
65 70 75 80
Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu
85 90 95
Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val
100 105 110
Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly
115 120 125
Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg
130 135 140
Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro
145 150 155 160
Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg
165 170 175
Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys
180 185 190
His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala
195 200 205
Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe
210 215 220
Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser
225 230 235 240
Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu
245 250 255
Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr
260 265 270
Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr
275 280 285
Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe
290 295 300
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn
305 310 315 320
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val
325 330 335
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser
340 345 350
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val
355 360 365
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp
370 375 380
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln
385 390 395 400
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr
405 410 415
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly
420 425 430
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys
435 440 445
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr
450 455 460
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser
465 470 475 480
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val
485 490 495
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly
500 505 510
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn
515 520 525
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys
530 535 540
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp
545 550 555 560
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys
565 570 575
Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn
580 585 590
Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
595 600 605
Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr
610 615 620
Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu
625 630 635 640
His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
645 650 655
Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Ala Gly Ala Gly Ser
660 665 670
Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu
675 680 685
Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe
690 695 700
Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr
705 710 715 720
Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser
725 730 735
Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala
740 745 750
Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe
755 760 765
Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
770 775 780
Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys
785 790 795 800
Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp
805 810 815
Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala
820 825 830
Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
835 840 845
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu
850 855 860
Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu
865 870 875 880
Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly
885 890 895
Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln
900 905 910
Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala
915 920 925
Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala
930 935 940
Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser
945 950 955 960
Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Pro Pro Glu Ala Glu
965 970 975
Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr
980 985 990
Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala
995 1000 1005
Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser
1010 1015 1020
Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe
1025 1030 1035
Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr
1040 1045 1050
Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys
1055 1060 1065
His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser
1070 1075 1080
Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro
1085 1090 1095
Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp
1100 1105 1110
Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln
1115 1120 1125
Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys
1130 1135 1140
Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile
1145 1150 1155
Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn
1160 1165 1170
Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu
1175 1180 1185
Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro
1190 1195
<210> 3
<211> 33
<212> PRT
<213> Artificial Sequence
<220>
<223> 三聚化结构域序列
<400> 3
Gly Ser Pro Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln
1 5 10 15
Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu
20 25 30
Gly
<210> 4
<211> 6
<212> PRT
<213> Artificial Sequence
<220>
<223> 凝血酶识别序列-1
<400> 4
Leu Val Pro Arg Gly Ser
1 5
<210> 5
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> 凝血酶识别序列-2
<400> 5
Ser Gly Arg Leu Val Pro Arg
1 5
<210> 6
<211> 1248
<212> PRT
<213> Artificial Sequence
<220>
<223> 融合前早期构象的野生型S三聚体蛋白的氨基酸序列,S-WT
<400> 6
Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn
1 5 10 15
Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser
20 25 30
Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val
35 40 45
Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg
50 55 60
Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser
65 70 75 80
Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu
85 90 95
Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val
100 105 110
Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly
115 120 125
Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg
130 135 140
Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro
145 150 155 160
Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg
165 170 175
Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys
180 185 190
His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala
195 200 205
Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe
210 215 220
Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser
225 230 235 240
Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu
245 250 255
Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr
260 265 270
Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr
275 280 285
Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe
290 295 300
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn
305 310 315 320
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val
325 330 335
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser
340 345 350
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val
355 360 365
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp
370 375 380
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln
385 390 395 400
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr
405 410 415
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly
420 425 430
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys
435 440 445
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr
450 455 460
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser
465 470 475 480
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val
485 490 495
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly
500 505 510
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn
515 520 525
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys
530 535 540
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp
545 550 555 560
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys
565 570 575
Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn
580 585 590
Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
595 600 605
Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr
610 615 620
Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu
625 630 635 640
His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
645 650 655
Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser
660 665 670
Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu
675 680 685
Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe
690 695 700
Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr
705 710 715 720
Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser
725 730 735
Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala
740 745 750
Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe
755 760 765
Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
770 775 780
Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys
785 790 795 800
Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp
805 810 815
Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala
820 825 830
Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
835 840 845
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu
850 855 860
Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu
865 870 875 880
Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly
885 890 895
Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln
900 905 910
Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala
915 920 925
Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala
930 935 940
Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser
945 950 955 960
Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu
965 970 975
Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr
980 985 990
Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala
995 1000 1005
Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser
1010 1015 1020
Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe
1025 1030 1035
Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr
1040 1045 1050
Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys
1055 1060 1065
His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser
1070 1075 1080
Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro
1085 1090 1095
Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp
1100 1105 1110
Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln
1115 1120 1125
Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys
1130 1135 1140
Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile
1145 1150 1155
Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn
1160 1165 1170
Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu
1175 1180 1185
Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Leu Val Pro Arg
1190 1195 1200
Gly Ser Gly Ser Pro Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg
1205 1210 1215
Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu
1220 1225 1230
Ser Thr Phe Leu Gly His His His His His His His His His His
1235 1240 1245
<210> 7
<211> 3747
<212> DNA
<213> Artificial Sequence
<220>
<223> 融合前早期构象的野生型S三聚体蛋白的核苷酸序列
<400> 7
tgtgttaatc ttacaaccag aactcaatta ccccctgcat acactaattc tttcacacgt 60
ggtgtttatt accctgacaa agttttcaga tcctcagttt tacattcaac tcaggacttg 120
ttcttacctt tcttttccaa tgttacttgg ttccatgcta tacatgtctc tgggaccaat 180
ggtactaaga ggtttgataa ccctgtccta ccatttaatg atggtgttta ttttgcttcc 240
actgagaagt ctaacataat aagaggctgg atttttggta ctactttaga ttcgaagacc 300
cagtccctac ttattgttaa taacgctact aatgttgtta ttaaagtctg tgaatttcaa 360
ttttgtaatg atccattttt gggtgtttat taccacaaaa acaacaaaag ttggatggaa 420
agtgagttca gagtttattc tagtgcgaat aattgcactt ttgaatatgt ctctcagcct 480
tttcttatgg accttgaagg aaaacagggt aatttcaaaa atcttaggga atttgtgttt 540
aagaatattg atggttattt taaaatatat tctaagcaca cgcctattaa tttagtgcgt 600
gatctccctc agggtttttc ggctttagaa ccattggtag atttgccaat aggtattaac 660
atcactaggt ttcaaacttt acttgcttta catagaagtt atttgactcc tggtgattct 720
tcttcaggtt ggacagctgg tgctgcagct tattatgtgg gttatcttca acctaggact 780
tttctattaa aatataatga aaatggaacc attacagatg ctgtagactg tgcacttgac 840
cctctctcag aaacaaagtg tacgttgaaa tccttcactg tagaaaaagg aatctatcaa 900
acttctaact ttagagtcca accaacagaa tctattgtta gatttcctaa tattacaaac 960
ttgtgccctt ttggtgaagt ttttaacgcc accagatttg catctgttta tgcttggaac 1020
aggaagagaa tcagcaactg tgttgctgat tattctgtcc tatataattc cgcatcattt 1080
tccactttta agtgttatgg agtgtctcct actaaattaa atgatctctg ctttactaat 1140
gtctatgcag attcatttgt aattagaggt gatgaagtca gacaaatcgc tccagggcaa 1200
actggaaaga ttgctgatta taattataaa ttaccagatg attttacagg ctgcgttata 1260
gcttggaatt ctaacaatct tgattctaag gttggtggta attataatta cctgtataga 1320
ttgtttagga agtctaatct caaacctttt gagagagata tttcaactga aatctatcag 1380
gccggtagca caccttgtaa tggtgttgaa ggttttaatt gttactttcc tttacaatca 1440
tatggtttcc aacccactaa tggtgttggt taccaaccat acagagtagt agtactttct 1500
tttgaacttc tacatgcacc agcaactgtt tgtggaccta aaaagtctac taatttggtt 1560
aaaaacaaat gtgtcaattt caacttcaat ggtttaacag gcacaggtgt tcttactgag 1620
tctaacaaaa agtttctgcc tttccaacaa tttggcagag acattgctga cactactgat 1680
gctgtccgtg atccacagac acttgagatt cttgacatta caccatgttc ttttggtggt 1740
gtcagtgtta taacaccagg aacaaatact tctaaccagg ttgctgttct ttatcaggat 1800
gttaactgca cagaagtccc tgttgctatt catgcagatc aacttactcc tacttggcgt 1860
gtttattcta caggttctaa tgtttttcaa acacgtgcag gctgtttaat aggggctgaa 1920
catgtcaaca actcatatga gtgtgacata cccattggtg caggtatatg cgctagttat 1980
cagactcaga ctaattctcc tcggcgggca cgtagtgtag ctagtcaatc catcattgcc 2040
tacactatgt cacttggtgc agaaaattca gttgcttact ctaataactc tattgccata 2100
cccacaaatt ttactattag tgttaccaca gaaattctac cagtgtctat gaccaagaca 2160
tcagtagatt gtacaatgta catttgtggt gattcaactg aatgcagcaa tcttttgttg 2220
caatatggca gtttttgtac acaattaaac cgtgctttaa ctggaatagc tgttgaacaa 2280
gacaaaaaca cccaagaagt ttttgcacaa gtcaaacaaa tttacaaaac accaccaatt 2340
aaagattttg gtggttttaa tttttcacaa atattaccag atccatcaaa accaagcaag 2400
aggtcattta ttgaagatct acttttcaac aaagtgacac ttgcagatgc tggcttcatc 2460
aaacaatatg gtgattgcct tggtgatatt gctgctagag acctcatttg tgcacaaaag 2520
tttaacggcc ttactgtttt gccacctttg ctcacagatg aaatgattgc tcaatacact 2580
tctgcactgt tagcgggtac aatcacttct ggttggacct ttggtgcagg tgctgcatta 2640
caaataccat ttgctatgca aatggcttat aggtttaatg gtattggagt tacacagaat 2700
gttctctatg agaaccaaaa attgattgcc aaccaattta atagtgctat tggcaaaatt 2760
caagactcac tttcttccac agcaagtgca cttggaaaac ttcaagatgt ggtcaaccaa 2820
aatgcacaag ctttaaacac gcttgttaaa caacttagct ccaattttgg tgcaatttca 2880
agtgttttaa atgatatcct ttcacgtctt gacaaagttg aggctgaagt gcaaattgat 2940
aggttgatca caggcagact tcaaagtttg cagacatatg tgactcaaca attaattaga 3000
gctgcagaaa tcagagcttc tgctaatctt gctgctacta aaatgtcaga gtgtgtactt 3060
ggacaatcaa aaagagttga tttttgtgga aagggctatc atcttatgtc cttccctcag 3120
tcagcacctc atggtgtagt cttcttgcat gtgacttatg tccctgcaca agaaaagaac 3180
ttcacaactg ctcctgccat ttgtcatgat ggaaaagcac actttcctcg tgaaggtgtc 3240
tttgtttcaa atggcacaca ctggtttgta acacaaagga atttttatga accacaaatc 3300
attactacag acaacacatt tgtgtctggt aactgtgatg ttgtaatagg aattgtcaac 3360
aacacagttt atgatccttt gcaacctgaa ttagactcat tcaaggagga gttagataaa 3420
tattttaaga atcatacatc accagatgtt gatttaggtg acatctctgg cattaatgct 3480
tcagttgtaa acattcaaaa agaaattgac cgcctcaatg aggttgccaa gaatttaaat 3540
gaatctctca tcgatctcca agaacttgga aagtatgagc agtatataaa atggccattg 3600
gtccctcgtg gaagcggttc ccccggttcc ggttacatcc ccgaggctcc ccgtgacggt 3660
caggcttacg tgcgtaagga cggtgagtgg gtgctgctgt ccaccttcct gggccatcat 3720
caccatcacc accaccacca ccattaa 3747
<210> 8
<211> 1248
<212> PRT
<213> Artificial Sequence
<220>
<223> 融合前早期构象的突变型S三聚体蛋白的氨基酸序列,S-2P
<400> 8
Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn
1 5 10 15
Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser
20 25 30
Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val
35 40 45
Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg
50 55 60
Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser
65 70 75 80
Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu
85 90 95
Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val
100 105 110
Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly
115 120 125
Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg
130 135 140
Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro
145 150 155 160
Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg
165 170 175
Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys
180 185 190
His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala
195 200 205
Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe
210 215 220
Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser
225 230 235 240
Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu
245 250 255
Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr
260 265 270
Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr
275 280 285
Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe
290 295 300
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn
305 310 315 320
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val
325 330 335
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser
340 345 350
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val
355 360 365
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp
370 375 380
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln
385 390 395 400
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr
405 410 415
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly
420 425 430
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys
435 440 445
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr
450 455 460
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser
465 470 475 480
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val
485 490 495
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly
500 505 510
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn
515 520 525
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys
530 535 540
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp
545 550 555 560
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys
565 570 575
Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn
580 585 590
Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
595 600 605
Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr
610 615 620
Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu
625 630 635 640
His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
645 650 655
Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Ala Gly Ala Gly Ser
660 665 670
Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu
675 680 685
Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe
690 695 700
Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr
705 710 715 720
Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser
725 730 735
Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala
740 745 750
Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe
755 760 765
Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
770 775 780
Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys
785 790 795 800
Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp
805 810 815
Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala
820 825 830
Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
835 840 845
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu
850 855 860
Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu
865 870 875 880
Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly
885 890 895
Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln
900 905 910
Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala
915 920 925
Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala
930 935 940
Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser
945 950 955 960
Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Pro Pro Glu Ala Glu
965 970 975
Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr
980 985 990
Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala
995 1000 1005
Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser
1010 1015 1020
Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe
1025 1030 1035
Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr
1040 1045 1050
Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys
1055 1060 1065
His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser
1070 1075 1080
Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro
1085 1090 1095
Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp
1100 1105 1110
Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln
1115 1120 1125
Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys
1130 1135 1140
Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile
1145 1150 1155
Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn
1160 1165 1170
Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu
1175 1180 1185
Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Leu Val Pro Arg
1190 1195 1200
Gly Ser Gly Ser Pro Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg
1205 1210 1215
Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu
1220 1225 1230
Ser Thr Phe Leu Gly His His His His His His His His His His
1235 1240 1245
<210> 9
<211> 3747
<212> DNA
<213> Artificial Sequence
<220>
<223> 融合前早期构象的突变型S三聚体蛋白的核苷酸序列
<400> 9
tgtgttaatc ttacaaccag aactcaatta ccccctgcat acactaattc tttcacacgt 60
ggtgtttatt accctgacaa agttttcaga tcctcagttt tacattcaac tcaggacttg 120
ttcttacctt tcttttccaa tgttacttgg ttccatgcta tacatgtctc tgggaccaat 180
ggtactaaga ggtttgataa ccctgtccta ccatttaatg atggtgttta ttttgcttcc 240
actgagaagt ctaacataat aagaggctgg atttttggta ctactttaga ttcgaagacc 300
cagtccctac ttattgttaa taacgctact aatgttgtta ttaaagtctg tgaatttcaa 360
ttttgtaatg atccattttt gggtgtttat taccacaaaa acaacaaaag ttggatggaa 420
agtgagttca gagtttattc tagtgcgaat aattgcactt ttgaatatgt ctctcagcct 480
tttcttatgg accttgaagg aaaacagggt aatttcaaaa atcttaggga atttgtgttt 540
aagaatattg atggttattt taaaatatat tctaagcaca cgcctattaa tttagtgcgt 600
gatctccctc agggtttttc ggctttagaa ccattggtag atttgccaat aggtattaac 660
atcactaggt ttcaaacttt acttgcttta catagaagtt atttgactcc tggtgattct 720
tcttcaggtt ggacagctgg tgctgcagct tattatgtgg gttatcttca acctaggact 780
tttctattaa aatataatga aaatggaacc attacagatg ctgtagactg tgcacttgac 840
cctctctcag aaacaaagtg tacgttgaaa tccttcactg tagaaaaagg aatctatcaa 900
acttctaact ttagagtcca accaacagaa tctattgtta gatttcctaa tattacaaac 960
ttgtgccctt ttggtgaagt ttttaacgcc accagatttg catctgttta tgcttggaac 1020
aggaagagaa tcagcaactg tgttgctgat tattctgtcc tatataattc cgcatcattt 1080
tccactttta agtgttatgg agtgtctcct actaaattaa atgatctctg ctttactaat 1140
gtctatgcag attcatttgt aattagaggt gatgaagtca gacaaatcgc tccagggcaa 1200
actggaaaga ttgctgatta taattataaa ttaccagatg attttacagg ctgcgttata 1260
gcttggaatt ctaacaatct tgattctaag gttggtggta attataatta cctgtataga 1320
ttgtttagga agtctaatct caaacctttt gagagagata tttcaactga aatctatcag 1380
gccggtagca caccttgtaa tggtgttgaa ggttttaatt gttactttcc tttacaatca 1440
tatggtttcc aacccactaa tggtgttggt taccaaccat acagagtagt agtactttct 1500
tttgaacttc tacatgcacc agcaactgtt tgtggaccta aaaagtctac taatttggtt 1560
aaaaacaaat gtgtcaattt caacttcaat ggtttaacag gcacaggtgt tcttactgag 1620
tctaacaaaa agtttctgcc tttccaacaa tttggcagag acattgctga cactactgat 1680
gctgtccgtg atccacagac acttgagatt cttgacatta caccatgttc ttttggtggt 1740
gtcagtgtta taacaccagg aacaaatact tctaaccagg ttgctgttct ttatcaggat 1800
gttaactgca cagaagtccc tgttgctatt catgcagatc aacttactcc tacttggcgt 1860
gtttattcta caggttctaa tgtttttcaa acacgtgcag gctgtttaat aggggctgaa 1920
catgtcaaca actcatatga gtgtgacata cccattggtg caggtatatg cgctagttat 1980
cagactcaga ctaattctcc tgcaggcgca ggaagtgtag ctagtcaatc catcattgcc 2040
tacactatgt cacttggtgc agaaaattca gttgcttact ctaataactc tattgccata 2100
cccacaaatt ttactattag tgttaccaca gaaattctac cagtgtctat gaccaagaca 2160
tcagtagatt gtacaatgta catttgtggt gattcaactg aatgcagcaa tcttttgttg 2220
caatatggca gtttttgtac acaattaaac cgtgctttaa ctggaatagc tgttgaacaa 2280
gacaaaaaca cccaagaagt ttttgcacaa gtcaaacaaa tttacaaaac accaccaatt 2340
aaagattttg gtggttttaa tttttcacaa atattaccag atccatcaaa accaagcaag 2400
aggtcattta ttgaagatct acttttcaac aaagtgacac ttgcagatgc tggcttcatc 2460
aaacaatatg gtgattgcct tggtgatatt gctgctagag acctcatttg tgcacaaaag 2520
tttaacggcc ttactgtttt gccacctttg ctcacagatg aaatgattgc tcaatacact 2580
tctgcactgt tagcgggtac aatcacttct ggttggacct ttggtgcagg tgctgcatta 2640
caaataccat ttgctatgca aatggcttat aggtttaatg gtattggagt tacacagaat 2700
gttctctatg agaaccaaaa attgattgcc aaccaattta atagtgctat tggcaaaatt 2760
caagactcac tttcttccac agcaagtgca cttggaaaac ttcaagatgt ggtcaaccaa 2820
aatgcacaag ctttaaacac gcttgttaaa caacttagct ccaattttgg tgcaatttca 2880
agtgttttaa atgatatcct ttcacgtctt gacccaccgg aggctgaagt gcaaattgat 2940
aggttgatca caggcagact tcaaagtttg cagacatatg tgactcaaca attaattaga 3000
gctgcagaaa tcagagcttc tgctaatctt gctgctacta aaatgtcaga gtgtgtactt 3060
ggacaatcaa aaagagttga tttttgtgga aagggctatc atcttatgtc cttccctcag 3120
tcagcacctc atggtgtagt cttcttgcat gtgacttatg tccctgcaca agaaaagaac 3180
ttcacaactg ctcctgccat ttgtcatgat ggaaaagcac actttcctcg tgaaggtgtc 3240
tttgtttcaa atggcacaca ctggtttgta acacaaagga atttttatga accacaaatc 3300
attactacag acaacacatt tgtgtctggt aactgtgatg ttgtaatagg aattgtcaac 3360
aacacagttt atgatccttt gcaacctgaa ttagactcat tcaaggagga gttagataaa 3420
tattttaaga atcatacatc accagatgtt gatttaggtg acatctctgg cattaatgct 3480
tcagttgtaa acattcaaaa agaaattgac cgcctcaatg aggttgccaa gaatttaaat 3540
gaatctctca tcgatctcca agaacttgga aagtatgagc agtatataaa atggccattg 3600
gtccctcgtg gaagcggttc ccccggttcc ggttacatcc ccgaggctcc ccgtgacggt 3660
caggcttacg tgcgtaagga cggtgagtgg gtgctgctgt ccaccttcct gggccatcat 3720
caccatcacc accaccacca ccattaa 3747
<210> 10
<211> 190
<212> PRT
<213> Artificial Sequence
<220>
<223> CRMA
<400> 10
Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn
1 5 10 15
Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln
20 25 30
Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp
35 40 45
Asp Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly
50 55 60
Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val
65 70 75 80
Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val
85 90 95
Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu
100 105 110
Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly
115 120 125
Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser
130 135 140
Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
145 150 155 160
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp
165 170 175
Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg
180 185 190
<210> 11
<211> 389
<212> PRT
<213> Artificial Sequence
<220>
<223> CRM389
<400> 11
Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn
1 5 10 15
Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln
20 25 30
Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp
35 40 45
Asp Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly
50 55 60
Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val
65 70 75 80
Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val
85 90 95
Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu
100 105 110
Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly
115 120 125
Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser
130 135 140
Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
145 150 155 160
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp
165 170 175
Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Val Arg
180 185 190
Arg Ser Val Gly Ser Ser Leu Ser Cys Ile Asn Leu Asp Trp Asp Val
195 200 205
Ile Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser Leu Lys Glu His Gly
210 215 220
Pro Ile Lys Asn Lys Met Ser Glu Ser Pro Asn Lys Thr Val Ser Glu
225 230 235 240
Glu Lys Ala Lys Gln Tyr Leu Glu Glu Phe His Gln Thr Ala Leu Glu
245 250 255
His Pro Glu Leu Ser Glu Leu Lys Thr Val Thr Gly Thr Asn Pro Val
260 265 270
Phe Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val Asn Val Ala Gln Val
275 280 285
Ile Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys Thr Thr Ala Ala Leu
290 295 300
Ser Ile Leu Pro Gly Ile Gly Ser Val Met Gly Ile Ala Asp Gly Ala
305 310 315 320
Val His His Asn Thr Glu Glu Ile Val Ala Gln Ser Ile Ala Leu Ser
325 330 335
Ser Leu Met Val Ala Gln Ala Ile Pro Leu Val Gly Glu Leu Val Asp
340 345 350
Ile Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser Ile Ile Asn Leu Phe
355 360 365
Gln Val Val His Asn Ser Tyr Asn Arg Pro Ala Tyr Ser Pro Gly His
370 375 380
Lys Thr Gln Pro Phe
385
<210> 12
<211> 535
<212> PRT
<213> Artificial Sequence
<220>
<223> CRM197
<400> 12
Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn
1 5 10 15
Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln
20 25 30
Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp
35 40 45
Asp Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly
50 55 60
Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val
65 70 75 80
Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val
85 90 95
Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu
100 105 110
Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly
115 120 125
Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser
130 135 140
Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
145 150 155 160
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp
165 170 175
Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Val Arg
180 185 190
Arg Ser Val Gly Ser Ser Leu Ser Cys Ile Asn Leu Asp Trp Asp Val
195 200 205
Ile Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser Leu Lys Glu His Gly
210 215 220
Pro Ile Lys Asn Lys Met Ser Glu Ser Pro Asn Lys Thr Val Ser Glu
225 230 235 240
Glu Lys Ala Lys Gln Tyr Leu Glu Glu Phe His Gln Thr Ala Leu Glu
245 250 255
His Pro Glu Leu Ser Glu Leu Lys Thr Val Thr Gly Thr Asn Pro Val
260 265 270
Phe Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val Asn Val Ala Gln Val
275 280 285
Ile Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys Thr Thr Ala Ala Leu
290 295 300
Ser Ile Leu Pro Gly Ile Gly Ser Val Met Gly Ile Ala Asp Gly Ala
305 310 315 320
Val His His Asn Thr Glu Glu Ile Val Ala Gln Ser Ile Ala Leu Ser
325 330 335
Ser Leu Met Val Ala Gln Ala Ile Pro Leu Val Gly Glu Leu Val Asp
340 345 350
Ile Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser Ile Ile Asn Leu Phe
355 360 365
Gln Val Val His Asn Ser Tyr Asn Arg Pro Ala Tyr Ser Pro Gly His
370 375 380
Lys Thr Gln Pro Phe Leu His Asp Gly Tyr Ala Val Ser Trp Asn Thr
385 390 395 400
Val Glu Asp Ser Ile Ile Arg Thr Gly Phe Gln Gly Glu Ser Gly His
405 410 415
Asp Ile Lys Ile Thr Ala Glu Asn Thr Pro Leu Pro Ile Ala Gly Val
420 425 430
Leu Leu Pro Thr Ile Pro Gly Lys Leu Asp Val Asn Lys Ser Lys Thr
435 440 445
His Ile Ser Val Asn Gly Arg Lys Ile Arg Met Arg Cys Arg Ala Ile
450 455 460
Asp Gly Asp Val Thr Phe Cys Arg Pro Lys Ser Pro Val Tyr Val Gly
465 470 475 480
Asn Gly Val His Ala Asn Leu His Val Ala Phe His Arg Ser Ser Ser
485 490 495
Glu Lys Ile His Ser Asn Glu Ile Ser Ser Asp Ser Ile Gly Val Leu
500 505 510
Gly Tyr Gln Lys Thr Val Asp His Thr Lys Val Asn Ser Lys Leu Ser
515 520 525
Leu Phe Phe Glu Ile Lys Ser
530 535
<210> 13
<211> 1453
<212> PRT
<213> Artificial Sequence
<220>
<223> CRMA-S(N端融合)氨基酸序列
<400> 13
Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn
1 5 10 15
Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln
20 25 30
Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp
35 40 45
Asp Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly
50 55 60
Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val
65 70 75 80
Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val
85 90 95
Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu
100 105 110
Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly
115 120 125
Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser
130 135 140
Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
145 150 155 160
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp
165 170 175
Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Gly Gly
180 185 190
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Cys Val Asn
195 200 205
Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe Thr
210 215 220
Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu His
225 230 235 240
Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp Phe
245 250 255
His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp Asn
260 265 270
Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu Lys
275 280 285
Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser Lys
290 295 300
Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile Lys
305 310 315 320
Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr Tyr
325 330 335
His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr Ser
340 345 350
Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu Met
355 360 365
Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe Val
370 375 380
Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr Pro
385 390 395 400
Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu Pro
405 410 415
Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr Leu
420 425 430
Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser Gly
435 440 445
Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro Arg
450 455 460
Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala Val
465 470 475 480
Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys Ser
485 490 495
Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln
500 505 510
Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro
515 520 525
Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp
530 535 540
Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr
545 550 555 560
Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr
565 570 575
Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val
580 585 590
Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys
595 600 605
Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val
610 615 620
Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr
625 630 635 640
Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu
645 650 655
Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn
660 665 670
Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe
675 680 685
Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu
690 695 700
Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys
705 710 715 720
Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly
725 730 735
Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro
740 745 750
Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val Arg
755 760 765
Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe Gly
770 775 780
Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val Ala
785 790 795 800
Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile His
805 810 815
Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser Asn
820 825 830
Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val Asn
835 840 845
Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala Ser
850 855 860
Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala Ser
865 870 875 880
Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser Val
885 890 895
Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser
900 905 910
Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp
915 920 925
Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu
930 935 940
Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly
945 950 955 960
Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val
965 970 975
Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn
980 985 990
Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe
995 1000 1005
Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
1010 1015 1020
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg
1025 1030 1035
Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
1040 1045 1050
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu
1055 1060 1065
Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala
1070 1075 1080
Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn
1085 1090 1095
Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu
1100 1105 1110
Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser
1115 1120 1125
Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val
1130 1135 1140
Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser
1145 1150 1155
Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser
1160 1165 1170
Arg Leu Asp Lys Val Glu Ala Glu Val Gln Ile Asp Arg Leu Ile
1175 1180 1185
Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr Gln Gln Leu
1190 1195 1200
Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala Thr
1205 1210 1215
Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp Phe
1220 1225 1230
Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro
1235 1240 1245
His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1250 1255 1260
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala
1265 1270 1275
His Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp
1280 1285 1290
Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr
1295 1300 1305
Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile
1310 1315 1320
Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser
1325 1330 1335
Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro
1340 1345 1350
Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala Ser Val Val
1355 1360 1365
Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala Lys Asn
1370 1375 1380
Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr Glu
1385 1390 1395
Gln Tyr Ile Lys Trp Pro Leu Val Pro Arg Gly Ser Gly Ser Pro
1400 1405 1410
Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr
1415 1420 1425
Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly
1430 1435 1440
His His His His His His His His His His
1445 1450
<210> 14
<211> 1453
<212> PRT
<213> Artificial Sequence
<220>
<223> S-CRMA(C端融合)氨基酸序列
<400> 14
Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn
1 5 10 15
Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser
20 25 30
Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val
35 40 45
Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg
50 55 60
Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser
65 70 75 80
Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu
85 90 95
Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val
100 105 110
Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly
115 120 125
Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg
130 135 140
Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro
145 150 155 160
Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg
165 170 175
Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys
180 185 190
His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala
195 200 205
Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe
210 215 220
Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser
225 230 235 240
Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu
245 250 255
Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr
260 265 270
Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr
275 280 285
Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe
290 295 300
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn
305 310 315 320
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val
325 330 335
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser
340 345 350
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val
355 360 365
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp
370 375 380
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln
385 390 395 400
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr
405 410 415
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly
420 425 430
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys
435 440 445
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr
450 455 460
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser
465 470 475 480
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val
485 490 495
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly
500 505 510
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn
515 520 525
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys
530 535 540
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp
545 550 555 560
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys
565 570 575
Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn
580 585 590
Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
595 600 605
Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr
610 615 620
Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu
625 630 635 640
His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
645 650 655
Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser
660 665 670
Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu
675 680 685
Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe
690 695 700
Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr
705 710 715 720
Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser
725 730 735
Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala
740 745 750
Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe
755 760 765
Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
770 775 780
Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys
785 790 795 800
Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp
805 810 815
Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala
820 825 830
Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
835 840 845
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu
850 855 860
Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu
865 870 875 880
Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly
885 890 895
Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln
900 905 910
Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala
915 920 925
Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala
930 935 940
Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser
945 950 955 960
Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu
965 970 975
Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr
980 985 990
Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala
995 1000 1005
Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser
1010 1015 1020
Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe
1025 1030 1035
Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr
1040 1045 1050
Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys
1055 1060 1065
His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser
1070 1075 1080
Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro
1085 1090 1095
Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp
1100 1105 1110
Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln
1115 1120 1125
Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys
1130 1135 1140
Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile
1145 1150 1155
Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn
1160 1165 1170
Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu
1175 1180 1185
Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Leu Val Pro Arg
1190 1195 1200
Gly Ser Gly Ser Pro Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg
1205 1210 1215
Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu
1220 1225 1230
Ser Thr Phe Leu Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
1235 1240 1245
Gly Gly Gly Gly Ser Gly Ala Asp Asp Val Val Asp Ser Ser Lys
1250 1255 1260
Ser Phe Val Met Glu Asn Phe Ser Ser Tyr His Gly Thr Lys Pro
1265 1270 1275
Gly Tyr Val Asp Ser Ile Gln Lys Gly Ile Gln Lys Pro Lys Ser
1280 1285 1290
Gly Thr Gln Gly Asn Tyr Asp Asp Asp Trp Lys Glu Phe Tyr Ser
1295 1300 1305
Thr Asp Asn Lys Tyr Asp Ala Ala Gly Tyr Ser Val Asp Asn Glu
1310 1315 1320
Asn Pro Leu Ser Gly Lys Ala Gly Gly Val Val Lys Val Thr Tyr
1325 1330 1335
Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val Asp Asn Ala Glu
1340 1345 1350
Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu Pro Leu Met
1355 1360 1365
Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly Asp Gly
1370 1375 1380
Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser Ser
1385 1390 1395
Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
1400 1405 1410
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln
1415 1420 1425
Asp Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg
1430 1435 1440
His His His His His His His His His His
1445 1450
<210> 15
<211> 1652
<212> PRT
<213> Artificial Sequence
<220>
<223> CRM389-S(N端融合)氨基酸序列
<400> 15
Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn
1 5 10 15
Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln
20 25 30
Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp
35 40 45
Asp Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly
50 55 60
Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val
65 70 75 80
Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val
85 90 95
Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu
100 105 110
Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly
115 120 125
Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser
130 135 140
Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
145 150 155 160
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp
165 170 175
Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Val Arg
180 185 190
Arg Ser Val Gly Ser Ser Leu Ser Cys Ile Asn Leu Asp Trp Asp Val
195 200 205
Ile Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser Leu Lys Glu His Gly
210 215 220
Pro Ile Lys Asn Lys Met Ser Glu Ser Pro Asn Lys Thr Val Ser Glu
225 230 235 240
Glu Lys Ala Lys Gln Tyr Leu Glu Glu Phe His Gln Thr Ala Leu Glu
245 250 255
His Pro Glu Leu Ser Glu Leu Lys Thr Val Thr Gly Thr Asn Pro Val
260 265 270
Phe Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val Asn Val Ala Gln Val
275 280 285
Ile Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys Thr Thr Ala Ala Leu
290 295 300
Ser Ile Leu Pro Gly Ile Gly Ser Val Met Gly Ile Ala Asp Gly Ala
305 310 315 320
Val His His Asn Thr Glu Glu Ile Val Ala Gln Ser Ile Ala Leu Ser
325 330 335
Ser Leu Met Val Ala Gln Ala Ile Pro Leu Val Gly Glu Leu Val Asp
340 345 350
Ile Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser Ile Ile Asn Leu Phe
355 360 365
Gln Val Val His Asn Ser Tyr Asn Arg Pro Ala Tyr Ser Pro Gly His
370 375 380
Lys Thr Gln Pro Phe Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
385 390 395 400
Gly Gly Gly Ser Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro
405 410 415
Ala Tyr Thr Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val
420 425 430
Phe Arg Ser Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe
435 440 445
Phe Ser Asn Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn
450 455 460
Gly Thr Lys Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val
465 470 475 480
Tyr Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe
485 490 495
Gly Thr Thr Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn
500 505 510
Ala Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp
515 520 525
Pro Phe Leu Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu
530 535 540
Ser Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr
545 550 555 560
Val Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe
565 570 575
Lys Asn Leu Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys
580 585 590
Ile Tyr Ser Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln
595 600 605
Gly Phe Ser Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn
610 615 620
Ile Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr
625 630 635 640
Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr
645 650 655
Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn
660 665 670
Gly Thr Ile Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu
675 680 685
Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln
690 695 700
Thr Ser Asn Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro
705 710 715 720
Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg
725 730 735
Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val
740 745 750
Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys
755 760 765
Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn
770 775 780
Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile
785 790 795 800
Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro
805 810 815
Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp
820 825 830
Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys
835 840 845
Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln
850 855 860
Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe
865 870 875 880
Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln
885 890 895
Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala
900 905 910
Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys
915 920 925
Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu
930 935 940
Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala
945 950 955 960
Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp
965 970 975
Ile Thr Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr
980 985 990
Asn Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr
995 1000 1005
Glu Val Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp
1010 1015 1020
Arg Val Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly
1025 1030 1035
Cys Leu Ile Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp
1040 1045 1050
Ile Pro Ile Gly Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr
1055 1060 1065
Asn Ser Pro Arg Arg Ala Arg Ser Val Ala Ser Gln Ser Ile Ile
1070 1075 1080
Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser Val Ala Tyr Ser
1085 1090 1095
Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser Val Thr
1100 1105 1110
Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp Cys
1115 1120 1125
Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu
1130 1135 1140
Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
1145 1150 1155
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala
1160 1165 1170
Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
1175 1180 1185
Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser
1190 1195 1200
Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu
1205 1210 1215
Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp
1220 1225 1230
Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu
1235 1240 1245
Thr Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr
1250 1255 1260
Thr Ser Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe
1265 1270 1275
Gly Ala Gly Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala
1280 1285 1290
Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu
1295 1300 1305
Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys
1310 1315 1320
Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu
1325 1330 1335
Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu Val
1340 1345 1350
Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn
1355 1360 1365
Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln Ile
1370 1375 1380
Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
1385 1390 1395
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn
1400 1405 1410
Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys
1415 1420 1425
Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro
1430 1435 1440
Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val
1445 1450 1455
Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His
1460 1465 1470
Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser Asn
1475 1480 1485
Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln
1490 1495 1500
Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val
1505 1510 1515
Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro
1520 1525 1530
Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn
1535 1540 1545
His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn
1550 1555 1560
Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu
1565 1570 1575
Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu
1580 1585 1590
Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Leu Val Pro Arg Gly
1595 1600 1605
Ser Gly Ser Pro Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg Asp
1610 1615 1620
Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser
1625 1630 1635
Thr Phe Leu Gly His His His His His His His His His His
1640 1645 1650
<210> 16
<211> 1652
<212> PRT
<213> Artificial Sequence
<220>
<223> S-CRM389(C端融合)氨基酸序列
<400> 16
Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn
1 5 10 15
Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser
20 25 30
Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val
35 40 45
Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg
50 55 60
Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser
65 70 75 80
Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu
85 90 95
Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val
100 105 110
Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly
115 120 125
Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg
130 135 140
Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro
145 150 155 160
Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg
165 170 175
Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys
180 185 190
His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala
195 200 205
Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe
210 215 220
Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser
225 230 235 240
Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu
245 250 255
Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr
260 265 270
Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr
275 280 285
Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe
290 295 300
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn
305 310 315 320
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val
325 330 335
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser
340 345 350
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val
355 360 365
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp
370 375 380
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln
385 390 395 400
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr
405 410 415
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly
420 425 430
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys
435 440 445
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr
450 455 460
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser
465 470 475 480
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val
485 490 495
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly
500 505 510
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn
515 520 525
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys
530 535 540
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp
545 550 555 560
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys
565 570 575
Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn
580 585 590
Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
595 600 605
Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr
610 615 620
Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu
625 630 635 640
His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
645 650 655
Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser
660 665 670
Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu
675 680 685
Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe
690 695 700
Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr
705 710 715 720
Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser
725 730 735
Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala
740 745 750
Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe
755 760 765
Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
770 775 780
Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys
785 790 795 800
Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp
805 810 815
Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala
820 825 830
Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
835 840 845
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu
850 855 860
Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu
865 870 875 880
Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly
885 890 895
Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln
900 905 910
Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala
915 920 925
Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala
930 935 940
Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser
945 950 955 960
Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu
965 970 975
Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr
980 985 990
Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala
995 1000 1005
Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser
1010 1015 1020
Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe
1025 1030 1035
Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr
1040 1045 1050
Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys
1055 1060 1065
His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser
1070 1075 1080
Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro
1085 1090 1095
Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp
1100 1105 1110
Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln
1115 1120 1125
Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys
1130 1135 1140
Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile
1145 1150 1155
Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn
1160 1165 1170
Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu
1175 1180 1185
Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Leu Val Pro Arg
1190 1195 1200
Gly Ser Gly Ser Pro Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg
1205 1210 1215
Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu
1220 1225 1230
Ser Thr Phe Leu Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
1235 1240 1245
Gly Gly Gly Gly Ser Gly Ala Asp Asp Val Val Asp Ser Ser Lys
1250 1255 1260
Ser Phe Val Met Glu Asn Phe Ser Ser Tyr His Gly Thr Lys Pro
1265 1270 1275
Gly Tyr Val Asp Ser Ile Gln Lys Gly Ile Gln Lys Pro Lys Ser
1280 1285 1290
Gly Thr Gln Gly Asn Tyr Asp Asp Asp Trp Lys Glu Phe Tyr Ser
1295 1300 1305
Thr Asp Asn Lys Tyr Asp Ala Ala Gly Tyr Ser Val Asp Asn Glu
1310 1315 1320
Asn Pro Leu Ser Gly Lys Ala Gly Gly Val Val Lys Val Thr Tyr
1325 1330 1335
Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val Asp Asn Ala Glu
1340 1345 1350
Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu Pro Leu Met
1355 1360 1365
Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly Asp Gly
1370 1375 1380
Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser Ser
1385 1390 1395
Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
1400 1405 1410
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln
1415 1420 1425
Asp Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg
1430 1435 1440
Val Arg Arg Ser Val Gly Ser Ser Leu Ser Cys Ile Asn Leu Asp
1445 1450 1455
Trp Asp Val Ile Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser Leu
1460 1465 1470
Lys Glu His Gly Pro Ile Lys Asn Lys Met Ser Glu Ser Pro Asn
1475 1480 1485
Lys Thr Val Ser Glu Glu Lys Ala Lys Gln Tyr Leu Glu Glu Phe
1490 1495 1500
His Gln Thr Ala Leu Glu His Pro Glu Leu Ser Glu Leu Lys Thr
1505 1510 1515
Val Thr Gly Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala Ala
1520 1525 1530
Trp Ala Val Asn Val Ala Gln Val Ile Asp Ser Glu Thr Ala Asp
1535 1540 1545
Asn Leu Glu Lys Thr Thr Ala Ala Leu Ser Ile Leu Pro Gly Ile
1550 1555 1560
Gly Ser Val Met Gly Ile Ala Asp Gly Ala Val His His Asn Thr
1565 1570 1575
Glu Glu Ile Val Ala Gln Ser Ile Ala Leu Ser Ser Leu Met Val
1580 1585 1590
Ala Gln Ala Ile Pro Leu Val Gly Glu Leu Val Asp Ile Gly Phe
1595 1600 1605
Ala Ala Tyr Asn Phe Val Glu Ser Ile Ile Asn Leu Phe Gln Val
1610 1615 1620
Val His Asn Ser Tyr Asn Arg Pro Ala Tyr Ser Pro Gly His Lys
1625 1630 1635
Thr Gln Pro Phe His His His His His His His His His His
1640 1645 1650
<210> 17
<211> 1798
<212> PRT
<213> Artificial Sequence
<220>
<223> CRM197-S(N端融合)氨基酸序列
<400> 17
Gly Ala Asp Asp Val Val Asp Ser Ser Lys Ser Phe Val Met Glu Asn
1 5 10 15
Phe Ser Ser Tyr His Gly Thr Lys Pro Gly Tyr Val Asp Ser Ile Gln
20 25 30
Lys Gly Ile Gln Lys Pro Lys Ser Gly Thr Gln Gly Asn Tyr Asp Asp
35 40 45
Asp Trp Lys Glu Phe Tyr Ser Thr Asp Asn Lys Tyr Asp Ala Ala Gly
50 55 60
Tyr Ser Val Asp Asn Glu Asn Pro Leu Ser Gly Lys Ala Gly Gly Val
65 70 75 80
Val Lys Val Thr Tyr Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val
85 90 95
Asp Asn Ala Glu Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu
100 105 110
Pro Leu Met Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly
115 120 125
Asp Gly Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser
130 135 140
Ser Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
145 150 155 160
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln Asp
165 170 175
Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg Val Arg
180 185 190
Arg Ser Val Gly Ser Ser Leu Ser Cys Ile Asn Leu Asp Trp Asp Val
195 200 205
Ile Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser Leu Lys Glu His Gly
210 215 220
Pro Ile Lys Asn Lys Met Ser Glu Ser Pro Asn Lys Thr Val Ser Glu
225 230 235 240
Glu Lys Ala Lys Gln Tyr Leu Glu Glu Phe His Gln Thr Ala Leu Glu
245 250 255
His Pro Glu Leu Ser Glu Leu Lys Thr Val Thr Gly Thr Asn Pro Val
260 265 270
Phe Ala Gly Ala Asn Tyr Ala Ala Trp Ala Val Asn Val Ala Gln Val
275 280 285
Ile Asp Ser Glu Thr Ala Asp Asn Leu Glu Lys Thr Thr Ala Ala Leu
290 295 300
Ser Ile Leu Pro Gly Ile Gly Ser Val Met Gly Ile Ala Asp Gly Ala
305 310 315 320
Val His His Asn Thr Glu Glu Ile Val Ala Gln Ser Ile Ala Leu Ser
325 330 335
Ser Leu Met Val Ala Gln Ala Ile Pro Leu Val Gly Glu Leu Val Asp
340 345 350
Ile Gly Phe Ala Ala Tyr Asn Phe Val Glu Ser Ile Ile Asn Leu Phe
355 360 365
Gln Val Val His Asn Ser Tyr Asn Arg Pro Ala Tyr Ser Pro Gly His
370 375 380
Lys Thr Gln Pro Phe Leu His Asp Gly Tyr Ala Val Ser Trp Asn Thr
385 390 395 400
Val Glu Asp Ser Ile Ile Arg Thr Gly Phe Gln Gly Glu Ser Gly His
405 410 415
Asp Ile Lys Ile Thr Ala Glu Asn Thr Pro Leu Pro Ile Ala Gly Val
420 425 430
Leu Leu Pro Thr Ile Pro Gly Lys Leu Asp Val Asn Lys Ser Lys Thr
435 440 445
His Ile Ser Val Asn Gly Arg Lys Ile Arg Met Arg Cys Arg Ala Ile
450 455 460
Asp Gly Asp Val Thr Phe Cys Arg Pro Lys Ser Pro Val Tyr Val Gly
465 470 475 480
Asn Gly Val His Ala Asn Leu His Val Ala Phe His Arg Ser Ser Ser
485 490 495
Glu Lys Ile His Ser Asn Glu Ile Ser Ser Asp Ser Ile Gly Val Leu
500 505 510
Gly Tyr Gln Lys Thr Val Asp His Thr Lys Val Asn Ser Lys Leu Ser
515 520 525
Leu Phe Phe Glu Ile Lys Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
530 535 540
Ser Gly Gly Gly Gly Ser Cys Val Asn Leu Thr Thr Arg Thr Gln Leu
545 550 555 560
Pro Pro Ala Tyr Thr Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp
565 570 575
Lys Val Phe Arg Ser Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu
580 585 590
Pro Phe Phe Ser Asn Val Thr Trp Phe His Ala Ile His Val Ser Gly
595 600 605
Thr Asn Gly Thr Lys Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp
610 615 620
Gly Val Tyr Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp
625 630 635 640
Ile Phe Gly Thr Thr Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val
645 650 655
Asn Asn Ala Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys
660 665 670
Asn Asp Pro Phe Leu Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp
675 680 685
Met Glu Ser Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe
690 695 700
Glu Tyr Val Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly
705 710 715 720
Asn Phe Lys Asn Leu Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr
725 730 735
Phe Lys Ile Tyr Ser Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu
740 745 750
Pro Gln Gly Phe Ser Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly
755 760 765
Ile Asn Ile Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr
770 775 780
Leu Thr Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala
785 790 795 800
Tyr Tyr Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn
805 810 815
Glu Asn Gly Thr Ile Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu
820 825 830
Ser Glu Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile
835 840 845
Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg
850 855 860
Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala
865 870 875 880
Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn
885 890 895
Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr
900 905 910
Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe
915 920 925
Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg
930 935 940
Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys
945 950 955 960
Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn
965 970 975
Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe
980 985 990
Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile
995 1000 1005
Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn
1010 1015 1020
Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly
1025 1030 1035
Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu
1040 1045 1050
Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn
1055 1060 1065
Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr
1070 1075 1080
Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe
1085 1090 1095
Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val Arg
1100 1105 1110
Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
1115 1120 1125
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln
1130 1135 1140
Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
1145 1150 1155
Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser
1160 1165 1170
Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly
1175 1180 1185
Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly
1190 1195 1200
Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg
1205 1210 1215
Arg Ala Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met
1220 1225 1230
Ser Leu Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile
1235 1240 1245
Ala Ile Pro Thr Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu
1250 1255 1260
Pro Val Ser Met Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile
1265 1270 1275
Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu Leu Gln Tyr Gly
1280 1285 1290
Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly Ile Ala Val
1295 1300 1305
Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val Lys Gln
1310 1315 1320
Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn Phe
1325 1330 1335
Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe
1340 1345 1350
Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
1355 1360 1365
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg
1370 1375 1380
Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
1385 1390 1395
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu
1400 1405 1410
Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala
1415 1420 1425
Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn
1430 1435 1440
Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu
1445 1450 1455
Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser
1460 1465 1470
Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val
1475 1480 1485
Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser
1490 1495 1500
Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser
1505 1510 1515
Arg Leu Asp Lys Val Glu Ala Glu Val Gln Ile Asp Arg Leu Ile
1520 1525 1530
Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr Gln Gln Leu
1535 1540 1545
Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala Thr
1550 1555 1560
Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp Phe
1565 1570 1575
Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro
1580 1585 1590
His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu
1595 1600 1605
Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala
1610 1615 1620
His Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp
1625 1630 1635
Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr
1640 1645 1650
Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile
1655 1660 1665
Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser
1670 1675 1680
Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro
1685 1690 1695
Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala Ser Val Val
1700 1705 1710
Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala Lys Asn
1715 1720 1725
Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr Glu
1730 1735 1740
Gln Tyr Ile Lys Trp Pro Leu Val Pro Arg Gly Ser Gly Ser Pro
1745 1750 1755
Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr
1760 1765 1770
Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly
1775 1780 1785
His His His His His His His His His His
1790 1795
<210> 18
<211> 1798
<212> PRT
<213> Artificial Sequence
<220>
<223> S-CRM197(C端融合)氨基酸序列
<400> 18
Cys Val Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn
1 5 10 15
Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser
20 25 30
Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val
35 40 45
Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg
50 55 60
Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser
65 70 75 80
Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu
85 90 95
Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val
100 105 110
Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly
115 120 125
Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg
130 135 140
Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro
145 150 155 160
Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg
165 170 175
Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys
180 185 190
His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala
195 200 205
Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe
210 215 220
Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser
225 230 235 240
Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu
245 250 255
Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr
260 265 270
Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr
275 280 285
Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe
290 295 300
Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn
305 310 315 320
Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val
325 330 335
Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser
340 345 350
Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val
355 360 365
Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp
370 375 380
Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln
385 390 395 400
Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr
405 410 415
Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly
420 425 430
Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys
435 440 445
Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr
450 455 460
Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser
465 470 475 480
Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val
485 490 495
Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly
500 505 510
Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn
515 520 525
Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys
530 535 540
Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp
545 550 555 560
Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys
565 570 575
Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn
580 585 590
Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
595 600 605
Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr
610 615 620
Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu
625 630 635 640
His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
645 650 655
Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser
660 665 670
Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu
675 680 685
Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe
690 695 700
Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr
705 710 715 720
Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser
725 730 735
Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala
740 745 750
Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe
755 760 765
Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
770 775 780
Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys
785 790 795 800
Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp
805 810 815
Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala
820 825 830
Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
835 840 845
Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu
850 855 860
Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu
865 870 875 880
Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly
885 890 895
Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln
900 905 910
Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala
915 920 925
Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala
930 935 940
Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser
945 950 955 960
Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu
965 970 975
Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr
980 985 990
Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala
995 1000 1005
Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser
1010 1015 1020
Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe
1025 1030 1035
Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr
1040 1045 1050
Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys
1055 1060 1065
His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser
1070 1075 1080
Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro
1085 1090 1095
Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp
1100 1105 1110
Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln
1115 1120 1125
Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys
1130 1135 1140
Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile
1145 1150 1155
Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn
1160 1165 1170
Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu
1175 1180 1185
Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Leu Val Pro Arg
1190 1195 1200
Gly Ser Gly Ser Pro Gly Ser Gly Tyr Ile Pro Glu Ala Pro Arg
1205 1210 1215
Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu
1220 1225 1230
Ser Thr Phe Leu Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
1235 1240 1245
Gly Gly Gly Gly Ser Gly Ala Asp Asp Val Val Asp Ser Ser Lys
1250 1255 1260
Ser Phe Val Met Glu Asn Phe Ser Ser Tyr His Gly Thr Lys Pro
1265 1270 1275
Gly Tyr Val Asp Ser Ile Gln Lys Gly Ile Gln Lys Pro Lys Ser
1280 1285 1290
Gly Thr Gln Gly Asn Tyr Asp Asp Asp Trp Lys Glu Phe Tyr Ser
1295 1300 1305
Thr Asp Asn Lys Tyr Asp Ala Ala Gly Tyr Ser Val Asp Asn Glu
1310 1315 1320
Asn Pro Leu Ser Gly Lys Ala Gly Gly Val Val Lys Val Thr Tyr
1325 1330 1335
Pro Gly Leu Thr Lys Val Leu Ala Leu Lys Val Asp Asn Ala Glu
1340 1345 1350
Thr Ile Lys Lys Glu Leu Gly Leu Ser Leu Thr Glu Pro Leu Met
1355 1360 1365
Glu Gln Val Gly Thr Glu Glu Phe Ile Lys Arg Phe Gly Asp Gly
1370 1375 1380
Ala Ser Arg Val Val Leu Ser Leu Pro Phe Ala Glu Gly Ser Ser
1385 1390 1395
Ser Val Glu Tyr Ile Asn Asn Trp Glu Gln Ala Lys Ala Leu Ser
1400 1405 1410
Val Glu Leu Glu Ile Asn Phe Glu Thr Arg Gly Lys Arg Gly Gln
1415 1420 1425
Asp Ala Met Tyr Glu Tyr Met Ala Gln Ala Cys Ala Gly Asn Arg
1430 1435 1440
Val Arg Arg Ser Val Gly Ser Ser Leu Ser Cys Ile Asn Leu Asp
1445 1450 1455
Trp Asp Val Ile Arg Asp Lys Thr Lys Thr Lys Ile Glu Ser Leu
1460 1465 1470
Lys Glu His Gly Pro Ile Lys Asn Lys Met Ser Glu Ser Pro Asn
1475 1480 1485
Lys Thr Val Ser Glu Glu Lys Ala Lys Gln Tyr Leu Glu Glu Phe
1490 1495 1500
His Gln Thr Ala Leu Glu His Pro Glu Leu Ser Glu Leu Lys Thr
1505 1510 1515
Val Thr Gly Thr Asn Pro Val Phe Ala Gly Ala Asn Tyr Ala Ala
1520 1525 1530
Trp Ala Val Asn Val Ala Gln Val Ile Asp Ser Glu Thr Ala Asp
1535 1540 1545
Asn Leu Glu Lys Thr Thr Ala Ala Leu Ser Ile Leu Pro Gly Ile
1550 1555 1560
Gly Ser Val Met Gly Ile Ala Asp Gly Ala Val His His Asn Thr
1565 1570 1575
Glu Glu Ile Val Ala Gln Ser Ile Ala Leu Ser Ser Leu Met Val
1580 1585 1590
Ala Gln Ala Ile Pro Leu Val Gly Glu Leu Val Asp Ile Gly Phe
1595 1600 1605
Ala Ala Tyr Asn Phe Val Glu Ser Ile Ile Asn Leu Phe Gln Val
1610 1615 1620
Val His Asn Ser Tyr Asn Arg Pro Ala Tyr Ser Pro Gly His Lys
1625 1630 1635
Thr Gln Pro Phe Leu His Asp Gly Tyr Ala Val Ser Trp Asn Thr
1640 1645 1650
Val Glu Asp Ser Ile Ile Arg Thr Gly Phe Gln Gly Glu Ser Gly
1655 1660 1665
His Asp Ile Lys Ile Thr Ala Glu Asn Thr Pro Leu Pro Ile Ala
1670 1675 1680
Gly Val Leu Leu Pro Thr Ile Pro Gly Lys Leu Asp Val Asn Lys
1685 1690 1695
Ser Lys Thr His Ile Ser Val Asn Gly Arg Lys Ile Arg Met Arg
1700 1705 1710
Cys Arg Ala Ile Asp Gly Asp Val Thr Phe Cys Arg Pro Lys Ser
1715 1720 1725
Pro Val Tyr Val Gly Asn Gly Val His Ala Asn Leu His Val Ala
1730 1735 1740
Phe His Arg Ser Ser Ser Glu Lys Ile His Ser Asn Glu Ile Ser
1745 1750 1755
Ser Asp Ser Ile Gly Val Leu Gly Tyr Gln Lys Thr Val Asp His
1760 1765 1770
Thr Lys Val Asn Ser Lys Leu Ser Leu Phe Phe Glu Ile Lys Ser
1775 1780 1785
His His His His His His His His His His
1790 1795
Claims (58)
1.融合蛋白,其从N端至C端包含严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的S蛋白或其变体序列,蛋白酶识别位点序列以及三聚化结构域序列;
其中,所述三聚化结构域序列如SEQ ID NO:3所示,所述蛋白酶识别位点序列如SEQ IDNO:4或5所示;
所述S蛋白变体在其S1结构域和S2结构域之间不包含弗林蛋白酶识别位点,并且与野生型S蛋白相比,所述S蛋白变体在相应于SEQ ID NO:1所示野生型S蛋白的位置668的位置上的氨基酸置换为A,在相应于SEQ ID NO:1所示野生型S蛋白的位置669的位置上的氨基酸置换为G,和在相应于SEQ ID NO:1所示野生型S蛋白的位置671的位置上的氨基酸置换为G;并且
与野生型S蛋白相比,所述S蛋白变体还在相应于SEQ ID NO:1所示野生型S蛋白的位置972的位置上的氨基酸置换为P,和在相应于SEQ ID NO:1所示野生型S蛋白的位置973的位置上的氨基酸置换为P。
2.权利要求1所述的融合蛋白,其中,所述S蛋白的氨基酸序列如SEQ ID NO:1所示。
3.权利要求1所述的融合蛋白,其中,所述S蛋白变体的氨基酸序列如SEQ ID NO:2所示。
4.权利要求1所述的融合蛋白,其中,所述融合蛋白包含选自下列的氨基酸序列:
(1)由SEQ ID NO:6所示序列的第1位至第1238位氨基酸残基构成的氨基酸序列;
(2)由SEQ ID NO:8所示序列的第1位至第1238位氨基酸残基构成的氨基酸序列。
5.权利要求1所述的融合蛋白,其中,所述融合蛋白进一步包含载体蛋白序列。
6.权利要求5所述的融合蛋白,其中,所述载体蛋白序列位于所述S蛋白或其变体序列的N端,或者所述三聚化结构域序列的C端。
7.权利要求6所述的融合蛋白,其中,所述载体蛋白选自CRMA、CRM389和CRM197。
8.权利要求7所述的融合蛋白,其中,所述载体蛋白选自SEQ ID NO:10所示的CRMA、SEQID NO:11所示的CRM389和SEQ ID NO:12所示的CRM197。
9.权利要求5-8任一项所述的融合蛋白,其中,所述载体蛋白序列与其相邻结构域之间任选地通过肽接头连接。
10.权利要求9所述的融合蛋白,其中,所述肽接头是(GmS)n,其中m选自1-4的整数,n选自1-3的整数。
11.权利要求10所述的融合蛋白,其中,m选自1,2,3和4,n选自1,2和3。
12.权利要求5所述的融合蛋白,其中,所述融合蛋白包含选自下列的氨基酸序列:
(1)由SEQ ID NO:13所示序列的第1位至第1443位氨基酸残基构成的氨基酸序列;
(2)由SEQ ID NO:14所示序列的第1位至第1443位氨基酸残基构成的氨基酸序列;
(3)由SEQ ID NO:15所示序列的第1位至第1642位氨基酸残基构成的氨基酸序列;
(4)由SEQ ID NO:16所示序列的第1位至第1642位氨基酸残基构成的氨基酸序列;
(5)由SEQ ID NO:17所示序列的第1位至第1788位氨基酸残基构成的氨基酸序列;
(6)由SEQ ID NO:18所示序列的第1位至第1788位氨基酸残基构成的氨基酸序列。
13.权利要求1-8、10-12任一项所述的融合蛋白,其中,所述融合蛋白包含标签序列。
14.权利要求13所述的融合蛋白,其中,所述标签序列是纯化标签。
15.权利要求14所述的融合蛋白,其中,所述标签序列是多组氨酸标签、myc标签或HA标签。
16.权利要求13所述的融合蛋白,其中,所述标签序列位于所述融合蛋白的N端或C端。
17.权利要求13所述的融合蛋白,其中,所述标签序列与其相邻结构域之间任选地通过肽接头或酶切位点序列连接。
18.权利要求13所述的融合蛋白,其中,所述融合蛋白的序列如SEQ ID NOs:6、8、13-18任一项所示。
19.多聚体,其包含权利要求1-18任一项所述的融合蛋白。
20.权利要求19所述的多聚体,其中,所述多聚体是同源多聚体。
21.权利要求19所述的多聚体,其中,所述多聚体是三聚体。
22.权利要求21所述的多聚体,其中,所述多聚体是由相同的所述融合蛋白所形成的三聚体。
23.分离的核酸分子,其包含编码权利要求1-18任一项所述的融合蛋白的核苷酸序列。
24.载体,其包含权利要求23所述的分离的核酸分子。
25.权利要求24所述的载体,其中,所述载体是杆状病毒表达载体。
26.宿主细胞,其包含权利要求23所述的分离的核酸分子或权利要求24或25所述的载体。
27.权利要求26所述的宿主细胞,其中,所述宿主细胞是昆虫细胞。
28.制备权利要求1-18任一项所述的融合蛋白或权利要求19所述的多聚体的方法,其包括,培养权利要求26的宿主细胞,和从细胞培养物中回收所述融合蛋白。
29.权利要求28所述的方法,其中,所述融合蛋白以三聚体形式存在。
30.权利要求28所述的方法,其中,所述宿主细胞是昆虫细胞。
31.制备严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的S蛋白三聚体的方法,其包括:在杆状病毒昆虫表达系统中表达权利要求1-18任一项所述的融合蛋白,从而产生所述S蛋白三聚体。
32.权利要求31所述的方法,其中,所述杆状病毒昆虫表达系统中的昆虫宿主细胞选自sf9或sf21。
33.权利要求31所述的方法,其中,所述杆状病毒昆虫表达系统中的杆状病毒表达载体选自pAcgp67 A,pAcgp67 B,pAcgp67 C,pFastbac,pIEx/bac。
34.权利要求31所述的方法,其中,所述方法包括以下步骤:
(1)将包含编码权利要求1-18任一项所述的融合蛋白的核苷酸序列的杆状病毒表达载体引入昆虫宿主细胞中;
(2)培养所述昆虫宿主细胞;
(3)从细胞培养物中回收S蛋白三聚体。
35.权利要求31所述的方法,其中,所述方法还包括:对所产生的S蛋白三聚体进行纯化。
36.权利要求35所述的方法,其中,通过亲和层析和/或分子筛对S蛋白三聚体进行纯化。
37.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的S蛋白三聚体,其具备早期融合前构象,所述早期融合前构象包含以下特征中的1项、2项或3项:
(1)NTD结构域质量中心与S2结构域质量中心的连线,和RBD结构域质量中心与S2结构域质量中心的连线的夹角为约81.4°;
(2)NTD结构域质量中心与S2结构域质量中心的垂直距离为约
(3)RBD结构域质量中心与S2结构域质量中心的垂直距离为约
所述S蛋白三聚体是由权利要求1-18任一项所述的融合蛋白所形成的三聚体。
38.权利要求37所述的S蛋白三聚体,其中,所述三聚体是由相同的所述融合蛋白形成的同源三聚体。
39.权利要求37所述的S蛋白三聚体,其中,所述S蛋白三聚体通过权利要求31-36任一项所述的方法制备。
40.药物组合物,其包含权利要求1-18任一项所述的融合蛋白、权利要求19-22任一项所述的多聚体、权利要求23所述的分离的核酸分子、权利要求24或25所述的载体、权利要求26或27所述的宿主细胞、或权利要求37-39任一项所述的S蛋白三聚体,以及药学上可接受的载体和/或赋形剂。
41.权利要求40所述的药物组合物,其中,所述药物组合物是免疫原性组合物或疫苗。
42.权利要求40所述的药物组合物,其中,所述药学上可接受的载体和/或赋形剂是佐剂。
43.权利要求1-18任一项所述的融合蛋白、权利要求19-22任一项所述的多聚体、权利要求23所述的分离的核酸分子、权利要求24或25所述的载体、权利要求26或27所述的宿主细胞、权利要求37-39任一项所述的S蛋白三聚体、或权利要求40-42任一项所述的药物组合物在制备用于在受试者中诱导针对SARS-CoV-2的特异性免疫应答和/或用于在受试者中预防和/或治疗SARS-CoV-2感染和/或用于在受试者中预防和/或治疗由SARS-CoV-2感染所致的疾病COVID-19的药剂中的应用。
44.权利要求43所述的应用,其中,所述药剂为疫苗。
45.权利要求43所述的应用,其中,所述受试者是人。
46.试剂盒,其包含权利要求1-18任一项所述的融合蛋白、权利要求19-22任一项所述的多聚体、权利要求23所述的分离的核酸分子、权利要求24或25所述的载体、权利要求26或27所述的宿主细胞、或权利要求37-39任一项所述的S蛋白三聚体。
47.权利要求46所述的试剂盒,其中,所述试剂盒还包含带有可检测的标记的二级抗体。
48.权利要求47所述的试剂盒,其中,所述可检测的标记选自酶、化学发光试剂、荧光染料或生物素。
49.权利要求48所述的试剂盒,其中,所述酶为辣根过氧化物酶或碱性磷酸酶,所述化学发光试剂为吖啶酯类化合物。
50.权利要求47所述的试剂盒,其中,所述二级抗体对待测抗体所来自的物种的抗体是特异的。
51.权利要求50所述的试剂盒,其中,所述物种为人。
52.权利要求47所述的试剂盒,其中,所述二级抗体是抗-免疫球蛋白抗体。
53.权利要求52所述的试剂盒,其中,所述抗-免疫球蛋白抗体选自抗IgG抗体、抗IgM抗体和抗IgA抗体。
54.权利要求46所述的试剂盒,其中所述融合蛋白、多聚体、或S蛋白三聚体连接于固相载体表面。
55.权利要求1-18任一项所述的融合蛋白、权利要求19-22任一项所述的多聚体、权利要求23所述的分离的核酸分子、权利要求24或25所述的载体、权利要求26或27所述的宿主细胞、或权利要求37-39任一项所述的S蛋白三聚体在制备用于测定来自受试者的样品中的对SARS-CoV-2特异性的抗体的存在或其水平和/或用于确定受试者是否患有SARS-CoV-2感染的试剂盒中的用途。
56.权利要求55所述的用途,其中,所述样品是血液样品。
57.权利要求56所述的用途,其中所述样品为全血、血浆或血清。
58.权利要求55所述的用途,其中,所述受试者是人。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010171146 | 2020-03-12 | ||
CN2020101711468 | 2020-03-12 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113388041A CN113388041A (zh) | 2021-09-14 |
CN113388041B true CN113388041B (zh) | 2024-02-06 |
Family
ID=77617502
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110268273.4A Active CN113388041B (zh) | 2020-03-12 | 2021-03-12 | 具有融合前早期构象的SARS-CoV-2 S三聚体蛋白及其应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113388041B (zh) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113150085B (zh) * | 2021-04-27 | 2023-04-18 | 成都威斯克生物医药有限公司 | 抗SARS-CoV-2感染的组合物 |
CN113801209B (zh) * | 2021-11-16 | 2022-02-11 | 浙江普康生物技术股份有限公司 | 一种具有广谱中和活性的新型冠状病毒重组蛋白及其制备方法 |
CN118076646A (zh) * | 2022-02-25 | 2024-05-24 | 神州细胞工程有限公司 | 一种可诱导广谱中和活性重组多组分新冠病毒三聚体蛋白疫苗的制备及应用 |
CN116731192A (zh) * | 2022-03-01 | 2023-09-12 | 上海泽润生物科技有限公司 | 重组刺突蛋白及其制备方法和用途 |
CN116836293A (zh) * | 2022-03-24 | 2023-10-03 | 四川三叶草生物制药有限公司 | 冠状病毒疫苗组合物、方法及其使用 |
WO2023179513A1 (zh) * | 2022-03-24 | 2023-09-28 | 四川三叶草生物制药有限公司 | 冠状病毒疫苗组合物、方法及其使用 |
CN115073565B (zh) * | 2022-06-13 | 2023-02-21 | 华素生物科技(北京)有限公司 | 重组新型冠状病毒s蛋白三聚体及其制备方法与应用 |
CN118119646A (zh) * | 2022-08-08 | 2024-05-31 | 神州细胞工程有限公司 | 一种可诱导广谱中和活性重组五组分新冠病毒三聚体蛋白疫苗的制备及应用 |
CN116063411A (zh) * | 2022-09-16 | 2023-05-05 | 广东珩达生物医药科技有限公司 | 新型冠状病毒抗原多肽及其重组腺相关病毒和制备疫苗的应用 |
CN115819523B (zh) * | 2022-10-05 | 2023-12-05 | 复旦大学 | 靶向新冠病毒s蛋白受体结合域的三价蛋白的优化设计方法 |
CN116162173B (zh) * | 2022-10-28 | 2024-06-04 | 安徽农业大学 | 一种GnRH6-CRM197重组蛋白去势疫苗及制备方法 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000041678A (ja) * | 1998-07-28 | 2000-02-15 | Denki Kagaku Kogyo Kk | 可溶性融合蛋白質とそれをコードする遺伝子、並びにその融合蛋白質の製造方法、及びその利用 |
CN1603346A (zh) * | 2003-09-29 | 2005-04-06 | 厦门大学 | 蛋白质,含有其的产品及其在诊断、治疗和/或预防sars相关疾病中的用途 |
CN1749277A (zh) * | 2004-09-14 | 2006-03-22 | 北京迪威华宇生物技术有限公司 | 热休克蛋白65-人sars冠状病毒表位抗原重组融合蛋白(hsp65-sars/3cl161-264) |
CN101022827A (zh) * | 2004-06-30 | 2007-08-22 | 魁北克益得生物医学公司 | 用于治疗冠状病毒感染的疫苗组合物 |
EP2072527A1 (en) * | 2007-12-21 | 2009-06-24 | Altonabiotec AG | Fusion polypeptides comprising a SHBG dimerization component and uses thereof |
WO2018074884A1 (ko) * | 2016-10-20 | 2018-04-26 | 연세대학교 원주산학협력단 | 재조합 단백질 및 그의 용도 |
-
2021
- 2021-03-12 CN CN202110268273.4A patent/CN113388041B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000041678A (ja) * | 1998-07-28 | 2000-02-15 | Denki Kagaku Kogyo Kk | 可溶性融合蛋白質とそれをコードする遺伝子、並びにその融合蛋白質の製造方法、及びその利用 |
CN1603346A (zh) * | 2003-09-29 | 2005-04-06 | 厦门大学 | 蛋白质,含有其的产品及其在诊断、治疗和/或预防sars相关疾病中的用途 |
CN101022827A (zh) * | 2004-06-30 | 2007-08-22 | 魁北克益得生物医学公司 | 用于治疗冠状病毒感染的疫苗组合物 |
CN1749277A (zh) * | 2004-09-14 | 2006-03-22 | 北京迪威华宇生物技术有限公司 | 热休克蛋白65-人sars冠状病毒表位抗原重组融合蛋白(hsp65-sars/3cl161-264) |
EP2072527A1 (en) * | 2007-12-21 | 2009-06-24 | Altonabiotec AG | Fusion polypeptides comprising a SHBG dimerization component and uses thereof |
WO2018074884A1 (ko) * | 2016-10-20 | 2018-04-26 | 연세대학교 원주산학협력단 | 재조합 단백질 및 그의 용도 |
Also Published As
Publication number | Publication date |
---|---|
CN113388041A (zh) | 2021-09-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113388041B (zh) | 具有融合前早期构象的SARS-CoV-2 S三聚体蛋白及其应用 | |
CN111991556B (zh) | SARS-CoV-2 RBD共轭纳米颗粒疫苗 | |
US11969466B2 (en) | Stabilized influenza hemagglutinin stem region trimers and uses thereof | |
CN105175537B (zh) | 抗hpv l1蛋白的广谱单克隆抗体或其抗原结合片段及它们的用途 | |
CN112707968B (zh) | 一种用于检测新冠病毒中和抗体的重组的受体结合蛋白、重组的受体蛋白 | |
CN105669838B (zh) | 来自水痘-带状疱疹病毒gE蛋白的中和表位及针对其的抗体 | |
KR101675185B1 (ko) | 디자이너(designer) 펩티드-기반 pcv2 백신 | |
JP2016527253A (ja) | 立体構造的に安定化されたrsv融合前fタンパク質 | |
CN112724209B (zh) | 可形成纳米颗粒的冠状病毒重组蛋白及其载体和应用 | |
JP2002532387A (ja) | ペプチドリガンドを介して結合された複数の免疫原性成分を有するhbvコア抗原粒子 | |
IL265121B1 (en) | Stable group 2 trimers from the stem region of influenza hemagglutinin and their uses | |
CN105669859B (zh) | 人乳头瘤病毒18型单克隆抗体及其应用 | |
CN117304280B (zh) | 一种重组rsv f蛋白及其应用 | |
JP5187883B2 (ja) | 抗原ペプチドおよびその利用 | |
CN114276422A (zh) | 新型冠状病毒s蛋白多肽抗原及其应用 | |
CN117320745A (zh) | Sars-cov-2亚单位疫苗 | |
DK1981904T3 (en) | Peptide domain from the enclosure of a HERV-W virus, which is required for reaction with a hASCT receptor | |
CN111094323A (zh) | 修饰的巨细胞病毒蛋白和稳定的复合物 | |
JP2002020399A (ja) | ノルウォークウイルス(nv)を認識するモノクローナル抗体 | |
RU2807742C1 (ru) | Мутантный белок f rsv и его применение | |
WO2024199469A1 (zh) | 具有稳定融合前构象的呼吸道合胞病毒f蛋白 | |
CN118772250A (zh) | 一种rsv f蛋白突变体及其制法与应用 | |
Manivel et al. | A self-associating hepatitis B surface antigen-derived peptide that is immunogenic in alum | |
JP2024511356A (ja) | ワクチン組成物及びその使用方法 | |
WO2020264011A1 (en) | Resurfaced dengue virus and ziki virus glycoprotein e diii variants and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |