CN113646328B - 一种免疫细胞因子及其制备与用途 - Google Patents
一种免疫细胞因子及其制备与用途 Download PDFInfo
- Publication number
- CN113646328B CN113646328B CN202080009757.0A CN202080009757A CN113646328B CN 113646328 B CN113646328 B CN 113646328B CN 202080009757 A CN202080009757 A CN 202080009757A CN 113646328 B CN113646328 B CN 113646328B
- Authority
- CN
- China
- Prior art keywords
- ser
- val
- thr
- leu
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 229940127130 immunocytokine Drugs 0.000 title claims abstract description 51
- 238000002360 preparation method Methods 0.000 title description 5
- 102000003812 Interleukin-15 Human genes 0.000 claims abstract description 103
- 108090000172 Interleukin-15 Proteins 0.000 claims abstract description 103
- 206010028980 Neoplasm Diseases 0.000 claims abstract description 40
- 101710160107 Outer membrane protein A Proteins 0.000 claims abstract description 23
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 111
- 239000000427 antigen Substances 0.000 claims description 37
- 102000036639 antigens Human genes 0.000 claims description 37
- 108091007433 antigens Proteins 0.000 claims description 37
- -1 HER 4) Proteins 0.000 claims description 34
- 150000007523 nucleic acids Chemical class 0.000 claims description 25
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 24
- 230000008685 targeting Effects 0.000 claims description 24
- 108010075254 C-Peptide Proteins 0.000 claims description 18
- 210000004027 cell Anatomy 0.000 claims description 17
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 claims description 14
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 claims description 14
- 102000052116 epidermal growth factor receptor activity proteins Human genes 0.000 claims description 14
- 108700015053 epidermal growth factor receptor activity proteins Proteins 0.000 claims description 14
- YOHYSYJDKVYCJI-UHFFFAOYSA-N n-[3-[[6-[3-(trifluoromethyl)anilino]pyrimidin-4-yl]amino]phenyl]cyclopropanecarboxamide Chemical compound FC(F)(F)C1=CC=CC(NC=2N=CN=C(NC=3C=C(NC(=O)C4CC4)C=CC=3)C=2)=C1 YOHYSYJDKVYCJI-UHFFFAOYSA-N 0.000 claims description 14
- 108010074708 B7-H1 Antigen Proteins 0.000 claims description 10
- 102100040678 Programmed cell death protein 1 Human genes 0.000 claims description 8
- 101710089372 Programmed cell death protein 1 Proteins 0.000 claims description 8
- 102100022718 Atypical chemokine receptor 2 Human genes 0.000 claims description 7
- 101000678892 Homo sapiens Atypical chemokine receptor 2 Proteins 0.000 claims description 7
- 101000934338 Homo sapiens Myeloid cell surface antigen CD33 Proteins 0.000 claims description 7
- 102100025243 Myeloid cell surface antigen CD33 Human genes 0.000 claims description 7
- 102100024216 Programmed cell death 1 ligand 1 Human genes 0.000 claims description 7
- 108090000623 proteins and genes Proteins 0.000 claims description 7
- 238000002560 therapeutic procedure Methods 0.000 claims description 7
- 101100504181 Arabidopsis thaliana GCS1 gene Proteins 0.000 claims description 6
- 108091008794 FGF receptors Proteins 0.000 claims description 6
- 102000044168 Fibroblast Growth Factor Receptor Human genes 0.000 claims description 6
- 108010053099 Vascular Endothelial Growth Factor Receptor-2 Proteins 0.000 claims description 6
- 102000004169 proteins and genes Human genes 0.000 claims description 6
- 102100024222 B-lymphocyte antigen CD19 Human genes 0.000 claims description 5
- 102100036301 C-C chemokine receptor type 7 Human genes 0.000 claims description 5
- 101000980825 Homo sapiens B-lymphocyte antigen CD19 Proteins 0.000 claims description 5
- 101000914324 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 5 Proteins 0.000 claims description 5
- 102100033177 Vascular endothelial growth factor receptor 2 Human genes 0.000 claims description 5
- BGFTWECWAICPDG-UHFFFAOYSA-N 2-[bis(4-chlorophenyl)methyl]-4-n-[3-[bis(4-chlorophenyl)methyl]-4-(dimethylamino)phenyl]-1-n,1-n-dimethylbenzene-1,4-diamine Chemical compound C1=C(C(C=2C=CC(Cl)=CC=2)C=2C=CC(Cl)=CC=2)C(N(C)C)=CC=C1NC(C=1)=CC=C(N(C)C)C=1C(C=1C=CC(Cl)=CC=1)C1=CC=C(Cl)C=C1 BGFTWECWAICPDG-UHFFFAOYSA-N 0.000 claims description 4
- 108010068327 4-hydroxyphenylpyruvate dioxygenase Proteins 0.000 claims description 4
- 108010008014 B-Cell Maturation Antigen Proteins 0.000 claims description 4
- 102000006942 B-Cell Maturation Antigen Human genes 0.000 claims description 4
- 102100038080 B-cell receptor CD22 Human genes 0.000 claims description 4
- 102100021663 Baculoviral IAP repeat-containing protein 5 Human genes 0.000 claims description 4
- 102100031151 C-C chemokine receptor type 2 Human genes 0.000 claims description 4
- 101710149815 C-C chemokine receptor type 2 Proteins 0.000 claims description 4
- 101710149863 C-C chemokine receptor type 4 Proteins 0.000 claims description 4
- 102100028989 C-X-C chemokine receptor type 2 Human genes 0.000 claims description 4
- 102100028990 C-X-C chemokine receptor type 3 Human genes 0.000 claims description 4
- 102100031650 C-X-C chemokine receptor type 4 Human genes 0.000 claims description 4
- 102100031658 C-X-C chemokine receptor type 5 Human genes 0.000 claims description 4
- 102100026094 C-type lectin domain family 12 member A Human genes 0.000 claims description 4
- 101710188619 C-type lectin domain family 12 member A Proteins 0.000 claims description 4
- 102100024217 CAMPATH-1 antigen Human genes 0.000 claims description 4
- 101150013553 CD40 gene Proteins 0.000 claims description 4
- 108010065524 CD52 Antigen Proteins 0.000 claims description 4
- 102100025221 CD70 antigen Human genes 0.000 claims description 4
- 108090000835 CX3C Chemokine Receptor 1 Proteins 0.000 claims description 4
- 102100039196 CX3C chemokine receptor 1 Human genes 0.000 claims description 4
- 102100025570 Cancer/testis antigen 1 Human genes 0.000 claims description 4
- 101150084967 EPCAM gene Proteins 0.000 claims description 4
- 102100041003 Glutamate carboxypeptidase 2 Human genes 0.000 claims description 4
- 101000884305 Homo sapiens B-cell receptor CD22 Proteins 0.000 claims description 4
- 101000716065 Homo sapiens C-C chemokine receptor type 7 Proteins 0.000 claims description 4
- 101000716070 Homo sapiens C-C chemokine receptor type 9 Proteins 0.000 claims description 4
- 101000916050 Homo sapiens C-X-C chemokine receptor type 3 Proteins 0.000 claims description 4
- 101000922348 Homo sapiens C-X-C chemokine receptor type 4 Proteins 0.000 claims description 4
- 101000922405 Homo sapiens C-X-C chemokine receptor type 5 Proteins 0.000 claims description 4
- 101000934356 Homo sapiens CD70 antigen Proteins 0.000 claims description 4
- 101000856237 Homo sapiens Cancer/testis antigen 1 Proteins 0.000 claims description 4
- 101000914321 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 7 Proteins 0.000 claims description 4
- 101000892862 Homo sapiens Glutamate carboxypeptidase 2 Proteins 0.000 claims description 4
- 101001037256 Homo sapiens Indoleamine 2,3-dioxygenase 1 Proteins 0.000 claims description 4
- 101000868279 Homo sapiens Leukocyte surface antigen CD47 Proteins 0.000 claims description 4
- 101000623901 Homo sapiens Mucin-16 Proteins 0.000 claims description 4
- 101000617725 Homo sapiens Pregnancy-specific beta-1-glycoprotein 2 Proteins 0.000 claims description 4
- 101000914514 Homo sapiens T-cell-specific surface glycoprotein CD28 Proteins 0.000 claims description 4
- 101000914484 Homo sapiens T-lymphocyte activation antigen CD80 Proteins 0.000 claims description 4
- 101000669447 Homo sapiens Toll-like receptor 4 Proteins 0.000 claims description 4
- 101000801234 Homo sapiens Tumor necrosis factor receptor superfamily member 18 Proteins 0.000 claims description 4
- 101000851376 Homo sapiens Tumor necrosis factor receptor superfamily member 8 Proteins 0.000 claims description 4
- 102100040061 Indoleamine 2,3-dioxygenase 1 Human genes 0.000 claims description 4
- 108010018951 Interleukin-8B Receptors Proteins 0.000 claims description 4
- 102100032913 Leukocyte surface antigen CD47 Human genes 0.000 claims description 4
- 102100034216 Melanocyte-stimulating hormone receptor Human genes 0.000 claims description 4
- 102000003735 Mesothelin Human genes 0.000 claims description 4
- 108090000015 Mesothelin Proteins 0.000 claims description 4
- 102100023123 Mucin-16 Human genes 0.000 claims description 4
- 108010032605 Nerve Growth Factor Receptors Proteins 0.000 claims description 4
- 108091008606 PDGF receptors Proteins 0.000 claims description 4
- 102000011653 Platelet-Derived Growth Factor Receptors Human genes 0.000 claims description 4
- 108010089836 Proto-Oncogene Proteins c-met Proteins 0.000 claims description 4
- 102000004584 Somatomedin Receptors Human genes 0.000 claims description 4
- 108010017622 Somatomedin Receptors Proteins 0.000 claims description 4
- 108010002687 Survivin Proteins 0.000 claims description 4
- 102100027213 T-cell-specific surface glycoprotein CD28 Human genes 0.000 claims description 4
- 102100027222 T-lymphocyte activation antigen CD80 Human genes 0.000 claims description 4
- 108010060818 Toll-Like Receptor 9 Proteins 0.000 claims description 4
- 102000002689 Toll-like receptor Human genes 0.000 claims description 4
- 108020000411 Toll-like receptor Proteins 0.000 claims description 4
- 102100039360 Toll-like receptor 4 Human genes 0.000 claims description 4
- 102100033117 Toll-like receptor 9 Human genes 0.000 claims description 4
- 102100033728 Tumor necrosis factor receptor superfamily member 18 Human genes 0.000 claims description 4
- 102100040245 Tumor necrosis factor receptor superfamily member 5 Human genes 0.000 claims description 4
- 102100036857 Tumor necrosis factor receptor superfamily member 8 Human genes 0.000 claims description 4
- 102000039446 nucleic acids Human genes 0.000 claims description 4
- 108020004707 nucleic acids Proteins 0.000 claims description 4
- 229920001481 poly(stearyl methacrylate) Polymers 0.000 claims description 4
- 239000013598 vector Substances 0.000 claims description 4
- 102100022464 5'-nucleotidase Human genes 0.000 claims description 3
- 102100022005 B-lymphocyte antigen CD20 Human genes 0.000 claims description 3
- 101000840545 Bacillus thuringiensis L-isoleucine-4-hydroxylase Proteins 0.000 claims description 3
- 102100024167 C-C chemokine receptor type 3 Human genes 0.000 claims description 3
- 101710149862 C-C chemokine receptor type 3 Proteins 0.000 claims description 3
- 102100035875 C-C chemokine receptor type 5 Human genes 0.000 claims description 3
- 101710149870 C-C chemokine receptor type 5 Proteins 0.000 claims description 3
- 102100036305 C-C chemokine receptor type 8 Human genes 0.000 claims description 3
- 102100025074 C-C chemokine receptor-like 2 Human genes 0.000 claims description 3
- 102100021936 C-C motif chemokine 27 Human genes 0.000 claims description 3
- 102100021942 C-C motif chemokine 28 Human genes 0.000 claims description 3
- 102100038078 CD276 antigen Human genes 0.000 claims description 3
- 101710185679 CD276 antigen Proteins 0.000 claims description 3
- 108010021064 CTLA-4 Antigen Proteins 0.000 claims description 3
- 229940045513 CTLA4 antagonist Drugs 0.000 claims description 3
- 101100381481 Caenorhabditis elegans baz-2 gene Proteins 0.000 claims description 3
- 102100039498 Cytotoxic T-lymphocyte protein 4 Human genes 0.000 claims description 3
- 102100034458 Hepatitis A virus cellular receptor 2 Human genes 0.000 claims description 3
- 102100022537 Histone deacetylase 6 Human genes 0.000 claims description 3
- 101000678236 Homo sapiens 5'-nucleotidase Proteins 0.000 claims description 3
- 101000897405 Homo sapiens B-lymphocyte antigen CD20 Proteins 0.000 claims description 3
- 101000777558 Homo sapiens C-C chemokine receptor type 10 Proteins 0.000 claims description 3
- 101000716068 Homo sapiens C-C chemokine receptor type 6 Proteins 0.000 claims description 3
- 101000716063 Homo sapiens C-C chemokine receptor type 8 Proteins 0.000 claims description 3
- 101000897494 Homo sapiens C-C motif chemokine 27 Proteins 0.000 claims description 3
- 101000897477 Homo sapiens C-C motif chemokine 28 Proteins 0.000 claims description 3
- 101000899330 Homo sapiens Histone deacetylase 6 Proteins 0.000 claims description 3
- 101001055145 Homo sapiens Interleukin-2 receptor subunit beta Proteins 0.000 claims description 3
- 101000998120 Homo sapiens Interleukin-3 receptor subunit alpha Proteins 0.000 claims description 3
- 101000623900 Homo sapiens Mucin-13 Proteins 0.000 claims description 3
- 101000623905 Homo sapiens Mucin-15 Proteins 0.000 claims description 3
- 101000623904 Homo sapiens Mucin-17 Proteins 0.000 claims description 3
- 101001133059 Homo sapiens Mucin-19 Proteins 0.000 claims description 3
- 101001133081 Homo sapiens Mucin-2 Proteins 0.000 claims description 3
- 101000972284 Homo sapiens Mucin-3A Proteins 0.000 claims description 3
- 101000972282 Homo sapiens Mucin-5AC Proteins 0.000 claims description 3
- 101000972276 Homo sapiens Mucin-5B Proteins 0.000 claims description 3
- 101000972273 Homo sapiens Mucin-7 Proteins 0.000 claims description 3
- 101000851370 Homo sapiens Tumor necrosis factor receptor superfamily member 9 Proteins 0.000 claims description 3
- 102100026879 Interleukin-2 receptor subunit beta Human genes 0.000 claims description 3
- 102100033493 Interleukin-3 receptor subunit alpha Human genes 0.000 claims description 3
- 102100023124 Mucin-13 Human genes 0.000 claims description 3
- 102100023128 Mucin-15 Human genes 0.000 claims description 3
- 102100023125 Mucin-17 Human genes 0.000 claims description 3
- 102100034257 Mucin-19 Human genes 0.000 claims description 3
- 102100034263 Mucin-2 Human genes 0.000 claims description 3
- 102100022497 Mucin-3A Human genes 0.000 claims description 3
- 102100022496 Mucin-5AC Human genes 0.000 claims description 3
- 102100022494 Mucin-5B Human genes 0.000 claims description 3
- 102100022492 Mucin-7 Human genes 0.000 claims description 3
- 108010063954 Mucins Proteins 0.000 claims description 3
- 101100346932 Mus musculus Muc1 gene Proteins 0.000 claims description 3
- WWGBHDIHIVGYLZ-UHFFFAOYSA-N N-[4-[3-[[[7-(hydroxyamino)-7-oxoheptyl]amino]-oxomethyl]-5-isoxazolyl]phenyl]carbamic acid tert-butyl ester Chemical compound C1=CC(NC(=O)OC(C)(C)C)=CC=C1C1=CC(C(=O)NCCCCCCC(=O)NO)=NO1 WWGBHDIHIVGYLZ-UHFFFAOYSA-N 0.000 claims description 3
- 101100372762 Rattus norvegicus Flt1 gene Proteins 0.000 claims description 3
- 101710100969 Receptor tyrosine-protein kinase erbB-3 Proteins 0.000 claims description 3
- 102100029986 Receptor tyrosine-protein kinase erbB-3 Human genes 0.000 claims description 3
- 101001037255 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Indoleamine 2,3-dioxygenase Proteins 0.000 claims description 3
- 102100022153 Tumor necrosis factor receptor superfamily member 4 Human genes 0.000 claims description 3
- 101710165473 Tumor necrosis factor receptor superfamily member 4 Proteins 0.000 claims description 3
- 102100036856 Tumor necrosis factor receptor superfamily member 9 Human genes 0.000 claims description 3
- 108010079206 V-Set Domain-Containing T-Cell Activation Inhibitor 1 Proteins 0.000 claims description 3
- 102100038929 V-set domain-containing T-cell activation inhibitor 1 Human genes 0.000 claims description 3
- 108091008605 VEGF receptors Proteins 0.000 claims description 3
- 102000009484 Vascular Endothelial Growth Factor Receptors Human genes 0.000 claims description 3
- 229940124676 vascular endothelial growth factor receptor Drugs 0.000 claims description 3
- 102000009410 Chemokine receptor Human genes 0.000 claims description 2
- 108050000299 Chemokine receptor Proteins 0.000 claims description 2
- 102100032530 Glypican-3 Human genes 0.000 claims description 2
- 101001014668 Homo sapiens Glypican-3 Proteins 0.000 claims description 2
- 101001068133 Homo sapiens Hepatitis A virus cellular receptor 2 Proteins 0.000 claims description 2
- 101000628547 Homo sapiens Metalloreductase STEAP1 Proteins 0.000 claims description 2
- 101000623897 Homo sapiens Mucin-12 Proteins 0.000 claims description 2
- 101000972285 Homo sapiens Mucin-3B Proteins 0.000 claims description 2
- 101000831007 Homo sapiens T-cell immunoreceptor with Ig and ITIM domains Proteins 0.000 claims description 2
- 108010017535 Interleukin-15 Receptors Proteins 0.000 claims description 2
- 102000004556 Interleukin-15 Receptors Human genes 0.000 claims description 2
- 108010043610 KIR Receptors Proteins 0.000 claims description 2
- 102000002698 KIR Receptors Human genes 0.000 claims description 2
- 101150015860 MC1R gene Proteins 0.000 claims description 2
- 102100026712 Metalloreductase STEAP1 Human genes 0.000 claims description 2
- 102100023143 Mucin-12 Human genes 0.000 claims description 2
- 102100022702 Mucin-3B Human genes 0.000 claims description 2
- 102100024834 T-cell immunoreceptor with Ig and ITIM domains Human genes 0.000 claims description 2
- 108010021428 Type 1 Melanocortin Receptor Proteins 0.000 claims description 2
- 102000016549 Vascular Endothelial Growth Factor Receptor-2 Human genes 0.000 claims description 2
- 108010053100 Vascular Endothelial Growth Factor Receptor-3 Proteins 0.000 claims description 2
- 108091008039 hormone receptors Proteins 0.000 claims description 2
- 238000011502 immune monitoring Methods 0.000 claims description 2
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 claims 2
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 claims 2
- 102000007339 Nerve Growth Factor Receptors Human genes 0.000 claims 2
- 102000008022 Proto-Oncogene Proteins c-met Human genes 0.000 claims 2
- 102100037853 C-C chemokine receptor type 4 Human genes 0.000 claims 1
- 102000001301 EGF receptor Human genes 0.000 claims 1
- 108060006698 EGF receptor Proteins 0.000 claims 1
- 101710083479 Hepatitis A virus cellular receptor 2 homolog Proteins 0.000 claims 1
- 102100022019 Pregnancy-specific beta-1-glycoprotein 2 Human genes 0.000 claims 1
- 229940126547 T-cell immunoglobulin mucin-3 Drugs 0.000 claims 1
- 102000016663 Vascular Endothelial Growth Factor Receptor-3 Human genes 0.000 claims 1
- 210000000822 natural killer cell Anatomy 0.000 abstract description 12
- 210000002865 immune cell Anatomy 0.000 abstract description 7
- 150000001875 compounds Chemical class 0.000 abstract description 3
- 230000002147 killing effect Effects 0.000 abstract description 3
- 230000001988 toxicity Effects 0.000 abstract description 3
- 231100000419 toxicity Toxicity 0.000 abstract description 3
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 86
- 241000880493 Leptailurus serval Species 0.000 description 82
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 68
- 108020004414 DNA Proteins 0.000 description 48
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 43
- 108010017391 lysylvaline Proteins 0.000 description 39
- 108010031719 prolyl-serine Proteins 0.000 description 37
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 36
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 35
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 35
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 34
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 32
- 108010060199 cysteinylproline Proteins 0.000 description 32
- 108010015792 glycyllysine Proteins 0.000 description 32
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 30
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 27
- 108010089804 glycyl-threonine Proteins 0.000 description 27
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 26
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 26
- 108010087924 alanylproline Proteins 0.000 description 25
- 108010027338 isoleucylcysteine Proteins 0.000 description 25
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 24
- 108010034529 leucyl-lysine Proteins 0.000 description 24
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 24
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 23
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 23
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 23
- 108010073969 valyllysine Proteins 0.000 description 23
- JDMKQHSHKJHAHR-UHFFFAOYSA-N Phe-Phe-Leu-Tyr Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)CC1=CC=CC=C1 JDMKQHSHKJHAHR-UHFFFAOYSA-N 0.000 description 22
- 108010064235 lysylglycine Proteins 0.000 description 22
- 108010070643 prolylglutamic acid Proteins 0.000 description 22
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 22
- 108010052774 valyl-lysyl-glycyl-phenylalanyl-tyrosine Proteins 0.000 description 22
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 21
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 21
- 108091028043 Nucleic acid sequence Proteins 0.000 description 21
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 21
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 21
- 108010078144 glutaminyl-glycine Proteins 0.000 description 21
- 108010050848 glycylleucine Proteins 0.000 description 21
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 21
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 20
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 20
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 20
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 20
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 20
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 20
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 20
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 20
- 108010044292 tryptophyltyrosine Proteins 0.000 description 20
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 19
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 19
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 19
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 18
- 108010065920 Insulin Lispro Proteins 0.000 description 18
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 18
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 18
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 18
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 18
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 18
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 18
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 18
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 18
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 18
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 18
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 18
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 18
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 17
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 17
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 17
- YQPSDMUGFKJZHR-QRTARXTBSA-N Asn-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N YQPSDMUGFKJZHR-QRTARXTBSA-N 0.000 description 17
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 17
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 17
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 17
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 17
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 17
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 17
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 17
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 17
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 17
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 17
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 17
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 17
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 17
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 17
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 17
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 17
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 17
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 17
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 17
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 17
- RSOMVHWMIAZNLE-HJWJTTGWSA-N Met-Phe-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSOMVHWMIAZNLE-HJWJTTGWSA-N 0.000 description 17
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 17
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 17
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 17
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 17
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 17
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 17
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 17
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 17
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 17
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 17
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 17
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 17
- 108010047857 aspartylglycine Proteins 0.000 description 17
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 17
- 108010009298 lysylglutamic acid Proteins 0.000 description 17
- 108010077112 prolyl-proline Proteins 0.000 description 17
- 108010026333 seryl-proline Proteins 0.000 description 17
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 16
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 16
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 16
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 16
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 16
- YERBCFWVWITTEJ-NAZCDGGXSA-N His-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N)O YERBCFWVWITTEJ-NAZCDGGXSA-N 0.000 description 16
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 16
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 16
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 16
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 16
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 16
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 16
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 16
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 16
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 16
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 16
- 108010070944 alanylhistidine Proteins 0.000 description 16
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 16
- 108010062796 arginyllysine Proteins 0.000 description 16
- 108010092854 aspartyllysine Proteins 0.000 description 16
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 15
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 15
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 15
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 15
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 15
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 15
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 15
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 15
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 15
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 15
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 15
- 108010080629 tryptophan-leucine Proteins 0.000 description 15
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 15
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 14
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 14
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 14
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 14
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 14
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 14
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 14
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 14
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 14
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 14
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 14
- 235000001014 amino acid Nutrition 0.000 description 14
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 14
- 108010051242 phenylalanylserine Proteins 0.000 description 14
- 108010051110 tyrosyl-lysine Proteins 0.000 description 14
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 13
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 13
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 13
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 13
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 13
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 13
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 13
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 13
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 13
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 13
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 12
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 12
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 12
- LGCVSPFCFXWUEY-IHPCNDPISA-N Asn-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N LGCVSPFCFXWUEY-IHPCNDPISA-N 0.000 description 12
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 12
- 102000004127 Cytokines Human genes 0.000 description 12
- 108090000695 Cytokines Proteins 0.000 description 12
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 12
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 12
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 12
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 12
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 12
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 12
- 150000001413 amino acids Chemical class 0.000 description 12
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 11
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 11
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 11
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 11
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 11
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 11
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 11
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 11
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 11
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 11
- 230000027455 binding Effects 0.000 description 11
- 108010010147 glycylglutamine Proteins 0.000 description 11
- 108010057821 leucylproline Proteins 0.000 description 11
- 102000004196 processed proteins & peptides Human genes 0.000 description 11
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 10
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 10
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 10
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 10
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 10
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 10
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 10
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 10
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 10
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 10
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 10
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 10
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 10
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 10
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 10
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 10
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 10
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 10
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 10
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 10
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 10
- 108010068265 aspartyltyrosine Proteins 0.000 description 10
- 108010049041 glutamylalanine Proteins 0.000 description 10
- 108010037850 glycylvaline Proteins 0.000 description 10
- 108010003700 lysyl aspartic acid Proteins 0.000 description 10
- 229920001184 polypeptide Polymers 0.000 description 10
- 108010053725 prolylvaline Proteins 0.000 description 10
- 230000002483 superagonistic effect Effects 0.000 description 10
- 108010027345 wheylin-1 peptide Proteins 0.000 description 10
- IESDGNYHXIOKRW-YXMSTPNBSA-N (2s)-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s,3r)-2-amino-3-hydroxybutanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IESDGNYHXIOKRW-YXMSTPNBSA-N 0.000 description 9
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 9
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 9
- ZUVMUOOHJYNJPP-XIRDDKMYSA-N Arg-Trp-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZUVMUOOHJYNJPP-XIRDDKMYSA-N 0.000 description 9
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 9
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 9
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 9
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 9
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 9
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 9
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 9
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 9
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 9
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 9
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 9
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 9
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 9
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 9
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 9
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 9
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 9
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 9
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 9
- WMKXFMUJRCEGRP-SRVKXCTJSA-N His-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N WMKXFMUJRCEGRP-SRVKXCTJSA-N 0.000 description 9
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 9
- 101001103036 Homo sapiens Nuclear receptor ROR-alpha Proteins 0.000 description 9
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 9
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 9
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 9
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 9
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 9
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 9
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 9
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 9
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 9
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 9
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 9
- 102100039614 Nuclear receptor ROR-alpha Human genes 0.000 description 9
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 9
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 9
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 9
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 9
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 9
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 9
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 9
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 9
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 9
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 9
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 9
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 9
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 9
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 9
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 9
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 9
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 9
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 9
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 9
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 9
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 9
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 9
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 9
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 9
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 9
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 9
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 9
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 9
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 9
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 9
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 9
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 9
- 108010008355 arginyl-glutamine Proteins 0.000 description 9
- DIBLBAURNYJYBF-XLXZRNDBSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2-amino-3-methylbutanoyl]amino]hexanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 DIBLBAURNYJYBF-XLXZRNDBSA-N 0.000 description 8
- NFGXHKASABOEEW-UHFFFAOYSA-N 1-methylethyl 11-methoxy-3,7,11-trimethyl-2,4-dodecadienoate Chemical compound COC(C)(C)CCCC(C)CC=CC(C)=CC(=O)OC(C)C NFGXHKASABOEEW-UHFFFAOYSA-N 0.000 description 8
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 8
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 8
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 8
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 8
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 8
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 8
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 8
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 8
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 8
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 8
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 8
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 8
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 8
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 8
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 8
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 8
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 8
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 8
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 8
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 8
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 8
- HZWWOGWOBQBETJ-CUJWVEQBSA-N His-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O HZWWOGWOBQBETJ-CUJWVEQBSA-N 0.000 description 8
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 8
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 8
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 8
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 8
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 8
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 8
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 8
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 8
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 8
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 8
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 8
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 8
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 8
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 8
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 8
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 8
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 8
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 8
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 8
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 8
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 8
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 8
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 8
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 8
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 8
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 8
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 8
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 8
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 8
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 8
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 8
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 108010073101 phenylalanylleucine Proteins 0.000 description 8
- 108010090894 prolylleucine Proteins 0.000 description 8
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 7
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 7
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 7
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 7
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 7
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 7
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 7
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 7
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 7
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 7
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 7
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 7
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 7
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 7
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 7
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 7
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 7
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 7
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 7
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 7
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 7
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 7
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 7
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 7
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 7
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 7
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 7
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 7
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 7
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 7
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 7
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 7
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 7
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 7
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 7
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 7
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 7
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 7
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 7
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 7
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 7
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 7
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 7
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 7
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 7
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 7
- 108010047495 alanylglycine Proteins 0.000 description 7
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 7
- OZFAFGSSMRRTDW-UHFFFAOYSA-N (2,4-dichlorophenyl) benzenesulfonate Chemical group ClC1=CC(Cl)=CC=C1OS(=O)(=O)C1=CC=CC=C1 OZFAFGSSMRRTDW-UHFFFAOYSA-N 0.000 description 6
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 6
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 6
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 6
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 6
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 6
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 6
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 6
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 6
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 6
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 6
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 6
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 6
- 239000012591 Dulbecco’s Phosphate Buffered Saline Substances 0.000 description 6
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 6
- BETSEXMYBWCDAE-SZMVWBNQSA-N Gln-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BETSEXMYBWCDAE-SZMVWBNQSA-N 0.000 description 6
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 6
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 6
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 6
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 6
- 101001103033 Homo sapiens Tyrosine-protein kinase transmembrane receptor ROR2 Proteins 0.000 description 6
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 6
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 6
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 6
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 6
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 6
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 6
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 6
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 6
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 6
- UEJYSALTSUZXFV-SRVKXCTJSA-N Rigin Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UEJYSALTSUZXFV-SRVKXCTJSA-N 0.000 description 6
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 6
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 6
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 6
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 6
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 6
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 6
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 6
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 6
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 6
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 6
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 6
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 6
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 6
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 6
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 6
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 6
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 6
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 6
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 6
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- 238000011161 development Methods 0.000 description 6
- 230000018109 developmental process Effects 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 108010077515 glycylproline Proteins 0.000 description 6
- 238000009169 immunotherapy Methods 0.000 description 6
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 6
- 230000035755 proliferation Effects 0.000 description 6
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 6
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 5
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 5
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 5
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 5
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 5
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 5
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 5
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 5
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 5
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 5
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 5
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 5
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 5
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 5
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 5
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 5
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 5
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 5
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 5
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 5
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 5
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 5
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 5
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 5
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 5
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 5
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 5
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 5
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 5
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 5
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 5
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 5
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 5
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 5
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 5
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 5
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 5
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 5
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 5
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 5
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 5
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 5
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 5
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 5
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 5
- QQFSKBMCAKWHLG-UHFFFAOYSA-N Ile-Phe-Pro-Pro Chemical compound C1CCC(C(=O)N2C(CCC2)C(O)=O)N1C(=O)C(NC(=O)C(N)C(C)CC)CC1=CC=CC=C1 QQFSKBMCAKWHLG-UHFFFAOYSA-N 0.000 description 5
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 5
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 5
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 5
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 5
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 5
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 5
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 5
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 5
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 5
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 5
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 5
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 5
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 5
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 5
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 5
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 5
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 5
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 5
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 5
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 5
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 5
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 5
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 5
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 5
- 210000001744 T-lymphocyte Anatomy 0.000 description 5
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 5
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 5
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 5
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 5
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 5
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 5
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 5
- NLWCSMOXNKBRLC-WDSOQIARSA-N Trp-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLWCSMOXNKBRLC-WDSOQIARSA-N 0.000 description 5
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 5
- SMLCYZYQFRTLCO-UWJYBYFXSA-N Tyr-Cys-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O SMLCYZYQFRTLCO-UWJYBYFXSA-N 0.000 description 5
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 5
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 5
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 5
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 5
- OXVPMZVGCAPFIG-BQFCYCMXSA-N Val-Gln-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N OXVPMZVGCAPFIG-BQFCYCMXSA-N 0.000 description 5
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 5
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 5
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 5
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 5
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 5
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 108010081404 acein-2 Proteins 0.000 description 5
- 239000000556 agonist Substances 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 108010038633 aspartylglutamate Proteins 0.000 description 5
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 235000018102 proteins Nutrition 0.000 description 5
- 102000005962 receptors Human genes 0.000 description 5
- 108020003175 receptors Proteins 0.000 description 5
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 5
- 108010003137 tyrosyltyrosine Proteins 0.000 description 5
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 4
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 4
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 4
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 4
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 4
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 4
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 4
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 4
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 4
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 4
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 4
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 4
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 4
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 4
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 4
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 4
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 4
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 4
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 4
- 102000017420 CD3 protein, epsilon/gamma/delta subunit Human genes 0.000 description 4
- 108050005493 CD3 protein, epsilon/gamma/delta subunit Proteins 0.000 description 4
- 102100025475 Carcinoembryonic antigen-related cell adhesion molecule 5 Human genes 0.000 description 4
- ROHVCXBMIAAASL-HJGDQZAQSA-N Gln-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)N)N)O ROHVCXBMIAAASL-HJGDQZAQSA-N 0.000 description 4
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 4
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 4
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 4
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 4
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 4
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 4
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 4
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 4
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 4
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 4
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 4
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 4
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 4
- 101001055144 Homo sapiens Interleukin-2 receptor subunit alpha Proteins 0.000 description 4
- 101000917858 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor III-A Proteins 0.000 description 4
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 4
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 4
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 4
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 4
- 102100026878 Interleukin-2 receptor subunit alpha Human genes 0.000 description 4
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 4
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 4
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 4
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 4
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 4
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 4
- 102100029193 Low affinity immunoglobulin gamma Fc region receptor III-A Human genes 0.000 description 4
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 4
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 4
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 4
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 4
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 4
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 4
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 4
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 4
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 4
- 241000288906 Primates Species 0.000 description 4
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 4
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 4
- 108091008680 RAR-related orphan receptors Proteins 0.000 description 4
- 102100020718 Receptor-type tyrosine-protein kinase FLT3 Human genes 0.000 description 4
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 4
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 4
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 4
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 4
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 4
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 4
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 4
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 4
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 4
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 4
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 4
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 4
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 4
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 4
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 4
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 4
- GQHAIUPYZPTADF-FDARSICLSA-N Trp-Ile-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 GQHAIUPYZPTADF-FDARSICLSA-N 0.000 description 4
- SUGLEXVWEJOCGN-ONUFPDRFSA-N Trp-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)O SUGLEXVWEJOCGN-ONUFPDRFSA-N 0.000 description 4
- PKZIWSHDJYIPRH-JBACZVJFSA-N Trp-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKZIWSHDJYIPRH-JBACZVJFSA-N 0.000 description 4
- MBLJBGZWLHTJBH-SZMVWBNQSA-N Trp-Val-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 MBLJBGZWLHTJBH-SZMVWBNQSA-N 0.000 description 4
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 4
- QOIKZODVIPOPDD-AVGNSLFASA-N Tyr-Cys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOIKZODVIPOPDD-AVGNSLFASA-N 0.000 description 4
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 4
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 4
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 4
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 4
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 4
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 4
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 4
- SBJCTAZFSZXWSR-AVGNSLFASA-N Val-Met-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SBJCTAZFSZXWSR-AVGNSLFASA-N 0.000 description 4
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 4
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 4
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 4
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 4
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 108010011559 alanylphenylalanine Proteins 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 230000003993 interaction Effects 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 230000002101 lytic effect Effects 0.000 description 4
- 238000000034 method Methods 0.000 description 4
- 239000000843 powder Substances 0.000 description 4
- 108010057840 ALT-803 Proteins 0.000 description 3
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 3
- QDGMZAOSMNGBLP-MRFFXTKBSA-N Ala-Trp-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N QDGMZAOSMNGBLP-MRFFXTKBSA-N 0.000 description 3
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 3
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 3
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 3
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 3
- HTSSXFASOUSJQG-IHPCNDPISA-N Asp-Tyr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HTSSXFASOUSJQG-IHPCNDPISA-N 0.000 description 3
- 102100032976 CCR4-NOT transcription complex subunit 6 Human genes 0.000 description 3
- 102100027207 CD27 antigen Human genes 0.000 description 3
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 3
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 3
- BPHKULHWEIUDOB-FXQIFTODSA-N Cys-Gln-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BPHKULHWEIUDOB-FXQIFTODSA-N 0.000 description 3
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 3
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 3
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 3
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 3
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 3
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 3
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 3
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 3
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 3
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 3
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 3
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 3
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 3
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 3
- ZNTSGDNUITWTRA-WDSOQIARSA-N His-Trp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O ZNTSGDNUITWTRA-WDSOQIARSA-N 0.000 description 3
- 101000914511 Homo sapiens CD27 antigen Proteins 0.000 description 3
- 101001103039 Homo sapiens Inactive tyrosine-protein kinase transmembrane receptor ROR1 Proteins 0.000 description 3
- 101001055157 Homo sapiens Interleukin-15 Proteins 0.000 description 3
- 101001003140 Homo sapiens Interleukin-15 receptor subunit alpha Proteins 0.000 description 3
- 101000932478 Homo sapiens Receptor-type tyrosine-protein kinase FLT3 Proteins 0.000 description 3
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 3
- 108010002350 Interleukin-2 Proteins 0.000 description 3
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 3
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 3
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 3
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 3
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- 102100024964 Neural cell adhesion molecule L1 Human genes 0.000 description 3
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 3
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 3
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 3
- 229920001213 Polysorbate 20 Polymers 0.000 description 3
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 3
- 239000012980 RPMI-1640 medium Substances 0.000 description 3
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 3
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 3
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 3
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 3
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 3
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 3
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 3
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 3
- 102100029215 Signaling lymphocytic activation molecule Human genes 0.000 description 3
- 102100034922 T-cell surface glycoprotein CD8 alpha chain Human genes 0.000 description 3
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 3
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 3
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 3
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 3
- KIMOCKLJBXHFIN-YLVFBTJISA-N Trp-Ile-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O)=CNC2=C1 KIMOCKLJBXHFIN-YLVFBTJISA-N 0.000 description 3
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 3
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 3
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 3
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 3
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 3
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 3
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 3
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 3
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 3
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 3
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 3
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 3
- 238000002270 exclusion chromatography Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 235000013861 fat-free Nutrition 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 239000000833 heterodimer Substances 0.000 description 3
- 102000056003 human IL15 Human genes 0.000 description 3
- 108010091871 leucylmethionine Proteins 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 235000013336 milk Nutrition 0.000 description 3
- 239000008267 milk Substances 0.000 description 3
- 210000004080 milk Anatomy 0.000 description 3
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 3
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 3
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 3
- 230000001737 promoting effect Effects 0.000 description 3
- 210000000952 spleen Anatomy 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 3
- 102100040842 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase FUT3 Human genes 0.000 description 2
- 102100031585 ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Human genes 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 2
- 102100022749 Aminopeptidase N Human genes 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 2
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 2
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 2
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 2
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 2
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 2
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- ANRZCQXIXGDXLR-CWRNSKLLSA-N Asn-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)N)N)C(=O)O ANRZCQXIXGDXLR-CWRNSKLLSA-N 0.000 description 2
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 2
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 2
- 102100032412 Basigin Human genes 0.000 description 2
- 102100032957 C5a anaphylatoxin chemotactic receptor 1 Human genes 0.000 description 2
- 102100024263 CD160 antigen Human genes 0.000 description 2
- 102100032912 CD44 antigen Human genes 0.000 description 2
- 210000001266 CD8-positive T-lymphocyte Anatomy 0.000 description 2
- 102100032378 Carboxypeptidase E Human genes 0.000 description 2
- 108010058255 Carboxypeptidase H Proteins 0.000 description 2
- 102100021396 Cell surface glycoprotein CD200 receptor 1 Human genes 0.000 description 2
- 102100028757 Chondroitin sulfate proteoglycan 4 Human genes 0.000 description 2
- 102100032768 Complement receptor type 2 Human genes 0.000 description 2
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 2
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- 102100029722 Ectonucleoside triphosphate diphosphohydrolase 1 Human genes 0.000 description 2
- 102000005593 Endopeptidases Human genes 0.000 description 2
- 108010059378 Endopeptidases Proteins 0.000 description 2
- 102000018389 Exopeptidases Human genes 0.000 description 2
- 108010091443 Exopeptidases Proteins 0.000 description 2
- 102100035139 Folate receptor alpha Human genes 0.000 description 2
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 2
- GPISLLFQNHELLK-DCAQKATOSA-N Gln-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GPISLLFQNHELLK-DCAQKATOSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 2
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 2
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 2
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 2
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 2
- 102100022623 Hepatocyte growth factor receptor Human genes 0.000 description 2
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 2
- 101000893701 Homo sapiens 3-galactosyl-N-acetylglucosaminide 4-alpha-L-fucosyltransferase FUT3 Proteins 0.000 description 2
- 101000777636 Homo sapiens ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Proteins 0.000 description 2
- 101000757160 Homo sapiens Aminopeptidase N Proteins 0.000 description 2
- 101000798441 Homo sapiens Basigin Proteins 0.000 description 2
- 101000867983 Homo sapiens C5a anaphylatoxin chemotactic receptor 1 Proteins 0.000 description 2
- 101000761938 Homo sapiens CD160 antigen Proteins 0.000 description 2
- 101000868273 Homo sapiens CD44 antigen Proteins 0.000 description 2
- 101000969553 Homo sapiens Cell surface glycoprotein CD200 receptor 1 Proteins 0.000 description 2
- 101000916489 Homo sapiens Chondroitin sulfate proteoglycan 4 Proteins 0.000 description 2
- 101000941929 Homo sapiens Complement receptor type 2 Proteins 0.000 description 2
- 101001012447 Homo sapiens Ectonucleoside triphosphate diphosphohydrolase 1 Proteins 0.000 description 2
- 101001023230 Homo sapiens Folate receptor alpha Proteins 0.000 description 2
- 101000746373 Homo sapiens Granulocyte-macrophage colony-stimulating factor Proteins 0.000 description 2
- 101000599951 Homo sapiens Insulin-like growth factor I Proteins 0.000 description 2
- 101001046686 Homo sapiens Integrin alpha-M Proteins 0.000 description 2
- 101001046677 Homo sapiens Integrin alpha-V Proteins 0.000 description 2
- 101000935043 Homo sapiens Integrin beta-1 Proteins 0.000 description 2
- 101000599852 Homo sapiens Intercellular adhesion molecule 1 Proteins 0.000 description 2
- 101001057504 Homo sapiens Interferon-stimulated gene 20 kDa protein Proteins 0.000 description 2
- 101000998146 Homo sapiens Interleukin-17A Proteins 0.000 description 2
- 101000599056 Homo sapiens Interleukin-6 receptor subunit beta Proteins 0.000 description 2
- 101000917826 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor II-a Proteins 0.000 description 2
- 101000917824 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor II-b Proteins 0.000 description 2
- 101000917839 Homo sapiens Low affinity immunoglobulin gamma Fc region receptor III-B Proteins 0.000 description 2
- 101000961414 Homo sapiens Membrane cofactor protein Proteins 0.000 description 2
- 101000972286 Homo sapiens Mucin-4 Proteins 0.000 description 2
- 101000972278 Homo sapiens Mucin-6 Proteins 0.000 description 2
- 101000581981 Homo sapiens Neural cell adhesion molecule 1 Proteins 0.000 description 2
- 101001051490 Homo sapiens Neural cell adhesion molecule L1 Proteins 0.000 description 2
- 101001098352 Homo sapiens OX-2 membrane glycoprotein Proteins 0.000 description 2
- 101000873418 Homo sapiens P-selectin glycoprotein ligand 1 Proteins 0.000 description 2
- 101000633780 Homo sapiens Signaling lymphocytic activation molecule Proteins 0.000 description 2
- 101000874179 Homo sapiens Syndecan-1 Proteins 0.000 description 2
- 101000914496 Homo sapiens T-cell antigen CD7 Proteins 0.000 description 2
- 101000716102 Homo sapiens T-cell surface glycoprotein CD4 Proteins 0.000 description 2
- 101000795167 Homo sapiens Tumor necrosis factor receptor superfamily member 13B Proteins 0.000 description 2
- 241000341655 Human papillomavirus type 16 Species 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 2
- GTSAALPQZASLPW-KJYZGMDISA-N Ile-His-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N GTSAALPQZASLPW-KJYZGMDISA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 2
- XDVKZSJODLMNLJ-GGQYPGDFSA-N Ile-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 XDVKZSJODLMNLJ-GGQYPGDFSA-N 0.000 description 2
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- 102100037852 Insulin-like growth factor I Human genes 0.000 description 2
- 102100022338 Integrin alpha-M Human genes 0.000 description 2
- 102100022337 Integrin alpha-V Human genes 0.000 description 2
- 102100025304 Integrin beta-1 Human genes 0.000 description 2
- 102100037877 Intercellular adhesion molecule 1 Human genes 0.000 description 2
- 102100020790 Interleukin-12 receptor subunit beta-1 Human genes 0.000 description 2
- 102100033461 Interleukin-17A Human genes 0.000 description 2
- 102100030704 Interleukin-21 Human genes 0.000 description 2
- 102100037795 Interleukin-6 receptor subunit beta Human genes 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- 102000017578 LAG3 Human genes 0.000 description 2
- 101150030213 Lag3 gene Proteins 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- 102100029204 Low affinity immunoglobulin gamma Fc region receptor II-a Human genes 0.000 description 2
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 2
- 102000000424 Matrix Metalloproteinase 2 Human genes 0.000 description 2
- 108010016165 Matrix Metalloproteinase 2 Proteins 0.000 description 2
- 102100039373 Membrane cofactor protein Human genes 0.000 description 2
- ABHVWYPPHDYFNY-WDSOQIARSA-N Met-His-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 ABHVWYPPHDYFNY-WDSOQIARSA-N 0.000 description 2
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 2
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- 102100022693 Mucin-4 Human genes 0.000 description 2
- 102100022493 Mucin-6 Human genes 0.000 description 2
- PCLIMKBDDGJMGD-UHFFFAOYSA-N N-bromosuccinimide Chemical compound BrN1C(=O)CCC1=O PCLIMKBDDGJMGD-UHFFFAOYSA-N 0.000 description 2
- JRNVZBWKYDBUCA-UHFFFAOYSA-N N-chlorosuccinimide Chemical compound ClN1C(=O)CCC1=O JRNVZBWKYDBUCA-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 102100021850 Nardilysin Human genes 0.000 description 2
- 108090000970 Nardilysin Proteins 0.000 description 2
- 102100032870 Natural cytotoxicity triggering receptor 1 Human genes 0.000 description 2
- 102100027347 Neural cell adhesion molecule 1 Human genes 0.000 description 2
- 102100037589 OX-2 membrane glycoprotein Human genes 0.000 description 2
- 239000012124 Opti-MEM Substances 0.000 description 2
- 102100034925 P-selectin glycoprotein ligand 1 Human genes 0.000 description 2
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 2
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 2
- 102100029740 Poliovirus receptor Human genes 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- 102100038280 Prostaglandin G/H synthase 2 Human genes 0.000 description 2
- 102100035703 Prostatic acid phosphatase Human genes 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 101710100963 Receptor tyrosine-protein kinase erbB-4 Proteins 0.000 description 2
- 102100029981 Receptor tyrosine-protein kinase erbB-4 Human genes 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 2
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 102100035721 Syndecan-1 Human genes 0.000 description 2
- 102100027208 T-cell antigen CD7 Human genes 0.000 description 2
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 2
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 2
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 2
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 2
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- GJOBRAHDRIDAPT-NGTWOADLSA-N Thr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H]([C@@H](C)O)N GJOBRAHDRIDAPT-NGTWOADLSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 2
- COLXBVRHSKPKIE-NYVOZVTQSA-N Trp-Trp-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O COLXBVRHSKPKIE-NYVOZVTQSA-N 0.000 description 2
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 2
- 102100040653 Tryptophan 2,3-dioxygenase Human genes 0.000 description 2
- 102100029675 Tumor necrosis factor receptor superfamily member 13B Human genes 0.000 description 2
- 102100033725 Tumor necrosis factor receptor superfamily member 16 Human genes 0.000 description 2
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 2
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 2
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 2
- STTVVMWQKDOKAM-YESZJQIVSA-N Tyr-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O STTVVMWQKDOKAM-YESZJQIVSA-N 0.000 description 2
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 2
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 2
- MQUYPYFPHIPVHJ-MNSWYVGCSA-N Tyr-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O MQUYPYFPHIPVHJ-MNSWYVGCSA-N 0.000 description 2
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 2
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 2
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- 230000000259 anti-tumor effect Effects 0.000 description 2
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 230000003833 cell viability Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 229940066758 endopeptidases Drugs 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 108010074108 interleukin-21 Proteins 0.000 description 2
- 210000004698 lymphocyte Anatomy 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 230000003211 malignant effect Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 229960000402 palivizumab Drugs 0.000 description 2
- 108010048507 poliovirus receptor Proteins 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 230000000770 proinflammatory effect Effects 0.000 description 2
- 239000012562 protein A resin Substances 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- ZQJHYRVSKHGGJY-YPKJBDGSSA-N (2s,3r)-2-[[(2s)-2-[[(2s)-1-[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carbonyl]amino]-3-phenylpropanoyl]amino]-3-hydroxybutanoic acid Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=CC=C1 ZQJHYRVSKHGGJY-YPKJBDGSSA-N 0.000 description 1
- SSOORFWOBGFTHL-OTEJMHTDSA-N (4S)-5-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[2-[(2S)-2-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S,3S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-5-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-5-amino-1-[[(2S)-5-carbamimidamido-1-[[(2S)-5-carbamimidamido-1-[[(1S)-4-carbamimidamido-1-carboxybutyl]amino]-1-oxopentan-2-yl]amino]-1-oxopentan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-1-oxohexan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-methyl-1-oxopentan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]carbamoyl]pyrrolidin-1-yl]-2-oxoethyl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-(1H-imidazol-4-yl)-1-oxopropan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-4-[[(2S)-2-[[(2S)-2-[[(2S)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]propanoyl]amino]-5-oxopentanoic acid Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SSOORFWOBGFTHL-OTEJMHTDSA-N 0.000 description 1
- FSPQCTGGIANIJZ-UHFFFAOYSA-N 2-[[(3,4-dimethoxyphenyl)-oxomethyl]amino]-4,5,6,7-tetrahydro-1-benzothiophene-3-carboxamide Chemical compound C1=C(OC)C(OC)=CC=C1C(=O)NC1=C(C(N)=O)C(CCCC2)=C2S1 FSPQCTGGIANIJZ-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- 108091007505 ADAM17 Proteins 0.000 description 1
- 108060000255 AIM2 Proteins 0.000 description 1
- 102100021501 ATP-binding cassette sub-family B member 5 Human genes 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 102100035990 Adenosine receptor A2a Human genes 0.000 description 1
- 102100036006 Adenosine receptor A3 Human genes 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- MLNSNVLOEIYJIU-ZUDIRPEPSA-N Ala-Leu-Thr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLNSNVLOEIYJIU-ZUDIRPEPSA-N 0.000 description 1
- REAQAWSENITKJL-DDWPSWQVSA-N Ala-Met-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O REAQAWSENITKJL-DDWPSWQVSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 102100034594 Angiopoietin-1 Human genes 0.000 description 1
- 102100022014 Angiopoietin-1 receptor Human genes 0.000 description 1
- 102100034608 Angiopoietin-2 Human genes 0.000 description 1
- 101001005269 Arabidopsis thaliana Ceramide synthase 1 LOH3 Proteins 0.000 description 1
- 101001005312 Arabidopsis thaliana Ceramide synthase LOH1 Proteins 0.000 description 1
- 101100243447 Arabidopsis thaliana PER53 gene Proteins 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- PCQXGEUALSFGIA-WDSOQIARSA-N Arg-His-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PCQXGEUALSFGIA-WDSOQIARSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- CGYKCTPUGXFPMG-IHPCNDPISA-N Asn-Tyr-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CGYKCTPUGXFPMG-IHPCNDPISA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- 102100022716 Atypical chemokine receptor 3 Human genes 0.000 description 1
- 201000008271 Atypical teratoid rhabdoid tumor Diseases 0.000 description 1
- 101700002522 BARD1 Proteins 0.000 description 1
- 102100028048 BRCA1-associated RING domain protein 1 Human genes 0.000 description 1
- 102000015735 Beta-catenin Human genes 0.000 description 1
- 108060000903 Beta-catenin Proteins 0.000 description 1
- 101001042041 Bos taurus Isocitrate dehydrogenase [NAD] subunit beta, mitochondrial Proteins 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 102100036848 C-C motif chemokine 20 Human genes 0.000 description 1
- 102100036166 C-X-C chemokine receptor type 1 Human genes 0.000 description 1
- 238000011740 C57BL/6 mouse Methods 0.000 description 1
- 108010062802 CD66 antigens Proteins 0.000 description 1
- 102000015367 CRBN Human genes 0.000 description 1
- 108091011896 CSF1 Proteins 0.000 description 1
- 102100024153 Cadherin-15 Human genes 0.000 description 1
- 108010052495 Calgranulin B Proteins 0.000 description 1
- 102100028801 Calsyntenin-1 Human genes 0.000 description 1
- 102100036214 Cannabinoid receptor 2 Human genes 0.000 description 1
- 101710187022 Cannabinoid receptor 2 Proteins 0.000 description 1
- 102100024423 Carbonic anhydrase 9 Human genes 0.000 description 1
- 108090000087 Carboxypeptidase B Proteins 0.000 description 1
- 102000003670 Carboxypeptidase B Human genes 0.000 description 1
- 108090000018 Carboxypeptidase D Proteins 0.000 description 1
- 102100032407 Carboxypeptidase D Human genes 0.000 description 1
- 108090000007 Carboxypeptidase M Proteins 0.000 description 1
- 102100032936 Carboxypeptidase M Human genes 0.000 description 1
- 102100021953 Carboxypeptidase Z Human genes 0.000 description 1
- 108010080937 Carboxypeptidases A Proteins 0.000 description 1
- 102000000496 Carboxypeptidases A Human genes 0.000 description 1
- 102100024533 Carcinoembryonic antigen-related cell adhesion molecule 1 Human genes 0.000 description 1
- 102100025473 Carcinoembryonic antigen-related cell adhesion molecule 6 Human genes 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 1
- 101710163595 Chaperone protein DnaK Proteins 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 108090000317 Chymotrypsin Proteins 0.000 description 1
- 102100040901 Circadian clock protein PASD1 Human genes 0.000 description 1
- 102100040835 Claudin-18 Human genes 0.000 description 1
- 108050009324 Claudin-18 Proteins 0.000 description 1
- 102100038449 Claudin-6 Human genes 0.000 description 1
- 108090000229 Claudin-6 Proteins 0.000 description 1
- 102220579739 Cohesin subunit SA-1_S51D_mutation Human genes 0.000 description 1
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- MWZSCEAYQCMROW-GUBZILKMSA-N Cys-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N MWZSCEAYQCMROW-GUBZILKMSA-N 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 1
- BBQIWFFTTQTNOC-AVGNSLFASA-N Cys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N BBQIWFFTTQTNOC-AVGNSLFASA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 1
- 108010005843 Cysteine Proteases Proteins 0.000 description 1
- 102000005927 Cysteine Proteases Human genes 0.000 description 1
- 102100038497 Cytokine receptor-like factor 2 Human genes 0.000 description 1
- 102100039315 Cytoplasmic polyadenylation element-binding protein 4 Human genes 0.000 description 1
- 102100037700 DNA mismatch repair protein Msh3 Human genes 0.000 description 1
- 102100024607 DNA topoisomerase 1 Human genes 0.000 description 1
- 102100036466 Delta-like protein 3 Human genes 0.000 description 1
- 102100029588 Deoxycytidine kinase Human genes 0.000 description 1
- 102100030074 Dickkopf-related protein 1 Human genes 0.000 description 1
- 102100031111 Disintegrin and metalloproteinase domain-containing protein 17 Human genes 0.000 description 1
- 102100024361 Disintegrin and metalloproteinase domain-containing protein 9 Human genes 0.000 description 1
- 102100035273 E3 ubiquitin-protein ligase CBL-B Human genes 0.000 description 1
- 102100026245 E3 ubiquitin-protein ligase RNF43 Human genes 0.000 description 1
- 101150049307 EEF1A2 gene Proteins 0.000 description 1
- 102000012804 EPCAM Human genes 0.000 description 1
- 101150076616 EPHA2 gene Proteins 0.000 description 1
- 101150016325 EPHA3 gene Proteins 0.000 description 1
- 108091016436 EPS8 Proteins 0.000 description 1
- 102000020045 EPS8 Human genes 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 102100030340 Ephrin type-A receptor 2 Human genes 0.000 description 1
- 102100030324 Ephrin type-A receptor 3 Human genes 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 102100035290 Fibroblast growth factor 13 Human genes 0.000 description 1
- 108090000379 Fibroblast growth factor 2 Proteins 0.000 description 1
- 108010008599 Forkhead Box Protein M1 Proteins 0.000 description 1
- 102100023374 Forkhead box protein M1 Human genes 0.000 description 1
- 102100035233 Furin Human genes 0.000 description 1
- 108090001126 Furin Proteins 0.000 description 1
- 102100032340 G2/mitotic-specific cyclin-B1 Human genes 0.000 description 1
- 102100022086 GRB2-related adapter protein 2 Human genes 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- 101710088083 Glomulin Proteins 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 1
- MIQCYAJSDGNCNK-BPUTZDHNSA-N Glu-Gln-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MIQCYAJSDGNCNK-BPUTZDHNSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 102100021184 Golgi membrane protein 1 Human genes 0.000 description 1
- 102100033851 Gonadotropin-releasing hormone receptor Human genes 0.000 description 1
- 102100034221 Growth-regulated alpha protein Human genes 0.000 description 1
- 108010074032 HLA-A2 Antigen Proteins 0.000 description 1
- 102000025850 HLA-A2 Antigen Human genes 0.000 description 1
- 102000006354 HLA-DR Antigens Human genes 0.000 description 1
- 108010058597 HLA-DR Antigens Proteins 0.000 description 1
- 101150051208 HSPH1 gene Proteins 0.000 description 1
- 101710178376 Heat shock 70 kDa protein Proteins 0.000 description 1
- 101710152018 Heat shock cognate 70 kDa protein Proteins 0.000 description 1
- 102100031624 Heat shock protein 105 kDa Human genes 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- FOCSWPCHUDVNLP-PMVMPFDFSA-N His-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC4=CN=CN4)N FOCSWPCHUDVNLP-PMVMPFDFSA-N 0.000 description 1
- 108090000353 Histone deacetylase Proteins 0.000 description 1
- 102000003964 Histone deacetylase Human genes 0.000 description 1
- 102100039996 Histone deacetylase 1 Human genes 0.000 description 1
- 102100039999 Histone deacetylase 2 Human genes 0.000 description 1
- 102100038715 Histone deacetylase 8 Human genes 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000677872 Homo sapiens ATP-binding cassette sub-family B member 5 Proteins 0.000 description 1
- 101000783751 Homo sapiens Adenosine receptor A2a Proteins 0.000 description 1
- 101000783645 Homo sapiens Adenosine receptor A3 Proteins 0.000 description 1
- 101000924552 Homo sapiens Angiopoietin-1 Proteins 0.000 description 1
- 101000753291 Homo sapiens Angiopoietin-1 receptor Proteins 0.000 description 1
- 101000924533 Homo sapiens Angiopoietin-2 Proteins 0.000 description 1
- 101000678890 Homo sapiens Atypical chemokine receptor 3 Proteins 0.000 description 1
- 101000713099 Homo sapiens C-C motif chemokine 20 Proteins 0.000 description 1
- 101000947174 Homo sapiens C-X-C chemokine receptor type 1 Proteins 0.000 description 1
- 101000762242 Homo sapiens Cadherin-15 Proteins 0.000 description 1
- 101000714553 Homo sapiens Cadherin-3 Proteins 0.000 description 1
- 101000910338 Homo sapiens Carbonic anhydrase 9 Proteins 0.000 description 1
- 101000914326 Homo sapiens Carcinoembryonic antigen-related cell adhesion molecule 6 Proteins 0.000 description 1
- 101000613559 Homo sapiens Circadian clock protein PASD1 Proteins 0.000 description 1
- 101000725401 Homo sapiens Cytochrome c oxidase subunit 2 Proteins 0.000 description 1
- 101000956427 Homo sapiens Cytokine receptor-like factor 2 Proteins 0.000 description 1
- 101000745636 Homo sapiens Cytoplasmic polyadenylation element-binding protein 4 Proteins 0.000 description 1
- 101000830681 Homo sapiens DNA topoisomerase 1 Proteins 0.000 description 1
- 101001056901 Homo sapiens Delta(14)-sterol reductase TM7SF2 Proteins 0.000 description 1
- 101000928513 Homo sapiens Delta-like protein 3 Proteins 0.000 description 1
- 101000864646 Homo sapiens Dickkopf-related protein 1 Proteins 0.000 description 1
- 101000832769 Homo sapiens Disintegrin and metalloproteinase domain-containing protein 9 Proteins 0.000 description 1
- 101000737265 Homo sapiens E3 ubiquitin-protein ligase CBL-B Proteins 0.000 description 1
- 101000692702 Homo sapiens E3 ubiquitin-protein ligase RNF43 Proteins 0.000 description 1
- 101000868643 Homo sapiens G2/mitotic-specific cyclin-B1 Proteins 0.000 description 1
- 101001040742 Homo sapiens Golgi membrane protein 1 Proteins 0.000 description 1
- 101000996727 Homo sapiens Gonadotropin-releasing hormone receptor Proteins 0.000 description 1
- 101001069921 Homo sapiens Growth-regulated alpha protein Proteins 0.000 description 1
- 101001035024 Homo sapiens Histone deacetylase 1 Proteins 0.000 description 1
- 101001035011 Homo sapiens Histone deacetylase 2 Proteins 0.000 description 1
- 101001032118 Homo sapiens Histone deacetylase 8 Proteins 0.000 description 1
- 101001033728 Homo sapiens Histone-lysine N-methyltransferase MECOM Proteins 0.000 description 1
- 101000839066 Homo sapiens Hypoxia-inducible lipid droplet-associated protein Proteins 0.000 description 1
- 101100232351 Homo sapiens IL12RB1 gene Proteins 0.000 description 1
- 101001015059 Homo sapiens Integrin beta-5 Proteins 0.000 description 1
- 101000852870 Homo sapiens Interferon alpha/beta receptor 1 Proteins 0.000 description 1
- 101000852865 Homo sapiens Interferon alpha/beta receptor 2 Proteins 0.000 description 1
- 101000599940 Homo sapiens Interferon gamma Proteins 0.000 description 1
- 101001001420 Homo sapiens Interferon gamma receptor 1 Proteins 0.000 description 1
- 101001082073 Homo sapiens Interferon-induced helicase C domain-containing protein 1 Proteins 0.000 description 1
- 101000960952 Homo sapiens Interleukin-1 receptor accessory protein Proteins 0.000 description 1
- 101000852483 Homo sapiens Interleukin-1 receptor-associated kinase 1 Proteins 0.000 description 1
- 101001083151 Homo sapiens Interleukin-10 receptor subunit alpha Proteins 0.000 description 1
- 101001003142 Homo sapiens Interleukin-12 receptor subunit beta-1 Proteins 0.000 description 1
- 101001019598 Homo sapiens Interleukin-17 receptor A Proteins 0.000 description 1
- 101001010626 Homo sapiens Interleukin-22 Proteins 0.000 description 1
- 101000599048 Homo sapiens Interleukin-6 receptor subunit alpha Proteins 0.000 description 1
- 101000960234 Homo sapiens Isocitrate dehydrogenase [NADP] cytoplasmic Proteins 0.000 description 1
- 101000945333 Homo sapiens Killer cell immunoglobulin-like receptor 2DL3 Proteins 0.000 description 1
- 101100182737 Homo sapiens MTDH gene Proteins 0.000 description 1
- 101000916644 Homo sapiens Macrophage colony-stimulating factor 1 receptor Proteins 0.000 description 1
- 101001106413 Homo sapiens Macrophage-stimulating protein receptor Proteins 0.000 description 1
- 101000620359 Homo sapiens Melanocyte protein PMEL Proteins 0.000 description 1
- 101001057156 Homo sapiens Melanoma-associated antigen C2 Proteins 0.000 description 1
- 101001133056 Homo sapiens Mucin-1 Proteins 0.000 description 1
- 101000593405 Homo sapiens Myb-related protein B Proteins 0.000 description 1
- 101000588302 Homo sapiens Nuclear factor erythroid 2-related factor 2 Proteins 0.000 description 1
- 101000686034 Homo sapiens Nuclear receptor ROR-gamma Proteins 0.000 description 1
- 101000633516 Homo sapiens Nuclear receptor subfamily 2 group F member 6 Proteins 0.000 description 1
- 101001098175 Homo sapiens P2X purinoceptor 7 Proteins 0.000 description 1
- 101000741896 Homo sapiens POTE ankyrin domain family member D Proteins 0.000 description 1
- 101000874141 Homo sapiens Probable ATP-dependent RNA helicase DDX43 Proteins 0.000 description 1
- 101001117317 Homo sapiens Programmed cell death 1 ligand 1 Proteins 0.000 description 1
- 101000610551 Homo sapiens Prominin-1 Proteins 0.000 description 1
- 101001117519 Homo sapiens Prostaglandin E2 receptor EP2 subtype Proteins 0.000 description 1
- 101000605127 Homo sapiens Prostaglandin G/H synthase 2 Proteins 0.000 description 1
- 101001001272 Homo sapiens Prostatic acid phosphatase Proteins 0.000 description 1
- 101000880770 Homo sapiens Protein SSX2 Proteins 0.000 description 1
- 101000941994 Homo sapiens Protein cereblon Proteins 0.000 description 1
- 101001072227 Homo sapiens Protocadherin-18 Proteins 0.000 description 1
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 description 1
- 101001090901 Homo sapiens Retroelement silencing factor 1 Proteins 0.000 description 1
- 101000633784 Homo sapiens SLAM family member 7 Proteins 0.000 description 1
- 101000829127 Homo sapiens Somatostatin receptor type 2 Proteins 0.000 description 1
- 101000617830 Homo sapiens Sterol O-acyltransferase 1 Proteins 0.000 description 1
- 101000617130 Homo sapiens Stromal cell-derived factor 1 Proteins 0.000 description 1
- 101000662902 Homo sapiens T cell receptor beta constant 2 Proteins 0.000 description 1
- 101000946860 Homo sapiens T-cell surface glycoprotein CD3 epsilon chain Proteins 0.000 description 1
- 101000946843 Homo sapiens T-cell surface glycoprotein CD8 alpha chain Proteins 0.000 description 1
- 101000595548 Homo sapiens TIR domain-containing adapter molecule 1 Proteins 0.000 description 1
- 101000772267 Homo sapiens Thyrotropin receptor Proteins 0.000 description 1
- 101000831567 Homo sapiens Toll-like receptor 2 Proteins 0.000 description 1
- 101000831496 Homo sapiens Toll-like receptor 3 Proteins 0.000 description 1
- 101000669460 Homo sapiens Toll-like receptor 5 Proteins 0.000 description 1
- 101000669402 Homo sapiens Toll-like receptor 7 Proteins 0.000 description 1
- 101000800483 Homo sapiens Toll-like receptor 8 Proteins 0.000 description 1
- 101000666379 Homo sapiens Transcription factor Dp family member 3 Proteins 0.000 description 1
- 101000894428 Homo sapiens Transcriptional repressor CTCFL Proteins 0.000 description 1
- 101000635938 Homo sapiens Transforming growth factor beta-1 proprotein Proteins 0.000 description 1
- 101000635958 Homo sapiens Transforming growth factor beta-2 proprotein Proteins 0.000 description 1
- 101000658584 Homo sapiens Transmembrane 4 L6 family member 5 Proteins 0.000 description 1
- 101000801433 Homo sapiens Trophoblast glycoprotein Proteins 0.000 description 1
- 101000892398 Homo sapiens Tryptophan 2,3-dioxygenase Proteins 0.000 description 1
- 101000611183 Homo sapiens Tumor necrosis factor Proteins 0.000 description 1
- 101000610604 Homo sapiens Tumor necrosis factor receptor superfamily member 10B Proteins 0.000 description 1
- 101000997835 Homo sapiens Tyrosine-protein kinase JAK1 Proteins 0.000 description 1
- 101000934996 Homo sapiens Tyrosine-protein kinase JAK3 Proteins 0.000 description 1
- 101000955962 Homo sapiens Vacuolar protein sorting-associated protein 51 homolog Proteins 0.000 description 1
- 101000851018 Homo sapiens Vascular endothelial growth factor receptor 1 Proteins 0.000 description 1
- 101000851007 Homo sapiens Vascular endothelial growth factor receptor 2 Proteins 0.000 description 1
- 101000956004 Homo sapiens Vitamin D-binding protein Proteins 0.000 description 1
- 101000621371 Homo sapiens WD and tetratricopeptide repeats protein 1 Proteins 0.000 description 1
- 101000814512 Homo sapiens X antigen family member 1 Proteins 0.000 description 1
- 101000892274 Human adenovirus C serotype 2 Adenovirus death protein Proteins 0.000 description 1
- 241001354547 Human rhinovirus C3 Species 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- 102100028891 Hypoxia-inducible lipid droplet-associated protein Human genes 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- AYLAAGNJNVZDPY-CYDGBPFRSA-N Ile-Met-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N AYLAAGNJNVZDPY-CYDGBPFRSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 102100033010 Integrin beta-5 Human genes 0.000 description 1
- 102100036714 Interferon alpha/beta receptor 1 Human genes 0.000 description 1
- 102100036718 Interferon alpha/beta receptor 2 Human genes 0.000 description 1
- 102100037850 Interferon gamma Human genes 0.000 description 1
- 102100035678 Interferon gamma receptor 1 Human genes 0.000 description 1
- 102100036157 Interferon gamma receptor 2 Human genes 0.000 description 1
- 102100027353 Interferon-induced helicase C domain-containing protein 1 Human genes 0.000 description 1
- 102100024064 Interferon-inducible protein AIM2 Human genes 0.000 description 1
- 102100039880 Interleukin-1 receptor accessory protein Human genes 0.000 description 1
- 102100036342 Interleukin-1 receptor-associated kinase 1 Human genes 0.000 description 1
- 102000003814 Interleukin-10 Human genes 0.000 description 1
- 108090000174 Interleukin-10 Proteins 0.000 description 1
- 102100030236 Interleukin-10 receptor subunit alpha Human genes 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 102000013462 Interleukin-12 Human genes 0.000 description 1
- 108010085418 Interleukin-13 Receptor alpha2 Subunit Proteins 0.000 description 1
- 102000007482 Interleukin-13 Receptor alpha2 Subunit Human genes 0.000 description 1
- 102100020789 Interleukin-15 receptor subunit alpha Human genes 0.000 description 1
- 102100035018 Interleukin-17 receptor A Human genes 0.000 description 1
- 102100030703 Interleukin-22 Human genes 0.000 description 1
- 102000004889 Interleukin-6 Human genes 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 102100037792 Interleukin-6 receptor subunit alpha Human genes 0.000 description 1
- 102100039905 Isocitrate dehydrogenase [NADP] cytoplasmic Human genes 0.000 description 1
- 102100033634 Killer cell immunoglobulin-like receptor 2DL3 Human genes 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- 206010067125 Liver injury Diseases 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- MLLKLNYPZRDIQG-GUBZILKMSA-N Lys-Cys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N MLLKLNYPZRDIQG-GUBZILKMSA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- 108010085169 Lysine carboxypeptidase Proteins 0.000 description 1
- 101001018085 Lysobacter enzymogenes Lysyl endopeptidase Proteins 0.000 description 1
- 102000043136 MAP kinase family Human genes 0.000 description 1
- 108091054455 MAP kinase family Proteins 0.000 description 1
- 102100028123 Macrophage colony-stimulating factor 1 Human genes 0.000 description 1
- 102100028198 Macrophage colony-stimulating factor 1 receptor Human genes 0.000 description 1
- 102100021435 Macrophage-stimulating protein receptor Human genes 0.000 description 1
- 102100022430 Melanocyte protein PMEL Human genes 0.000 description 1
- 102100027252 Melanoma-associated antigen C2 Human genes 0.000 description 1
- AFVOKRHYSSFPHC-STECZYCISA-N Met-Ile-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFVOKRHYSSFPHC-STECZYCISA-N 0.000 description 1
- RIWWCXKWIUQIAY-SZMVWBNQSA-N Met-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RIWWCXKWIUQIAY-SZMVWBNQSA-N 0.000 description 1
- UXJHNUBJSQQIOC-SZMVWBNQSA-N Met-Trp-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O UXJHNUBJSQQIOC-SZMVWBNQSA-N 0.000 description 1
- 102100026261 Metalloproteinase inhibitor 3 Human genes 0.000 description 1
- 102100023482 Mitogen-activated protein kinase 14 Human genes 0.000 description 1
- 102100034256 Mucin-1 Human genes 0.000 description 1
- 101100007718 Mus musculus Crisp1 gene Proteins 0.000 description 1
- 101100407308 Mus musculus Pdcd1lg2 gene Proteins 0.000 description 1
- 102100034670 Myb-related protein B Human genes 0.000 description 1
- 101000944608 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) Chaperonin GroEL 2 Proteins 0.000 description 1
- 101001055320 Myxine glutinosa Insulin-like growth factor Proteins 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 102100022691 NACHT, LRR and PYD domains-containing protein 3 Human genes 0.000 description 1
- 108010004217 Natural Cytotoxicity Triggering Receptor 1 Proteins 0.000 description 1
- 102100029527 Natural cytotoxicity triggering receptor 3 ligand 1 Human genes 0.000 description 1
- 101710201161 Natural cytotoxicity triggering receptor 3 ligand 1 Proteins 0.000 description 1
- 108010012255 Neural Cell Adhesion Molecule L1 Proteins 0.000 description 1
- 102100031701 Nuclear factor erythroid 2-related factor 2 Human genes 0.000 description 1
- 102100023421 Nuclear receptor ROR-gamma Human genes 0.000 description 1
- 102100029528 Nuclear receptor subfamily 2 group F member 6 Human genes 0.000 description 1
- 102220490907 Olfactomedin-like protein 2A_N72A_mutation Human genes 0.000 description 1
- 102000016979 Other receptors Human genes 0.000 description 1
- 102100037602 P2X purinoceptor 7 Human genes 0.000 description 1
- 102100038762 POTE ankyrin domain family member D Human genes 0.000 description 1
- 102100030476 POU domain class 2-associating factor 1 Human genes 0.000 description 1
- 101710114665 POU domain class 2-associating factor 1 Proteins 0.000 description 1
- 102000036673 PRAME Human genes 0.000 description 1
- 108060006580 PRAME Proteins 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- 101710181935 Phosphate-binding protein PstS 1 Proteins 0.000 description 1
- 102100036056 Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit delta isoform Human genes 0.000 description 1
- 101710204747 Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit delta isoform Proteins 0.000 description 1
- 102100021768 Phosphoserine aminotransferase Human genes 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- 102100035724 Probable ATP-dependent RNA helicase DDX43 Human genes 0.000 description 1
- 108700030875 Programmed Cell Death 1 Ligand 2 Proteins 0.000 description 1
- 102100024213 Programmed cell death 1 ligand 2 Human genes 0.000 description 1
- 102100023832 Prolyl endopeptidase FAP Human genes 0.000 description 1
- 102100040120 Prominin-1 Human genes 0.000 description 1
- 108010044159 Proprotein Convertases Proteins 0.000 description 1
- 102000006437 Proprotein Convertases Human genes 0.000 description 1
- 102100024448 Prostaglandin E2 receptor EP2 subtype Human genes 0.000 description 1
- 108050003267 Prostaglandin G/H synthase 2 Proteins 0.000 description 1
- 108010072866 Prostate-Specific Antigen Proteins 0.000 description 1
- 101710119219 Protachykinin-1 Proteins 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 102100032133 Protein LYRIC Human genes 0.000 description 1
- 102100032420 Protein S100-A9 Human genes 0.000 description 1
- 102100037686 Protein SSX2 Human genes 0.000 description 1
- 102220586251 Protein yippee-like 4_L45D_mutation Human genes 0.000 description 1
- 102100036397 Protocadherin-18 Human genes 0.000 description 1
- 108010001946 Pyrin Domain-Containing 3 Protein NLR Family Proteins 0.000 description 1
- 108010025832 RANK Ligand Proteins 0.000 description 1
- 101000820656 Rattus norvegicus Seminal vesicle secretory protein 4 Proteins 0.000 description 1
- 101710151245 Receptor-type tyrosine-protein kinase FLT3 Proteins 0.000 description 1
- 102100037422 Receptor-type tyrosine-protein phosphatase C Human genes 0.000 description 1
- 102100034981 Retroelement silencing factor 1 Human genes 0.000 description 1
- 108010044012 STAT1 Transcription Factor Proteins 0.000 description 1
- 108010017324 STAT3 Transcription Factor Proteins 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 1
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- 102100022346 Serine/threonine-protein phosphatase 5 Human genes 0.000 description 1
- 101710129069 Serine/threonine-protein phosphatase 5 Proteins 0.000 description 1
- 101710199542 Serine/threonine-protein phosphatase T Proteins 0.000 description 1
- 102100029904 Signal transducer and activator of transcription 1-alpha/beta Human genes 0.000 description 1
- 102100024040 Signal transducer and activator of transcription 3 Human genes 0.000 description 1
- 108010074687 Signaling Lymphocytic Activation Molecule Family Member 1 Proteins 0.000 description 1
- 102100023802 Somatostatin receptor type 2 Human genes 0.000 description 1
- 101000668858 Spinacia oleracea 30S ribosomal protein S1, chloroplastic Proteins 0.000 description 1
- 102100021993 Sterol O-acyltransferase 1 Human genes 0.000 description 1
- 101710196623 Stimulator of interferon genes protein Proteins 0.000 description 1
- 101000898746 Streptomyces clavuligerus Clavaminate synthase 1 Proteins 0.000 description 1
- 101000697584 Streptomyces lavendulae Streptothricin acetyltransferase Proteins 0.000 description 1
- 102100021669 Stromal cell-derived factor 1 Human genes 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- 102100037298 T cell receptor beta constant 2 Human genes 0.000 description 1
- 102100035794 T-cell surface glycoprotein CD3 epsilon chain Human genes 0.000 description 1
- 101150057140 TACSTD1 gene Proteins 0.000 description 1
- 108700012457 TACSTD2 Proteins 0.000 description 1
- 102100030302 TBC1 domain family member 8 Human genes 0.000 description 1
- 102100036073 TIR domain-containing adapter molecule 1 Human genes 0.000 description 1
- 101150080074 TP53 gene Proteins 0.000 description 1
- 101000874827 Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8) Dephospho-CoA kinase Proteins 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- VBPDMBAFBRDZSK-HOUAVDHOSA-N Thr-Asn-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VBPDMBAFBRDZSK-HOUAVDHOSA-N 0.000 description 1
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 102000005497 Thymidylate Synthase Human genes 0.000 description 1
- 102100029337 Thyrotropin receptor Human genes 0.000 description 1
- 108010031429 Tissue Inhibitor of Metalloproteinase-3 Proteins 0.000 description 1
- 241000723792 Tobacco etch virus Species 0.000 description 1
- 102100024333 Toll-like receptor 2 Human genes 0.000 description 1
- 102100024324 Toll-like receptor 3 Human genes 0.000 description 1
- 102100039357 Toll-like receptor 5 Human genes 0.000 description 1
- 102100039390 Toll-like receptor 7 Human genes 0.000 description 1
- 102100033110 Toll-like receptor 8 Human genes 0.000 description 1
- 102100038129 Transcription factor Dp family member 3 Human genes 0.000 description 1
- 102100021393 Transcriptional repressor CTCFL Human genes 0.000 description 1
- 102000004887 Transforming Growth Factor beta Human genes 0.000 description 1
- 108090001012 Transforming Growth Factor beta Proteins 0.000 description 1
- 102000004060 Transforming Growth Factor-beta Type II Receptor Human genes 0.000 description 1
- 108010082684 Transforming Growth Factor-beta Type II Receptor Proteins 0.000 description 1
- 102100030742 Transforming growth factor beta-1 proprotein Human genes 0.000 description 1
- 102100030737 Transforming growth factor beta-2 proprotein Human genes 0.000 description 1
- 102100034898 Transmembrane 4 L6 family member 5 Human genes 0.000 description 1
- 102100033579 Trophoblast glycoprotein Human genes 0.000 description 1
- VEYXZZGMIBKXCN-UBHSHLNASA-N Trp-Asp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VEYXZZGMIBKXCN-UBHSHLNASA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 1
- MKDXQPMIQPTTAW-SIXJUCDHSA-N Trp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N MKDXQPMIQPTTAW-SIXJUCDHSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- ULHASJWZGUEUNN-XIRDDKMYSA-N Trp-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O ULHASJWZGUEUNN-XIRDDKMYSA-N 0.000 description 1
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 1
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 101710136122 Tryptophan 2,3-dioxygenase Proteins 0.000 description 1
- 102100040247 Tumor necrosis factor Human genes 0.000 description 1
- 102100024568 Tumor necrosis factor ligand superfamily member 11 Human genes 0.000 description 1
- 102100024584 Tumor necrosis factor ligand superfamily member 12 Human genes 0.000 description 1
- 101710097155 Tumor necrosis factor ligand superfamily member 12 Proteins 0.000 description 1
- 102100040112 Tumor necrosis factor receptor superfamily member 10B Human genes 0.000 description 1
- 102100027212 Tumor-associated calcium signal transducer 2 Human genes 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 1
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- XYNFFTNEQDWZNY-ULQDDVLXSA-N Tyr-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N XYNFFTNEQDWZNY-ULQDDVLXSA-N 0.000 description 1
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 1
- UPODKYBYUBTWSV-BZSNNMDCSA-N Tyr-Phe-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 UPODKYBYUBTWSV-BZSNNMDCSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- 102100033438 Tyrosine-protein kinase JAK1 Human genes 0.000 description 1
- 102100025387 Tyrosine-protein kinase JAK3 Human genes 0.000 description 1
- 101150020913 USP7 gene Proteins 0.000 description 1
- 102100021013 Ubiquitin carboxyl-terminal hydrolase 7 Human genes 0.000 description 1
- 108700011958 Ubiquitin-Specific Peptidase 7 Proteins 0.000 description 1
- 229940126752 Ubiquitin-specific protease 7 inhibitor Drugs 0.000 description 1
- 102000003990 Urokinase-type plasminogen activator Human genes 0.000 description 1
- 108090000435 Urokinase-type plasminogen activator Proteins 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- 108010073923 Vascular Endothelial Growth Factor C Proteins 0.000 description 1
- 102100033178 Vascular endothelial growth factor receptor 1 Human genes 0.000 description 1
- 102100033179 Vascular endothelial growth factor receptor 3 Human genes 0.000 description 1
- 102100038611 Vitamin D-binding protein Human genes 0.000 description 1
- 102100023038 WD and tetratricopeptide repeats protein 1 Human genes 0.000 description 1
- 102000040856 WT1 Human genes 0.000 description 1
- 108700020467 WT1 Proteins 0.000 description 1
- 101150084041 WT1 gene Proteins 0.000 description 1
- 102100039490 X antigen family member 1 Human genes 0.000 description 1
- 210000000683 abdominal cavity Anatomy 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000005975 antitumor immune response Effects 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- FUHMZYWBSHTEDZ-UHFFFAOYSA-M bispyribac-sodium Chemical compound [Na+].COC1=CC(OC)=NC(OC=2C(=C(OC=3N=C(OC)C=C(OC)N=3)C=CC=2)C([O-])=O)=N1 FUHMZYWBSHTEDZ-UHFFFAOYSA-M 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 230000017531 blood circulation Effects 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 108010053786 carboxypeptidase Z Proteins 0.000 description 1
- 230000022534 cell killing Effects 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 238000012054 celltiter-glo Methods 0.000 description 1
- 239000012916 chromogenic reagent Substances 0.000 description 1
- 229960002376 chymotrypsin Drugs 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 229940127276 delta-like ligand 3 Drugs 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 238000005227 gel permeation chromatography Methods 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 231100000753 hepatic injury Toxicity 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000002631 hypothermal effect Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000037451 immune surveillance Effects 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 239000002955 immunomodulating agent Substances 0.000 description 1
- 230000002625 immunotoxic effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 208000027866 inflammatory disease Diseases 0.000 description 1
- 108010085650 interferon gamma receptor Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000005210 lymphoid organ Anatomy 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 108010066052 multidrug resistance-associated protein 1 Proteins 0.000 description 1
- 108010066416 multidrug resistance-associated protein 3 Proteins 0.000 description 1
- AEMBWNDIEFEPTH-UHFFFAOYSA-N n-tert-butyl-n-ethylnitrous amide Chemical compound CCN(N=O)C(C)(C)C AEMBWNDIEFEPTH-UHFFFAOYSA-N 0.000 description 1
- 229960003301 nivolumab Drugs 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 108010068338 p38 Mitogen-Activated Protein Kinases Proteins 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 108040000983 polyphosphate:AMP phosphotransferase activity proteins Proteins 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 239000012474 protein marker Substances 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 210000003289 regulatory T cell Anatomy 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 108010091078 rigin Proteins 0.000 description 1
- 229960004641 rituximab Drugs 0.000 description 1
- 102200147816 rs80356634 Human genes 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 101150050955 stn gene Proteins 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 238000007910 systemic administration Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 101150047061 tag-72 gene Proteins 0.000 description 1
- 238000002626 targeted therapy Methods 0.000 description 1
- ZRKFYGHZFMAOKI-QMGMOQQFSA-N tgfbeta Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 ZRKFYGHZFMAOKI-QMGMOQQFSA-N 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 229960001322 trypsin Drugs 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 230000005909 tumor killing Effects 0.000 description 1
- 229960005356 urokinase Drugs 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 230000004580 weight loss Effects 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/39—Medicinal preparations containing antigens or antibodies characterised by the immunostimulating additives, e.g. chemical adjuvants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/28—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/52—Cytokines; Lymphokines; Interferons
- C07K14/54—Interleukins [IL]
- C07K14/5443—IL-15
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/715—Receptors; Cell surface antigens; Cell surface determinants for cytokines; for lymphokines; for interferons
- C07K14/7155—Receptors; Cell surface antigens; Cell surface determinants for cytokines; for lymphokines; for interferons for interleukins [IL]
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
- C07K16/1027—Paramyxoviridae, e.g. respiratory syncytial virus
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/28—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
- C07K16/2803—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against the immunoglobulin superfamily
- C07K16/2818—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against the immunoglobulin superfamily against CD28 or CD152
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/28—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
- C07K16/2803—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against the immunoglobulin superfamily
- C07K16/2827—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against the immunoglobulin superfamily against B7 molecules, e.g. CD80, CD86
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/28—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
- C07K16/2863—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against receptors for growth factors, growth regulators
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/32—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against translation products of oncogenes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/505—Medicinal preparations containing antigens or antibodies comprising antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Immunology (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Engineering & Computer Science (AREA)
- Pharmacology & Pharmacy (AREA)
- General Chemical & Material Sciences (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Veterinary Medicine (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Cell Biology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Virology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Oncology (AREA)
- Pulmonology (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Peptides Or Proteins (AREA)
Abstract
本发明提供了一种IL‑15与IL‑15Ra分别融合在靶向细胞表面抗原的抗体的免疫细胞因子,可有效地将IL‑15与IL‑15Ra的复合物特异性靶向肿瘤微环境,激活肿瘤内或附近的相关免疫细胞,达到特异性杀伤肿瘤的目标,同时可以避免因全身性过度激活NK细胞诱导的免疫毒性。
Description
技术领域
本发明属于生物制药领域,具体涉及一种IL-15与IL-15Ra分别融合于抗体的免疫细胞因子或者IL-15单独与抗体融合的免疫细胞因子及其制备和用途。
背景技术
在过去几年中,免疫疗法已被用于治疗人类的各种肿瘤。免疫疗法利用人体的免疫系统去攻击或杀死恶性肿瘤细胞,同时对健康组织无影响。免疫系统具有识别和清除恶性肿瘤细胞的能力,但是,肿瘤进化出多种逃避免疫监视的机制。因此,免疫疗法的挑战在于开发出能有效、安全地增强机体抗肿瘤免疫反应的策略。目前正在进行和开发的肿瘤免疫治疗策略主要有细胞因子治疗、过继细胞转移、肿瘤疫苗、单克隆抗体等。其中刺激或活化肿瘤特异性的T细胞或NK的免疫反应,发挥T细胞或NK细胞对肿瘤的杀伤作用是特别令人感兴趣的[The Potential and Promise of IL-15in Immuno-Oncogenic Therapies]。
IL-15是一种具有与IL-2类似的结构的细胞因子,其可有效调节效应天然杀伤细胞(NK)、记忆CD8+T细胞的发育、增殖和活化。IL-15Ra是与IL-15具有高亲和力的跨膜蛋白,其与IL-15结合后通过反式递呈作用将IL-15递呈至表达IL2R beta和gamma链的NK细胞和T细胞表面,促进这些细胞的增殖和活化,增强这些细胞的肿瘤杀伤活性。另外,与IL-2不同,IL-15不会导致Treg细胞的激活,目前作为肿瘤免疫治疗制剂正在进行广泛的临床前和临床研究。
IL-15与IL-15Ra形成的复合物(IL-15超激动剂)进一步加快了以IL-15为基础的肿瘤免疫治疗的步伐。与单体IL-15相比,IL-15超激动剂具有突出的优势:如更高的血液循环浓度、更长的体内半衰期以及增强的刺激NK和CD8T细胞的能力。目前正在开发的IL-15超激动剂主要有两大类:IL-15通过linker与IL-15Ra融合形成的RLI(如Cytune的CYP0150)、以及IL-15与IL-15Ra形成的分子内复合物(如诺华的NIZ985、Altor的ALT-803、Xencer的XmAb24306)。CYP0150,由于不含有Fc,在体内易被清除,半衰期很短,需频繁给药;NIZ985和ALT-803分子中,IL-15和IL-15Ra通过分子间的作用力形成复合物,不仅分子本身不稳定,而且制备工艺较繁琐。XmAb24306通过将IL-15和IL-15Ra分别融合在Fc的N端,形成的分子内复合物较为稳定,而且具有较长的体内半衰期。大量的临床前研究表明,与天然IL-15相比,IL-15超激动剂的生理活性及抗肿瘤活性大大增强。一些研究表明,其他免疫疗法如Nivolumab(anti-PD-1单抗)、Rituximab(anti-CD20单抗)可有效增强ALT-803对于进展性肿瘤患者的抗肿瘤效应。另外,全身性施用IL-15超激动剂可偏向性地扩增淋巴器官中促炎症(CD11bhighCD27high)NK细胞亚群,这些细胞亚群分泌的大量IFNγ等促炎症因子可能会导致诸如体温过低、体重减少、肝损伤等免疫毒性作用。
本发明人发现,将IL-15与IL-15Ra分别融合在靶向肿瘤抗原的抗体形成的免疫细胞因子,可以特异性靶向肿瘤微环境。在肿瘤微环境内,该免疫细胞因子中的IL-15/IL15Ra形成的复合物,可以激活肿瘤内或附近的相关免疫细胞,发挥肿瘤微环境内T细胞或者NK细胞对肿瘤的特异性杀伤作用,同时可以避免全身性过度激活T细胞或者NK细胞诱导的免疫毒性。
本发明简述
本发明人发现,将IL15与IL-15Ra分别融合在靶向治疗相关细胞表面抗原的抗体形成的免疫细胞因子,可以有效地将免疫细胞因子特异性靶向肿瘤微环境,激活肿瘤内或附近的相关免疫细胞,达到特异性杀伤肿瘤的目标,同时可以避免因全身性过度激活NK细胞诱导的免疫毒性。
本发明的一个方面,提供了一种免疫细胞因子,所述免疫细胞因子包括:
(A)白介素-15(IL-15);
(B)白介素-15受体a亚基(IL-15Ra);
(C)靶向治疗相关细胞表面抗原的抗体;
其中,所述IL-15直接或通过连接肽与所述抗体的重链可变区N端连接,所述IL-15Ra直接或通过连接肽与所述抗体的轻链可变区N端连接;或者,所述IL-15直接或通过连接肽与所述抗体的轻链可变区N端连接,所述IL-15Ra直接或通过连接肽与所述抗体的重链可变区N端连接;或者,所述IL-15直接或通过连接肽与所述抗体的轻链恒定区CL的C端连接,所述IL-15Ra直接或通过连接肽与所述抗体的重链恒定区CH1的C端连接;或者所述IL-15直接或通过连接肽与所述抗体的重链恒定区CH1的C端连接,所述IL-15Ra直接或通过连接肽与所述抗体的轻链恒定区CL的C端连接。在一些实施方案中,所述连接肽为裂解性连接肽或非裂解性连接肽。在一些实施方案中,其中所述连接肽独立地选自但不限于GGGGSGGGGSGGGGSG、GSPLGVRGS、GSPLGVR、PLGVR、GGGGSGPLGVRGGGGSG或GGGGSGPLGVR等。
在一些实施方案中,所述IL-15为哺乳动物细胞IL-15,优选灵长类IL-15,更优选人类IL-15。在一些实施方案中,所述IL-15为野生型IL-15或IL-15衍生物。在一些实施方案中,所述IL-15具有如SEQ ID NO:71所示的核酸序列和SEQ ID NO:72所示的氨基酸序列;在一些实施方案中,所述IL-15具有如SEQ ID NO:111所示的核酸序列和SEQ ID NO:112所示的氨基酸序列。在另外的一些实施方案中,所述IL-15具有如与SEQ ID NO:72或SEQ ID NO:11285%-100%同源的氨基酸序列。
在一些实施方案中,所述IL-15Ra为IL-15Ra。在一些实施方案中,所述IL-15Ra为IL-15Ra的Sushi结构域。在一些实施方案中,所述IL-15Ra为IL-15RaSushi结构域的变异体或者衍生物。在一些实施方案中,所述IL-15Ra具有如SEQ ID NO:73所示的核酸序列和SEQID NO:74所示的氨基酸序列。在一些实施方案中,所述IL-15Ra具有如SEQ ID NO:87所示的核酸序列或SEQ ID NO:88所示的氨基酸序列。在另外的一些实施方案中,所述IL-15Ra具有与SEQ ID NO:74或SEQ ID NO:8885%-100%同源的氨基酸序列。
在一些实施方案中,所述治疗相关细胞表面抗原为CD抗原。在一些实施方案中,所述CD抗原选自但不限于CD19、CD20、CD47、CD40、CD73、CD33、CD38、CD123、CD30、CD3、CD22、CD25、CD133、CD27、CD39、CD138、CD46、CD56、CD70、CD32、CD11b、CD135、CD171、CD174、CD147、CD155、CD16、CD162、CD16a、CD200、CD21、CD28、CD44、CD52、CD54、CD7、CD80、CD88、CD13、CD130、CD150、CD160、CD200R1、CD267、CD29、CD3E、CD4、CD51或CD8等。
在一些实施方案中,所述治疗相关细胞表面抗原为免疫监测点蛋白。在一些实施方案中,所述免疫监测点蛋白选自但不限于PD-1、PD-L1、CTLA-4、LAG-3、OX40、CD28、CD40、CD47、CD70、CD80、CD122、GTIR、A2AR、B7-H3(CD276)、B7-H4、IDO、KIR、Tim-3或4-1BB(CD137)等。在一些实施方案中,所述免疫监测点蛋白为PD-1。在一些实施方案中,靶向免疫监测点蛋白PD-1的抗体具有如SEQ ID NO:76所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:78所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列,或者具有如SEQ ID NO:76所示的重链氨基酸序列中所含的vH序列,以及如SEQ ID NO:78所示的轻链氨基酸序列中所含的VL序列。在一些实施方案中,所述靶向免疫监测点蛋白PD-1的抗体具有如SEQ ID NO:75所示的重链核酸序列和SEQ ID NO:77所示的轻链核酸序列,或者具有如SEQ ID NO:76所示的重链氨基酸序列和SEQ ID NO:78所示的轻链氨基酸序列。
在一些实施方案中,所述免疫监测点蛋白为PD-L1。在一些实施方案中,所述靶向免疫监测点蛋白PD-L1的抗体具有SEQ ID NO:36所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:38所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列;或者具有SEQ ID NO:80所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:82所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列。在一些实施方案中,所述靶向免疫监测点蛋白PD-L1的抗体具有SEQ ID NO:36所示的氨基酸序列中所含的VH序列,以及如SEQ ID NO:38所示的氨基酸序列中所含的VL序列;或者具有SEQ ID NO:80所示的重链氨基酸序列中所含的VH序列,以及如SEQ ID NO:82所示的轻链氨基酸序列中所含的VL序列。在一些实施方案中,所述靶向免疫监测点蛋白PD-L1的抗体具有如SEQ ID NO:35所示的重链核酸序列和SEQ ID NO:37所示的轻链核酸序列,或者具有如SEQ ID NO:36所示的重链氨基酸序列和SEQ ID NO:38所示的轻链氨基酸序列。在一些实施方案中,所述靶向免疫监测点蛋白PD-L1的抗体具有如SEQ ID NO:79所示的重链核酸序列和SEQ ID NO:81所示的轻链核酸序列,具有如SEQ ID NO:80所示的重链氨基酸序列和SEQID NO:82所示的轻链氨基酸序列。
在一些实施方案中,所述治疗相关细胞表面抗原选自肿瘤表面抗原。在一些实施方案中,所述肿瘤表面抗原选自但不限于EGFR、HER2、HER3、HER4、NY-ESO-1、GPC-3、CLL-1、BCMA、GD2、EpCAM、化学趋化因子受体家族(CCRl、CCR2、CCR3、CCR4、CCR5、CCR6、CCR7、CCR8、CCR9、CCR10、CCL27、CCL28、CX3CR1、CXCR1、CXCR2、CXCR3、CXCR4、CXCR5、CXCR6)、mucin家族(MUC1、MUC2、MUC3A、MUC3B、MUC4、MUC5AC、MUC5B、MUC6、MUC7、MUC8、MUC12、MUC13、MUC15、MUC16、MUC17、MUC19、MUC20)、PSMA、CEA、HDAC6、Mesothelin、TERT、TLR、TLR9、TLR4、CD33、GITR、Survivin、CD123、TIGIT、成纤维细胞生长因子受体(FGFR)、血管内皮生长因子受体(FLT1、KDR/Flk-1、VEGFR-3)、肝细胞生长因子受体(HGFR)、神经生长因子受体(NGFR)、胰岛素样生长因子受体(IGFR)、血小板衍生生长因子受体(PDGFR)或激素受体(黑皮质素1受体(MC1R,MSHR)等。
在一些实施方案中,所述肿瘤表面抗原为EGFR。在一些实施方案中,所述靶向肿瘤表面抗原EGFR的抗体具有如SEQ ID NO:32所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:34所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列,或者具有如SEQ ID NO:32所示的重链氨基酸序列中所含的VH序列,以及如SEQ ID NO:34所示的轻链氨基酸序列中所含的VL序列。在一些实施方案中,所述靶向肿瘤表面抗原EGFR的抗体具有如SEQ ID NO:29所示的重链核酸序列和SEQ ID NO:33所示的轻链核酸序列,或者具有如SEQ ID NO:30所示的重链氨基酸序列和SEQ ID NO:34所示的轻链氨基酸序列。在一些实施方案中,所述靶向肿瘤表面抗原EGFR的抗体具有如SEQ ID NO:31所示的重链核酸序列和SEQ ID NO:33所示的轻链核酸序列,或者具有如SEQ ID NO:32所示的重链氨基酸序列和SEQ ID NO:34所示的轻链氨基酸序列。
在一些实施方案中,所述肿瘤表面抗原为HER2。在一些实施方案中,靶向HER2的抗体具有如SEQ ID NO:18所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:20所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列,或者靶向HER2的抗体具有如SEQ ID NO:18所示的重链氨基酸序列中所含的VH序列,以及如SEQ ID NO:20所示的轻链氨基酸序列中所含的VL序列。在一些实施方案中,靶向HER2的抗体具有如SEQ IDNO:17所示的重链核酸序列和SEQ ID NO:19所示的轻链核酸序列,或者具有如SEQ ID NO:18所示的重链氨基酸序列和SEQ ID NO:20所示的轻链氨基酸序列。
在一些实施方案中,所述细胞表面抗原选自病毒相关抗原。在一些实施方案中,所述病毒相关抗原选自但不限于RSV F、HPV E6、HPV E7、HPV L2、HPV 16、HPV E6/7、HPV L1、HPV16 E6、HPV16 E7等。在一些具体的实施方案中,所述病毒相关抗原为RSV F。在一些实施方案中,靶向RSV病毒F蛋白的抗体具有SEQ ID NO:54所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:56所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列;或者具有SEQ ID NO:54所示的重链氨基酸序列中所含的VH序列,以及如SEQ ID NO:56所示的轻链氨基酸序列中所含的VL序列。在一些实施方案中,靶向病毒相关抗原RSV F蛋白的抗体具有如SEQ ID NO:53所示的重链核酸序列和SEQ ID NO:55所示的轻链核酸序列,或者具有如SEQ ID NO:54所示的重链氨基酸序列和SEQ ID NO:56所示的轻链氨基酸序列。
在一些实施方案中,所述免疫细胞因子由分别具有如下氨基酸序列的重链和轻链组成:SEQ ID NO:2和SEQ ID NO:8;SEQ ID NO:6和SEQ ID NO:4;SEQ ID NO:68和SEQ IDNO:8;SEQ ID NO:70和SEQ ID NO:4;SEQ ID NO:10和SEQ ID NO:12;SEQ ID NO:14和SEQID NO:16;SEQ ID NO:22和SEQ ID NO:24;SEQ ID NO:26和SEQ ID NO:28;SEQ ID NO:40和SEQ ID NO:24;SEQ ID NO:44和SEQ ID NO:28;SEQ ID NO:46和SEQ ID NO:48;SEQ ID NO:42和SEQ ID NO:58;SEQ ID NO:84和SEQ ID NO:86;SEQ ID NO:60和SEQ ID NO:62;SEQ IDNO:64和SEQ ID NO:66;SEQ ID NO:115和SEQ ID NO:116;SEQ ID NO:117和SEQ ID NO:118;SEQ ID NO:119和SEQ ID NO:12;SEQ ID NO:120和SEQ ID NO:16。本领域技术人员容易理解,本发明的免疫细胞因子的“重链”指包含靶向治疗相关细胞表面抗原的抗体的重链可变区及与之连接的IL-15或IL-15Ra的多肽链,免疫细胞因子的“轻链”指包含靶向治疗相关细胞表面抗原的抗体的轻链可变区及与之连接的IL-15Ra或IL-15的多肽链。
在一些实施方案中,所述免疫细胞因子由分别具有如下氨基酸序列的重链和轻链组成:SEQ ID NO:1和SEQ ID NO:7;SEQ ID NO:5和SEQ ID NO:3;SEQ ID NO:67和SEQ IDNO:7;SEQ ID NO:69和SEQ ID NO:3;SEQ ID NO:9和SEQ ID NO:11;SEQ ID NO:13和SEQ IDNO:15;SEQ ID NO:21和SEQ ID NO:23;SEQ ID NO:25和SEQ ID NO:27;SEQ ID NO:39和SEQID NO:23;SEQ ID NO:43和SEQ ID NO:27;SEQ ID NO:45和SEQ ID NO:47;SEQ ID NO:41和SEQ ID NO:57;SEQ ID NO:83和SEQ ID NO:85;SEQ ID NO:59和SEQ ID NO:61;SEQ ID NO:63和SEQ ID NO:65。
本发明的一个方面,提供了一种免疫细胞因子,所述免疫细胞因子包括:
(A)白介素-15(IL-15);
(B)靶向治疗相关细胞表面抗原的抗体;
其中,所述IL-15直接或者通过连接肽与所述抗体重链可变区的N端和/或轻链可变区的N端连接;或者,所述IL-15直接或者通过连接肽与所述抗体重链恒定区的C端或轻链恒定区的C端连接。
在一些实施方案中,所述连接肽为裂解性连接肽或非裂解性连接肽。在一些实施方案中,其中所述连接肽独立地选自但不限于GGGGSGGGGSGGGGSG、GSPLGVRGS、GSPLGVR、PLGVR、GGGGSGPLGVRGGGGSG或GGGGSGPLGVR等。
在一些实施方案中,所述IL-15为哺乳动物细胞IL-15,优选灵长类IL-15,更优选人类IL-15。在一些实施方案中,所述IL-15为野生型IL-15或IL-15衍生物。在一些实施方案中,所述IL-15具有如SEQ ID NO:71所示的核酸序列和SEQ ID NO:72所示的氨基酸序列;在一些实施方案中,所述IL-15具有如SEQ ID NO:111所示的核酸序列和SEQ ID NO:112所示的氨基酸序列。在另外的一些实施方案中,所述IL-15具有如与SEQ ID NO:72或SEQ ID NO:11285%-100%同源的氨基酸序列。
在一些实施方案中,所述治疗相关细胞表面抗原选自肿瘤表面抗原。在一些实施方案中,所述肿瘤表面抗原选自但不限于EGFR、HER2、HER3、HER4、NY-ESO-1、GPC-3、CLL-1、BCMA、GD2、EpCAM、化学趋化因子受体家族(CCR1、CCR2、CCR3、CCR4、CCR5、CCR6、CCR7、CCR8、CCR9、CCR10、CCL27、CCL28、CX3CR1、CXCR1、CXCR2、CXCR3、CXCR4、CXCR5、CXCR6)、mucin家族(MUC1、MUC2、MUC3A、MUC3B、MUC4、MUC5AC、MUC5B、MUC6、MUC7、MUC8、MUC12、MUC13、MUC15、MUC16、MUC17、MUC19、MUC20)、PSMA、CEA、HDAC6、Mesothelin、TERT、TLR、TLR9、TLR4、CD33、GITR、Survivin、CD123、TIGIT、成纤维细胞生长因子受体(FGFR)、血管内皮生长因子受体(FLT1、KDR/Flk-1、VEGFR-3)、肝细胞生长因子受体(HGFR)、神经生长因子受体(NGFR)、胰岛素样生长因子受体(IGFR)、血小板衍生生长因子受体(PDGFR)或激素受体(黑皮质素1受体(MC1R,MSHR)等。
在一些实施方案中,所述肿瘤表面抗原为EGFR。在一些实施方案中,所述靶向肿瘤表面抗原EGFR的抗体具有如SEQ ID NO:32所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:34所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列,或者具有如SEQ ID NO:32所示的重链氨基酸序列中所含的VH序列,以及如SEQ ID NO:34所示的轻链氨基酸序列中所含的vL序列。在一些实施方案中,所述靶向肿瘤表面抗原EGFR的抗体具有如SEQ ID NO:29所示的重链核酸序列和SEQ ID NO:33所示的轻链核酸序列,或者具有如SEQ ID NO:30所示的重链氨基酸序列和SEQ ID NO:34所示的轻链氨基酸序列。在一些实施方案中,所述靶向肿瘤表面抗原EGFR的抗体具有如SEQ ID NO:31所示的重链核酸序列和SEQ ID NO:33所示的轻链核酸序列,或者具有如SEQ ID NO:32所示的重链氨基酸序列和SEQ ID NO:34所示的轻链氨基酸序列。
在一些实施方案中,所述肿瘤表面抗原为HER2。在一些实施方案中,靶向HER2的抗体具有如SEQ ID NO:18所示的重链氨基酸序列中所含的HCD R1、HCDR2和HCDR3序列,以及如SEQ ID NO:20所示的轻链氨基酸序列中所含的LCD R1、LCDR2和LCDR3序列,或者靶向HER2的抗体具有如SEQ ID NO:18所示的重链氨基酸序列中所含的VH序列,以及如SEQ IDNO:20所示的轻链氨基酸序列中所含的VL序列。在一些实施方案中,靶向HER2的抗体具有如SEQ ID NO:17所示的重链核酸序列和SEQ ID NO:19所示的轻链核酸序列,或者具有如SEQID NO:18所示的重链氨基酸序列和SEQ ID NO:20所示的轻链氨基酸序列。
在一些实施方案中,所述免疫细胞因子由分别具有如下氨基酸序列的重链和轻链组成:SEQ ID NO:90和SEQ ID NO:34;SEQ ID NO:92和SEQ ID NO:34;SEQ ID NO:94和SEQID NO:34;SEQ ID NO:96和SEQ ID NO:34;SEQ ID NO:98和SEQ ID NO:20;SEQ ID NO:100和SEQ ID NO:20;SEQ ID NO:110和SEQ ID NO:102;SEQ ID NO:110和SEQ ID NO:104;SEQID NO:98和SEQ ID NO:102;SEQ ID NO:98和SEQ ID NO:104;SEQ ID NO:100和SEQ ID NO:102;SEQ ID NO:100和SEQ ID NO:104;SEQ ID NO:106和SEQ ID NO:20;SEQ ID NO:110和SEQ ID NO:108;SEQ ID NO:106和SEQ ID NO:108;SEQ ID NO:113和SEQ ID NO:34;SEQ IDNO:114和SEQ ID NO:34。
在一些实施方案中,所述免疫细胞因子由分别具有如下氨基酸序列的重链和轻链组成:SEQ ID NO:89和SEQ ID NO:33;SEQ ID NO:91和SEQ ID NO:33;SEQ ID NO:93和SEQID NO:33;SEQ ID NO:95和SEQ ID NO:33;SEQ ID NO:97和SEQ ID NO:19;SEQ ID NO:99和SEQ ID NO:19;SEQ ID NO:109和SEQ ID NO:101;SEQ ID NO:109和SEQ ID NO:103;SEQ IDNO:97和SEQ ID NO:101;SEQ ID NO:97和SEQ ID NO:103;SEQ ID NO:99和SEQ ID NO:101;SEQ ID NO:99和SEQ ID NO:103;SEQ ID NO:105和SEQ ID NO:19;SEQ ID NO:109和SEQ IDNO:107;SEQ ID NO:105和SEQ ID NO:107。
一方面,本发明提供了编码如上所述免疫细胞因子的核酸。
一方面,本发明提供了一种含有如上所述免疫细胞因子核酸序列的载体。
一方面,本发明提供了一种宿主细胞,其包含如上所述的载体。
一方面,本发明提供了一种药物组合物,其包含药学上可接受的载体或制剂以及如上所述的免疫细胞因子。
另一方面,本发明提供了一种治疗有需要的受试者炎症性疾病或癌症的方法,所述方法包括对所述受试者施用治疗有效量的组合物,所述组合物包含药学可接受形式的如上所述的免疫细胞因子。
附图说明
构成本申请的附图用来提供对本发明的进一步理解,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定,在附图中:
图1为基于抗体的免疫细胞因子构建体(名称参见表1)的SDS-PAGE图。各小图中M为蛋白marker,图1A和1B中泳道上方标示的A代表BSIC-01,B代表BSIC-10,C代表BSIC-09,D代表BSIC-05,E代表BSIC-06,F代表BSIC-03。“-”表示不加beta-巯基乙醇后上样,“+”表示加beta-巯基乙醇后上样。
图2为基于抗体的免疫细胞因子凝胶层析结果,其中2A为BSIC-03、2B为BSIC-05、2C为BSIC-06、2D为BSIC-09、2E为BSIC-10、2F为BSIC-24。
图3为基于抗体的免疫细胞因子与IL-15结合(图3A)或IL-15RaSushi结合(图3B)的ELISA检测结果,其中阳性1为IL-15RaSushi-Fc融合蛋白(Novoprotein),阳性2为IL-15-Fc融合蛋白(金斯瑞生物)
图4为基于抗体的免疫细胞因子促进Mo7e细胞增殖的检测结果,其中阳性3为IL-15与IL-15RaSushi-Fc通过非共价相互作用形成的异二聚体。
图5显示了aEGFR(5A)、aHER2(5B)、aPD1(5C)、aPDL1(5D和5E)、synagis(5F)抗体重链和轻链可变区,互补决定区(CDRs)为下划线标出的部分。
图6为基于抗体的免疫细胞因子刺激脾脏(6A)和PBMC(6B)中淋巴细胞FACS检测结果,其中阳性3为IL-15与IL-15RaSushi-Fc通过非共价相互作用形成的异二聚体,阴性为DPBS。
图7为基于抗体的免疫细胞因子体内与抗原ELISA检测结果,其中7A为免疫细胞因子与抗原hEGFR的结合;7B为免疫细胞细胞因子与抗原hPD-L1的结合;7C为免疫细胞因子与抗原hHER2的结合。
本发明的详述
本发明在此通过对使用下述定义和实施例的引用进行详细描述。所有在本文中提及的专利和公开文献的内容,包括在这些专利和公开中披露的所有序列,明确地通过提述并入本文。
本文所述的“免疫细胞因子”(immunocytokines)是指细胞因子与抗体的融合物。
本文的术语“抗体”以最广泛的含义使用,并且包括各种抗体结构,包括但不限于单克隆抗体及其衍生物和工程化抗体,包括如多特异性抗体(例如DVD-Ig、CrossMab、ART-Ig、FIT-Ig、Duobody等双特异性抗体,三特异性抗体)和抗体片段,包括但不限于Fab、Fab’、(Fab’)2、Fv、Diabody等等,只要它们表现出所期望的治疗相关细胞表面抗原结合活性,且具有能够在N端与IL-15和IL-15Ra连接的重链可变区和轻链可变区。在某些情形下,例如在IL-15或IL-15Ra与轻链CL或重链CH1的C端连接的场合,抗体亦可包含重链和轻链恒定区或其部分。在这样的情形中,抗体可以是,例如,Fab的形式。
术语“CDR”是指在免疫球蛋白可变区序列内的互补决定区。对于各重链和轻可变区,在重链和轻链的各可变区中存在三个CDR,其被命名为CDR1、CDR2和CDR3。术语“CDR组”是指在能够结合抗原的单一可变区中出现的三个CDR的组。这些CDR的确切边界已根据不同系统不同地定义。由Kabat(Kabat等(1987)和(1991))描述的系统,不仅提供了可适用于抗体或结合蛋白的任何可变区的明确残基编号系统,而且还提供了定义各重链或轻链序列中的三个CDR的精确残基边界。这些CDR可以被称为Kabat CDR。Chothia和同事(Chothia和Lesk(1987)J.Mol.Biol.196:901-917;Chothia等(1989)Nature342:877-883)发现KabatCDR内的某些亚部分采取几乎相同的肽骨架构象,尽管在氨基酸序列水平上具有大的多样性。这些亚部分被命名为L1、L2和L3或H1、H2和H3,其中“L”和“H”分别指轻链和重链区域。这些区域可以被称为Chothia CDR,所述Chothia CDR具有与Kabat CDR重叠的边界。定义与Kabat CDR重叠的CDR的其它边界已由Padlan(1995)FASEB J.9:133-139和MacCallum(1996)J.Mol.Biol.262(5):732-45)描述。还有其它CDR边界定义可以不严格遵循本文系统之一,但仍将与Kabat CDR重叠,尽管鉴于特定残基或残基组或甚至整个CDR不显著影响抗原结合的预测或实验发现,它们可以缩短或加长。本文综合了Kabat和Contact定义CDR的方法(详见http://www.bioinf,org.uk/abs/)。
术语“同源性”或“同一性”指两个聚合物分子之间(例如,在两个核酸分子(如,两个DNA分子或两个RNA分子之间)或在两个多肽分子之间)的次级单位序列同一性。当这两个分子中的次级单位位置被相同单体性次级单位占据时,例如,若两个DNA分子中每个分子中的某位置被腺嘌呤占据,则它们在该位置是同源或相同的。两个序列之间的同一性直接随匹配位置或同源位置的数目而变化,例如,如果两个序列中一半位置(例如长度为十个次级单位的聚合物中的五个位置)是同源的,则这两个序列是同源的(亦可称具有50%同源性或50%同一性);如果90%的位置(例如10个位置中9个位置)是匹配或同源的,则这两个序列是90%同源的(亦可称具有90%同源性或90%同一性)。
本发明的“IL-15”为哺乳动物细胞白介素15(IL-15),优选灵长类IL-15,更优选IL-15。本发明的“IL-1 5”指具有与选自由SEQ ID NO:72氨基酸序列至少85%(即,相当于约20个氨基酸改变)的同一性百分数的氨基酸序列,优选地至少99%(相当于约1个氨基酸改变),但仍保持与野生型IL-15相当的生理活性的IL-15衍生物或变体。本领域技术人员可基于其知识和本专利申请的教导鉴定这样的衍生物。也应当理解天然氨基酸可以被化学修饰的氨基酸所替代。通常,这样的化学修饰的氨基酸提高多肽半衰期(见国际专利申请WO/2015/131994)。如前述使用的两个氨基酸序列之间“同一性百分数”的定义参加国际专利申请WO/2015/131994。
优选地,IL-15衍生物或变体为IL-15激动剂或超激动剂。本领域技术人员可以基于现有技术的方法鉴定IL-15激动剂或超激动剂。作为IL-15超激动剂或超激动剂的实例,可以引用在国际申请WO2005/085282或Zhu等的(J.Immunol,vol.183(6),p:3598-607,2009)中公开的那些。
更优先地,所述IL-15激动剂或超激动剂包括选自下组的取代:L45D、L45E、S51D、L52D、N72D、N72E、N72A、N72S、N72Y和N72P(上述氨基酸位点参照人IL-15的序列,SEQ IDNO.:72)。
本发明的“IL-15Ra的Sushi结构域”具有本领域中一般公知的含义。所述Sushi结构域为哺乳动物IL-15Ra的Sushi结构域,优选灵长类IL-15Ra的Sushi结构域,更优选人IL-15Ra的Sushi结构域。优选地,包括人IL-15Ra的sushi结构域具有如SEQ ID NO.:74所示的氨基酸序列。
如本文使用,术语“IL-15Ra的sushi结构域的衍生物”是指与SEQ ID NO:74所示氨基酸序列至少85%(即相当于约10个氨基酸改变)的同一性百分数的氨基酸序列,优选地至少99%(相当于约1个氨基酸改变)的IL-15Ra的sushi结构域的衍生物。本领域技术人员可基于其个人的知识和本专利申请的教导鉴定这样的衍生物。也应当理解天然氨基酸可以被化学修饰的氨基酸所替代。通常,这样的化学修饰的氨基酸可提高多肽半衰期(见国际专利申请WO/2015/131994)。如前述使用的两个氨基酸序列之间“同一性百分数”的定义,参见例如WO/2015/131994。
本发明的“治疗相关细胞表面抗原”指表达在细胞表面的、可以被靶向以进行IL-15介导的疾病治疗的抗原。本发明的“治疗相关细胞表面抗原”包括CD抗原、免疫监测点抗原、肿瘤相关抗原、肿瘤特异性抗原、病毒诱发的肿瘤抗原,且可选自但不限于下面所述抗原:CD19、PD-1、PD-L1、HER2、STAT3、STEAP1、CTLA-4、IDO、NY-ESO-1、CD40、CSF1R、BCMA、MUC1、ADORA2A、CD20、GD2、TLR7、WT1、IFNAR1、CD47;Neoantigen、EGFR、LAG-3、OX40、PSMA、Mesothelin、TERT、TLR、TLR9、4-1BB、IL2R、TLR4、CD33、GITR、HPV E6、Survivin、CD123、TIGITTIM-3、CD73、HPVE7、TLR3、CD38、EBV、STING、CD22、GPC3、HDAC1、CXCR4、GMCSFR、CD30、CEACAM5、HDAC6、HPV、CD3、MAGE-A3、TNF、PSA、CD25、CEA、EPCAM、CMV、IL12、PRAME、IL12R、5T4、Beta catenin、CCR2、PMEL、CXCL12、IGF1、CD46、CXCR1、GMCSF、IL15R、ROR1、TGFBR2、CCR4、FLT-3、FOLR1、GCSFR、ICOS、JAK2、KRAS、VISTA、CD133、CD27、CD39、CEACAM6、NKG2D、STAT5、TGFB1、TLR2、USP7、ANG1、ANG2、B7-H3、CLEC12A、IL13RA2、RIG-I、TRP2、VEGF、AFP、Alpha-Gal、COX-2、EPHA2、gp96、MUC16、p53、TGF-β、CD138、CDw136、CS1、CXCR2、EGFRvIII、Gelactin-3、Globo H、GR、IFNAR2、IFNGR1、IL6、JAK1、MLANA、RAS、SLAMF7、TDO、TGFB2、TLR8、ALK、Arginase、CCR1、CD56、CD70、FAP、GD3、IDH1、IL6R、IRAK4、MAGE-A4、MERTK、MIF、PSCA、PTGER4、SIRPA、TGFB、TGFBR1、ACPP、ADORA2B、AR、Brachyury、CA19-9、CD32、CEACAM1、Gastrin、HDAC、HPVL2、IFNAR、IFNGR、IGF1R、IGF2、IL15、IL17R、IL1B、IL7R、JAK、MAGE-A、MAGE-A1、MAGE-A6、P38、、RORC、TLR5、VEGFR2、ADORA3、ATRT、B7-H4、c-KIT、CCR7、CD11b、CD135、CD171、CD174、CDH3、CX3CR1、Gelactin-1、GM3、HLA-A2、HSP70、IL10、IL17、IL2RB、JAK3、MDA5、NKG2A、PBF、PVRIG、SPAM1、URLC10、VEGFR1、ABCB5、ADABP、ADAM17、ADP、AEG1、Alpha-lactalbumin、AMHR2、ASPH、AXL、BCL2、BTE6-LX-8b、BTE6-X-15-7、Carbohydrateantigens、CCL20、CCL3、CCNB1、CD147、CD155、CD16、CD162、CD16a、CD200、CD21、CD28、CD44、CD52、CD54、CD7、CD80、CD88、Claudin 18、cMET、COX2、CSF1、CTCFL、CXCR5、CXCR7、E1A、EIF2AK3、ERG、FGF2、FN1、GC、GM2、gPA33、HBV、Hemagglutinin、HER3、HILPDA、HLA-DR、HMW-MAA、HP59、HPV 16、HPV E6/7、HPV L1、HSP105、HSP65、HVEM、Hyaluronan、IL13RA1、IL2、IL21R、IL8、KIF20A、KIR2DL1、KIR2DL3、LXR、MAGE-A10、MAGE-C2、MammaglobinA、MAPK、MICA、MiHA、MMP-11、MVP、Myeloblastin、N-Myc、NKp46、NLRP3、NR2F6、Oncofetal Antigen、P2RX7、RhoC、SIM-2、SSTR2、SSX2、STAT1、STn、TAG72、TAMA、TFDP3、TGFBR、TSA、TYK2、Tyrosinase、VEGFA、5′Nucleotidase(Ecto 5′Nucleotidase or CD73 or NT5E or EC 3.1.3.5)、ADAM9、AIM2、B7-H6、BAFF-R、BAI1、BARD1、BOB-1、CA9、Cancertestis antigen、CB2、CBLB、CCR9、CD13、CD130、CD150、CD160、CD200R1、CD267、CD29、CD3E、CD4、CD51、CD8、CGEN-XXXX、Claudin 6、CLEC2D、COX、COX-1、CPEB4、CPEG4、CRBN、CRLF2、CSPG4、CTA、CXCL1、CXCR3、Cytosine deaminase、DCK、DKK1、DLL3、DR3、DR5、EBNA3C、EGF、EGFR5、ELVAL4、EPHA3、EPS8、EVI1、FAIM-3、FasR、FCU1、FLT3、FOLR、FOXM1、FSHR、Galectin-3、GalNAc、GARP、Gelactin-9、Gelatcin-1/3/9、GLD18、GNRHR、GP160、GP73、H3.3K27M、HAGE、HDAC2、HDAC8、HPV16 E6、HPV16 E7、HSP、Hypoxia、ICAM、ICAM7、IDO1、IFNG、IFNGR2、IGF2R、IGFBP2、IGK、IL10RA、IL12RB1、IL13、IL13R、IL13Ralpha2、IL15RA、IL17A、IL17B、IL1A、IL1R1、IL1R3、IL21、IL22R、IL27R、IL2RA、IL35、IL9R、Integrinbeta-7、IRAK1、ITGB5、Kappamyelomaantigen、KIR2DL2、Kynurenine、L1CAM、Lambda Myeloma Antigen、LAMP、LLO、LXRA、LXRB、Masreceptor、MG7、MHCI、MHCII、MIC、MOSPD2、MRP-3、MRP1、MRP3765、muGNTP01、MYB、MYBL2、NFAT、NGcGM3、Nrf2、p38 MAP Kinase(EC 2.7.11.24)、P55、PAM4、PAP、PASD1、PCDH18、PD-L2、PI3K-delta、POTE、PPT、Protein tolemerase、PTGER2、RANKL、RBL001、RNF43、ROR2、S100A9、SEREX、SLAMF1、STAT、TACSTD2、TASTD2、TDO2、TEM、thymidine kinase、Thymidylatesynthase、TIE2、TIMP3、TM4SF5、TOP1、TRBC1、TRBC2、TRIF、Tryptophan、TSHR、TWEAK、UTA2-1、VDBP、VRP、VSIG-4、XAGE1、XAGE1A、ZP1、ZP3。
本发明的术语“连接肽”是指用于连接2个多肽的氨基酸残基或包含2个或更多个通过肽键连接的氨基酸残基的多肽。这样的接头多肽是本领域众所周知的(参见,例如,Holliger等人(1993)Proc.Natl.Acad.Sci.USA 90:6444-6448;Poljak等人(1994)Structure2:1121-1123)。合适的,非免疫原性的连接肽包括例如GS,(G4S)n,(SG4)n,(G4S)n或G4(SG4)n肽接头。“n”一般是1至10,通常是2至4的整数。在某些实施方案中,所述连接肽是可裂解的连接肽,例如,可自断裂、可酶促断裂或可化学断裂的连接肽。参见WO2011/034605,将其全部按引用并入本文中。例如,但不限于,连接肽的酶断裂可以包括使用内肽酶或外肽酶。内肽酶的非限制性实例包括MMP2(基质金属蛋白酶2)、尿激酶、Lys-C、Asp-N、Arg-C、V8、Glu-C、胰凝乳蛋白酶、胰蛋白酶、胃蛋白酶、木瓜蛋白酶、凝血酶、Genenase、因子Xa、TEv(烟草蚀刻病毒半胱氨酸蛋白酶)、肠激酶、HRV C3(人鼻病毒C3蛋白酶)、ininogenase、枯草芽孢杆菌蛋白酶样前蛋白转化酶(例如,Furin(PC1)、PC2或PC3)和N-精氨酸二碱基转化酶(N-arginine dibasic convertase)。外肽酶的非限制性实例包括羧基肽酶A、羧基肽酶B、羧基肽酶D、羧基肽酶E(也称为羧基肽酶H)、羧基肽酶M、羧基肽酶N或羧基肽酶Z。在某些实施方案中,可以通过使用羟胺、N-氯代琥珀酰亚胺、N-溴代琥珀酰亚胺或溴化氰,进行化学断裂。
实施例
需要说明的是,在不冲突的情况下,本申请中的实施例仅为举例说明,不旨在对本发明造成任何方式上的限制。
实施例
实施例1基于抗体的免疫细胞因子的构建
分别合成编码抗EGFR抗体(aEGFR)、Palivizumab(Syn)、抗PD-1抗体(aPD-1)、抗PD-L1抗体(aPD-L1)、抗HER2抗体(aHER2)的重链(H)和轻链(L),以及IL-15、IL-15Ra、IL-15RaSushi的基因片段,采用标准分子生物学技术通过连接肽将IL-15融合在前述抗体重链的可变区的N端或CH1的C端,将IL-15Ra融合在前述抗体轻链的可变区N端或恒定区的C端,或者将IL-15融合在前述抗体轻链的可变区的N端或恒定区的C端,将IL-15Ra融合在前述抗体重链的可变区N端或CH1的C端,所有的序列都通过测序验证(见表1的序列)。
表1基于抗体的免疫细胞因子的序列
/>
/>
/>
注:null表示Fc无ADCC和CDC功能或者具有减弱的ADCC和CDC功能.
实施例2基于抗体的免疫细胞因子的表达、纯化和凝胶排阻层析
将实施例1构建好的含有免疫细胞因子的抗体重链和轻链的表达载体瞬时转染Expi293细胞(ThermoFisher)分别共转染,转染时重链的质粒和轻链的质粒用量为摩尔比1∶1):将40ml Expi293(3-4×106细胞/ml)接种至125ml细胞培养瓶,80ug质粒用2mlOpti-MEM(Invitrogen)稀释后加至2ml含120μl PEI(Polysciences)的Opti-MEM中,室温静置30min,将质粒-PEI mixture加至细胞培养液中125rpm,37℃,5%CO2培养。于转染后96h收集细胞培养上清,使用Protein A Resin(Genscript)纯化,SDS-PAGE检测。
将获得的Protein A resin纯化后的免疫细胞因子用GE的AKTA chromatography过柱分析,所用的层析柱为:Superdex 200Increase 10/300GL凝胶排阻层析柱。凝胶排阻层析所用的溶液为PBS缓冲液(0.010M phosphate buffer,0.0027M KCl,0.14M NaCl,pH7.4)。从图1的SDS胶图和图2的色谱图说明免疫细胞因子复合物具有相当的纯度,可通过层析柱有效分离。
实施例3基于抗体的免疫细胞因子体外活性验证
3.1 IL-15或IL-15RaSushi结合活性
包被hIL-15(SinoBiological)或IL-15RaSushi(Miltenyi)(100ng/孔)(DPBSbuffer,pH7.4)于96孔板,4℃孵育过夜;含2%脱脂奶粉的DPBST室温封闭1小时,含0.05%Tween-20的DPBS洗3次后,分别加入梯度稀释的免疫细胞因子室温孵育2h,含0.05%Tween-20的DPBS洗4-5次后,加入HRP conjugated anti-human kappa light chain(Cat.A18853,Thermo Fisher Scientific,1∶2000)二抗室温孵育2h,含0.05%Tween-20的DPBS洗4-5次后,TMB(BioLegend)显色后于OD450处读数。Prizm Graphpad软件用log(agonist)vs.response模型对数据进行非线性回归。
结果如图3所示。与positive-1(IL-15RaSushi-Fc)相比,BSIC-01(IL15-SynH(null):IL15RaSushi-SynL)和BSIC-02(IL15RaShi-SynH(null):IL15-SynL)对IL-15的亲和力相对较低(3A),说明BSIC-01(IL15-SynH(null):IL15RaSushi-SynL)与BSIC-02(IL15RaShi-SynH(null):IL15-SynL)分子内的IL-15和IL-15RaSushi形成稳定的复合物。与positive-2(IL-15-Fc)相比,BSIC-01(IL15-SynH(null):IL15RaSushi-SynL)对IL-15的亲和力较低(3B),与骨架抗体BSIC-13Syn(null)基本一致,说明BSIC-01(IL15-SynH(null):IL15RaSushi-SynL)分子内的IL-15和IL-15RaSushi形成非常稳定的复合物;BSIC-27(IL-15RaSushi-SynH(null))对IL-15RaSushi的亲和力较高。
3.2抗原结合活性
包被hEGFR-his(SinoBiological)、hHER2(ACRO)或hPDL1(100ng/孔)于96孔板,4℃孵育过夜;含2%脱脂奶粉的PBST(0.5%Tween-20in PBS)室温封闭1小时,分别加入梯度稀释的免疫细胞因子室温孵育2h,含2%脱脂奶粉的PBST洗4-5次后,加入HRP anti-humankappa light chain(Invitorgen)二抗室温孵育1h,含2%脱脂奶粉的PBST洗4-5次后,TMB显色试剂(BioLegend,Cat.421101)显色后于450nm处读数,或者QuantaBlu荧光过氧化物酶底物(Life technologies,Cat.15169)显色后于325nm和420nm处读数。Prizm Graphpad软件用specific binding model对数据进行非线性回归。结果如图7所示,基于抗体的免疫细胞因子对抗原具有较好的亲和力。
实施例4 Mo7e促增殖实验
培养Mo7e细胞(协和细胞资源中心)(培养条件:RPMI 1640培养基+10%胎牛血清+双抗+10ng/ml GMCSF(Peprotech)),实验之前台盼蓝检测细胞活率,确保Mo7e细胞活率超过95%方可进行后续的增殖实验。用RPMI 1640(10%FBS+双抗)培养基洗2次后将细胞重悬至密度为40,000/100ul,加入到96孔板(全黑色底部不透明)中,每孔50ul。配制含有免疫细胞因子的RPMI 1640(10%FBS+双抗)培养基,按照预先设定好的浓度梯度进行配制。将配制好的含免疫细胞因子的培养基加入到96孔板中,每孔50ul,每个梯度做三个复孔;完成后,轻轻震荡96孔板,让含有细胞因子的培养基与预先铺好的Mo7e细胞混合均匀,放入细胞培养箱培养,条件同上。72h后,根据厂家提供的说明书,通过MTS(Promega)显色后于OD490处读数或者通过Cell TiterGLO(Promega)显色后读取荧光值。Prizm Graphpad软件用log(agonist)vs.response模型对数据进行非线性回归。
结果如图4所示。BSIC-01(IL15-SynH(null):IL15RaSushi-SynL)促进Mo7e增殖的效果弱于positive-3(IL-15与IL-15RaSushi-Fc通过非共价相互作用形成的异二聚体)和rhIL-15,但是强于BSIC-27(图4A),提示IL-15和IL-15Rasushi通过非共价相互作用形成的分子内复合物可有效促进Mo7e细胞的增殖。BSIC-07(IL15-aEGFRH(null):IL15RaSushi-aEGFRL)和BSIC-08IL15RaSushi-aEGFRH(null):IL15-aEGFRL促进Mo7e增殖的效果类似,但是稍弱于rhIL-15。BSIC-03(IL15-aHER2H(WT):IL15RaSushi-aHER2L)和BSIC-04(IL15RaSushi-aHER2(WT):IL15-aHER2L)促进Mo7e增殖的效果强于BSIC-24(IL-15-aHER2H(WT))或IL-15-aHER2L(WT)(IL-15-aHER2H(WT)),与Palivizumab或aEGFR融合的趋势一致。
实施例5淋巴细胞激活实验
将DPBS或免疫细胞因子融合蛋白腹腔注射雌性C57BL/6小鼠(6-7周),4天后后,后眼窝采集静脉血分离PBMC,同时处死小鼠取小鼠脾脏,制备脾脏细胞单细胞悬液,用流式抗体染色后检测PBMC及脾脏中不同免疫细胞的阳性率。所用的流式抗体如下:PE rat anti-mouse CD19、APC rat anti-mouse CD45、FITC rat anti-mouse CD335、FITC rat anti-mouse CD3、PE rat anti-mouse CD4、PE rat anti-mouse CD8、TruStain FcX plus(anti-mouse CD16/32),均购自Biolegend公司。结果如图6所示,基于抗体的免疫细胞因子能够有效激活小鼠体内的CD8+T细胞和NK细胞等免疫细胞。
SEQUENCE LISTING
<110> 南通壹宸生物医药科技有限公司
<120> 一种免疫细胞因子及其制备与用途
<160> 120
<170> PatentIn version 3.5
<210> 1
<211> 1737
<212> DNA
<213> 人工序列
<400> 1
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg caggtgaccc tgcgcgagtc cggccctgca 420
ctggtgaagc ccacccagac cctgaccctg acctgcacct tctccggctt ctccctgtcc 480
acctccggca tgtccgtggg ctggatccgg cagcctcccg gcaaggccct ggagtggctg 540
gctgacatct ggtgggacga caagaaggac tacaacccct ccctgaagtc ccgcctgacc 600
atctccaagg acacctccaa gaaccaggtg gtgctgaagg tgaccaacat ggaccccgcc 660
gacaccgcca cctactactg cgcccgctca atgattacca actggtactt cgacgtgtgg 720
ggagccggta ccaccgtgac cgtgtcttcc gcctccacca agggcccatc ggtcttcccc 780
ctggcaccct cctccaagag cacctctggg ggcacagcgg ccctgggctg cctggtcaag 840
gactacttcc ccgaaccggt gacggtgtcg tggaactcag gcgccctgac cagcggcgtg 900
cacaccttcc cggctgtcct acagtcctca ggactctact ccctcagcag cgtggtgact 960
gtgccctcta gcagcttggg cacccagacc tacatctgca acgtgaatca caagcccagc 1020
aacaccaagg tggacaagaa agttgaaccc aaatcttgcg acaaaactca cacatgccca 1080
ccgtgcccag cacctccagt cgccggaccg tcagtcttcc tcttccctcc aaaacccaag 1140
gacaccctca tgatctcccg gacccctgag gtcacatgcg tggtggtgga cgtgagccac 1200
gaagaccctg aggtcaagtt caactggtac gtggacggcg tggaggtgca taatgccaag 1260
acaaagccgc gggaggagca gtacaacagc acgtaccgtg tggtcagcgt cctcaccgtc 1320
ctgcaccagg actggctgaa tggcaaggag tacaagtgca aggtctccaa caaaggcctc 1380
ccaagctcca tcgagaaaac catctccaaa gccaaagggc agccccgaga accacaggtg 1440
tacaccctgc ctccatcccg ggatgagctg accaagaacc aggtcagcct gacctgcctg 1500
gtcaaaggct tctatcccag cgacatcgcc gtggagtggg agagcaatgg gcagccggag 1560
aacaactaca agaccacgcc tcccgtgctg gactccgacg gctccttctt cctctacagc 1620
aagctcaccg tggacaagag caggtggcag caggggaacg tcttctcatg ctccgtgatg 1680
catgaggctc tgcacaacca ctacacgcag aagagcctct ccctgtctcc gggtaaa 1737
<210> 2
<211> 579
<212> PRT
<213> 人工序列
<400> 2
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Gln Val Thr Leu Arg Glu Ser Gly Pro Ala Leu Val Lys Pro
130 135 140
Thr Gln Thr Leu Thr Leu Thr Cys Thr Phe Ser Gly Phe Ser Leu Ser
145 150 155 160
Thr Ser Gly Met Ser Val Gly Trp Ile Arg Gln Pro Pro Gly Lys Ala
165 170 175
Leu Glu Trp Leu Ala Asp Ile Trp Trp Asp Asp Lys Lys Asp Tyr Asn
180 185 190
Pro Ser Leu Lys Ser Arg Leu Thr Ile Ser Lys Asp Thr Ser Lys Asn
195 200 205
Gln Val Val Leu Lys Val Thr Asn Met Asp Pro Ala Asp Thr Ala Thr
210 215 220
Tyr Tyr Cys Ala Arg Ser Met Ile Thr Asn Trp Tyr Phe Asp Val Trp
225 230 235 240
Gly Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro
245 250 255
Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr
260 265 270
Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr
275 280 285
Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro
290 295 300
Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr
305 310 315 320
Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn
325 330 335
His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser
340 345 350
Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala
355 360 365
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
370 375 380
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
385 390 395 400
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
405 410 415
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
420 425 430
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
435 440 445
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile
450 455 460
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
465 470 475 480
Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser
485 490 495
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
500 505 510
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
515 520 525
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
530 535 540
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
545 550 555 560
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
565 570 575
Pro Gly Lys
<210> 3
<211> 1029
<212> DNA
<213> 人工序列
<400> 3
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gacatccaga tgacccagtc cccctccacc 420
ctgtccgcct ccgtgggcga ccgcgtgacc atcacctgca agtgccagct gtccgtgggc 480
tacatgcact ggtaccagca gaagcccggc aaggccccca agctgctgat ctacgacacc 540
tccaagctgg cctccggcgt gccctcccgc ttctccggct ccggctccgg caccgagttc 600
accctgacca tctcctccct gcagcccgac gacttcgcca cctactactg cttccagggc 660
tccggctacc ccttcacctt cggcggcggc accaagctgg agatcaaacg aactgtggct 720
gcaccatctg tcttcatctt cccgccatct gatgagcagt tgaaatctgg aactgcctct 780
gtcgtgtgcc tgctgaataa cttctatccc agagaggcca aagtacagtg gaaggtggat 840
aacgccctcc aatcgggtaa ctcccaggag agtgtcacag agcaggacag caaggacagc 900
acctacagcc tcagcagcac cctgacgctg agcaaagcag actacgagaa acacaaagtc 960
tacgcctgcg aagtcaccca tcagggcctg tcctcgcccg tcacaaagag cttcaacagg 1020
ggagagtgt 1029
<210> 4
<211> 343
<212> PRT
<213> 人工序列
<400> 4
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Thr Leu Ser Ala Ser
130 135 140
Val Gly Asp Arg Val Thr Ile Thr Cys Lys Cys Gln Leu Ser Val Gly
145 150 155 160
Tyr Met His Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu
165 170 175
Ile Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser
180 185 190
Gly Ser Gly Ser Gly Thr Glu Phe Thr Leu Thr Ile Ser Ser Leu Gln
195 200 205
Pro Asp Asp Phe Ala Thr Tyr Tyr Cys Phe Gln Gly Ser Gly Tyr Pro
210 215 220
Phe Thr Phe Gly Gly Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala
225 230 235 240
Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser
245 250 255
Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu
260 265 270
Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser
275 280 285
Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu
290 295 300
Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val
305 310 315 320
Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys
325 330 335
Ser Phe Asn Arg Gly Glu Cys
340
<210> 5
<211> 1590
<212> DNA
<213> 人工序列
<400> 5
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
gggcaggtga ccctgcgcga gtccggccct gcactggtga agcccaccca gaccctgacc 300
ctgacctgca ccttctccgg cttctccctg tccacctccg gcatgtccgt gggctggatc 360
cggcagcctc ccggcaaggc cctggagtgg ctggctgaca tctggtggga cgacaagaag 420
gactacaacc cctccctgaa gtcccgcctg accatctcca aggacacctc caagaaccag 480
gtggtgctga aggtgaccaa catggacccc gccgacaccg ccacctacta ctgcgcccgc 540
tcaatgatta ccaactggta cttcgacgtg tggggagccg gtaccaccgt gaccgtgtct 600
tccgcctcca ccaagggccc atcggtcttc cccctggcac cctcctccaa gagcacctct 660
gggggcacag cggccctggg ctgcctggtc aaggactact tccccgaacc ggtgacggtg 720
tcgtggaact caggcgccct gaccagcggc gtgcacacct tcccggctgt cctacagtcc 780
tcaggactct actccctcag cagcgtggtg actgtgccct ctagcagctt gggcacccag 840
acctacatct gcaacgtgaa tcacaagccc agcaacacca aggtggacaa gaaagttgaa 900
cccaaatctt gcgacaaaac tcacacatgc ccaccgtgcc cagcacctcc agtcgccgga 960
ccgtcagtct tcctcttccc tccaaaaccc aaggacaccc tcatgatctc ccggacccct 1020
gaggtcacat gcgtggtggt ggacgtgagc cacgaagacc ctgaggtcaa gttcaactgg 1080
tacgtggacg gcgtggaggt gcataatgcc aagacaaagc cgcgggagga gcagtacaac 1140
agcacgtacc gtgtggtcag cgtcctcacc gtcctgcacc aggactggct gaatggcaag 1200
gagtacaagt gcaaggtctc caacaaaggc ctcccaagct ccatcgagaa aaccatctcc 1260
aaagccaaag ggcagccccg agaaccacag gtgtacaccc tgcctccatc ccgggatgag 1320
ctgaccaaga accaggtcag cctgacctgc ctggtcaaag gcttctatcc cagcgacatc 1380
gccgtggagt gggagagcaa tgggcagccg gagaacaact acaagaccac gcctcccgtg 1440
ctggactccg acggctcctt cttcctctac agcaagctca ccgtggacaa gagcaggtgg 1500
cagcagggga acgtcttctc atgctccgtg atgcatgagg ctctgcacaa ccactacacg 1560
cagaagagcc tctccctgtc tccgggtaaa 1590
<210> 6
<211> 530
<212> PRT
<213> 人工序列
<400> 6
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Gln Val Thr Leu Arg Glu Ser Gly Pro Ala Leu Val Lys Pro Thr
85 90 95
Gln Thr Leu Thr Leu Thr Cys Thr Phe Ser Gly Phe Ser Leu Ser Thr
100 105 110
Ser Gly Met Ser Val Gly Trp Ile Arg Gln Pro Pro Gly Lys Ala Leu
115 120 125
Glu Trp Leu Ala Asp Ile Trp Trp Asp Asp Lys Lys Asp Tyr Asn Pro
130 135 140
Ser Leu Lys Ser Arg Leu Thr Ile Ser Lys Asp Thr Ser Lys Asn Gln
145 150 155 160
Val Val Leu Lys Val Thr Asn Met Asp Pro Ala Asp Thr Ala Thr Tyr
165 170 175
Tyr Cys Ala Arg Ser Met Ile Thr Asn Trp Tyr Phe Asp Val Trp Gly
180 185 190
Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser
195 200 205
Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala
210 215 220
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val
225 230 235 240
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala
245 250 255
Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val
260 265 270
Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His
275 280 285
Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys
290 295 300
Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly
305 310 315 320
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
325 330 335
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
340 345 350
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
355 360 365
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
370 375 380
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
385 390 395 400
Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu
405 410 415
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
420 425 430
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
435 440 445
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
450 455 460
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
465 470 475 480
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
485 490 495
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
500 505 510
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
515 520 525
Gly Lys
530
<210> 7
<211> 882
<212> DNA
<213> 人工序列
<400> 7
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
ggggacatcc agatgaccca gtccccctcc accctgtccg cctccgtggg cgaccgcgtg 300
accatcacct gcaagtgcca gctgtccgtg ggctacatgc actggtacca gcagaagccc 360
ggcaaggccc ccaagctgct gatctacgac acctccaagc tggcctccgg cgtgccctcc 420
cgcttctccg gctccggctc cggcaccgag ttcaccctga ccatctcctc cctgcagccc 480
gacgacttcg ccacctacta ctgcttccag ggctccggct accccttcac cttcggcggc 540
ggcaccaagc tggagatcaa acgaactgtg gctgcaccat ctgtcttcat cttcccgcca 600
tctgatgagc agttgaaatc tggaactgcc tctgtcgtgt gcctgctgaa taacttctat 660
cccagagagg ccaaagtaca gtggaaggtg gataacgccc tccaatcggg taactcccag 720
gagagtgtca cagagcagga cagcaaggac agcacctaca gcctcagcag caccctgacg 780
ctgagcaaag cagactacga gaaacacaaa gtctacgcct gcgaagtcac ccatcagggc 840
ctgtcctcgc ccgtcacaaa gagcttcaac aggggagagt gt 882
<210> 8
<211> 294
<212> PRT
<213> 人工序列
<400> 8
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Thr Leu Ser Ala Ser Val
85 90 95
Gly Asp Arg Val Thr Ile Thr Cys Lys Cys Gln Leu Ser Val Gly Tyr
100 105 110
Met His Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile
115 120 125
Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly
130 135 140
Ser Gly Ser Gly Thr Glu Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro
145 150 155 160
Asp Asp Phe Ala Thr Tyr Tyr Cys Phe Gln Gly Ser Gly Tyr Pro Phe
165 170 175
Thr Phe Gly Gly Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala
180 185 190
Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly
195 200 205
Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala
210 215 220
Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln
225 230 235 240
Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser
245 250 255
Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr
260 265 270
Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser
275 280 285
Phe Asn Arg Gly Glu Cys
290
<210> 9
<211> 1737
<212> DNA
<213> 人工序列
<400> 9
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gaggtgcagc tggtggagtc tggaggaggc 420
ttggtccagc ctggggggtc cctgagactc tcctgtgcag cctctgggtt caatattaag 480
gacacttaca tccactgggt ccgccaggct ccagggaagg ggctggagtg ggtcgcacgt 540
atttatccta ccaatggtta cacacgctac gcagactccg tgaagggccg attcaccatc 600
tccgcagaca cttccaagaa cacggcgtat cttcaaatga acagcctgag agccgaggac 660
acggccgtgt attactgttc gagatggggc ggtgacggct tctatgccat ggactactgg 720
ggccaaggaa ccctggtcac cgtctcctca gcctccacca agggcccatc ggtcttcccc 780
ctggcaccct cctccaagag cacctctggg ggcacagcgg ccctgggctg cctggtcaag 840
gactacttcc ccgaaccggt gacggtgtcg tggaactcag gcgccctgac cagcggcgtg 900
cacaccttcc cggctgtcct acagtcctca ggactctact ccctcagcag cgtggtgact 960
gtgccctcta gcagcttggg cacccagacc tacatctgca acgtgaatca caagcccagc 1020
aacaccaagg tggacaagaa agttgagccc aaatcttgcg acaaaactca cacatgccca 1080
ccgtgcccag cacctccagt cgccggaccg tcagtcttcc tcttccctcc aaaacccaag 1140
gacaccctca tgatctcccg gacccctgag gtcacatgcg tggtggtgga cgtgagccac 1200
gaagaccctg aggtcaagtt caactggtac gtggacggcg tggaggtgca taatgccaag 1260
acaaagccgc gggaggagca gtacaacagc acgtaccgtg tggtcagcgt cctcaccgtc 1320
ctgcaccagg actggctgaa tggcaaggag tacaagtgca aggtctccaa caaaggcctc 1380
ccaagctcca tcgagaaaac catctccaaa gccaaagggc agccccgaga accacaggtg 1440
tacaccctgc ctccatcccg ggatgagctg accaagaacc aggtcagcct gacctgcctg 1500
gtcaaaggct tctatcccag cgacatcgcc gtggagtggg agagcaatgg gcagccggag 1560
aacaactaca agaccacgcc tcccgtgctg gactccgacg gctccttctt cctctacagc 1620
aagctcaccg tggacaagag caggtggcag caggggaacg tcttctcatg ctccgtgatg 1680
catgaggctc tgcacaacca ctacacgcag aagagcctct ccctgtctcc gggtaaa 1737
<210> 10
<211> 579
<212> PRT
<213> 人工序列
<400> 10
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro
130 135 140
Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys
145 150 155 160
Asp Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu
165 170 175
Trp Val Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp
180 185 190
Ser Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr
195 200 205
Ala Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
210 215 220
Tyr Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp
225 230 235 240
Gly Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro
245 250 255
Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr
260 265 270
Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr
275 280 285
Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro
290 295 300
Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr
305 310 315 320
Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn
325 330 335
His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser
340 345 350
Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala
355 360 365
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
370 375 380
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
385 390 395 400
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
405 410 415
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
420 425 430
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
435 440 445
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile
450 455 460
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
465 470 475 480
Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser
485 490 495
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
500 505 510
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
515 520 525
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
530 535 540
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
545 550 555 560
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
565 570 575
Pro Gly Lys
<210> 11
<211> 885
<212> DNA
<213> 人工序列
<400> 11
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
ggggacatcc agatgaccca gtctccatcc tccctgtctg catctgtagg agacagagtc 300
accatcactt gccgggcaag tcaggatgtg aataccgcgg tcgcatggta tcagcagaaa 360
ccagggaaag cccctaagct cctgatctat tctgcatcct tcttgtatag tggggtccca 420
tcaaggttca gtggcagtag atctgggaca gatttcactc tcaccatcag cagtctgcaa 480
cctgaagatt ttgcaactta ctactgtcaa cagcattaca ctacccctcc gacgttcggc 540
caaggtacca aggtggagat caaacgaact gtggctgcac catctgtctt catcttcccg 600
ccatctgatg agcagttgaa atctggaact gcctctgtcg tgtgcctgct gaataacttc 660
tatcccagag aggccaaagt acagtggaag gtggataacg ccctccaatc gggtaactcc 720
caggagagtg tcacagagca ggacagcaag gacagcacct acagcctcag cagcaccctg 780
acgctgagca aagcagacta cgagaaacac aaagtctacg cctgcgaagt cacccatcag 840
ggcctgtcct cgcccgtcac aaagagcttc aacaggggag agtgt 885
<210> 12
<211> 295
<212> PRT
<213> 人工序列
<400> 12
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val
85 90 95
Gly Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Asn Thr
100 105 110
Ala Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu
115 120 125
Ile Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser
130 135 140
Gly Ser Arg Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln
145 150 155 160
Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr Pro
165 170 175
Pro Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala
180 185 190
Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser
195 200 205
Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu
210 215 220
Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser
225 230 235 240
Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu
245 250 255
Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val
260 265 270
Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys
275 280 285
Ser Phe Asn Arg Gly Glu Cys
290 295
<210> 13
<211> 1590
<212> DNA
<213> 人工序列
<400> 13
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
ggggaggtgc agctggtgga gtctggagga ggcttggtcc agcctggggg gtccctgaga 300
ctctcctgtg cagcctctgg gttcaatatt aaggacactt acatccactg ggtccgccag 360
gctccaggga aggggctgga gtgggtcgca cgtatttatc ctaccaatgg ttacacacgc 420
tacgcagact ccgtgaaggg ccgattcacc atctccgcag acacttccaa gaacacggcg 480
tatcttcaaa tgaacagcct gagagccgag gacacggccg tgtattactg ttcgagatgg 540
ggcggtgacg gcttctatgc catggactac tggggccaag gaaccctggt caccgtctcc 600
tcagcctcca ccaagggccc atcggtcttc cccctggcac cctcctccaa gagcacctct 660
gggggcacag cggccctggg ctgcctggtc aaggactact tccccgaacc ggtgacggtg 720
tcgtggaact caggcgccct gaccagcggc gtgcacacct tcccggctgt cctacagtcc 780
tcaggactct actccctcag cagcgtggtg actgtgccct ctagcagctt gggcacccag 840
acctacatct gcaacgtgaa tcacaagccc agcaacacca aggtggacaa gaaagttgag 900
cccaaatctt gcgacaaaac tcacacatgc ccaccgtgcc cagcacctcc agtcgccgga 960
ccgtcagtct tcctcttccc tccaaaaccc aaggacaccc tcatgatctc ccggacccct 1020
gaggtcacat gcgtggtggt ggacgtgagc cacgaagacc ctgaggtcaa gttcaactgg 1080
tacgtggacg gcgtggaggt gcataatgcc aagacaaagc cgcgggagga gcagtacaac 1140
agcacgtacc gtgtggtcag cgtcctcacc gtcctgcacc aggactggct gaatggcaag 1200
gagtacaagt gcaaggtctc caacaaaggc ctcccaagct ccatcgagaa aaccatctcc 1260
aaagccaaag ggcagccccg agaaccacag gtgtacaccc tgcctccatc ccgggatgag 1320
ctgaccaaga accaggtcag cctgacctgc ctggtcaaag gcttctatcc cagcgacatc 1380
gccgtggagt gggagagcaa tgggcagccg gagaacaact acaagaccac gcctcccgtg 1440
ctggactccg acggctcctt cttcctctac agcaagctca ccgtggacaa gagcaggtgg 1500
cagcagggga acgtcttctc atgctccgtg atgcatgagg ctctgcacaa ccactacacg 1560
cagaagagcc tctccctgtc tccgggtaaa 1590
<210> 14
<211> 530
<212> PRT
<213> 人工序列
<400> 14
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly
85 90 95
Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys Asp
100 105 110
Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp
115 120 125
Val Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser
130 135 140
Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala
145 150 155 160
Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr
165 170 175
Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly
180 185 190
Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser
195 200 205
Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala
210 215 220
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val
225 230 235 240
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala
245 250 255
Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val
260 265 270
Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His
275 280 285
Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys
290 295 300
Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly
305 310 315 320
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
325 330 335
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
340 345 350
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
355 360 365
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
370 375 380
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
385 390 395 400
Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu
405 410 415
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
420 425 430
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
435 440 445
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
450 455 460
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
465 470 475 480
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
485 490 495
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
500 505 510
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
515 520 525
Gly Lys
530
<210> 15
<211> 1032
<212> DNA
<213> 人工序列
<400> 15
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gacatccaga tgacccagtc tccatcctcc 420
ctgtctgcat ctgtaggaga cagagtcacc atcacttgcc gggcaagtca ggatgtgaat 480
accgcggtcg catggtatca gcagaaacca gggaaagccc ctaagctcct gatctattct 540
gcatccttct tgtatagtgg ggtcccatca aggttcagtg gcagtagatc tgggacagat 600
ttcactctca ccatcagcag tctgcaacct gaagattttg caacttacta ctgtcaacag 660
cattacacta cccctccgac gttcggccaa ggtaccaagg tggagatcaa acgaactgtg 720
gctgcaccat ctgtcttcat cttcccgcca tctgatgagc agttgaaatc tggaactgcc 780
tctgtcgtgt gcctgctgaa taacttctat cccagagagg ccaaagtaca gtggaaggtg 840
gataacgccc tccaatcggg taactcccag gagagtgtca cagagcagga cagcaaggac 900
agcacctaca gcctcagcag caccctgacg ctgagcaaag cagactacga gaaacacaaa 960
gtctacgcct gcgaagtcac ccatcagggc ctgtcctcgc ccgtcacaaa gagcttcaac 1020
aggggagagt gt 1032
<210> 16
<211> 344
<212> PRT
<213> 人工序列
<400> 16
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser
130 135 140
Val Gly Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Asn
145 150 155 160
Thr Ala Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu
165 170 175
Leu Ile Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe
180 185 190
Ser Gly Ser Arg Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu
195 200 205
Gln Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr
210 215 220
Pro Pro Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val
225 230 235 240
Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys
245 250 255
Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg
260 265 270
Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn
275 280 285
Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser
290 295 300
Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys
305 310 315 320
Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr
325 330 335
Lys Ser Phe Asn Arg Gly Glu Cys
340
<210> 17
<211> 642
<212> DNA
<213> 人工序列
<400> 17
gacatccaga tgacccagtc tccatcctcc ctgtctgcat ctgtaggaga cagagtcacc 60
atcacttgcc gggcaagtca ggatgtgaat accgcggtcg catggtatca gcagaaacca 120
gggaaagccc ctaagctcct gatctattct gcatccttct tgtatagtgg ggtcccatca 180
aggttcagtg gcagtagatc tgggacagat ttcactctca ccatcagcag tctgcaacct 240
gaagattttg caacttacta ctgtcaacag cattacacta cccctccgac gttcggccaa 300
ggtaccaagg tggagatcaa acgaactgtg gctgcaccat ctgtcttcat cttcccgcca 360
tctgatgagc agttgaaatc tggaactgcc tctgtcgtgt gcctgctgaa taacttctat 420
cccagagagg ccaaagtaca gtggaaggtg gataacgccc tccaatcggg taactcccag 480
gagagtgtca cagagcagga cagcaaggac agcacctaca gcctcagcag caccctgacg 540
ctgagcaaag cagactacga gaaacacaaa gtctacgcct gcgaagtcac ccatcagggc 600
ctgtcctcgc ccgtcacaaa gagcttcaac aggggagagt gt 642
<210> 18
<211> 450
<212> PRT
<213> 人工序列
<400> 18
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys Asp Thr
20 25 30
Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly Gln
100 105 110
Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val
115 120 125
Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala
130 135 140
Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser
145 150 155 160
Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val
165 170 175
Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro
180 185 190
Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys
195 200 205
Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp
210 215 220
Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly
225 230 235 240
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
245 250 255
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
260 265 270
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
275 280 285
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
290 295 300
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
305 310 315 320
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
325 330 335
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
340 345 350
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
355 360 365
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
370 375 380
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
385 390 395 400
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
405 410 415
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
420 425 430
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
435 440 445
Gly Lys
450
<210> 19
<211> 642
<212> DNA
<213> 人工序列
<400> 19
gacatccaga tgacccagtc tccatcctcc ctgtctgcat ctgtaggaga cagagtcacc 60
atcacttgcc gggcaagtca ggatgtgaat accgcggtcg catggtatca gcagaaacca 120
gggaaagccc ctaagctcct gatctattct gcatccttct tgtatagtgg ggtcccatca 180
aggttcagtg gcagtagatc tgggacagat ttcactctca ccatcagcag tctgcaacct 240
gaagattttg caacttacta ctgtcaacag cattacacta cccctccgac gttcggccaa 300
ggtaccaagg tggagatcaa acgaactgtg gctgcaccat ctgtcttcat cttcccgcca 360
tctgatgagc agttgaaatc tggaactgcc tctgtcgtgt gcctgctgaa taacttctat 420
cccagagagg ccaaagtaca gtggaaggtg gataacgccc tccaatcggg taactcccag 480
gagagtgtca cagagcagga cagcaaggac agcacctaca gcctcagcag caccctgacg 540
ctgagcaaag cagactacga gaaacacaaa gtctacgcct gcgaagtcac ccatcagggc 600
ctgtcctcgc ccgtcacaaa gagcttcaac aggggagagt gt 642
<210> 20
<211> 214
<212> PRT
<213> 人工序列
<400> 20
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Asn Thr Ala
20 25 30
Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Arg Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr Pro Pro
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala
100 105 110
Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly
115 120 125
Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala
130 135 140
Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln
145 150 155 160
Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser
165 170 175
Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr
180 185 190
Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser
195 200 205
Phe Asn Arg Gly Glu Cys
210
<210> 21
<211> 1737
<212> DNA
<213> 人工序列
<400> 21
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg caggtgcagc tgcaggagag cggccccggc 420
ctggtgaagc ccagcgagac cctgagcctg acctgcaccg tgagcggcgg cagcgtgagc 480
agcggcgact actactggac ctggatccgc cagagccccg gcaagggcct ggagtggatc 540
ggccacatct actacagcgg caacaccaac tacaacccca gcctgaagag ccgcctgacc 600
atcagcatcg acaccagcaa gacccagttc agcctgaagc tgagcagcgt gaccgccgcc 660
gacaccgcca tctactactg cgtgcgcgac cgcgtgaccg gcgccttcga catctggggc 720
cagggcacca tggtgactgt gtctagcgcc tccaccaagg gcccatcggt cttccccctg 780
gcaccctcct ccaagagcac ctctgggggc acagcggccc tgggctgcct ggtcaaggac 840
tacttccccg aaccggtgac ggtgtcgtgg aactcaggcg ccctgaccag cggcgtgcac 900
accttcccgg ctgtcctaca gtcctcagga ctctactccc tcagcagcgt ggtgactgtg 960
ccctctagca gcttgggcac ccagacctac atctgcaacg tgaatcacaa gcccagcaac 1020
accaaggtgg acaagaaagt tgaacccaaa tcttgcgaca aaactcacac atgcccaccg 1080
tgcccagcac ctgaactcct ggggggaccg tcagtcttcc tcttcccccc aaaacccaag 1140
gacaccctca tgatctcccg gacccctgag gtcacatgcg tggtggtgga cgtgagccac 1200
gaagaccctg aggtcaagtt caactggtac gtggacggcg tggaggtgca taatgccaag 1260
acaaagccgc gggaggagca gtacaacagc acgtaccgtg tggtcagcgt cctcaccgtc 1320
ctgcaccagg actggctgaa tggcaaggag tacaagtgca aggtctccaa caaagccctc 1380
ccagccccca tagagaaaac catctccaaa gccaaagggc agccccgaga accacaggtg 1440
tacaccctgc ccccatcccg ggaggagatg accaagaacc aggtcagcct gacctgcctg 1500
gtcaaaggct tctatcccag cgacatcgcc gtggagtggg agagcaatgg gcagccggag 1560
aacaactaca agaccacgcc tcccgtgctg gactccgacg gctccttctt cctctacagc 1620
aagctcaccg tggacaagag caggtggcag caggggaacg tcttctcatg ctccgtgatg 1680
catgaggctc tgcacaacca ctacacgcag aagagcctct ccctgtctcc gggtaaa 1737
<210> 22
<211> 579
<212> PRT
<213> 人工序列
<400> 22
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro
130 135 140
Ser Glu Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser
145 150 155 160
Ser Gly Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly
165 170 175
Leu Glu Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn
180 185 190
Pro Ser Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr
195 200 205
Gln Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile
210 215 220
Tyr Tyr Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly
225 230 235 240
Gln Gly Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser
245 250 255
Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala
260 265 270
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val
275 280 285
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala
290 295 300
Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val
305 310 315 320
Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His
325 330 335
Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys
340 345 350
Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly
355 360 365
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
370 375 380
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
385 390 395 400
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
405 410 415
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
420 425 430
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
435 440 445
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile
450 455 460
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
465 470 475 480
Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser
485 490 495
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
500 505 510
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
515 520 525
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
530 535 540
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
545 550 555 560
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
565 570 575
Pro Gly Lys
<210> 23
<211> 885
<212> DNA
<213> 人工序列
<400> 23
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
ggggacatcc agatgaccca gagccccagc agcctgagcg ccagcgtggg cgaccgcgtg 300
accatcacct gccaggccag ccaggacatc agcaactacc tgaactggta ccagcagaag 360
cccggcaagg cccccaagct gctgatctac gacgccagca acctggagac cggcgtgccc 420
agccgcttca gcggcagcgg cagcggcacc gacttcacct tcaccatcag cagcctgcag 480
cccgaggaca tcgccaccta cttctgccag cacttcgacc acctgcccct ggccttcggc 540
ggcggcacca aggtggagat caagcgcaca gtggcagccc ccagcgtctt catttttccc 600
ccttccgatg aacagctgaa gtccggcact gcttctgtgg tctgtctgct gaacaatttc 660
tatcccagag aggccaaggt gcagtggaaa gtggacaacg ctctgcagtc cggcaacagc 720
caggagagtg tgaccgaaca ggatagtaag gacagcacat attctctgtc tagtaccctg 780
acactgagta aggcagatta cgagaagcac aaagtgtatg cctgcgaagt cactcatcag 840
ggactgtcaa gccccgtgac caagagcttc aaccggggcg agtgt 885
<210> 24
<211> 295
<212> PRT
<213> 人工序列
<400> 24
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val
85 90 95
Gly Asp Arg Val Thr Ile Thr Cys Gln Ala Ser Gln Asp Ile Ser Asn
100 105 110
Tyr Leu Asn Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu
115 120 125
Ile Tyr Asp Ala Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser
130 135 140
Gly Ser Gly Ser Gly Thr Asp Phe Thr Phe Thr Ile Ser Ser Leu Gln
145 150 155 160
Pro Glu Asp Ile Ala Thr Tyr Phe Cys Gln His Phe Asp His Leu Pro
165 170 175
Leu Ala Phe Gly Gly Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala
180 185 190
Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser
195 200 205
Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu
210 215 220
Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser
225 230 235 240
Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu
245 250 255
Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val
260 265 270
Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys
275 280 285
Ser Phe Asn Arg Gly Glu Cys
290 295
<210> 25
<211> 1590
<212> DNA
<213> 人工序列
<400> 25
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
gggcaggtgc agctgcagga gagcggcccc ggcctggtga agcccagcga gaccctgagc 300
ctgacctgca ccgtgagcgg cggcagcgtg agcagcggcg actactactg gacctggatc 360
cgccagagcc ccggcaaggg cctggagtgg atcggccaca tctactacag cggcaacacc 420
aactacaacc ccagcctgaa gagccgcctg accatcagca tcgacaccag caagacccag 480
ttcagcctga agctgagcag cgtgaccgcc gccgacaccg ccatctacta ctgcgtgcgc 540
gaccgcgtga ccggcgcctt cgacatctgg ggccagggca ccatggtgac tgtgtctagc 600
gcctccacca agggcccatc ggtcttcccc ctggcaccct cctccaagag cacctctggg 660
ggcacagcgg ccctgggctg cctggtcaag gactacttcc ccgaaccggt gacggtgtcg 720
tggaactcag gcgccctgac cagcggcgtg cacaccttcc cggctgtcct acagtcctca 780
ggactctact ccctcagcag cgtggtgact gtgccctcta gcagcttggg cacccagacc 840
tacatctgca acgtgaatca caagcccagc aacaccaagg tggacaagaa agttgaaccc 900
aaatcttgcg acaaaactca cacatgccca ccgtgcccag cacctgaact cctgggggga 960
ccgtcagtct tcctcttccc cccaaaaccc aaggacaccc tcatgatctc ccggacccct 1020
gaggtcacat gcgtggtggt ggacgtgagc cacgaagacc ctgaggtcaa gttcaactgg 1080
tacgtggacg gcgtggaggt gcataatgcc aagacaaagc cgcgggagga gcagtacaac 1140
agcacgtacc gtgtggtcag cgtcctcacc gtcctgcacc aggactggct gaatggcaag 1200
gagtacaagt gcaaggtctc caacaaagcc ctcccagccc ccatagagaa aaccatctcc 1260
aaagccaaag ggcagccccg agaaccacag gtgtacaccc tgcccccatc ccgggaggag 1320
atgaccaaga accaggtcag cctgacctgc ctggtcaaag gcttctatcc cagcgacatc 1380
gccgtggagt gggagagcaa tgggcagccg gagaacaact acaagaccac gcctcccgtg 1440
ctggactccg acggctcctt cttcctctac agcaagctca ccgtggacaa gagcaggtgg 1500
cagcagggga acgtcttctc atgctccgtg atgcatgagg ctctgcacaa ccactacacg 1560
cagaagagcc tctccctgtc tccgggtaaa 1590
<210> 26
<211> 530
<212> PRT
<213> 人工序列
<400> 26
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser
85 90 95
Glu Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser
100 105 110
Gly Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu
115 120 125
Glu Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro
130 135 140
Ser Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln
145 150 155 160
Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr
165 170 175
Tyr Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln
180 185 190
Gly Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val
195 200 205
Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala
210 215 220
Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser
225 230 235 240
Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val
245 250 255
Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro
260 265 270
Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys
275 280 285
Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp
290 295 300
Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly
305 310 315 320
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
325 330 335
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
340 345 350
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
355 360 365
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
370 375 380
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
385 390 395 400
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
405 410 415
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
420 425 430
Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu
435 440 445
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
450 455 460
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
465 470 475 480
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
485 490 495
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
500 505 510
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
515 520 525
Gly Lys
530
<210> 27
<211> 1032
<212> DNA
<213> 人工序列
<400> 27
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gacatccaga tgacccagag ccccagcagc 420
ctgagcgcca gcgtgggcga ccgcgtgacc atcacctgcc aggccagcca ggacatcagc 480
aactacctga actggtacca gcagaagccc ggcaaggccc ccaagctgct gatctacgac 540
gccagcaacc tggagaccgg cgtgcccagc cgcttcagcg gcagcggcag cggcaccgac 600
ttcaccttca ccatcagcag cctgcagccc gaggacatcg ccacctactt ctgccagcac 660
ttcgaccacc tgcccctggc cttcggcggc ggcaccaagg tggagatcaa gcgcacagtg 720
gcagccccca gcgtcttcat ttttccccct tccgatgaac agctgaagtc cggcactgct 780
tctgtggtct gtctgctgaa caatttctat cccagagagg ccaaggtgca gtggaaagtg 840
gacaacgctc tgcagtccgg caacagccag gagagtgtga ccgaacagga tagtaaggac 900
agcacatatt ctctgtctag taccctgaca ctgagtaagg cagattacga gaagcacaaa 960
gtgtatgcct gcgaagtcac tcatcaggga ctgtcaagcc ccgtgaccaa gagcttcaac 1020
cggggcgagt gt 1032
<210> 28
<211> 344
<212> PRT
<213> 人工序列
<400> 28
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser
130 135 140
Val Gly Asp Arg Val Thr Ile Thr Cys Gln Ala Ser Gln Asp Ile Ser
145 150 155 160
Asn Tyr Leu Asn Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu
165 170 175
Leu Ile Tyr Asp Ala Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe
180 185 190
Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Phe Thr Ile Ser Ser Leu
195 200 205
Gln Pro Glu Asp Ile Ala Thr Tyr Phe Cys Gln His Phe Asp His Leu
210 215 220
Pro Leu Ala Phe Gly Gly Gly Thr Lys Val Glu Ile Lys Arg Thr Val
225 230 235 240
Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys
245 250 255
Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg
260 265 270
Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn
275 280 285
Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser
290 295 300
Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys
305 310 315 320
Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr
325 330 335
Lys Ser Phe Asn Arg Gly Glu Cys
340
<210> 29
<211> 1347
<212> DNA
<213> 人工序列
<400> 29
caggtgcagc tgcaggagag cggccccggc ctggtgaagc ccagcgagac cctgagcctg 60
acctgcaccg tgagcggcgg cagcgtgagc agcggcgact actactggac ctggatccgc 120
cagagccccg gcaagggcct ggagtggatc ggccacatct actacagcgg caacaccaac 180
tacaacccca gcctgaagag ccgcctgacc atcagcatcg acaccagcaa gacccagttc 240
agcctgaagc tgagcagcgt gaccgccgcc gacaccgcca tctactactg cgtgcgcgac 300
cgcgtgaccg gcgccttcga catctggggc cagggcacca tggtgactgt gtctagcgcc 360
tccaccaagg gcccatcggt cttccccctg gcaccctcct ccaagagcac ctctgggggc 420
acagcggccc tgggctgcct ggtcaaggac tacttccccg aaccggtgac ggtgtcgtgg 480
aactcaggcg ccctgaccag cggcgtgcac accttcccgg ctgtcctaca gtcctcagga 540
ctctactccc tcagcagcgt ggtgactgtg ccctctagca gcttgggcac ccagacctac 600
atctgcaacg tgaatcacaa gcccagcaac accaaggtgg acaagaaagt tgaacccaaa 660
tcttgcgaca aaactcacac atgcccaccg tgcccagcac ctgaactcct ggggggaccg 720
tcagtcttcc tcttcccccc aaaacccaag gacaccctca tgatctcccg gacccctgag 780
gtcacatgcg tggtggtgga cgtgagccac gaagaccctg aggtcaagtt caactggtac 840
gtggacggcg tggaggtgca taatgccaag acaaagccgc gggaggagca gtacaacagc 900
acgtaccgtg tggtcagcgt cctcaccgtc ctgcaccagg actggctgaa tggcaaggag 960
tacaagtgca aggtctccaa caaagccctc ccagccccca tagagaaaac catctccaaa 1020
gccaaagggc agccccgaga accacaggtg tacaccctgc ccccatcccg ggaggagatg 1080
accaagaacc aggtcagcct gacctgcctg gtcaaaggct tctatcccag cgacatcgcc 1140
gtggagtggg agagcaatgg gcagccggag aacaactaca agaccacgcc tcccgtgctg 1200
gactccgacg gctccttctt cctctacagc aagctcaccg tggacaagag caggtggcag 1260
caggggaacg tcttctcatg ctccgtgatg catgaggctc tgcacaacca ctacacgcag 1320
aagagcctct ccctgtctcc gggtaaa 1347
<210> 30
<211> 449
<212> PRT
<213> 人工序列
<400> 30
Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu
1 5 10 15
Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser Gly
20 25 30
Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu Glu
35 40 45
Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro Ser
50 55 60
Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln Phe
65 70 75 80
Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr Tyr
85 90 95
Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln Gly
100 105 110
Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
115 120 125
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
130 135 140
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
145 150 155 160
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
165 170 175
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
180 185 190
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
195 200 205
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys
210 215 220
Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Pro
225 230 235 240
Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser
245 250 255
Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp
260 265 270
Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn
275 280 285
Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val
290 295 300
Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu
305 310 315 320
Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu Lys
325 330 335
Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr
340 345 350
Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser Leu Thr
355 360 365
Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu
370 375 380
Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu
385 390 395 400
Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys
405 410 415
Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu
420 425 430
Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly
435 440 445
Lys
<210> 31
<211> 1344
<212> DNA
<213> 人工序列
<400> 31
caggtgcagc tgcaggagag cggccccggc ctggtgaagc ccagcgagac cctgagcctg 60
acctgcaccg tgagcggcgg cagcgtgagc agcggcgact actactggac ctggatccgc 120
cagagccccg gcaagggcct ggagtggatc ggccacatct actacagcgg caacaccaac 180
tacaacccca gcctgaagag ccgcctgacc atcagcatcg acaccagcaa gacccagttc 240
agcctgaagc tgagcagcgt gaccgccgcc gacaccgcca tctactactg cgtgcgcgac 300
cgcgtgaccg gcgccttcga catctggggc cagggcacca tggtgactgt gtctagcgcc 360
tccaccaagg gcccatcggt cttccccctg gcaccctcct ccaagagcac ctctgggggc 420
acagcggccc tgggctgcct ggtcaaggac tacttccccg aaccggtgac ggtgtcgtgg 480
aactcaggcg ccctgaccag cggcgtgcac accttcccgg ctgtcctaca gtcctcagga 540
ctctactccc tcagcagcgt ggtgactgtg ccctctagca gcttgggcac ccagacctac 600
atctgcaacg tgaatcacaa gcccagcaac accaaggtgg acaagaaagt tgaacccaaa 660
tcttgcgaca aaactcacac atgcccaccg tgcccagcac ctccagtcgc cggaccgtca 720
gtcttcctct tccctccaaa acccaaggac accctcatga tctcccggac ccctgaggtc 780
acatgcgtgg tggtggacgt gagccacgaa gaccctgagg tcaagttcaa ctggtacgtg 840
gacggcgtgg aggtgcataa tgccaagaca aagccgcggg aggagcagta caacagcacg 900
taccgtgtgg tcagcgtcct caccgtcctg caccaggact ggctgaatgg caaggagtac 960
aagtgcaagg tctccaacaa aggcctccca agctccatcg agaaaaccat ctccaaagcc 1020
aaagggcagc cccgagaacc acaggtgtac accctgcctc catcccggga tgagctgacc 1080
aagaaccagg tcagcctgac ctgcctggtc aaaggcttct atcccagcga catcgccgtg 1140
gagtgggaga gcaatgggca gccggagaac aactacaaga ccacgcctcc cgtgctggac 1200
tccgacggct ccttcttcct ctacagcaag ctcaccgtgg acaagagcag gtggcagcag 1260
gggaacgtct tctcatgctc cgtgatgcat gaggctctgc acaaccacta cacgcagaag 1320
agcctctccc tgtctccggg taaa 1344
<210> 32
<211> 448
<212> PRT
<213> 人工序列
<400> 32
Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu
1 5 10 15
Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser Gly
20 25 30
Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu Glu
35 40 45
Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro Ser
50 55 60
Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln Phe
65 70 75 80
Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr Tyr
85 90 95
Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln Gly
100 105 110
Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
115 120 125
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
130 135 140
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
145 150 155 160
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
165 170 175
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
180 185 190
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
195 200 205
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys
210 215 220
Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser
225 230 235 240
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
245 250 255
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
260 265 270
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
275 280 285
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
290 295 300
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
305 310 315 320
Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr
325 330 335
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
340 345 350
Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys
355 360 365
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
370 375 380
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
385 390 395 400
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
405 410 415
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
420 425 430
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440 445
<210> 33
<211> 642
<212> DNA
<213> 人工序列
<400> 33
gacatccaga tgacccagag ccccagcagc ctgagcgcca gcgtgggcga ccgcgtgacc 60
atcacctgcc aggccagcca ggacatcagc aactacctga actggtacca gcagaagccc 120
ggcaaggccc ccaagctgct gatctacgac gccagcaacc tggagaccgg cgtgcccagc 180
cgcttcagcg gcagcggcag cggcaccgac ttcaccttca ccatcagcag cctgcagccc 240
gaggacatcg ccacctactt ctgccagcac ttcgaccacc tgcccctggc cttcggcggc 300
ggcaccaagg tggagatcaa gcgcacagtg gcagccccca gcgtcttcat ttttccccct 360
tccgatgaac agctgaagtc cggcactgct tctgtggtct gtctgctgaa caatttctat 420
cccagagagg ccaaggtgca gtggaaagtg gacaacgctc tgcagtccgg caacagccag 480
gagagtgtga ccgaacagga tagtaaggac agcacatatt ctctgtctag taccctgaca 540
ctgagtaagg cagattacga gaagcacaaa gtgtatgcct gcgaagtcac tcatcaggga 600
ctgtcaagcc ccgtgaccaa gagcttcaac cggggcgagt gt 642
<210> 34
<211> 214
<212> PRT
<213> 人工序列
<400> 34
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Gln Ala Ser Gln Asp Ile Ser Asn Tyr
20 25 30
Leu Asn Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Asp Ala Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Phe Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Ile Ala Thr Tyr Phe Cys Gln His Phe Asp His Leu Pro Leu
85 90 95
Ala Phe Gly Gly Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala
100 105 110
Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly
115 120 125
Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala
130 135 140
Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln
145 150 155 160
Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser
165 170 175
Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr
180 185 190
Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser
195 200 205
Phe Asn Arg Gly Glu Cys
210
<210> 35
<211> 1341
<212> DNA
<213> 人工序列
<400> 35
gaggtgcagc tggtggagag cggaggtgga ctagtacagc ctggtggcag cctacgactg 60
agttgcgccg ccagcggctt caccttcagc gacagctgga tccactgggt gcgccaggcc 120
cccggcaagg gcctggagtg ggtggcctgg atcagcccct acggcggcag cacctactac 180
gccgacagcg tgaagggccg cttcaccatc agcgccgaca ccagcaagaa caccgcctac 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcgc ccgccgccac 300
tggcccggcg gcttcgacta ctggggccag ggcaccctgg tgaccgtgag cagcgcctcc 360
accaagggcc catcggtctt ccccctggca ccctcctcca agagcacctc tgggggcaca 420
gcggccctgg gctgcctggt caaggactac ttccccgaac cggtgacggt gtcgtggaac 480
tcaggcgccc tgaccagcgg cgtgcacacc ttcccggctg tcctacagtc ctcaggactc 540
tactccctca gcagcgtggt gactgtgccc tctagcagct tgggcaccca gacctacatc 600
tgcaacgtga atcacaagcc cagcaacacc aaggtggaca agaaagttga acccaaatct 660
tgcgacaaaa ctcacacatg cccaccgtgc ccagcacctc cagtcgccgg accgtcagtc 720
ttcctcttcc ctccaaaacc caaggacacc ctcatgatct cccggacccc tgaggtcaca 780
tgcgtggtgg tggacgtgag ccacgaagac cctgaggtca agttcaactg gtacgtggac 840
ggcgtggagg tgcataatgc caagacaaag ccgcgggagg agcagtacaa cagcacgtac 900
cgtgtggtca gcgtcctcac cgtcctgcac caggactggc tgaatggcaa ggagtacaag 960
tgcaaggtct ccaacaaagg cctcccaagc tccatcgaga aaaccatctc caaagccaaa 1020
gggcagcccc gagaaccaca ggtgtacacc ctgcctccat cccgggatga gctgaccaag 1080
aaccaggtca gcctgacctg cctggtcaaa ggcttctatc ccagcgacat cgccgtggag 1140
tgggagagca atgggcagcc ggagaacaac tacaagacca cgcctcccgt gctggactcc 1200
gacggctcct tcttcctcta cagcaagctc accgtggaca agagcaggtg gcagcagggg 1260
aacgtcttct catgctccgt gatgcatgag gctctgcaca accactacac gcagaagagc 1320
ctctccctgt ctccgggtaa a 1341
<210> 36
<211> 447
<212> PRT
<213> 人工序列
<400> 36
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asp Ser
20 25 30
Trp Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Trp Ile Ser Pro Tyr Gly Gly Ser Thr Tyr Tyr Ala Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Arg Arg His Trp Pro Gly Gly Phe Asp Tyr Trp Gly Gln Gly Thr
100 105 110
Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro
115 120 125
Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly
130 135 140
Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn
145 150 155 160
Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln
165 170 175
Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser
180 185 190
Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser
195 200 205
Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr
210 215 220
His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser Val
225 230 235 240
Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr
245 250 255
Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu
260 265 270
Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys
275 280 285
Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser
290 295 300
Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys
305 310 315 320
Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile
325 330 335
Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro
340 345 350
Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu
355 360 365
Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn
370 375 380
Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser
385 390 395 400
Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg
405 410 415
Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu
420 425 430
His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440 445
<210> 37
<211> 642
<212> DNA
<213> 人工序列
<400> 37
gacatccaga tgacccagag ccccagcagc ctgagcgcca gcgtgggcga ccgcgtgacc 60
atcacctgcc gcgccagcca ggacgtgagc accgccgtgg cctggtacca gcagaagccc 120
ggcaaggccc ccaagctgct gatctacagc gccagcttcc tgtacagcgg cgtgcccagc 180
cgcttcagcg gcagcggcag cggcaccgac ttcaccctga ccatcagcag cctgcagccc 240
gaggacttcg ccacctacta ctgccagcag tacctgtacc accccgccac cttcggccag 300
ggcaccaagg tggagatcaa gcgcacagtg gcagccccca gcgtcttcat ttttccccct 360
tccgatgaac agctgaagtc cggcactgct tctgtggtct gtctgctgaa caatttctat 420
cccagagagg ccaaggtgca gtggaaagtg gacaacgctc tgcagtccgg caacagccag 480
gagagtgtga ccgaacagga tagtaaggac agcacatatt ctctgtctag taccctgaca 540
ctgagtaagg cagattacga gaagcacaaa gtgtatgcct gcgaagtcac tcatcaggga 600
ctgtcaagcc ccgtgaccaa gagcttcaac cggggcgagt gt 642
<210> 38
<211> 214
<212> PRT
<213> 人工序列
<400> 38
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Ser Thr Ala
20 25 30
Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln Tyr Leu Tyr His Pro Ala
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala
100 105 110
Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly
115 120 125
Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala
130 135 140
Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln
145 150 155 160
Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser
165 170 175
Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr
180 185 190
Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser
195 200 205
Phe Asn Arg Gly Glu Cys
210
<210> 39
<211> 1734
<212> DNA
<213> 人工序列
<400> 39
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg caggtgcagc tgcaggagag cggccccggc 420
ctggtgaagc ccagcgagac cctgagcctg acctgcaccg tgagcggcgg cagcgtgagc 480
agcggcgact actactggac ctggatccgc cagagccccg gcaagggcct ggagtggatc 540
ggccacatct actacagcgg caacaccaac tacaacccca gcctgaagag ccgcctgacc 600
atcagcatcg acaccagcaa gacccagttc agcctgaagc tgagcagcgt gaccgccgcc 660
gacaccgcca tctactactg cgtgcgcgac cgcgtgaccg gcgccttcga catctggggc 720
cagggcacca tggtgactgt gtctagcgcc tccaccaagg gcccatcggt cttccccctg 780
gcaccctcct ccaagagcac ctctgggggc acagcggccc tgggctgcct ggtcaaggac 840
tacttccccg aaccggtgac ggtgtcgtgg aactcaggcg ccctgaccag cggcgtgcac 900
accttcccgg ctgtcctaca gtcctcagga ctctactccc tcagcagcgt ggtgactgtg 960
ccctctagca gcttgggcac ccagacctac atctgcaacg tgaatcacaa gcccagcaac 1020
accaaggtgg acaagaaagt tgaacccaaa tcttgcgaca aaactcacac atgcccaccg 1080
tgcccagcac ctccagtcgc cggaccgtca gtcttcctct tccctccaaa acccaaggac 1140
accctcatga tctcccggac ccctgaggtc acatgcgtgg tggtggacgt gagccacgaa 1200
gaccctgagg tcaagttcaa ctggtacgtg gacggcgtgg aggtgcataa tgccaagaca 1260
aagccgcggg aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg 1320
caccaggact ggctgaatgg caaggagtac aagtgcaagg tctccaacaa aggcctccca 1380
agctccatcg agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac 1440
accctgcctc catcccggga tgagctgacc aagaaccagg tcagcctgac ctgcctggtc 1500
aaaggcttct atcccagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac 1560
aactacaaga ccacgcctcc cgtgctggac tccgacggct ccttcttcct ctacagcaag 1620
ctcaccgtgg acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat 1680
gaggctctgc acaaccacta cacgcagaag agcctctccc tgtctccggg taaa 1734
<210> 40
<211> 578
<212> PRT
<213> 人工序列
<400> 40
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro
130 135 140
Ser Glu Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser
145 150 155 160
Ser Gly Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly
165 170 175
Leu Glu Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn
180 185 190
Pro Ser Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr
195 200 205
Gln Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile
210 215 220
Tyr Tyr Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly
225 230 235 240
Gln Gly Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser
245 250 255
Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala
260 265 270
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val
275 280 285
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala
290 295 300
Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val
305 310 315 320
Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His
325 330 335
Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys
340 345 350
Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly
355 360 365
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
370 375 380
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
385 390 395 400
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
405 410 415
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
420 425 430
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
435 440 445
Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu
450 455 460
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
465 470 475 480
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
485 490 495
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
500 505 510
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
515 520 525
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
530 535 540
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
545 550 555 560
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
565 570 575
Gly Lys
<210> 41
<211> 1569
<212> DNA
<213> 人工序列
<400> 41
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
gggcaggtgc agctggtgga gagtggaggt ggcgtggtac agcccggccg cagcctgcgc 300
ctggactgca aggccagcgg catcaccttc agcaacagcg gcatgcactg ggtgcgccag 360
gcccccggca agggcctgga gtgggtggcc gtgatctggt acgacggcag caagcgctac 420
tacgccgaca gcgtgaaggg ccgcttcacc atcagccgcg acaacagcaa gaacaccctg 480
ttcctgcaga tgaacagcct gcgcgccgag gacaccgccg tgtactactg cgccaccaac 540
gacgactact ggggccaggg caccctggtg accgtgagca gcgcctccac caagggccca 600
tcggtcttcc ccctggcacc ctcctccaag agcacctctg ggggcacagc ggccctgggc 660
tgcctggtca aggactactt ccccgaaccg gtgacggtgt cgtggaactc aggcgccctg 720
accagcggcg tgcacacctt cccggctgtc ctacagtcct caggactcta ctccctcagc 780
agcgtggtga ctgtgccctc tagcagcttg ggcacccaga cctacatctg caacgtgaat 840
cacaagccca gcaacaccaa ggtggacaag aaagttgaac ccaaatcttg cgacaaaact 900
cacacatgcc caccgtgccc agcacctcca gtcgccggac cgtcagtctt cctcttccct 960
ccaaaaccca aggacaccct catgatctcc cggacccctg aggtcacatg cgtggtggtg 1020
gacgtgagcc acgaagaccc tgaggtcaag ttcaactggt acgtggacgg cgtggaggtg 1080
cataatgcca agacaaagcc gcgggaggag cagtacaaca gcacgtaccg tgtggtcagc 1140
gtcctcaccg tcctgcacca ggactggctg aatggcaagg agtacaagtg caaggtctcc 1200
aacaaaggcc tcccaagctc catcgagaaa accatctcca aagccaaagg gcagccccga 1260
gaaccacagg tgtacaccct gcctccatcc cgggatgagc tgaccaagaa ccaggtcagc 1320
ctgacctgcc tggtcaaagg cttctatccc agcgacatcg ccgtggagtg ggagagcaat 1380
gggcagccgg agaacaacta caagaccacg cctcccgtgc tggactccga cggctccttc 1440
ttcctctaca gcaagctcac cgtggacaag agcaggtggc agcaggggaa cgtcttctca 1500
tgctccgtga tgcatgaggc tctgcacaac cactacacgc agaagagcct ctccctgtct 1560
ccgggtaaa 1569
<210> 42
<211> 523
<212> PRT
<213> 人工序列
<400> 42
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Gln Val Gln Leu Val Glu Ser Gly Gly Gly Val Val Gln Pro Gly
85 90 95
Arg Ser Leu Arg Leu Asp Cys Lys Ala Ser Gly Ile Thr Phe Ser Asn
100 105 110
Ser Gly Met His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp
115 120 125
Val Ala Val Ile Trp Tyr Asp Gly Ser Lys Arg Tyr Tyr Ala Asp Ser
130 135 140
Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ser Lys Asn Thr Leu
145 150 155 160
Phe Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr
165 170 175
Cys Ala Thr Asn Asp Asp Tyr Trp Gly Gln Gly Thr Leu Val Thr Val
180 185 190
Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser
195 200 205
Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys
210 215 220
Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu
225 230 235 240
Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser Gly Leu
245 250 255
Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr
260 265 270
Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val
275 280 285
Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro
290 295 300
Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser Val Phe Leu Phe Pro
305 310 315 320
Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr
325 330 335
Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn
340 345 350
Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg
355 360 365
Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val
370 375 380
Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser
385 390 395 400
Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys
405 410 415
Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp
420 425 430
Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe
435 440 445
Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu
450 455 460
Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe
465 470 475 480
Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly
485 490 495
Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr
500 505 510
Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
515 520
<210> 43
<211> 1587
<212> DNA
<213> 人工序列
<400> 43
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
gggcaggtgc agctgcagga gagcggcccc ggcctggtga agcccagcga gaccctgagc 300
ctgacctgca ccgtgagcgg cggcagcgtg agcagcggcg actactactg gacctggatc 360
cgccagagcc ccggcaaggg cctggagtgg atcggccaca tctactacag cggcaacacc 420
aactacaacc ccagcctgaa gagccgcctg accatcagca tcgacaccag caagacccag 480
ttcagcctga agctgagcag cgtgaccgcc gccgacaccg ccatctacta ctgcgtgcgc 540
gaccgcgtga ccggcgcctt cgacatctgg ggccagggca ccatggtgac tgtgtctagc 600
gcctccacca agggcccatc ggtcttcccc ctggcaccct cctccaagag cacctctggg 660
ggcacagcgg ccctgggctg cctggtcaag gactacttcc ccgaaccggt gacggtgtcg 720
tggaactcag gcgccctgac cagcggcgtg cacaccttcc cggctgtcct acagtcctca 780
ggactctact ccctcagcag cgtggtgact gtgccctcta gcagcttggg cacccagacc 840
tacatctgca acgtgaatca caagcccagc aacaccaagg tggacaagaa agttgaaccc 900
aaatcttgcg acaaaactca cacatgccca ccgtgcccag cacctccagt cgccggaccg 960
tcagtcttcc tcttccctcc aaaacccaag gacaccctca tgatctcccg gacccctgag 1020
gtcacatgcg tggtggtgga cgtgagccac gaagaccctg aggtcaagtt caactggtac 1080
gtggacggcg tggaggtgca taatgccaag acaaagccgc gggaggagca gtacaacagc 1140
acgtaccgtg tggtcagcgt cctcaccgtc ctgcaccagg actggctgaa tggcaaggag 1200
tacaagtgca aggtctccaa caaaggcctc ccaagctcca tcgagaaaac catctccaaa 1260
gccaaagggc agccccgaga accacaggtg tacaccctgc ctccatcccg ggatgagctg 1320
accaagaacc aggtcagcct gacctgcctg gtcaaaggct tctatcccag cgacatcgcc 1380
gtggagtggg agagcaatgg gcagccggag aacaactaca agaccacgcc tcccgtgctg 1440
gactccgacg gctccttctt cctctacagc aagctcaccg tggacaagag caggtggcag 1500
caggggaacg tcttctcatg ctccgtgatg catgaggctc tgcacaacca ctacacgcag 1560
aagagcctct ccctgtctcc gggtaaa 1587
<210> 44
<211> 529
<212> PRT
<213> 人工序列
<400> 44
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser
85 90 95
Glu Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser
100 105 110
Gly Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu
115 120 125
Glu Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro
130 135 140
Ser Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln
145 150 155 160
Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr
165 170 175
Tyr Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln
180 185 190
Gly Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val
195 200 205
Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala
210 215 220
Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser
225 230 235 240
Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val
245 250 255
Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro
260 265 270
Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys
275 280 285
Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp
290 295 300
Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro
305 310 315 320
Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser
325 330 335
Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp
340 345 350
Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn
355 360 365
Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val
370 375 380
Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu
385 390 395 400
Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys
405 410 415
Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr
420 425 430
Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr
435 440 445
Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu
450 455 460
Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu
465 470 475 480
Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys
485 490 495
Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu
500 505 510
Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly
515 520 525
Lys
<210> 45
<211> 1731
<212> DNA
<213> 人工序列
<400> 45
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gaggtgcagc tggtggagag cggaggtgga 420
ctagtacagc ctggtggcag cctacgactg agttgcgccg ccagcggctt caccttcagc 480
gacagctgga tccactgggt gcgccaggcc cccggcaagg gcctggagtg ggtggcctgg 540
atcagcccct acggcggcag cacctactac gccgacagcg tgaagggccg cttcaccatc 600
agcgccgaca ccagcaagaa caccgcctac ctgcagatga acagcctgcg cgccgaggac 660
accgccgtgt actactgcgc ccgccgccac tggcccggcg gcttcgacta ctggggccag 720
ggcaccctgg tgaccgtgag cagcgcctcc accaagggcc catcggtctt ccccctggca 780
ccctcctcca agagcacctc tgggggcaca gcggccctgg gctgcctggt caaggactac 840
ttccccgaac cggtgacggt gtcgtggaac tcaggcgccc tgaccagcgg cgtgcacacc 900
ttcccggctg tcctacagtc ctcaggactc tactccctca gcagcgtggt gactgtgccc 960
tctagcagct tgggcaccca gacctacatc tgcaacgtga atcacaagcc cagcaacacc 1020
aaggtggaca agaaagttga acccaaatct tgcgacaaaa ctcacacatg cccaccgtgc 1080
ccagcacctc cagtcgccgg accgtcagtc ttcctcttcc ctccaaaacc caaggacacc 1140
ctcatgatct cccggacccc tgaggtcaca tgcgtggtgg tggacgtgag ccacgaagac 1200
cctgaggtca agttcaactg gtacgtggac ggcgtggagg tgcataatgc caagacaaag 1260
ccgcgggagg agcagtacaa cagcacgtac cgtgtggtca gcgtcctcac cgtcctgcac 1320
caggactggc tgaatggcaa ggagtacaag tgcaaggtct ccaacaaagg cctcccaagc 1380
tccatcgaga aaaccatctc caaagccaaa gggcagcccc gagaaccaca ggtgtacacc 1440
ctgcctccat cccgggatga gctgaccaag aaccaggtca gcctgacctg cctggtcaaa 1500
ggcttctatc ccagcgacat cgccgtggag tgggagagca atgggcagcc ggagaacaac 1560
tacaagacca cgcctcccgt gctggactcc gacggctcct tcttcctcta cagcaagctc 1620
accgtggaca agagcaggtg gcagcagggg aacgtcttct catgctccgt gatgcatgag 1680
gctctgcaca accactacac gcagaagagc ctctccctgt ctccgggtaa a 1731
<210> 46
<211> 577
<212> PRT
<213> 人工序列
<400> 46
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro
130 135 140
Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser
145 150 155 160
Asp Ser Trp Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu
165 170 175
Trp Val Ala Trp Ile Ser Pro Tyr Gly Gly Ser Thr Tyr Tyr Ala Asp
180 185 190
Ser Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr
195 200 205
Ala Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
210 215 220
Tyr Cys Ala Arg Arg His Trp Pro Gly Gly Phe Asp Tyr Trp Gly Gln
225 230 235 240
Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val
245 250 255
Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala
260 265 270
Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser
275 280 285
Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val
290 295 300
Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro
305 310 315 320
Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys
325 330 335
Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp
340 345 350
Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro
355 360 365
Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser
370 375 380
Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp
385 390 395 400
Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn
405 410 415
Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val
420 425 430
Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu
435 440 445
Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys
450 455 460
Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr
465 470 475 480
Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr
485 490 495
Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu
500 505 510
Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu
515 520 525
Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys
530 535 540
Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu
545 550 555 560
Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly
565 570 575
Lys
<210> 47
<211> 885
<212> DNA
<213> 人工序列
<400> 47
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
ggggacatcc agatgaccca gagccccagc agcctgagcg ccagcgtggg cgaccgcgtg 300
accatcacct gccgcgccag ccaggacgtg agcaccgccg tggcctggta ccagcagaag 360
cccggcaagg cccccaagct gctgatctac agcgccagct tcctgtacag cggcgtgccc 420
agccgcttca gcggcagcgg cagcggcacc gacttcaccc tgaccatcag cagcctgcag 480
cccgaggact tcgccaccta ctactgccag cagtacctgt accaccccgc caccttcggc 540
cagggcacca aggtggagat caagcgcaca gtggcagccc ccagcgtctt catttttccc 600
ccttccgatg aacagctgaa gtccggcact gcttctgtgg tctgtctgct gaacaatttc 660
tatcccagag aggccaaggt gcagtggaaa gtggacaacg ctctgcagtc cggcaacagc 720
caggagagtg tgaccgaaca ggatagtaag gacagcacat attctctgtc tagtaccctg 780
acactgagta aggcagatta cgagaagcac aaagtgtatg cctgcgaagt cactcatcag 840
ggactgtcaa gccccgtgac caagagcttc aaccggggcg agtgt 885
<210> 48
<211> 295
<212> PRT
<213> 人工序列
<400> 48
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val
85 90 95
Gly Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Ser Thr
100 105 110
Ala Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu
115 120 125
Ile Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser
130 135 140
Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln
145 150 155 160
Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln Tyr Leu Tyr His Pro
165 170 175
Ala Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala
180 185 190
Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser
195 200 205
Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu
210 215 220
Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser
225 230 235 240
Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu
245 250 255
Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val
260 265 270
Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys
275 280 285
Ser Phe Asn Arg Gly Glu Cys
290 295
<210> 49
<211> 1584
<212> DNA
<213> 人工序列
<400> 49
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
ggggaggtgc agctggtgga gagcggaggt ggactagtac agcctggtgg cagcctacga 300
ctgagttgcg ccgccagcgg cttcaccttc agcgacagct ggatccactg ggtgcgccag 360
gcccccggca agggcctgga gtgggtggcc tggatcagcc cctacggcgg cagcacctac 420
tacgccgaca gcgtgaaggg ccgcttcacc atcagcgccg acaccagcaa gaacaccgcc 480
tacctgcaga tgaacagcct gcgcgccgag gacaccgccg tgtactactg cgcccgccgc 540
cactggcccg gcggcttcga ctactggggc cagggcaccc tggtgaccgt gagcagcgcc 600
tccaccaagg gcccatcggt cttccccctg gcaccctcct ccaagagcac ctctgggggc 660
acagcggccc tgggctgcct ggtcaaggac tacttccccg aaccggtgac ggtgtcgtgg 720
aactcaggcg ccctgaccag cggcgtgcac accttcccgg ctgtcctaca gtcctcagga 780
ctctactccc tcagcagcgt ggtgactgtg ccctctagca gcttgggcac ccagacctac 840
atctgcaacg tgaatcacaa gcccagcaac accaaggtgg acaagaaagt tgaacccaaa 900
tcttgcgaca aaactcacac atgcccaccg tgcccagcac ctccagtcgc cggaccgtca 960
gtcttcctct tccctccaaa acccaaggac accctcatga tctcccggac ccctgaggtc 1020
acatgcgtgg tggtggacgt gagccacgaa gaccctgagg tcaagttcaa ctggtacgtg 1080
gacggcgtgg aggtgcataa tgccaagaca aagccgcggg aggagcagta caacagcacg 1140
taccgtgtgg tcagcgtcct caccgtcctg caccaggact ggctgaatgg caaggagtac 1200
aagtgcaagg tctccaacaa aggcctccca agctccatcg agaaaaccat ctccaaagcc 1260
aaagggcagc cccgagaacc acaggtgtac accctgcctc catcccggga tgagctgacc 1320
aagaaccagg tcagcctgac ctgcctggtc aaaggcttct atcccagcga catcgccgtg 1380
gagtgggaga gcaatgggca gccggagaac aactacaaga ccacgcctcc cgtgctggac 1440
tccgacggct ccttcttcct ctacagcaag ctcaccgtgg acaagagcag gtggcagcag 1500
gggaacgtct tctcatgctc cgtgatgcat gaggctctgc acaaccacta cacgcagaag 1560
agcctctccc tgtctccggg taaa 1584
<210> 50
<211> 528
<212> PRT
<213> 人工序列
<400> 50
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly
85 90 95
Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asp
100 105 110
Ser Trp Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp
115 120 125
Val Ala Trp Ile Ser Pro Tyr Gly Gly Ser Thr Tyr Tyr Ala Asp Ser
130 135 140
Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala
145 150 155 160
Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr
165 170 175
Cys Ala Arg Arg His Trp Pro Gly Gly Phe Asp Tyr Trp Gly Gln Gly
180 185 190
Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
195 200 205
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
210 215 220
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
225 230 235 240
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
245 250 255
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
260 265 270
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
275 280 285
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys
290 295 300
Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser
305 310 315 320
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
325 330 335
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
340 345 350
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
355 360 365
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
370 375 380
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
385 390 395 400
Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr
405 410 415
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
420 425 430
Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys
435 440 445
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
450 455 460
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
465 470 475 480
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
485 490 495
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
500 505 510
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
515 520 525
<210> 51
<211> 1032
<212> DNA
<213> 人工序列
<400> 51
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gacatccaga tgacccagag ccccagcagc 420
ctgagcgcca gcgtgggcga ccgcgtgacc atcacctgcc gcgccagcca ggacgtgagc 480
accgccgtgg cctggtacca gcagaagccc ggcaaggccc ccaagctgct gatctacagc 540
gccagcttcc tgtacagcgg cgtgcccagc cgcttcagcg gcagcggcag cggcaccgac 600
ttcaccctga ccatcagcag cctgcagccc gaggacttcg ccacctacta ctgccagcag 660
tacctgtacc accccgccac cttcggccag ggcaccaagg tggagatcaa gcgcacagtg 720
gcagccccca gcgtcttcat ttttccccct tccgatgaac agctgaagtc cggcactgct 780
tctgtggtct gtctgctgaa caatttctat cccagagagg ccaaggtgca gtggaaagtg 840
gacaacgctc tgcagtccgg caacagccag gagagtgtga ccgaacagga tagtaaggac 900
agcacatatt ctctgtctag taccctgaca ctgagtaagg cagattacga gaagcacaaa 960
gtgtatgcct gcgaagtcac tcatcaggga ctgtcaagcc ccgtgaccaa gagcttcaac 1020
cggggcgagt gt 1032
<210> 52
<211> 344
<212> PRT
<213> 人工序列
<400> 52
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser
130 135 140
Val Gly Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Ser
145 150 155 160
Thr Ala Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu
165 170 175
Leu Ile Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe
180 185 190
Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu
195 200 205
Gln Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln Tyr Leu Tyr His
210 215 220
Pro Ala Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val
225 230 235 240
Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys
245 250 255
Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg
260 265 270
Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn
275 280 285
Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser
290 295 300
Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys
305 310 315 320
Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr
325 330 335
Lys Ser Phe Asn Arg Gly Glu Cys
340
<210> 53
<211> 1350
<212> DNA
<213> 人工序列
<400> 53
caggtgaccc tgcgcgagtc cggccctgca ctggtgaagc ccacccagac cctgaccctg 60
acctgcacct tctccggctt ctccctgtcc acctccggca tgtccgtggg ctggatccgg 120
cagcctcccg gcaaggccct ggagtggctg gctgacatct ggtgggacga caagaaggac 180
tacaacccct ccctgaagtc ccgcctgacc atctccaagg acacctccaa gaaccaggtg 240
gtgctgaagg tgaccaacat ggaccccgcc gacaccgcca cctactactg cgcccgctca 300
atgattacca actggtactt cgacgtgtgg ggagccggta ccaccgtgac cgtgtcttcc 360
gcctccacca agggcccatc ggtcttcccc ctggcaccct cctccaagag cacctctggg 420
ggcacagcgg ccctgggctg cctggtcaag gactacttcc ccgaaccggt gacggtgtcg 480
tggaactcag gcgccctgac cagcggcgtg cacaccttcc cggctgtcct acagtcctca 540
ggactctact ccctcagcag cgtggtgact gtgccctcta gcagcttggg cacccagacc 600
tacatctgca acgtgaatca caagcccagc aacaccaagg tggacaagaa agttgaaccc 660
aaatcttgcg acaaaactca cacatgccca ccgtgcccag cacctgaact cctgggggga 720
ccgtcagtct tcctcttccc cccaaaaccc aaggacaccc tcatgatctc ccggacccct 780
gaggtcacat gcgtggtggt ggacgtgagc cacgaagacc ctgaggtcaa gttcaactgg 840
tacgtggacg gcgtggaggt gcataatgcc aagacaaagc cgcgggagga gcagtacaac 900
agcacgtacc gtgtggtcag cgtcctcacc gtcctgcacc aggactggct gaatggcaag 960
gagtacaagt gcaaggtctc caacaaagcc ctcccagccc ccatcgagaa aaccatctcc 1020
aaagccaaag ggcagccccg agaaccacag gtgtacaccc tgcccccatc ccgggatgag 1080
ctgaccaaga accaggtcag cctgacctgc ctggtcaaag gcttctatcc cagcgacatc 1140
gccgtggagt gggagagcaa tgggcagccg gagaacaact acaagaccac gcctcccgtg 1200
ctggactccg acggctcctt cttcctctac agcaagctca ccgtggacaa gagcaggtgg 1260
cagcagggga acgtcttctc atgctccgtg atgcatgagg ctctgcacaa ccactacacg 1320
cagaagagcc tctccctgtc tccgggtaaa 1350
<210> 54
<211> 450
<212> PRT
<213> 人工序列
<400> 54
Gln Val Thr Leu Arg Glu Ser Gly Pro Ala Leu Val Lys Pro Thr Gln
1 5 10 15
Thr Leu Thr Leu Thr Cys Thr Phe Ser Gly Phe Ser Leu Ser Thr Ser
20 25 30
Gly Met Ser Val Gly Trp Ile Arg Gln Pro Pro Gly Lys Ala Leu Glu
35 40 45
Trp Leu Ala Asp Ile Trp Trp Asp Asp Lys Lys Asp Tyr Asn Pro Ser
50 55 60
Leu Lys Ser Arg Leu Thr Ile Ser Lys Asp Thr Ser Lys Asn Gln Val
65 70 75 80
Val Leu Lys Val Thr Asn Met Asp Pro Ala Asp Thr Ala Thr Tyr Tyr
85 90 95
Cys Ala Arg Ser Met Ile Thr Asn Trp Tyr Phe Asp Val Trp Gly Ala
100 105 110
Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val
115 120 125
Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala
130 135 140
Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser
145 150 155 160
Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val
165 170 175
Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro
180 185 190
Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys
195 200 205
Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp
210 215 220
Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly
225 230 235 240
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
245 250 255
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
260 265 270
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
275 280 285
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
290 295 300
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
305 310 315 320
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
325 330 335
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
340 345 350
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
355 360 365
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
370 375 380
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
385 390 395 400
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
405 410 415
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
420 425 430
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
435 440 445
Gly Lys
450
<210> 55
<211> 639
<212> DNA
<213> 人工序列
<400> 55
gacatccaga tgacccagtc cccctccacc ctgtccgcct ccgtgggcga ccgcgtgacc 60
atcacctgca agtgccagct gtccgtgggc tacatgcact ggtaccagca gaagcccggc 120
aaggccccca agctgctgat ctacgacacc tccaagctgg cctccggcgt gccctcccgc 180
ttctccggct ccggctccgg caccgagttc accctgacca tctcctccct gcagcccgac 240
gacttcgcca cctactactg cttccagggc tccggctacc ccttcacctt cggcggcggc 300
accaagctgg agatcaaacg aactgtggct gcaccatctg tcttcatctt cccgccatct 360
gatgagcagt tgaaatctgg aactgcctct gtcgtgtgcc tgctgaataa cttctatccc 420
agagaggcca aagtacagtg gaaggtggat aacgccctcc aatcgggtaa ctcccaggag 480
agtgtcacag agcaggacag caaggacagc acctacagcc tcagcagcac cctgacgctg 540
agcaaagcag actacgagaa acacaaagtc tacgcctgcg aagtcaccca tcagggcctg 600
tcctcgcccg tcacaaagag cttcaacagg ggagagtgt 639
<210> 56
<211> 213
<212> PRT
<213> 人工序列
<400> 56
Asp Ile Gln Met Thr Gln Ser Pro Ser Thr Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Lys Cys Gln Leu Ser Val Gly Tyr Met
20 25 30
His Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile Tyr
35 40 45
Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser
50 55 60
Gly Ser Gly Thr Glu Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro Asp
65 70 75 80
Asp Phe Ala Thr Tyr Tyr Cys Phe Gln Gly Ser Gly Tyr Pro Phe Thr
85 90 95
Phe Gly Gly Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro
100 105 110
Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly Thr
115 120 125
Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys
130 135 140
Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln Glu
145 150 155 160
Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser
165 170 175
Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala
180 185 190
Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser Phe
195 200 205
Asn Arg Gly Glu Cys
210
<210> 57
<211> 1032
<212> DNA
<213> 人工序列
<400> 57
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gagatcgtgc tgacccagag ccccgccacc 420
ctgagcctga gccccggcga gcgcgccacc ctgagctgcc gcgccagcca gagcgtgagc 480
agctacctgg cctggtacca gcagaagcca ggacaggctc cacgactgct aatctatgac 540
gccagcaacc gcgccaccgg catccccgcc cgcttcagcg gcagcggcag cggcaccgac 600
ttcaccctga ccatcagcag cctggagccc gaggacttcg ccgtgtacta ctgccagcag 660
agcagcaact ggccccgcac cttcggccag ggcaccaagg tggagatcaa gcgcacagtg 720
gcagccccca gcgtcttcat ttttccccct tccgatgaac agctgaagtc cggcactgct 780
tctgtggtct gtctgctgaa caatttctat cccagagagg ccaaggtgca gtggaaagtg 840
gacaacgctc tgcagtccgg caacagccag gagagtgtga ccgaacagga tagtaaggac 900
agcacatatt ctctgtctag taccctgaca ctgagtaagg cagattacga gaagcacaaa 960
gtgtatgcct gcgaagtcac tcatcaggga ctgtcaagcc ccgtgaccaa gagcttcaac 1020
cggggcgagt gt 1032
<210> 58
<211> 344
<212> PRT
<213> 人工序列
<400> 58
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Glu Ile Val Leu Thr Gln Ser Pro Ala Thr Leu Ser Leu Ser
130 135 140
Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gln Ser Val Ser
145 150 155 160
Ser Tyr Leu Ala Trp Tyr Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu
165 170 175
Leu Ile Tyr Asp Ala Ser Asn Arg Ala Thr Gly Ile Pro Ala Arg Phe
180 185 190
Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu
195 200 205
Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gln Gln Ser Ser Asn Trp
210 215 220
Pro Arg Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val
225 230 235 240
Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys
245 250 255
Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg
260 265 270
Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn
275 280 285
Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser
290 295 300
Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys
305 310 315 320
Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr
325 330 335
Lys Ser Phe Asn Arg Gly Glu Cys
340
<210> 59
<211> 1740
<212> DNA
<213> 人工序列
<400> 59
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gaagtacagc tgcttgagag tggaggaggt 420
ttggtacagc ccggcggatc cctccgcctg tcctgtgcgg ctagtggctt tacattctca 480
tcctatatca tgatgtgggt aagacaggcc ccaggaaagg gcctggagtg ggttagttct 540
atctacccct caggcgggat taccttctac gcagatactg tgaagggcag gtttaccata 600
tcccgagaca acagtaagaa taccctttac cttcaaatga actcccttcg ggccgaggac 660
actgcggtgt actattgcgc tcgcattaag cttggcaccg tgacaaccgt gaactattgg 720
ggtcaaggca cgctggtgac tgtctcttcc gcctccacca agggcccatc ggtcttcccc 780
ctggcaccct cctccaagag cacctctggg ggcacagcgg ccctgggctg cctggtcaag 840
gactacttcc ccgaaccggt gacggtgtcg tggaactcag gcgccctgac cagcggcgtg 900
cacaccttcc cggctgtcct acagtcctca ggactctact ccctcagcag cgtggtgact 960
gtgccctcta gcagcttggg cacccagacc tacatctgca acgtgaatca caagcccagc 1020
aacaccaagg tggacaagaa agttgaaccc aaatcttgcg acaaaactca cacatgccca 1080
ccgtgcccag cacctgaact cctgggggga ccgtcagtct tcctcttccc cccaaaaccc 1140
aaggacaccc tcatgatctc ccggacccct gaggtcacat gcgtggtggt ggacgtgagc 1200
cacgaagacc ctgaggtcaa gttcaactgg tacgtggacg gcgtggaggt gcataatgcc 1260
aagacaaagc cgcgggagga gcagtacaac agcacgtacc gtgtggtcag cgtcctcacc 1320
gtcctgcacc aggactggct gaatggcaag gagtacaagt gcaaggtctc caacaaagcc 1380
ctcccagccc ccatagagaa aaccatctcc aaagccaaag ggcagccccg agaaccacag 1440
gtgtacaccc tgcccccatc ccgggatgag ctgaccaaga accaggtcag cctgacctgc 1500
ctggtcaaag gcttctatcc cagcgacatc gccgtggagt gggagagcaa tgggcagccg 1560
gagaacaact acaagaccac gcctcccgtg ctggactccg acggctcctt cttcctctac 1620
agcaagctca ccgtggacaa gagcaggtgg cagcagggga acgtcttctc atgctccgtg 1680
atgcatgagg ctctgcacaa ccactacacg cagaagagcc tctccctgtc tccgggtaaa 1740
<210> 60
<211> 580
<212> PRT
<213> 人工序列
<400> 60
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Glu Val Gln Leu Leu Glu Ser Gly Gly Gly Leu Val Gln Pro
130 135 140
Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser
145 150 155 160
Ser Tyr Ile Met Met Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu
165 170 175
Trp Val Ser Ser Ile Tyr Pro Ser Gly Gly Ile Thr Phe Tyr Ala Asp
180 185 190
Thr Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ser Lys Asn Thr
195 200 205
Leu Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
210 215 220
Tyr Cys Ala Arg Ile Lys Leu Gly Thr Val Thr Thr Val Asn Tyr Trp
225 230 235 240
Gly Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro
245 250 255
Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr
260 265 270
Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr
275 280 285
Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro
290 295 300
Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr
305 310 315 320
Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn
325 330 335
His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser
340 345 350
Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu
355 360 365
Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
370 375 380
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
385 390 395 400
His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu
405 410 415
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr
420 425 430
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
435 440 445
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro
450 455 460
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
465 470 475 480
Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val
485 490 495
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
500 505 510
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
515 520 525
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr
530 535 540
Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val
545 550 555 560
Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu
565 570 575
Ser Pro Gly Lys
580
<210> 61
<211> 891
<212> DNA
<213> 人工序列
<400> 61
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
gggcaatccg cactgactca accagccagc gttagcggct cccctggtca atctatcacc 300
atcagctgta ccgggaccag ctcagacgtt ggcggttaca actacgtcag ctggtaccag 360
cagcacccgg gtaaagctcc aaagctgatg atttatgatg tgtctaatcg accttctggt 420
gtatctaacc gattttcagg ctctaaaagt ggaaatactg cttccctcac gatctcaggg 480
ctgcaagccg aagacgaagc cgattattat tgttctagct atacatccag cagcacccgc 540
gtgtttggaa cgggaaccaa ggtcacggtt ctgggacagc ccaaagccaa tcctaccgtc 600
actctgttcc cacccagtag tgaggagctg caggcaaata aggctaccct ggtctgtctt 660
atatccgatt tctatcccgg ggcagtcaca gtcgcttgga aggcagatgg ctctccagtg 720
aaggccggcg tcgaaacaac taaaccttcc aagcagtcta ataacaagta cgctgcttct 780
tcttaccttt cacttactcc tgaacaatgg aagagccaca ggagttactc ttgtcaggta 840
acccacgagg ggtccactgt ggagaaaacc gtcgctccca cagagtgttc t 891
<210> 62
<211> 297
<212> PRT
<213> 人工序列
<400> 62
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Gln Ser Ala Leu Thr Gln Pro Ala Ser Val Ser Gly Ser Pro Gly
85 90 95
Gln Ser Ile Thr Ile Ser Cys Thr Gly Thr Ser Ser Asp Val Gly Gly
100 105 110
Tyr Asn Tyr Val Ser Trp Tyr Gln Gln His Pro Gly Lys Ala Pro Lys
115 120 125
Leu Met Ile Tyr Asp Val Ser Asn Arg Pro Ser Gly Val Ser Asn Arg
130 135 140
Phe Ser Gly Ser Lys Ser Gly Asn Thr Ala Ser Leu Thr Ile Ser Gly
145 150 155 160
Leu Gln Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Ser Ser Tyr Thr Ser
165 170 175
Ser Ser Thr Arg Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly
180 185 190
Gln Pro Lys Ala Asn Pro Thr Val Thr Leu Phe Pro Pro Ser Ser Glu
195 200 205
Glu Leu Gln Ala Asn Lys Ala Thr Leu Val Cys Leu Ile Ser Asp Phe
210 215 220
Tyr Pro Gly Ala Val Thr Val Ala Trp Lys Ala Asp Gly Ser Pro Val
225 230 235 240
Lys Ala Gly Val Glu Thr Thr Lys Pro Ser Lys Gln Ser Asn Asn Lys
245 250 255
Tyr Ala Ala Ser Ser Tyr Leu Ser Leu Thr Pro Glu Gln Trp Lys Ser
260 265 270
His Arg Ser Tyr Ser Cys Gln Val Thr His Glu Gly Ser Thr Val Glu
275 280 285
Lys Thr Val Ala Pro Thr Glu Cys Ser
290 295
<210> 63
<211> 1593
<212> DNA
<213> 人工序列
<400> 63
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
ggggaagtac agctgcttga gagtggagga ggtttggtac agcccggcgg atccctccgc 300
ctgtcctgtg cggctagtgg ctttacattc tcatcctata tcatgatgtg ggtaagacag 360
gccccaggaa agggcctgga gtgggttagt tctatctacc cctcaggcgg gattaccttc 420
tacgcagata ctgtgaaggg caggtttacc atatcccgag acaacagtaa gaataccctt 480
taccttcaaa tgaactccct tcgggccgag gacactgcgg tgtactattg cgctcgcatt 540
aagcttggca ccgtgacaac cgtgaactat tggggtcaag gcacgctggt gactgtctct 600
tccgcctcca ccaagggccc atcggtcttc cccctggcac cctcctccaa gagcacctct 660
gggggcacag cggccctggg ctgcctggtc aaggactact tccccgaacc ggtgacggtg 720
tcgtggaact caggcgccct gaccagcggc gtgcacacct tcccggctgt cctacagtcc 780
tcaggactct actccctcag cagcgtggtg actgtgccct ctagcagctt gggcacccag 840
acctacatct gcaacgtgaa tcacaagccc agcaacacca aggtggacaa gaaagttgaa 900
cccaaatctt gcgacaaaac tcacacatgc ccaccgtgcc cagcacctga actcctgggg 960
ggaccgtcag tcttcctctt ccccccaaaa cccaaggaca ccctcatgat ctcccggacc 1020
cctgaggtca catgcgtggt ggtggacgtg agccacgaag accctgaggt caagttcaac 1080
tggtacgtgg acggcgtgga ggtgcataat gccaagacaa agccgcggga ggagcagtac 1140
aacagcacgt accgtgtggt cagcgtcctc accgtcctgc accaggactg gctgaatggc 1200
aaggagtaca agtgcaaggt ctccaacaaa gccctcccag cccccataga gaaaaccatc 1260
tccaaagcca aagggcagcc ccgagaacca caggtgtaca ccctgccccc atcccgggat 1320
gagctgacca agaaccaggt cagcctgacc tgcctggtca aaggcttcta tcccagcgac 1380
atcgccgtgg agtgggagag caatgggcag ccggagaaca actacaagac cacgcctccc 1440
gtgctggact ccgacggctc cttcttcctc tacagcaagc tcaccgtgga caagagcagg 1500
tggcagcagg ggaacgtctt ctcatgctcc gtgatgcatg aggctctgca caaccactac 1560
acgcagaaga gcctctccct gtctccgggt aaa 1593
<210> 64
<211> 531
<212> PRT
<213> 人工序列
<400> 64
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Glu Val Gln Leu Leu Glu Ser Gly Gly Gly Leu Val Gln Pro Gly
85 90 95
Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Ser
100 105 110
Tyr Ile Met Met Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp
115 120 125
Val Ser Ser Ile Tyr Pro Ser Gly Gly Ile Thr Phe Tyr Ala Asp Thr
130 135 140
Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ser Lys Asn Thr Leu
145 150 155 160
Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr
165 170 175
Cys Ala Arg Ile Lys Leu Gly Thr Val Thr Thr Val Asn Tyr Trp Gly
180 185 190
Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser
195 200 205
Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala
210 215 220
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val
225 230 235 240
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala
245 250 255
Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val
260 265 270
Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His
275 280 285
Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys
290 295 300
Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly
305 310 315 320
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
325 330 335
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
340 345 350
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
355 360 365
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
370 375 380
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
385 390 395 400
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile
405 410 415
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
420 425 430
Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser
435 440 445
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
450 455 460
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
465 470 475 480
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
485 490 495
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
500 505 510
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
515 520 525
Pro Gly Lys
530
<210> 65
<211> 1038
<212> DNA
<213> 人工序列
<400> 65
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg caatccgcac tgactcaacc agccagcgtt 420
agcggctccc ctggtcaatc tatcaccatc agctgtaccg ggaccagctc agacgttggc 480
ggttacaact acgtcagctg gtaccagcag cacccgggta aagctccaaa gctgatgatt 540
tatgatgtgt ctaatcgacc ttctggtgta tctaaccgat tttcaggctc taaaagtgga 600
aatactgctt ccctcacgat ctcagggctg caagccgaag acgaagccga ttattattgt 660
tctagctata catccagcag cacccgcgtg tttggaacgg gaaccaaggt cacggttctg 720
ggacagccca aagccaatcc taccgtcact ctgttcccac ccagtagtga ggagctgcag 780
gcaaataagg ctaccctggt ctgtcttata tccgatttct atcccggggc agtcacagtc 840
gcttggaagg cagatggctc tccagtgaag gccggcgtcg aaacaactaa accttccaag 900
cagtctaata acaagtacgc tgcttcttct tacctttcac ttactcctga acaatggaag 960
agccacagga gttactcttg tcaggtaacc cacgaggggt ccactgtgga gaaaaccgtc 1020
gctcccacag agtgttct 1038
<210> 66
<211> 346
<212> PRT
<213> 人工序列
<400> 66
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Gln Ser Ala Leu Thr Gln Pro Ala Ser Val Ser Gly Ser Pro
130 135 140
Gly Gln Ser Ile Thr Ile Ser Cys Thr Gly Thr Ser Ser Asp Val Gly
145 150 155 160
Gly Tyr Asn Tyr Val Ser Trp Tyr Gln Gln His Pro Gly Lys Ala Pro
165 170 175
Lys Leu Met Ile Tyr Asp Val Ser Asn Arg Pro Ser Gly Val Ser Asn
180 185 190
Arg Phe Ser Gly Ser Lys Ser Gly Asn Thr Ala Ser Leu Thr Ile Ser
195 200 205
Gly Leu Gln Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Ser Ser Tyr Thr
210 215 220
Ser Ser Ser Thr Arg Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu
225 230 235 240
Gly Gln Pro Lys Ala Asn Pro Thr Val Thr Leu Phe Pro Pro Ser Ser
245 250 255
Glu Glu Leu Gln Ala Asn Lys Ala Thr Leu Val Cys Leu Ile Ser Asp
260 265 270
Phe Tyr Pro Gly Ala Val Thr Val Ala Trp Lys Ala Asp Gly Ser Pro
275 280 285
Val Lys Ala Gly Val Glu Thr Thr Lys Pro Ser Lys Gln Ser Asn Asn
290 295 300
Lys Tyr Ala Ala Ser Ser Tyr Leu Ser Leu Thr Pro Glu Gln Trp Lys
305 310 315 320
Ser His Arg Ser Tyr Ser Cys Gln Val Thr His Glu Gly Ser Thr Val
325 330 335
Glu Lys Thr Val Ala Pro Thr Glu Cys Ser
340 345
<210> 67
<211> 1740
<212> DNA
<213> 人工序列
<400> 67
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg caggtgaccc tgcgcgagtc cggccctgca 420
ctggtgaagc ccacccagac cctgaccctg acctgcacct tctccggctt ctccctgtcc 480
acctccggca tgtccgtggg ctggatccgg cagcctcccg gcaaggccct ggagtggctg 540
gctgacatct ggtgggacga caagaaggac tacaacccct ccctgaagtc ccgcctgacc 600
atctccaagg acacctccaa gaaccaggtg gtgctgaagg tgaccaacat ggaccccgcc 660
gacaccgcca cctactactg cgcccgctca atgattacca actggtactt cgacgtgtgg 720
ggagccggta ccaccgtgac cgtgtcttcc gcctccacca agggcccatc ggtcttcccc 780
ctggcaccct cctccaagag cacctctggg ggcacagcgg ccctgggctg cctggtcaag 840
gactacttcc ccgaaccggt gacggtgtcg tggaactcag gcgccctgac cagcggcgtg 900
cacaccttcc cggctgtcct acagtcctca ggactctact ccctcagcag cgtggtgact 960
gtgccctcta gcagcttggg cacccagacc tacatctgca acgtgaatca caagcccagc 1020
aacaccaagg tggacaagaa agttgaaccc aaatcttgcg acaaaactca cacatgccca 1080
ccgtgcccag cacctgaact cctgggggga ccgtcagtct tcctcttccc cccaaaaccc 1140
aaggacaccc tcatgatctc ccggacccct gaggtcacat gcgtggtggt ggacgtgagc 1200
cacgaagacc ctgaggtcaa gttcaactgg tacgtggacg gcgtggaggt gcataatgcc 1260
aagacaaagc cgcgggagga gcagtacaac agcacgtacc gtgtggtcag cgtcctcacc 1320
gtcctgcacc aggactggct gaatggcaag gagtacaagt gcaaggtctc caacaaagcc 1380
ctcccagccc ccatagagaa aaccatctcc aaagccaaag ggcagccccg agaaccacag 1440
gtgtacaccc tgcccccatc ccgggaggag atgaccaaga accaggtcag cctgacctgc 1500
ctggtcaaag gcttctatcc cagcgacatc gccgtggagt gggagagcaa tgggcagccg 1560
gagaacaact acaagaccac gcctcccgtg ctggactccg acggctcctt cttcctctac 1620
agcaagctca ccgtggacaa gagcaggtgg cagcagggga acgtcttctc atgctccgtg 1680
atgcatgagg ctctgcacaa ccactacacg cagaagagcc tctccctgtc tccgggtaaa 1740
<210> 68
<211> 563
<212> PRT
<213> 人工序列
<400> 68
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Gln Val Thr Leu Arg Glu Ser Gly Pro Ala Leu Val Lys Pro
130 135 140
Thr Gln Thr Leu Thr Leu Thr Cys Thr Phe Ser Gly Phe Ser Leu Ser
145 150 155 160
Thr Ser Gly Met Ser Val Gly Trp Ile Arg Gln Pro Pro Gly Lys Ala
165 170 175
Leu Glu Trp Leu Ala Asp Ile Trp Trp Asp Asp Lys Lys Asp Tyr Asn
180 185 190
Pro Ser Leu Lys Ser Arg Leu Thr Ile Ser Lys Asp Thr Ser Lys Asn
195 200 205
Gln Val Val Leu Lys Val Thr Asn Met Asp Pro Ala Asp Thr Ala Thr
210 215 220
Tyr Tyr Cys Ala Arg Ser Met Ile Thr Asn Trp Tyr Phe Asp Val Trp
225 230 235 240
Gly Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro
245 250 255
Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr
260 265 270
Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr
275 280 285
Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro
290 295 300
Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr
305 310 315 320
Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn
325 330 335
His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser
340 345 350
Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu
355 360 365
Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
370 375 380
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
385 390 395 400
His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu
405 410 415
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr
420 425 430
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
435 440 445
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro
450 455 460
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
465 470 475 480
Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val
485 490 495
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
500 505 510
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
515 520 525
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr
530 535 540
Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val
545 550 555 560
Met His Glu
<210> 69
<211> 1593
<212> DNA
<213> 人工序列
<400> 69
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
gggcaggtga ccctgcgcga gtccggccct gcactggtga agcccaccca gaccctgacc 300
ctgacctgca ccttctccgg cttctccctg tccacctccg gcatgtccgt gggctggatc 360
cggcagcctc ccggcaaggc cctggagtgg ctggctgaca tctggtggga cgacaagaag 420
gactacaacc cctccctgaa gtcccgcctg accatctcca aggacacctc caagaaccag 480
gtggtgctga aggtgaccaa catggacccc gccgacaccg ccacctacta ctgcgcccgc 540
tcaatgatta ccaactggta cttcgacgtg tggggagccg gtaccaccgt gaccgtgtct 600
tccgcctcca ccaagggccc atcggtcttc cccctggcac cctcctccaa gagcacctct 660
gggggcacag cggccctggg ctgcctggtc aaggactact tccccgaacc ggtgacggtg 720
tcgtggaact caggcgccct gaccagcggc gtgcacacct tcccggctgt cctacagtcc 780
tcaggactct actccctcag cagcgtggtg actgtgccct ctagcagctt gggcacccag 840
acctacatct gcaacgtgaa tcacaagccc agcaacacca aggtggacaa gaaagttgaa 900
cccaaatctt gcgacaaaac tcacacatgc ccaccgtgcc cagcacctga actcctgggg 960
ggaccgtcag tcttcctctt ccccccaaaa cccaaggaca ccctcatgat ctcccggacc 1020
cctgaggtca catgcgtggt ggtggacgtg agccacgaag accctgaggt caagttcaac 1080
tggtacgtgg acggcgtgga ggtgcataat gccaagacaa agccgcggga ggagcagtac 1140
aacagcacgt accgtgtggt cagcgtcctc accgtcctgc accaggactg gctgaatggc 1200
aaggagtaca agtgcaaggt ctccaacaaa gccctcccag cccccataga gaaaaccatc 1260
tccaaagcca aagggcagcc ccgagaacca caggtgtaca ccctgccccc atcccgggag 1320
gagatgacca agaaccaggt cagcctgacc tgcctggtca aaggcttcta tcccagcgac 1380
atcgccgtgg agtgggagag caatgggcag ccggagaaca actacaagac cacgcctccc 1440
gtgctggact ccgacggctc cttcttcctc tacagcaagc tcaccgtgga caagagcagg 1500
tggcagcagg ggaacgtctt ctcatgctcc gtgatgcatg aggctctgca caaccactac 1560
acgcagaaga gcctctccct gtctccgggt aaa 1593
<210> 70
<211> 531
<212> PRT
<213> 人工序列
<400> 70
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Gln Val Thr Leu Arg Glu Ser Gly Pro Ala Leu Val Lys Pro Thr
85 90 95
Gln Thr Leu Thr Leu Thr Cys Thr Phe Ser Gly Phe Ser Leu Ser Thr
100 105 110
Ser Gly Met Ser Val Gly Trp Ile Arg Gln Pro Pro Gly Lys Ala Leu
115 120 125
Glu Trp Leu Ala Asp Ile Trp Trp Asp Asp Lys Lys Asp Tyr Asn Pro
130 135 140
Ser Leu Lys Ser Arg Leu Thr Ile Ser Lys Asp Thr Ser Lys Asn Gln
145 150 155 160
Val Val Leu Lys Val Thr Asn Met Asp Pro Ala Asp Thr Ala Thr Tyr
165 170 175
Tyr Cys Ala Arg Ser Met Ile Thr Asn Trp Tyr Phe Asp Val Trp Gly
180 185 190
Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser
195 200 205
Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala
210 215 220
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val
225 230 235 240
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala
245 250 255
Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val
260 265 270
Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His
275 280 285
Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys
290 295 300
Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly
305 310 315 320
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
325 330 335
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
340 345 350
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
355 360 365
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
370 375 380
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
385 390 395 400
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile
405 410 415
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
420 425 430
Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gln Val Ser
435 440 445
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
450 455 460
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
465 470 475 480
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
485 490 495
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
500 505 510
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
515 520 525
Pro Gly Lys
530
<210> 71
<211> 342
<212> DNA
<213> 人工序列
<400> 71
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gc 342
<210> 72
<211> 114
<212> PRT
<213> 人工序列
<400> 72
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser
<210> 73
<211> 195
<212> DNA
<213> 人工序列
<400> 73
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgc 195
<210> 74
<211> 65
<212> PRT
<213> 人工序列
<400> 74
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg
65
<210> 75
<211> 1326
<212> DNA
<213> 人工序列
<400> 75
caggtgcagc tggtggagag tggaggtggc gtggtacagc ccggccgcag cctgcgcctg 60
gactgcaagg ccagcggcat caccttcagc aacagcggca tgcactgggt gcgccaggcc 120
cccggcaagg gcctggagtg ggtggccgtg atctggtacg acggcagcaa gcgctactac 180
gccgacagcg tgaagggccg cttcaccatc agccgcgaca acagcaagaa caccctgttc 240
ctgcagatga acagcctgcg cgccgaggac accgccgtgt actactgcgc caccaacgac 300
gactactggg gccagggcac cctggtgacc gtgagcagcg cctccaccaa gggcccatcg 360
gtcttccccc tggcaccctc ctccaagagc acctctgggg gcacagcggc cctgggctgc 420
ctggtcaagg actacttccc cgaaccggtg acggtgtcgt ggaactcagg cgccctgacc 480
agcggcgtgc acaccttccc ggctgtccta cagtcctcag gactctactc cctcagcagc 540
gtggtgactg tgccctctag cagcttgggc acccagacct acatctgcaa cgtgaatcac 600
aagcccagca acaccaaggt ggacaagaaa gttgaaccca aatcttgcga caaaactcac 660
acatgcccac cgtgcccagc acctccagtc gccggaccgt cagtcttcct cttccctcca 720
aaacccaagg acaccctcat gatctcccgg acccctgagg tcacatgcgt ggtggtggac 780
gtgagccacg aagaccctga ggtcaagttc aactggtacg tggacggcgt ggaggtgcat 840
aatgccaaga caaagccgcg ggaggagcag tacaacagca cgtaccgtgt ggtcagcgtc 900
ctcaccgtcc tgcaccagga ctggctgaat ggcaaggagt acaagtgcaa ggtctccaac 960
aaaggcctcc caagctccat cgagaaaacc atctccaaag ccaaagggca gccccgagaa 1020
ccacaggtgt acaccctgcc tccatcccgg gatgagctga ccaagaacca ggtcagcctg 1080
acctgcctgg tcaaaggctt ctatcccagc gacatcgccg tggagtggga gagcaatggg 1140
cagccggaga acaactacaa gaccacgcct cccgtgctgg actccgacgg ctccttcttc 1200
ctctacagca agctcaccgt ggacaagagc aggtggcagc aggggaacgt cttctcatgc 1260
tccgtgatgc atgaggctct gcacaaccac tacacgcaga agagcctctc cctgtctccg 1320
ggtaaa 1326
<210> 76
<211> 442
<212> PRT
<213> 人工序列
<400> 76
Gln Val Gln Leu Val Glu Ser Gly Gly Gly Val Val Gln Pro Gly Arg
1 5 10 15
Ser Leu Arg Leu Asp Cys Lys Ala Ser Gly Ile Thr Phe Ser Asn Ser
20 25 30
Gly Met His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Val Ile Trp Tyr Asp Gly Ser Lys Arg Tyr Tyr Ala Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ser Lys Asn Thr Leu Phe
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Thr Asn Asp Asp Tyr Trp Gly Gln Gly Thr Leu Val Thr Val Ser
100 105 110
Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser
115 120 125
Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp
130 135 140
Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr
145 150 155 160
Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser Gly Leu Tyr
165 170 175
Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gln
180 185 190
Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp
195 200 205
Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro
210 215 220
Cys Pro Ala Pro Pro Val Ala Gly Pro Ser Val Phe Leu Phe Pro Pro
225 230 235 240
Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys
245 250 255
Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp
260 265 270
Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu
275 280 285
Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu
290 295 300
His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn
305 310 315 320
Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly
325 330 335
Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu
340 345 350
Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr
355 360 365
Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn
370 375 380
Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe
385 390 395 400
Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn
405 410 415
Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr
420 425 430
Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440
<210> 77
<211> 642
<212> DNA
<213> 人工序列
<400> 77
gagatcgtgc tgacccagag ccccgccacc ctgagcctga gccccggcga gcgcgccacc 60
ctgagctgcc gcgccagcca gagcgtgagc agctacctgg cctggtacca gcagaagcca 120
ggacaggctc cacgactgct aatctatgac gccagcaacc gcgccaccgg catccccgcc 180
cgcttcagcg gcagcggcag cggcaccgac ttcaccctga ccatcagcag cctggagccc 240
gaggacttcg ccgtgtacta ctgccagcag agcagcaact ggccccgcac cttcggccag 300
ggcaccaagg tggagatcaa gcgcacagtg gcagccccca gcgtcttcat ttttccccct 360
tccgatgaac agctgaagtc cggcactgct tctgtggtct gtctgctgaa caatttctat 420
cccagagagg ccaaggtgca gtggaaagtg gacaacgctc tgcagtccgg caacagccag 480
gagagtgtga ccgaacagga tagtaaggac agcacatatt ctctgtctag taccctgaca 540
ctgagtaagg cagattacga gaagcacaaa gtgtatgcct gcgaagtcac tcatcaggga 600
ctgtcaagcc ccgtgaccaa gagcttcaac cggggcgagt gt 642
<210> 78
<211> 214
<212> PRT
<213> 人工序列
<400> 78
Glu Ile Val Leu Thr Gln Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly
1 5 10 15
Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gln Ser Val Ser Ser Tyr
20 25 30
Leu Ala Trp Tyr Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Leu Ile
35 40 45
Tyr Asp Ala Ser Asn Arg Ala Thr Gly Ile Pro Ala Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Glu Pro
65 70 75 80
Glu Asp Phe Ala Val Tyr Tyr Cys Gln Gln Ser Ser Asn Trp Pro Arg
85 90 95
Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala
100 105 110
Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly
115 120 125
Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala
130 135 140
Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln
145 150 155 160
Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser
165 170 175
Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr
180 185 190
Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser
195 200 205
Phe Asn Arg Gly Glu Cys
210
<210> 79
<211> 1350
<212> DNA
<213> 人工序列
<400> 79
gaagtacagc tgcttgagag tggaggaggt ttggtacagc ccggcggatc cctccgcctg 60
tcctgtgcgg ctagtggctt tacattctca tcctatatca tgatgtgggt aagacaggcc 120
ccaggaaagg gcctggagtg ggttagttct atctacccct caggcgggat taccttctac 180
gcagatactg tgaagggcag gtttaccata tcccgagaca acagtaagaa taccctttac 240
cttcaaatga actcccttcg ggccgaggac actgcggtgt actattgcgc tcgcattaag 300
cttggcaccg tgacaaccgt gaactattgg ggtcaaggca cgctggtgac tgtctcttcc 360
gcctccacca agggcccatc ggtcttcccc ctggcaccct cctccaagag cacctctggg 420
ggcacagcgg ccctgggctg cctggtcaag gactacttcc ccgaaccggt gacggtgtcg 480
tggaactcag gcgccctgac cagcggcgtg cacaccttcc cggctgtcct acagtcctca 540
ggactctact ccctcagcag cgtggtgact gtgccctcta gcagcttggg cacccagacc 600
tacatctgca acgtgaatca caagcccagc aacaccaagg tggacaagaa agttgaaccc 660
aaatcttgcg acaaaactca cacatgccca ccgtgcccag cacctgaact cctgggggga 720
ccgtcagtct tcctcttccc cccaaaaccc aaggacaccc tcatgatctc ccggacccct 780
gaggtcacat gcgtggtggt ggacgtgagc cacgaagacc ctgaggtcaa gttcaactgg 840
tacgtggacg gcgtggaggt gcataatgcc aagacaaagc cgcgggagga gcagtacaac 900
agcacgtacc gtgtggtcag cgtcctcacc gtcctgcacc aggactggct gaatggcaag 960
gagtacaagt gcaaggtctc caacaaagcc ctcccagccc ccatagagaa aaccatctcc 1020
aaagccaaag ggcagccccg agaaccacag gtgtacaccc tgcccccatc ccgggatgag 1080
ctgaccaaga accaggtcag cctgacctgc ctggtcaaag gcttctatcc cagcgacatc 1140
gccgtggagt gggagagcaa tgggcagccg gagaacaact acaagaccac gcctcccgtg 1200
ctggactccg acggctcctt cttcctctac agcaagctca ccgtggacaa gagcaggtgg 1260
cagcagggga acgtcttctc atgctccgtg atgcatgagg ctctgcacaa ccactacacg 1320
cagaagagcc tctccctgtc tccgggtaaa 1350
<210> 80
<211> 450
<212> PRT
<213> 人工序列
<400> 80
Glu Val Gln Leu Leu Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Ser Tyr
20 25 30
Ile Met Met Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ser Ser Ile Tyr Pro Ser Gly Gly Ile Thr Phe Tyr Ala Asp Thr Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ala Arg Ile Lys Leu Gly Thr Val Thr Thr Val Asn Tyr Trp Gly Gln
100 105 110
Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val
115 120 125
Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala
130 135 140
Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser
145 150 155 160
Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val
165 170 175
Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro
180 185 190
Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys
195 200 205
Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp
210 215 220
Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly
225 230 235 240
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
245 250 255
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
260 265 270
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
275 280 285
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
290 295 300
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
305 310 315 320
Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile Glu
325 330 335
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
340 345 350
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
355 360 365
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
370 375 380
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
385 390 395 400
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
405 410 415
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
420 425 430
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
435 440 445
Gly Lys
450
<210> 81
<211> 648
<212> DNA
<213> 人工序列
<400> 81
caatccgcac tgactcaacc agccagcgtt agcggctccc ctggtcaatc tatcaccatc 60
agctgtaccg ggaccagctc agacgttggc ggttacaact acgtcagctg gtaccagcag 120
cacccgggta aagctccaaa gctgatgatt tatgatgtgt ctaatcgacc ttctggtgta 180
tctaaccgat tttcaggctc taaaagtgga aatactgctt ccctcacgat ctcagggctg 240
caagccgaag acgaagccga ttattattgt tctagctata catccagcag cacccgcgtg 300
tttggaacgg gaaccaaggt cacggttctg ggacagccca aagccaatcc taccgtcact 360
ctgttcccac ccagtagtga ggagctgcag gcaaataagg ctaccctggt ctgtcttata 420
tccgatttct atcccggggc agtcacagtc gcttggaagg cagatggctc tccagtgaag 480
gccggcgtcg aaacaactaa accttccaag cagtctaata acaagtacgc tgcttcttct 540
tacctttcac ttactcctga acaatggaag agccacagga gttactcttg tcaggtaacc 600
cacgaggggt ccactgtgga gaaaaccgtc gctcccacag agtgttct 648
<210> 82
<211> 216
<212> PRT
<213> 人工序列
<400> 82
Gln Ser Ala Leu Thr Gln Pro Ala Ser Val Ser Gly Ser Pro Gly Gln
1 5 10 15
Ser Ile Thr Ile Ser Cys Thr Gly Thr Ser Ser Asp Val Gly Gly Tyr
20 25 30
Asn Tyr Val Ser Trp Tyr Gln Gln His Pro Gly Lys Ala Pro Lys Leu
35 40 45
Met Ile Tyr Asp Val Ser Asn Arg Pro Ser Gly Val Ser Asn Arg Phe
50 55 60
Ser Gly Ser Lys Ser Gly Asn Thr Ala Ser Leu Thr Ile Ser Gly Leu
65 70 75 80
Gln Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Ser Ser Tyr Thr Ser Ser
85 90 95
Ser Thr Arg Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly Gln
100 105 110
Pro Lys Ala Asn Pro Thr Val Thr Leu Phe Pro Pro Ser Ser Glu Glu
115 120 125
Leu Gln Ala Asn Lys Ala Thr Leu Val Cys Leu Ile Ser Asp Phe Tyr
130 135 140
Pro Gly Ala Val Thr Val Ala Trp Lys Ala Asp Gly Ser Pro Val Lys
145 150 155 160
Ala Gly Val Glu Thr Thr Lys Pro Ser Lys Gln Ser Asn Asn Lys Tyr
165 170 175
Ala Ala Ser Ser Tyr Leu Ser Leu Thr Pro Glu Gln Trp Lys Ser His
180 185 190
Arg Ser Tyr Ser Cys Gln Val Thr His Glu Gly Ser Thr Val Glu Lys
195 200 205
Thr Val Ala Pro Thr Glu Cys Ser
210 215
<210> 83
<211> 1716
<212> DNA
<213> 人工序列
<400> 83
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacgacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg caggtgcagc tggtggagag tggaggtggc 420
gtggtacagc ccggccgcag cctgcgcctg gactgcaagg ccagcggcat caccttcagc 480
aacagcggca tgcactgggt gcgccaggcc cccggcaagg gcctggagtg ggtggccgtg 540
atctggtacg acggcagcaa gcgctactac gccgacagcg tgaagggccg cttcaccatc 600
agccgcgaca acagcaagaa caccctgttc ctgcagatga acagcctgcg cgccgaggac 660
accgccgtgt actactgcgc caccaacgac gactactggg gccagggcac cctggtgacc 720
gtgagcagcg cctccaccaa gggcccatcg gtcttccccc tggcaccctc ctccaagagc 780
acctctgggg gcacagcggc cctgggctgc ctggtcaagg actacttccc cgaaccggtg 840
acggtgtcgt ggaactcagg cgccctgacc agcggcgtgc acaccttccc ggctgtccta 900
cagtcctcag gactctactc cctcagcagc gtggtgactg tgccctctag cagcttgggc 960
acccagacct acatctgcaa cgtgaatcac aagcccagca acaccaaggt ggacaagaaa 1020
gttgaaccca aatcttgcga caaaactcac acatgcccac cgtgcccagc acctccagtc 1080
gccggaccgt cagtcttcct cttccctcca aaacccaagg acaccctcat gatctcccgg 1140
acccctgagg tcacatgcgt ggtggtggac gtgagccacg aagaccctga ggtcaagttc 1200
aactggtacg tggacggcgt ggaggtgcat aatgccaaga caaagccgcg ggaggagcag 1260
tacaacagca cgtaccgtgt ggtcagcgtc ctcaccgtcc tgcaccagga ctggctgaat 1320
ggcaaggagt acaagtgcaa ggtctccaac aaaggcctcc caagctccat cgagaaaacc 1380
atctccaaag ccaaagggca gccccgagaa ccacaggtgt acaccctgcc tccatcccgg 1440
gatgagctga ccaagaacca ggtcagcctg acctgcctgg tcaaaggctt ctatcccagc 1500
gacatcgccg tggagtggga gagcaatggg cagccggaga acaactacaa gaccacgcct 1560
cccgtgctgg actccgacgg ctccttcttc ctctacagca agctcaccgt ggacaagagc 1620
aggtggcagc aggggaacgt cttctcatgc tccgtgatgc atgaggctct gcacaaccac 1680
tacacgcaga agagcctctc cctgtctccg ggtaaa 1716
<210> 84
<211> 572
<212> PRT
<213> 人工序列
<400> 84
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Gln Val Gln Leu Val Glu Ser Gly Gly Gly Val Val Gln Pro
130 135 140
Gly Arg Ser Leu Arg Leu Asp Cys Lys Ala Ser Gly Ile Thr Phe Ser
145 150 155 160
Asn Ser Gly Met His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu
165 170 175
Trp Val Ala Val Ile Trp Tyr Asp Gly Ser Lys Arg Tyr Tyr Ala Asp
180 185 190
Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ser Lys Asn Thr
195 200 205
Leu Phe Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
210 215 220
Tyr Cys Ala Thr Asn Asp Asp Tyr Trp Gly Gln Gly Thr Leu Val Thr
225 230 235 240
Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro
245 250 255
Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val
260 265 270
Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala
275 280 285
Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser Gly
290 295 300
Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly
305 310 315 320
Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys
325 330 335
Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys
340 345 350
Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser Val Phe Leu Phe
355 360 365
Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val
370 375 380
Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe
385 390 395 400
Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro
405 410 415
Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr
420 425 430
Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val
435 440 445
Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys Ala
450 455 460
Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg
465 470 475 480
Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly
485 490 495
Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro
500 505 510
Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser
515 520 525
Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln
530 535 540
Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His
545 550 555 560
Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
565 570
<210> 85
<211> 885
<212> DNA
<213> 人工序列
<400> 85
atcacctgcc cacctcccat gagcgtggag cacgccgaca tctgggtgaa gagctacagc 60
ctgtacagcc gcgagcgcta catctgcaac agcggcttca agcgcaaggc cggcaccagc 120
agcctgaccg agtgcgtgct gaacaaggcc accaacgtgg cccactggac cacccccagc 180
ctgaagtgca tccgcggcgg aggcggatcc ggaggcggag gttccggcgg gggtgggagc 240
ggggagatcg tgctgaccca gagccccgcc accctgagcc tgagccccgg cgagcgcgcc 300
accctgagct gccgcgccag ccagagcgtg agcagctacc tggcctggta ccagcagaag 360
ccaggacagg ctccacgact gctaatctat gacgccagca accgcgccac cggcatcccc 420
gcccgcttca gcggcagcgg cagcggcacc gacttcaccc tgaccatcag cagcctggag 480
cccgaggact tcgccgtgta ctactgccag cagagcagca actggccccg caccttcggc 540
cagggcacca aggtggagat caagcgcaca gtggcagccc ccagcgtctt catttttccc 600
ccttccgatg aacagctgaa gtccggcact gcttctgtgg tctgtctgct gaacaatttc 660
tatcccagag aggccaaggt gcagtggaaa gtggacaacg ctctgcagtc cggcaacagc 720
caggagagtg tgaccgaaca ggatagtaag gacagcacat attctctgtc tagtaccctg 780
acactgagta aggcagatta cgagaagcac aaagtgtatg cctgcgaagt cactcatcag 840
ggactgtcaa gccccgtgac caagagcttc aaccggggcg agtgt 885
<210> 86
<211> 295
<212> PRT
<213> 人工序列
<400> 86
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Glu Ile Val Leu Thr Gln Ser Pro Ala Thr Leu Ser Leu Ser Pro
85 90 95
Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gln Ser Val Ser Ser
100 105 110
Tyr Leu Ala Trp Tyr Gln Gln Lys Pro Gly Gln Ala Pro Arg Leu Leu
115 120 125
Ile Tyr Asp Ala Ser Asn Arg Ala Thr Gly Ile Pro Ala Arg Phe Ser
130 135 140
Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Glu
145 150 155 160
Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gln Gln Ser Ser Asn Trp Pro
165 170 175
Arg Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala
180 185 190
Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser
195 200 205
Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu
210 215 220
Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser
225 230 235 240
Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu
245 250 255
Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val
260 265 270
Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys
275 280 285
Ser Phe Asn Arg Gly Glu Cys
290 295
<210> 87
<211> 711
<212> DNA
<213> 人工序列
<400> 87
atcacgtgcc ctccccccat gtccgtggaa cacgcagaca tctgggtcaa gagctacagc 60
ttgtactcca gggagcggta catttgtaac tctggtttca agcgtaaagc cggcacgtcc 120
agcctgacgg agtgcgtgtt gaacaaggcc acgaatgtcg cccactggac aacccccagt 180
ctcaaatgca ttagagaccc tgccctggtt caccaaaggc cagcgccacc ctccacagta 240
acgacggcag gggtgacccc acagccagag agcctctccc cttctggaaa agagcccgca 300
gcttcatctc ccagctcaaa caacacagcg gccacaacag cagctattgt cccgggctcc 360
cagctgatgc cttcaaaatc accttccaca ggaaccacag agataagcag tcatgagtcc 420
tcccacggca ccccctctca gacaacagcc aagaactggg aactcacagc atccgcctcc 480
caccagccgc caggtgtgta tccacagggc cacagcgaca ccactgtggc tatctccacg 540
tccactgtcc tgctgtgtgg gctgagcgct gtgtctctcc tggcatgcta cctcaagtca 600
aggcaaactc ccccgctggc cagcgttgaa atggaagcca tggaggctct gccggtgact 660
tgggggacca gcagcagaga tgaagacttg gaaaactgct ctcaccacct a 711
<210> 88
<211> 237
<212> PRT
<213> 人工序列
<400> 88
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Asp Pro Ala Leu Val His Gln Arg Pro Ala Pro Pro Ser Thr Val
65 70 75 80
Thr Thr Ala Gly Val Thr Pro Gln Pro Glu Ser Leu Ser Pro Ser Gly
85 90 95
Lys Glu Pro Ala Ala Ser Ser Pro Ser Ser Asn Asn Thr Ala Ala Thr
100 105 110
Thr Ala Ala Ile Val Pro Gly Ser Gln Leu Met Pro Ser Lys Ser Pro
115 120 125
Ser Thr Gly Thr Thr Glu Ile Ser Ser His Glu Ser Ser His Gly Thr
130 135 140
Pro Ser Gln Thr Thr Ala Lys Asn Trp Glu Leu Thr Ala Ser Ala Ser
145 150 155 160
His Gln Pro Pro Gly Val Tyr Pro Gln Gly His Ser Asp Thr Thr Val
165 170 175
Ala Ile Ser Thr Ser Thr Val Leu Leu Cys Gly Leu Ser Ala Val Ser
180 185 190
Leu Leu Ala Cys Tyr Leu Lys Ser Arg Gln Thr Pro Pro Leu Ala Ser
195 200 205
Val Glu Met Glu Ala Met Glu Ala Leu Pro Val Thr Trp Gly Thr Ser
210 215 220
Ser Arg Asp Glu Asp Leu Glu Asn Cys Ser His His Leu
225 230 235
<210> 89
<211> 1737
<212> DNA
<213> 人工序列
<400> 89
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccggt 360
cctctgggag tacgaggcgg gggtgggagc gggcaggtgc agctgcagga gagcggcccc 420
ggcctggtga agcccagcga gaccctgagc ctgacctgca ccgtgagcgg cggcagcgtg 480
agcagcggcg actactactg gacctggatt cgccagagcc ccggcaaggg cctggagtgg 540
atcggccaca tctactacag cggcaacacc aactacaacc ccagcctgaa gagccgcctg 600
accatcagca tcgacaccag caagacccag ttcagcctga agctgagcag cgtgaccgcc 660
gccgacaccg ccatctacta ctgcgtgcgc gaccgcgtga ccggcgcctt cgacatctgg 720
ggccagggca ccatggtgac tgtgtctagc gcctccacca agggcccatc ggtcttcccc 780
ctggcaccct cctccaagag cacctctggg ggcacagcgg ccctgggctg cctggtcaag 840
gactacttcc ccgaaccggt gacggtgtcg tggaactcag gcgccctgac cagcggcgtg 900
cacaccttcc cggctgtcct acagtcctca ggactctact ccctcagcag cgtggtgact 960
gtgccctcta gcagcttggg cacccagacc tacatctgca acgtgaatca caagcccagc 1020
aacaccaagg tggacaagaa agttgaaccc aaatcttgcg acaaaactca cacatgccca 1080
ccgtgcccag cacctccagt cgccggaccg tcagtcttcc tcttccctcc aaaacccaag 1140
gacaccctca tgatctcccg gacccctgag gtcacatgcg tggtggtgga cgtgagccac 1200
gaagaccctg aggtcaagtt caactggtac gtggacggcg tggaggtgca taatgccaag 1260
acaaagccgc gggaggagca gtacaacagc acgtaccgtg tggtcagcgt cctcaccgtc 1320
ctgcaccagg actggctgaa tggcaaggag tacaagtgca aggtctccaa caaaggcctc 1380
ccaagctcca tcgagaaaac catctccaaa gccaaagggc agccccgaga accacaggtg 1440
tacaccctgc ctccatcccg ggatgagctg accaagaacc aggtcagcct gacctgcctg 1500
gtcaaaggct tctatcccag cgacatcgcc gtggagtggg agagcaatgg gcagccggag 1560
aacaactaca agaccacgcc tcccgtgctg gactccgacg gctccttctt cctctacagc 1620
aagctcaccg tggacaagag caggtggcag caggggaacg tcttctcatg ctccgtgatg 1680
catgaggctc tgcacaacca ctacacgcag aagagcctct ccctgtctcc gggtaaa 1737
<210> 90
<211> 579
<212> PRT
<213> 人工序列
<400> 90
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Pro Leu Gly Val Arg Gly Gly Gly
115 120 125
Gly Ser Gly Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys
130 135 140
Pro Ser Glu Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val
145 150 155 160
Ser Ser Gly Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys
165 170 175
Gly Leu Glu Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr
180 185 190
Asn Pro Ser Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys
195 200 205
Thr Gln Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala
210 215 220
Ile Tyr Tyr Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp
225 230 235 240
Gly Gln Gly Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro
245 250 255
Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr
260 265 270
Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr
275 280 285
Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro
290 295 300
Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr
305 310 315 320
Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn
325 330 335
His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser
340 345 350
Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala
355 360 365
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
370 375 380
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
385 390 395 400
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
405 410 415
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
420 425 430
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
435 440 445
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile
450 455 460
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
465 470 475 480
Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser
485 490 495
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
500 505 510
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
515 520 525
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
530 535 540
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
545 550 555 560
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
565 570 575
Pro Gly Lys
<210> 91
<211> 1719
<212> DNA
<213> 人工序列
<400> 91
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccggt 360
cctctgggag tacgacaggt gcagctgcag gagagcggcc ccggcctggt gaagcccagc 420
gagaccctga gcctgacctg caccgtgagc ggcggcagcg tgagcagcgg cgactactac 480
tggacctgga ttcgccagag ccccggcaag ggcctggagt ggatcggcca catctactac 540
agcggcaaca ccaactacaa ccccagcctg aagagccgcc tgaccatcag catcgacacc 600
agcaagaccc agttcagcct gaagctgagc agcgtgaccg ccgccgacac cgccatctac 660
tactgcgtgc gcgaccgcgt gaccggcgcc ttcgacatct ggggccaggg caccatggtg 720
actgtgtcta gcgcctccac caagggccca tcggtcttcc ccctggcacc ctcctccaag 780
agcacctctg ggggcacagc ggccctgggc tgcctggtca aggactactt ccccgaaccg 840
gtgacggtgt cgtggaactc aggcgccctg accagcggcg tgcacacctt cccggctgtc 900
ctacagtcct caggactcta ctccctcagc agcgtggtga ctgtgccctc tagcagcttg 960
ggcacccaga cctacatctg caacgtgaat cacaagccca gcaacaccaa ggtggacaag 1020
aaagttgaac ccaaatcttg cgacaaaact cacacatgcc caccgtgccc agcacctcca 1080
gtcgccggac cgtcagtctt cctcttccct ccaaaaccca aggacaccct catgatctcc 1140
cggacccctg aggtcacatg cgtggtggtg gacgtgagcc acgaagaccc tgaggtcaag 1200
ttcaactggt acgtggacgg cgtggaggtg cataatgcca agacaaagcc gcgggaggag 1260
cagtacaaca gcacgtaccg tgtggtcagc gtcctcaccg tcctgcacca ggactggctg 1320
aatggcaagg agtacaagtg caaggtctcc aacaaaggcc tcccaagctc catcgagaaa 1380
accatctcca aagccaaagg gcagccccga gaaccacagg tgtacaccct gcctccatcc 1440
cgggatgagc tgaccaagaa ccaggtcagc ctgacctgcc tggtcaaagg cttctatccc 1500
agcgacatcg ccgtggagtg ggagagcaat gggcagccgg agaacaacta caagaccacg 1560
cctcccgtgc tggactccga cggctccttc ttcctctaca gcaagctcac cgtggacaag 1620
agcaggtggc agcaggggaa cgtcttctca tgctccgtga tgcatgaggc tctgcacaac 1680
cactacacgc agaagagcct ctccctgtct ccgggtaaa 1719
<210> 92
<211> 573
<212> PRT
<213> 人工序列
<400> 92
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Pro Leu Gly Val Arg Gln Val Gln
115 120 125
Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu Thr Leu Ser
130 135 140
Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser Gly Asp Tyr Tyr
145 150 155 160
Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu Glu Trp Ile Gly
165 170 175
His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro Ser Leu Lys Ser
180 185 190
Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln Phe Ser Leu Lys
195 200 205
Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr Tyr Cys Val Arg
210 215 220
Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln Gly Thr Met Val
225 230 235 240
Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala
245 250 255
Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu
260 265 270
Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly
275 280 285
Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser Ser
290 295 300
Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu
305 310 315 320
Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn Thr
325 330 335
Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr
340 345 350
Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser Val Phe Leu
355 360 365
Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu
370 375 380
Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys
385 390 395 400
Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys
405 410 415
Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu
420 425 430
Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys
435 440 445
Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser Lys
450 455 460
Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser
465 470 475 480
Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys
485 490 495
Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln
500 505 510
Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly
515 520 525
Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln
530 535 540
Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn
545 550 555 560
His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
565 570
<210> 93
<211> 1737
<212> DNA
<213> 人工序列
<400> 93
caggtgcagc tgcaggagag cggccccggc ctggtgaagc ccagcgagac cctgagcctg 60
acctgcaccg tgagcggcgg cagcgtgagc agcggcgact actactggac ctggattcgc 120
cagagccccg gcaagggcct ggagtggatc ggccacatct actacagcgg caacaccaac 180
tacaacccca gcctgaagag ccgcctgacc atcagcatcg acaccagcaa gacccagttc 240
agcctgaagc tgagcagcgt gaccgccgcc gacaccgcca tctactactg cgtgcgcgac 300
cgcgtgaccg gcgccttcga catctggggc cagggcacca tggtgactgt gtctagcgcc 360
tccaccaagg gcccatcggt cttccccctg gcaccctcct ccaagagcac ctctgggggc 420
acagcggccc tgggctgcct ggtcaaggac tacttccccg aaccggtgac ggtgtcgtgg 480
aactcaggcg ccctgaccag cggcgtgcac accttcccgg ctgtcctaca gtcctcagga 540
ctctactccc tcagcagcgt ggtgactgtg ccctctagca gcttgggcac ccagacctac 600
atctgcaacg tgaatcacaa gcccagcaac accaaggtgg acaagaaagt tgaacccaaa 660
tcttgcgaca aaactcacac atgcccaccg tgcccagcac ctccagtcgc cggaccgtca 720
gtcttcctct tccctccaaa acccaaggac accctcatga tctcccggac ccctgaggtc 780
acatgcgtgg tggtggacgt gagccacgaa gaccctgagg tcaagttcaa ctggtacgtg 840
gacggcgtgg aggtgcataa tgccaagaca aagccgcggg aggagcagta caacagcacg 900
taccgtgtgg tcagcgtcct caccgtcctg caccaggact ggctgaatgg caaggagtac 960
aagtgcaagg tctccaacaa aggcctccca agctccatcg agaaaaccat ctccaaagcc 1020
aaagggcagc cccgagaacc acaggtgtac accctgcctc catcccggga tgagctgacc 1080
aagaaccagg tcagcctgac ctgcctggtc aaaggcttct atcccagcga catcgccgtg 1140
gagtgggaga gcaatgggca gccggagaac aactacaaga ccacgcctcc cgtgctggac 1200
tccgacggct ccttcttcct ctacagcaag ctcaccgtgg acaagagcag gtggcagcag 1260
gggaacgtct tctcatgctc cgtgatgcat gaggctctgc acaaccacta cacgcagaag 1320
agcctctccc tgtctccggg taaaggcgga ggcggatccg gtcctctggg agtacgaggc 1380
gggggtggga gcgggaactg ggtgaacgtg atcagcgacc tgaagaagat cgaggacctg 1440
atccagagca tgcacatcga cgccaccctg tacaccgaga gcgacgtgca ccccagctgc 1500
aaggtgaccg ccatgaagtg cttcctgctg gagctgcagg tgatcagcct ggagagcggc 1560
gacgccagca tccacgacac cgtggagaac ctgatcatcc tggccaacaa cagcctgagc 1620
agcaacggca acgtgaccga gagcggctgc aaggagtgcg aggagctgga ggagaagaac 1680
atcaaggagt tcctgcagag cttcgtgcac atcgtgcaga tgttcatcaa caccagc 1737
<210> 94
<211> 579
<212> PRT
<213> 人工序列
<400> 94
Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu
1 5 10 15
Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser Gly
20 25 30
Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu Glu
35 40 45
Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro Ser
50 55 60
Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln Phe
65 70 75 80
Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr Tyr
85 90 95
Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln Gly
100 105 110
Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
115 120 125
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
130 135 140
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
145 150 155 160
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
165 170 175
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
180 185 190
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
195 200 205
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys
210 215 220
Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser
225 230 235 240
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
245 250 255
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
260 265 270
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
275 280 285
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
290 295 300
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
305 310 315 320
Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr
325 330 335
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
340 345 350
Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys
355 360 365
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
370 375 380
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
385 390 395 400
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
405 410 415
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
420 425 430
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440 445
Gly Gly Gly Gly Ser Gly Pro Leu Gly Val Arg Gly Gly Gly Gly Ser
450 455 460
Gly Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu
465 470 475 480
Ile Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val
485 490 495
His Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu
500 505 510
Gln Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val
515 520 525
Glu Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn
530 535 540
Val Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn
545 550 555 560
Ile Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile
565 570 575
Asn Thr Ser
<210> 95
<211> 1722
<212> DNA
<213> 人工序列
<400> 95
caggtgcagc tgcaggagag cggccccggc ctggtgaagc ccagcgagac cctgagcctg 60
acctgcaccg tgagcggcgg cagcgtgagc agcggcgact actactggac ctggattcgc 120
cagagccccg gcaagggcct ggagtggatc ggccacatct actacagcgg caacaccaac 180
tacaacccca gcctgaagag ccgcctgacc atcagcatcg acaccagcaa gacccagttc 240
agcctgaagc tgagcagcgt gaccgccgcc gacaccgcca tctactactg cgtgcgcgac 300
cgcgtgaccg gcgccttcga catctggggc cagggcacca tggtgactgt gtctagcgcc 360
tccaccaagg gcccatcggt cttccccctg gcaccctcct ccaagagcac ctctgggggc 420
acagcggccc tgggctgcct ggtcaaggac tacttccccg aaccggtgac ggtgtcgtgg 480
aactcaggcg ccctgaccag cggcgtgcac accttcccgg ctgtcctaca gtcctcagga 540
ctctactccc tcagcagcgt ggtgactgtg ccctctagca gcttgggcac ccagacctac 600
atctgcaacg tgaatcacaa gcccagcaac accaaggtgg acaagaaagt tgaacccaaa 660
tcttgcgaca aaactcacac atgcccaccg tgcccagcac ctccagtcgc cggaccgtca 720
gtcttcctct tccctccaaa acccaaggac accctcatga tctcccggac ccctgaggtc 780
acatgcgtgg tggtggacgt gagccacgaa gaccctgagg tcaagttcaa ctggtacgtg 840
gacggcgtgg aggtgcataa tgccaagaca aagccgcggg aggagcagta caacagcacg 900
taccgtgtgg tcagcgtcct caccgtcctg caccaggact ggctgaatgg caaggagtac 960
aagtgcaagg tctccaacaa aggcctccca agctccatcg agaaaaccat ctccaaagcc 1020
aaagggcagc cccgagaacc acaggtgtac accctgcctc catcccggga tgagctgacc 1080
aagaaccagg tcagcctgac ctgcctggtc aaaggcttct atcccagcga catcgccgtg 1140
gagtgggaga gcaatgggca gccggagaac aactacaaga ccacgcctcc cgtgctggac 1200
tccgacggct ccttcttcct ctacagcaag ctcaccgtgg acaagagcag gtggcagcag 1260
gggaacgtct tctcatgctc cgtgatgcat gaggctctgc acaaccacta cacgcagaag 1320
agcctctccc tgtctccggg taaaggtcct ctgggagtac gaggcggggg tgggagcggg 1380
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 1440
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 1500
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 1560
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 1620
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 1680
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gc 1722
<210> 96
<211> 574
<212> PRT
<213> 人工序列
<400> 96
Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu
1 5 10 15
Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser Gly
20 25 30
Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu Glu
35 40 45
Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro Ser
50 55 60
Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln Phe
65 70 75 80
Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr Tyr
85 90 95
Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln Gly
100 105 110
Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
115 120 125
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
130 135 140
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
145 150 155 160
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
165 170 175
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
180 185 190
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
195 200 205
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys
210 215 220
Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser
225 230 235 240
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
245 250 255
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
260 265 270
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
275 280 285
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
290 295 300
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
305 310 315 320
Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr
325 330 335
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
340 345 350
Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys
355 360 365
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
370 375 380
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
385 390 395 400
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
405 410 415
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
420 425 430
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440 445
Gly Pro Leu Gly Val Arg Gly Gly Gly Gly Ser Gly Asn Trp Val Asn
450 455 460
Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile Gln Ser Met His
465 470 475 480
Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His Pro Ser Cys Lys
485 490 495
Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln Val Ile Ser Leu
500 505 510
Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu Asn Leu Ile Ile
515 520 525
Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val Thr Glu Ser Gly
530 535 540
Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile Lys Glu Phe Leu
545 550 555 560
Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn Thr Ser
565 570
<210> 97
<211> 1740
<212> DNA
<213> 人工序列
<400> 97
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccggt 360
cctctgggag tacgaggcgg gggtgggagc ggggaggtgc agctggtgga gtctggagga 420
ggcttggtcc agcctggggg gtccctgaga ctctcctgtg cagcctctgg gttcaatatt 480
aaggacactt acatccactg ggtccgccag gctccaggga aggggctgga gtgggtcgca 540
cgtatttatc ctaccaatgg ttacacacgc tacgcagact ccgtgaaggg ccgattcacc 600
atctccgcag acacttccaa gaacacggcg tatcttcaaa tgaacagcct gagagccgag 660
gacacggccg tgtattactg ttcgagatgg ggcggtgacg gcttctatgc catggactac 720
tggggccaag gaaccctggt caccgtctcc tcagcctcca ccaagggccc atcggtcttc 780
cccctggcac cctcctccaa gagcacctct gggggcacag cggccctggg ctgcctggtc 840
aaggactact tccccgaacc ggtgacggtg tcgtggaact caggcgccct gaccagcggc 900
gtgcacacct tcccggctgt cctacagtcc tcaggactct actccctcag cagcgtggtg 960
actgtgccct ctagcagctt gggcacccag acctacatct gcaacgtgaa tcacaagccc 1020
agcaacacca aggtggacaa gaaagttgaa cccaaatctt gcgacaaaac tcacacatgc 1080
ccaccgtgcc cagcacctcc agtcgccgga ccgtcagtct tcctcttccc tccaaaaccc 1140
aaggacaccc tcatgatctc ccggacccct gaggtcacat gcgtggtggt ggacgtgagc 1200
cacgaagacc ctgaggtcaa gttcaactgg tacgtggacg gcgtggaggt gcataatgcc 1260
aagacaaagc cgcgggagga gcagtacaac agcacgtacc gtgtggtcag cgtcctcacc 1320
gtcctgcacc aggactggct gaatggcaag gagtacaagt gcaaggtctc caacaaaggc 1380
ctcccaagct ccatcgagaa aaccatctcc aaagccaaag ggcagccccg agaaccacag 1440
gtgtacaccc tgcctccatc ccgggatgag ctgaccaaga accaggtcag cctgacctgc 1500
ctggtcaaag gcttctatcc cagcgacatc gccgtggagt gggagagcaa tgggcagccg 1560
gagaacaact acaagaccac gcctcccgtg ctggactccg acggctcctt cttcctctac 1620
agcaagctca ccgtggacaa gagcaggtgg cagcagggga acgtcttctc atgctccgtg 1680
atgcatgagg ctctgcacaa ccactacacg cagaagagcc tctccctgtc tccgggtaaa 1740
<210> 98
<211> 580
<212> PRT
<213> 人工序列
<400> 98
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Pro Leu Gly Val Arg Gly Gly Gly
115 120 125
Gly Ser Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln
130 135 140
Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile
145 150 155 160
Lys Asp Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu
165 170 175
Glu Trp Val Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala
180 185 190
Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn
195 200 205
Thr Ala Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val
210 215 220
Tyr Tyr Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr
225 230 235 240
Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly
245 250 255
Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly
260 265 270
Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val
275 280 285
Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe
290 295 300
Pro Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val
305 310 315 320
Thr Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val
325 330 335
Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys
340 345 350
Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val
355 360 365
Ala Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
370 375 380
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
385 390 395 400
His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu
405 410 415
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr
420 425 430
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
435 440 445
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser
450 455 460
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
465 470 475 480
Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val
485 490 495
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
500 505 510
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
515 520 525
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr
530 535 540
Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val
545 550 555 560
Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu
565 570 575
Ser Pro Gly Lys
580
<210> 99
<211> 1722
<212> DNA
<213> 人工序列
<400> 99
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccggt 360
cctctgggag tacgagaggt gcagctggtg gagtctggag gaggcttggt ccagcctggg 420
gggtccctga gactctcctg tgcagcctct gggttcaata ttaaggacac ttacatccac 480
tgggtccgcc aggctccagg gaaggggctg gagtgggtcg cacgtattta tcctaccaat 540
ggttacacac gctacgcaga ctccgtgaag ggccgattca ccatctccgc agacacttcc 600
aagaacacgg cgtatcttca aatgaacagc ctgagagccg aggacacggc cgtgtattac 660
tgttcgagat ggggcggtga cggcttctat gccatggact actggggcca aggaaccctg 720
gtcaccgtct cctcagcctc caccaagggc ccatcggtct tccccctggc accctcctcc 780
aagagcacct ctgggggcac agcggccctg ggctgcctgg tcaaggacta cttccccgaa 840
ccggtgacgg tgtcgtggaa ctcaggcgcc ctgaccagcg gcgtgcacac cttcccggct 900
gtcctacagt cctcaggact ctactccctc agcagcgtgg tgactgtgcc ctctagcagc 960
ttgggcaccc agacctacat ctgcaacgtg aatcacaagc ccagcaacac caaggtggac 1020
aagaaagttg aacccaaatc ttgcgacaaa actcacacat gcccaccgtg cccagcacct 1080
ccagtcgccg gaccgtcagt cttcctcttc cctccaaaac ccaaggacac cctcatgatc 1140
tcccggaccc ctgaggtcac atgcgtggtg gtggacgtga gccacgaaga ccctgaggtc 1200
aagttcaact ggtacgtgga cggcgtggag gtgcataatg ccaagacaaa gccgcgggag 1260
gagcagtaca acagcacgta ccgtgtggtc agcgtcctca ccgtcctgca ccaggactgg 1320
ctgaatggca aggagtacaa gtgcaaggtc tccaacaaag gcctcccaag ctccatcgag 1380
aaaaccatct ccaaagccaa agggcagccc cgagaaccac aggtgtacac cctgcctcca 1440
tcccgggatg agctgaccaa gaaccaggtc agcctgacct gcctggtcaa aggcttctat 1500
cccagcgaca tcgccgtgga gtgggagagc aatgggcagc cggagaacaa ctacaagacc 1560
acgcctcccg tgctggactc cgacggctcc ttcttcctct acagcaagct caccgtggac 1620
aagagcaggt ggcagcaggg gaacgtcttc tcatgctccg tgatgcatga ggctctgcac 1680
aaccactaca cgcagaagag cctctccctg tctccgggta aa 1722
<210> 100
<211> 574
<212> PRT
<213> 人工序列
<400> 100
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Pro Leu Gly Val Arg Glu Val Gln
115 120 125
Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly Ser Leu Arg
130 135 140
Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys Asp Thr Tyr Ile His
145 150 155 160
Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg Ile
165 170 175
Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val Lys Gly Arg
180 185 190
Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr Leu Gln Met
195 200 205
Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ser Arg Trp
210 215 220
Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly Gln Gly Thr Leu
225 230 235 240
Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu
245 250 255
Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys
260 265 270
Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser
275 280 285
Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gln Ser
290 295 300
Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser
305 310 315 320
Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro Ser Asn
325 330 335
Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His
340 345 350
Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser Val Phe
355 360 365
Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro
370 375 380
Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro Glu Val
385 390 395 400
Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr
405 410 415
Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val
420 425 430
Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys
435 440 445
Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr Ile Ser
450 455 460
Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro
465 470 475 480
Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val
485 490 495
Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly
500 505 510
Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp
515 520 525
Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp
530 535 540
Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala Leu His
545 550 555 560
Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
565 570
<210> 101
<211> 1035
<212> DNA
<213> 人工序列
<400> 101
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccggt 360
cctctgggag tacgaggcgg gggtgggagc ggggacatcc agatgaccca gtctccatcc 420
tccctgtctg catctgtagg agacagagtc accatcactt gccgggcaag tcaggatgtg 480
aataccgcgg tcgcatggta tcagcagaaa ccagggaaag cccctaagct cctgatctat 540
tctgcatcct tcttgtatag tggggtccca tcaaggttca gtggcagtag atctgggaca 600
gatttcactc tcaccatcag cagtctgcaa cctgaagatt ttgcaactta ctactgtcaa 660
cagcattaca ctacccctcc gacgttcggc caaggtacca aggttgagat caaacgcaca 720
gtggcagccc ccagcgtctt catttttccc ccttccgatg aacagctgaa gtccggcact 780
gcttctgtgg tctgtctgct gaacaatttc tatcccagag aggccaaggt gcagtggaaa 840
gtggacaacg ctctgcagtc cggcaacagc caggagagtg tgaccgaaca ggatagtaag 900
gacagcacat attctctgtc tagtaccctg acactgagta aggcagatta cgagaagcac 960
aaagtgtatg cctgcgaagt cactcatcag ggactgtcaa gccccgtgac caagagcttc 1020
aaccggggcg agtgt 1035
<210> 102
<211> 345
<212> PRT
<213> 人工序列
<400> 102
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Pro Leu Gly Val Arg Gly Gly Gly
115 120 125
Gly Ser Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala
130 135 140
Ser Val Gly Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val
145 150 155 160
Asn Thr Ala Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys
165 170 175
Leu Leu Ile Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg
180 185 190
Phe Ser Gly Ser Arg Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser
195 200 205
Leu Gln Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln His Tyr Thr
210 215 220
Thr Pro Pro Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr
225 230 235 240
Val Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu
245 250 255
Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro
260 265 270
Arg Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly
275 280 285
Asn Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr
290 295 300
Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His
305 310 315 320
Lys Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val
325 330 335
Thr Lys Ser Phe Asn Arg Gly Glu Cys
340 345
<210> 103
<211> 1017
<212> DNA
<213> 人工序列
<400> 103
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccggt 360
cctctgggag tacgagacat ccagatgacc cagtctccat cctccctgtc tgcatctgta 420
ggagacagag tcaccatcac ttgccgggca agtcaggatg tgaataccgc ggtcgcatgg 480
tatcagcaga aaccagggaa agcccctaag ctcctgatct attctgcatc cttcttgtat 540
agtggggtcc catcaaggtt cagtggcagt agatctggga cagatttcac tctcaccatc 600
agcagtctgc aacctgaaga ttttgcaact tactactgtc aacagcatta cactacccct 660
ccgacgttcg gccaaggtac caaggttgag atcaaacgca cagtggcagc ccccagcgtc 720
ttcatttttc ccccttccga tgaacagctg aagtccggca ctgcttctgt ggtctgtctg 780
ctgaacaatt tctatcccag agaggccaag gtgcagtgga aagtggacaa cgctctgcag 840
tccggcaaca gccaggagag tgtgaccgaa caggatagta aggacagcac atattctctg 900
tctagtaccc tgacactgag taaggcagat tacgagaagc acaaagtgta tgcctgcgaa 960
gtcactcatc agggactgtc aagccccgtg accaagagct tcaaccgggg cgagtgt 1017
<210> 104
<211> 339
<212> PRT
<213> 人工序列
<400> 104
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Pro Leu Gly Val Arg Asp Ile Gln
115 120 125
Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val
130 135 140
Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Asn Thr Ala Val Ala Trp
145 150 155 160
Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile Tyr Ser Ala
165 170 175
Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Arg Ser
180 185 190
Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro Glu Asp Phe
195 200 205
Ala Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr Pro Pro Thr Phe Gly
210 215 220
Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val
225 230 235 240
Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly Thr Ala Ser
245 250 255
Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys Val Gln
260 265 270
Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln Glu Ser Val
275 280 285
Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu
290 295 300
Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu
305 310 315 320
Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg
325 330 335
Gly Glu Cys
<210> 105
<211> 1737
<212> DNA
<213> 人工序列
<400> 105
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gaggtgcagc tggtggagtc tggaggaggc 420
ttggtccagc ctggggggtc cctgagactc tcctgtgcag cctctgggtt caatattaag 480
gacacttaca tccactgggt ccgccaggct ccagggaagg ggctggagtg ggtcgcacgt 540
atttatccta ccaatggtta cacacgctac gcagactccg tgaagggccg attcaccatc 600
tccgcagaca cttccaagaa cacggcgtat cttcaaatga acagcctgag agccgaggac 660
acggccgtgt attactgttc gagatggggc ggtgacggct tctatgccat ggactactgg 720
ggccaaggaa ccctggtcac cgtctcctca gcctccacca agggcccatc ggtcttcccc 780
ctggcaccct cctccaagag cacctctggg ggcacagcgg ccctgggctg cctggtcaag 840
gactacttcc ccgaaccggt gacggtgtcg tggaactcag gcgccctgac cagcggcgtg 900
cacaccttcc cggctgtcct acagtcctca ggactctact ccctcagcag cgtggtgact 960
gtgccctcta gcagcttggg cacccagacc tacatctgca acgtgaatca caagcccagc 1020
aacaccaagg tggacaagaa agttgaaccc aaatcttgcg acaaaactca cacatgccca 1080
ccgtgcccag cacctccagt cgccggaccg tcagtcttcc tcttccctcc aaaacccaag 1140
gacaccctca tgatctcccg gacccctgag gtcacatgcg tggtggtgga cgtgagccac 1200
gaagaccctg aggtcaagtt caactggtac gtggacggcg tggaggtgca taatgccaag 1260
acaaagccgc gggaggagca gtacaacagc acgtaccgtg tggtcagcgt cctcaccgtc 1320
ctgcaccagg actggctgaa tggcaaggag tacaagtgca aggtctccaa caaaggcctc 1380
ccaagctcca tcgagaaaac catctccaaa gccaaagggc agccccgaga accacaggtg 1440
tacaccctgc ctccatcccg ggatgagctg accaagaacc aggtcagcct gacctgcctg 1500
gtcaaaggct tctatcccag cgacatcgcc gtggagtggg agagcaatgg gcagccggag 1560
aacaactaca agaccacgcc tcccgtgctg gactccgacg gctccttctt cctctacagc 1620
aagctcaccg tggacaagag caggtggcag caggggaacg tcttctcatg ctccgtgatg 1680
catgaggctc tgcacaacca ctacacgcag aagagcctct ccctgtctcc gggtaaa 1737
<210> 106
<211> 579
<212> PRT
<213> 人工序列
<400> 106
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro
130 135 140
Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys
145 150 155 160
Asp Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu
165 170 175
Trp Val Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp
180 185 190
Ser Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr
195 200 205
Ala Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
210 215 220
Tyr Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp
225 230 235 240
Gly Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro
245 250 255
Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr
260 265 270
Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr
275 280 285
Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro
290 295 300
Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr
305 310 315 320
Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn
325 330 335
His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser
340 345 350
Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala
355 360 365
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
370 375 380
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
385 390 395 400
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
405 410 415
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
420 425 430
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
435 440 445
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile
450 455 460
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
465 470 475 480
Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser
485 490 495
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
500 505 510
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
515 520 525
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
530 535 540
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
545 550 555 560
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
565 570 575
Pro Gly Lys
<210> 107
<211> 1032
<212> DNA
<213> 人工序列
<400> 107
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gcggcggagg cggatccgga 360
ggcggaggtt ccggcggggg tgggagcggg gacatccaga tgacccagtc tccatcctcc 420
ctgtctgcat ctgtaggaga cagagtcacc atcacttgcc gggcaagtca ggatgtgaat 480
accgcggtcg catggtatca gcagaaacca gggaaagccc ctaagctcct gatctattct 540
gcatccttct tgtatagtgg ggtcccatca aggttcagtg gcagtagatc tgggacagat 600
ttcactctca ccatcagcag tctgcaacct gaagattttg caacttacta ctgtcaacag 660
cattacacta cccctccgac gttcggccaa ggtaccaagg ttgagatcaa acgcacagtg 720
gcagccccca gcgtcttcat ttttccccct tccgatgaac agctgaagtc cggcactgct 780
tctgtggtct gtctgctgaa caatttctat cccagagagg ccaaggtgca gtggaaagtg 840
gacaacgctc tgcagtccgg caacagccag gagagtgtga ccgaacagga tagtaaggac 900
agcacatatt ctctgtctag taccctgaca ctgagtaagg cagattacga gaagcacaaa 960
gtgtatgcct gcgaagtcac tcatcaggga ctgtcaagcc ccgtgaccaa gagcttcaac 1020
cggggcgagt gt 1032
<210> 108
<211> 344
<212> PRT
<213> 人工序列
<400> 108
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser
130 135 140
Val Gly Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Asp Val Asn
145 150 155 160
Thr Ala Val Ala Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu
165 170 175
Leu Ile Tyr Ser Ala Ser Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe
180 185 190
Ser Gly Ser Arg Ser Gly Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu
195 200 205
Gln Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln His Tyr Thr Thr
210 215 220
Pro Pro Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Thr Val
225 230 235 240
Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys
245 250 255
Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg
260 265 270
Glu Ala Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn
275 280 285
Ser Gln Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser
290 295 300
Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys
305 310 315 320
Val Tyr Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr
325 330 335
Lys Ser Phe Asn Arg Gly Glu Cys
340
<210> 109
<211> 1347
<212> DNA
<213> 人工序列
<400> 109
gaggtgcagc tggtggagtc tggaggaggc ttggtccagc ctggggggtc cctgagactc 60
tcctgtgcag cctctgggtt caatattaag gacacttaca tccactgggt ccgccaggct 120
ccagggaagg ggctggagtg ggtcgcacgt atttatccta ccaatggtta cacacgctac 180
gcagactccg tgaagggccg attcaccatc tccgcagaca cttccaagaa cacggcgtat 240
cttcaaatga acagcctgag agccgaggac acggccgtgt attactgttc gagatggggc 300
ggtgacggct tctatgccat ggactactgg ggccaaggaa ccctggtcac cgtctcctca 360
gcctccacca agggcccatc ggtcttcccc ctggcaccct cctccaagag cacctctggg 420
ggcacagcgg ccctgggctg cctggtcaag gactacttcc ccgaaccggt gacggtgtcg 480
tggaactcag gcgccctgac cagcggcgtg cacaccttcc cggctgtcct acagtcctca 540
ggactctact ccctcagcag cgtggtgact gtgccctcta gcagcttggg cacccagacc 600
tacatctgca acgtgaatca caagcccagc aacaccaagg tggacaagaa agttgaaccc 660
aaatcttgcg acaaaactca cacatgccca ccgtgcccag cacctccagt cgccggaccg 720
tcagtcttcc tcttccctcc aaaacccaag gacaccctca tgatctcccg gacccctgag 780
gtcacatgcg tggtggtgga cgtgagccac gaagaccctg aggtcaagtt caactggtac 840
gtggacggcg tggaggtgca taatgccaag acaaagccgc gggaggagca gtacaacagc 900
acgtaccgtg tggtcagcgt cctcaccgtc ctgcaccagg actggctgaa tggcaaggag 960
tacaagtgca aggtctccaa caaaggcctc ccaagctcca tcgagaaaac catctccaaa 1020
gccaaagggc agccccgaga accacaggtg tacaccctgc ctccatcccg ggatgagctg 1080
accaagaacc aggtcagcct gacctgcctg gtcaaaggct tctatcccag cgacatcgcc 1140
gtggagtggg agagcaatgg gcagccggag aacaactaca agaccacgcc tcccgtgctg 1200
gactccgacg gctccttctt cctctacagc aagctcaccg tggacaagag caggtggcag 1260
caggggaacg tcttctcatg ctccgtgatg catgaggctc tgcacaacca ctacacgcag 1320
aagagcctct ccctgtctcc gggtaaa 1347
<210> 110
<211> 449
<212> PRT
<213> 人工序列
<400> 110
Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly
1 5 10 15
Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys Asp Thr
20 25 30
Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val
35 40 45
Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val
50 55 60
Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr
65 70 75 80
Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95
Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly Gln
100 105 110
Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val
115 120 125
Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala
130 135 140
Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser
145 150 155 160
Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val
165 170 175
Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro
180 185 190
Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys
195 200 205
Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp
210 215 220
Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro
225 230 235 240
Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser
245 250 255
Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp
260 265 270
Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn
275 280 285
Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val
290 295 300
Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu
305 310 315 320
Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys
325 330 335
Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr
340 345 350
Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr
355 360 365
Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu
370 375 380
Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu
385 390 395 400
Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys
405 410 415
Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu
420 425 430
Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly
435 440 445
Lys
<210> 111
<211> 342
<212> DNA
<213> 人工序列
<400> 111
aactgggtga acgtgatcag cgacctgaag aagatcgagg acctgatcca gagcatgcac 60
atcgacgcca ccctgtacac cgagagcgac gtgcacccca gctgcaaggt gaccgccatg 120
aagtgcttcc tgctggagct gcaggtgatc agcctggaga gcggcgacgc cagcatccac 180
gacaccgtgg agaacctgat catcctggcc aacaacagcc tgagcagcaa cggcaacgtg 240
accgagagcg gctgcaagga gtgcgaggag ctggaggaga agaacatcaa ggagttcctg 300
cagagcttcg tgcacatcgt gcagatgttc atcaacacca gc 342
<210> 112
<211> 114
<212> PRT
<213> 人工序列
<400> 112
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser
<210> 113
<211> 578
<212> PRT
<213> 人工序列
<400> 113
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro
130 135 140
Ser Glu Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser
145 150 155 160
Ser Gly Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly
165 170 175
Leu Glu Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn
180 185 190
Pro Ser Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr
195 200 205
Gln Phe Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile
210 215 220
Tyr Tyr Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly
225 230 235 240
Gln Gly Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser
245 250 255
Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala
260 265 270
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val
275 280 285
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala
290 295 300
Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val
305 310 315 320
Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His
325 330 335
Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys
340 345 350
Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly
355 360 365
Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile
370 375 380
Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu
385 390 395 400
Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His
405 410 415
Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg
420 425 430
Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys
435 440 445
Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu
450 455 460
Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr
465 470 475 480
Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu
485 490 495
Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp
500 505 510
Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val
515 520 525
Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp
530 535 540
Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His
545 550 555 560
Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro
565 570 575
Gly Lys
<210> 114
<211> 578
<212> PRT
<213> 人工序列
<400> 114
Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu
1 5 10 15
Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser Gly
20 25 30
Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu Glu
35 40 45
Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro Ser
50 55 60
Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln Phe
65 70 75 80
Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr Tyr
85 90 95
Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln Gly
100 105 110
Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
115 120 125
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
130 135 140
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
145 150 155 160
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
165 170 175
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
180 185 190
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
195 200 205
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys
210 215 220
Thr His Thr Cys Pro Pro Cys Pro Ala Pro Pro Val Ala Gly Pro Ser
225 230 235 240
Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met Ile Ser Arg
245 250 255
Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His Glu Asp Pro
260 265 270
Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val His Asn Ala
275 280 285
Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val
290 295 300
Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly Lys Glu Tyr
305 310 315 320
Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser Ile Glu Lys Thr
325 330 335
Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu
340 345 350
Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser Leu Thr Cys
355 360 365
Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu Trp Glu Ser
370 375 380
Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp
385 390 395 400
Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser
405 410 415
Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met His Glu Ala
420 425 430
Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys
435 440 445
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
450 455 460
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
465 470 475 480
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
485 490 495
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
500 505 510
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
515 520 525
Asn Leu Ile Ile Leu Ala Asn Asn Ser Leu Ser Ser Asn Gly Asn Val
530 535 540
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
545 550 555 560
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
565 570 575
Thr Ser
<210> 115
<211> 344
<212> PRT
<213> 人工序列
<400> 115
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Gln Ala Ser Gln Asp Ile Ser Asn Tyr
20 25 30
Leu Asn Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Asp Ala Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Phe Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Ile Ala Thr Tyr Phe Cys Gln His Phe Asp His Leu Pro Leu
85 90 95
Ala Phe Gly Gly Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala
100 105 110
Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly
115 120 125
Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala
130 135 140
Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln
145 150 155 160
Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser
165 170 175
Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr
180 185 190
Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser
195 200 205
Phe Asn Arg Gly Glu Cys Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
210 215 220
Gly Gly Gly Gly Ser Gly Asn Trp Val Asn Val Ile Ser Asp Leu Lys
225 230 235 240
Lys Ile Glu Asp Leu Ile Gln Ser Met His Ile Asp Ala Thr Leu Tyr
245 250 255
Thr Glu Ser Asp Val His Pro Ser Cys Lys Val Thr Ala Met Lys Cys
260 265 270
Phe Leu Leu Glu Leu Gln Val Ile Ser Leu Glu Ser Gly Asp Ala Ser
275 280 285
Ile His Asp Thr Val Glu Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu
290 295 300
Ser Ser Asn Gly Asn Val Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu
305 310 315 320
Leu Glu Glu Lys Asn Ile Lys Glu Phe Leu Gln Ser Phe Val His Ile
325 330 335
Val Gln Met Phe Ile Asn Thr Ser
340
<210> 116
<211> 308
<212> PRT
<213> 人工序列
<400> 116
Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu
1 5 10 15
Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser Gly
20 25 30
Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu Glu
35 40 45
Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro Ser
50 55 60
Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln Phe
65 70 75 80
Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr Tyr
85 90 95
Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln Gly
100 105 110
Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
115 120 125
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
130 135 140
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
145 150 155 160
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
165 170 175
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
180 185 190
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
195 200 205
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys
210 215 220
Thr His Thr Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly
225 230 235 240
Gly Ser Gly Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp
245 250 255
Ile Trp Val Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys
260 265 270
Asn Ser Gly Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys
275 280 285
Val Leu Asn Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu
290 295 300
Lys Cys Ile Arg
305
<210> 117
<211> 357
<212> PRT
<213> 人工序列
<400> 117
Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu
1 5 10 15
Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser Ser Gly
20 25 30
Asp Tyr Tyr Trp Thr Trp Ile Arg Gln Ser Pro Gly Lys Gly Leu Glu
35 40 45
Trp Ile Gly His Ile Tyr Tyr Ser Gly Asn Thr Asn Tyr Asn Pro Ser
50 55 60
Leu Lys Ser Arg Leu Thr Ile Ser Ile Asp Thr Ser Lys Thr Gln Phe
65 70 75 80
Ser Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Ile Tyr Tyr
85 90 95
Cys Val Arg Asp Arg Val Thr Gly Ala Phe Asp Ile Trp Gly Gln Gly
100 105 110
Thr Met Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe
115 120 125
Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu
130 135 140
Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp
145 150 155 160
Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu
165 170 175
Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser
180 185 190
Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His Lys Pro
195 200 205
Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys
210 215 220
Thr His Thr Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly
225 230 235 240
Gly Ser Gly Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu
245 250 255
Asp Leu Ile Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser
260 265 270
Asp Val His Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu
275 280 285
Glu Leu Gln Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp
290 295 300
Thr Val Glu Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn
305 310 315 320
Gly Asn Val Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu
325 330 335
Lys Asn Ile Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met
340 345 350
Phe Ile Asn Thr Ser
355
<210> 118
<211> 295
<212> PRT
<213> 人工序列
<400> 118
Asp Ile Gln Met Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly
1 5 10 15
Asp Arg Val Thr Ile Thr Cys Gln Ala Ser Gln Asp Ile Ser Asn Tyr
20 25 30
Leu Asn Trp Tyr Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile
35 40 45
Tyr Asp Ala Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Gly
50 55 60
Ser Gly Ser Gly Thr Asp Phe Thr Phe Thr Ile Ser Ser Leu Gln Pro
65 70 75 80
Glu Asp Ile Ala Thr Tyr Phe Cys Gln His Phe Asp His Leu Pro Leu
85 90 95
Ala Phe Gly Gly Gly Thr Lys Val Glu Ile Lys Arg Thr Val Ala Ala
100 105 110
Pro Ser Val Phe Ile Phe Pro Pro Ser Asp Glu Gln Leu Lys Ser Gly
115 120 125
Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala
130 135 140
Lys Val Gln Trp Lys Val Asp Asn Ala Leu Gln Ser Gly Asn Ser Gln
145 150 155 160
Glu Ser Val Thr Glu Gln Asp Ser Lys Asp Ser Thr Tyr Ser Leu Ser
165 170 175
Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys Val Tyr
180 185 190
Ala Cys Glu Val Thr His Gln Gly Leu Ser Ser Pro Val Thr Lys Ser
195 200 205
Phe Asn Arg Gly Glu Cys Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
210 215 220
Gly Gly Gly Gly Ser Gly Ile Thr Cys Pro Pro Pro Met Ser Val Glu
225 230 235 240
His Ala Asp Ile Trp Val Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg
245 250 255
Tyr Ile Cys Asn Ser Gly Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu
260 265 270
Thr Glu Cys Val Leu Asn Lys Ala Thr Asn Val Ala His Trp Thr Thr
275 280 285
Pro Ser Leu Lys Cys Ile Arg
290 295
<210> 119
<211> 580
<212> PRT
<213> 人工序列
<400> 119
Asn Trp Val Asn Val Ile Ser Asp Leu Lys Lys Ile Glu Asp Leu Ile
1 5 10 15
Gln Ser Met His Ile Asp Ala Thr Leu Tyr Thr Glu Ser Asp Val His
20 25 30
Pro Ser Cys Lys Val Thr Ala Met Lys Cys Phe Leu Leu Glu Leu Gln
35 40 45
Val Ile Ser Leu Glu Ser Gly Asp Ala Ser Ile His Asp Thr Val Glu
50 55 60
Asn Leu Ile Ile Leu Ala Asn Asp Ser Leu Ser Ser Asn Gly Asn Val
65 70 75 80
Thr Glu Ser Gly Cys Lys Glu Cys Glu Glu Leu Glu Glu Lys Asn Ile
85 90 95
Lys Glu Phe Leu Gln Ser Phe Val His Ile Val Gln Met Phe Ile Asn
100 105 110
Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125
Ser Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro
130 135 140
Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys
145 150 155 160
Asp Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu
165 170 175
Trp Val Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp
180 185 190
Ser Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr
195 200 205
Ala Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr
210 215 220
Tyr Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp
225 230 235 240
Gly Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro
245 250 255
Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr
260 265 270
Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr
275 280 285
Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro
290 295 300
Ala Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr
305 310 315 320
Val Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn
325 330 335
His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser
340 345 350
Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu
355 360 365
Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu
370 375 380
Met Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser
385 390 395 400
His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu
405 410 415
Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr
420 425 430
Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn
435 440 445
Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro
450 455 460
Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln
465 470 475 480
Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val
485 490 495
Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val
500 505 510
Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro
515 520 525
Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr
530 535 540
Val Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val
545 550 555 560
Met His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu
565 570 575
Ser Pro Gly Lys
580
<210> 120
<211> 531
<212> PRT
<213> 人工序列
<400> 120
Ile Thr Cys Pro Pro Pro Met Ser Val Glu His Ala Asp Ile Trp Val
1 5 10 15
Lys Ser Tyr Ser Leu Tyr Ser Arg Glu Arg Tyr Ile Cys Asn Ser Gly
20 25 30
Phe Lys Arg Lys Ala Gly Thr Ser Ser Leu Thr Glu Cys Val Leu Asn
35 40 45
Lys Ala Thr Asn Val Ala His Trp Thr Thr Pro Ser Leu Lys Cys Ile
50 55 60
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
65 70 75 80
Gly Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly
85 90 95
Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Asn Ile Lys Asp
100 105 110
Thr Tyr Ile His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp
115 120 125
Val Ala Arg Ile Tyr Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser
130 135 140
Val Lys Gly Arg Phe Thr Ile Ser Ala Asp Thr Ser Lys Asn Thr Ala
145 150 155 160
Tyr Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr
165 170 175
Cys Ser Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly
180 185 190
Gln Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser
195 200 205
Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala
210 215 220
Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val
225 230 235 240
Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala
245 250 255
Val Leu Gln Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val
260 265 270
Pro Ser Ser Ser Leu Gly Thr Gln Thr Tyr Ile Cys Asn Val Asn His
275 280 285
Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys
290 295 300
Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly
305 310 315 320
Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met
325 330 335
Ile Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser His
340 345 350
Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val Glu Val
355 360 365
His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gln Tyr Asn Ser Thr Tyr
370 375 380
Arg Val Val Ser Val Leu Thr Val Leu His Gln Asp Trp Leu Asn Gly
385 390 395 400
Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala Pro Ile
405 410 415
Glu Lys Thr Ile Ser Lys Ala Lys Gly Gln Pro Arg Glu Pro Gln Val
420 425 430
Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gln Val Ser
435 440 445
Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp Ile Ala Val Glu
450 455 460
Trp Glu Ser Asn Gly Gln Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro
465 470 475 480
Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val
485 490 495
Asp Lys Ser Arg Trp Gln Gln Gly Asn Val Phe Ser Cys Ser Val Met
500 505 510
His Glu Ala Leu His Asn His Tyr Thr Gln Lys Ser Leu Ser Leu Ser
515 520 525
Pro Gly Lys
530
Claims (15)
1.一种免疫细胞因子,其包括:
(A)白介素-15(IL-15),其中所述IL-15由SEQ ID NO:72或SEQ ID NO:112的氨基酸序列组成;
(B)白介素-15受体a亚基(IL-15Ra),其中所述IL-15Ra由SEQ ID NO:74或SEQ ID NO:88的氨基酸序列组成;
(C)靶向治疗相关细胞表面抗原的抗体,其中所述治疗相关细胞表面抗原为免疫监测点蛋白或肿瘤抗原;
其中,所述IL-15通过连接肽与所述抗体的重链可变区N端连接,所述IL-15Ra通过连接肽与所述抗体的轻链可变区N端连接;或者,所述IL-15通过连接肽与所述抗体的轻链可变区N端连接,所述IL-15Ra通过连接肽与所述抗体的重链可变区N端连接。
2.根据权利要求1所述的免疫细胞因子,其中所述治疗相关细胞表面抗原选自:表皮生长因子受体家族(EGFR、HER2、HER3、HER4)、PD-1、PD-L1、STEAP1、CTLA-4、4-1BB(CD137)、OX40、CD28、CD40、CD47、CD70、CD80、CD122、GTIR、A2AR、B7-H3(CD276)、B7-H4、IDO、KIR、Tim-3、NY-ESO-1、GPC3、CLL-1、BCMA、mucin家族(MUC1、MUC2、MUC3A、MUC3B、MUC4、MUC5AC、MUC5B、MUC6、MUC7、MUC8、MUC12、MUC13、MUC15、MUC16、MUC17、MUC19、MUC20)、CD19、CD20、CD22、CD30、CD33、CD52、化学趋化因子受体家族(CCR1、CCR2、CCR3、CCR4、CCR5、CCR6、CCR7、CCR8、CCR9、CCR10、CCL27、CCL28、CX3CR1、CXCR1、CXCR2、CXCR3、CXCR4、CXCR5、CXCR6)、PSMA、CEA、HDAC6、EpCAM、Mesothelin、TERT、TLR、TLR9、TLR4、CD33、GITR、Survivin、CD123、TIGIT、TIM-3、CD73、成纤维细胞生长因子受体(FGFR)、血管内皮生长因子受体(FLT1、KDR/Flk-1、VEGFR-
3)、肝细胞生长因子受体(HGFR)、神经生长因子受体(NGFR)、胰岛素样生长因子受体(IGFR)、血小板衍生生长因子受体(PDGFR)、激素受体(黑皮质素1受体(MC1R,MSHR)。
3.根据权利要求2所述的免疫细胞因子,其中所述治疗相关细胞表面抗原选自EGFR、HER2、PD-1、PD-L1、CLL-1、GPC-3、RSV F蛋白、CD19、CD20、CD22、CD30、CD33或CD52。
4.根据权利要求3所述的免疫细胞因子,其中所述靶向治疗相关细胞表面抗原的抗体为靶向EGFR的抗体,该抗体具有如SEQ ID NO:32所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:34所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列;或者具有如SEQ ID NO:32所示的重链氨基酸序列中所含的VH序列,以及如SEQID NO:34所示的轻链氨基酸序列中所含的VL序列。
5.根据权利要求3所述的免疫细胞因子,其中所述靶向治疗相关细胞表面抗原的抗体为靶向HER2的抗体,该抗体具有如SEQ ID NO:18所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:20所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列;或者具有如SEQ ID NO:18所示的重链氨基酸序列中所含的VH序列,以及如SEQID NO:20所示的轻链氨基酸序列中所含的VL序列。
6.根据权利要求3所述的免疫细胞因子,其中所述靶向治疗相关细胞表面抗原的抗体为靶向PD-1的抗体,该抗体具有如SEQ ID NO:76所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:78所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列;或者具有如SEQ ID NO:76所示的重链氨基酸序列中所含的VH序列,以及如SEQID NO:78所示的轻链氨基酸序列中所含的VL序列。
7.根据权利要求3所述的免疫细胞因子,其中所述靶向治疗相关细胞表面抗原的抗体为靶向PD-L1的抗体,该抗体具有SEQ ID NO:36所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:38所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列;或者具有SEQ ID NO:80所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:82所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列,或者具有SEQ ID NO:36所示的氨基酸序列中所含的VH序列,以及如SEQ ID NO:38所示的氨基酸序列中所含的VL序列;或者具有SEQ ID NO:80所示的重链氨基酸序列中所含的VH,以及如SEQ ID NO:82所示的轻链氨基酸序列中所含的VL序列。
8.根据权利要求3所述的免疫细胞因子,其中所述靶向治疗相关细胞表面抗原的抗体为靶向RSV病毒F蛋白的抗体,该抗体具有SEQ ID NO:54所示的重链氨基酸序列中所含的HCDR1、HCDR2和HCDR3序列,以及如SEQ ID NO:56所示的轻链氨基酸序列中所含的LCDR1、LCDR2和LCDR3序列;或者具有SEQ ID NO:54所示的重链氨基酸序列中所含的VH序列,以及如SEQ ID NO:56所示的轻链氨基酸序列中所含的VL序列。
9.根据权利要求1-8中任一项所述的免疫细胞因子,其中所述靶向治疗相关细胞表面抗原的抗体进一步含有Fc片段。
10.根据权利要求9所述的免疫细胞因子,其中所述Fc片段选自人IgG1、IgG2、IgG3或IgG4。
11.根据权利要求1-8中任一项所述的免疫细胞因子,其中所述连接肽独立地选自GGGGSGGGGSGGGGSG、GSPLGVRGS、GSPLGVR、PLGVR、GGGGSGPLGVRGGGGSG或GGGGSGPLGVR。
12.根据权利要求1-8中任一项所述的免疫细胞因子,其中所述免疫细胞因子包含分别具有如下氨基酸序列的重链和轻链:SEQ ID NO:2和SEQ ID NO:8;SEQ ID NO:6和SEQ IDNO:4;SEQ ID NO:68和SEQ ID NO:8;SEQ ID NO:70和SEQ ID NO:4;SEQ ID NO:10和SEQ IDNO:12;SEQ ID NO:14和SEQ ID NO:16;SEQ ID NO:22和SEQ ID NO:24;SEQ ID NO:26和SEQID NO:28;SEQ ID NO:40和SEQ ID NO:24;SEQ ID NO:44和SEQ ID NO:28;SEQ ID NO:46和SEQ ID NO:48;SEQ ID NO:42和SEQ ID NO:58;SEQ ID NO:84和SEQ ID NO:86;SEQ ID NO:60和SEQ ID NO:62;SEQ ID NO:64和SEQ ID NO:66;SEQ ID NO:115和SEQ ID NO:116;SEQID NO:117和SEQ ID NO:118;SEQ ID NO:119和SEQ ID NO:12;SEQ ID NO:120和SEQ IDNO:16;SEQ ID NO:90和SEQ ID NO:34;SEQ ID NO:92和SEQ ID NO:34;SEQ ID NO:94和SEQID NO:34;SEQ ID NO:96和SEQ ID NO:34;SEQ ID NO:98和SEQ ID NO:20;SEQ ID NO:100和SEQ ID NO:20;SEQ ID NO:110和SEQ ID NO:102;SEQ ID NO:110和SEQ ID NO:104;SEQID NO:98和SEQ ID NO:102;SEQ ID NO:98和SEQ ID NO:104;SEQ ID NO:100和SEQ ID NO:102;SEQ ID NO:100和SEQ ID NO:104;SEQ ID NO:106和SEQ ID NO:20;SEQ ID NO:110和SEQ ID NO:108;SEQ ID NO:106和SEQ ID NO:108;SEQ ID NO:113和SEQ ID NO:34;SEQ IDNO:114和SEQ ID NO:34。
13.一种核酸,编码如权利要求1-12中任一项所述的免疫细胞因子。
14.一种载体,其包含如权利要求13所述的核酸。
15.一种宿主细胞,其包含如权利要求14所述的载体。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910764762 | 2019-08-19 | ||
CN2019107647621 | 2019-08-19 | ||
PCT/CN2020/109986 WO2021032116A1 (zh) | 2019-08-19 | 2020-08-19 | 一种免疫细胞因子及其制备与用途 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113646328A CN113646328A (zh) | 2021-11-12 |
CN113646328B true CN113646328B (zh) | 2024-03-26 |
Family
ID=74659948
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080009757.0A Active CN113646328B (zh) | 2019-08-19 | 2020-08-19 | 一种免疫细胞因子及其制备与用途 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20220289838A1 (zh) |
EP (1) | EP4019536A4 (zh) |
CN (1) | CN113646328B (zh) |
WO (1) | WO2021032116A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SG11201705844SA (en) * | 2015-02-06 | 2017-08-30 | Heat Biologics Inc | Vector co-expressing vaccine and costimulatory molecules |
WO2023222886A1 (en) * | 2022-05-20 | 2023-11-23 | Depth Charge Ltd | Antibody-cytokine fusion proteins |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA3039930A1 (en) * | 2016-10-14 | 2018-04-19 | Xencor, Inc. | Bispecific heterodimeric fusion proteins containing il-15/il-15ralpha fc-fusion proteins and pd-1 antibody fragments |
CN108948177A (zh) * | 2007-05-11 | 2018-12-07 | 阿尔托生物科学有限公司 | 融合分子与il-15变异体 |
CN109715196A (zh) * | 2016-06-13 | 2019-05-03 | 转矩医疗股份有限公司 | 用于促进免疫细胞功能的组合物和方法 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2367027T3 (es) | 2004-02-27 | 2011-10-27 | Inserm (Institut National De La Santé Et De La Recherche Medicale) | Sitio de unión de la il-15 para il-15ralfa y mutantes específicos de il-15 que tienen actividad agonista/antagonista. |
NZ701769A (en) | 2009-09-16 | 2016-06-24 | Genentech Inc | Coiled coil and/or tether containing protein complexes and uses thereof |
EP2915569A1 (en) | 2014-03-03 | 2015-09-09 | Cytune Pharma | IL-15/IL-15Ralpha based conjugates purification method |
KR102392142B1 (ko) * | 2016-10-21 | 2022-04-28 | 알토 바이오사이언스 코포레이션 | 다량체 il-15 기반 분자 |
AU2018297248A1 (en) * | 2017-07-03 | 2020-02-20 | Torque Therapeutics, Inc. | Fusion molecules targeting immune regulatory cells and uses thereof |
-
2020
- 2020-08-19 EP EP20855549.0A patent/EP4019536A4/en active Pending
- 2020-08-19 CN CN202080009757.0A patent/CN113646328B/zh active Active
- 2020-08-19 WO PCT/CN2020/109986 patent/WO2021032116A1/zh unknown
- 2020-08-19 US US17/636,352 patent/US20220289838A1/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108948177A (zh) * | 2007-05-11 | 2018-12-07 | 阿尔托生物科学有限公司 | 融合分子与il-15变异体 |
CN109715196A (zh) * | 2016-06-13 | 2019-05-03 | 转矩医疗股份有限公司 | 用于促进免疫细胞功能的组合物和方法 |
CA3039930A1 (en) * | 2016-10-14 | 2018-04-19 | Xencor, Inc. | Bispecific heterodimeric fusion proteins containing il-15/il-15ralpha fc-fusion proteins and pd-1 antibody fragments |
Also Published As
Publication number | Publication date |
---|---|
US20220289838A1 (en) | 2022-09-15 |
EP4019536A1 (en) | 2022-06-29 |
EP4019536A4 (en) | 2023-09-06 |
WO2021032116A1 (zh) | 2021-02-25 |
CN113646328A (zh) | 2021-11-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230340160A1 (en) | Bispecific t cell activating antigen binding molecules | |
TWI811982B (zh) | 對共刺激tnf受體具特異性之雙特異性抗體 | |
KR102648966B1 (ko) | Folr1 및 cd3에 대한 t 세포 활성화 이중특이적 항원 결합 분자 | |
DK2519543T3 (en) | HETERODIMER BINDING PROTEINS AND USE THEREOF | |
KR20210134300A (ko) | 항-sars-cov-2 스파이크 당단백질 항체 및 항원-결합 단편 | |
CN106399276B (zh) | 治疗性核酸酶组合物和方法 | |
KR101901458B1 (ko) | Tcr 복합체 면역치료제 | |
KR102361237B1 (ko) | 코일드 코일 면역글로불린 융합 단백질 및 이것의 조성물 | |
KR101900953B1 (ko) | Cd86 길항제 다중-표적 결합 단백질 | |
KR101715445B1 (ko) | 면역접합체 | |
KR20180081532A (ko) | 암 치료용 조성물 및 방법 | |
KR20180054877A (ko) | 공자극 tnf 수용체에 대해 4가를 갖는 이중특이적 항체 | |
KR20210013156A (ko) | 항-cd33 항체, 항-cd33/항-cd3 이중특이성 항체, 및 이의 용도 | |
KR20150122761A (ko) | T 세포 활성화 항원 결합 분자 | |
CN111954680B (zh) | IL2Rβ/共同γ链抗体 | |
CN109311973A (zh) | 含有c端融合的tnf家族配体三聚体的抗原结合分子 | |
CN110719920A (zh) | 蛋白质异二聚体及其用途 | |
CN107206072A (zh) | T细胞活化性双特异性抗原结合分子CD3 ABD叶酸受体1(FolR1)和PD‑1轴结合拮抗剂的组合疗法 | |
KR20150122203A (ko) | T 세포 활성화 이중특이적 항원 결합 분자 | |
KR20140101744A (ko) | 폴리펩티드 구축물 및 이의 용도 | |
KR20200003367A (ko) | 암 치료용 조성물 및 방법 | |
CN113646328B (zh) | 一种免疫细胞因子及其制备与用途 | |
KR20230017815A (ko) | 항-sars-cov-2 스파이크 당단백질 항체 및 항원-결합 단편 | |
TW202216743A (zh) | Il-10突變蛋白及其融合蛋白 | |
CN113316587B (zh) | 一种双特异性分子及其制备与用途 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |