KR20180101537A - 암 백신 - Google Patents
암 백신 Download PDFInfo
- Publication number
- KR20180101537A KR20180101537A KR1020187023390A KR20187023390A KR20180101537A KR 20180101537 A KR20180101537 A KR 20180101537A KR 1020187023390 A KR1020187023390 A KR 1020187023390A KR 20187023390 A KR20187023390 A KR 20187023390A KR 20180101537 A KR20180101537 A KR 20180101537A
- Authority
- KR
- South Korea
- Prior art keywords
- amino acid
- seq
- immunogenic
- leu
- polypeptide
- Prior art date
Links
- 229940022399 cancer vaccine Drugs 0.000 title description 5
- 238000009566 cancer vaccine Methods 0.000 title description 3
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 469
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 354
- 230000002163 immunogen Effects 0.000 claims abstract description 332
- 229920001184 polypeptide Polymers 0.000 claims abstract description 329
- 101001133056 Homo sapiens Mucin-1 Proteins 0.000 claims abstract description 217
- 102100034256 Mucin-1 Human genes 0.000 claims abstract description 193
- 239000000203 mixture Substances 0.000 claims abstract description 130
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 126
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 124
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 124
- 238000000034 method Methods 0.000 claims abstract description 84
- 101000576802 Homo sapiens Mesothelin Proteins 0.000 claims abstract description 24
- 102100025096 Mesothelin Human genes 0.000 claims abstract 9
- 239000000427 antigen Substances 0.000 claims description 212
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 177
- 239000002773 nucleotide Substances 0.000 claims description 154
- 125000003729 nucleotide group Chemical group 0.000 claims description 154
- 150000001413 amino acids Chemical class 0.000 claims description 150
- 108091007433 antigens Proteins 0.000 claims description 92
- 102000036639 antigens Human genes 0.000 claims description 92
- 206010028980 Neoplasm Diseases 0.000 claims description 81
- 239000013598 vector Substances 0.000 claims description 68
- 201000011510 cancer Diseases 0.000 claims description 52
- 239000003795 chemical substances by application Substances 0.000 claims description 43
- 230000002519 immonomodulatory effect Effects 0.000 claims description 29
- 239000003112 inhibitor Substances 0.000 claims description 22
- 229940043367 IDO1 inhibitor Drugs 0.000 claims description 16
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 14
- 229940121354 immunomodulator Drugs 0.000 claims description 13
- 239000002955 immunomodulating agent Substances 0.000 claims description 12
- 229940045513 CTLA4 antagonist Drugs 0.000 claims description 10
- 206010061902 Pancreatic neoplasm Diseases 0.000 claims description 10
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 claims description 10
- 201000002528 pancreatic cancer Diseases 0.000 claims description 10
- 208000008443 pancreatic carcinoma Diseases 0.000 claims description 10
- 239000008194 pharmaceutical composition Substances 0.000 claims description 10
- 206010006187 Breast cancer Diseases 0.000 claims description 9
- 208000026310 Breast neoplasm Diseases 0.000 claims description 9
- 206010033128 Ovarian cancer Diseases 0.000 claims description 8
- 206010061535 Ovarian neoplasm Diseases 0.000 claims description 8
- 230000002584 immunomodulator Effects 0.000 claims description 7
- 230000007935 neutral effect Effects 0.000 claims description 5
- 239000012275 CTLA-4 inhibitor Substances 0.000 claims description 3
- 208000003721 Triple Negative Breast Neoplasms Diseases 0.000 claims description 3
- 208000022679 triple-negative breast carcinoma Diseases 0.000 claims description 3
- 239000003937 drug carrier Substances 0.000 claims 1
- 229960005486 vaccine Drugs 0.000 abstract description 63
- 239000013612 plasmid Substances 0.000 description 288
- 108090000015 Mesothelin Proteins 0.000 description 169
- 102000003735 Mesothelin Human genes 0.000 description 163
- 235000001014 amino acid Nutrition 0.000 description 140
- 229940024606 amino acid Drugs 0.000 description 140
- 241000282414 Homo sapiens Species 0.000 description 117
- 108090000623 proteins and genes Proteins 0.000 description 106
- 238000010367 cloning Methods 0.000 description 62
- 210000004027 cell Anatomy 0.000 description 61
- 108020004414 DNA Proteins 0.000 description 58
- 210000001744 T-lymphocyte Anatomy 0.000 description 57
- 108010029485 Protein Isoforms Proteins 0.000 description 53
- 102000001708 Protein Isoforms Human genes 0.000 description 53
- 108700026244 Open Reading Frames Proteins 0.000 description 42
- 241001465754 Metazoa Species 0.000 description 39
- 239000002243 precursor Substances 0.000 description 39
- 241000699670 Mus sp. Species 0.000 description 38
- 235000018102 proteins Nutrition 0.000 description 37
- 102000004169 proteins and genes Human genes 0.000 description 37
- 230000028993 immune response Effects 0.000 description 35
- 206010057190 Respiratory tract infections Diseases 0.000 description 33
- 241000701161 unidentified adenovirus Species 0.000 description 33
- 108010087924 alanylproline Proteins 0.000 description 32
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 31
- 230000005867 T cell response Effects 0.000 description 30
- 108010017842 Telomerase Proteins 0.000 description 29
- 239000012528 membrane Substances 0.000 description 26
- 210000004379 membrane Anatomy 0.000 description 26
- 230000004044 response Effects 0.000 description 25
- 210000004989 spleen cell Anatomy 0.000 description 25
- 230000036755 cellular response Effects 0.000 description 24
- -1 haptens Chemical class 0.000 description 23
- 239000002245 particle Substances 0.000 description 23
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 22
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 22
- 108700001237 Nucleic Acid-Based Vaccines Proteins 0.000 description 22
- 229940023146 nucleic acid vaccine Drugs 0.000 description 22
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 21
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 20
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 20
- 101100346929 Homo sapiens MUC1 gene Proteins 0.000 description 20
- 108010074328 Interferon-gamma Proteins 0.000 description 20
- 239000002671 adjuvant Substances 0.000 description 20
- 102000057860 human MUC1 Human genes 0.000 description 20
- 101100346932 Mus musculus Muc1 gene Proteins 0.000 description 19
- 239000013592 cell lysate Substances 0.000 description 19
- 241000124008 Mammalia Species 0.000 description 18
- 210000003719 b-lymphocyte Anatomy 0.000 description 18
- 238000012217 deletion Methods 0.000 description 18
- 230000037430 deletion Effects 0.000 description 18
- 238000013461 design Methods 0.000 description 18
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 18
- 238000004520 electroporation Methods 0.000 description 18
- 102100037850 Interferon gamma Human genes 0.000 description 17
- 108010013476 HLA-A24 Antigen Proteins 0.000 description 16
- 230000035772 mutation Effects 0.000 description 16
- 238000011160 research Methods 0.000 description 16
- 239000000126 substance Substances 0.000 description 16
- 108091093088 Amplicon Proteins 0.000 description 15
- 108010067902 Peptide Library Proteins 0.000 description 15
- 241000700605 Viruses Species 0.000 description 15
- CTMZLDSMFCVUNX-VMIOUTBZSA-N cytidylyl-(3'->5')-guanosine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=C(C(N=C(N)N3)=O)N=C2)O)[C@@H](CO)O1 CTMZLDSMFCVUNX-VMIOUTBZSA-N 0.000 description 15
- 208000035475 disorder Diseases 0.000 description 15
- 108010037850 glycylvaline Proteins 0.000 description 15
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 15
- 125000006850 spacer group Chemical group 0.000 description 15
- 230000003612 virological effect Effects 0.000 description 15
- 241000282553 Macaca Species 0.000 description 14
- 108010060035 arginylproline Proteins 0.000 description 14
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 13
- 238000007920 subcutaneous administration Methods 0.000 description 13
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 12
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 12
- 108010070944 alanylhistidine Proteins 0.000 description 12
- 239000003623 enhancer Substances 0.000 description 12
- 230000003053 immunization Effects 0.000 description 12
- 230000005847 immunogenicity Effects 0.000 description 12
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 12
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 11
- 101100259716 Arabidopsis thaliana TAA1 gene Proteins 0.000 description 11
- 241000710188 Encephalomyocarditis virus Species 0.000 description 11
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- 101100259832 Oryza sativa subsp. japonica TAR2 gene Proteins 0.000 description 11
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 11
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 11
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 11
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 11
- 238000006243 chemical reaction Methods 0.000 description 11
- 238000002649 immunization Methods 0.000 description 11
- 238000010255 intramuscular injection Methods 0.000 description 11
- 239000007927 intramuscular injection Substances 0.000 description 11
- 230000002101 lytic effect Effects 0.000 description 11
- 239000000546 pharmaceutical excipient Substances 0.000 description 11
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 11
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 10
- 108010074032 HLA-A2 Antigen Proteins 0.000 description 10
- 102000025850 HLA-A2 Antigen Human genes 0.000 description 10
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 10
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 10
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 10
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 10
- 108700019146 Transgenes Proteins 0.000 description 10
- 230000001413 cellular effect Effects 0.000 description 10
- 230000014509 gene expression Effects 0.000 description 10
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 10
- 229910052737 gold Inorganic materials 0.000 description 10
- 239000010931 gold Substances 0.000 description 10
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 9
- 241000880493 Leptailurus serval Species 0.000 description 9
- 241000699666 Mus <mouse, genus> Species 0.000 description 9
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 9
- 108010076504 Protein Sorting Signals Proteins 0.000 description 9
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 9
- 230000002159 abnormal effect Effects 0.000 description 9
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 9
- 230000001086 cytosolic effect Effects 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 108010050848 glycylleucine Proteins 0.000 description 9
- 230000001404 mediated effect Effects 0.000 description 9
- 238000011282 treatment Methods 0.000 description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 8
- 201000009030 Carcinoma Diseases 0.000 description 8
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 8
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 8
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 8
- 108010008355 arginyl-glutamine Proteins 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 210000003205 muscle Anatomy 0.000 description 8
- 238000006467 substitution reaction Methods 0.000 description 8
- 241000282693 Cercopithecidae Species 0.000 description 7
- 230000004663 cell proliferation Effects 0.000 description 7
- 230000009977 dual effect Effects 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 239000000839 emulsion Substances 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 210000005260 human cell Anatomy 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N squalane Chemical compound CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- 206010009944 Colon cancer Diseases 0.000 description 6
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 6
- 239000000556 agonist Substances 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 230000034994 death Effects 0.000 description 6
- 231100000517 death Toxicity 0.000 description 6
- 238000010790 dilution Methods 0.000 description 6
- 239000012895 dilution Substances 0.000 description 6
- 238000009472 formulation Methods 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 230000008348 humoral response Effects 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 208000020816 lung neoplasm Diseases 0.000 description 6
- 239000013642 negative control Substances 0.000 description 6
- 230000001613 neoplastic effect Effects 0.000 description 6
- 230000003287 optical effect Effects 0.000 description 6
- 210000002966 serum Anatomy 0.000 description 6
- OSBSFAARYOCBHB-UHFFFAOYSA-N tetrapropylammonium Chemical compound CCC[N+](CCC)(CCC)CCC OSBSFAARYOCBHB-UHFFFAOYSA-N 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 210000004881 tumor cell Anatomy 0.000 description 6
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 5
- 102000004127 Cytokines Human genes 0.000 description 5
- 108090000695 Cytokines Proteins 0.000 description 5
- 229940021995 DNA vaccine Drugs 0.000 description 5
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 5
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 5
- 229960000723 ampicillin Drugs 0.000 description 5
- 210000000612 antigen-presenting cell Anatomy 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 231100000504 carcinogenesis Toxicity 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 210000000987 immune system Anatomy 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 230000002401 inhibitory effect Effects 0.000 description 5
- 238000002347 injection Methods 0.000 description 5
- 239000007924 injection Substances 0.000 description 5
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 5
- 239000002502 liposome Substances 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 201000005202 lung cancer Diseases 0.000 description 5
- 210000001165 lymph node Anatomy 0.000 description 5
- 239000013600 plasmid vector Substances 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 108010077112 prolyl-proline Proteins 0.000 description 5
- 230000028327 secretion Effects 0.000 description 5
- 239000007787 solid Substances 0.000 description 5
- 210000004988 splenocyte Anatomy 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- 238000011144 upstream manufacturing Methods 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- 239000003981 vehicle Substances 0.000 description 5
- DRHZYJAUECRAJM-DWSYSWFDSA-N (2s,3s,4s,5r,6r)-6-[[(3s,4s,4ar,6ar,6bs,8r,8ar,12as,14ar,14br)-8a-[(2s,3r,4s,5r,6r)-3-[(2s,3r,4s,5r,6s)-5-[(2s,3r,4s,5r)-4-[(2s,3r,4r)-3,4-dihydroxy-4-(hydroxymethyl)oxolan-2-yl]oxy-3,5-dihydroxyoxan-2-yl]oxy-3,4-dihydroxy-6-methyloxan-2-yl]oxy-5-[(3s,5s, Chemical compound O([C@H]1[C@H](O)[C@H](O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O1)O)O[C@H]1CC[C@]2(C)[C@H]3CC=C4[C@@H]5CC(C)(C)CC[C@@]5([C@@H](C[C@@]4(C)[C@]3(C)CC[C@H]2[C@@]1(C=O)C)O)C(=O)O[C@@H]1O[C@H](C)[C@@H]([C@@H]([C@H]1O[C@H]1[C@@H]([C@H](O)[C@@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@](O)(CO)CO3)O)[C@H](O)CO2)O)[C@H](C)O1)O)O)OC(=O)C[C@@H](O)C[C@H](OC(=O)C[C@@H](O)C[C@@H]([C@@H](C)CC)O[C@H]1[C@@H]([C@@H](O)[C@H](CO)O1)O)[C@@H](C)CC)C(O)=O)[C@@H]1OC[C@@H](O)[C@H](O)[C@H]1O DRHZYJAUECRAJM-DWSYSWFDSA-N 0.000 description 4
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- 241000282472 Canis lupus familiaris Species 0.000 description 4
- 108010041986 DNA Vaccines Proteins 0.000 description 4
- 101000655352 Homo sapiens Telomerase reverse transcriptase Proteins 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 208000005718 Stomach Neoplasms Diseases 0.000 description 4
- 239000004480 active ingredient Substances 0.000 description 4
- 208000009956 adenocarcinoma Diseases 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 230000002146 bilateral effect Effects 0.000 description 4
- 210000000481 breast Anatomy 0.000 description 4
- 229910002091 carbon monoxide Inorganic materials 0.000 description 4
- 230000003197 catalytic effect Effects 0.000 description 4
- 125000002091 cationic group Chemical group 0.000 description 4
- 210000001072 colon Anatomy 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- 230000001461 cytolytic effect Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 102000015694 estrogen receptors Human genes 0.000 description 4
- 108010038795 estrogen receptors Proteins 0.000 description 4
- 150000002244 furazanes Chemical class 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 239000001307 helium Substances 0.000 description 4
- 229910052734 helium Inorganic materials 0.000 description 4
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 230000003834 intracellular effect Effects 0.000 description 4
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 4
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 4
- 230000002018 overexpression Effects 0.000 description 4
- 210000000496 pancreas Anatomy 0.000 description 4
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 4
- 102000003998 progesterone receptors Human genes 0.000 description 4
- 108090000468 progesterone receptors Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 102000005962 receptors Human genes 0.000 description 4
- 108020003175 receptors Proteins 0.000 description 4
- 238000003118 sandwich ELISA Methods 0.000 description 4
- 229930182490 saponin Natural products 0.000 description 4
- 235000017709 saponins Nutrition 0.000 description 4
- 150000007949 saponins Chemical class 0.000 description 4
- KZNICNPSHKQLFF-UHFFFAOYSA-N succinimide Chemical class O=C1CCC(=O)N1 KZNICNPSHKQLFF-UHFFFAOYSA-N 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 238000011200 topical administration Methods 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 230000004614 tumor growth Effects 0.000 description 4
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 4
- 241000990167 unclassified Simian adenoviruses Species 0.000 description 4
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 4
- 239000013603 viral vector Substances 0.000 description 4
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 3
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- 102000008096 B7-H1 Antigen Human genes 0.000 description 3
- 108010074708 B7-H1 Antigen Proteins 0.000 description 3
- 238000011746 C57BL/6J (JAX™ mouse strain) Methods 0.000 description 3
- 101150013553 CD40 gene Proteins 0.000 description 3
- 206010051066 Gastrointestinal stromal tumour Diseases 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 102000008070 Interferon-gamma Human genes 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 3
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- 206010041067 Small cell lung cancer Diseases 0.000 description 3
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 3
- 108020000411 Toll-like receptor Proteins 0.000 description 3
- 102000002689 Toll-like receptor Human genes 0.000 description 3
- 102100040245 Tumor necrosis factor receptor superfamily member 5 Human genes 0.000 description 3
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 230000000259 anti-tumor effect Effects 0.000 description 3
- 230000000890 antigenic effect Effects 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000011260 co-administration Methods 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 210000002615 epidermis Anatomy 0.000 description 3
- 206010017758 gastric cancer Diseases 0.000 description 3
- 201000011243 gastrointestinal stromal tumor Diseases 0.000 description 3
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 238000007918 intramuscular administration Methods 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 210000004072 lung Anatomy 0.000 description 3
- 230000002611 ovarian Effects 0.000 description 3
- 210000001672 ovary Anatomy 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 229920000053 polysorbate 80 Polymers 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 230000001681 protective effect Effects 0.000 description 3
- 210000000664 rectum Anatomy 0.000 description 3
- 210000003289 regulatory T cell Anatomy 0.000 description 3
- 206010039083 rhinitis Diseases 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 208000000587 small cell lung carcinoma Diseases 0.000 description 3
- 229940031439 squalene Drugs 0.000 description 3
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 3
- 201000011549 stomach cancer Diseases 0.000 description 3
- 239000004094 surface-active agent Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000002255 vaccination Methods 0.000 description 3
- UAIUNKRWKOVEES-UHFFFAOYSA-N 3,3',5,5'-tetramethylbenzidine Chemical compound CC1=C(N)C(C)=CC(C=2C=C(C)C(N)=C(C)C=2)=C1 UAIUNKRWKOVEES-UHFFFAOYSA-N 0.000 description 2
- MXKLDYKORJEOPR-UHFFFAOYSA-N 3-(5-fluoro-1h-indol-3-yl)pyrrolidine-2,5-dione Chemical compound C12=CC(F)=CC=C2NC=C1C1CC(=O)NC1=O MXKLDYKORJEOPR-UHFFFAOYSA-N 0.000 description 2
- 208000036832 Adenocarcinoma of ovary Diseases 0.000 description 2
- 206010052747 Adenocarcinoma pancreas Diseases 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 2
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- 241000710929 Alphavirus Species 0.000 description 2
- 241000272814 Anser sp. Species 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 229940123189 CD40 agonist Drugs 0.000 description 2
- 108010021064 CTLA-4 Antigen Proteins 0.000 description 2
- 102000008203 CTLA-4 Antigen Human genes 0.000 description 2
- 102000009016 Cholera Toxin Human genes 0.000 description 2
- 108010049048 Cholera Toxin Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 206010010356 Congenital anomaly Diseases 0.000 description 2
- 108010037462 Cyclooxygenase 2 Proteins 0.000 description 2
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 2
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 2
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 2
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 2
- 108010002350 Interleukin-2 Proteins 0.000 description 2
- 102000000588 Interleukin-2 Human genes 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- 206010025323 Lymphomas Diseases 0.000 description 2
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 2
- 241000701076 Macacine alphaherpesvirus 1 Species 0.000 description 2
- 102000007651 Macrophage Colony-Stimulating Factor Human genes 0.000 description 2
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 2
- 108010008707 Mucin-1 Proteins 0.000 description 2
- 102000007298 Mucin-1 Human genes 0.000 description 2
- 208000009525 Myocarditis Diseases 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 206010061328 Ovarian epithelial cancer Diseases 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 2
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 2
- 108010076039 Polyproteins Proteins 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 2
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 2
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 2
- 102100040678 Programmed cell death protein 1 Human genes 0.000 description 2
- 101710089372 Programmed cell death protein 1 Proteins 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Chemical group OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 102100038280 Prostaglandin G/H synthase 2 Human genes 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- 206010039491 Sarcoma Diseases 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- PRXRUNOAOLTIEF-ADSICKODSA-N Sorbitan trioleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCC\C=C/CCCCCCCC)[C@H]1OC[C@H](O)[C@H]1OC(=O)CCCCCCC\C=C/CCCCCCCC PRXRUNOAOLTIEF-ADSICKODSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 2
- 102000011016 Type 5 Cyclic Nucleotide Phosphodiesterases Human genes 0.000 description 2
- 108010037581 Type 5 Cyclic Nucleotide Phosphodiesterases Proteins 0.000 description 2
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 2
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 230000001270 agonistic effect Effects 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 230000030741 antigen processing and presentation Effects 0.000 description 2
- 239000008135 aqueous vehicle Substances 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical class N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 238000002619 cancer immunotherapy Methods 0.000 description 2
- 230000005773 cancer-related death Effects 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 230000004700 cellular uptake Effects 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 239000013601 cosmid vector Substances 0.000 description 2
- 210000005220 cytoplasmic tail Anatomy 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 210000003162 effector t lymphocyte Anatomy 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 108700020302 erbB-2 Genes Proteins 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000013230 female C57BL/6J mice Methods 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 239000007789 gas Substances 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108091008039 hormone receptors Proteins 0.000 description 2
- KTUFNOKKBVMGRW-UHFFFAOYSA-N imatinib Chemical compound C1CN(C)CCN1CC1=CC=C(C(=O)NC=2C=C(NC=3N=C(C=CN=3)C=3C=NC=CC=3)C(C)=CC=2)C=C1 KTUFNOKKBVMGRW-UHFFFAOYSA-N 0.000 description 2
- 230000016784 immunoglobulin production Effects 0.000 description 2
- 230000003308 immunostimulating effect Effects 0.000 description 2
- 210000005008 immunosuppressive cell Anatomy 0.000 description 2
- 238000009169 immunotherapy Methods 0.000 description 2
- 238000000099 in vitro assay Methods 0.000 description 2
- 108020004201 indoleamine 2,3-dioxygenase Proteins 0.000 description 2
- 102000006639 indoleamine 2,3-dioxygenase Human genes 0.000 description 2
- 229960003130 interferon gamma Drugs 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000002207 metabolite Substances 0.000 description 2
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 2
- JXTPJDDICSTXJX-UHFFFAOYSA-N n-Triacontane Natural products CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC JXTPJDDICSTXJX-UHFFFAOYSA-N 0.000 description 2
- 230000009871 nonspecific binding Effects 0.000 description 2
- 229940066429 octoxynol Drugs 0.000 description 2
- 229920002113 octoxynol Polymers 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 235000019198 oils Nutrition 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 208000013371 ovarian adenocarcinoma Diseases 0.000 description 2
- 201000006588 ovary adenocarcinoma Diseases 0.000 description 2
- 201000002094 pancreatic adenocarcinoma Diseases 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 230000001376 precipitating effect Effects 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 230000037452 priming Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 238000011321 prophylaxis Methods 0.000 description 2
- 239000003909 protein kinase inhibitor Substances 0.000 description 2
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- 238000013207 serial dilution Methods 0.000 description 2
- BNRNXUUZRGQAQC-UHFFFAOYSA-N sildenafil Chemical compound CCCC1=NN(C)C(C(N2)=O)=C1N=C2C(C(=CC=1)OCC)=CC=1S(=O)(=O)N1CCN(C)CC1 BNRNXUUZRGQAQC-UHFFFAOYSA-N 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 229940032094 squalane Drugs 0.000 description 2
- 206010041823 squamous cell carcinoma Diseases 0.000 description 2
- 210000002784 stomach Anatomy 0.000 description 2
- 238000010254 subcutaneous injection Methods 0.000 description 2
- 239000007929 subcutaneous injection Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000007910 systemic administration Methods 0.000 description 2
- 239000003826 tablet Substances 0.000 description 2
- 229960000814 tetanus toxoid Drugs 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- 229950007217 tremelimumab Drugs 0.000 description 2
- 102000003390 tumor necrosis factor Human genes 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 208000010576 undifferentiated carcinoma Diseases 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 150000003679 valine derivatives Chemical class 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- COWAQTUWATVJAN-MRVPVSSYSA-N (3R)-3-(5-chloro-1H-indol-3-yl)pyrrolidine-2,5-dione Chemical compound ClC=1C=C2C(=CNC2=CC1)[C@@H]1C(NC(C1)=O)=O COWAQTUWATVJAN-MRVPVSSYSA-N 0.000 description 1
- MXKLDYKORJEOPR-MRVPVSSYSA-N (3R)-3-(5-fluoro-1H-indol-3-yl)pyrrolidine-2,5-dione Chemical compound FC=1C=C2C(=CNC2=CC1)[C@@H]1C(NC(C1)=O)=O MXKLDYKORJEOPR-MRVPVSSYSA-N 0.000 description 1
- VNQNEQQYAIPCAQ-MRVPVSSYSA-N (3r)-3-(1h-indol-3-yl)pyrrolidine-2,5-dione Chemical compound O=C1NC(=O)C[C@@H]1C1=CNC2=CC=CC=C12 VNQNEQQYAIPCAQ-MRVPVSSYSA-N 0.000 description 1
- LEBVLXFERQHONN-UHFFFAOYSA-N 1-butyl-N-(2,6-dimethylphenyl)piperidine-2-carboxamide Chemical compound CCCCN1CCCCC1C(=O)NC1=C(C)C=CC=C1C LEBVLXFERQHONN-UHFFFAOYSA-N 0.000 description 1
- LDGWQMRUWMSZIU-LQDDAWAPSA-M 2,3-bis[(z)-octadec-9-enoxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)C)OCCCCCCCC\C=C/CCCCCCCC LDGWQMRUWMSZIU-LQDDAWAPSA-M 0.000 description 1
- KSXTUUUQYQYKCR-LQDDAWAPSA-M 2,3-bis[[(z)-octadec-9-enoyl]oxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCC(=O)OCC(C[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC KSXTUUUQYQYKCR-LQDDAWAPSA-M 0.000 description 1
- JUEUYDRZJNQZGR-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JUEUYDRZJNQZGR-UHFFFAOYSA-N 0.000 description 1
- XLPHMKQBBCKEFO-UHFFFAOYSA-N 2-azaniumylethyl 2,3-bis(3,7,11,15-tetramethylhexadecanoyloxy)propyl phosphate Chemical compound CC(C)CCCC(C)CCCC(C)CCCC(C)CC(=O)OCC(COP(O)(=O)OCCN)OC(=O)CC(C)CCCC(C)CCCC(C)CCCC(C)C XLPHMKQBBCKEFO-UHFFFAOYSA-N 0.000 description 1
- VNQNEQQYAIPCAQ-UHFFFAOYSA-N 3-(1h-indol-3-yl)pyrrolidine-2,5-dione Chemical compound O=C1NC(=O)CC1C1=CNC2=CC=CC=C12 VNQNEQQYAIPCAQ-UHFFFAOYSA-N 0.000 description 1
- FGTCROZDHDSNIO-UHFFFAOYSA-N 3-(4-quinolinylmethylamino)-N-[4-(trifluoromethoxy)phenyl]-2-thiophenecarboxamide Chemical compound C1=CC(OC(F)(F)F)=CC=C1NC(=O)C1=C(NCC=2C3=CC=CC=C3N=CC=2)C=CS1 FGTCROZDHDSNIO-UHFFFAOYSA-N 0.000 description 1
- ASSGNCUADIFTFP-UHFFFAOYSA-N 3-(5,6-difluoro-1h-indol-3-yl)pyrrolidine-2,5-dione Chemical compound C1=2C=C(F)C(F)=CC=2NC=C1C1CC(=O)NC1=O ASSGNCUADIFTFP-UHFFFAOYSA-N 0.000 description 1
- WBVXHYJYKXXPET-UHFFFAOYSA-N 3-(5-bromo-1h-indol-3-yl)pyrrolidine-2,5-dione Chemical compound C12=CC(Br)=CC=C2NC=C1C1CC(=O)NC1=O WBVXHYJYKXXPET-UHFFFAOYSA-N 0.000 description 1
- COWAQTUWATVJAN-UHFFFAOYSA-N 3-(5-chloro-1h-indol-3-yl)pyrrolidine-2,5-dione Chemical compound C12=CC(Cl)=CC=C2NC=C1C1CC(=O)NC1=O COWAQTUWATVJAN-UHFFFAOYSA-N 0.000 description 1
- AHVPBZIGWCANAK-UHFFFAOYSA-N 3-(6-chloro-1h-indol-3-yl)pyrrolidine-2,5-dione Chemical compound C=1NC2=CC(Cl)=CC=C2C=1C1CC(=O)NC1=O AHVPBZIGWCANAK-UHFFFAOYSA-N 0.000 description 1
- DOFKTGJHTOAAJF-UHFFFAOYSA-N 5-fluoro-3-piperidin-3-yl-1h-indole Chemical group C12=CC(F)=CC=C2NC=C1C1CCCNC1 DOFKTGJHTOAAJF-UHFFFAOYSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- OONFNUWBHFSNBT-HXUWFJFHSA-N AEE788 Chemical compound C1CN(CC)CCN1CC1=CC=C(C=2NC3=NC=NC(N[C@H](C)C=4C=CC=CC=4)=C3C=2)C=C1 OONFNUWBHFSNBT-HXUWFJFHSA-N 0.000 description 1
- GBJVVSCPOBPEIT-UHFFFAOYSA-N AZT-1152 Chemical compound N=1C=NC2=CC(OCCCN(CC)CCOP(O)(O)=O)=CC=C2C=1NC(=NN1)C=C1CC(=O)NC1=CC=CC(F)=C1 GBJVVSCPOBPEIT-UHFFFAOYSA-N 0.000 description 1
- 102100034540 Adenomatous polyposis coli protein Human genes 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- CVHJIWVKTFNGHT-ACZMJKKPSA-N Ala-Gln-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N CVHJIWVKTFNGHT-ACZMJKKPSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- RTDZQOFEGPWSJD-AVGNSLFASA-N Arg-Leu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O RTDZQOFEGPWSJD-AVGNSLFASA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- XKDYWGLNSCNRGW-WDSOQIARSA-N Arg-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)CCCCN)C(O)=O)=CNC2=C1 XKDYWGLNSCNRGW-WDSOQIARSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- JYHIVHINLJUIEG-BVSLBCMMSA-N Arg-Tyr-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JYHIVHINLJUIEG-BVSLBCMMSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- QJMCHPGWFZZRID-BQBZGAKWSA-N Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O QJMCHPGWFZZRID-BQBZGAKWSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- AMRANMVXQWXNAH-ZLUOBGJFSA-N Asp-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O AMRANMVXQWXNAH-ZLUOBGJFSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- IQCJOIHDVFJQFV-LKXGYXEUSA-N Asp-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O IQCJOIHDVFJQFV-LKXGYXEUSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- MLDQJTXFUGDVEO-UHFFFAOYSA-N BAY-43-9006 Chemical compound C1=NC(C(=O)NC)=CC(OC=2C=CC(NC(=O)NC=3C=C(C(Cl)=CC=3)C(F)(F)F)=CC=2)=C1 MLDQJTXFUGDVEO-UHFFFAOYSA-N 0.000 description 1
- 206010060999 Benign neoplasm Diseases 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 101710150190 Beta-secretase 2 Proteins 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 206010007279 Carcinoid tumour of the gastrointestinal tract Diseases 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 108010039939 Cell Wall Skeleton Proteins 0.000 description 1
- 241001217856 Chimpanzee adenovirus Species 0.000 description 1
- CMSMOCZEIVJLDB-UHFFFAOYSA-N Cyclophosphamide Chemical compound ClCCN(CCCl)P1(=O)NCCCO1 CMSMOCZEIVJLDB-UHFFFAOYSA-N 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 1
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 1
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 1
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 1
- OWAFTBLVZNSIFO-SRVKXCTJSA-N Cys-His-His Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OWAFTBLVZNSIFO-SRVKXCTJSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 1
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- 102100039498 Cytotoxic T-lymphocyte protein 4 Human genes 0.000 description 1
- 230000000970 DNA cross-linking effect Effects 0.000 description 1
- 238000011238 DNA vaccination Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 102100031788 E3 ubiquitin-protein ligase MYLIP Human genes 0.000 description 1
- 101710190174 E3 ubiquitin-protein ligase MYLIP Proteins 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 238000011510 Elispot assay Methods 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000214054 Equine rhinitis A virus Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- PCKOTDPDHIBGRW-CIUDSAMLSA-N Gln-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N PCKOTDPDHIBGRW-CIUDSAMLSA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- LUGUNEGJNDEBLU-DCAQKATOSA-N Gln-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LUGUNEGJNDEBLU-DCAQKATOSA-N 0.000 description 1
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 1
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- FTMLQFPULNGION-ZVZYQTTQSA-N Gln-Val-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FTMLQFPULNGION-ZVZYQTTQSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- QVDGHDFFYHKJPN-QWRGUYRKSA-N Gly-Phe-Cys Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(O)=O QVDGHDFFYHKJPN-QWRGUYRKSA-N 0.000 description 1
- 108010009504 Gly-Phe-Leu-Gly Proteins 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 206010060980 Granular cell tumour Diseases 0.000 description 1
- 102000009465 Growth Factor Receptors Human genes 0.000 description 1
- 108010009202 Growth Factor Receptors Proteins 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 101150054472 HER2 gene Proteins 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- ABCCKUZDWMERKT-AVGNSLFASA-N His-Pro-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O ABCCKUZDWMERKT-AVGNSLFASA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- IXQGOKWTQPCIQM-YJRXYDGGSA-N His-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O IXQGOKWTQPCIQM-YJRXYDGGSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- 101000889276 Homo sapiens Cytotoxic T-lymphocyte protein 4 Proteins 0.000 description 1
- 101100313319 Homo sapiens TERT gene Proteins 0.000 description 1
- 101000801234 Homo sapiens Tumor necrosis factor receptor superfamily member 18 Proteins 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- 239000004890 Hydrophobing Agent Substances 0.000 description 1
- XQFRJNBWHJMXHO-RRKCRQDMSA-N IDUR Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 XQFRJNBWHJMXHO-RRKCRQDMSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- ZGGWRNBSBOHIGH-HVTMNAMFSA-N Ile-Gln-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZGGWRNBSBOHIGH-HVTMNAMFSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108010002616 Interleukin-5 Proteins 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 108010002586 Interleukin-7 Proteins 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical group OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- 239000005517 L01XE01 - Imatinib Substances 0.000 description 1
- 239000005411 L01XE02 - Gefitinib Substances 0.000 description 1
- 239000005551 L01XE03 - Erlotinib Substances 0.000 description 1
- 239000002147 L01XE04 - Sunitinib Substances 0.000 description 1
- 239000005511 L01XE05 - Sorafenib Substances 0.000 description 1
- 239000002136 L01XE07 - Lapatinib Substances 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 240000000759 Lepidium meyenii Species 0.000 description 1
- 235000000421 Lepidium meyenii Nutrition 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 1
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- BVRNWWHJYNPJDG-XIRDDKMYSA-N Lys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N BVRNWWHJYNPJDG-XIRDDKMYSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 101150114927 MUC1 gene Proteins 0.000 description 1
- 241000282567 Macaca fascicularis Species 0.000 description 1
- 101710141452 Major surface glycoprotein G Proteins 0.000 description 1
- 206010027406 Mesothelioma Diseases 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 101150064776 Msln gene Proteins 0.000 description 1
- 101800001322 Mucin-1 subunit beta Proteins 0.000 description 1
- 102400001154 Mucin-1 subunit beta Human genes 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- FXHOOIRPVKKKFG-UHFFFAOYSA-N N,N-Dimethylacetamide Chemical compound CN(C)C(C)=O FXHOOIRPVKKKFG-UHFFFAOYSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- 108700020354 N-acetylmuramyl-threonyl-isoglutamine Proteins 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- FVUHARPMQIRXTJ-UHFFFAOYSA-N NS(=O)(=O)NCCNC1=CN(ON1O)C1=CC(=CC=C1)C(F)(F)F Chemical compound NS(=O)(=O)NCCNC1=CN(ON1O)C1=CC(=CC=C1)C(F)(F)F FVUHARPMQIRXTJ-UHFFFAOYSA-N 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 206010061309 Neoplasm progression Diseases 0.000 description 1
- GSBKRFGXEJLVMI-UHFFFAOYSA-N Nervonyl carnitine Chemical compound CCC[N+](C)(C)C GSBKRFGXEJLVMI-UHFFFAOYSA-N 0.000 description 1
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 1
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 208000007571 Ovarian Epithelial Carcinoma Diseases 0.000 description 1
- 241000282320 Panthera leo Species 0.000 description 1
- 241000282376 Panthera tigris Species 0.000 description 1
- 235000019483 Peanut oil Nutrition 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- 229940123333 Phosphodiesterase 5 inhibitor Drugs 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241001672814 Porcine teschovirus 1 Species 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- JRBWMRUPXWPEID-JYJNAYRXSA-N Pro-Trp-Cys Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CS)C(=O)O)C(=O)[C@@H]1CCCN1 JRBWMRUPXWPEID-JYJNAYRXSA-N 0.000 description 1
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 241000283080 Proboscidea <mammal> Species 0.000 description 1
- 235000019484 Rapeseed oil Nutrition 0.000 description 1
- 108010081734 Ribonucleoproteins Proteins 0.000 description 1
- 102000004389 Ribonucleoproteins Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- FKZSXTKZLPPHQU-GQGQLFGLSA-N Ser-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N FKZSXTKZLPPHQU-GQGQLFGLSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000287219 Serinus canaria Species 0.000 description 1
- 239000004147 Sorbitan trioleate Substances 0.000 description 1
- 229910000831 Steel Inorganic materials 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 235000019486 Sunflower oil Nutrition 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- 102100032938 Telomerase reverse transcriptase Human genes 0.000 description 1
- 206010043276 Teratoma Diseases 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 1
- HEJJDUDEHLPDAW-CUJWVEQBSA-N Thr-His-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N)O HEJJDUDEHLPDAW-CUJWVEQBSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- BDYBHQWMHYDRKJ-UNQGMJICSA-N Thr-Phe-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N)O BDYBHQWMHYDRKJ-UNQGMJICSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 102100023935 Transmembrane glycoprotein NMB Human genes 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 1
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 1
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- CMXACOZDEJYZSK-XIRDDKMYSA-N Trp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CMXACOZDEJYZSK-XIRDDKMYSA-N 0.000 description 1
- 108060008683 Tumor Necrosis Factor Receptor Proteins 0.000 description 1
- 102100033728 Tumor necrosis factor receptor superfamily member 18 Human genes 0.000 description 1
- 101710165473 Tumor necrosis factor receptor superfamily member 4 Proteins 0.000 description 1
- 102100022153 Tumor necrosis factor receptor superfamily member 4 Human genes 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- LTOCXIVQWDANEX-UXCYUTBZSA-M [Br-].CCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)CCCN)OCCCCCCCC\C=C/CCCC.CC(C)CCCC(C)CCCC(C)CCCC(C)CC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CC(C)CCCC(C)CCCC(C)CCCC(C)C Chemical group [Br-].CCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)CCCN)OCCCCCCCC\C=C/CCCC.CC(C)CCCC(C)CCCC(C)CCCC(C)CC(=O)OC[C@H](COP(O)(=O)OCCN)OC(=O)CC(C)CCCC(C)CCCC(C)CCCC(C)C LTOCXIVQWDANEX-UXCYUTBZSA-M 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 229940021704 adenovirus vaccine Drugs 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 230000001476 alcoholic effect Effects 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- AZDRQVAHHNSJOQ-UHFFFAOYSA-N alumane Chemical class [AlH3] AZDRQVAHHNSJOQ-UHFFFAOYSA-N 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical compound O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 125000000129 anionic group Chemical group 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 230000005911 anti-cytotoxic effect Effects 0.000 description 1
- 238000009175 antibody therapy Methods 0.000 description 1
- 229940045988 antineoplastic drug protein kinase inhibitors Drugs 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 229950002916 avelumab Drugs 0.000 description 1
- 206010064097 avian influenza Diseases 0.000 description 1
- 229960003005 axitinib Drugs 0.000 description 1
- 229960001212 bacterial vaccine Drugs 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229920001400 block copolymer Polymers 0.000 description 1
- 229960003150 bupivacaine Drugs 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 208000002458 carcinoid tumor Diseases 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 239000012159 carrier gas Substances 0.000 description 1
- 229960000590 celecoxib Drugs 0.000 description 1
- RZEKVGVHFLEQIL-UHFFFAOYSA-N celecoxib Chemical compound C1=CC(C)=CC=C1C1=CC(C(F)(F)F)=NN1C1=CC=C(S(N)(=O)=O)C=C1 RZEKVGVHFLEQIL-UHFFFAOYSA-N 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 210000003855 cell nucleus Anatomy 0.000 description 1
- 210000004520 cell wall skeleton Anatomy 0.000 description 1
- 230000007969 cellular immunity Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 229940099352 cholate Drugs 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 150000001841 cholesterols Chemical class 0.000 description 1
- BHQCQFFYRZLCQQ-OELDTZBJSA-N cholic acid Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 BHQCQFFYRZLCQQ-OELDTZBJSA-N 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 235000012343 cottonseed oil Nutrition 0.000 description 1
- 239000002385 cottonseed oil Substances 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- 230000037029 cross reaction Effects 0.000 description 1
- 229960004397 cyclophosphamide Drugs 0.000 description 1
- 208000002445 cystadenocarcinoma Diseases 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 230000000120 cytopathologic effect Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 210000004443 dendritic cell Anatomy 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 210000004207 dermis Anatomy 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 150000002009 diols Chemical class 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- MVCOAUNKQVWQHZ-UHFFFAOYSA-N doramapimod Chemical compound C1=CC(C)=CC=C1N1C(NC(=O)NC=2C3=CC=CC=C3C(OCCN3CCOCC3)=CC=2)=CC(C(C)(C)C)=N1 MVCOAUNKQVWQHZ-UHFFFAOYSA-N 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 210000004954 endothelial membrane Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 229940125532 enzyme inhibitor Drugs 0.000 description 1
- 238000003114 enzyme-linked immunosorbent spot assay Methods 0.000 description 1
- 229960001433 erlotinib Drugs 0.000 description 1
- AAKJLRGGTJKAMG-UHFFFAOYSA-N erlotinib Chemical compound C=12C=C(OCCOC)C(OCCOC)=CC2=NC=NC=1NC1=CC=CC(C#C)=C1 AAKJLRGGTJKAMG-UHFFFAOYSA-N 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 229940011871 estrogen Drugs 0.000 description 1
- 239000000262 estrogen Substances 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 210000004996 female reproductive system Anatomy 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 229940044627 gamma-interferon Drugs 0.000 description 1
- 150000002270 gangliosides Chemical class 0.000 description 1
- 229960002584 gefitinib Drugs 0.000 description 1
- XGALLCVXEZPNRQ-UHFFFAOYSA-N gefitinib Chemical compound C=12C=C(OCCCN3CCOCC3)C(OC)=CC2=NC=NC=1NC1=CC=C(F)C(Cl)=C1 XGALLCVXEZPNRQ-UHFFFAOYSA-N 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 201000006604 granular cell tumor Diseases 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 108060003552 hemocyanin Proteins 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 229960002411 imatinib Drugs 0.000 description 1
- 208000023525 immature teratoma Diseases 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 239000012642 immune effector Substances 0.000 description 1
- 230000037451 immune surveillance Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000002434 immunopotentiative effect Effects 0.000 description 1
- 229960001438 immunostimulant agent Drugs 0.000 description 1
- 239000003022 immunostimulating agent Substances 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 238000000126 in silico method Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 229960005386 ipilimumab Drugs 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 229940119170 jojoba wax Drugs 0.000 description 1
- 210000002510 keratinocyte Anatomy 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 229940043355 kinase inhibitor Drugs 0.000 description 1
- 229960004891 lapatinib Drugs 0.000 description 1
- BCFGMOOMADDAQU-UHFFFAOYSA-N lapatinib Chemical compound O1C(CNCCS(=O)(=O)C)=CC=C1C1=CC=C(N=CN=C2NC=3C=C(Cl)C(OCC=4C=C(F)C=CC=4)=CC=3)C2=C1 BCFGMOOMADDAQU-UHFFFAOYSA-N 0.000 description 1
- 150000002605 large molecules Chemical class 0.000 description 1
- 235000012902 lepidium meyenii Nutrition 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 230000021633 leukocyte mediated immunity Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000005923 long-lasting effect Effects 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 230000001926 lymphatic effect Effects 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- VZCYOOQTPOCHFL-UPHRSURJSA-N maleic acid Chemical compound OC(=O)\C=C/C(O)=O VZCYOOQTPOCHFL-UPHRSURJSA-N 0.000 description 1
- 201000000289 malignant teratoma Diseases 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 208000037819 metastatic cancer Diseases 0.000 description 1
- 208000011575 metastatic malignant neoplasm Diseases 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- BMGQWWVMWDBQGC-IIFHNQTCSA-N midostaurin Chemical compound CN([C@H]1[C@H]([C@]2(C)O[C@@H](N3C4=CC=CC=C4C4=C5C(=O)NCC5=C5C6=CC=CC=C6N2C5=C43)C1)OC)C(=O)C1=CC=CC=C1 BMGQWWVMWDBQGC-IIFHNQTCSA-N 0.000 description 1
- 229950010895 midostaurin Drugs 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- RAHBGWKEPAQNFF-UHFFFAOYSA-N motesanib Chemical compound C=1C=C2C(C)(C)CNC2=CC=1NC(=O)C1=CC=CN=C1NCC1=CC=NC=C1 RAHBGWKEPAQNFF-UHFFFAOYSA-N 0.000 description 1
- 201000010879 mucinous adenocarcinoma Diseases 0.000 description 1
- 125000001446 muramyl group Chemical group N[C@@H](C=O)[C@@H](O[C@@H](C(=O)*)C)[C@H](O)[C@H](O)CO 0.000 description 1
- 210000004985 myeloid-derived suppressor cell Anatomy 0.000 description 1
- QMDUPVPMPVZZGK-UHFFFAOYSA-N n,n-dimethyloctadecan-1-amine;hydrobromide Chemical compound [Br-].CCCCCCCCCCCCCCCCCC[NH+](C)C QMDUPVPMPVZZGK-UHFFFAOYSA-N 0.000 description 1
- LBWFXVZLPYTWQI-IPOVEDGCSA-N n-[2-(diethylamino)ethyl]-5-[(z)-(5-fluoro-2-oxo-1h-indol-3-ylidene)methyl]-2,4-dimethyl-1h-pyrrole-3-carboxamide;(2s)-2-hydroxybutanedioic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O.CCN(CC)CCNC(=O)C1=C(C)NC(\C=C/2C3=CC(F)=CC=C3NC\2=O)=C1C LBWFXVZLPYTWQI-IPOVEDGCSA-N 0.000 description 1
- ZAJXXUDARPGGOC-UHFFFAOYSA-N n-[4-(3-chloro-4-fluoroanilino)-7-[3-methyl-3-(4-methylpiperazin-1-yl)but-1-ynyl]quinazolin-6-yl]prop-2-enamide Chemical compound C1CN(C)CCN1C(C)(C)C#CC1=CC2=NC=NC(NC=3C=C(Cl)C(F)=CC=3)=C2C=C1NC(=O)C=C ZAJXXUDARPGGOC-UHFFFAOYSA-N 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- HHZIURLSWUIHRB-UHFFFAOYSA-N nilotinib Chemical compound C1=NC(C)=CN1C1=CC(NC(=O)C=2C=C(NC=3N=C(C=CN=3)C=3C=NC=CC=3)C(C)=CC=2)=CC(C(F)(F)F)=C1 HHZIURLSWUIHRB-UHFFFAOYSA-N 0.000 description 1
- 210000004882 non-tumor cell Anatomy 0.000 description 1
- 239000012457 nonaqueous media Substances 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 239000007764 o/w emulsion Substances 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 150000002895 organic esters Chemical class 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 239000003002 pH adjusting agent Substances 0.000 description 1
- 201000008129 pancreatic ductal adenocarcinoma Diseases 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000000312 peanut oil Substances 0.000 description 1
- 229960002621 pembrolizumab Drugs 0.000 description 1
- 229940023041 peptide vaccine Drugs 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010047079 phenylalanyl-leucyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- 239000002590 phosphodiesterase V inhibitor Substances 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000056 polyoxyethylene ether Polymers 0.000 description 1
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 229940023143 protein vaccine Drugs 0.000 description 1
- 230000036647 reaction Effects 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- RZJQGNCSTQAWON-UHFFFAOYSA-N rofecoxib Chemical compound C1=CC(S(=O)(=O)C)=CC=C1C1=C(C=2C=CC=CC=2)C(=O)OC1 RZJQGNCSTQAWON-UHFFFAOYSA-N 0.000 description 1
- 229960000371 rofecoxib Drugs 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 229960003310 sildenafil Drugs 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 229960003787 sorafenib Drugs 0.000 description 1
- IVDHYUQIDRJSTI-UHFFFAOYSA-N sorafenib tosylate Chemical compound [H+].CC1=CC=C(S([O-])(=O)=O)C=C1.C1=NC(C(=O)NC)=CC(OC=2C=CC(NC(=O)NC=3C=C(C(Cl)=CC=3)C(F)(F)F)=CC=2)=C1 IVDHYUQIDRJSTI-UHFFFAOYSA-N 0.000 description 1
- 235000019337 sorbitan trioleate Nutrition 0.000 description 1
- 229960000391 sorbitan trioleate Drugs 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000010959 steel Substances 0.000 description 1
- 239000002600 sunflower oil Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 229940034785 sutent Drugs 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 229960000835 tadalafil Drugs 0.000 description 1
- IEHKWSGCTWLXFU-IIBYNOLFSA-N tadalafil Chemical compound C1=C2OCOC2=CC([C@@H]2C3=C([C]4C=CC=CC4=N3)C[C@H]3N2C(=O)CN(C3=O)C)=C1 IEHKWSGCTWLXFU-IIBYNOLFSA-N 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 229940021747 therapeutic vaccine Drugs 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- JOXIMZWYDAKGHI-UHFFFAOYSA-N toluene-4-sulfonic acid Chemical compound CC1=CC=C(S(O)(=O)=O)C=C1 JOXIMZWYDAKGHI-UHFFFAOYSA-N 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 108091007466 transmembrane glycoproteins Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 102000003298 tumor necrosis factor receptor Human genes 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 210000004981 tumor-associated macrophage Anatomy 0.000 description 1
- WFKWXMTUELFFGS-UHFFFAOYSA-N tungsten Chemical compound [W] WFKWXMTUELFFGS-UHFFFAOYSA-N 0.000 description 1
- 229910052721 tungsten Inorganic materials 0.000 description 1
- 239000010937 tungsten Substances 0.000 description 1
- 229940121358 tyrosine kinase inhibitor Drugs 0.000 description 1
- 239000005483 tyrosine kinase inhibitor Substances 0.000 description 1
- 150000004917 tyrosine kinase inhibitor derivatives Chemical class 0.000 description 1
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- IYFNEFQTYQPVOC-UHFFFAOYSA-N udenafil Chemical compound C1=C(C=2NC=3C(CCC)=NN(C)C=3C(=O)N=2)C(OCCC)=CC=C1S(=O)(=O)NCCC1CCCN1C IYFNEFQTYQPVOC-UHFFFAOYSA-N 0.000 description 1
- 229960000438 udenafil Drugs 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 229950005972 urelumab Drugs 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 229960000241 vandetanib Drugs 0.000 description 1
- UHTHHESEBZOYNR-UHFFFAOYSA-N vandetanib Chemical compound COC1=CC(C(/N=CN2)=N/C=3C(=CC(Br)=CC=3)F)=C2C=C1OCC1CCN(C)CC1 UHTHHESEBZOYNR-UHFFFAOYSA-N 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 229940023148 viral-based vaccine Drugs 0.000 description 1
- 239000007762 w/o emulsion Substances 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 229940055760 yervoy Drugs 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/0005—Vertebrate antigens
- A61K39/0011—Cancer antigens
- A61K39/001169—Tumor associated carbohydrates
- A61K39/00117—Mucins, e.g. MUC-1
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/0005—Vertebrate antigens
- A61K39/0011—Cancer antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/0005—Vertebrate antigens
- A61K39/0011—Cancer antigens
- A61K39/001154—Enzymes
- A61K39/001157—Telomerase or TERT [telomerase reverse transcriptase]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K45/00—Medicinal preparations containing active ingredients not provided for in groups A61K31/00 - A61K41/00
- A61K45/06—Mixtures of active ingredients without chemical characterisation, e.g. antiphlogistics and cardiaca
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P1/00—Drugs for disorders of the alimentary tract or the digestive system
- A61P1/18—Drugs for disorders of the alimentary tract or the digestive system for pancreatic disorders, e.g. pancreatic enzymes
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P15/00—Drugs for genital or sexual disorders; Contraceptives
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
- A61P37/04—Immunostimulants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4727—Mucins, e.g. human intestinal mucin
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/57—Medicinal preparations containing antigens or antibodies characterised by the type of response, e.g. Th1, Th2
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/57—Medicinal preparations containing antigens or antibodies characterised by the type of response, e.g. Th1, Th2
- A61K2039/575—Medicinal preparations containing antigens or antibodies characterised by the type of response, e.g. Th1, Th2 humoral response
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/58—Medicinal preparations containing antigens or antibodies raising an immune response against a target which is not the antigen used for immunisation
- A61K2039/585—Medicinal preparations containing antigens or antibodies raising an immune response against a target which is not the antigen used for immunisation wherein the target is cancer
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/70—Multivalent vaccine
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/80—Vaccine for a specifically defined cancer
- A61K2039/812—Breast
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/80—Vaccine for a specifically defined cancer
- A61K2039/852—Pancreas
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/80—Vaccine for a specifically defined cancer
- A61K2039/892—Reproductive system [uterus, ovaries, cervix, testes]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K31/00—Medicinal preparations containing organic active ingredients
- A61K31/70—Carbohydrates; Sugars; Derivatives thereof
- A61K31/7088—Compounds having three or more nucleosides or nucleotides
- A61K31/711—Natural deoxyribonucleic acids, i.e. containing only 2'-deoxyriboses attached to adenine, guanine, cytosine or thymine and having 3'-5' phosphodiester links
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Epidemiology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Engineering & Computer Science (AREA)
- Mycology (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Oncology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Biophysics (AREA)
- Endocrinology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Reproductive Health (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
본 개시내용은 (a) 단리된 면역원성 TAA 폴리펩티드 (즉, 면역원성 MUC1 폴리펩티드, 면역원성 MSLN 폴리펩티드, 및 면역원성 TERT 폴리펩티드), (b) 1종 이상의 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자, (c) 면역원성 TAA 폴리펩티드 또는 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자를 포함하는 백신 조성물, 및 (d) 폴리펩티드, 핵산 분자, 및 조성물의 용도에 관련된 방법을 제공한다.
Description
관련 출원에 대한 참조
본 출원은 2016년 1월 19일에 출원된 미국 가출원 번호 62/280,636 및 2016년 11월 8일에 출원된 미국 가출원 번호 62/419,190에 대해 우선권 주장한다. 상기 출원의 각각의 전체 내용은 본원에 참조로 포함된다.
발명의 분야
본 발명은 일반적으로 면역요법 및 구체적으로 신생물성 장애를 치료 또는 예방하기 위한 백신 및 방법에 관한 것이다.
암은 전세계적으로 사망의 주된 원인이다. 이는 다양한 기관, 예컨대 췌장, 난소, 유방, 폐, 결장 및 직장에서 발생할 수 있다. 췌장암은 미국에서 암 사망의 네번째로 가장 통상적인 원인이다. 췌장암은 췌장의 외분비 또는 내분비 구성요소에서 발생할 수 있다. 외분비암은 (1) 월등히 가장 통상적인 유형인 췌장 선암종, (2) 외분비 췌장암의 5%를 나타내는 세엽 세포 암종, (3) 췌장암의 1%를 차지하는 낭선암종, 및 (4) 다른 희귀한 형태의 암, 예컨대 췌모세포종, 선편평상피암종, 반지 세포 암종, 간세포양 암종, 콜로이드 암종, 미분화 암종, 및 파골세포-유사 거대 세포를 갖는 미분화 암종을 포함한다.
난소암은 여성 중에서의 암의 약 3%를 차지하지만, 이는 여성 생식계의 임의의 다른 암보다 더 많은 사망을 유발한다. 난소암은 (1) 상피암, 예컨대 상피 난소 암종; (2) 배세포암, 예컨대 미숙 기형종; 및 (3) 기질암, 예컨대 과립층 세포 종양을 포함한다.
유방암은 미국 여성 중에서 두번째로 가장 통상적인 암이며, 여성에서의 암 사망의 두번째로 주된 원인이다. 유방암은 호르몬 수용체 및 HER2/neu 스테이터스에 기초하여, 예컨대 (1) 호르몬 수용체-양성 암 (여기서 암 세포는 에스트로겐 수용체 또는 프로게스테론 수용체 중 어느 하나를 함유함); (2) 호르몬 수용체-음성 암 (여기서 암 세포는 에스트로겐 또는 프로게스테론 수용체 중 어느 하나를 갖지 않음); (3) HER2/neu 양성 (과량의 HER2/neu 단백질 또는 HER2/neu 유전자의 여분의 카피를 갖는 암); (4) HER2/neu 음성 암 (여기서 암은 과량의 HER2/neu를 갖지 않음); (5) 삼중-음성 암 (여기서 유방암 세포는 에스트로겐 수용체도, 프로게스테론 수용체도, 과량의 HER2도 갖지 않음); 및 (6) 삼중-양성 암 (여기서 암은 에스트로겐 수용체-양성, 프로게스테론 수용체-양성이고, 지나치게 많은 HER2를 가짐)으로 분류된다.
폐암은 모든 암 사망의 1/4 초과를 차지하며, 남성 및 여성 둘 다 중에서 암 사망의 월등히 주된 원인이다. 폐암의 가장 통상적인 유형은 비소세포 폐암 (NSCLC)이며, 이는 폐암의 약 85% 내지 90%를 차지한다. NSCLC는 여러 하위유형, 예컨대 편평 세포 (표피양) 암종, 선암종, 대세포 (미분화) 암종, 선편평상피암종, 및 육종양 암종으로 추가로 분류될 수 있다. 폐암의 두번째로 통상적인 유형은 소세포 폐암 (SCLC)이며, 이는 모든 폐암의 약 10% 내지 15%를 차지한다.
결장직장암 (CRC)은 남성 및 여성 둘 다를 결합할 경우 미국에서 암-관련된 사망의 두번째로 주된 원인이다. 선암종은 CRC의 가장 통상적인 유형이며, 결장직장암의 95% 초과를 차지한다. CRC의 다른 덜 통상적인 유형으로는 카르시노이드 종양, 위장관 기질 종양 (GIST), 림프종, 및 육종을 들 수 있다.
위암은 세계에서 암-관련된 사망의 세번째로 가장 통상적인 원인이다. 이는, 일차적으로 대부분의 환자가 진행된 질환을 나타내기 때문에 치료하기 곤란하게 남아 있다. 미국에서, 위암은 현재 열다섯번째로 가장 통상적인 암이다. 위암의 약 90-95%는 선암종이고; 다른 덜 통상적인 유형으로는 림프종 (4%), GIST, 및 카르시노이드 종양 (3%)을 들 수 있다.
암 관리의 전통적인 처방은 순환형 및 고형 암의 선택적 군의 관리에서 성공적이었다. 그러나, 많은 유형의 암은 전통적인 접근법에 대해 내성이다. 최근에, 암에 대한 면역요법, 특히 암 백신 및 항체 요법이 탐구되었다. 암 면역요법의 한 접근법은 면역원을 투여하여 표적 암 세포 상의 종양-연관된 항원 (TAA)에 대한 활성 전신 면역 반응을 생성하는 것을 포함한다. 다수의 종양-연관된 항원이 확인되었고, 이들 항원 중 많은 것이 암의 치료 또는 예방을 위한 바이러스-, 박테리아-, 단백질-, 펩티드-, 또는 DNA-기반 백신으로서 탐구되었지만, 지금까지의 대부분의 임상적 시도는 치료 생성물을 생성하는데 실패하였다. 따라서, 암의 치료 또는 예방에 사용될 수 있는 면역원에 대한 필요가 존재한다.
본 개시내용은 종양-연관된 항원 MUC1, 메소텔린, 및 TERT로부터 유래된 면역원, 뿐만 아니라 암 면역요법에 사용하기 위한 이러한 면역원을 포함하는 조성물에 관한 것이다.
인간 뮤신 1 (MUC1; 에피시알린, PEM, H23Ag, EMA, CA15-3, 및 MCA로도 알려짐)은 단순 및 선상 상피의 첨단 표면 상에 발현되는 다형성 막횡단 당단백질이다. MUC1 유전자는 신호 펩티드 서열을 포함하는 단일 폴리펩티드 쇄 전구체를 코딩한다. 번역 직후 신호 펩티드 서열은 제거되고, MUC1 전구체의 나머지 부분은 2개의 펩티드 단편으로 추가로 절단된다: 보다 긴 N-말단 서브유닛 (MUC1-N 또는 MUC1α) 및 보다 짧은 C-말단 서브유닛 (MUC1-C 또는 MUC1β). 성숙 MUC1은 안정한 수소 결합을 통해 회합된 MUC1-N 및 MUC1-C를 포함한다. 세포외 도메인인 MUC1-N은 20개 아미노산 잔기의 25 내지 125개 가변 수 탠덤 반복 (VNTR)을 함유한다. MUC1-C는 짧은 세포외 영역 (대략 53개 아미노산), 막횡단 도메인 (대략 28개 아미노산), 및 세포질 꼬리 (대략 72개 아미노산)를 함유한다. MUC1 (MUC1-CT)의 세포질 꼬리는 성장 인자 수용체 및 세포내 키나제에 의해 인산화되는 고도로 보존된 세린 및 티로신 잔기를 함유한다. 인간 MUC1은 상이한 유형의 MUC1 RNA 선택적 스플라이싱으로부터 초래되는 다중 이소형으로 존재한다. 전장 인간 MUC1 이소형 1 단백질 전구체 (이소형 1, 유니프롯(Uniprot) P15941-1)의 아미노산 서열은 서열식별번호(SEQ ID NO): 1 ("MUC1 이소형 1 참조 폴리펩티드")에 제공된다. 인간 MUC-1의 적어도 16가지의 다른 이소형이 지금까지 보고되었으며 (유니프롯 P15941-2 내지 P15941-17), 이는 이소형 1의 서열에 비해 다양한 삽입, 결실, 또는 치환을 포함한다. 이들 이소형은 이소형 2, 3, 4, 5, 6, Y, 8, 9, F, Y-LSP, S2, M6, ZD, T10, E2, 및 J13 (각각 유니프롯 P15941-2 내지 P15941-17)으로 알려져 있다. 전장 인간 MUC1 이소형 1 전구체 단백질은 1255개의 아미노산으로 이루어지며, 이는 아미노산 1-23에 신호 펩티드 서열을 포함한다. 성숙 MUC1 단백질의 MUC1-N 및 MUC1-C 도메인은 각각 아미노산 24-1097 및 1098-1255로 이루어진다.
메소텔린 (MSLN으로도 알려짐)은 흉막, 복막 및 심장막에 라이닝된 세포의 표면 상에 존재하는 막-결합된 당단백질이며, 중피종, 난소, 및 췌장 선암종을 포함한 여러 인간 종양에서 과다발현된다. 메소텔린 유전자는 40-kDa 메소텔린 단백질 및 분비된 거대핵세포 강화 인자 (MPF) 단백질로 프로세싱되는 71-킬로달톤 (kDa) 전구체 단백질을 코딩한다 (Chang, et al., Proc Natl Acad Sci USA (1996) 93:136-40). MSLN 유전자의 선택적 스플라이싱은 적어도 4가지의 메소텔린 이소형을 초래한다. 이소형 1 (유니프롯 Q13421-1), 이소형 2 (유니프롯 Q13421-3), 이소형 3 (유니프롯 Q13421-2), 및 이소형 4 (유니프롯 Q13421-4)의 아미노산 서열은 유니프롯 (www.uniprot.org)에서 이용가능하다. 622개의 아미노산으로 이루어진 전장 인간 MSLN 이소형 2 전구체 단백질 (유니프롯 식별자 Q13421-3)의 아미노산 서열은 서열식별번호: 2 ("메소텔린 전구체 이소형 2 참조 폴리펩티드")에 제공된다. MSLN의 세포질 부분은 서열식별번호: 2의 아미노산 잔기 37 내지 597을 포함한다. 이소형 2는 MSLN의 주요 형태이다. 630개의 아미노산으로 이루어진 이소형 1은 이소형 2 서열의 위치 409에 8개의 아미노산 (PQAPRRPL)의 삽입을 갖는 것에 의해 이소형 2와 상이하다. 이소형 3은 대안적 C 말단 (이소형 2의 위치 593-622에)을 갖는 반면, 이소형 4는 이소형 2에 비해 아미노산 44의 결실을 갖는다. 이소형 2는 초기에 622개-아미노산 전구체로서 번역되며, 이는 N-말단에 신호 펩티드 서열 (아미노산 1-36) 및 C-말단에 GPI-앵커 서열을 포함한다. 신호 펩티드 서열 및 GPI-앵커 서열은 성숙 메소텔린에서 절단되어질 수 있다.
텔로머라제 역전사효소 (또는 TERT)는 텔로미어 반복 TTAGGG의 첨가에 의해 텔로미어 말단의 유지를 담당하는 리보핵단백질 폴리머라제인 텔로머라제의 촉매적 구성요소이다. TERT 외에도, 텔로머라제는 또한 텔로미어 반복에 대한 주형으로서 기능하는 RNA 구성요소를 포함한다. 인간 TERT 유전자는 1132개의 아미노산 단백질을 코딩한다. 인간 TERT의 여러 이소형이 존재하며, 이는 선택적 스플라이싱으로부터 초래된다. 이소형 1, 이소형 2, 이소형 3, 및 이소형 4의 아미노산 서열은 유니프롯 (<www.uniprot.org>; 각각 유니프롯 식별자 O14746-1, O14746-2, O14746-3, 및 O14746-4)에서 이용가능하다. 인간 전장 TERT 이소형 1 단백질 (이소형 1, 진뱅크(Genbank) AAD30037, 유니프롯 O14746-1)의 아미노산 서열은 또한 본원에서 서열식별번호: 3 ("TERT 이소형 1 참조 폴리펩티드")에 제공된다. TERT 이소형 1 (O14746-1)에 비해, 이소형 2 (O14746-2)는 아미노산 764-807의 대체 (STLTDLQPYM...LNEASSGLFD → LRPVPGDPAG...AGRAAPAFGG) 및 C-말단 아미노산 808-1132의 결실을 갖고, 이소형 3 (O14746-3)은 아미노산 885-947의 결실을 갖고, 이소형 4 (O14746-4)는 아미노산 711-722 및 808-1132의 결실, 및 아미노산 764-807의 대체 (STLTDLQPYM...LNEASSGLFD → LRPVPGDPAG...AGRAAPAFGG)를 갖는다.
일부 측면에서, 본 개시내용은 예를 들어, 생체내에서 (예를 들어 인간을 포함한 동물에서) 면역 반응을 도출하는데, 또는 암의 치료용 백신에서의 구성요소로서 사용하는데 유용한 MUC1, MSLN, 및 TERT로부터 유래된 단리된 면역원성 폴리펩티드를 제공한다.
다른 측면에서, 본 개시내용은 본 개시내용에 의해 제공된 면역원성 폴리펩티드를 코딩하는 핵산 분자를 제공한다. 일부 실시양태에서, 본 개시내용은 각각 2종, 3종, 또는 그 초과의 면역원성 폴리펩티드를 코딩하는 다중-항원 핵산 구축물을 제공한다.
본 개시내용은 또한 본 발명의 1종 이상의 핵산 분자를 함유하는 벡터를 제공한다. 벡터는 핵산 분자에 의해 코딩되는 면역원성 TAA 폴리펩티드를 클로닝하거나 발현하는데, 또는 조성물, 예컨대 백신 중의 핵산 분자를 숙주 세포에 또는 숙주 동물 또는 인간에게 전달하는데 유용하다.
일부 추가의 측면에서, 본 개시내용은 1종 이상의 면역원성 TAA 폴리펩티드, 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자, 또는 면역원성 TAA 폴리펩티드를 코딩하는 핵산 분자를 함유하는 벡터 또는 플라스미드를 포함하는 조성물을 제공한다. 일부 실시양태에서, 조성물은 포유동물, 예컨대 마우스, 개, 원숭이, 또는 인간에서 TAA에 대한 면역 반응을 도출하는데 유용한 면역원성 조성물이다. 일부 실시양태에서, 조성물은 포유동물, 예컨대 인간의 면역화에, 비정상적 세포 증식을 억제하는데, 암의 발생에 대한 보호를 제공하는데 (예방제로서 사용됨), 또는 TAA 과다-발현과 연관된 장애, 예컨대 암, 특히 췌장, 난소, 및 삼중-음성 유방암의 치료에 (치료제로서 사용됨) 유용한 백신 조성물이다.
추가의 다른 측면에서, 본 개시내용은 면역원성 TAA 폴리펩티드, 단리된 핵산 분자, 및 상기 본원에 기재된 면역원성 TAA 폴리펩티드 또는 단리된 핵산 분자를 포함하는 조성물을 사용하는 방법을 제공한다. 일부 실시양태에서, 본 개시내용은 표적 TAA에 대해 면역원성인 본 발명에 의해 제공된 폴리펩티드의 유효량, 이러한 면역원성 폴리펩티드를 코딩하는 단리된 핵산 분자, 또는 이러한 면역원성 TAA 폴리펩티드 또는 이러한 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자를 포함하는 조성물의 유효량을 포유동물에게 투여하는 것을 포함하는, 포유동물, 특히 인간에서 TAA에 대한 면역 반응을 도출하는 방법을 제공한다. 폴리펩티드 또는 핵산 백신은 1종 이상의 아주반트 또는 면역 조정제와 함께 사용될 수 있다.
A. 정의
용어 "아주반트"는 백신 면역원에 의해 도출되는 면역 반응을 증강시키거나, 가속화하거나, 연장시킬 수 있는 물질을 지칭한다.
용어 "효능제"는 또 다른 분자 (예컨대 수용체)의 활성을 촉진시키는 (유도하거나, 유발하거나, 증강시키거나, 증가시키는) 물질을 지칭한다. 용어 효능제는 수용체에 결합하는 물질 및 그에 결합하지 않고 수용체 기능을 촉진시키는 물질을 포괄한다.
용어 "길항제" 또는 "억제제"는 또 다른 분자 또는 수용체의 생물학적 활성을 부분적으로 또는 전체적으로 차단하거나, 억제하거나, 중화시키는 물질을 지칭한다.
용어 "공동-투여"는 치료 기간 동안 동일한 대상체에게 2종 이상의 작용제의 투여를 지칭한다. 2종 이상의 작용제는 단일 제제에 포함될 수 있으며, 따라서, 동시에 투여될 수 있다. 대안적으로, 2종 이상의 작용제는 별개의 물리적 제제에 있고, 대상체에게 순차적으로 또는 동시에 별개로 투여될 수 있다. 용어 "동시에 투여되는" 또는 "동시 투여"는 제1 작용제의 투여 및 제2 작용제의 투여가 시간적으로 서로 중첩되는 것을 의미하는 반면, 용어 "순차적으로 투여되는" 또는 "순차적 투여"는 제1 작용제의 투여 및 제2 작용제의 투여가 시간적으로 서로 중첩되지 않는 것을 의미한다.
용어 "세포액성" 또는 "세포질성"은 특정한 폴리펩티드를 코딩하는 뉴클레오티드 서열이 숙주 세포에 의해 발현된 후, 발현된 폴리펩티드가 숙주 세포 내부에 보유될 것으로 예상되는 것을 의미한다.
용어 "축중성 변이체"는 염기의 치환을 갖지만 동일한 폴리펩티드를 코딩하는 핵산 서열을 지칭한다.
용어 "유효량"은 포유동물에서 바람직한 효과를 유발하는데 충분한 포유동물에게 투여되는 양을 지칭한다.
용어 주어진 폴리펩티드의 "단편"은 주어진 폴리펩티드보다 짧고, 주어진 폴리펩티드의 서열과 100% 동일성을 공유하는 폴리펩티드를 지칭한다.
용어 면역원성 TAA 폴리펩티드의 "기능적 변이체"는 참조 면역원성 TAA 폴리펩티드의 아미노산의 수의 90% 내지 110%를 포함하고, 참조 TAA 폴리펩티드의 아미노산 서열과 100%보다 낮지만 95%보다 높은 동일성을 가지며, 참조 면역원성 TAA 폴리펩티드의 동일하거나 유사한 면역원성 특성을 소유하는 폴리펩티드를 지칭한다.
용어 "동일한"은 각각 뉴클레오티드 또는 아미노산의 정확히 동일한 서열을 공유하는 2개 이상의 핵산, 또는 2개 이상의 폴리펩티드를 지칭한다. 용어 "퍼센트 동일성"은 2개 이상의 핵산 또는 폴리펩티드 사이의 유사성의 수준을 기재한다. 2개의 서열이 생물정보학 소프트웨어에 의해 정렬되는 경우, "퍼센트 동일성"은 서열 사이의 정확한 뉴클레오티드/아미노산 매치의 수에 100을 곱하고, 갭을 포함한 정렬된 영역의 길이로 나눔으로써 계산된다. 예를 들어, 10개의 미스매치를 나타내는 2개의 100개-아미노산 길이 폴리펩티드는 90% 동일할 것이다.
용어 "면역-이펙터-세포 증강제" 또는 "IEC 증강제"는 포유동물의 1종 이상의 유형의 면역 이펙터 세포의 수, 품질, 및/또는 기능을 증가시키고/거나 증강시킬 수 있는 물질을 지칭한다. 면역 이펙터 세포의 예로는 세포용해성 CD8 T 세포, CD4 T 세포, NK 세포, 및 B 세포를 들 수 있다.
용어 "면역 조정제"는 포유동물의 선천성, 체액성, 또는 세포성 면역계의 임의의 구성요소의 작용 또는 기능을 변경시킬 (예를 들어, 억제하거나, 감소시키거나, 증가시키거나, 증강시키거나, 자극할) 수 있는 물질을 지칭한다. 따라서, 용어 "면역 조정제"는 본원에 정의된 바와 같은 "면역-이펙터-세포 증강제" 및 본원에 정의된 바와 같은 "면역-억제성-세포 억제제", 뿐만 아니라 포유동물의 면역계의 임의의 다른 구성요소에 영향을 미치는 물질을 포괄한다.
용어 "면역 반응"은 선천성 면역 반응 (예를 들어, 톨-유사 수용체 신호화 캐스케이드의 활성화), 세포-매개된 면역 반응 (예를 들어, 면역계의 T 세포, 예컨대 항원-특이적 T 세포, 및 비-특이적 세포에 의해 매개되는 반응), 및 체액성 면역 반응 (예를 들어, B 세포에 의해 매개되는 반응, 예컨대 혈장, 림프, 및/또는 조직 유체 내로의 항체의 생성 및 분비)을 포함하나 이에 제한되지 않는 숙주 척추 동물의 면역계에 의한 특정한 물질 (예컨대 항원 또는 면역원)에 대한 임의의 검출가능한 반응을 지칭한다. 면역 반응의 예로는 톨-유사 수용체 활성화의 변경 (예를 들어, 증가), 림포카인 (예를 들어, 시토카인 (예를 들어, Th1, Th2 또는 Th17 유형 시토카인) 또는 케모카인) 발현 또는 분비, 대식세포 활성화, 수지상 세포 활성화, T 세포 (예를 들어, CD4+ 또는 CD8+ T 세포) 활성화, NK 세포 활성화, B 세포 활성화 (예를 들어, 항체 생성 및/또는 분비), MHC 분자에의 면역원 (예를 들어, 항원, 면역원성 폴리펩티드)의 결합, 세포독성 T 림프구 ("CTL") 반응의 유도, B 세포 반응의 유도 (예를 들어, 항체 생산), 및 면역계의 세포 (예를 들어, T 세포 및 B 세포)의 팽장 (예를 들어, 세포의 집단의 성장), 및 항원 제시 세포에 의한 항원의 증가된 프로세싱 및 제시를 들 수 있다. 용어 "면역 반응"은 또한 시험관내에서 척추 동물의 면역계의 하나 이상의 구성요소에 의한 특정한 물질 (예컨대 항원 또는 면역원)에 대한 임의의 검출가능한 반응을 포괄한다.
용어 "면역원"은 면역원성인 물질을 지칭한다.
용어 "면역원성"은 아주반트의 존재 또는 부재 하에서, 단독으로든지 또는 담체에 결합되는 경우든지, 포유동물 (예컨대 인간)에의 투여 시 면역 반응을 도출하거나, 도출하거나, 자극하거나, 유도하는, 또는 포유동물에서 특정한 항원에 대한 기존의 면역 반응을 개선시키거나, 증강시키거나, 증가시키거나, 연장시키는 물질의 능력을 지칭한다.
용어 "면역원성 조성물"은 면역원성인 조성물을 지칭한다.
용어 "면역원성 MUC1 폴리펩티드"는 인간 천연 MUC1 단백질에 대해 또는 인간 천연 MUC1 단백질을 발현하는 세포에 대해 면역원성인 폴리펩티드를 지칭한다. 폴리펩티드는 인간 천연 MUC1 단백질의 것과 동일한 아미노산 서열을 갖거나, 인간 천연 MUC1 단백질의 아미노산 서열에 비해 1개 이상의 돌연변이를 나타낼 수 있다.
용어 "면역원성 MSLN 폴리펩티드"는 인간 천연 MSLN 단백질에 대해 또는 인간 천연 MSLN 단백질을 발현하는 세포에 대해 면역원성인 폴리펩티드를 지칭한다. 폴리펩티드는 인간 천연 MSLN 단백질의 것과 동일한 아미노산 서열을 갖거나, 인간 천연 MSLN 단백질의 아미노산 서열에 비해 1개 이상의 돌연변이를 나타낼 수 있다.
용어 "면역원성 TERT 폴리펩티드"는 인간 천연 TERT 단백질에 대해 또는 인간 천연 TERT 단백질을 발현하는 세포에 대해 면역원성인 폴리펩티드를 지칭한다. 폴리펩티드는 인간 천연 TERT 단백질의 것과 동일한 아미노산 서열을 갖거나, 인간 천연 TERT 단백질의 아미노산 서열에 비해 1개 이상의 돌연변이를 나타낼 수 있다.
용어 "면역원성 TAA 폴리펩티드"는 각각 상기 본원에 정의된 바와 같은 "면역원성 MSLN 폴리펩티드", "면역원성 MUC1 폴리펩티드", 또는 "면역원성 TERT 폴리펩티드"를 지칭한다.
용어 "면역원성 MUC1 핵산 분자"는 본원에 정의된 바와 같은 "면역원성 MUC1 폴리펩티드"를 코딩하는 핵산 분자를 지칭한다.
용어 "면역원성 MSLN 핵산 분자"는 본원에 정의된 바와 같은 "면역원성 MSLN 폴리펩티드"를 코딩하는 핵산 분자를 지칭한다.
용어 "면역원성 TERT 핵산 분자"는 본원에 정의된 바와 같은 "면역원성 TERT 폴리펩티드"를 코딩하는 핵산 분자를 지칭한다.
용어 "면역원성 TAA 핵산 분자"는 상기 본원에 정의된 바와 같은 "면역원성 MUC1 폴리펩티드", "면역원성 MSLN 폴리펩티드", 또는 "면역원성 TERT 폴리펩티드"를 코딩하는 핵산 분자를 지칭한다.
용어 "면역-억제성-세포 억제제" 또는 "ISC 억제제"는 포유동물의 면역 억제성 세포 수 및/또는 기능을 감소시키고/거나 억제할 수 있는 물질을 지칭한다. 면역 억제성 세포의 예로는 조절 T 세포 ("Treg"), 골수성-유래된 서프레서 세포, 및 종양-연관된 대식세포를 들 수 있다.
용어 "포유동물"은 포유동물 강의 임의의 동물 종을 지칭한다. 포유동물의 예로는 인간; 비-인간 영장류, 예컨대 원숭이; 실험실 동물, 예컨대 래트, 마우스, 기니 피그; 가축, 예컨대 고양이, 개, 토끼, 소, 양, 염소, 말, 및 돼지; 및 포획된 야생 동물, 예컨대 사자, 호랑이, 코끼리 등을 들 수 있다.
용어 "막-결합된"은 특정한 폴리펩티드를 코딩하는 뉴클레오티드 서열이 숙주 세포에 의해 발현된 후, 발현된 폴리펩티드가 세포의 막에 결합되거나, 그에 부착되거나, 다르게는 그와 회합되는 것을 의미한다.
용어 "신생물성 장애"는 세포가 비정상적으로 높고 비제어된 속도로 증식하는 상태를 지칭하며, 속도는 주위의 정상 조직의 것을 초과하고 그와 조화되지 않는다. 이는 통상적으로 "종양"으로 알려진 고형 병변 또는 덩어리를 초래한다. 이 용어는 양성 및 악성 신생물성 장애를 포괄한다. 본 개시내용에서 용어 "암"과 호환적으로 사용되는 용어 "악성 신생물성 장애"는 신체의 다른 위치로 확산되는 종양 세포의 능력 ("전이"로 알려짐)을 특징으로 하는 신생물성 장애를 지칭한다. 용어 "양성 신생물성 장애"는 종양 세포가 전이되는 능력이 결여된 신생물성 장애를 지칭한다.
용어 "돌연변이"는 참조 단백질 또는 폴리펩티드의 아미노산 서열에 비해 단백질 또는 폴리펩티드의 아미노산 서열에서의 아미노산 잔기의 결실, 부가, 또는 치환을 지칭한다.
용어 "작동적으로 연결된"은 기재된 구성요소가 그들을 그들의 의도된 방식으로 기능하는 것을 허용하는 관계에 있는 병치를 지칭한다. 트랜스진에 "작동적으로 연결된" 제어 서열은 트랜스진의 발현이 제어 서열과 혼화성인 조건 하에서 달성되도록 하는 방식으로 라이게이션된다.
용어 "제약 조성물"은 바람직한 생리학적, 약리학적, 또는 치료 효과를 도출하기 위한 대상체 (예를 들어 인간 환자)에의 투여에 적합한 고체 또는 액체 조성물을 지칭한다. 1종 이상의 활성 성분을 함유하는 것 외에도, 제약 조성물은 1종 이상의 제약상 허용되는 부형제를 함유할 수 있다.
용어 "제약상 허용되는 부형제"는 활성 성분과 혼화성이며 그것이 투여되는 대상체에서 유의한 부작용을 유발하지 않는 활성 성분 (예를 들어, 항원, 항원-코딩 핵산, 면역 조정제, 또는 아주반트) 이외의 면역원성, 제약, 또는 백신 조성물 중의 물질을 지칭한다.
용어 "펩티드", "폴리펩티드", 및 "단백질"은 본원에서 호환적으로 사용되며, 코딩된 및 비-코딩된 아미노산, 화학적으로, 또는 생화학적으로 변형되거나 유도체화된 아미노산, 및 변형된 폴리펩티드 백본을 갖는 폴리펩티드를 포함할 수 있는, 임의의 길이의 아미노산의 중합체성 형태를 지칭한다.
용어 "예방하는" 또는 "예방하다"는 (a) 장애가 발생하지 못하게 하거나, (b) 장애의 발병 또는 장애의 증상의 발병을 지연시키는 것을 지칭한다.
폴리펩티드의 맥락에서 용어 "분비된"은 폴리펩티드를 코딩하는 뉴클레오티드 서열이 숙주 세포에 의해 발현된 후, 발현된 폴리펩티드가 숙주 세포의 외부로 분비되는 것을 의미한다.
면역 조정제, 예컨대 단백질 키나제 억제제의 양을 기재하는데 사용되는 경우 용어 "부최적 용량"은 면역조정제가 단독으로 환자에게 투여되는 경우 치료되는 질환에 대한 바람직한 치료 효과를 생성하는데 요구되는 최소 양 미만인 면역조정제의 용량을 지칭한다.
용어 "대상체"는 인간 또는 비-인간 포유동물을 지칭한다.
용어 "치료하는", "치료", 또는 "치료하다"는 장애를 없애거나, 장애의 중증도를 감소시키거나, 장애의 증상의 중증도 또는 발생 빈도를 감소시키는 것을 지칭한다.
용어 "종양-연관된 항원" 또는 "TAA"는 종양 세포에 의해 특이적으로 발현되거나, 동일한 조직 유형의 비-종양 세포에 의해서보다 종양 세포에 의해서 보다 높은 빈도 또는 밀도로 발현되는 항원을 지칭한다. 종양-연관된 항원은 숙주에 의해 정상적으로 발현되지 않는 항원일 수 있거나; 이들은 돌연변이되거나, 말단절단되거나, 미스폴딩되거나, 다르게는 숙주에 의해 정상적으로 발현되는 분자의 비정상적 징후일 수 있거나; 이들은 정상적으로 발현되지만, 비정상적으로 높은 수준으로 발현되는 분자와 동일할 수 있거나; 이들은 비정상적인 맥락 또는 환경에서 발현될 수 있다. 종양-연관된 항원은 예를 들어, 단백질 또는 단백질 단편, 복합 탄수화물, 강글리오시드, 합텐, 핵산, 또는 이들 또는 다른 생물학적 분자의 임의의 조합일 수 있다.
용어 "백신"은 특정한 항원 또는 항원들에 대해 보호적 면역 반응을 도출하기 위한 포유동물 (예컨대 인간)에의 투여를 위한 면역원성 조성물을 지칭한다. 백신의 1차 활성 성분은 면역원(들)이다.
용어 "벡터"는 외래 핵산 분자를 숙주 세포 내로 수송하거나 전달할 수 있는 핵산 분자, 또는 변형된 미생물을 지칭한다. 외래 핵산 분자는 "삽입물" 또는 "트랜스진"으로 지칭된다. 벡터는 일반적으로 삽입물 및 벡터의 백본으로서 기능하는 보다 큰 서열로 이루어진다. 벡터의 구조 또는 기원에 기초하여, 벡터의 주요 유형으로는 플라스미드 벡터, 코스미드 벡터, 파지 벡터 (예컨대 람다 파지), 바이러스 벡터 (예컨대 아데노바이러스 벡터), 인공 염색체, 및 박테리아 벡터를 들 수 있다.
B. 면역원성 종양-연관된-항원 (TAA) 폴리펩티드
일부 측면에서, 본 개시내용은 생체내에서 (예를 들어 인간을 포함한 동물에서) 또는 시험관내에서 면역 반응을 도출하거나, 이펙터 T 세포를 활성화시키거나, 각각 MUC1, TERT, 및 MSLN에 특이적인 항체를 생성하는데, 또는 암, 예컨대 췌장, 난소, 및 유방암, 특히 삼중-음성 유방암의 치료용 백신 중의 구성요소로서 사용하는데 유용한 단리된 면역원성 MUC1 폴리펩티드, TERT 폴리펩티드, 및 MSLN 폴리펩티드를 제공한다.
이들 면역원성 TAA 폴리펩티드는 본 개시내용을 고려하여 관련 기술분야에 알려진 방법에 의해 제조될 수 있다. 면역 반응을 도출하는 폴리펩티드의 능력은 시험관내 검정 또는 생체내 검정에서 측정될 수 있다. 면역 반응을 도출하는 폴리펩티드 또는 DNA 구축물의 능력을 결정하기 위한 시험관내 검정은 관련 기술분야에 알려져 있다. 이러한 시험관내 검정의 한 예는 그의 개시내용이 본 출원에 포함되는 미국 특허 7,387,882에 기재된 바와 같은 T 세포 반응을 자극하는 폴리펩티드 또는 폴리펩티드를 발현하는 핵산의 능력을 측정하는 것이다. 검정 방법은 (1) 배양물 중의 항원 제시 세포를 항원과 접촉시킴으로써, 항원이 항원 제시 세포에 의해 흡수되고 프로세싱되어, 1종 이상의 프로세싱된 항원을 생성할 수 있게 하는 단계; (2) 항원 제시 세포를 T 세포가 프로세싱된 항원 중 1종 이상에 반응하는데 충분한 조건 하에서 T 세포와 접촉시키는 단계; (3) T 세포가 프로세싱된 항원 중 1종 이상에 반응하는지 여부를 결정하는 단계를 포함한다. 사용되는 T 세포는 CD8+ T 세포 또는 CD4+ T 세포일 수 있다. T 세포 반응은 시토카인 중 1종 이상, 예컨대 인터페론-감마 및 인터류킨-2의 방출, 항원 제시 세포 (종양 세포)의 용해를 측정함으로써 결정될 수 있다. B 세포 반응은 항체의 생산을 측정함으로써 결정될 수 있다.
B-1. 면역원성 MUC1 폴리펩티드
한 측면에서, 본 개시내용은 인간 천연 MUC1로부터 유래된 단리된 면역원성 MUC1 폴리펩티드를 제공하며, 여기서 MUC1 폴리펩티드는 인간 천연 MUC1 단백질에 비해 1개 이상의 도입된 돌연변이를 나타낸다. 돌연변이의 예로는 MUC1 단백질의 VNTR 영역에서의 20개 아미노산의 탠덤 반복의 일부 (그러나 전부는 아님)의 결실, 전체적으로 또는 부분적으로 신호 펩티드 서열의 결실, 및 MUC1 이소형에서 발견되는 비-컨센서스 아미노산 서열의 아미노산의 결실을 포함한다. 따라서, 일부 실시양태에서, 본 개시내용에 의해 제공된 면역원성 MUC1 폴리펩티드는 (1) 인간 MUC1 단백질의 20개 아미노산의 3 내지 30개 탠덤 반복의 아미노산 서열 및 (2) VNTR 영역에 플랭킹된 인간 MUC1 단백질의 아미노산 서열을 포함한다. 일부 특정한 실시양태에서, 면역원성 MUC1 폴리펩티드는 (1) 인간 MUC1의 5 내지 25개 탠덤 반복의 아미노산 서열 및 (2) VNTR 영역에 플랭킹된 인간 MUC1 단백질의 아미노산 서열을 포함한다. 일부 추가의 실시양태에서, 면역원성 MUC1 폴리펩티드는 세포질성 형태 (또는 "cMUC1")이다. 용어 "세포질성 형태"는 인간 천연 MUC1 단백질의 분비 서열 (아미노산 1-23; "신호 펩티드 서열"로도 알려짐)이 전체적으로 또는 부분적으로 결여된 면역원성 MUC1 폴리펩티드를 지칭한다. 분비 서열의 아미노산의 결실은 폴리펩티드가 그것이 세포에서 발현되기 때문에 분비 경로로 진입하는 것을 방지할 것으로 예상된다. 일부 다른 실시양태에서, 면역원성 MUC1 폴리펩티드는 MUC1의 막-결합된 형태의 아미노산 서열을 포함한다.
본 개시내용에 의해 제공된 면역원성 MUC1 폴리펩티드는 예를 들어, 유니프롯 이소형 1, 2, 3, 4, 5, 6, Y, 8, 9, F, Y-LSP, S2, M6, ZD, T10, E2, 및 J13 (각각 유니프롯 P15941-1 내지 P15941-17)을 포함한 관련 기술분야에 알려져 있거나, 장래에 발견될 인간 MUC1 이소형 중 임의의 것의 아미노산 서열로부터 유래되거나, 구축되거나, 제조될 수 있다. 일부 실시양태에서, 면역원성 MUC1 폴리펩티드는 인간 MUC1 이소형 1의 일부인 아미노산 서열을 포함하며, 여기서 인간 MUC1 이소형 1의 아미노산 서열은 서열식별번호: 1에 제시된다. 구체적 실시양태에서, 면역원성 MUC1 폴리펩티드는 서열식별번호: 1의 아미노산 서열의 아미노산 24-225 및 1098-1255를 포함한다. 또 다른 구체적 실시양태에서, 면역원성 MUC1 폴리펩티드는 서열식별번호: 1의 아미노산 서열의 아미노산 22-225 및 946-1255를 포함한다. 일부 다른 구체적 실시양태에서, 면역원성 MUC1 폴리펩티드는 하기로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하거나, 또는 그로 이루어진다:
(1) 서열식별번호: 8의 아미노산 서열 (플라스미드 1027 폴리펩티드);
(2) 서열식별번호: 8의 아미노산 4-537을 포함하는 아미노산 서열;
(3) 서열식별번호: 8의 아미노산 24-537을 포함하는 아미노산 서열;
(4) 서열식별번호: 16의 아미노산 서열 (플라스미드 1197 폴리펩티드);
(5) 서열식별번호: 16의 아미노산 4-517을 포함하는 아미노산 서열; 및
(6) 서열식별번호: 16의 아미노산 4-517을 포함하며, 여기서 서열식별번호: 16에서 위치 513에서의 아미노산이 T인 아미노산 서열.
일부 구체적 실시양태에서, 면역원성 MUC1 폴리펩티드는 서열식별번호: 8 (플라스미드 1027 폴리펩티드) 또는 서열식별번호: 16 (플라스미드 1197 폴리펩티드)의 아미노산 서열을 포함한다.
B-2. 면역원성 MSLN 폴리펩티드
한 측면에서, 본 개시내용은 MSLN 전구체의 부분 또는 전체 신호 펩티드 서열의 결실에 의해 인간 MSLN 전구체로부터 유래된 단리된 면역원성 MSLN 폴리펩티드를 제공한다. 따라서, 면역원성 MSLN 폴리펩티드는 천연 인간 MSLN 전구체의 아미노산 서열을 포함하며, 여기서 MSLN 전구체의 일부 또는 전체 신호 펩티드 서열은 부재한다. 일부 실시양태에서, 천연 인간 MSLN의 GPI 앵커 서열 (즉, 서열식별번호: 2의 아미노산 598-622)의 일부, 또는 전체는 또한 면역원성 MSLN 폴리펩티드에 부재한다. 본원에 사용된 용어 "인간 MSLN"은 임의의 인간 MSLN 이소형, 예컨대 이소형 1, 2, 3, 또는 4를 포괄한다. 일부 특정한 실시양태에서, 인간 MSLN은 인간 MSLN 이소형 2이다.
일부 특정한 실시양태에서, 단리된 면역원성 MSLN 폴리펩티드는 하기로 이루어진 군으로부터 선택된다:
1) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597을 포함하거나, 또는 그로 이루어진 폴리펩티드;
2) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597로 이루어진 아미노산 서열과 적어도 90%, 95%, 98%, 또는 99% 동일한 아미노산 서열을 포함하는 폴리펩티드;
3) 서열식별번호: 6의 아미노산 서열, 또는 서열식별번호: 6의 아미노산 서열의 아미노산 4-564를 포함하거나, 또는 그로 이루어진 폴리펩티드; 및
4) 서열식별번호: 6의 아미노산 서열과 적어도 93% - 99%, 94% - 98%, 또는 94% - 97% 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드 ("플라스미드 1103 폴리펩티드").
B-3. 면역원성 TERT 폴리펩티드
또 다른 측면에서, 본 개시내용은 TERT 단백질의 N-말단 아미노산의 600개 이하의 결실에 의해 인간 TERT 단백질로부터 유래된 단리된 면역원성 TERT 폴리펩티드를 제공한다. 따라서, 일부 실시양태에서, 면역원성 TERT 폴리펩티드는 서열식별번호: 3에 제시된 TERT 이소형 1의 아미노산 서열을 포함하며, 여기서 TERT 이소형 1의 아미노산 서열의 N-말단 (아미노 말단)으로부터의 약 600개 이하의 아미노산은 부재한다. TERT 이소형 1의 N-말단으로부터의 600개 이하의 아미노산의 임의의 수는 면역원성 TERT 폴리펩티드에 부재할 수 있다. 예를 들어, 서열식별번호: 3의 TERT 이소형 1의 위치 1 내지 위치 50, 100, 50, 200, 245, 300, 350, 400, 450, 500, 550, 또는 600으로부터의 N-말단 아미노산은 면역원성 TERT 폴리펩티드로부터 부재할 수 있다. 따라서, 본 개시내용에 의해 제공된 면역원성 TERT 폴리펩티드는 서열식별번호: 3의 아미노산 51-1132, 101-1132, 151-1132, 201-1132, 251-1132, 301-1132, 351-1132, 401-1132, 451-1132, 501-1132, 또는 551-1132를 포함할 수 있다. 면역원성 TERT 폴리펩티드는 또한 다른 TERT 이소형으로부터 구축될 수 있다. 그러나, 폴리펩티드가 C-말단 말단절단을 갖는 TERT 이소형으로부터 구축되는 경우, 보다 적은 아미노산이 N-말단으로부터 결실될 수 있는 것이 바람직하다.
일부 추가의 실시양태에서, 면역원성 TERT 폴리펩티드는 TERT 촉매적 도메인을 불활성화시키는 1개 이상의 아미노산 돌연변이를 추가로 포함한다. 이러한 아미노산 돌연변이의 예로는 서열식별번호: 3의 위치 712에서의 아스파르트산의 알라닌으로의 치환 (D712A) 및 서열식별번호: 3의 위치 713에서의 발린의 이소류신으로의 치환 (V713I)을 들 수 있다. 일부 실시양태에서, 면역원성 TERT 폴리펩티드는 돌연변이 D712A 및 V713I 둘 다를 포함한다.
일부 구체적 실시양태에서, 본 개시내용은 하기로 이루어진 군으로부터 선택되는 면역원성 TERT 폴리펩티드를 제공한다:
1) 서열식별번호: 10의 아미노산 서열 또는 서열식별번호: 10의 아미노산 2-892를 포함하는 폴리펩티드 ("플라스미드 1112 폴리펩티드"), 또는 폴리펩티드의 기능적 변이체;
2) 서열식별번호: 14의 아미노산 서열 또는 서열식별번호: 14의 아미노산 3-789를 포함하는 폴리펩티드 ("플라스미드 1326 폴리펩티드"), 또는 폴리펩티드의 기능적 변이체; 및
3) 서열식별번호: 12의 아미노산 서열 또는 서열식별번호: 12의 아미노산 4-591을 포함하는 폴리펩티드 ("플라스미드 1330 폴리펩티드"), 또는 폴리펩티드의 기능적 변이체.
C. 면역원성 TAA 폴리펩티드를 코딩하는 핵산 분자
일부 측면에서, 본 개시내용은 각각 본 개시내용에 의해 제공된 1종, 2종, 3종, 또는 그 초과의 별개의 면역원성 TAA 폴리펩티드를 코딩하는 핵산 분자를 제공한다. 핵산 분자는 데옥시리보뉴클레오티드 (DNA) 또는 리보뉴클레오티드 (RNA)일 수 있다. 따라서, 핵산 분자는 티미딘 (T)이 또한 우라실 (U)일 수 있는 본원에 개시된 뉴클레오티드 서열을 포함하며, 이는 DNA 및 RNA의 화학적 구조 사이의 차이를 반영한다. 핵산 분자는 변형된 형태, 단일 또는 이중 가닥 형태, 또는 선형 또는 원형 형태일 수 있다. 핵산 분자는 본 개시내용을 고려하여 관련 기술분야에 알려진 방법을 사용하여 제조될 수 있다.
C-1. 단일-항원 구축물
한 측면에서, 본 개시내용은 본 개시내용에 의해 제공된 단일 면역원성 MUC1 폴리펩티드, 단일 면역원성 MSLN 폴리펩티드, 또는 단일 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열을 포함하는 단리된 핵산 분자를 제공한다. 단지 1종의 면역원성 TAA 폴리펩티드, 예컨대 면역원성 MUC1 폴리펩티드, 면역원성 MSLN 폴리펩티드, 또는 면역원성 TERT를 코딩하는 핵산 분자는 또한 본원에서 "단일-항원 구축물"로 지칭된다.
C-1a. MUC1 단일 항원 구축물
일부 실시양태에서, 본 개시내용은 본 개시내용에 제공된 면역원성 MUC1 폴리펩티드를 코딩하는 단리된 핵산 분자를 제공한다. 핵산 분자에 의해 코딩되는 면역원성 MUC1 폴리펩티드는 세포질성 형태 (또는 cMUC1) 또는 "막-결합된 형태 (또는 mMUC1)일 수 있다. 용어 "막-결합된 형태"는 숙주 세포에 의해 코딩 핵산으로부터 발현된 후, 숙주 세포의 막에 결합되거나, 그에 부착되거나, 다르게는 그와 회합하는 면역원성 MUC1 폴리펩티드를 지칭한다.
일부 구체적 실시양태에서, 본 개시내용에 의해 제공된 단리된 핵산 분자는 하기로 이루어진 군으로부터 선택되는 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열을 포함한다:
(1) 서열식별번호: 8의 아미노산 서열을 포함하는 면역원성 MUC1 폴리펩티드 (플라스미드 1027 폴리펩티드);
(2) 서열식별번호: 8의 아미노산 4-537을 포함하는 면역원성 MUC1 폴리펩티드;
(3) 서열식별번호: 8의 아미노산 24-537을 포함하는 면역원성 MUC1 폴리펩티드;
(4) 서열식별번호: 16의 아미노산 서열을 포함하는 면역원성 MUC1 폴리펩티드 (플라스미드 1197 폴리펩티드);
(5) 서열식별번호: 16의 아미노산 4-517을 포함하는 면역원성 MUC1 폴리펩티드;
(6) 서열식별번호: 16의 아미노산 4-517을 포함하며, 여기서 위치 513에서의 아미노산이 T인 면역원성 MUC1 폴리펩티드; 및
(7) 서열식별번호: 1의 아미노산 24-225 및 946-1255를 포함하는 면역원성 MUC1 폴리펩티드.
일부 다른 구체적 실시양태에서, 본 개시내용에 의해 제공된 단리된 핵산 분자는 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열, 또는 그의 축중성 변이체를 포함한다:
(1) 서열식별번호: 7의 뉴클레오티드 서열 (플라스미드 1027);
(2) 서열식별번호: 7의 뉴클레오티드 10-1611을 포함하는 뉴클레오티드 서열;
(3) 서열식별번호: 15의 뉴클레오티드 서열 (플라스미드 1197); 및
(4) 서열식별번호: 15의 뉴클레오티드 10-1551을 포함하는 뉴클레오티드 서열.
C-1b. MSLN 단일 항원 구축물
일부 실시양태에서, 본 개시내용은 본 개시내용에 제공된 면역원성 MSLN 폴리펩티드를 코딩하는 단리된 핵산 분자를 제공한다.
일부 특정한 실시양태에서, 단리된 핵산 분자는 하기로 이루어진 군으로부터 선택되는 면역원성 MSLN 폴리펩티드를 코딩한다:
1) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597을 포함하거나, 또는 그로 이루어진 면역원성 MSLN 폴리펩티드;
2) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597로 이루어진 아미노산 서열과 적어도 90%, 95%, 98%, 또는 99% 동일한 아미노산 서열을 포함하는 면역원성 MSLN 폴리펩티드;
3) 서열식별번호: 6의 아미노산 서열을 포함하거나, 또는 그로 이루어진 면역원성 MSLN 폴리펩티드; 및
4) 서열식별번호: 6의 아미노산 서열과 적어도 93% - 99%, 94% - 98%, 또는 94% - 97% 동일성을 갖는 아미노산 서열을 포함하는 면역원성 MSLN 폴리펩티드 ("플라스미드 1103 폴리펩티드").
일부 다른 구체적 실시양태에서, 본 개시내용에 의해 제공된 단리된 핵산 분자는 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열, 또는 그의 축중성 변이체를 포함한다:
(1) 서열식별번호: 5의 뉴클레오티드 서열; 및
(2) 서열식별번호: 5의 뉴클레오티드 10-1692를 포함하는 뉴클레오티드 서열.
C-1c. TERT 단일 항원 구축물
일부 다른 실시양태에서, 본 개시내용은 본 개시내용에 제공된 면역원성 TERT 폴리펩티드를 코딩하는 단리된 핵산 분자를 제공한다.
본 개시내용에 의해 제공된 핵산에 의해 코딩되는 면역원성 TERT 폴리펩티드는 TERT 이소형 1의 아미노산 서열의 N-말단으로부터 최대 600개 아미노산의 결실을 함유할 수 있다. 일반적으로, 면역원성 TERT 폴리펩티드는 그것이 TERT 단백질의 N-말단으로부터 보다 적은 아미노산의 결실을 갖는 경우, 보다 강한 면역원성을 소유할 것으로 예상될 수 있다. TERT 단백질로부터 결실될 수 있는 N-말단 아미노산의 수는 폴리펩티드를 코딩하는 핵산 분자가 어떻게 사용되거나 전달되도록 의도되는 지에 기초하여 결정될 수 있다. 예를 들어, 핵산 분자가 특정한 바이러스 벡터를 사용하여 전달되어야 하는 경우, 결실은 사용되는 벡터의 용량에 기초하여 결정될 수 있다.
일부 실시양태에서, 핵산 분자에 의해 코딩되는 면역원성 TERT 폴리펩티드는 TERT 촉매적 도메인을 불활성화시키는 1개 이상의 아미노산 돌연변이를 포함한다. 이러한 아미노산 돌연변이의 예로는 서열식별번호: 3의 위치 712에서의 아스파르트산의 알라닌으로의 치환 (D712A) 및 서열식별번호: 3의 위치 713에서의 발린의 이소류신으로의 치환 (V713I)을 들 수 있다. 일부 실시양태에서, 면역원성 TERT 폴리펩티드는 돌연변이 D712A 및 V713I 둘 다를 포함한다.
일부 구체적 실시양태에서, 단리된 핵산 분자는 하기로 이루어진 군으로부터 선택되는 면역원성 TERT 폴리펩티드를 코딩한다:
(1) 서열식별번호: 10의 아미노산 서열 또는 서열식별번호: 10의 아미노산 2-892를 포함하는 면역원성 TERT 폴리펩티드 ("플라스미드 1112 폴리펩티드"), 또는 폴리펩티드의 기능적 변이체;
(2) 서열식별번호: 14의 아미노산 서열 또는 서열식별번호: 14의 아미노산 3-789를 포함하는 면역원성 TERT 폴리펩티드 ("플라스미드 1326 폴리펩티드"), 또는 폴리펩티드의 기능적 변이체; 및
(3) 서열식별번호: 12의 아미노산 서열 또는 서열식별번호: 12의 아미노산 4-591을 포함하는 면역원성 TERT 폴리펩티드 ("플라스미드 1330 폴리펩티드"), 또는 폴리펩티드의 기능적 변이체.
일부 특정한 실시양태에서, 단리된 핵산 분자는 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열, 또는 그의 축중성 변이체를 포함한다:
(1) 서열식별번호: 9의 뉴클레오티드 서열 (TERT240);
(2) 서열식별번호: 9의 뉴클레오티드 4-2679를 포함하는 뉴클레오티드 서열;
(3) 서열식별번호: 11의 뉴클레오티드 서열 (TERT541);
(4) 서열식별번호: 11의 뉴클레오티드 10-1782를 포함하는 뉴클레오티드 서열;
(5) 서열식별번호: 13의 뉴클레오티드 서열 (TERT342); 및
(6) 서열식별번호: 13의 뉴클레오티드 7-2373을 포함하는 뉴클레오티드 서열.
C-2. 다중-항원 구축물
또 다른 측면에서, 본 개시내용은 각각 2종, 3종, 또는 그 초과의 상이한 면역원성 TAA 폴리펩티드를 코딩하는 핵산 분자를 제공한다. 1종 초과의 면역원성 TAA 폴리펩티드를 코딩하는 핵산 분자는 또한 본 개시내용에서 "다중-항원 구축물", "다중-항원 백신", "다중-항원 플라스미드" 등으로 지칭된다. 2종의 상이한 면역원성 TAA 폴리펩티드를 코딩하는 핵산 분자는 또한 이 개시내용에서 "이중-항원 구축물", "이중 항원 백신", 또는 "이중 항원 플라스미드" 등으로 지칭된다. 3종의 상이한 면역원성 TAA 폴리펩티드를 코딩하는 핵산 분자는 또한 이 개시내용에서 "삼중-항원 구축물", "삼중-항원 백신", 또는 "삼중-항원 플라스미드"로 지칭된다.
본 개시내용에 의해 제공된 다중-항원 구축물은 본 개시내용을 고려하여 관련 기술분야에 알려진 다양한 기술을 사용하여 제조될 수 있다. 예를 들어, 다중-항원 구축물은 다중 독립적 프로모터를 단일 플라스미드 내로 혼입시킴으로써 구축될 수 있다 (Huang, Y., Z. Chen, et al. (2008). "Design, construction, and characterization of a dual-promoter multigenic DNA vaccine directed against an HIV-1 subtype C/B' recombinant." J Acquir Immune Defic Syndr 47(4): 403-411; Xu, K., Z. Y. Ling, et al. (2011). "Broad humoral and cellular immunity elicited by a bivalent DNA vaccine encoding HA and NP genes from an H5N1 virus." Viral Immunol 24(1): 45-56). 플라스미드는 각각 a) 인핸서 요소가 있거나 없이 RNA 폴리머라제 의존적 전사를 개시하기 위한 진핵생물 프로모터, b) 표적 항원을 코딩하는 유전자, 및 c) 전사 종결자 서열로 이루어진 다중 발현 카세트를 운반하도록 조작될 수 있다. 형질감염된 세포 핵에의 플라스미드의 전달 시, 전사는 각각의 프로모터로부터 개시될 것이며, 이는 각각 표적 항원 중 하나를 코딩하는 별개의 mRNA의 생성을 초래한다. mRNA는 독립적으로 번역됨으로써, 바람직한 항원을 생성할 것이다.
본 개시내용에 의해 제공된 다중-항원 구축물은 또한 바이러스 2A 펩티드의 사용을 통해 구축될 수 있다 (Szymczak, A. L. and D. A. Vignali (2005). "Development of 2A peptide-based strategies in the design of multicistronic vectors." Expert Opin Biol Ther 5(5): 627-638; de Felipe, P., G. A. Luke, et al. (2006). "E unum pluribus: multiple proteins from a self-processing polyprotein." Trends Biotechnol 24(2): 68-75; Luke, G. A., P. de Felipe, et al. (2008). "Occurrence, function and evolutionary origins of '2A-like' sequences in virus genomes." J Gen Virol 89(Pt 4): 1036-1042; Ibrahimi, A., G. Vande Velde, et al. (2009). "Highly efficient multicistronic lentiviral vectors with peptide 2A sequences." Hum Gene Ther 20(8): 845-860; Kim, J. H., S. R. Lee, et al. (2011). "High cleavage efficiency of a 2A peptide derived from porcine teschovirus-1 in human cell lines, zebrafish and mice." PLoS One 6(4): e18556). 절단 카세트 또는 CHYSEL (시스-작용 히드롤라제 요소)로도 지칭되는 이들 펩티드는 고도로 보존된 카르복시 말단 D-V/I-EXNPGP 모티프를 갖는 대략 20개 아미노산 길이이다 (표 19). 이들 펩티드는 자연에서 희귀하며, 가장 통상적으로 바이러스, 예컨대 구제역 바이러스 (FMDV), 말 비염 A 바이러스 (ERAV), 말 비염 B 바이러스 (ERBV), 뇌심근염 바이러스 (EMCV), 돼지 테스코바이러스 (PTV), 및 토세아 아시그나 바이러스 (TAV)에서 발견된다 (Luke, G. A., P. de Felipe, et al. (2008). "Occurrence, function and evolutionary origins of '2A-like' sequences in virus genomes." J Gen Virol 89(Pt 4): 1036-1042). 2A-기반 다중-항원 발현 전략으로, 다중 표적 항원을 코딩하는 유전자는 바이러스 2A 펩티드를 코딩하는 서열에 의해 분리된, 단일 오픈 리딩 프레임에서 함께 연결된다. 전체 오픈 리딩 프레임은 단일 프로모터 및 종결자를 갖는 벡터 내로 클로닝될 수 있다. 숙주 세포에의 구축물의 전달 시, 다중 항원을 코딩하는 mRNA는 단일 폴리단백질로서 전사되고, 번역될 것이다. 2A 펩티드의 번역 동안, 리보솜은 C-말단 글리신 및 프롤린 사이의 결합을 스키핑한다. 리보솜 스키핑은 2A 펩티드의 하류로부터 그들 상류로 펩티드 서열을 방출하는 공동번역적 자가촉매적 "절단"처럼 작용한다. 2개의 단백질 항원 사이의 2A 펩티드의 혼입은 상류 폴리펩티드의 C-말단 상으로의 ~20개의 아미노산 및 하류 단백질의 N-말단에의 1개의 아미노산 (프롤린)의 첨가를 초래할 수 있다. 이 방법론의 변형에서, 프로테아제 절단 부위는 편재적 프로테아제가 상류 단백질로부터 카세트를 절단할 것이도록 2A 카세트의 N 말단에서 혼입될 수 있다 (Fang, J., S. Yi, et al. (2007). "An antibody delivery system for regulated expression of therapeutic levels of monoclonal antibodies in vivo." Mol Ther 15(6): 1153-1159).
본 개시내용에 의해 제공된 다중-항원 구축물을 구축하는 또 다른 전략은 내부 리보솜 진입 부위, 또는 IRES의 사용을 포함한다. 내부 리보솜 진입 부위는 특정 RNA 분자의 5' 비번역된 영역에서 발견되는 RNA 요소이다 (Bonnal, S., C. Boutonnet, et al. (2003). "IRESdb: the Internal Ribosome Entry Site database." Nucleic Acids Res 31(1): 427-428). 이들은 진핵생물 리보솜을 RNA로 유인하여 하류 오픈 리딩 프레임의 번역을 용이하게 한다. 정상 세포성 7-메틸구아노신 캡-의존적 번역과는 달리, IRES-매개된 번역은 RNA 분자 훨씬 내의 AUG 코돈에서 개시될 수 있다. 다중-시스트론 발현 벡터에 사용하기 위한 매우 유효한 프로세스가 탐구되었다 (Bochkov, Y. A. and A. C. Palmenberg (2006). "Translational efficiency of EMCV IRES in bicistronic vectors is dependent upon IRES sequence and gene location." Biotechniques 41(3): 283-284, 286, 288). 전형적으로, 2개의 트랜스진은 IRES에 의해 분리된 2개의 별개의 오픈 리딩 프레임으로서 프로모터 및 전사 종결자 사이에서 벡터 내로 삽입된다. 숙주 세포에의 구축물의 전달 시, 둘 다의 트랜스진을 코딩하는 단일 길이 전사체가 전사될 것이다. 제1 ORF는 전통적인 캡-의존적 방식으로 번역될 것이며, 이는 IRES의 상류의 정지 코돈에서 종결된다. 제2 ORF는 IRES를 사용하여 캡-독립적 방식으로 번역될 것이다. 이러한 방식으로, 2개의 독립적 단백질은 단일 발현 카세트를 갖는 벡터로부터 전사된 단일 mRNA로부터 생성될 수 있다.
일부 측면에서, 본 개시내용은 2개의 코딩 뉴클레오티드 서열을 포함하는 이중-항원 구축물을 제공하며, 여기서 코딩 뉴클레오티드 서열의 각각은 개별적 면역원성 TAA 폴리펩티드를 코딩한다. 이러한 이중-항원 구축물의 구조는 식 (I)에 나타나 있다:
TAA1-스페이서1-TAA2 (I)
여기서 식 (I)에서:
(i) TAA1 및 TAA2는 각각 면역원성 MUC1 폴리펩티드, 면역원성 MSLN 폴리펩티드, 및 면역원성 TERT 폴리펩티드로 이루어진 군으로부터 선택되는 면역원성 TAA 폴리펩티드를 코딩하는 뉴클레오티드 서열이며, 여기서 TAA1 및 TAA2는 상이한 면역원성 TAA 폴리펩티드를 코딩하고;
(ii) 스페이서1은 스페이서 뉴클레오티드 서열이고, 부재할 수 있다.
일부 실시양태에서, 본 개시내용은 식 (I)의 이중-항원 구축물을 제공하며, 식 (I)에서, TAA1은 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, TAA2는 면역원성 MSLN 폴리펩티드 또는 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이다.
일부 다른 실시양태에서, 본 개시내용은 식 (I)의 이중-항원 구축물을 제공하며, 식 (I)에서, TAA1은 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, TAA2는 면역원성 MUC1 폴리펩티드 또는 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이다.
일부 추가의 실시양태에서, 본 개시내용은 식 (I)의 이중-항원 구축물을 제공하며, 식 (I)에서, TAA1은 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, TAA2는 면역원성 MUC1 폴리펩티드 또는 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열이다.
일부 구체적 실시양태에서, 본 개시내용은 하기로 이루어진 군으로부터 선택되는 식의 이중-항원 구축물을 제공하며,
(1) MUC1-2A-TERT (II)
(2) MUC1-2A-MSLN (III)
(3) MSLN-2A-TERT (IV)
(4) MSLN-2A-MUC1 (V)
(5) TERT-2A-MSLN (VI)
(6) TERT-2A-MUC1 (VII),
여기서 식 (II) - (VII)의 각각에서: (i) MUC1, MSLN, 및 TERT는 각각 면역원성 MUC1 폴리펩티드, 면역원성 MSLN 폴리펩티드, 및 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열을 나타내고, (ii) 2A는 2A 펩티드를 코딩하는 뉴클레오티드 서열이다.
일부 다른 측면에서, 본 개시내용은 3개의 코딩 뉴클레오티드 서열을 포함하는 삼중-항원 구축물을 제공하며, 여기서 코딩 뉴클레오티드 서열의 각각은 상이한 개별적 면역원성 TAA 폴리펩티드를 발현한다. 삼중-항원 구축물의 구조는 식 (VIII)에 나타나 있다:
TAA1-스페이서1-TAA2-스페이서2-TAA3 (VIII)
여기서 식 (VIII)에서:
(i) TAA1, TAA2, 및 TAA3은 각각 면역원성 MUC1 폴리펩티드, 면역원성 MSLN 폴리펩티드, 및 면역원성 TERT 폴리펩티드로 이루어진 군으로부터 선택되는 면역원성 TAA 폴리펩티드를 코딩하는 뉴클레오티드 서열이며, 여기서 TAA1, TAA2, 및 TAA3은 상이한 면역원성 TAA 폴리펩티드를 코딩하고;
(ii) 스페이서1 및 스페이서2는 각각 스페이서 뉴클레오티드 서열이며, 여기서 (a) 스페이서1 및 스페이서2는 동일하거나 상이할 수 있고, (b) 스페이서1은 부재할 수 있거나, 스페이서2는 부재할 수 있거나, 스페이서1 및 스페이서2 둘 다는 부재할 수 있다.
본 개시내용에 사용된 용어 "스페이서 뉴클레오티드 서열"은 핵산 분자의 오픈 리딩 프레임에서 2개의 코딩 서열 또는 트랜스진 사이에 삽입되고, 핵산 분자로부터의 2개의 별개의 유전자 생성물의 공동-발현 또는 번역을 허용하도록 기능하는 뉴클레오티드 서열을 지칭한다. 본 개시내용에 의해 제공된 다중-항원 구축물에 사용될 수 있는 스페이서 뉴클레오티드 서열의 예로는 진핵생물 프로모터, 2A 펩티드를 코딩하는 뉴클레오티드 서열, 및 내부 리보솜 진입 부위 (IRES)를 들 수 있다. 2A 펩티드의 예로는 구제역 바이러스 2A 펩티드 (FMD2A), 말 비염 A 바이러스 2A 펩티드 (ERA2A), 말 비염 B 바이러스 2A 펩티드 (ERB2A), 뇌심근염 바이러스 2A 펩티드 (EMC2A), 돼지 테스코바이러스 2A 펩티드 (PT2A), 및 토세아 아시그나 바이러스 2A 펩티드 (T2A)를 들 수 있다. 이들 2A 펩티드의 서열은 표 19에 제공된다.
일부 실시양태에서, 스페이서1 및 스페이서2는 독립적으로 2A 펩티드를 코딩하는 뉴클레오티드 서열, 또는 GGSGG를 코딩하는 뉴클레오티드 서열이다.
일부 실시양태에서, 본 개시내용은 식 (VIII)의 삼중-항원 구축물을 제공하며, 식 (VIII)에서, (i) TAA1는 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, (ii) TAA2는 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, (iii) TAA3은 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이다.
일부 다른 실시양태에서, 본 개시내용은 식 (VIII)의 삼중-항원 구축물을 제공하며, 식 (VIII)에서, (i) TAA1은 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, (ii) TAA2는 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, (iii) TAA3은 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열이다.
일부 다른 실시양태에서, 본 개시내용은 식 (VIII)의 삼중-항원 구축물을 제공하며, 식 (VIII)에서, (i) TAA1은 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, (ii) TAA2는 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, (iii) TAA3은 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열이다.
일부 다른 실시양태에서, 본 개시내용은 식 (VIII)의 삼중-항원 구축물을 제공하며, 식 (VIII)에서, (i) TAA1은 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, (ii) TAA2는 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열이고, (iii) TAA3은 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이다.
일부 구체적 실시양태에서, 본 개시내용은 하기로 이루어진 군으로부터 선택되는 식의 삼중-항원 구축물을 제공하며,
(1) MUC1-2A-MSLN-2A-TERT (IX)
(2) MUC1-2A-TERT-2A-MSLN (X)
(3) MSLN-2A-MUC1-2A-TERT (XI)
(4) MSLN-2A-TERT-2A-MUC1 (XII)
(5) TERT-2A-MUC1-2A-MSLN (XIII)
(6) TERT-2A-MSLN-2A-MUC1 (XIV)
여기서, 식 (IX) - (XIV)의 각각에서: (i) MUC1, MSLN, 및 TERT는 각각 면역원성 MUC1 폴리펩티드, 면역원성 MSLN 폴리펩티드, 및 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열을 나타내고, (ii) 2A는 2A 펩티드를 코딩하는 뉴클레오티드 서열이다.
다중-항원 구축물에 의해 코딩되는 면역원성 MSLN 폴리펩티드는 전장 MSLN 단백질 또는 그의 단편, 예컨대 세포질성, 분비된, 또는 막-결합된 단편일 수 있다. 일부 실시양태에서, 다중-항원 구축물은 하기로 이루어진 군으로부터 선택되는 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열을 포함한다:
1) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597을 포함하거나, 또는 그로 이루어진 폴리펩티드;
2) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597로 이루어진 아미노산 서열과 적어도 90%, 95%, 98%, 또는 99% 동일한 아미노산 서열을 포함하는 폴리펩티드;
3) 서열식별번호: 6의 아미노산 서열, 또는 서열식별번호: 6의 아미노산 서열의 아미노산 4-564를 포함하거나, 또는 그로 이루어진 폴리펩티드; 및
4) 서열식별번호: 8의 아미노산 서열과 적어도 93% - 99%, 94% - 98%, 또는 94% - 97% 동일성을 갖는 아미노산 서열을 포함하는 폴리펩티드 ("플라스미드 1103 폴리펩티드").
일부 특정한 실시양태에서, 다중-항원 구축물은 서열식별번호: 5의 뉴클레오티드 서열 또는 그의 축중성 변이체를 포함한다.
다중-항원 구축물에 의해 코딩되는 면역원성 MUC1 폴리펩티드는 (1) 인간 MUC1 단백질의 20개 아미노산의 3 내지 30개 탠덤 반복의 아미노산 서열 및 (2) VNTR 영역에 플랭킹된 인간 MUC1 단백질의 아미노산 서열을 포함할 수 있다. 일부 실시양태에서, 다중-항원 구축물은 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열을 포함하며, 여기서 면역원성 MUC1 폴리펩티드는 하기로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하거나, 또는 그로 이루어진다:
(1) 서열식별번호: 8의 아미노산 서열 (플라스미드 1027 폴리펩티드);
(2) 서열식별번호: 8의 아미노산 4-537을 포함하는 아미노산 서열;
(3) 서열식별번호: 8의 아미노산 24-537을 포함하는 아미노산 서열;
(4) 서열식별번호: 16의 아미노산 서열 (플라스미드 1197 폴리펩티드);
(5) 서열식별번호: 16의 아미노산 4-517을 포함하는 아미노산 서열; 및
(6) 서열식별번호: 16의 아미노산 4-517을 포함하며, 단 위치 513에서의 아미노산이 T인 아미노산 서열.
일부 특정한 실시양태에서, 다중-항원 구축물은 서열식별번호: 7의 뉴클레오티드 서열, 서열식별번호: 15의 뉴클레오티드 서열, 또는 서열식별번호 7 또는 15의 뉴클레오티드 서열의 축중성 변이체를 포함한다.
다중-항원 구축물에 의해 코딩되는 면역원성 TERT 폴리펩티드는 전장 단백질 또는 임의의 말단절단된 형태일 수 있다. 전장 TERT 단백질은 말단절단된 형태보다 더 강한 면역 반응을 생성할 것으로 예상된다. 그러나, 구축물을 전달하기 위해 선택된 구체적인 벡터에 따라, 벡터는 전체 TERT 단백질을 코딩하는 유전자를 운반하는 능력을 가질 수 있다. 따라서, 단백질로부터의 일부 아미노산의 결실은 트랜스진이 특정한 벡터 내로 적합화될 것이도록 생성될 수 있다. 아미노산의 결실은 TERT 단백질의 N-말단, C-말단, 또는 서열의 어디에서로부터도 생성될 수 있다. 추가의 결실은 핵 국재화 신호를 제거하기 위해 생성될 수 있으며, 그에 의해 폴리펩티드가 세포질성이 되게 하고, 세포성 항원 프로세싱/제시 기구에의 접근을 증가시킬 수 있다. 일부 실시양태에서, TERT 단백질의 N-말단의 위치 200, 300, 400, 500, 또는 600 이하의 아미노산은 면역원성 TERT 폴리펩티드로부터 부재한다. 추가의 아미노산의 돌연변이는 TERT 촉매적 도메인을 불활성화시키기 위해 도입될 수 있다. 이러한 돌연변이의 예로는 D712A 및 V713T를 들 수 있다.
일부 추가의 실시양태에서, 다중-항원 구축물은 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열을 포함하며, 여기서 면역원성 TERT 폴리펩티드는 하기로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하거나, 또는 그로 이루어진다:
1) 서열식별번호: 10의 아미노산 서열 ("플라스미드 1112 폴리펩티드"; TERT 240);
2) 서열식별번호: 12의 아미노산 서열 ("플라스미드 1330 폴리펩티드"; TERT 541); 및
3) 서열식별번호: 14의 아미노산 서열 ("플라스미드 1326 폴리펩티드"; TERT 343).
일부 특정한 실시양태에서, 다중-항원 구축물은 서열식별번호: 9, 11, 또는 13의 뉴클레오티드 서열, 또는 서열식별번호: 9, 11, 또는 13의 뉴클레오티드 서열의 축중성 변이체를 포함한다.
일부 특정한 실시양태에서, 본 개시내용은 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열을 포함하는, 면역원성 MUC1 폴리펩티드 및 면역원성 MSLN 폴리펩티드를 코딩하는 이중 항원 구축물을 제공한다:
(1) 서열식별번호: 18, 20, 22, 또는 24의 아미노산 서열을 코딩하는 뉴클레오티드 서열;
(2) 서열식별번호: 17, 19, 21, 또는 23의 뉴클레오티드 서열; 및
(3) 서열식별번호: 17, 19, 21, 또는 23의 뉴클레오티드 서열의 축중성 변이체.
일부 다른 특정한 실시양태에서, 본 개시내용은 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열을 포함하는, 면역원성 MUC1 폴리펩티드 및 면역원성 TERT 폴리펩티드를 코딩하는 이중 항원 구축물을 제공한다:
(1) 서열식별번호: 26, 28, 30, 32, 또는 34의 아미노산 서열을 코딩하는 뉴클레오티드 서열;
(2) 서열식별번호: 25, 27, 29, 31, 또는 33의 뉴클레오티드 서열; 및
(3) 서열식별번호: 25, 27, 29, 31, 또는 33의 뉴클레오티드 서열의 축중성 변이체.
일부 다른 특정한 실시양태에서, 본 개시내용은 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열을 포함하는, 면역원성 MSLN 폴리펩티드 및 면역원성 TERT 폴리펩티드를 코딩하는 이중 항원 구축물을 제공한다:
(1) 서열식별번호: 36, 38, 40, 또는 42의 아미노산 서열을 코딩하는 뉴클레오티드 서열;
(2) 서열식별번호: 35, 37, 39, 또는 41의 뉴클레오티드 서열; 및
(3) 서열식별번호: 35, 37, 39, 또는 41의 뉴클레오티드 서열의 축중성 변이체.
일부 다른 특정한 실시양태에서, 본 개시내용은 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열을 포함하는, 면역원성 MUC1 폴리펩티드, 면역원성 MSLN 폴리펩티드, 및 면역원성 TERT 폴리펩티드를 코딩하는 삼중-항원 구축물을 제공한다:
(1) 서열식별번호: 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 또는 66의 아미노산 서열을 코딩하는 뉴클레오티드 서열;
(2) 서열식별번호: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 또는 65의 뉴클레오티드 서열; 및
(3) 서열식별번호: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 또는 65의 뉴클레오티드 서열의 축중성 변이체.
D. 면역원성 TAA 폴리펩티드를 코딩하는 핵산 분자를 함유하는 벡터
본 발명의 또 다른 측면은 단일 항원 구축물, 이중-항원 구축물, 삼중-항원 구축물, 및 다른 다중-항원 구축물을 포함한, 본 개시내용에 의해 제공된 핵산 분자 중 임의의 것 중 1종 이상을 함유하는 벡터에 관한 것이다. 벡터는 핵산 분자에 의해 코딩되는 면역원성 TAA 폴리펩티드를 클로닝하거나 발현하는데, 또는 조성물, 예컨대 백신 중의 핵산 분자를 숙주 세포에 또는 숙주 동물, 예컨대 인간에게 전달하는데 유용하다. 일부 특정한 실시양태에서, 벡터는 면역원성 MUC1 폴리펩티드, 면역원성 MSLN 폴리펩티드, 및 면역원성 TERT 폴리펩티드를 코딩하는 삼중-항원 구축물을 포함하며, 여기서 삼중-항원 구축물은 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열을 포함한다:
(1) 서열식별번호: 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 또는 66의 아미노산 서열을 코딩하는 뉴클레오티드 서열;
(2) 서열식별번호: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 또는 65의 뉴클레오티드 서열; 및
(3) 서열식별번호: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 또는 65의 뉴클레오티드 서열의 축중성 변이체.
폭넓게 다양한 벡터, 예컨대 플라스미드 벡터, 코스미드 벡터, 파지 벡터, 및 바이러스 벡터는 본 발명의 핵산 분자를 함유하고 발현하도록 제조될 수 있다.
일부 실시양태에서, 본 개시내용은 본 발명의 핵산 분자를 함유하는 플라스미드-기반 벡터를 제공한다. 적합한 플라스미드 벡터의 예로는 pBR325, pUC18, pSKF, pET23D, 및 pGB-2를 들 수 있다. 플라스미드 벡터, 뿐만 아니라 이러한 벡터를 구축하는 방법의 다른 예는 미국 특허 번호 5,580,859, 5,589,466, 5,688,688, 5,814,482, 및 5,580,859에 기재되어 있다.
다른 실시양태에서, 본 발명은 바이러스, 예컨대 레트로바이러스, 알파바이러스, 및 아데노바이러스로부터 구축된 벡터를 제공한다. 레트로바이러스 벡터의 예는 미국 특허 번호 5,219,740, 5,716,613, 5,851,529, 5,591,624, 5,716,826, 5,716,832, 및 5,817,491에 기재되어 있다. 알파바이러스로부터 생성될 수 있는 벡터의 대표적인 예는 미국 특허 번호 5,091,309 및 5,217,879, 5,843,723, 및 5,789,245에 기재되어 있다.
일부 특정한 실시양태에서, 본 개시내용은 비-인간 영장류 아데노바이러스, 예컨대 시미안 아데노바이러스로부터 유래된 아데노바이러스 벡터를 제공한다. 이러한 아데노바이러스 벡터, 뿐만 아니라 그들의 제조의 예는 PCT 출원 공개 WO2005/071093 및 WO 2010/086189에 기재되어 있으며, 시미안 아데노바이러스, 예컨대 ChAd3, ChAd4, ChAd5, ChAd7, ChAd8, ChAd9, ChAd10, ChAd11, ChAd16, ChAd17, ChAd19, ChAd20, ChAd22, ChAd24, ChAd26, ChAd30, ChAd31, ChAd37, ChAd38, ChAd44, ChAd63, ChAd68, ChAd82, ChAd55, ChAd73, ChAd83, ChAd146, ChAd147, PanAd1, Pan Ad2, 및 Pan Ad3으로부터 구축된 비-복제 벡터, 및 시미안 아데노바이러스 Ad4 또는 Ad7로부터 구축된 복제-적격 벡터 구축물을 들 수 있다. 시미안 아데노바이러스로부터 아데노바이러스 벡터를 구축하는데 있어서, E1A, E1B, E2A, E2B, E3, 및 E4로부터 선택되는 바이러스의 게놈 영역으로부터의 초기 유전자 중 1개 이상은 결실되거나, 결실 또는 돌연변이에 의해 비-기능적이 되는 것이 바람직하다. 특정한 실시양태에서, 벡터는 ChAd3 또는 ChAd68로부터 구축된다. 적합한 벡터는 또한 다른 바이러스, 예컨대: (1) 마마 바이러스, 예컨대 카나리아 마마 바이러스 또는 우두 바이러스 (Fisher-Hoch et al., PNAS 86:317-321, 1989; Flexner et al., Ann. N.Y. Acad. Sci. 569:86-103, 1989; Flexner et al., Vaccine 8:17-21, 1990; 미국 특허 번호 4,603,112, 4,769,330 및 5,017,487; WO 89/01973); (2) SV40 (Mulligan et al., Nature 277:108-114, 1979); (3) 포진 (Kit, Adv. Exp. Med. Biol. 215:219-236, 1989; 미국 특허 번호 5,288,641); 및 (4) 렌티바이러스, 예컨대 HIV (Poznansky, J. Virol. 65:532-536, 1991)로부터 생성될 수 있다.
벡터를 구축하는 방법은 관련 기술분야에 널리 알려져 있다. 발현 벡터는 전형적으로 발현되어야 할 핵산 서열에 작동적으로 연결된 1종 이상의 제어 요소를 포함한다. 용어 "제어 요소"는 집합적으로 프로모터 영역, 폴리아데닐화 신호, 전사 종결 서열, 상류 조절 도메인, 복제 기점, 내부 리보솜 진입 부위 ("IRES"), 인핸서 등을 지칭하며, 이는 집합적으로 수용자 세포에서 코딩 서열의 복제, 전사, 및 번역을 제공한다. 선택된 코딩 서열이 적절한 숙주 세포에서 복제되고, 전사되고, 번역될 수 있는 한, 이들 제어 요소의 전부가 항상 존재할 필요가 있는 것은 아니다. 제어 요소는 관련 기술분야의 통상의 기술자에게 알려진 다수의 인자, 예컨대 구체적인 숙주 세포 및 다른 벡터 구성요소의 공급원 또는 구조에 기초하여 선택된다. 면역원성 TAA 폴리펩티드의 발현을 증강시키기 위해, 코작(Kozak) 서열이 면역원성 TAA 폴리펩티드를 코딩하는 서열의 상류에 제공될 수 있다. 척추동물에 대해, 알려진 코작 서열은 (GCC)NCCATGG이며, 여기서 N은 A 또는 G이고, GCC는 덜 보존된다. 사용될 수 있는 예시적인 코작 서열로는 GAACATGG, ACCAUGG, 및 ACCATGG를 들 수 있다.
E. 면역원성 TAA 폴리펩티드를 포함하는 조성물 (폴리펩티드 조성물)
또 다른 측면에서, 본 개시내용은 본 개시내용에 의해 제공된 1종 이상의 단리된 면역원성 TAA 폴리펩티드를 포함하는 폴리펩티드 조성물 ("폴리펩티드 조성물")을 제공한다. 일부 실시양태에서, 폴리펩티드 조성물은 포유동물, 예컨대 마우스, 개, 비인간 영장류 또는 인간에서 TAA 단백질에 대한 면역 반응을 도출하는데 유용한 면역원성 조성물이다. 일부 다른 실시양태에서, 폴리펩티드 조성물은 인간에게 투여하기 위한 제약 조성물이다. 추가의 다른 실시양태에서, 폴리펩티드 조성물은 포유동물, 예컨대 인간의 면역화에, 비정상적 세포 증식을 억제하는데, 암의 발생에 대한 보호를 제공하는데 (예방제로서 사용됨), 또는 TAA 과다 발현과 연관된 장애, 예컨대 암의 치료에 (치료제로서 사용됨) 유용한 백신 조성물이다.
본 개시내용에 의해 제공된 폴리펩티드 조성물은 단일 유형의 면역원성 TAA 폴리펩티드, 예컨대 면역원성 MSLN 폴리펩티드, 면역원성 MUC1 폴리펩티드, 또는 면역원성 TERT 폴리펩티드를 함유할 수 있다. 조성물은 또한 2종 이상의 상이한 유형의 면역원성 TAA 폴리펩티드의 조합을 함유할 수 있다. 예를 들어, 폴리펩티드 조성물은 하기 조합 중 임의의 것으로의 면역원성 TAA 폴리펩티드를 함유할 수 있다:
1) 면역원성 MSLN 폴리펩티드 및 면역원성 MUC1 폴리펩티드;
2) 면역원성 MSLN 폴리펩티드 및 TERT 폴리펩티드; 또는
3) 면역원성 MSLN 폴리펩티드, 면역원성 MUC1 폴리펩티드, 및 TERT 폴리펩티드.
일부 실시양태에서, 본 개시내용에 의해 제공된 폴리펩티드 조성물, 예컨대 면역원성 조성물, 제약 조성물, 또는 백신 조성물은 제약상 허용되는 부형제를 추가로 포함한다. 면역원성, 제약, 또는 백신 조성물에 적합한 제약상 허용되는 부형제는 관련 기술분야에 알려져 있다. 조성물에 사용될 수 있는 적합한 부형제의 예로는 생체적합성 오일, 예컨대 평지씨유, 해바라기유, 땅콩유, 면실유, 호호바 오일, 스쿠알란, 스쿠알렌, 생리 염수 용액, 보존제 및 삼투압 제어제, 담체 기체, pH-제어제, 유기 용매, 소수성화제, 효소 억제제, 수흡수성 중합체, 계면활성제, 흡수 촉진제, pH 조정제, 및 항-산화제를 들 수 있다.
조성물, 특히 면역원성 조성물 또는 백신 조성물 중의 면역원성 TAA 폴리펩티드는 수용자에의 투여를 위한 담체에 연결되거나, 그에 접합되거나, 다르게는 그 내로 혼입될 수 있다. 용어 "담체"는 수용자 (예를 들어, 환자)에의 면역원성 폴리펩티드의 전달을 위해 면역원성 폴리펩티드가 그에 부착되거나, 다르게는 그와 회합될 수 있는 물질 또는 구조를 지칭한다. 담체 자체가 면역원성일 수 있다. 담체의 예로는 면역원성 폴리펩티드, 면역 CpG 섬, 림펫 헤모시아닌 (KLH), 파상풍 톡소이드 (TT), 콜레라 독소 서브유닛 B (CTB), 박테리아 또는 박테리아 고스트, 리포솜, 키토솜, 비로솜, 미소구, 수지상 세포 등을 들 수 있다. 1종 이상의 면역원성 TAA 폴리펩티드 분자는 단일 담체 분자에 연결될 수 있다. 면역원성 폴리펩티드를 담체에 연결시키는 방법은 관련 기술분야에 알려져 있다.
본 개시내용에 의해 제공된 백신 조성물 또는 면역원성 조성물은 1종 이상의 면역 조정제 또는 아주반트와 함께 또는 조합으로 사용될 수 있다. 면역 조정제 또는 아주반트는 백신 조성물 또는 면역원성 조성물로부터 별개로 제제화될 수 있거나, 이들은 동일한 조성물 제제의 일부일 수 있다. 따라서, 일부 실시양태에서, 본 개시내용은 1종 이상의 면역 조정제 또는 아주반트를 추가로 포함하는 백신 조성물을 제공한다. 면역 조정제 및 아주반트의 예는 하기 본원에 제공된다.
면역원성 및 백신 조성물을 포함한 폴리펩티드 조성물은 임의의 적합한 투여 형태, 예컨대 액체 형태 (예를 들어, 용액, 현탁액, 또는 에멀젼) 및 고체 형태 (예를 들어, 캡슐, 정제, 또는 분말)로, 및 관련 기술분야의 통상의 기술자에게 알려진 방법에 의해 제조될 수 있다.
F. 면역원성 TAA 핵산 분자를 포함하는 조성물 (핵산 조성물)
본 개시내용은 또한 본 개시내용에 의해 제공된 단리된 핵산 분자 또는 벡터를 포함하는 핵산 조성물 ("핵산 조성물")을 제공한다. 핵산 조성물은 시험관내에서 또는 인간을 포함한 포유동물에서 생체내에서 TAA 단백질에 대한 면역 반응을 도출하는데 유용하다. 일부 실시양태에서, 핵산 조성물은 면역원성 조성물 또는 제약 조성물이다.
일부 특정한 실시양태에서, 핵산 조성물은 (1) 비정상적 세포 증식을 억제하고, 암의 발생에 대한 보호를 제공하거나 (예방제로서 사용됨), (2) TAA 과다-발현과 연관된 암의 치료 (치료제로서 사용됨), 또는 (3) 특정한 인간 TAA, 예컨대 MSLN, MUC1, 또는 TERT에 대한 면역 반응을 도출하기 위한 인간에의 투여를 위한 DNA 백신 조성물이다. 조성물 중의 핵산 분자는 "네이키드" 핵산 분자, 즉, 단순히 형질감염 또는 발현을 촉진시키는 요소가 없는 단리된 DNA의 형태일 수 있다. 대안적으로, 조성물 중의 핵산 분자는 벡터, 예컨대 플라스미드 벡터 또는 바이러스 벡터 내로 혼입된다.
본 개시내용에 의해 제공된 핵산 조성물은 각각 단지 1가지 유형의 면역원성 TAA 폴리펩티드, 예컨대 면역원성 MSLN 폴리펩티드, 면역원성 MUC1 폴리펩티드, 또는 면역원성 TERT 폴리펩티드를 코딩하는 개별적 단리된 핵산 분자를 포함할 수 있다.
핵산 조성물은 2종 이상의 유형의 면역원성 TAA 폴리펩티드를 코딩하는 다중-항원 구축물을 포함할 수 있다. 예를 들어, 다중-항원 구축물은 하기 조합 중 임의의 것으로 2종 이상의 면역원성 TAA 폴리펩티드를 코딩할 수 있다:
(1) 면역원성 MSLN 폴리펩티드 및 면역원성 MUC1 폴리펩티드;
(2) 면역원성 MSLN 폴리펩티드 및 면역원성 TERT 폴리펩티드;
(3) 면역원성 MUC1 폴리펩티드 및 면역원성 TERT 폴리펩티드; 및
(4) 면역원성 MSLN 폴리펩티드, 면역원성 MUC1 폴리펩티드, 및 면역원성 TERT 폴리펩티드.
일부 특정한 실시양태에서, 본 개시내용에 의해 제공된 조성물은 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열을 포함하는 이중 항원 구축물을 포함한다:
(1) 서열식별번호: 18, 20, 22, 또는 24, 26, 28, 30, 32, 또는 34, 36, 38, 30, 40, 또는 42의 아미노산 서열을 코딩하는 뉴클레오티드 서열;
(2) 서열식별번호: 17, 19, 21, 또는 23, 25, 27, 29, 31, 또는 33, 35, 37, 39, 또는 41의 뉴클레오티드 서열; 및
(3) 서열식별번호: 17, 19, 21, 또는 23, 25, 27, 29, 31, 또는 33, 35, 37, 39, 또는 41의 뉴클레오티드 서열의 축중성 변이체.
일부 다른 특정한 실시양태에서, 본 개시내용에 의해 제공된 조성물은 하기로 이루어진 군으로부터 선택되는 뉴클레오티드 서열을 포함하는 삼중-항원 구축물을 포함한다:
(1) 서열식별번호: 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 또는 66의 아미노산 서열을 코딩하는 뉴클레오티드 서열;
(2) 서열식별번호: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 또는 65의 뉴클레오티드 서열; 및
(3) 서열식별번호: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 또는 65의 뉴클레오티드 서열의 축중성 변이체.
핵산 조성물, 예컨대 제약 조성물 또는 DNA 백신 조성물은 제약상 허용되는 부형제를 추가로 포함할 수 있다. DNA 백신 조성물을 포함한 핵산 조성물에 적합한 제약상 허용되는 부형제는 관련 기술분야의 통상의 기술자에게 널리 알려져 있다. 이러한 부형제는 수성 또는 비수성 용액, 현탁액, 및 에멀젼일 수 있다. 비-수성 부형제의 예로는 프로필렌 글리콜, 폴리에틸렌 글리콜, 식물유, 예컨대 올리브유, 및 주사가능한 유기 에스테르, 예컨대 에틸 올레에이트를 들 수 있다. 수성 부형제의 예로는 염수 및 완충된 매질을 포함한 물, 알콜성/수성 용액, 에멀젼 또는 현탁액을 들 수 있다. 적합한 부형제는 또한 폴리뉴클레오티드 분자의 세포 흡수에 도움이 되는 작용제를 포함한다. 이러한 작용제의 예는 (i) 세포 투과성을 변형시키는 화학물질, 예컨대 부피바카인, (ii) 폴리뉴클레오티드의 캡슐화를 위한 리포솜 또는 바이러스 입자, 또는 (iii) 그들 자신을 폴리뉴클레오티드와 회합시키는 양이온성 지질 또는 실리카, 금, 또는 텅스텐 미세입자이다. 음이온성 및 중성 리포솜은 관련 기술분야에 널리 알려져 있으며 (리포솜을 제조하는 방법의 상세한 설명에 대해서는 예를 들어, 문헌 [Liposomes: A Practical Approach, RPC New Ed, IRL press (1990)] 참조), 폴리뉴클레오티드를 포함한 다양한 생성물을 전달하는데 유용하다. 양이온성 지질은 또한 관련 기술분야에 알려져 있으며, 유전자 전달에 통상적으로 사용된다. 이러한 지질로는 DOTMA (N-[I-(2,3-디올레일옥시) 프로필 N,N,N-트리메틸암모늄 클로라이드)로도 알려진 리포펙틴(Lipofectin).TM., DOTAP (1,2-비스 (올레일옥시)-3 (트리메틸암모니오) 프로판), DDAB (디메틸디옥타데실-암모늄 브로마이드), DOGS (디옥타데실아미돌로글리실 스페르민) 및 콜레스테롤 유도체, 예컨대 DCChol (3 베타-(N-(N',N'-디메틸 아미노메탄)-카르바모일) 콜레스테롤)을 들 수 있다. 이들 양이온성 지질의 설명은 EP 187,702, WO 90/11092, 미국 특허 번호 5,283,185, WO 91/15501, WO 95/26356, 및 미국 특허 번호 5,527,928에서 발견될 수 있다. 본 개시내용에 의해 제공된 핵산 백신과 함께 사용될 수 있는 특정한 유용한 양이온성 지질 제제는 백스펙틴(VAXFECTIN)이며, 이는 수성 비히클과 배합되는 경우, 자기-어셈블리하여 리포솜을 형성하는 양이온성 지질 (GAP-DMORIE) 및 중성 인지질 (DPyPE)의 혼합물이다. 유전자 전달을 위한 양이온성 지질은 바람직하게는 예로서 WO 90/11092에 기재된 바와 같이, 중성 지질, 예컨대 DOPE (디올레일 포스파티딜에탄올아민)와 공동으로 사용된다. 또한, 핵산 구축물, 예컨대 DNA 구축물은 또한 비이온성 블록 공중합체, 예컨대 CRL1005와 함께 제제화될 수 있다.
본 개시내용에 의해 제공된 핵산 백신 조성물 또는 면역원성 조성물은 1종 이상의 면역 조정제와 함께 또는 조합으로 사용될 수 있다. 본 개시내용에 의해 제공된 핵산 백신 조성물 또는 면역원성 조성물은 또한 1종 이상의 아주반트와 함께 또는 조합으로 사용될 수 있다. 또한, 본 개시내용에 의해 제공된 핵산 백신 조성물 또는 면역원성 조성물은 1종 이상의 면역 조정제 및 1종 이상의 아주반트와 함께 또는 조합으로 사용될 수 있다. 면역 조정제 또는 아주반트는 핵산 조성물로부터 별개로 제제화될 수 있거나, 이들은 동일한 조성물의 일부일 수 있다. 따라서, 일부 실시양태에서, 본 개시내용은 1종 이상의 면역 조정제 및/또는 1종 이상의 아주반트를 추가로 포함하는 핵산 백신 조성물을 제공한다. 면역 조정제 및 아주반트의 예는 하기 본원에 제공된다.
백신 조성물을 포함한 핵산 조성물은 임의의 적합한 투여 형태, 예컨대 액체 형태 (예를 들어, 용액, 현탁액, 또는 에멀젼) 및 고체 형태 (예를 들어, 캡슐, 정제, 또는 분말)로, 및 관련 기술분야의 통상의 기술자에게 알려진 방법에 의해 제조될 수 있다.
G. 면역원성 TAA 폴리펩티드, 핵산 분자, 및 조성물의 용도
다른 측면에서, 본 개시내용은 상기 본원에 기재된 면역원성 TAA 폴리펩티드, 단리된 핵산 분자, 및 조성물을 사용하는 방법을 제공한다.
한 측면에서, 본 개시내용은 유효량의 (1) 표적 TAA에 대해 면역원성인 면역원성 TAA 폴리펩티드, (2) 1종 이상의 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자, (3) 1종 이상의 면역원성 TAA 폴리펩티드를 포함하는 조성물, 또는 (4) 1종 이상의 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자를 포함하는 조성물을 포유동물에게 투여하는 것을 포함하는, 포유동물, 특히 인간에서 TAA에 대한 면역 반응을 도출하는 방법을 제공한다. 일부 실시양태에서, 본 개시내용은 유효량의 본 개시내용에 의해 제공된 면역원성 MSLN 조성물을 인간에게 투여하는 것을 포함하는, 인간에서 MSLN에 대한 면역 반응을 도출하는 방법을 제공하며, 여기서 면역원성 MSLN 조성물은 (1) 면역원성 MSLN 폴리펩티드, (2) 면역원성 MSLN 폴리펩티드를 코딩하는 단리된 핵산 분자, (3) 면역원성 MSLN 폴리펩티드를 포함하는 조성물, 또는 (4) 면역원성 MSLN 폴리펩티드를 코딩하는 단리된 핵산 분자를 포함하는 조성물로부터 선택된다. 일부 다른 실시양태에서, 본 개시내용은 유효량의 본 개시내용에 의해 제공된 면역원성 MUC1 조성물을 인간에게 투여하는 것을 포함하는, 인간에서 MUC1에 대한 면역 반응을 도출하는 방법을 제공하며, 여기서 면역원성 MUC1 조성물은 (1) 면역원성 MUC1 폴리펩티드, (2) 면역원성 MUC1 폴리펩티드를 코딩하는 단리된 핵산 분자, (3) 면역원성 MUC1 폴리펩티드를 포함하는 조성물, 또는 (4) 면역원성 MUC1 폴리펩티드를 코딩하는 단리된 핵산 분자를 포함하는 조성물로부터 선택된다. 일부 실시양태에서, 본 개시내용은 유효량의 본 개시내용에 의해 제공된 면역원성 TERT 조성물을 인간에게 투여하는 것을 포함하는, 인간에서 TERT에 대한 면역 반응을 도출하는 방법을 제공하며, 여기서 면역원성 TERT 조성물은 (1) 면역원성 TERT 폴리펩티드, (2) 면역원성 TERT 폴리펩티드를 코딩하는 단리된 핵산 분자, (3) 면역원성 TERT 폴리펩티드를 포함하는 조성물, 또는 (4) 면역원성 TERT 폴리펩티드를 코딩하는 단리된 핵산 분자를 포함하는 조성물로부터 선택된다.
또 다른 측면에서, 본 개시내용은 인간에서 비정상적 세포 증식을 억제하는 방법을 제공하며, 여기서 비정상적 세포 증식은 TAA의 과다-발현과 연관된다. 방법은 과다-발현된 TAA에 대해 면역원성인 유효량의 본 개시내용에 의해 제공된 면역원성 TAA 조성물을 인간에게 투여하는 것을 포함한다. 면역원성 TAA 조성물은 (1) 면역원성 TAA 폴리펩티드, (2) 1종 이상의 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자, (3) 면역원성 TAA 폴리펩티드를 포함하는 조성물, 또는 (4) 1종 이상의 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자를 포함하는 조성물일 수 있다. 비정상적 세포 증식은 인간의 임의의 기관 또는 조직, 예컨대 유방, 위, 난소, 폐, 방광, 대장 (예를 들어, 결장 및 직장), 신장, 췌장, 및 전립선에 있을 수 있다. 일부 실시양태에서, 방법은 유방, 난소, 췌장, 결장, 폐, 위, 및 직장에서의 비정상적 세포 증식을 억제하기 위한 것이다.
또 다른 측면에서, 본 개시내용은 인간에서 암을 치료하는 방법을 제공하며, 여기서 암은 TAA의 과다-발현과 연관된다. 방법은 과다-발현된 TAA에 대한 면역 반응을 도출할 수 있는 면역원성 TAA 조성물의 유효량을 인간에게 투여하는 것을 포함한다. 면역원성 TAA 조성물은 (1) 면역원성 TAA 폴리펩티드, (2) 1종 이상의 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자, (3) 면역원성 TAA 폴리펩티드를 포함하는 조성물, 또는 (4) 1종 이상의 면역원성 TAA 폴리펩티드를 코딩하는 단리된 핵산 분자를 포함하는 조성물일 수 있다.
일부 실시양태에서, 본 개시내용은 유효량의 상기 본원에 제공된 핵산 조성물을 인간에게 투여하는 것을 포함하는, 인간에서 암을 치료하는 방법을 제공한다. 조성물 중의 핵산은 단지 1종의 특정한 면역원성 TAA 폴리펩티드, 예컨대 면역원성 MSLN 폴리펩티드, 면역원성 MUC1 폴리펩티드, 또는 면역원성 TERT 폴리펩티드를 코딩하는 단일-항원 구축물일 수 있다. 조성물 중의 핵산은 또한 2종, 3종, 또는 그 초과의 상이한 면역원성 TAA 폴리펩티드를 코딩하는 다중-항원 구축물일 수 있다.
일부 구체적 실시양태에서, 본 개시내용은 유효량의 이중-항원 구축물을 포함하는 조성물을 인간에게 투여하는 것을 포함하는, 인간에서 암을 치료하는 방법을 제공한다. 이중-항원 구축물은 (1) 면역원성 MSLN 폴리펩티드 및 면역원성 MUC1 폴리펩티드; (2) 면역원성 MSLN 폴리펩티드 및 면역원성 TERT 폴리펩티드; (3) 면역원성 TERT 폴리펩티드 및 면역원성 MUC1 폴리펩티드로부터 선택되는 임의의 2종의 상이한 면역원성 TAA 폴리펩티드를 코딩할 수 있다.
일부 다른 구체적 실시양태에서, 본 개시내용은 면역원성 MSLN 폴리펩티드, 면역원성 MUC1 폴리펩티드, 및 면역원성 TERT 폴리펩티드를 코딩하는 삼중-항원 구축물을 포함하는 조성물의 유효량을 인간에게 투여하는 것을 포함하는, 인간에서 암을 치료하는 방법을 제공하며, 여기서 암은 MUC1, MSLN, 및 TERT로부터 선택되는 1종 이상의 TAA의 과다-발현과 연관된다.
종양-연관된 항원 MUC1, MSLN, 및/또는 TERT를 과다-발현하는 임의의 암은 본 개시내용에 의해 제공된 방법에 의해 치료될 수 있다. 암의 예로는 유방암, 난소암, 폐암 (예컨대 소세포 폐암 및 비소세포 폐암), 결장직장암, 위암, 및 췌장암을 들 수 있다. 일부 특정한 실시양태에서, 본 개시내용은 삼중-항원 구축물을 포함하는 조성물의 유효량을 인간에게 투여하는 것을 포함하는, 인간에서 암을 치료하는 방법을 제공하며, 여기서 암은 (1) 유방암, 예컨대 삼중-음성 유방암, (2) 췌장암, 예컨대 췌장관 선암종, 또는 (3) 난소암, 예컨대 난소 선암종이다.
폴리펩티드 및 핵산 조성물은 관련 기술분야에 알려진 다수의 적합한 방법에 의해 인간을 포함한 동물에게 투여될 수 있다. 적합한 방법의 예로는 (1) 근육내, 진피내, 표피내, 또는 피하 투여, (2) 경구 투여, 및 (3) 국소 적용 (예컨대 안구, 비내, 및 질내 적용)을 들 수 있다. 사용될 수 있는 핵산 백신 조성물의 진피내 또는 표피내 투여의 한 특정한 방법은 파우더메드(PowderMed)에 의해 시판되는 입자 매개 표피 전달 (Particle Mediated Epidermal Delivery: PMEDTM) 백신 전달 장치를 사용한 유전자 총 전달이다. PMED는 동물 또는 인간에게 백신을 투여하는 무-침 방법이다. PMED 시스템은 DNA를 현미경적 금 입자 상으로 침전시킨 후, 헬륨 기체에 의해 표피 내로 추진시키는 것을 포함한다. DNA-코팅된 금 입자는 표피의 APC 및 각질세포에 전달되고, 일단 이들 세포의 핵 내부에서, DNA는 금을 용출해내고, 전사적으로 활성화되어, 코딩된 단백질을 생성한다. 핵산 백신의 근육내 투여를 위한 한 특정한 방법은 전기천공이다. 전기천공은 세포막에서 일시적 기공을 생성하는 제어된 전기적 펄스를 사용하며, 이는 근육 내로 주사된 핵산 백신의 세포 흡수를 용이하게 한다. CpG가 핵산 백신과 조합으로 사용되는 경우, CpG 및 핵산 백신은 한 제제에 공동-제제화될 수 있으며, 제제는 전기천공에 의해 근육내로 투여된다.
본 개시내용에 의해 제공된 주어진 방법에서 투여되는 조성물 중의 면역원성 TAA 폴리펩티드 또는 면역원성 TAA 폴리펩티드를 코딩하는 핵산의 유효량은 관련 기술분야의 통상의 기술자에 의해 용이하게 결정될 수 있으며, 다수의 인자에 의존할 것이다. 암, 예컨대 췌장암, 난소암, 및 유방암을 치료하는 방법에서, 면역원성 TAA 폴리펩티드 또는 핵산의 유효량을 결정하는데 있어서 고려될 수 있는 인자로는 (1) 대상체의 면역 상태 및 건강을 포함한 치료되는 대상체, (2) 치료되는 암의 중증도 또는 단계, (3) 사용되거나 발현되는 구체적인 면역원성 TAA 폴리펩티드, (4) 바람직한 보호 또는 치료의 정도, (5) 투여 방법 및 일정, 및 (6) 사용되는 다른 치료제 (예컨대 아주반트 또는 면역 조정제)를 들 수 있으나, 이에 제한되지는 않는다. 다중-항원 백신 조성물을 포함한 핵산 백신 조성물의 경우, 제제화 및 전달의 방법은 핵심 인자 중에서 유효한 면역 반응을 도출하는데 요구되는 핵산의 용량을 결정하기 위한 것이다. 예를 들어, 핵산의 유효량은 핵산 백신 조성물이 수용액으로서 제제화되고, 피하주사 침 주사 또는 압착공기 주사에 의해 투여되는 경우 2 μg/용량 - 10 mg/용량의 범위일 수 있는 반면, 핵산이 코팅된 금 비드로서 제조되고, 유전자 총 기술을 사용하여 전달되는 경우 단지 16 ng/용량 - 16 μg/용량이 요구될 수 있다. 전기천공에 의한 핵산 백신에 대한 용량 범위는 일반적으로 0.5 - 10 mg/용량의 범위이다. 핵산 백신이 공동-제제에서 전기천공에 의해 CpG와 함께 투여되는 경우, 핵산 백신의 용량은 0.5 - 5 mg/용량의 범위일 수 있으며, CpG의 용량은 전형적으로 0.05 mg - 5 mg/용량의 범위, 예컨대 사람당 0.05, 0.2, 0.6, 또는 1.2 mg/용량이다.
본 발명의 핵산 또는 폴리펩티드 백신 조성물은 견고하고 오래-지속되는 면역 반응을 유도하는 프라임-부스트 전략에 사용될 수 있다. 동일한 면역원성 구축물의 반복된 주사에 기초한 프라이밍 및 부스팅 백신접종 프로토콜은 널리 알려져 있다. 일반적으로, 제1 용량은 보호적 면역을 생성하지 않을 수 있지만, 단지 면역계를 "프라이밍"한다. 보호적 면역 반응은 제2 또는 제3 용량 ("부스트") 후에 발생한다. 부스트는 통상적인 기술에 따라 수행되며, 투여의 일정, 투여의 경로, 아주반트의 선택, 용량, 및 또 다른 백신과 함께 투여되는 경우 잠재적인 순서의 관점에서 경험적으로 추가로 최적화될 수 있다. 한 실시양태에서, 본 발명의 핵산 또는 폴리펩티드 백신은 통상적인 동종 프라임-부스트 전략에 사용되며, 여기서 동일한 백신은 다중 용량으로 동물에게 투여된다. 또 다른 실시양태에서, 핵산 또는 폴리펩티드 백신 조성물은 이종 프라임-부스트 백신접종에 사용되며, 여기서 동일한 항원을 함유하는 상이한 유형의 백신이 미리 결정된 시간 간격으로 투여된다. 예를 들어, 핵산 구축물은 초기 용량 ("프라임")에서 플라스미드의 형태로 및 후속 용량 ("부스트")에서 벡터의 일부로서 투여될 수 있으며, 그 역도 마찬가지다.
본 개시내용의 폴리펩티드 또는 핵산 백신 조성물은 1종 이상의 아주반트와 함께 사용될 수 있다. 적합한 아주반트의 예로는 (1) 수중유 에멀젼 제제 (다른 특이적 면역자극제, 예컨대 뮤라밀 폴리펩티드 또는 박테리아 세포벽 구성요소를 갖거나 갖지 않음), 예컨대 (a) 미세유동화기를 사용하여 마이크로미터 미만 입자로 제제화된 5% 스쿠알렌, 0.5% 트윈 80 (폴리옥시에틸렌 소르비탄 모노-올레에이트), 및 0.5% 스팬 85 (소르비탄 트리올레에이트)를 함유하는 MF59™ (PCT 공개 번호 WO 90/14837; Chapter 10 in Vaccine design: the subunit and adjuvant approach, eds. Powell & Newman, Plenum Press 1995), (b) 마이크로미터 미만 에멀젼으로 미세유동화되거나, 볼텍싱되어 보다 큰 입자 크기 에멀젼을 생성하는 10% 스쿠알렌, 0.4% 트윈 80, 5% 플루로닉-블록화된 중합체 L121, 및 thr-MDP를 함유하는 SAF, 및 (c) 2% 스쿠알렌, 0.2% 트윈 80, 및 1종 이상의 박테리아 세포벽 구성요소, 예컨대 모노포스포릴리피드 A (MPL), 트레할로스 디미콜레이트 (TDM), 및 세포벽 골격 (CWS)을 함유하는 RIBI™ 아주반트 시스템 (RAS) (리비 이뮤노켐(Ribi Immunochem), 미국 몬타나주 해밀턴); (2) 사포닌 아주반트, 예컨대 QS21, 스티뮬론(STIMULON)™ (캠브리지 바이오사이언스(Cambridge Bioscience), 미국 매사추세츠주 워세스터), 아비스코(Abisco)® (이스코노바(Isconova), 스웨덴), 또는 이스코매트릭스(Iscomatrix)® (커먼웰쓰 세럼 래보러토리즈(Commonwealth Serum Laboratories), 오스트레일리아); (3) 완전 프로인트 아주반트 (CFA: Complete Freund's Adjuvant) 및 불완전 프로인트 아주반트 (IFA: Incomplete Freund's Adjuvant); (4) 시토카인, 예컨대 인터류킨 (예를 들어 IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12 (PCT 공개 번호 WO 99/44636) 등), 인터페론 (예를 들어 감마 인터페론), 대식세포 콜로니 자극 인자 (M-CSF), 및 종양 괴사 인자 (TNF); (5) 모노포스포릴 리피드 A (MPL) 또는 3-O-데아세틸화된 MPL (3dMPL), (WO 00/56358); (6) 3dMPL과 QS21 및/또는 수중유 에멀젼의 조합 (EP-A-0835318, EP-A-0735898, EP-A-0761231); (7) CpG 모티프를 포함하는, 즉 적어도 1종의 CG 디뉴클레오티드를 함유하는 올리고뉴클레오티드 (여기서 시토신은 비메틸화됨) (WO 98/40100, WO 98/55495, WO 98/37919 및 WO 98/52581); (8) 폴리옥시에틸렌 에테르 또는 폴리옥시에틸렌 에스테르 (WO 99/52549); (9) 옥톡시놀과 조합으로의 폴리옥시에틸렌 소르비탄 에스테르 계면활성제 (WO 01/21207) 또는 적어도 1종의 추가의 비-이온성 계면활성제, 예컨대 옥톡시놀과 조합으로의 폴리옥시에틸렌 알킬 에테르 또는 에스테르 계면활성제 (WO 01/21152); (10) 사포닌 및 면역자극성 올리고뉴클레오티드 (예를 들어 CpG 올리고뉴클레오티드) (WO 00/62800); (11) 알루미늄 염 (명반), 예컨대 인산알루미늄 및 수산화알루미늄을 포함한 금속 염; (12) 사포닌 및 수중유 에멀젼 (WO 99/11241); 및 (13) 사포닌 (예를 들어 QS21), 3dMPL, 및 IM2의 조합 (WO 98/57659)을 들 수 있다.
또한, 포유동물에서의 암을 포함한 신생물성 장애의 치료를 위해, 본 개시내용에 의해 제공된, 백신 조성물을 포함한 폴리펩티드 또는 핵산 조성물은 1종 이상의 면역 조정제와 조합으로 투여될 수 있다. 면역 조정제는 면역-억제성-세포 억제제 (ISC 억제제) 또는 면역-이펙터-세포 증강제 (IEC 증강제)일 수 있다. 또한, 1종 이상의 ISC 억제제는 1종 이상의 IEC 증강제와 조합으로 사용될 수 있다. 면역 조정제는 (1) 전신적 투여, 예컨대 정맥내, 근육내, 또는 경구 투여, 및 (2) 국소 투여, 예컨대 진피내 및 피하 투여를 포함한 임의의 적합한 방법 및 경로에 의해 투여될 수 있다. 적절하거나 적합할 경우, 국소 투여는 일반적으로 전신적 투여에 비해 바람직하다. 임의의 면역 조정제의 국소 투여는 제약의 국소 투여에 적합한 포유동물의 신체의 임의의 위치에서 수행될 수 있지만; 그러나, 이들 면역 조정제는 백신 배수 림프절에 아주 근접하여 국소적으로 투여된다.
조성물, 예컨대 백신은 임의의 또는 모든 사용되는 면역 조정제와 동시에 또는 순차적으로 투여될 수 있다. 유사하게, 2종 이상의 면역 조정제가 사용되는 경우, 이들은 서로에 대해 동시에 또는 순차적으로 투여될 수 있다. 일부 실시양태에서, 백신은 1종의 면역 조정제에 대해 동시에 (예를 들어, 혼합물 중에), 그러나, 1종 이상의 추가의 면역 조정제에 대해 순차적으로 투여된다. 백신 및 면역 조정제의 공동-투여는 백신 및 적어도 1종의 면역 조정제가, 비록 항원 및 면역 조정제가 동시에 투여되지 않는다고 할지라도, 각각이 투여 부위, 예컨대 백신 배수 림프절에 동시에 존재하도록 투여되는 경우를 포함할 수 있다. 백신 및 면역 조정제의 공동-투여는 또한 백신 또는 면역 조정제가 투여 부위로부터 청소되지만, 청소된 백신 또는 면역 조정제의 적어도 하나의 세포 효과가 투여 부위, 예컨대 백신 배수 림프절에서, 적어도 1종 이상의 추가의 면역 조정제가 투여 부위에 투여될 때까지 지속하는 경우를 포함할 수 있다. 핵산 백신이 CpG와 조합으로 투여되는 경우, 백신 및 CpG는 단일 제제에 함유되고, 임의의 적합한 방법에 의해 함께 투여될 수 있다. 일부 실시양태에서, 공동-제제 (혼합물) 중의 핵산 백신 및 CpG는 전기천공과 조합으로의 근육내 주사에 의해 투여된다.
일부 실시양태에서, 폴리펩티드 또는 핵산 조성물과 조합으로 사용되는 면역 조정제는 ISC 억제제이다.
ISC 억제제의 예로는 (1) 단백질 키나제 억제제, 예컨대 이마티닙, 소라페닙, 라파티닙, BIRB-796, 및 AZD-1152, AMG706, 작티마 (ZD6474), MP-412, 소라페닙 (BAY 43-9006), 다사티닙, CEP-701 (레스타우르티닙), XL647, XL999, 티케르브 (라파티닙), MLN518 (이전에 CT53518로 알려짐), PKC412, ST1571, AEE 788, OSI-930, OSI-817, 수니티닙 말레이트 (수텐트(SUTENT)), 악시티닙 (AG-013736), 에를로티닙, 게피티닙, 악시티닙, 보수티닙, 템시롤리스무스 및 닐로티닙 (AMN107)을 들 수 있다. 일부 특정한 실시양태에서, 티로신 키나제 억제제는 수니티닙, 소라페닙, 또는 수니티닙 또는 소라페닙의 제약상 허용되는 염 또는 유도체 (예컨대 말레이트 또는 토실레이트); (2) 시클로옥시게나제-2 (COX-2) 억제제, 예컨대 셀레콕시브 및 로페콕시브; (3) 포스포디에스테라제 유형 5 (PDE5) 억제제 (예컨대 PDE5 억제제의 예로는 아바나필, 로데나필, 미로데나필, 실데나필, 타달라필, 바르데나필, 우데나필, 및 자프리나스트를 들 수 있음), 및 (4) DNA 가교제, 예컨대 시클로포스파미드이다.
일부 실시양태에서, 폴리펩티드 또는 핵산 조성물과 조합으로 사용되는 면역 조정제는 IEC 증강제이다. 2종 이상의 IEC 증강제가 함께 사용될 수 있다.
사용될 수 있는 IEC 증강제의 예로는 (1) TNFR 효능제, 예컨대 OX40, 4-1BB (예컨대 BMS-663513), GITR (예컨대 TRX518), 및 CD40 (예컨대 CD40 효능제성 항체)의 효능제; (2) CTLA-4 억제제 (예컨대 이플리무맙 및 트레멜리무맙; (3) TLR 효능제, 예컨대 CpG 7909 (5' TCGTCGTTTTGTCGTTTTGTCGTT3'), CpG 24555 (5' TCGTCGTTTTTCGGTGCTTTT3' (CpG 24555); 및 CpG 10103 (5' TCGTCGTTTTTCGGTCGTTTT3'); (4) 프로그래밍된 세포 사멸 단백질 1 (PD-1) 억제제, 예컨대 니볼루맙 및 펨브롤리주맙; 및 (5) PD-L1 억제제, 예컨대 아테졸리주맙, 두르발루맙, 및 벨루맙; 및 (6) IDO1 억제제를 들 수 있다.
일부 실시양태에서, IEC 증강제는 CD40 효능제 항체이며, 이는 인간, 인간화 또는 부분-인간 키메라 항-CD40 항체일 수 있다. 구체적인 CD40 효능제 항체의 예로는 G28-5, mAb89, EA-5 또는 S2C6 모노클로날 항체, 및 CP870,893을 들 수 있다. CP-870,893은 항-종양 요법으로서 임상적으로 조사된 전체 인간 효능제성 CD40 모노클로날 항체 (mAb)이다. CP870,893의 구조 및 제조는 WO2003041070에 개시되어 있다 (여기서 항체는 내부 확인된 "21.4.1"에 의해 확인되며, 항체의 중쇄 및 경쇄의 아미노산 서열은 각각 서열식별번호: 40 및 서열식별번호: 41에 제시됨). 본 개시내용에 의해 제공된 조성물과 조합으로 사용하기 위해, CP-870,893은 임의의 적합한 경로, 예컨대 진피내, 피하, 또는 근육내 주사에 의해 투여될 수 있다. CP870893의 유효량은 일반적으로 0.01 - 0.25 mg/kg의 범위이다. 일부 실시양태에서, CP870893은 0.05 - 0.1 mg/kg의 양으로 투여된다.
일부 다른 실시양태에서, IEC 증강제는 CTLA-4 억제제, 예컨대 이플리무맙 및 트레멜리무맙이다. 예르보이(YERVOY)로서 시판되는 이플리무맙 (MEX-010 또는 MDX-101로도 알려짐)은 인간 항-인간 CTLA-4 항체이다. 이플리무맙은 또한 그의 CAS 등록 번호 477202-00-9에 의해 지칭될 수 있으며, PCT 공개 번호 WO 01/14424에 항체 10DI로서 개시되어 있다. 트레멜리무맙 (CP-675,206으로도 알려짐)은 전체 인간 IgG2 모노클로날 항체이며, CAS 번호 745013-59-6을 갖는다. 트레멜리무맙은 미국 특허 번호 6,682,736 (그 전문이 본원에 참조로 포함됨)에 개시되어 있고, 여기서 이는 항체 11.2.1로서 확인되며, 그의 중쇄 및 경쇄의 아미노산 서열은 각각 서열식별번호: 42 및 43에 제시된다. 본 개시내용에 의해 제공된 조성물과 조합으로 사용하기 위해, 트레멜리무맙은 국소적으로, 특히 진피내로 또는 피하로 투여될 수 있다. 진피내로 또는 피하로 투여되는 트레멜리무맙의 유효량은 전형적으로 사람당 5 - 200 mg/용량의 범위이다. 일부 실시양태에서, 트레멜리무맙의 유효량은 용량당 사람당 10 - 150 mg/용량의 범위이다. 일부 특정한 실시양태에서, 트레멜리무맙의 유효량은 사람당 약 10, 25, 50, 75, 100, 125, 150, 175, 또는 200 mg/용량이다.
일부 다른 실시양태에서, 면역 조정제는 PD-1 억제제 또는 PD-L1 억제제, 예컨대 니볼루맙, 펨브롤리주맙, RN888 (항-PD-1 항체), 아테졸리주맙 (로슈(Roche)로부터의 PD-L1-특이적 mAb), 두르발루맙 (아스트라 제네카(Astra Zeneca)로부터의 PD-L1-특이적 mAb), 및 아벨루맙(Avelumab) (머크(Merck)로부터의 PD-L1-특이적 mAb)이다. (Okazaki T et al., International Immunology (2007);19,7:813-824, Sunshine J et al., Curr Opin Pharmacol. 2015 Aug;23:32-8).
다른 실시양태에서, 본 개시내용은 항암 백신을 포함한 백신과 함께 면역 조정제의 용도를 제공하며, 여기서 면역 조정제는 인돌아민 2,3-디옥시게나제 1 ("IDO1"로도 알려짐)의 억제제이다. IDO1은 면역 세포 기능을 억제성 표현형으로 조정하는 것으로 밝혀졌으며, 따라서, 숙주 면역 감시로부터의 종양 이탈에 부분적으로 설명하는 것으로 믿어진다. 상기 효소는 필수 아미노산 트립토판을 키누레닌 및 다른 대사물로 분해한다. 이들 대사물 및 소량의 트립토판은 이펙터 T-세포 기능의 억제 및 조절성 T 세포의 증가된 분화를 초래하는 것으로 밝혀졌다. IDO1 억제제는 대분자, 예컨대 항체, 또는 소분자, 예컨대 화학적 화합물일 수 있다.
일부 특정한 실시양태에서, 본 개시내용에 의해 제공된 폴리펩티드 또는 핵산 조성물은 WO2010/005958에 개시된 1,2,5-옥사디아졸 유도체 IDO1 억제제와 조합으로 사용된다. 구체적인 1,2,5-옥사디아졸 유도체 IDO1 억제제의 예로는 하기 화합물을 들 수 있다:
4-({2-[(아미노술포닐)아미노]에틸}아미노)-N-(3-브로모-4-플루오로페닐)-N'-히드록시-1,2,5-옥사디아졸-3-카르복스이미드아미드;
4-({2 [(아미노술포닐)아미노]에틸} 아미노)-N-(3-클로로-4-플루오로페닐)-N'-히드록시-1,2,5-옥사디아졸 3-카르복스이미드아미드;
4-({2 [(아미노술포닐)아미노] 에틸} 아미노)-N-[4-플루오로-3-(트리플루오로메틸)페닐]-N'-히드록시-1,2,5 옥사디아졸-3-카르복스이미드아미드;
4-({2 [(아미노술포닐)아미노] 에틸} 아미노)-N'-히드록시-N-[3-(트리플루오로메틸)페닐]-1,2,5 옥사디아졸-3-카르복스이미드아미드;
4-({2 [(아미노술포닐)아미노]에틸} 아미노)-N-(3-시아노-4-플루오로페닐)-N'-히드록시-1,2,5-옥사디아졸 3-카르복스이미드아미드;
4-({2 [(아미노술포닐)아미노] 에틸} 아미노)-N-[(4-브로모-2-푸릴)메틸]-N'-히드록시-1,2,5 옥사디아졸-3-카르복스이미드아미드; 또는
4-({2 [(아미노술포닐)아미노] 에틸} 아미노)-N-[(4-클로로-2-푸릴)메틸]-N'-히드록시-1,2,5 옥사디아졸-3-카르복스이미드아미드.
1,2,5-옥사디아졸 유도체 IDO1 억제제는 전형적으로 경구로 1일당 1회 또는 2회 투여되며, 경구 투여에 의한 유효량은 일반적으로 환자당 용량당 25 mg - 1000 mg의 범위, 예컨대 25 mg, 50 mg, 100 mg, 200 mg, 300 mg, 400 mg, 500 mg, 600 mg, 700 mg, 800 mg, 또는 1000 mg이다. 특정한 실시양태에서, 본 개시내용에 의해 제공된 폴리펩티드 또는 핵산 조성물은 경구로 1일당 2회 용량당 25 mg 또는 50 mg으로 투여되는 4-({2-[(아미노술포닐)아미노]에틸}아미노)-N-(3-브로모-4-플루오로페닐)-N'-히드록시-1,2,5-옥사디아졸-3-카르복스이미드아미드와 조합으로 사용된다. 1,2,5-옥사디아졸 유도체는 그 전문이 본원에 참조로 포함되는 미국 특허 번호 8,088,803에 기재된 바와 같이 합성될 수 있다.
일부 다른 구체적 실시양태에서, 본 개시내용에 의해 제공된 폴리펩티드 또는 핵산 조성물은 WO2015/173764에 개시된 피롤리딘-2,5-디온 유도체 IDO1 억제제와 조합으로 사용된다. 구체적인 피롤리딘-2,5-디온 유도체 억제제의 예로는 하기 화합물을 들 수 있다:
3-(5-플루오로-1H-인돌-3-일)피롤리딘-2,5-디온;
(3-2H)-3-(5-플루오로-1H-인돌-3-일)피롤리딘-2,5-디온;
(-)-(R)-3-(5-플루오로-1H-인돌-3-일)피롤리딘-2,5-디온;
3-(1H-인돌-3-일)피롤리딘-2,5-디온;
(-)-(R)-3-(1H-인돌-3-일)피롤리딘-2,5-디온;
3-(5-클로로-1H-인돌-3-일)피롤리딘-2,5-디온;
(-)-(R)-3-(5-클로로-1H-인돌-3-일)피롤리딘-2,5-디온;
3-(5-브로모-1H-인돌-3-일)피롤리딘-2,5-디온;
3-(5,6-디플루오로-1H-인돌-3-일)피롤리딘-2,5-디온; 및
3-(6-클로로-1H-인돌-3-일)피롤리딘-2,5-디온.
피롤리딘-2,5-디온 유도체 IDO1 억제제는 전형적으로 경구로 1일당 1회 또는 2회 투여되며, 경구 투여에 의한 유효량은 일반적으로 환자당 용량당 50 mg - 1000 mg의 범위, 예컨대 125 mg, 250 mg, 500 mg, 750 mg, 또는 1000 mg이다. 특정한 실시양태에서, 본 개시내용에 의해 제공된 폴리펩티드 또는 핵산 조성물은 경구로 1일당 1회 환자당 용량당 125-100 mg으로 투여되는 3-(5-플루오로-1H-인돌-3-일)피롤리딘-2,5-디온과 조합으로 사용된다. 피롤리딘-2,5-디온 유도체는 그 전문이 본원에 참조로 포함되는 미국 특허 출원 공개 US2015329525에 기재된 바와 같이 합성될 수 있다.
H. 실시예
하기 실시예는 본 발명의 특정 실시양태를 예시하기 위해 제공된다. 이는 본 발명의 범위를 어떠한 방식으로도 제한하는 것으로 해석되지 않아야 한다. 상기 논의 및 이들 실시예로부터, 관련 기술분야의 통상의 기술자는 본 발명의 본질적 특징을 확인할 수 있으며, 그의 정신 및 범위로부터 벗어나지 않고, 이를 다양한 용법 및 조건에 적응시키기 위해 본 발명의 다양한 변화 및 변형을 생성할 수 있다.
실시예 1. 단일-항원, 이중-항원, 및 삼중-항원 구축물의 구축
실시예 1은 단일 항원 구축물, 이중-항원 구축물, 및 삼중 항원 구축물의 구축을 예시한다. 달리 언급되지 않는다면, MUC1, MSLN, 및 TERT 단백질의 아미노산 위치 또는 잔기에 대한 언급은 각각 서열식별번호: 1에 제시된 바와 같은 인간 MUC1 이소형 1 전구체의 아미노산 서열, 서열식별번호: 2에 제시된 바와 같은 인간 메소텔린 (MSLN) 이소형 2 전구체 단백질의 아미노산 서열, 및 서열식별번호: 3에 제시된 바와 같은 인간 TERT 이소형 1 전구체 단백질의 아미노산 서열을 지칭한다.
1A. 단일-항원 구축물
플라스미드 1027 (MUC1). 플라스미드 1027을 유전자 합성 및 제한 단편 교환의 기술을 사용하여 생성하였다. 5X 탠덤 반복 VNTR 영역을 갖는 인간 MUC1의 아미노산 서열을 유전자 최적화 및 합성을 위해 진아트(GeneArt)에 제출하였다. 폴리펩티드를 코딩하는 유전자를 발현을 위해 최적화하고, 합성하고, 클로닝하였다. MUC-1 오픈 리딩 프레임을 NheI 및 BglII로의 소화에 의해 진아트 벡터로부터 삭제하고, 유사하게 소화된 플라스미드 pPJV7563 내로 삽입하였다. 플라스미드 1027의 오픈 리딩 프레임 (ORF) 뉴클레오티드 서열은 서열식별번호: 7에 제시된다. 플라스미드 1027에 의해 코딩되는 아미노산 서열은 서열식별번호: 8에 제시된다.
플라스미드 1103 (cMSLN). 플라스미드 1103을 PCR 및 제한 단편 교환의 기술을 사용하여 구축하였다. 먼저, 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1084로부터 프라이머 MSLN34 및 MSLN598로 증폭시켜, 각각 앰플리콘의 5' 및 3' 말단에서 NheI 및 BglII 제한 부위의 부가를 초래하였다. 앰플리콘을 NheI 및 Bgl II로 소화시키고, 유사하게 소화된 플라스미드 pPJV7563 내로 삽입하였다. 플라스미드 1103의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 5에 제시된다. 플라스미드 1103에 의해 코딩되는 아미노산 서열은 서열식별번호: 6에 제시된다.
플라스미드 1112 (TERT240). 플라스미드 1112를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, TERT 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1065로부터 프라이머 f pmed TERT 241G 및 r TERT co# pMed로 증폭시켰다. 앰플리콘을 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1112의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 9에 제시된다. 플라스미드 1112에 의해 코딩되는 아미노산 서열은 서열식별번호: 10에 제시된다.
플라스미드 1197 (cMUC1). 플라스미드 1197을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, MUC1 아미노산 22-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1027로부터 프라이머 ID1197F 및 ID1197R로 증폭시켰다. 앰플리콘을 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1197의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 15에 제시된다. 플라스미드 1197에 의해 코딩되는 아미노산 서열은 서열식별번호: 16에 제시된다.
플라스미드 1326 (TERT343). 플라스미드 1326을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, TERT 아미노산 344-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 TertΔ343-F 및 Tert-R로 증폭시켰다. 앰플리콘을 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1326의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 13에 제시된다. 플라스미드 1326에 의해 코딩되는 아미노산 서열은 서열식별번호: 14에 제시된다.
플라스미드 1330 (TERT541). 플라스미드 1330을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, TERT 아미노산 542-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 TertΔ541-F 및 Tert-R로 증폭시켰다. 앰플리콘을 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1330의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 11에 제시된다. 플라스미드 1330에 의해 코딩되는 아미노산 서열은 서열식별번호: 12에 제시된다.
1B. 이중-항원 구축물
플라스미드 1158 (cMSLN - PT2A - Muc1). 플라스미드 1158을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f pmed Nhe cMSLN 및 r PTV2A Bamh cMSLN으로 증폭시켰다. 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1027로부터 프라이머 f1 PTV2A Muc, f2 PTV2A, 및 r pmed Bgl Muc로 증폭시켰다. PCR은 cMSLN의 3' 말단 및 Muc1의 5' 말단에서 중첩하는 PTV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1158의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 23에 제시된다. 플라스미드 1158에 의해 코딩되는 아미노산 서열은 서열식별번호: 24에 제시된다.
플라스미드 1159 (Muc1 - PT2A - cMSLN). 플라스미드 1159를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f1 PTV2A cMSLN , f2 PTV2A, 및 r pmed Bgl cMSLN으로 증폭시켰다. 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1027로부터 프라이머 f pmed Nhe Muc 및 r PTV2A Bamh Muc로 증폭시켰다. PCR은 cMSLN의 5' 말단 및 Muc1의 3' 말단에서 중첩하는 PTV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1159의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 21에 제시된다. 플라스미드 1159에 의해 코딩되는 아미노산 서열은 서열식별번호: 22에 제시된다.
플라스미드 1269 (Muc1 - Ter240). 플라스미드 1269를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f tg link Ter240 및 r pmed Bgl Ter240으로 증폭시켰다. 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1027로부터 프라이머 f pmed Nhe Muc 및 r link muc로 증폭시켰다. PCR은 Tert의 5' 말단 및 Muc1의 3' 말단에서 중첩하는 GGSGG 링커의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1269의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 25에 제시된다. 플라스미드 1269에 의해 코딩되는 아미노산 서열은 서열식별번호: 26에 제시된다.
플라스미드 1270 (Muc1 - ERB2A - Ter240). 플라스미드 1270을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f2 ERBV2A, f1 ERBV2A Ter240, 및 r pmed Bgl Ter240으로 증폭시켰다. 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1027로부터 프라이머 f pmed Nhe Muc 및 r ERB2A Bamh Muc로 증폭시켰다. PCR은 Tert의 5' 말단 및 Muc1의 3' 말단에서 중첩하는 ERBV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1270의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 27에 제시된다. 플라스미드 1270에 의해 코딩되는 아미노산 서열은 서열식별번호: 28에 제시된다.
플라스미드 1271 (Ter240 - ERB2A - Muc1). 플라스미드 1271을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f pmed Nhe Ter240 및 r ERB2A Bamh Ter240으로 증폭시켰다. 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1027로부터 프라이머 f2 ERBV2A, f1 ERBV2A Muc, 및 r pmed Bgl Muc로 증폭시켰다. PCR은 Tert의 3' 말단 및 Muc1의 5' 말단에서 중첩하는 ERBV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1271의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 29에 제시된다. 플라스미드 1271에 의해 코딩되는 아미노산 서열은 서열식별번호: 30에 제시된다.
플라스미드 1272 (Ter240 - T2A - cMSLN). 플라스미드 1272를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f pmed Nhe Ter240 및 r T2A Tert240으로 증폭시켰다. 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f2 T2A, f1 T2A cMSLN, 및 r pmed Bgl cMSLN으로 증폭시켰다. PCR은 Tert의 3' 말단 및 cMSLN의 5' 말단에서 중첩하는 TAV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1272의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 35에 제시된다. 플라스미드 1272에 의해 코딩되는 아미노산 서열은 서열식별번호: 36에 제시된다.
플라스미드 1273 (Tert240 - cMSLN). 플라스미드 1273을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f pmed Nhe Ter240 및 r link Tert240으로 증폭시켰다. 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f tert ink cMSLN 및 r pmed Bgl cMSLN으로 증폭시켰다. PCR은 Tert의 3' 말단 및 cMSLN의 5' 말단에서 중첩하는 GGSGG 링커의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1273의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 37에 제시된다. 플라스미드 1273에 의해 코딩되는 아미노산 서열은 서열식별번호: 38에 제시된다.
플라스미드 1274 (cMSLN - T2A - Tert240). 플라스미드 1274를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f2 T2A, f1 T2A Tert240 및 r pmed Bgl Ter240으로 증폭시켰다. 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f pmed Nhe cMSLN 및 r T2A Bamh cMSLN으로 증폭시켰다. PCR은 Tert의 5' 말단 및 cMSLN의 3' 말단에서 중첩하는 TAV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1274의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 39에 제시된다. 플라스미드 1274에 의해 코딩되는 아미노산 서열은 서열식별번호: 40에 제시된다.
플라스미드 1275 (cMSLN - Tert240). 플라스미드 1275를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f tg link Ter240 및 r pmed Bgl Ter240으로 증폭시켰다. 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f pmed Nhe cMSLN 및 r link cMSLN으로 증폭시켰다. PCR은 Tert의 5' 말단 및 cMSLN의 3' 말단에서 중첩하는 GGSGG 링커의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1275의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 41에 제시된다. 플라스미드 1275에 의해 코딩되는 아미노산 서열은 서열식별번호: 42에 제시된다.
플라스미드 1286 (cMuc1 - ERB2A - Tert240). 플라스미드 1286을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f2 ERBV2A, f1 ERBV2A Ter240, 및 r pmed Bgl Ter240으로 증폭시켰다. 인간 뮤신-1 아미노산 22-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1197로부터 프라이머 f pmed Nhe cytMuc 및 r ERB2A Bamh Muc로 증폭시켰다. PCR은 Tert의 5' 말단 및 Muc1의 3' 말단에서 중첩하는 ERBV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1286의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 31에 제시된다. 플라스미드 1286에 의해 코딩되는 아미노산 서열은 서열식별번호: 32에 제시된다.
플라스미드 1287 (Tert240 - ERB2A - cMuc1). 플라스미드 1287을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f pmed Nhe Ter240 및 r ERB2A Bamh Ter240으로 증폭시켰다. 인간 뮤신-1 아미노산 22-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1197로부터 프라이머 f2 ERBV2A, f1 ERBV2A cMuc, 및 r pmed Bgl Muc로 증폭시켰다. PCR은 Tert의 3' 말단 및 Muc1의 5' 말단에서 중첩하는 ERBV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1287의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 33에 제시된다. 플라스미드 1287에 의해 코딩되는 아미노산 서열은 서열식별번호: 34에 제시된다.
플라스미드 1313 (Muc1 - EMC2A - cMSLN). 플라스미드 1313을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 EMCV_cMSLN_F - 33, EMCV2A_F - 34 및 pMED_cMSLN_R - 37로 증폭시켰다. 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1027로부터 프라이머 pMED_MUC1_F - 31, EMCV2A_R - 36, 및 EMCV_Muc1_R - 35로 증폭시켰다. PCR은 cMSLN의 5' 말단 및 Muc1의 3' 말단에서 중첩하는 EMCV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1313의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 19에 제시된다. 플라스미드 1313에 의해 코딩되는 아미노산 서열은 서열식별번호: 20에 제시된다.
플라스미드 1316 (cMSLN - EMC2A - Muc1). 플라스미드 1316을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f pmed Nhe cMSLN 및 r EM2A Bamh cMSLN으로 증폭시켰다. 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1027로부터 프라이머 f1 EM2A Muc, f2 EMCV2A, 및 r pmed Bgl Muc로 증폭시켰다. PCR은 cMSLN의 3' 말단 및 Muc1의 5' 말단에서 중첩하는 EMCV 2A 서열의 부가를 초래하였다. 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1316의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 17에 제시된다. 플라스미드 1316에 의해 코딩되는 아미노산 서열은 서열식별번호: 18에 제시된다.
1C. 삼중-항원 구축물
플라스미드 1317 (Muc1 - EMC2A - cMSLN - T2A - Tert240). 플라스미드 1317을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 뮤신-1 아미노산 2-225, 946-1255, EMCV 2A 펩티드, 및 메소텔린 전구체의 아미노 말단 절반을 코딩하는 유전자를 PCR에 의해 플라스미드 1313으로부터 프라이머 f pmed Nhe Muc 및 r MSLN 1051 - 1033으로 증폭시켰다. 메소텔린 전구체의 카르복시 말단 절반, TAV 2A 펩티드, 및 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1274로부터 프라이머 f MSLN 1028 -1051 및 r pmed Bgl Ter240으로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1317의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 43에 제시된다. 플라스미드 1317에 의해 코딩되는 아미노산 서열은 서열식별번호: 44에 제시된다.
플라스미드 1318 (Muc1 - ERB2A - Tert240 - T2A - cMSLN). 플라스미드 1318을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 뮤신-1 아미노산 2-225, 946-1255, ERBV 2A 펩티드, 및 인간 텔로머라제의 아미노 말단 절반을 코딩하는 유전자를 PCR에 의해 플라스미드 1270으로부터 프라이머 f pmed Nhe Muc 및 r tert 1602 -1579로 증폭시켰다. 텔로머라제의 카르복시 말단 절반, TAV 2A 펩티드, 및 인간 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1272로부터 프라이머 f tert 1584 -1607 및 r pmed Bgl cMSLN으로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1318의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 45에 제시된다. 플라스미드 1318에 의해 코딩되는 아미노산 서열은 서열식별번호: 46에 제시된다.
플라스미드 1319 (cMSLN - EMC2A - Muc1 - ERB2A - Tert240). 플라스미드 1319를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 메소텔린 전구체 아미노산 37-597, EMCV 2A 펩티드, 및 인간 뮤신-1의 아미노 말단 절반을 코딩하는 유전자를 PCR에 의해 플라스미드 1316으로부터 프라이머 f pmed Nhe cMSLN 및 r muc 986 - 963으로 증폭시켰다. 뮤신-1의 카르복시 말단 절반, ERBV 2A 펩티드, 및 인간 텔로머라제 아미노산 241-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1270으로부터 프라이머 f Muc 960 - 983 및 r pmed Bgl Ter240으로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1319의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 47에 제시된다. 플라스미드 1319에 의해 코딩되는 아미노산 서열은 서열식별번호: 48에 제시된다.
플라스미드 1320 (cMSLN - T2A - Tert240 - ERB2A - Muc1). 플라스미드 1320을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 메소텔린 전구체 아미노산 37-597, TAV 2A 펩티드, 및 인간 텔로머라제의 아미노 말단 절반을 코딩하는 유전자를 PCR에 의해 플라스미드 1274로부터 프라이머 f pmed Nhe cMSLN 및 r tert 1602 -1579로 증폭시켰다. 텔로머라제의 카르복시 말단 절반, ERBV 2A 펩티드, 및 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1271로부터 프라이머 f tert 1584 -1607 및 r pmed Bgl Muc로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1320의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 49에 제시된다. 플라스미드 1320에 의해 코딩되는 아미노산 서열은 서열식별번호: 50에 제시된다.
플라스미드 1321 (Tert240 - T2A - cMSLN - EMC2A - Muc1). 플라스미드 1321을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제의 아미노 말단 절반을 코딩하는 유전자를 PCR에 의해 플라스미드 1112로부터 프라이머 f pmed Nhe Ter240 및 r tert 1602 -1579로 증폭시켰다. 텔로머라제의 카르복시 말단 절반, TAV 2A 펩티드, 및 인간 메소텔린 전구체의 아미노 말단 절반을 코딩하는 유전자를 PCR에 의해 플라스미드 1272로부터 프라이머 f tert 1584 -1607 및 r MSLN 1051 - 1033으로 증폭시켰다. 인간 메소텔린 전구체의 카르복시 말단 절반, EMCV 2A 펩티드, 및 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1316으로부터 프라이머 f MSLN 1028 -1051 및 r pmed Bgl Muc로 증폭시켰다. 3개의 부분적으로 중첩하는 앰플리콘을 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1321의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 51에 제시된다. 플라스미드 1321에 의해 코딩되는 아미노산 서열은 서열식별번호: 52에 제시된다.
플라스미드 1322 (Tert240 - ERB2A - Muc1 - EMC2A - cMSLN). 플라스미드 1322를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 텔로머라제 아미노산 241-1132, ERBV 2A 펩티드, 및 인간 뮤신-1의 아미노 말단 절반을 코딩하는 유전자를 PCR에 의해 플라스미드 1271로부터 프라이머 f pmed Nhe Ter240 및 r muc 986 - 963으로 증폭시켰다. 뮤신-1의 카르복시 말단 절반, EMCV 2A 펩티드, 및 인간 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1313으로부터 프라이머 f Muc 960 - 983 및 r pmed Bgl cMSLN으로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1322의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 53에 제시된다. 플라스미드 1322에 의해 코딩되는 아미노산 서열은 서열식별번호: 54에 제시된다.
플라스미드 1351 (Muc1 - EMC2A - cMSLN - T2A - Tert541). 플라스미드 1351을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 뮤신-1 아미노산 2-225, 946-1255, EMCV 2A 펩티드, 및 인간 메소텔린 전구체를 코딩하는 유전자를 PCR에 의해 플라스미드 1313으로부터 프라이머 f pmed Nhe Muc 및 r T2A Bamh cMSLN으로 증폭시켰다. TAV 2A 펩티드 및 인간 텔로머라제 아미노산 541-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1330으로부터 프라이머 f1 T2A Tert d541, f2 T2A, 및 r pmed Bgl Ter240으로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1351의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 55에 제시된다. 플라스미드 1351에 의해 코딩되는 아미노산 서열은 서열식별번호: 56에 제시된다.
플라스미드 1352 (cMSLN - EMC2A - Muc1 - ERB2A - Tert541). 플라스미드 1352를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 메소텔린 전구체 아미노산 37-597, EMCV 2A 펩티드, 및 인간 뮤신-1을 코딩하는 유전자를 PCR에 의해 플라스미드 1316으로부터 프라이머 f pmed Nhe cMSLN 및 r ERB2A Bamh Muc로 증폭시켰다. ERBV 2A 펩티드 및 인간 텔로머라제 아미노산 541-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1330으로부터 프라이머 f1 ERBV2A Tert d541, f2 ERBV2A, 및 r pmed Bgl Ter240으로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1352의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 57에 제시된다. 플라스미드 1352에 의해 코딩되는 아미노산 서열은 서열식별번호: 58에 제시된다.
플라스미드 1353 (cMSLN - T2A - Tert541 - ERB2A - Muc1). 플라스미드 1353을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f pmed Nhe cMSLN, r2 T2A, 및 r T2A Bamh cMSLN으로 증폭시켰다. TAV 2A 펩티드, 인간 텔로머라제 아미노산 541-1132, ERBV 2A 펩티드, 및 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1271로부터 프라이머 f1 T2A Tert d541 및 r pmed Bgl Muc로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1353의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 59에 제시된다. 플라스미드 1353에 의해 코딩되는 아미노산 서열은 서열식별번호: 60에 제시된다.
플라스미드 1354 (Muc1 - EMC2A - cMSLN - T2A -Tert342). 플라스미드 1354를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 뮤신-1 아미노산 2-225, 946-1255, EMCV 2A 펩티드, 및 인간 메소텔린 전구체를 코딩하는 유전자를 PCR에 의해 플라스미드 1313으로부터 프라이머 f pmed Nhe Muc 및 r T2A Bamh cMSLN으로 증폭시켰다. TAV 2A 펩티드 및 인간 텔로머라제 아미노산 342-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1326으로부터 프라이머 f1 T2A Tert d342, f2 T2A, 및 r pmed Bgl Ter240으로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1354의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 61에 제시된다. 플라스미드 1354에 의해 코딩되는 아미노산 서열은 서열식별번호: 62에 제시된다.
플라스미드 1355 (cMSLN - EMC2A - Muc1 - ERB2A -Tert342). 플라스미드 1355를 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 메소텔린 전구체 아미노산 37-597, EMCV 2A 펩티드, 및 인간 뮤신-1을 코딩하는 유전자를 PCR에 의해 플라스미드 1316으로부터 프라이머 f pmed Nhe cMSLN 및 r ERB2A Bamh Muc로 증폭시켰다. ERBV 2A 펩티드, 및 인간 텔로머라제 아미노산 342-1132를 코딩하는 유전자를 PCR에 의해 플라스미드 1326으로부터 프라이머 f1 ERBV2A Ter d342, f2 ERBV2A, 및 r pmed Bgl Ter240으로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1355의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 63에 제시된다. 플라스미드 1355에 의해 코딩되는 아미노산 서열은 서열식별번호: 64에 제시된다.
플라스미드 1356 (cMSLN - T2A - Tert342 - ERB2A - Muc1). 플라스미드 1356을 PCR 및 심리스 클로닝의 기술을 사용하여 구축하였다. 먼저, 인간 메소텔린 전구체 아미노산 37-597을 코딩하는 유전자를 PCR에 의해 플라스미드 1103으로부터 프라이머 f pmed Nhe cMSLN, r2 T2A, 및 r T2A Bamh cMSLN으로 증폭시켰다. TAV 2A 펩티드, 인간 텔로머라제 아미노산 342-1132, ERBV 2A 펩티드, 및 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 플라스미드 1271로부터 프라이머 f1 T2A Tert d342 및 r pmed Bgl Muc로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 생성된 클론 #3은 비의도된 단일 염기 돌연변이를 함유하였다. 돌연변이를 교정하기 위해, PCR 및 심리스 클로닝을 클론 #3을 주형으로서 사용하여 반복하였다. 인간 메소텔린 전구체 아미노산 37-597, TAV 2A 펩티드, 및 인간 텔로머라제의 아미노 말단 절반을 코딩하는 유전자를 PCR에 의해 클론 #3으로부터 프라이머 f pmed Nhe cMSLN 및 r tert 1602 -1579로 증폭시켰다. 텔로머라제의 카르복시 말단 절반, ERBV 2A 펩티드, 및 인간 뮤신-1 아미노산 2-225, 946-1255를 코딩하는 유전자를 PCR에 의해 클론 #3으로부터 프라이머 f tert 1584 -1607 및 r pmed Bgl Muc로 증폭시켰다. 부분적으로 중첩하는 앰플리콘을 Dpn I로 소화시키고, 함께 혼합하고, 심리스 클로닝에 의해 pPJV7563의 Nhe I / Bgl II 부위 내로 클로닝하였다. 플라스미드 1356의 오픈 리딩 프레임 뉴클레오티드 서열은 서열식별번호: 65에 제시된다. 플라스미드 1356에 의해 코딩되는 아미노산 서열은 서열식별번호: 66에 제시된다.
1D. 벡터 구축
단일 또는 다중-항원 구축물을 발현시키기 위한 벡터를 침팬지 아데노바이러스 Ad68 게놈 서열로부터 구축하였다. 트랜스진이 없는 AdC68 백본 ("비어있는 벡터"로 지칭됨)의 3가지 버전을 인 실리코 디자인하였다. 벡터는 단지 바이러스 내로 조작되어 그들을 복제에 부적격이 되게 하고 트랜스진 삽입에 대한 공간을 생성한 E1 및 E3 결실의 정도에서만 상이하였다. 벡터 AdC68W 및 AdC68X는 국제 특허 출원 WO2015/063647A1에 기재되었다. 염기 456-3256 및 27476-31831의 결실을 운반하는 벡터 AdC68Y를 AdC68X에 비해 개선된 성장 특성 및 AdC68W보다 더 큰 트랜스진 운반능을 갖도록 조작하였다. 모든 3가지 비어있는 벡터를 시험관내 올리고 합성 및 에스케리키아 콜라이(Escherichia coli) (이. 콜라이(E. coli)) 및 효모에서의 후속의 재조합-매개된 중간체 어셈블리를 이용하여 다중-단계 공정으로 생화학적으로 합성하였다. 다양한 면역원성 TAA 폴리펩티드를 코딩하는 오픈 리딩 프레임을 PCR에 의해 실시예에 기재된 플라스미드로부터 증폭시켰다. 그 후, 오픈 리딩 프레임을 비어있는 벡터 바크미드 내로 삽입하였다. 재조합 바이러스 게놈을 PacI로의 소화에 의해 바크미드로부터 방출시키고, 선형화된 핵산을 E1 보충 부착 HEK293 세포주 내로 형질감염시켰다. 가시적 세포변성 효과 및 아데노바이러스 초점 형성 시, 배양물을 다중 라운드의 동결/해동에 의해 수확하여 세포로부터 바이러스를 방출시켰다. 바이러스를 증폭시키고, 표준 기술에 의해 정제하였다.
실시예 2. 면역원성 MUC1 단일-항원 구축물의 면역원성
HLA-A2/DR1 마우스에서의 연구
연구 디자인. 12마리의 혼합 성별 HLA-A2/DR1 마우스를 PMED 방법을 사용하여 DNA 구축물 플라스미드 1027 (서열식별번호: 8의 막-결합된 면역원성 MUC1 폴리펩티드를 코딩함) 또는 플라스미드 1197 (서열식별번호: 16의 세포액성 면역원성 MUC1 폴리펩티드를 코딩함)로 제0일에 프라이밍하고, 제14일에 부스팅하였다. 제21일에, 마우스를 희생시키고, 비장세포를 인터페론-감마 (IFN-γ) 엘리스폿(ELISpot) 및 세포내 시토카인 염색 (ICS) 검정에서 MUC1-특이적 세포성 면역원성에 대해 평가하였다.
입자 매개 표피 전달 (PMED). PMED는 핵산 백신을 동물 또는 인간에게 투여하는 무-침 방법이다. PMED 시스템은 DNA를 현미경적 금 입자 상으로 침전시킨 후, 헬륨 기체에 의해 표피 내로 추진시키는 것을 포함한다. 단일 사용 장치인 ND10은 내부 실린더로부터 가압된 헬륨을 사용하여 금 입자를 전달하고, 리피터 전달 장치인 X15는 고압 호스를 통해 X15에 연결된 외부 헬륨 탱크를 사용하여 금 입자를 전달한다. 이들 장치 둘 다는 MUC1 DNA 플라스미드를 전달하기 위한 연구에 사용되었다. 금 입자는 통상적으로 직경이 1-3 μm였으며, 입자는 1 mg의 금 입자당 2 μg의 항원 DNA 플라스미드를 함유하도록 제제화되었다. (Sharpe, M. et al.: P. Protection of mice from H5N1 influenza challenge by prophylactic DNA vaccination using particle mediated epidermal delivery. Vaccine, 2007, 25(34): 6392-98: Roberts LK, et al.: Clinical safety and efficacy of a powdered Hepatitis B nucleic acid vaccine delivered to the epidermis by a commercial prototype device. Vaccine, 2005; 23(40):4867-78).
IFN-γ 엘리스폿 검정. 개별적 동물로부터의 비장세포를 IFN-γ 엘리스폿 플레이트에서 개별적 Ag-특이적 펩티드 (2-10 ug/ml의 각각의 펩티드, 웰당 2.5-5e5 세포) 또는 15mer Ag-특이적 펩티드의 풀 (11개의 아미노산에 의해 중첩함, 전체 Ag-특이적 아미노산 서열을 커버함; 2-5 ug/ml의 각각의 펩티드, 웰당 1.25-5e5 세포)로 삼중으로 공동-인큐베이션하였다 (또한 펩티드 풀 표 (표 18), 및 표 15-17 참조). 플레이트를 37℃, 5% CO2에서 ~16시간 동안 인큐베이션한 후, 세척하고, 제조자의 지시서에 따라 성장시켰다. IFN-γ 스폿 형성 세포 (SFC)의 수를 CTL 판독기로 카운팅하였다. 삼중실험의 평균을 계산하고, 펩티드를 함유하지 않은 음성 대조군 웰의 반응을 차감하였다. 그 후, SFC 카운트를 정규화하여 1e6 비장세포당 반응을 기재하였다. 표에서의 항원-특이적 반응은 Ag-특이적 펩티드 또는 펩티드 풀에 대한 반응의 합계를 나타낸다.
ICS 검정. 개별적 동물로부터의 비장세포를 U-바닥 96-웰-플레이트 조직 배양 플레이트에서 H-2b-, HLA-A2-, 또는 HLA-A24-제한된 Ag-특이적 펩티드 (각각 5-10 ug/ml의 펩티드, 웰당 1-2e6 비장세포) 또는 15mer Ag-특이적 펩티드의 풀 (11개의 아미노산에 의해 중첩함, 전체 Ag-특이적 아미노산 서열을 커버함; 2-5 ug/ml의 각각의 펩티드, 웰당 1-2e6 비장세포)과 공동-인큐베이션하였다 (또한 펩티드 풀 표 (표 18) 및 표 15-17 참조). 플레이트를 37℃, 5% CO2에서 ~16시간 동안 인큐베이션하였다. 그 후, 세포를 염색하여 CD8+ T 세포로부터의 세포내 IFN-γ 발현을 검출하고, 고정시켰다. 세포를 유동 세포측정기 상에서 획득하였다. 펩티드를 함유하지 않은 음성 대조군 웰에서 얻어진 반응의 차감 후에, 데이터를 동물당 펩티드(들) Ag- 또는 펩티드 풀 Ag-특이적 IFN-γ+ CD8+ T 세포의 빈도로서 제시하였다.
샌드위치 ELISA 검정. 표준 샌드위치 ELISA 검정을 테칸 에보 (Tecan Evo), 바이오멕(Biomek) FxP, 및 바이오텍(BioTek) 405 셀렉트(Select) TS 자동화 기기를 사용하여 수행하였다. 384 웰 마이크로플레이트 (평평-웰, 고 결합)를 1X PBS 중 1.0 μg/mL 인간 MUC1 또는 인간 MSLN 단백질 (항원)로 25 μl/웰에서 코팅하고, 4℃에서 밤새 인큐베이션하였다. 다음날 아침에, 플레이트를 실온에서 1시간 동안 0.05% 트윈 20을 갖는 PBS (PBS-T) 중 5% FBS로 차단하였다. 마우스 혈청을 96 U-바닥 웰 플레이트에서 PBS-T에서 1/100 출발 희석에서 제조하였다. 테칸 에보를 9개의 희석 증분 점에 걸쳐 PBS-T에서 ½ 로그 연속 희석을 수행한 후, 25 μl/웰의 희석된 혈청을 96 웰 플레이트에서 384 웰 플레이트로 스탬핑하였다. 384 웰 플레이트를 진탕기 상에서 600 RPM에서 실온에서 1시간 동안 인큐베이션한 후, 바이오텍 EL 405 셀렉트 TS 플레이트 세척기를 사용하여, 플레이트를 PBS-T에서 4회 세척하였다. 2차 마우스 항-IgG-HRP 항체를 적절한 희석으로 희석하고, 바이오멕 FxP에 의해 25 μl/웰에서 384 웰 플레이트 내로 스탬핑하고, 진탕기 상에서 600 RPM에서 실온에서 1시간 동안 인큐베이션한 후, 5회 반복 세척하였다. 바이오멕 FxP를 사용하여, 플레이트를 RT TMB 기질의 25 μl/웰에서 스탬핑하고, 암흑에서 실온에서 30분 동안 인큐베이션한 후, 1N H2SO4 산을 25 μl/웰 스탬핑하여 효소 반응을 정지시켰다. 플레이트를 몰리큘라 디바이시즈(Molecular Devices), 스펙트라맥스(Spectramax) 340PC/384 플러스 상에서 450 nm 파장에서 판독하였다. 데이터를 99.0의 검출 한계로 1.0의 OD에서 계산된 역가로서 보고하였다. 항원-특이적 상업적 모노클로날 항체를 플레이트-대-플레이트 변동 성능을 추적하기 위한 양성 대조군으로서 각각의 플레이트에 사용하고; 무관한 백신화된 마우스 혈청을 음성 대조군으로서 사용하고, PBS-T 유일 웰을 비-특이적 결합 배경을 모니터링하기 위해 사용하였다. 표에서의 역가는 개별적 동물로부터 도출된 항원-특이적 IgG 역가를 나타낸다.
결과. 표 1은 각각 MUC1 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 표 15 및 18 참조) 또는 MUC1 펩티드 aa516-530으로 배양된 HLA-A2/DR1 비장세포로부터의 엘리스폿 및 ICS 데이터를 나타낸다. 칼럼 3에서의 수는 MUC1 펩티드 풀로의 재자극, 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 칼럼 4에서의 수는 MUC1 펩티드 aa516-530으로의 재자극 및 배경 차감 후의 IFN-γ+인 CD8+ T 세포의 빈도를 나타낸다. 양성 반응은 SFC >100 및 IFN-γ+ CD8+ T 세포의 빈도 >0.05%를 갖는 것으로서 정의된다. 표 1에 나타난 바와 같이, 상기 실시예 1A에 기재된 전장 막-결합된 (플라스미드 1027) 및 세포액성 (플라스미드 1197) MUC1 구축물로 제조된 면역원성 MUC1 폴리펩티드는 HLA-A2-제한된 MUC1 펩티드 aa516-530-특이적 CD8+ T 세포 반응을 포함한 MUC1-특이적 T 세포 반응을 유도할 수 있다. 세포액성 MUC1 항원 형태는 가장 높은 규모의 T 세포 반응을 유도하였다. 중요하게는, MUC1 펩티드 aa516-530에 대한 암 환자로부터 유래된 T 세포 반응은 시험관내에서의 항-종양 효능과 상관되는 것으로 나타났으며 (Jochems C et al., Cancer Immunol Immunother (2014) 63:161-174), 이는 이 특이적 에피토프에 대한 세포성 반응을 일으키는 것의 중요성을 입증한다.
표 1. HLA-A2/DR1 마우스에서의 단일-항원 MUC1 DNA 구축물 (플라스미드 1027 및 플라스미드 1197)에 의해 유도된 T 세포 반응
HLA-A24 마우스에서의 연구
연구 디자인. 혼합 성별 HLA-A24 마우스를 DNA 구축물 플라스미드 1027로 PMED 투여에 의해 제0일에 프라이밍하고, 제14일, 제28일 및 제42일에 부스팅하였다. 제21일에, 마우스를 희생시키고, 비장세포를 MUC1-특이적 세포성 면역원성 (엘리스폿)에 대해 평가하였다.
결과. 표 2는 MUC1 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 15 참조)로 배양된 HLA-A24 비장세포로부터의 엘리스폿 데이터를 나타낸다. 칼럼 3에서의 수는 MUC1 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 볼드체의 수는 시험된 적어도 1개의 펩티드 풀이 너무 많아서 카운팅할 수 없었으며, 따라서, 진정한 수치는 적어도 언급된 값임을 지시한다. 양성 반응은 SFC >100을 갖는 것으로서 정의된다. 표 2에 나타난 바와 같이, 막-결합된 MUC1 구축물은 MUC1-특이적 세포성 반응을 유도할 수 있다.
표 2. HLA-A24 마우스에서의 인간 천연 전장 막-결합된 MUC1 항원을 코딩하는 단일-항원 DNA 구축물 플라스미드 1027에 의해 유도된 T 세포 반응
원숭이에서의 연구
연구 디자인. 14마리의 차이니즈 시노몰구스 마카크를 세포액성 (플라스미드 1197) 또는 전장 막-결합된 MUC1 항원 (플라스미드 1027)을 코딩하는 AdC68W 아데노바이러스 벡터로 2e11 바이러스 입자로 양측 근육내 주사 (총 1 mL)에 의해 프라이밍하였다. 29일 후에, 동물을 전기천공을 통해 근육내로 양측으로 전달된 (총 2 mL) 세포액성 또는 전장 막-결합된 MUC1 항원을 코딩하는 DNA로 부스팅하였다. 항-CTLA-4를 제1일 (32 mg) 및 제29일 (50 mg)에 피하로 투여하였다. 마지막 면역화 후 14일에, 동물을 채혈하고, PBMC 및 혈청을 단리하여 각각 MUC1-특이적 세포성 (엘리스폿, ICS) 및 체액성 (ELISA) 반응을 평가하였다.
NHP-특이적 면역 검정.
엘리스폿 검정. 개별적 동물로부터의 PBMC를 IFN-y 엘리스폿 플레이트에서 15mer Ag-특이적 펩티드의 풀 (11개의 아미노산에 의해 중첩함, 전체 Ag-특이적 아미노산 서열을 커버함)과 2 ug/ml의 각각의 펩티드, 웰당 4e5 세포로 이중으로 공동-인큐베이션하였다 (또한 펩티드 풀 표 (표 18) 및 표 15-17 참조). 플레이트를 37℃, 5% CO2에서 ~16시간 동안 인큐베이션한 후, 세척하고, 제조자의 지시서에 따라 성장시켰다. IFN-γ 스폿 형성 세포 (SFC)의 수를 CTL 판독기로 카운팅하였다. 이중실험의 평균을 계산하고, 펩티드를 함유하지 않은 음성 대조군 웰의 반응을 차감하였다. 그 후, SFC 카운트를 정규화하여 1e6 PBMC당 반응을 기재하였다. 표에서의 항원-특이적 반응은 Ag-특이적 펩티드 풀에 대한 반응의 합계를 나타낸다.
ICS 검정. 개별적 동물로부터의 PBMC를 U-바닥 96-웰-플레이트 조직 배양 플레이트에서 15mer MUC1 펩티드의 풀 (11개의 아미노산에 의해 중첩함, 전체 천연 전장 MUC1 아미노산 서열을 커버함; 표 15 참조)과 2 ug/mL의 각각의 펩티드, 웰당 1.5-2e6 PBMC로 공동-인큐베이션하였다. 플레이트를 37℃, 5% CO2에서 ~16시간 동안 인큐베이션한 후, 염색하여 CD8 T 세포로부터의 세포내 IFN-γ 발현을 검출하였다. 고정 후, 세포를 유동 세포측정기 상에서 획득하였다. 결과를 펩티드를 함유하지 않은 음성 대조군 웰에서 얻어진 반응의 차감 후에 개별적 동물당 MUC1, MSLN, 또는 TERT-특이적 IFN-γ+ CD8+ T 세포의 수로서 제시하고, 1e6 CD8+ T 세포에 대해 정규화하였다.
샌드위치 ELISA 검정. 표준 샌드위치 ELISA 검정을 테칸 에보, 바이오멕 FxP, 및 바이오텍 405 셀렉트 TS 자동화 기기를 사용하여 수행하였다. 384 웰 마이크로플레이트 (평평-웰, 고 결합)를 25 μl/웰에서 1X PBS 중 1.0 μg/mL 인간 MUC1 또는 인간 MSLN 단백질 (항원)로 코팅하고, 4℃에서 밤새 인큐베이션하였다. 다음날 아침에, 플레이트를 실온에서 1시간 동안 0.05% 트윈 20을 갖는 PBS (PBS-T) 중 5% FBS로 차단하였다. 차이니즈 시노몰구스 마카크로부터의 혈청을 96 U-바닥 웰 플레이트에서 PBS-T에서 1/100 출발 희석에서 제조하였다. 테칸 에보를 9개의 희석 증분 점에 걸쳐 PBS-T에서 ½ 로그 연속 희석을 수행한 후, 25 μl/웰의 희석된 혈청을 96 웰 플레이트에서 384 웰 플레이트로 스탬핑하였다. 384 웰 플레이트를 진탕기 상에서 600 RPM에서 실온에서 1시간 동안 인큐베이션한 후, 바이오텍 EL 405 셀렉트 TS 플레이트 세척기를 사용하여, 플레이트를 PBS-T에서 4회 세척하였다. 시노몰구스 IgG와 교차-반응시킨 2차 레수스 항-IgG-HRP 항체를 적절한 희석으로 희석하고, 바이오멕 FxP에 의해 25 μl/웰에서 384 웰 플레이트 내로 스탬핑하고, 진탕기 상에서 600 RPM에서 실온에서 1시간 동안 인큐베이션한 후, 5회 반복 세척하였다. 바이오멕 FxP를 사용하여, 플레이트를 RT TMB 기질의 25 μl/웰에서 스탬핑하고, 암흑에서 실온에서 30분 동안 인큐베이션한 후, 1N H2SO4 산을 25 μl/웰 스탬핑하여 효소 반응을 정지시켰다. 플레이트를 몰리큘라 디바이시즈, 스펙트라맥스 340PC/384 플러스 상에서 450 nm 파장에서 판독하였다. 데이터를 99.0의 검출 한계로 1.0의 OD에서 계산된 역가로서 보고하였다. 항원-특이적 상업적 모노클로날 항체를 플레이트-대-플레이트 변동 성능을 추적하기 위한 양성 대조군으로서 각각의 플레이트에 사용하고; 무관한 백신화된 마우스 혈청을 음성 대조군으로서 사용하고, PBS-T 유일 웰을 비-특이적 결합 배경을 모니터링하기 위해 사용하였다. 표에서의 역가는 개별적 동물로부터 도출된 항원-특이적 IgG 역가를 나타낸다.
결과. 표 3은 MUC1 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 15 참조)과 배양된 차이니즈 시노몰구스 마카크의 PBMC로부터의 엘리스폿 및 ICS 데이터, 및 차이니즈 시노몰구스 마카크의 혈청으로부터의 ELISA 데이터를 나타낸다. 칼럼 3에서의 수는 MUC1 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 PBMC를 나타낸다. 칼럼 4에서의 수는 MUC1 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ+ CD8+ T 세포/106 CD8+ T 세포를 나타낸다. 칼럼 5에서의 수는 항-MUC1 IgG 역가 (광학 밀도 (O.D) =1, 검출 한계 (L.O.D) = 99.0)를 나타낸다. 양성 반응은 SFC > 50, IFN-γ+ CD8+ T 세포 / 1e6 CD8+ T 세포 >50, 및 IgG 역가 >99를 갖는 것으로서 정의된다. 표 3에 나타난 바와 같이, 세포액성 (1197) 및 천연 전장 막-결합된 (1027) MUC1 구축물로 제조된 면역원성 MUC1 폴리펩티드는 MUC1-특이적 T 및 B 세포 반응을 유도할 수 있다. 천연 전장 막-결합된 MUC1 구축물 (1027)은 전체적인 가장 양호한 MUC1-특이적 세포성 및 체액성 반응을 유도하는 것으로 나타났다.
표 3. 차이니즈 시노몰구스 마카크에서의 단일-항원 아데노바이러스 AdC68W 및 단일-항원 DNA 구축물 (플라스미드 1197; 플라스미드 1027)에 의해 유도된 T 및 B 세포 반응
실시예 3. MSLN 단일-항원 구축물의 면역원성
파스퇴르 (HLA-A2/DR1) 마우스에서의 면역 반응 연구
연구 디자인. 12마리의 암컷 HLA-A2/DR1 마우스를 막-결합된 (플라스미드 1084) 또는 세포액성 MSLN 항원 (플라스미드 1103)을 코딩하는 AdC68W 아데노바이러스 벡터로 1e10 바이러스 입자로 근육내 주사 (50 ul)에 의해 프라이밍하였다. 28일 후에, 동물을 실시예 2에 기재된 바와 같은 PMED 방법을 사용하여 면역원성 MSLN 폴리펩티드를 코딩하는 DNA 단일-항원 구축물로 부스팅하였다. 항원-특이적 T 세포 반응을 7일 후에 IFN-γ 엘리스폿 및 ICS 검정에서 측정하였다.
결과. 표 4는 각각 MSLN 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 16 참조), 또는 MSLN 펩티드 aa50-64, aa102-116, 및 aa542-556과 배양된 HLA-A2/DR1 비장세포로부터의 엘리스폿 및 ICS 데이터를 나타낸다. 칼럼 3에서의 수는 MSLN 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 칼럼 4에서의 수는 MSLN 펩티드 aa50-64, aa102-116 및 aa542-556으로의 재자극, 및 배경 차감 후의 IFN-γ+인 CD8+ T 세포의 빈도를 나타낸다. 양성 반응은 SFC >100 및 IFN-γ+ CD8+ T 세포의 빈도 >0.05%를 갖는 것으로서 정의된다. 표 4에 나타난 바와 같이, 상기 실시예 1A에 기재된 막-결합된 (1084) 및 세포액성 (1103) MSLN 구축물로 제조된 면역원성 MSLN 폴리펩티드는 MSLN-특이적 T 세포 반응을 유도할 수 있다. 세포액성 MSLN 항원 형태는 가장 높은 규모의 MSLN-특이적 T 세포 반응을 유도하였다.
표 4. HLA-A2/DR1 마우스에서의 단일-항원 아데노바이러스 AdC68W 및 단일-항원 DNA 구축물에 의해 유도된 T 세포 반응
HLA A24 마우스에서의 면역 반응 연구
연구 디자인. 12마리의 혼합-성별 HLA-A24 마우스를 막-결합된 (1084) 또는 세포액성 MSLN (1103) DNA 구축물로 PMED 방법을 사용하여 프라임/부스트/부스트/부스트 처방으로, 각각의 백신접종 사이에 2주 떨어져서 면역화하였다. MSLN-특이적 T 세포 반응을 마지막 면역화 후 7일에 IFN-γ 엘리스폿 및 ICS 검정에서 측정하였다.
결과. 표 5는 각각 MSLN 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 16 참조) 또는 MSLN 펩티드 aa130-144 및 aa230-244와 배양된 HLA-A24 비장세포로부터의 엘리스폿 및 ICS 데이터를 나타낸다. 칼럼 3에서의 수는 MSLN 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 칼럼 4에서의 수는 MSLN 펩티드 aa130-144 및 aa230-244로의 재자극, 및 배경 차감 후의 IFN-γ+인 CD8+ T 세포의 빈도를 나타낸다. 양성 반응은 SFC >100 및 IFN-γ+ CD8+ T 세포의 빈도 >0.05%를 갖는 것으로서 정의된다. 표 5에 나타난 바와 같이, 막-결합된 (1084) 및 세포액성 MSLN (1103) 구축물로 제조된 면역원성 MSLN 폴리펩티드는 MSLN-특이적 T 세포 반응을 유도할 수 있다. 세포액성 MSLN 항원 형태는 가장 높은 규모의 MSLN-특이적 T 세포 반응을 유도하였다.
표 5. HLA-A24 마우스에서의 단일-항원 DNA 구축물에 의해 유도된 T 세포 반응
원숭이에서의 면역 반응 연구
연구 디자인. 14마리의 차이니즈 시노몰구스 마카크를 막-결합된 (플라스미드 1084) 또는 세포액성 MSLN 항원 (플라스미드 1103)을 코딩하는 AdC68W 아데노바이러스 벡터로 2e11 바이러스 입자로 양측 근육내 주사 (총 1 mL)에 의해 프라이밍하였다. 29일 후에, 동물을 전기천공을 통해 근육내로 양측으로 전달된 (총 2 mL) 막-결합된 (1084) 또는 세포액성 MSLN 항원 (1103)을 코딩하는 DNA로 부스팅하였다. 항-CTLA-4를 제1일 (32 mg) 및 제29일 (50 mg)에 피하로 투여하였다. 마지막 면역화 후 14일에, 동물을 채혈하고, PBMC 및 혈청을 단리하여 각각 MSLN-특이적 세포성 (엘리스폿, ICS) 및 체액성 (ELISA) 반응을 평가하였다.
결과. 표 6은 MSLN 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 16 참조)과 배양된 차이니즈 시노몰구스 마카크의 PBMC로부터의 엘리스폿 및 ICS 데이터, 및 차이니즈 시노몰구스 마카크의 혈청으로부터의 ELISA 데이터를 나타낸다. 칼럼 3에서의 수는 MSLN 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 칼럼 4에서의 수는 MSLN 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ+ CD8+ T 세포/106 CD8+ T 세포를 나타낸다. 칼럼 5에서의 수는 항-MSLN IgG 역가 (광학 밀도 (O.D) =1, 검출 한계 (L.O.D) = 99.0)를 나타낸다. 양성 반응은 SFC >50, IFN-γ+ CD8+ T 세포 / 1e6 CD8+ T 세포 >50, 및 IgG 역가 >99를 갖는 것으로서 정의된다. 표 6에 나타난 바와 같이, 막-결합된 (1084) 및 세포액성 (1103) MSLN 구축물로 제조된 면역원성 MSLN 폴리펩티드는 MSLN-특이적 T 및 B 세포 반응을 유도할 수 있다. 세포질성 MSLN 구축물 (플라스미드 1103)은 가장 강한 MSLN-특이적 세포성 반응을 유도하는 것으로 나타났으며; 대조적으로, 막-결합된 MSLN 구축물 (플라스미드 1084)은 가장 강한 MSLN-특이적 체액성 반응을 유도하는 것으로 나타났다.
표 6. 차이니즈 시노몰구스 마카크에서의 단일-항원 아데노바이러스 AdC68W 및 단일-항원 DNA 구축물에 의해 유도된 T 및 B 세포 반응
실시예 4. TERT 단일-항원 구축물의 면역원성
파스퇴르 마우스에서의 면역 반응 연구
연구 디자인. 6마리의 혼합 성별 HLA-A2/DR1 마우스를 말단절단된 (Δ240) 세포액성 면역원성 TERT 폴리펩티드 (플라스미드 1112)를 코딩하는 AdC68W 아데노바이러스 벡터로 1e10 바이러스 입자로 근육내 주사 (50 ul)에 의해 프라이밍하였다. 28일 후에, 동물을 말단절단된 (Δ240) 세포액성 TERT 항원 (플라스미드 1112)을 코딩하는 전기천공을 통해 양측으로 전달된 50 ug DNA (2x20ul)로 근육내로 부스팅하였다. 항원-특이적 T 세포 반응을 7일 후에 IFN-γ 엘리스폿 및 ICS 검정에서 측정하였다.
결과. 표 7은 각각 TERT 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 17 참조) 또는 TERT 펩티드 aa861-875와 배양된 HLA-A2/DR1 비장세포로부터의 엘리스폿 및 ICS 데이터를 나타낸다. 칼럼 3에서의 수는 TERT 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 칼럼 4에서의 수는 TERT 펩티드 aa861-875로의 재자극 및 배경 차감 후의 IFN-γ+인 CD8+ T 세포의 빈도를 나타낸다. 양성 반응은 SFC >100 및 IFN-γ+ CD8+ T 세포의 빈도 >0.05%를 갖는 것으로서 정의된다. 표 7에 나타난 바와 같이, 상기 실시예 1A에 기재된 말단절단된 (Δ240) 세포액성 TERT 구축물로 제조된 면역원성 TERT 폴리펩티드는 HLA-A2-제한된 TERT-특이적 CD8 T 세포 반응을 유도할 수 있다.
표 7. HLA-A2/DR1 마우스에서의 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 단일-항원 아데노바이러스 AdC68W 및 단일-항원 DNA 구축물 (플라스미드 1112)에 의해 유도된 T 세포 반응
HLA A24 마우스에서의 면역 반응 연구
연구 디자인. 8마리의 혼합 성별 HLA-A24 마우스를 말단절단된 (Δ240) 세포액성 TERT 항원 (플라스미드 1112)을 코딩하는 AdC68W 아데노바이러스 벡터로 총 1e10 바이러스 입자로 양측 근육내 주사 (각각의 앞정강근 내로 50 ul)에 의해 프라이밍하였다. 14일 후에, 동물을 말단절단된 (Δ240) 세포액성 TERT 항원 (플라스미드 1112)을 코딩하는 전기천공을 통해 양측으로 전달된 (2x20 ul) 50 ug DNA로 근육내로 부스팅하였다. 항원-특이적 T 세포 반응을 7일 후에 IFN-γ 엘리스폿 및 ICS 검정에서 측정하였다.
결과. 표 8은 각각 TERT 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 17 참조) 또는 TERT 펩티드 aa841-855와 배양된 HLA-A24 비장세포로부터의 IFN-γ 엘리스폿 및 ICS 데이터를 나타낸다. 칼럼 3에서의 수는 TERT 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 칼럼 4에서의 수는 TERT 펩티드 aa841-855로의 재자극, 및 배경 차감 후의 IFN-γ+인 CD8+ T 세포의 빈도를 나타낸다. 볼드체로의 수는 시험된 적어도 1개의 펩티드 풀이 너무 많아서 카운팅할 수 없었으며, 따라서 진정한 수치는 적어도 언급된 값임을 지시한다. 양성 반응은 SFC >100 및 IFN-γ+ CD8+ T 세포의 빈도 >0.1%를 갖는 것으로서 정의된다. 표 8에 나타난 바와 같이, 말단절단된 (Δ240) 세포액성 TERT (1112) 구축물로 제조된 면역원성 TERT 폴리펩티드는 HLA-A24-제한된 TERT-특이적 CD8+ T 세포 반응을 유도할 수 있다.
표 8. HLA-A24 마우스에서의 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 단일-항원 아데노바이러스 AdC68W 및 단일-항원 DNA 구축물 (플라스미드 1112)에 의해 유도된 T 세포 반응
원숭이에서의 면역 반응 연구
연구 디자인. 8마리의 차이니즈 시노몰구스 마카크를 말단절단된 (Δ240) 세포액성 TERT 항원 (플라스미드 1112)을 코딩하는 AdC68W 아데노바이러스 벡터로 2e11 바이러스 입자로 양측 근육내 주사 (총 1 mL)에 의해 프라이밍하였다. 30일 및 64일 후에, 동물을 전기천공을 통해 근육내로 양측으로 전달된 (총 2 mL) 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 DNA (플라스미드 1112)로 부스팅하였다. 항-CTLA-4를 제1일 (32 mg), 제31일 (50 mg) 및 제65일 (75 mg)에 피하로 투여하였다. 마지막 면역화 후 14일에, 동물을 채혈하고, PBMC를 단리하여 TERT-특이적 세포성 (엘리스폿, ICS) 반응을 평가하였다.
결과. 표 9는 TERT 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 17 참조)과 배양된 차이니즈 시노몰구스 마카크의 PBMC로부터의 엘리스폿 및 ICS 데이터를 나타낸다. 칼럼 3에서의 수는 TERT 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 칼럼 4에서의 수는 TERT 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ+ CD8+ T 세포/106 CD8+ T 세포를 나타낸다. 양성 반응은 SFC >50 및 IFN-γ+ CD8+ T 세포 / 1e6 CD8+ T 세포 >50을 갖는 것으로서 정의된다. 표 9에 나타난 바와 같이, 말단절단된 (Δ240) 세포액성 (플라스미드 1112) TERT 구축물로 제조된 면역원성 TERT 폴리펩티드는 TERT-특이적 T 세포 반응을 유도할 수 있다.
표 9. 차이니즈 시노몰구스 마카크에서의 TERT 단일-항원 아데노바이러스 AdC68W 및 TERT 단일-항원 DNA 구축물에 의해 유도된 T 세포 반응
실시예 5. 이중-항원 구축물의 면역원성
원숭이에서의 면역 반응 연구
연구 디자인. 24마리의 차이니즈 시노몰구스 마카크를 인간 천연 전장 막-결합된 MUC1 (MUC1) 및 인간 말단절단된 (Δ240) 세포액성 TERT (TERTΔ240) 항원을 코딩하는 이중-항원 아데노바이러스 AdC68W 벡터로 2e11 바이러스 입자로 양측 근육내 주사 (총 1 mL)에 의해 프라이밍하였다. 30일 및 64일 후에, 동물을 전기천공을 통해 근육내로 양측으로 전달된 (총 2 mL) 동일한 2개의 항원을 코딩하는 이중-항원 DNA 구축물 (플라스미드 1270, 1271, 및 1269)로 부스팅하였다. 항-CTLA-4를 제1일 (32 mg), 제31일 (50 mg) 및 제65일 (75 mg)에 피하로 투여하였다. 마지막 면역화 후 14일에, 동물을 채혈하고, PBMC 및 혈청을 단리하여 각각 MUC1- 및 TERT-특이적 세포성 (엘리스폿, ICS) 및 MUC1-특이적 체액성 (ELISA) 반응을 평가하였다. 전체적으로, 둘 다의 항원을 공동-발현한 3가지의 상이한 이중-항원 백신 구축물을 평가하였다: a) 2A 펩티드에 의해 연결된 MUC1 및 TERT를 코딩하는 MUC1-2A-TERTΔ240 (플라스미드 1270), AdC68W 벡터 및 DNA 플라스미드; b) 2A 펩티드에 의해 연결된 TERT 및 MUC1을 코딩하는 TERTΔ240-2A-MUC1 (플라스미드 1271), AdC68W 벡터 및 DNA 플라스미드; c) MUC1-TERT 융합 단백질을 코딩하는 MUC1-TERTΔ240 (플라스미드 1269), AdC68W 벡터 및 DNA 플라스미드 (또한 구축물 표 참조).
결과. 표 10은 MUC1 및 TERT 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 15 및 17 참조)과 배양된 차이니즈 시노몰구스 마카크의 PBMC로부터의 엘리스폿 및 ICS 데이터, 및 차이니즈 시노몰구스 마카크의 혈청으로부터의 ELISA 데이터를 나타낸다. 양성 반응은 SFC >50, IFN-γ+ CD8+ T 세포 / 1e6 CD8+ T 세포 >50, 및 IgG 역가 >99를 갖는 것으로서 정의된다. 칼럼 3 및 6에서의 수는 각각 MUC1 및 TERT 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 볼드체로의 수는 시험된 적어도 1개의 펩티드 풀이 너무 많아서 카운팅할 수 없었으며, 따라서 진정한 수치는 적어도 언급된 값임을 지시한다. 칼럼 4 및 7에서의 수는 각각 MUC1 펩티드 풀 및 TERT 펩티드 풀로의 재자극, 및 배경 차감 후의 # IFN-γ+ CD8+ T 세포/106 CD8+ T 세포를 나타낸다. 칼럼 5에서의 수는 항-MUC1 IgG 역가 (광학 밀도 (O.D) =1, 검출 한계 (L.O.D) = 99.0)를 나타낸다. 표 10에 나타난 바와 같이, MUC1- 및 TERT-발현 이중-항원 구축물 (플라스미드 1270, 1271, 및 1269)로 제조된 면역원성 MUC1 및 TERT 폴리펩티드는 MUC1- 및 TERT-특이적 T 세포 반응, 및 MUC1-특이적 B 세포 반응을 유도할 수 있다. MUC1-TERT 융합 단백질을 코딩하는 이중-항원 구축물 1269는 가장 강한 전체적 MUC1-특이적 세포성 반응을 유도하는 것으로 나타났으며; 대조적으로, 이중-항원 구축물 플라스미드 1271 (TERT-2A-MUC1)은 가장 강한 전체적 TERT-특이적 세포성 반응을 유도하는 것으로 나타났다. 모든 3가지 이중-항원 구축물은 필적하는 MUC1-특이적 체액성 반응을 유도하는 것으로 나타났다.
표 10. 차이니즈 시노몰구스 마카크에서의 면역원성 MUC1 및/또는 TERT 폴리펩티드를 코딩하는 이중-항원 아데노바이러스 AdC68W 및 단일-항원 DNA 구축물 (플라스미드 1270, 1271, 및 1269)에 의해 유도된 T 및 B 세포 반응
실시예 6. 삼중-항원 구축물의 면역원성
실시예 6은 모든 3가지 코딩된 암 항원에 대한 Ag-특이적 T 및 B 세포 반응을 도출하는 인간 천연 전장 막-결합된 MUC1 항원 (MUC1), 인간 세포액성 MSLN 항원 (cMSLN), 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원 (TERTΔ240 또는 TERTΔ541)을 발현하는 삼중-항원 아데노바이러스 및 핵산 백신 구축물의 능력을 예시한다.
전기천공을 사용한 C57BL/6J 마우스에서의 면역 반응 연구
연구 디자인. 48마리의 암컷 C57BL/6J 마우스를 인간 MUC1, cMSLN, 및 TERTΔ240을 코딩하는 삼중-항원 DNA 구축물로 면역화하였다. 삼중-항원 DNA 백신 (100 ug)을 프라임/부스트 처방에서 수반 전기천공으로 근육내로 양측으로 (각각의 앞정강근 내로 총 20 ul), 각각의 백신접종 사이에 2주 떨어져서 전달하였다. MUC1-, MSLN-, 및 TERT-특이적 세포성 반응, 및 MUC1- 및 MSLN-특이적 체액성 반응을 마지막 면역화 후 7일에 각각 IFN-γ 엘리스폿 검정 및 ELISA 검정에서 측정하였다. 전체적으로, 2A 펩티드에 의해 연결된 모든 3가지 항원을 코딩하는 6가지 상이한 삼중-항원 DNA 구축물을 하기와 같이 사용하였다: MUC1-2A-cMSLN-2A-TERTΔ240 (플라스미드 1317), MUC1-2A-TERTΔ240-2A-cMSLN (플라스미드 1318), cMSLN-2A-MUC1-2A-TERTΔ240 (플라스미드 1319), cMSLN-2A-TERTΔ240-2A-MUC1 (플라스미드 1320), TERTΔ240-2A-cMSLN-2A-MUC1 (플라스미드 1321), TERTΔ240-2A-MUC1-2A-cMSLN (플라스미드 1322) (또한 구축물 표 참조).
결과. 표 11은 MUC1, MSLN, 및 TERT 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 15-17 참조)과 배양된 C57BL/6J 비장세포로부터의 엘리스폿 데이터, 및 C57BL/6J 마우스 혈청으로부터의 ELISA 데이터를 나타낸다. 양성 반응은 SFC >100 및 IgG 역가 >99를 갖는 것으로서 정의된다. 칼럼 3, 5 및 7에서의 수는 각각 MUC1, MSLN 및 TERT 펩티드 풀로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 볼드체로의 수는 시험된 적어도 1개의 펩티드 풀이 너무 많아서 카운팅할 수 없었으며, 따라서 진정한 수치는 적어도 언급된 값임을 지시한다. 칼럼 4 및 6에서의 수는 각각 항-MUC1 및 MSLN IgG 역가 (광학 밀도 (O.D) =1, 검출 한계 (L.O.D) = 99.0)를 나타낸다. 표 11에 나타난 바와 같이, MUC1-, MSLN-, 및 TERT-발현 삼중-항원 구축물로 제조된 면역원성 MUC1, MSLN, 및 TERT 폴리펩티드는 모든 3가지 항원에 대한 T 세포 반응, 및 MUC1에 대한 B 세포 반응을 유도할 수 있으며; 대조적으로, 단지 삼중-항원 구축물 플라스미드 1317, 1318, 및 1322만이 MSLN에 대한 B 세포 반응을 유도할 수 있다.
표 11. C57BL/6J 마우스에서의 인간 천연 전장 막-결합된 MUC1, 인간 세포액성 MSLN, 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 삼중-항원 DNA 구축물 (1317-1322)에 의해 유도된 T 및 B 세포 반응
아데노바이러스 벡터를 사용한 C57BL/6J 마우스에서의 면역 반응 연구
연구 디자인. 36마리의 암컷 C57BL/6J 마우스를 인간 MUC1, cMSLN, 및 TERTΔ240 또는 TERTΔ541을 코딩하는 삼중-항원 아데노바이러스 벡터로 1e10 바이러스 입자로 근육내 주사 (50 ul)에 의해 프라이밍하였다. 28일 후에, 동물을 수반 전기천공으로 근육내로 양측으로 전달된 (각각의 앞정강근 내로 총 20 ul) 삼중-항원 DNA 구축물 (50 ug)로 부스팅하였다. MUC1-, MSLN-, 및 TERT-특이적 세포성 반응, 및 MUC1- 및 MSLN-특이적 체액성 반응을 마지막 면역화 후 7일에 각각 IFN-γ 엘리스폿 및 ICS 검정, 및 ELISA 검정에서 측정하였다. 전체적으로, 2A 펩티드에 의해 연결된 MUC1, cMSLN, 및 TERTΔ240을 코딩하는 3가지 삼중-항원 아데노바이러스 및 DNA 구축물, 및 2A 펩티드에 의해 연결된 MUC1, cMSLN, 및 TERTΔ541을 코딩하는 3가지 삼중-항원 아데노바이러스 및 DNA 구축물을 하기와 같이 사용하였다: MUC1-2A-cMSLN-2A-TERTΔ240 (플라스미드 1317), cMSLN-2A-MUC1-2A-TERTΔ240 (플라스미드 1319), cMSLN-2A-TERTΔ240-2A-MUC1 (플라스미드 1320), 및 MUC1-2A-cMSLN-2A-TERTΔ541 (플라스미드 1351), cMSLN-2A-MUC1-2A-TERTΔ541 (플라스미드 1352), cMSLN-2A-TERTΔ541-2A-MUC1 (플라스미드 1353) (또한 구축물 표 참조).
결과. 표 12는 MUC1, MSLN, 및 TERT 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 15-17 참조)과 배양된 C57BL/6J 비장세포로부터의 엘리스폿 데이터, TERT 펩티드 aa1025-1039와 배양된 C57BL/6J 비장세포로부터의 ICS 데이터, 및 C57BL/6J 마우스 혈청으로부터의 ELISA 데이터를 나타낸다. 양성 반응은 SFC >100, IFN-γ+ CD8+ T 세포의 빈도 >0.1%, 및 IgG 역가 >99를 갖는 것으로서 정의된다. 칼럼 3, 5, 및 7에서의 수는 각각 MUC1, MSLN 및 TERT 펩티드 풀로의 재자극, 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 볼드체로의 수는 시험된 적어도 1개의 펩티드 풀이 너무 많아서 카운팅할 수 없었으며, 따라서 진정한 수치는 적어도 언급된 값임을 지시한다. 칼럼 8에서의 수는 TERT-특이적 펩티드 TERT aa1025-1039로의 재자극, 및 배경 차감 후의 # IFN-γ+ CD8+ T 세포/106 CD8+ T 세포를 나타낸다. 칼럼 4 및 6에서의 수는 각각 항-MUC1 및 항-MSLN IgG 역가 (광학 밀도 (O.D) =1, 검출 한계 (L.O.D) = 99.0)를 나타낸다. 표 12에 나타난 바와 같이, MUC1-, MSLN-, 및 TERT-발현 삼중-항원 구축물로 제조된 면역원성 MUC1, MSLN, 및 TERT 폴리펩티드는 모든 3가지 항원에 대한 T 세포 반응, 및 MUC1에 대한 B 세포 반응을 유도할 수 있으며; 대조적으로, 단지 삼중-항원 구축물 1317 및 1351만이 MSLN에 대한 B 세포 반응을 유도할 수 있다.
표 12A. C57BL/6J 마우스에서의 인간 천연 전장 막-결합된 MUC1, 인간 세포액성 MSLN, 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1317, 1319, 및 1320)에 의해, 및 인간 천연 전장 막-결합된 MUC1, 인간 세포액성 MSLN, 및 인간 말단절단된 (Δ541) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1351-1353)에 의해 유도된 MUC1-특이적 T 및 B 세포 반응
표 12B. C57BL/6J 마우스에서의 인간 천연 전장 막-결합된 MUC1, 인간 세포액성 MSLN, 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1317, 1319, 및 1320)에 의해, 및 인간 천연 전장 막-결합된 MUC1, 인간 세포액성 MSLN, 및 인간 말단절단된 (Δ541) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1351-1353)에 의해 유도된 MSLN-특이적 T 및 B 세포 반응
표 12C. C57BL/6J 마우스에서의 인간 천연 전장 막-결합된 MUC1, 인간 세포액성 MSLN, 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1317, 1319, 및 1320)에 의해, 및 인간 천연 전장 막-결합된 MUC1, 인간 세포액성 MSLN, 및 인간 말단절단된 (Δ541) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1351-1353)에 의해 유도된 TERT-특이적 T 세포 반응
HLA-A24 마우스에서의 면역 반응 연구
연구 디자인. 8마리의 혼합 성별 HLA-A24 마우스를 인간 MUC1, cMSLN, 및 TERTΔ240을 코딩하는 아데노바이러스 AdC68Y 삼중-항원 구축물 (플라스미드 1317; MUC1-2A-cMSLN-2A-TERTΔ240)로 1e10 바이러스 입자로 근육내 주사 (각각의 앞정강근 내로 50 ul)에 의해 프라이밍하였다. 14일 후에, 동물을 동일한 3개의 항원 (수반 전기천공으로 각각의 앞정강근 내로 전달된 20 ul)을 코딩하는 50 ug 삼중-항원 DNA 구축물 (플라스미드 1317)로 근육내로 부스팅하였다. HLA-A24-제한된 MUC1-특이적 세포성 반응을 마지막 면역화 후 7일에 IFN-γ 엘리스폿 검정에서 측정하였다.
결과. 표 13은 MUC1 펩티드 aa524-532와 배양된 HLA-A24 비장세포로부터의 엘리스폿 데이터를 나타낸다. 양성 반응은 SFC >50을 갖는 것으로서 정의된다. 칼럼 3에서의 수는 MUC1 펩티드 aa524-532로의 재자극 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 표 13에 나타난 바와 같이, MUC1-, MSLN-, 및 TERT-발현 삼중-항원 구축물 1317로 제조된 면역원성 MUC1 폴리펩티드는 HLA-A24-제한된 MUC1 펩티드 aa524-532-특이적 CD8+ T 세포 반응을 유도할 수 있다. 중요하게는, 이 특이적 MUC1 펩티드에 대해 암 환자로부터 유도된 T 세포 반응은 시험관내에서의 항-종양 효능과 상관되는 것으로 나타났으며 (Jochems C et al., Cancer Immunol Immunother (2014) 63:161-174), 이는 이 특이적 에피토프에 대한 세포성 반응을 일으키는 것의 중요성을 입증한다.
표 13. HLA-A24 마우스에서의 인간 천연 전장 막-결합된 MUC1, 인간 세포액성 MSLN, 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 및 DNA 구축물 플라스미드 1317 (MUC1-2A-cMSLN-2A-TERTΔ240)에 의해 유도된 HLA-A24-제한된 MUC1 펩티드 aa524-532-특이적 T 세포 반응
원숭이에서의 면역 반응 연구
연구 디자인. 24마리의 차이니즈 시노몰구스 마카크를 인간 천연 전장 막-결합된 MUC1 (MUC1), 인간 세포질성 MSLN (cMSLN), 및 인간 말단절단된 (Δ240) 세포액성 TERT (TERTΔ240) 항원을 코딩하는 AdC68Y 아데노바이러스 벡터로 2e11 바이러스 입자로 양측 근육내 주사 (총 1 mL)에 의해 프라이밍하였다. 28일 및 56일 후에, 동물을 전기천공을 통해 근육내로 양측으로 전달된 (총 2 mL) 동일한 3개의 항원을 코딩하는 DNA로 부스팅하였다. 항-CTLA-4를 제1일 (32 mg), 제29일 (50 mg) 및 제57일 (75 mg)에 피하로 투여하였다. 마지막 면역화 후 21일에, 동물을 채혈하고, PBMC 및 혈청을 단리하여 각각 MUC1-, MSLN-, 및 TERT-특이적 세포성 (엘리스폿, ICS) 및 MUC1- 및 MSLN-특이적 체액성 (ELISA) 반응을 평가하였다. 전체적으로, 2A 펩티드에 의해 연결된 MUC1, cMSLN, 및 TERTΔ240을 코딩하는 3가지 삼중-항원 아데노바이러스 및 DNA 구축물을 평가하였다: MUC1-2A-cMSLN-2A-TERTΔ240 (플라스미드 1317), cMSLN-2A-MUC1-2A-TERTΔ240 (플라스미드 1319), 및 cMSLN-2A-TERTΔ240-2A-MUC1 (플라스미드 1320).
결과. 표 14A, 14B, 및 14C는 MUC1, MSLN, 및 TERT 펩티드 라이브러리로부터 유래된 펩티드 풀 (또한 펩티드 풀 표 (표 18) 및 표 15-17 참조)과 배양된 차이니즈 시노몰구스 마카크의 PBMC로부터의 엘리스폿 및 ICS 데이터, 및 차이니즈 시노몰구스 마카크의 혈청으로부터의 ELISA 데이터를 나타낸다. 양성 반응은 SFC >50, IFN-γ+ CD8+ T 세포 / 1e6 CD8+ T 세포 >50, 및 IgG 역가 >99를 갖는 것으로서 정의된다. 칼럼 3, 6, 및 9에서의 수는 각각 MUC1, MSLN, 및 TERT 펩티드 풀로의 재자극, 및 배경 차감 후의 # IFN-γ 스폿/106 비장세포를 나타낸다. 볼드체로의 수는 시험된 적어도 1개의 펩티드 풀이 너무 많아서 카운팅할 수 없었으며, 따라서 진정한 수치는 적어도 언급된 값임을 지시한다. 칼럼 4, 7, 및 10에서의 수는 각각 MUC1, MSLN, 및 TERT 펩티드 풀로의 재자극, 및 배경 차감 후의 # IFN-γ+ CD8+ T 세포/106 CD8+ T 세포를 나타낸다. 칼럼 5 및 8에서의 수는 각각 항-MUC1 및 항-MSLN IgG 역가 (광학 밀도 (O.D) =1, 검출 한계 (L.O.D) = 99.0)를 나타낸다. 표 14에 나타난 바와 같이, MUC1-, MSLN-, 및 TERT-발현 삼중-Ag 구축물로 제조된 면역원성 MUC1, MSLN, 및 TERT 폴리펩티드는 모든 3가지 항원에 대한 세포성 반응, 및 MUC1에 대한 체액성 반응을 유도할 수 있다. 그러나, 단지 삼중-항원 구축물 1317만이 유의한 MSLN-특이적 B 세포 반응을 유도할 수 있다.
표 14A. 차이니즈 시노몰구스 마카크에서의 인간 천연 전장 막-결합된 MUC1, 인간 세포질성 MSLN, 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1317, 1319, 및 1320)에 의해 유도된 MUC1-특이적 T 및 B 세포 반응
표 14B. 차이니즈 시노몰구스 마카크에서의 인간 천연 전장 막-결합된 MUC1, 인간 세포질성 MSLN, 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1317, 1319, 및 1320)에 의해 유도된 MSLN-특이적 T 및 B 세포 반응
표 14C. 차이니즈 시노몰구스 마카크에서의 인간 천연 전장 막-결합된 MUC1, 인간 세포질성 MSLN, 및 인간 말단절단된 (Δ240) 세포액성 TERT 항원을 코딩하는 삼중-항원 아데노바이러스 AdC68Y 및 DNA 구축물 (플라스미드 1317, 1319, 및 1320)에 의해 유도된 TERT-특이적 T 세포 반응
표 15. 인간 MUC1 펩티드 라이브러리 펩티드 풀 및 상응하는 아미노산 서열
표 16. 인간 MSLN 펩티드 라이브러리 펩티드 풀 및 상응하는 아미노산 서열
표 17. 인간 TERT 펩티드 라이브러리 펩티드 풀 및 상응하는 아미노산 서열
표 18. 펩티드 풀
표 19. 2A 펩티드
실시예 7. 백신과 면역 조정제의 조합
하기 실시예는 항암 백신이 항-세포독성 T-림프구 항원 (CTLA4) 항체 및/또는 인돌아민 2,3-디옥시게나제 1 (IDO1) 억제제와 조합으로 투여된 경우 증강된 종양 성장 억제 효과를 예시하기 위해 제공된다.
연구 절차.
BALB-neuT 마우스를 연구 제0일에 TUBO 종양 세포로 피하 주사에 의해 삽입하였다. 마우스를 200 mg/Kg의 3-(5-플루오로-1H-인돌-3-일)피롤리딘-2,5-디온 (IDO1 억제제) 또는 비히클로 연구 제7일로부터 매일 2회 경구 섭식에 의해 투약하였다. 비교자 군을 연구 제7일로부터 줄곧 비히클로 샴 투약하였다. 적절한 마우스를 연구 제10일에 래트 HER2 (rHER2)를 발현하도록 조작된 아데노바이러스 벡터 (rHER2 백신) 또는 rHER2 트랜스진이 결여된 벡터 (대조군 백신)의 1e10 바이러스 입자로 근육내 주사에 의해 면역화하였다. 이어서, 250 ug의 항-CTLA4 항체 (CTLA-4에 대한 뮤린 모노클로날 항체, 클론 9D9) 또는 IgG2 이소형 대조군 모노클로날 항체를 아데노바이러스 벡터 주사의 부위를 배수하는 림프절과 아주 근접하게 피하로 주사하였다. 그로부터 2주마다, 마우스를 rHER2를 코딩하는 DNA 플라스미드 (rHER2 백신) 또는 rHER2 트랜스진이 결여된 DNA 플라스미드 (대조군 백신) 100 ug으로 전기천공에 의해 면역화하였다. DNA 플라스미드 투여에 이어, 250 ug의 항-CTLA4 항체를 DNA 플라스미드 주사의 부위를 배수하는 림프절과 아주 근접하게 피하로 주사하였다. 종양 진행을 추적하기 위해, 피하 종양 부피를 연구 전반에 걸쳐 주 2회 측정하였다. 2000 mm3에 도달한 피하 종양 부피를 갖거나 질환의 비가역적 징후를 나타내는 동물을 안락사시켰다.
결과.
각각의 처리군에서의 개별적 동물의 피하 종양 부피는 표 20-A-20-H에 제시된다.
항-CTLA4 항체 단독으로 또는 IDO1 억제제 단독으로 처리된 마우스에서 종양 성장 속도에 대한 효과는 관찰되지 않았다. 그러나, rHER2 백신 단독으로 처리된 동물 중 일부에서 보다 느린 성장 속도가 관찰되었다. 항-CTLA4 항체와 조합으로 rHER2 백신으로 처리된 마우스 및 IDO1 억제제와 조합으로 rHER2 백신으로 처리된 마우스는 상응하는 대조군 동물에 비해 감소된 종양 성장 속도를 가졌다. 종양 성장 억제는 rHER2 백신, 항-CTLA4 항체, 및 IDO1 억제제로 처리된 마우스에서 가장 현저하였다.
표 20-A. rHER2 백신, 이소형 대조군 항체, 및 비히클로 처리된 BALB-neuT 마우스로부터의 피하 종양 부피
표 20-B. rHER2 백신, 항-CTLA4 항체, 및 비히클로 처리된 BALB-neuT 마우스로부터의 피하 종양 부피
표 20-C. rHER2 백신, 이소형 모노클로날 항체, 및 IDO1 억제제로 처리된 BALB-neuT 마우스로부터의 피하 종양 부피
표 20-D. rHER2 백신, 항-CTLA4 항체, 및 IDO1 억제제로 처리된 BALB-neuT 마우스로부터의 피하 종양 부피
표 20-E. 대조군 백신, 이소형 모노클로날 항체, 및 비히클로 처리된 BALB-neuT 마우스로부터의 피하 종양 부피
표 20-F. 대조군 백신, 항-CTLA4 항체, 및 비히클로 처리된 BALB-neuT 마우스로부터의 피하 종양 부피
표 20-G. 대조군 백신, 이소형 대조군 모노클로날 항체, 및 IDO1 억제제로 처리된 BALB-neuT 마우스로부터의 피하 종양 부피
표 20-H. 대조군 백신, 항-CTLA4 항체, 및 IDO1 억제제로 처리된 BALB-neuT 마우스로부터의 피하 종양 부피
실시예에 기재된 플라스미드의 구축에 사용된 프라이머
SEQUENCE LISTING
<110> Pfizer Inc
<120> Cancer Vaccines
<130> PC71855A
<150> US 62/280,636
<151> 2016-01-19
<150> US 62/419,190
<151> 2016-11-08
<160> 621
<170> PatentIn version 3.5
<210> 1
<211> 1255
<212> PRT
<213> Homo sapiens
<400> 1
Met Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr
1 5 10 15
Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly
20 25 30
Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser
35 40 45
Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His
50 55 60
Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu
65 70 75 80
Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln
85 90 95
Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr
100 105 110
Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro
115 120 125
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
130 135 140
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
145 150 155 160
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
165 170 175
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
180 185 190
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
195 200 205
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
210 215 220
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
225 230 235 240
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
245 250 255
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
260 265 270
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
275 280 285
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
290 295 300
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
305 310 315 320
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
325 330 335
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
340 345 350
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
355 360 365
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
370 375 380
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
385 390 395 400
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
405 410 415
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
420 425 430
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
435 440 445
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
450 455 460
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
465 470 475 480
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
485 490 495
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
500 505 510
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
515 520 525
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
530 535 540
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
545 550 555 560
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
565 570 575
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
580 585 590
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
595 600 605
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
610 615 620
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
625 630 635 640
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
645 650 655
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
660 665 670
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
675 680 685
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
690 695 700
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
705 710 715 720
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
725 730 735
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
740 745 750
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
755 760 765
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
770 775 780
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
785 790 795 800
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
805 810 815
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
820 825 830
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
835 840 845
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
850 855 860
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
865 870 875 880
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
885 890 895
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
900 905 910
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro
915 920 925
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Asn
930 935 940
Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser
945 950 955 960
Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly
965 970 975
Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe
980 985 990
Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His
995 1000 1005
Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro
1010 1015 1020
Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
1025 1030 1035
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln
1040 1045 1050
Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu
1055 1060 1065
Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln
1070 1075 1080
Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser
1085 1090 1095
Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn
1100 1105 1110
Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala
1115 1120 1125
Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser Val Ser Asp
1130 1135 1140
Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly Val Pro Gly
1145 1150 1155
Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu Val Ala Leu
1160 1165 1170
Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys Arg Arg
1175 1180 1185
Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr Tyr
1190 1195 1200
His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr
1205 1210 1215
Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser
1220 1225 1230
Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val
1235 1240 1245
Ala Ala Thr Ser Ala Asn Leu
1250 1255
<210> 2
<211> 622
<212> PRT
<213> Homo sapiens
<400> 2
Met Ala Leu Pro Thr Ala Arg Pro Leu Leu Gly Ser Cys Gly Thr Pro
1 5 10 15
Ala Leu Gly Ser Leu Leu Phe Leu Leu Phe Ser Leu Gly Trp Val Gln
20 25 30
Pro Ser Arg Thr Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu
35 40 45
Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg
50 55 60
Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu
65 70 75 80
Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu
85 90 95
Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro
100 105 110
Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro
115 120 125
Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile
130 135 140
Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln
145 150 155 160
Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu
165 170 175
Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu
180 185 190
Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu
195 200 205
Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg
210 215 220
Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp
225 230 235 240
Ser Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly
245 250 255
Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg
260 265 270
Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile
275 280 285
Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser
290 295 300
Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys
305 310 315 320
Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met
325 330 335
Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu
340 345 350
Lys His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val
355 360 365
Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile
370 375 380
Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu
385 390 395 400
Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp
405 410 415
Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr
420 425 430
Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu
435 440 445
Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp
450 455 460
Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala
465 470 475 480
Arg Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile
485 490 495
Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser
500 505 510
Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr
515 520 525
Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly
530 535 540
Pro His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg
545 550 555 560
Asp Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu
565 570 575
Gly Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser
580 585 590
Met Gln Glu Ala Leu Ser Gly Thr Pro Cys Leu Leu Gly Pro Gly Pro
595 600 605
Val Leu Thr Val Leu Ala Leu Leu Leu Ala Ser Thr Leu Ala
610 615 620
<210> 3
<211> 1132
<212> PRT
<213> Homo sapiens
<400> 3
Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser
1 5 10 15
His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly
20 25 30
Pro Gln Gly Trp Arg Leu Val Gln Arg Gly Asp Pro Ala Ala Phe Arg
35 40 45
Ala Leu Val Ala Gln Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro
50 55 60
Pro Pro Ala Ala Pro Ser Phe Arg Gln Val Ser Cys Leu Lys Glu Leu
65 70 75 80
Val Ala Arg Val Leu Gln Arg Leu Cys Glu Arg Gly Ala Lys Asn Val
85 90 95
Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro
100 105 110
Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr
115 120 125
Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val
130 135 140
Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val
145 150 155 160
Leu Val Ala Pro Ser Cys Ala Tyr Gln Val Cys Gly Pro Pro Leu Tyr
165 170 175
Gln Leu Gly Ala Ala Thr Gln Ala Arg Pro Pro Pro His Ala Ser Gly
180 185 190
Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg
195 200 205
Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg
210 215 220
Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg
225 230 235 240
Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser Trp
245 250 255
Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val
260 265 270
Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala
275 280 285
Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His His
290 295 300
Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro
305 310 315 320
Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly
325 330 335
Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro
340 345 350
Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser
355 360 365
Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln
370 375 380
Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His
385 390 395 400
Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg
405 410 415
Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gln
420 425 430
Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu
435 440 445
Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe
450 455 460
Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser
465 470 475 480
Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser
485 490 495
Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met
500 505 510
Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys
515 520 525
Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe
530 535 540
Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe
545 550 555 560
Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr
565 570 575
Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His
580 585 590
Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln
595 600 605
His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile
610 615 620
Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val
625 630 635 640
Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser
645 650 655
Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg
660 665 670
Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg
675 680 685
Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro
690 695 700
Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr Ile
705 710 715 720
Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln
725 730 735
Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His
740 745 750
Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp
755 760 765
Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser
770 775 780
Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu
785 790 795 800
Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His
805 810 815
Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro
820 825 830
Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp
835 840 845
Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu
850 855 860
Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala
865 870 875 880
Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys
885 890 895
Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu
900 905 910
Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe
915 920 925
Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser
930 935 940
Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe
945 950 955 960
Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly
965 970 975
Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn
980 985 990
Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln
995 1000 1005
Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln
1010 1015 1020
Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp
1025 1030 1035
Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
1040 1045 1050
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu
1055 1060 1065
Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr
1070 1075 1080
Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr
1085 1090 1095
Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr
1100 1105 1110
Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys
1115 1120 1125
Thr Ile Leu Asp
1130
<210> 4
<211> 30252
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 4
ccatcttcaa taatatacct caaacttttt gtgcgcgtta atatgcaaat gaggcgtttg 60
aatttgggga ggaagggcgg tgattggtcg agggatgagc gaccgttagg ggcggggcga 120
gtgacgtttt gatgacgtgg ttgcgaggag gagccagttt gcaagttctc gtgggaaaag 180
tgacgtcaaa cgaggtgtgg tttgaacacg gaaatactca attttcccgc gctctctgac 240
aggaaatgag gtgtttctgg gcggatgcaa gtgaaaacgg gccattttcg cgcgaaaact 300
gaatgaggaa gtgaaaatct gagtaatttc gcgtttatgg cagggaggag tatttgccga 360
gggccgagta gactttgacc gattacgtgg gggtttcgat taccgtgttt ttcacctaaa 420
tttccgcgta cggtgtcaaa gtccggtgtt tttactactg taatagtaat caattacggg 480
gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 540
gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 600
agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 660
ccacttggca gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga 720
cggtaaatgg cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg 780
gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat 840
caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 900
caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc 960
cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc 1020
tgtccctatc agtgatagag atctccctat cagtgataga gagtttagtg aaccgtcaga 1080
tccgctaggg taccgcgatc gcacctcgag ctgatcataa tcagccatac cacatttgta 1140
gaggttttac ttgctttaaa aaacctccca cacctccccc tgaacctgaa acataaaatg 1200
aatgcaattg ttgttgttaa cttgtttatt gcagcttata atggttacaa ataaagcaat 1260
agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg tggtttgtcc 1320
aaactcatca atgtatctta ccaggtgccg agcctgcgag tgcggaggga agcatgccag 1380
gttccagccc gtgtgtgtgg atgtgacgga ggacctgcga cccgatcatt tggtgttgcc 1440
ctgcaccggg acggagttcg gttccagcgg ggaagaatct gactagagtg agtagtgttc 1500
tggggcgggg gaggacctgc atgagggcca gaataactga aatctgtgct tttctgtgtg 1560
ttgcagcagc atgagcggaa gcggctcctt tgagggaggg gtattcagcc cttatctgac 1620
ggggcgtctc ccctcctggg cgggagtgcg tcagaatgtg atgggatcca cggtggacgg 1680
ccggcccgtg cagcccgcga actcttcaac cctgacctat gcaaccctga gctcttcgtc 1740
gttggacgca gctgccgccg cagctgctgc atctgccgcc agcgccgtgc gcggaatggc 1800
catgggcgcc ggctactacg gcactctggt ggccaactcg agttccacca ataatcccgc 1860
cagcctgaac gaggagaagc tgttgctgct gatggcccag ctcgaggcct tgacccagcg 1920
cctgggcgag ctgacccagc aggtggctca gctgcaggag cagacgcggg ccgcggttgc 1980
cacggtgaaa tccaaataaa aaatgaatca ataaataaac ggagacggtt gttgatttta 2040
acacagagtc tgaatcttta tttgattttt cgcgcgcggt aggccctgga ccaccggtct 2100
cgatcattga gcacccggtg gatcttttcc aggacccggt agaggtgggc ttggatgttg 2160
aggtacatgg gcatgagccc gtcccggggg tggaggtagc tccattgcag ggcctcgtgc 2220
tcgggggtgg tgttgtaaat cacccagtca tagcaggggc gcagggcatg gtgttgcaca 2280
atatctttga ggaggagact gatggccacg ggcagccctt tggtgtaggt gtttacaaat 2340
ctgttgagct gggagggatg catgcggggg gagatgaggt gcatcttggc ctggatcttg 2400
agattggcga tgttaccgcc cagatcccgc ctggggttca tgttgtgcag gaccaccagc 2460
acggtgtatc cggtgcactt ggggaattta tcatgcaact tggaagggaa ggcgtgaaag 2520
aatttggcga cgcctttgtg cccgcccagg ttttccatgc actcatccat gatgatggcg 2580
atgggcccgt gggcggcggc ctgggcaaag acgtttcggg ggtcggacac atcatagttg 2640
tggtcctggg tgaggtcatc ataggccatt ttaatgaatt tggggcggag ggtgccggac 2700
tgggggacaa aggtaccctc gatcccgggg gcgtagttcc cctcacagat ctgcatctcc 2760
caggctttga gctcggaggg ggggatcatg tccacctgcg gggcgataaa gaacacggtt 2820
tccggggcgg gggagatgag ctgggccgaa agcaagttcc ggagcagctg ggacttgccg 2880
cagccggtgg ggccgtagat gaccccgatg accggctgca ggtggtagtt gagggagaga 2940
cagctgccgt cctcccggag gaggggggcc acctcgttca tcatctcgcg cacgtgcatg 3000
ttctcgcgca ccagttccgc caggaggcgc tctcccccca gggataggag ctcctggagc 3060
gaggcgaagt ttttcagcgg cttgagtccg tcggccatgg gcattttgga gagggtttgt 3120
tgcaagagtt ccaggcggtc ccagagctcg gtgatgtgct ctacggcatc tcgatccagc 3180
agacctcctc gtttcgcggg ttgggacggc tgcgggagta gggcaccaga cgatgggcgt 3240
ccagcgcagc cagggtccgg tccttccagg gtcgcagcgt ccgcgtcagg gtggtctccg 3300
tcacggtgaa ggggtgcgcg ccgggctggg cgcttgcgag ggtgcgcttc aggctcatcc 3360
ggctggtcga aaaccgctcc cgatcggcgc cctgcgcgtc ggccaggtag caattgacca 3420
tgagttcgta gttgagcgcc tcggccgcgt ggcctttggc gcggagctta cctttggaag 3480
tctgcccgca ggcgggacag aggagggact tgagggcgta gagcttgggg gcgaggaaga 3540
cggactcggg ggcgtaggcg tccgcgccgc agtgggcgca gacggtctcg cactccacga 3600
gccaggtgag gtcgggctgg tcggggtcaa aaaccagttt cccgccgttc tttttgatgc 3660
gtttcttacc tttggtctcc atgagctcgt gtccccgctg ggtgacaaag aggctgtccg 3720
tgtccccgta gaccgacttt atgggccggt cctcgagcgg tgtgccgcgg tcctcctcgt 3780
agaggaaccc cgcccactcc gagacgaaag cccgggtcca ggccagcacg aaggaggcca 3840
cgtgggacgg gtagcggtcg ttgtccacca gcgggtccac cttttccagg gtatgcaaac 3900
acatgtcccc ctcgtccaca tccaggaagg tgattggctt gtaagtgtag gccacgtgac 3960
cgggggtccc ggccgggggg gtataaaagg gtgcgggtcc ctgctcgtcc tcactgtctt 4020
ccggatcgct gtccaggagc gccagctgtt ggggtaggta ttccctctcg aaggcgggca 4080
tgacctcggc actcaggttg tcagtttcta gaaacgagga ggatttgata ttgacggtgc 4140
cggcggagat gcctttcaag agcccctcgt ccatctggtc agaaaagacg atctttttgt 4200
tgtcgagctt ggtggcgaag gagccgtaga gggcgttgga gaggagcttg gcgatggagc 4260
gcatggtctg gtttttttcc ttgtcggcgc gctccttggc ggcgatgttg agctgcacgt 4320
actcgcgcgc cacgcacttc cattcgggga agacggtggt cagctcgtcg ggcacgattc 4380
tgacctgcca gccccgatta tgcagggtga tgaggtccac actggtggcc acctcgccgc 4440
gcaggggctc attagtccag cagaggcgtc cgcccttgcg cgagcagaag gggggcaggg 4500
ggtccagcat gacctcgtcg ggggggtcgg catcgatggt gaagatgccg ggcaggaggt 4560
cggggtcaaa gtagctgatg gaagtggcca gatcgtccag ggcagcttgc cattcgcgca 4620
cggccagcgc gcgctcgtag ggactgaggg gcgtgcccca gggcatggga tgggtaagcg 4680
cggaggcgta catgccgcag atgtcgtaga cgtagagggg ctcctcgagg atgccgatgt 4740
aggtggggta gcagcgcccc ccgcggatgc tggcgcgcac gtagtcatac agctcgtgcg 4800
agggggcgag gagccccggg cccaggttgg tgcgactggg cttttcggcg cggtagacga 4860
tctggcggaa aatggcatgc gagttggagg agatggtggg cctttggaag atgttgaagt 4920
gggcgtgggg cagtccgacc gagtcgcgga tgaagtgggc gtaggagtct tgcagcttgg 4980
cgacgagctc ggcggtgact aggacgtcca gagcgcagta gtcgagggtc tcctggatga 5040
tgtcatactt gagctgtccc ttttgtttcc acagctcgcg gttgagaagg aactcttcgc 5100
ggtccttcca gtactcttcg agggggaacc cgtcctgatc tgcacggtaa gagcctagca 5160
tgtagaactg gttgacggcc ttgtaggcgc agcagccctt ctccacgggg agggcgtagg 5220
cctgggcggc cttgcgcagg gaggtgtgcg tgagggcgaa agtgtccctg accatgacct 5280
tgaggaactg gtgcttgaag tcgatatcgt cgcagccccc ctgctcccag agctggaagt 5340
ccgtgcgctt cttgtaggcg gggttgggca aagcgaaagt aacatcgttg aagaggatct 5400
tgcccgcgcg gggcataaag ttgcgagtga tgcggaaagg ttggggcacc tcggcccggt 5460
tgttgatgac ctgggcggcg agcacgatct cgtcgaagcc gttgatgttg tggcccacga 5520
tgtagagttc cacgaatcgc ggacggccct tgacgtgggg cagtttcttg agctcctcgt 5580
aggtgagctc gtcggggtcg ctgagcccgt gctgctcgag cgcccagtcg gcgagatggg 5640
ggttggcgcg gaggaaggaa gtccagagat ccacggccag ggcggtttgc agacggtccc 5700
ggtactgacg gaactgctgc ccgacggcca ttttttcggg ggtgacgcag tagaaggtgc 5760
gggggtcccc gtgccagcga tcccatttga gctggagggc gagatcgagg gcgagctcga 5820
cgagccggtc gtccccggag agtttcatga ccagcatgaa ggggacgagc tgcttgccga 5880
aggaccccat ccaggtgtag gtttccacat cgtaggtgag gaagagcctt tcggtgcgag 5940
gatgcgagcc gatggggaag aactggatct cctgccacca attggaggaa tggctgttga 6000
tgtgatggaa gtagaaatgc cgacggcgcg ccgaacactc gtgcttgtgt ttatacaagc 6060
ggccacagtg ctcgcaacgc tgcacgggat gcacgtgctg cacgagctgt acctgagttc 6120
ctttgacgag gaatttcagt gggaagtgga gtcgtggcgc ctgcatctcg tgctgtacta 6180
cgtcgtggtg gtcggcctgg ccctcttctg cctcgatggt ggtcatgctg acgagcccgc 6240
gcgggaggca ggtccagacc tcggcgcgag cgggtcggag agcgaggacg agggcgcgca 6300
ggccggagct gtccagggtc ctgagacgct gcggagtcag gtcagtgggc agcggcggcg 6360
cgcggttgac ttgcaggagt ttttccaggg cgcgcgggag gtccagatgg tacttgatct 6420
ccaccgcgcc attggtggcg acgtcgatgg cttgcagggt cccgtgcccc tggggtgtga 6480
ccaccgtccc ccgtttcttc ttgggcggct ggggcgacgg gggcggtgcc tcttccatgg 6540
ttagaagcgg cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag 6600
gggcggcagg ggcacgtcgg cgccgcgcgc gggtaggttc tggtactgcg cccggagaag 6660
actggcgtga gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc 6720
cacgggaccc gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt 6780
gacggcggcc tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc 6840
ggtcatgaac tgctcgatct cctcctcttg aaggtctccg cggccggcgc gctccacggt 6900
ggccgcgagg tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgcccgcctc 6960
gttccagacg cggctgtaga ccacgacgcc ctcgggatcg cgggcgcgca tgaccacctg 7020
ggcgaggttg agctccacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag 7080
gtagttgagc gtggtggcga tgtgctcggt gacgaagaaa tacatgatcc agcggcggag 7140
cggcatctcg ctgacgtcgc ccagcgcctc caaacgttcc atggcctcgt aaaagtccac 7200
ggcgaagttg aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg 7260
gatgagctcg gcgatggtgg cgcgcacctc gcgctcgaag gcccccggga gttcctccac 7320
ttcctcttct tcctcctcca ctaacatctc ttctacttcc tcctcaggcg gcagtggtgg 7380
cgggggaggg ggcctgcgtc gccggcggcg cacgggcaga cggtcgatga agcgctcgat 7440
ggtctcgccg cgccggcgtc gcatggtctc ggtgacggcg cgcccgtcct cgcggggccg 7500
cagcgtgaag acgccgccgc gcatctccag gtggccgggg gggtccccgt tgggcaggga 7560
gagggcgctg acgatgcatc ttatcaattg ccccgtaggg actccgcgca aggacctgag 7620
cgtctcgaga tccacgggat ctgaaaaccg ctgaacgaag gcttcgagcc agtcgcagtc 7680
gcaaggtagg ctgagcacgg tttcttctgg cgggtcatgt tggttgggag cggggcgggc 7740
gatgctgctg gtgatgaagt tgaaataggc ggttctgaga cggcggatgg tggcgaggag 7800
caccaggtct ttgggcccgg cttgctggat gcgcagacgg tcggccatgc cccaggcgtg 7860
gtcctgacac ctggccaggt ccttgtagta gtcctgcatg agccgctcca cgggcacctc 7920
ctcctcgccc gcgcggccgt gcatgcgcgt gagcccgaag ccgcgctggg gctggacgag 7980
cgccaggtcg gcgacgacgc gctcggcgag gatggcttgc tggatctggg tgagggtggt 8040
ctggaagtca tcaaagtcga cgaagcggtg gtaggctccg gtgttgatgg tgtaggagca 8100
gttggccatg acggaccagt tgacggtctg gtggcccgga cgcacgagct cgtggtactt 8160
gaggcgcgag taggcgcgcg tgtcgaagat gtagtcgttg caggtgcgca ccaggtactg 8220
gtagccgatg aggaagtgcg gcggcggctg gcggtagagc ggccatcgct cggtggcggg 8280
ggcgccgggc gcgaggtcct cgagcatggt gcggtggtag ccgtagatgt acctggacat 8340
ccaggtgatg ccggcggcgg tggtggaggc gcgcgggaac tcgcggacgc ggttccagat 8400
gttgcgcagc ggcaggaagt agttcatggt gggcacggtc tggcccgtga ggcgcgcgca 8460
gtcgtggatg ctctatacgg gcaaaaacga aagcggtcag cggctcgact ccgtggcctg 8520
gaggctaagc gaacgggttg ggctgcgcgt gtaccccggt tcgaatctcg aatcaggctg 8580
gagccgcagc taacgtggta ttggcactcc cgtctcgacc caagcctgca ccaaccctcc 8640
aggatacgga ggcgggtcgt tttgcaactt ttttttggag gccggatgag actagtaagc 8700
gcggaaagcg gccgaccgcg atggctcgct gccgtagtct ggagaagaat cgccagggtt 8760
gcgttgcggt gtgccccggt tcgaggccgg ccggattccg cggctaacga gggcgtggct 8820
gccccgtcgt ttccaagacc ccatagccag ccgacttctc cagttacgga gcgagcccct 8880
cttttgtttt gtttgttttt gccagatgca tcccgtactg cggcagatgc gcccccacca 8940
ccctccaccg caacaacagc cccctccaca gccggcgctt ctgcccccgc cccagcagca 9000
acttccagcc acgaccgccg cggccgccgt gagcggggct ggacagagtt atgatcacca 9060
gctggccttg gaagagggcg aggggctggc gcgcctgggg gcgtcgtcgc cggagcggca 9120
cccgcgcgtg cagatgaaaa gggacgctcg cgaggcctac gtgcccaagc agaacctgtt 9180
cagagacagg agcggcgagg agcccgagga gatgcgcgcg gcccggttcc acgcggggcg 9240
ggagctgcgg cgcggcctgg accgaaagag ggtgctgagg gacgaggatt tcgaggcgga 9300
cgagctgacg gggatcagcc ccgcgcgcgc gcacgtggcc gcggccaacc tggtcacggc 9360
gtacgagcag accgtgaagg aggagagcaa cttccaaaaa tccttcaaca accacgtgcg 9420
caccctgatc gcgcgcgagg aggtgaccct gggcctgatg cacctgtggg acctgctgga 9480
ggccatcgtg cagaacccca ccagcaagcc gctgacggcg cagctgttcc tggtggtgca 9540
gcatagtcgg gacaacgaag cgttcaggga ggcgctgctg aatatcaccg agcccgaggg 9600
ccgctggctc ctggacctgg tgaacattct gcagagcatc gtggtgcagg agcgcgggct 9660
gccgctgtcc gagaagctgg cggccatcaa cttctcggtg ctgagtttgg gcaagtacta 9720
cgctaggaag atctacaaga ccccgtacgt gcccatagac aaggaggtga agatcgacgg 9780
gttttacatg cgcatgaccc tgaaagtgct gaccctgagc gacgatctgg gggtgtaccg 9840
caacgacagg atgcaccgtg cggtgagcgc cagcaggcgg cgcgagctga gcgaccagga 9900
gctgatgcat agtctgcagc gggccctgac cggggccggg accgaggggg agagctactt 9960
tgacatgggc gcggacctgc actggcagcc cagccgccgg gccttggagg cggcggcagg 10020
accctacgta gaagaggtgg acgatgaggt ggacgaggag ggcgagtacc tggaagactg 10080
atggcgcgac cgtatttttg ctagatgcaa caacaacagc cacctcctga tcccgcgatg 10140
cgggcggcgc tgcagagcca gccgtccggc attaactcct cggacgattg gacccaggcc 10200
atgcaacgca tcatggcgct gacgacccgc aaccccgaag cctttagaca gcagccccag 10260
gccaaccggc tctcggccat cctggaggcc gtggtgccct cgcgctccaa ccccacgcac 10320
gagaaggtcc tggccatcgt gaacgcgctg gtggagaaca aggccatccg cggcgacgag 10380
gccggcctgg tgtacaacgc gctgctggag cgcgtggccc gctacaacag caccaacgtg 10440
cagaccaacc tggaccgcat ggtgaccgac gtgcgcgagg ccgtggccca gcgcgagcgg 10500
ttccaccgcg agtccaacct gggatccatg gtggcgctga acgccttcct cagcacccag 10560
cccgccaacg tgccccgggg ccaggaggac tacaccaact tcatcagcgc cctgcgcctg 10620
atggtgaccg aggtgcccca gagcgaggtg taccagtccg ggccggacta cttcttccag 10680
accagtcgcc agggcttgca gaccgtgaac ctgagccagg ctttcaagaa cttgcagggc 10740
ctgtggggcg tgcaggcccc ggtcggggac cgcgcgacgg tgtcgagcct gctgacgccg 10800
aactcgcgcc tgctgctgct gctggtggcc cccttcacgg acagcggcag catcaaccgc 10860
aactcgtacc tgggctacct gattaacctg taccgcgagg ccatcggcca ggcgcacgtg 10920
gacgagcaga cctaccagga gatcacccac gtgagccgcg ccctgggcca ggacgacccg 10980
ggcaacctgg aagccaccct gaactttttg ctgaccaacc ggtcgcagaa gatcccgccc 11040
cagtacgcgc tcagcaccga ggaggagcgc atcctgcgtt acgtgcagca gagcgtgggc 11100
ctgttcctga tgcaggaggg ggccaccccc agcgccgcgc tcgacatgac cgcgcgcaac 11160
atggagccca gcatgtacgc cagcaaccgc ccgttcatca ataaactgat ggactacttg 11220
catcgggcgg ccgccatgaa ctctgactat ttcaccaacg ccatcctgaa tccccactgg 11280
ctcccgccgc cggggttcta cacgggcgag tacgacatgc ccgaccccaa tgacgggttc 11340
ctgtgggacg atgtggacag cagcgtgttc tccccccgac cgggtgctaa cgagcgcccc 11400
ttgtggaaga aggaaggcag cgaccgacgc ccgtcctcgg cgctgtccgg ccgcgagggt 11460
gctgccgcgg cggtgcccga ggccgccagt cctttcccga gcttgccctt ctcgctgaac 11520
agtatccgca gcagcgagct gggcaggatc acgcgcccgc gcttgctggg cgaagaggag 11580
tacttgaatg actcgctgtt gagacccgag cgggagaaga acttccccaa taacgggata 11640
gaaagcctgg tggacaagat gagccgctgg aagacgtatg cgcaggagca cagggacgat 11700
ccccgggcgt cgcagggggc cacgagccgg ggcagcgccg cccgtaaacg ccggtggcac 11760
gacaggcagc ggggacagat gtgggacgat gaggactccg ccgacgacag cagcgtgttg 11820
gacttgggtg ggagtggtaa cccgttcgct cacctgcgcc cccgtatcgg gcgcatgatg 11880
taagagaaac cgaaaataaa tgatactcac caaggccatg gcgaccagcg tgcgttcgtt 11940
tcttctctgt tgttgttgta tctagtatga tgaggcgtgc gtacccggag ggtcctcctc 12000
cctcgtacga gagcgtgatg cagcaggcga tggcggcggc ggcgatgcag cccccgctgg 12060
aggctcctta cgtgcccccg cggtacctgg cgcctacgga ggggcggaac agcattcgtt 12120
actcggagct ggcacccttg tacgatacca cccggttgta cctggtggac aacaagtcgg 12180
cggacatcgc ctcgctgaac taccagaacg accacagcaa cttcctgacc accgtggtgc 12240
agaacaatga cttcaccccc acggaggcca gcacccagac catcaacttt gacgagcgct 12300
cgcggtgggg cggccagctg aaaaccatca tgcacaccaa catgcccaac gtgaacgagt 12360
tcatgtacag caacaagttc aaggcgcggg tgatggtctc ccgcaagacc cccaatgggg 12420
tgacagtgac agaggattat gatggtagtc aggatgagct gaagtatgaa tgggtggaat 12480
ttgagctgcc cgaaggcaac ttctcggtga ccatgaccat cgacctgatg aacaacgcca 12540
tcatcgacaa ttacttggcg gtggggcggc agaacggggt gctggagagc gacatcggcg 12600
tgaagttcga cactaggaac ttcaggctgg gctgggaccc cgtgaccgag ctggtcatgc 12660
ccggggtgta caccaacgag gctttccatc ccgatattgt cttgctgccc ggctgcgggg 12720
tggacttcac cgagagccgc ctcagcaacc tgctgggcat tcgcaagagg cagcccttcc 12780
aggaaggctt ccagatcatg tacgaggatc tggagggggg caacatcccc gcgctcctgg 12840
atgtcgacgc ctatgagaaa agcaaggagg atgcagcagc tgaagcaact gcagccgtag 12900
ctaccgcctc taccgaggtc aggggcgata attttgcaag cgccgcagca gtggcagcgg 12960
ccgaggcggc tgaaaccgaa agtaagatag tcattcagcc ggtggagaag gatagcaaga 13020
acaggagcta caacgtacta ccggacaaga taaacaccgc ctaccgcagc tggtacctag 13080
cctacaacta tggcgacccc gagaagggcg tgcgctcctg gacgctgctc accacctcgg 13140
acgtcacctg cggcgtggag caagtctact ggtcgctgcc cgacatgatg caagacccgg 13200
tcaccttccg ctccacgcgt caagttagca actacccggt ggtgggcgcc gagctcctgc 13260
ccgtctactc caagagcttc ttcaacgagc aggccgtcta ctcgcagcag ctgcgcgcct 13320
tcacctcgct tacgcacgtc ttcaaccgct tccccgagaa ccagatcctc gtccgcccgc 13380
ccgcgcccac cattaccacc gtcagtgaaa acgttcctgc tctcacagat cacgggaccc 13440
tgccgctgcg cagcagtatc cggggagtcc agcgcgtgac cgttactgac gccagacgcc 13500
gcacctgccc ctacgtctac aaggccctgg gcatagtcgc gccgcgcgtc ctctcgagcc 13560
gcaccttcta aatgtccatt ctcatctcgc ccagtaataa caccggttgg ggcctgcgcg 13620
cgcccagcaa gatgtacgga ggcgctcgcc aacgctccac gcaacacccc gtgcgcgtgc 13680
gcgggcactt ccgcgctccc tggggcgccc tcaagggccg cgtgcggtcg cgcaccaccg 13740
tcgacgacgt gatcgaccag gtggtggccg acgcgcgcaa ctacaccccc gccgccgcgc 13800
ccgtctccac cgtggacgcc gtcatcgaca gcgtggtggc cgacgcgcgc cggtacgccc 13860
gcgccaagag ccggcggcgg cgcatcgccc ggcggcaccg gagcaccccc gccatgcgcg 13920
cggcgcgagc cttgctgcgc agggccaggc gcacgggacg cagggccatg ctcagggcgg 13980
ccagacgcgc ggcttcaggc gccagcgccg gcaggacccg gagacgcgcg gccacggcgg 14040
cggcagcggc catcgccagc atgtcccgcc cgcggcgagg gaacgtgtac tgggtgcgcg 14100
acgccgccac cggtgtgcgc gtgcccgtgc gcacccgccc ccctcgcact tgaagatgtt 14160
cacttcgcga tgttgatgtg tcccagcggc gaggaggatg tccaagcgca aattcaagga 14220
agagatgctc caggtcatcg cgcctgagat ctacggccct gcggtggtga aggaggaaag 14280
aaagccccgc aaaatcaagc gggtcaaaaa ggacaaaaag gaagaagaaa gtgatgtgga 14340
cggattggtg gagtttgtgc gcgagttcgc cccccggcgg cgcgtgcagt ggcgcgggcg 14400
gaaggtgcaa ccggtgctga gacccggcac caccgtggtc ttcacgcccg gcgagcgctc 14460
cggcaccgct tccaagcgct cctacgacga ggtgtacggg gatgatgata ttctggagca 14520
ggcggccgag cgcctgggcg agtttgctta cggcaagcgc agccgttccg caccgaagga 14580
agaggcggtg tccatcccgc tggaccacgg caaccccacg ccgagcctca agcccgtgac 14640
cttgcagcag gtgctgccga ccgcggcgcc gcgccggggg ttcaagcgcg agggcgagga 14700
tctgtacccc accatgcagc tgatggtgcc caagcgccag aagctggaag acgtgctgga 14760
gaccatgaag gtggacccgg acgtgcagcc cgaggtcaag gtgcggccca tcaagcaggt 14820
ggccccgggc ctgggcgtgc agaccgtgga catcaagatt cccacggagc ccatggaaac 14880
gcagaccgag cccatgatca agcccagcac cagcaccatg gaggtgcaga cggatccctg 14940
gatgccatcg gctcctagtc gaagaccccg gcgcaagtac ggcgcggcca gcctgctgat 15000
gcccaactac gcgctgcatc cttccatcat ccccacgccg ggctaccgcg gcacgcgctt 15060
ctaccgcggt cataccagca gccgccgccg caagaccacc actcgccgcc gccgtcgccg 15120
caccgccgct gcaaccaccc ctgccgccct ggtgcggaga gtgtaccgcc gcggccgcgc 15180
acctctgacc ctgccgcgcg cgcgctacca cccgagcatc gccatttaaa ctttcgcctg 15240
ctttgcagat caatggccct cacatgccgc cttcgcgttc ccattacggg ctaccgagga 15300
agaaaaccgc gccgtagaag gctggcgggg aacgggatgc gtcgccacca ccaccggcgg 15360
cggcgcgcca tcagcaagcg gttgggggga ggcttcctgc ccgcgctgat ccccatcatc 15420
gccgcggcga tcggggcgat ccccggcatt gcttccgtgg cggtgcaggc ctctcagcgc 15480
cactgagaca cacttggaaa catcttgtaa taaaccaatg gactctgacg ctcctggtcc 15540
tgtgatgtgt tttcgtagac agatggaaga catcaatttt tcgtccctgg ctccgcgaca 15600
cggcacgcgg ccgttcatgg gcacctggag cgacatcggc accagccaac tgaacggggg 15660
cgccttcaat tggagcagtc tctggagcgg gcttaagaat ttcgggtcca cgcttaaaac 15720
ctatggcagc aaggcgtgga acagcaccac agggcaggcg ctgagggata agctgaaaga 15780
gcagaacttc cagcagaagg tggtcgatgg gctcgcctcg ggcatcaacg gggtggtgga 15840
cctggccaac caggccgtgc agcggcagat caacagccgc ctggacccgg tgccgcccgc 15900
cggctccgtg gagatgccgc aggtggagga ggagctgcct cccctggaca agcggggcga 15960
gaagcgaccc cgccccgatg cggaggagac gctgctgacg cacacggacg agccgccccc 16020
gtacgaggag gcggtgaaac tgggtctgcc caccacgcgg cccatcgcgc ccctggccac 16080
cggggtgctg aaacccgaaa agcccgcgac cctggacttg cctcctcccc agccttcccg 16140
cccctctaca gtggctaagc ccctgccgcc ggtggccgtg gcccgcgcgc gacccggggg 16200
caccgcccgc cctcatgcga actggcagag cactctgaac agcatcgtgg gtctgggagt 16260
gcagagtgtg aagcgccgcc gctgctatta aacctaccgt agcgcttaac ttgcttgtct 16320
gtgtgtgtat gtattatgtc gccgccgccg ctgtccacca gaaggaggag tgaagaggcg 16380
cgtcgccgag ttgcaagatg gccaccccat cgatgctgcc ccagtgggcg tacatgcaca 16440
tcgccggaca ggacgcttcg gagtacctga gtccgggtct ggtgcagttt gcccgcgcca 16500
cagacaccta cttcagtctg gggaacaagt ttaggaaccc cacggtggcg cccacgcacg 16560
atgtgaccac cgaccgcagc cagcggctga cgctgcgctt cgtgcccgtg gaccgcgagg 16620
acaacaccta ctcgtacaaa gtgcgctaca cgctggccgt gggcgacaac cgcgtgctgg 16680
acatggccag cacctacttt gacatccgcg gcgtgctgga tcggggccct agcttcaaac 16740
cctactccgg caccgcctac aacagtctgg cccccaaggg agcacccaac acttgtcagt 16800
ggacatataa agccgatggt gaaactgcca cagaaaaaac ctatacatat ggaaatgcac 16860
ccgtgcaggg cattaacatc acaaaagatg gtattcaact tggaactgac accgatgatc 16920
agccaatcta cgcagataaa acctatcagc ctgaacctca agtgggtgat gctgaatggc 16980
atgacatcac tggtactgat gaaaagtatg gaggcagagc tcttaagcct gataccaaaa 17040
tgaagccttg ttatggttct tttgccaagc ctactaataa agaaggaggt caggcaaatg 17100
tgaaaacagg aacaggcact actaaagaat atgacataga catggctttc tttgacaaca 17160
gaagtgcggc tgctgctggc ctagctccag aaattgtttt gtatactgaa aatgtggatt 17220
tggaaactcc agatacccat attgtataca aagcaggcac agatgacagc agctcttcta 17280
ttaatttggg tcagcaagcc atgcccaaca gacctaacta cattggtttc agagacaact 17340
ttatcgggct catgtactac aacagcactg gcaatatggg ggtgctggcc ggtcaggctt 17400
ctcagctgaa tgctgtggtt gacttgcaag acagaaacac cgagctgtcc taccagctct 17460
tgcttgactc tctgggtgac agaacccggt atttcagtat gtggaatcag gcggtggaca 17520
gctatgatcc tgatgtgcgc attattgaaa atcatggtgt ggaggatgaa cttcccaact 17580
attgtttccc tctggatgct gttggcagaa cagatactta tcagggaatt aaggctaatg 17640
gaactgatca aaccacatgg accaaagatg acagtgtcaa tgatgctaat gagataggca 17700
agggtaatcc attcgccatg gaaatcaaca tccaagccaa cctgtggagg aacttcctct 17760
acgccaacgt ggccctgtac ctgcccgact cttacaagta cacgccggcc aatgttaccc 17820
tgcccaccaa caccaacacc tacgattaca tgaacggccg ggtggtggcg ccctcgctgg 17880
tggactccta catcaacatc ggggcgcgct ggtcgctgga tcccatggac aacgtgaacc 17940
ccttcaacca ccaccgcaat gcggggctgc gctaccgctc catgctcctg ggcaacgggc 18000
gctacgtgcc cttccacatc caggtgcccc agaaattttt cgccatcaag agcctcctgc 18060
tcctgcccgg gtcctacacc tacgagtgga acttccgcaa ggacgtcaac atgatcctgc 18120
agagctccct cggcaacgac ctgcgcacgg acggggcctc catctccttc accagcatca 18180
acctctacgc caccttcttc cccatggcgc acaacacggc ctccacgctc gaggccatgc 18240
tgcgcaacga caccaacgac cagtccttca acgactacct ctcggcggcc aacatgctct 18300
accccatccc ggccaacgcc accaacgtgc ccatctccat cccctcgcgc aactgggccg 18360
ccttccgcgg ctggtccttc acgcgtctca agaccaagga gacgccctcg ctgggctccg 18420
ggttcgaccc ctacttcgtc tactcgggct ccatccccta cctcgacggc accttctacc 18480
tcaaccacac cttcaagaag gtctccatca ccttcgactc ctccgtcagc tggcccggca 18540
acgaccggct cctgacgccc aacgagttcg aaatcaagcg caccgtcgac ggcgagggct 18600
acaacgtggc ccagtgcaac atgaccaagg actggttcct ggtccagatg ctggcccact 18660
acaacatcgg ctaccagggc ttctacgtgc ccgagggcta caaggaccgc atgtactcct 18720
tcttccgcaa cttccagccc atgagccgcc aggtggtgga cgaggtcaac tacaaggact 18780
accaggccgt caccctggcc taccagcaca acaactcggg cttcgtcggc tacctcgcgc 18840
ccaccatgcg ccagggccag ccctaccccg ccaactaccc ctacccgctc atcggcaaga 18900
gcgccgtcac cagcgtcacc cagaaaaagt tcctctgcga cagggtcatg tggcgcatcc 18960
ccttctccag caacttcatg tccatgggcg cgctcaccga cctcggccag aacatgctct 19020
atgccaactc cgcccacgcg ctagacatga atttcgaagt cgaccccatg gatgagtcca 19080
cccttctcta tgttgtcttc gaagtcttcg acgtcgtccg agtgcaccag ccccaccgcg 19140
gcgtcatcga ggccgtctac ctgcgcaccc ccttctcggc cggtaacgcc accacctaag 19200
ctcttgcttc ttgcaagcca tggccgcggg ctccggcgag caggagctca gggccatcat 19260
ccgcgacctg ggctgcgggc cctacttcct gggcaccttc gataagcgct tcccgggatt 19320
catggccccg cacaagctgg cctgcgccat cgtcaacacg gccggccgcg agaccggggg 19380
cgagcactgg ctggccttcg cctggaaccc gcgctcgaac acctgctacc tcttcgaccc 19440
cttcgggttc tcggacgagc gcctcaagca gatctaccag ttcgagtacg agggcctgct 19500
gcgccgcagc gccctggcca ccgaggaccg ctgcgtcacc ctggaaaagt ccacccagac 19560
cgtgcagggt ccgcgctcgg ccgcctgcgg gctcttctgc tgcatgttcc tgcacgcctt 19620
cgtgcactgg cccgaccgcc ccatggacaa gaaccccacc atgaacttgc tgacgggggt 19680
gcccaacggc atgctccagt cgccccaggt ggaacccacc ctgcgccgca accaggaggc 19740
gctctaccgc ttcctcaact cccactccgc ctactttcgc tcccaccgcg cgcgcatcga 19800
gaaggccacc gccttcgacc gcatgaatca agacatgtaa accgtgtgtg tatgttaaat 19860
gtctttaata aacagcactt tcatgttaca catgcatctg agatgattta tttagaaatc 19920
gaaagggttc tgccgggtct cggcatggcc cgcgggcagg gacacgttgc ggaactggta 19980
cttggccagc cacttgaact cggggatcag cagtttgggc agcggggtgt cggggaagga 20040
gtcggtccac agcttccgcg tcagttgcag ggcgcccagc aggtcgggcg cggagatctt 20100
gaaatcgcag ttgggacccg cgttctgcgc gcgggagttg cggtacacgg ggttgcagca 20160
ctggaacacc atcagggccg ggtgcttcac gctcgccagc accgtcgcgt cggtgatgct 20220
ctccacgtcg aggtcctcgg cgttggccat cccgaagggg gtcatcttgc aggtctgcct 20280
tcccatggtg ggcacgcacc cgggcttgtg gttgcaatcg cagtgcaggg ggatcagcat 20340
catctgggcc tggtcggcgt tcatccccgg gtacatggcc ttcatgaaag cctccaattg 20400
cctgaacgcc tgctgggcct tggctccctc ggtgaagaag accccgcagg acttgctaga 20460
gaactggttg gtggcgcacc cggcgtcgtg cacgcagcag cgcgcgtcgt tgttggccag 20520
ctgcaccacg ctgcgccccc agcggttctg ggtgatcttg gcccggtcgg ggttctcctt 20580
cagcgcgcgc tgcccgttct cgctcgccac atccatctcg atcatgtgct ccttctggat 20640
catggtggtc ccgtgcaggc accgcagctt gccctcggcc tcggtgcacc cgtgcagcca 20700
cagcgcgcac ccggtgcact cccagttctt gtgggcgatc tgggaatgcg cgtgcacgaa 20760
gccctgcagg aagcggccca tcatggtggt cagggtcttg ttgctagtga aggtcagcgg 20820
aatgccgcgg tgctcctcgt tgatgtacag gtggcagatg cggcggtaca cctcgccctg 20880
ctcgggcatc agctggaagt tggctttcag gtcggtctcc acgcggtagc ggtccatcag 20940
catagtcatg atttccatac ccttctccca ggccgagacg atgggcaggc tcatagggtt 21000
cttcaccatc atcttagcgc tagcagccgc ggccaggggg tcgctctcgt ccagggtctc 21060
aaagctccgc ttgccgtcct tctcggtgat ccgcaccggg gggtagctga agcccacggc 21120
cgccagctcc tcctcggcct gtctttcgtc ctcgctgtcc tggctgacgt cctgcaggac 21180
cacatgcttg gtcttgcggg gtttcttctt gggcggcagc ggcggcggag atgttggaga 21240
tggcgagggg gagcgcgagt tctcgctcac cactactatc tcttcctctt cttggtccga 21300
ggccacgcgg cggtaggtat gtctcttcgg gggcagaggc ggaggcgacg ggctctcgcc 21360
gccgcgactt ggcggatggc tggcagagcc ccttccgcgt tcgggggtgc gctcccggcg 21420
gcgctctgac tgacttcctc cgcggccggc cattgtgttc tcctagggag gaacaacaag 21480
catggagact cagccatcgc caacctcgcc atctgccccc accgccgacg agaagcagca 21540
gcagcagaat gaaagcttaa ccgccccgcc gcccagcccc gccacctccg acgcggccgt 21600
cccagacatg caagagatgg aggaatccat cgagattgac ctgggctatg tgacgcccgc 21660
ggagcacgag gaggagctgg cagtgcgctt ttcacaagaa gagatacacc aagaacagcc 21720
agagcaggaa gcagagaatg agcagagtca ggctgggctc gagcatgacg gcgactacct 21780
ccacctgagc gggggggagg acgcgctcat caagcatctg gcccggcagg ccaccatcgt 21840
caaggatgcg ctgctcgacc gcaccgaggt gcccctcagc gtggaggagc tcagccgcgc 21900
ctacgagttg aacctcttct cgccgcgcgt gccccccaag cgccagccca atggcacctg 21960
cgagcccaac ccgcgcctca acttctaccc ggtcttcgcg gtgcccgagg ccctggccac 22020
ctaccacatc tttttcaaga accaaaagat ccccgtctcc tgccgcgcca accgcacccg 22080
cgccgacgcc cttttcaacc tgggtcccgg cgcccgccta cctgatatcg cctccttgga 22140
agaggttccc aagatcttcg agggtctggg cagcgacgag actcgggccg cgaacgctct 22200
gcaaggagaa ggaggagagc atgagcacca cagcgccctg gtcgagttgg aaggcgacaa 22260
cgcgcggctg gcggtgctca aacgcacggt cgagctgacc catttcgcct acccggctct 22320
gaacctgccc cccaaagtca tgagcgcggt catggaccag gtgctcatca agcgcgcgtc 22380
gcccatctcc gaggacgagg gcatgcaaga ctccgaggag ggcaagcccg tggtcagcga 22440
cgagcagctg gcccggtggc tgggtcctaa tgctagtccc cagagtttgg aagagcggcg 22500
caaactcatg atggccgtgg tcctggtgac cgtggagctg gagtgcctgc gccgcttctt 22560
cgccgacgcg gagaccctgc gcaaggtcga ggagaacctg cactacctct tcaggcacgg 22620
gttcgtgcgc caggcctgca agatctccaa cgtggagctg accaacctgg tctcctacat 22680
gggcatcttg cacgagaacc gcctggggca gaacgtgctg cacaccaccc tgcgcgggga 22740
ggcccggcgc gactacatcc gcgactgcgt ctacctctac ctctgccaca cctggcagac 22800
gggcatgggc gtgtggcagc agtgtctgga ggagcagaac ctgaaagagc tctgcaagct 22860
cctgcagaag aacctcaagg gtctgtggac cgggttcgac gagcgcacca ccgcctcgga 22920
cctggccgac ctcattttcc ccgagcgcct caggctgacg ctgcgcaacg gcctgcccga 22980
ctttatgagc caaagcatgt tgcaaaactt tcgctctttc atcctcgaac gctccggaat 23040
cctgcccgcc acctgctccg cgctgccctc ggacttcgtg ccgctgacct tccgcgagtg 23100
ccccccgccg ctgtggagcc actgctacct gctgcgcctg gccaactacc tggcctacca 23160
ctcggacgtg atcgaggacg tcagcggcga gggcctgctc gagtgccact gccgctgcaa 23220
cctctgcacg ccgcaccgct ccctggcctg caacccccag ctgctgagcg agacccagat 23280
catcggcacc ttcgagttgc aagggcccag cgaaggcgag ggttcagccg ccaagggggg 23340
tctgaaactc accccggggc tgtggacctc ggcctacttg cgcaagttcg tgcccgagga 23400
ctaccatccc ttcgagatca ggttctacga ggaccaatcc catccgccca aggccgagct 23460
gtcggcctgc gtcatcaccc agggggcgat cctggcccaa ttgcaagcca tccagaaatc 23520
ccgccaagaa ttcttgctga aaaagggccg cggggtctac ctcgaccccc agaccggtga 23580
ggagctcaac cccggcttcc cccaggatgc cccgaggaaa caagaagctg aaagtggagc 23640
tgccgcccgt ggaggatttg gaggaagact gggagaacag cagtcaggca gaggaggagg 23700
agatggagga agactgggac agcactcagg cagaggagga cagcctgcaa gacagtctgg 23760
aggaagacga ggaggaggca gaggaggagg tggaagaagc agccgccgcc agaccgtcgt 23820
cctcggcggg ggagaaagca agcagcacgg ataccatctc cgctccgggt cggggtcccg 23880
ctcgaccaca cagtagatgg gacgagaccg gacgattccc gaaccccacc acccagaccg 23940
gtaagaagga gcggcaggga tacaagtcct ggcgggggca caaaaacgcc atcgtctcct 24000
gcttgcaggc ctgcgggggc aacatctcct tcacccggcg ctacctgctc ttccaccgcg 24060
gggtgaactt tccccgcaac atcttgcatt actaccgtca cctccacagc ccctactact 24120
tccaagaaga ggcagcagca gcagaaaaag accagcagaa aaccagcagc tagaaaatcc 24180
acagcggcgg cagcaggtgg actgaggatc gcggcgaacg agccggcgca aacccgggag 24240
ctgaggaacc ggatctttcc caccctctat gccatcttcc agcagagtcg ggggcaggag 24300
caggaactga aagtcaagaa ccgttctctg cgctcgctca cccgcagttg tctgtatcac 24360
aagagcgaag accaacttca gcgcactctc gaggacgccg aggctctctt caacaagtac 24420
tgcgcgctca ctcttaaaga gtagcccgcg cccgcccagt cgcagaaaaa ggcgggaatt 24480
acgtcacctg tgcccttcgc cctagccgcc tccacccatc atcatgagca aagagattcc 24540
cacgccttac atgtggagct accagcccca gatgggcctg gccgccggtg ccgcccagga 24600
ctactccacc cgcatgaatt ggctcagcgc cgggcccgcg atgatctcac gggtgaatga 24660
catccgcgcc caccgaaacc agatactcct agaacagtca gcgctcaccg ccacgccccg 24720
caatcacctc aatccgcgta attggcccgc cgccctggtg taccaggaaa ttccccagcc 24780
cacgaccgta ctacttccgc gagacgccca ggccgaagtc cagctgacta actcaggtgt 24840
ccagctggcg ggcggcgcca ccctgtgtcg tcaccgcccc gctcagggta taaagcggct 24900
ggtgatccgg ggcagaggca cacagctcaa cgacgaggtg gtgagctctt cgctgggtct 24960
gcgacctgac ggagtcttcc aactcgccgg atcggggaga tcttccttca cgcctcgtca 25020
ggccgtcctg actttggaga gttcgtcctc gcagccccgc tcgggtggca tcggcactct 25080
ccagttcgtg gaggagttca ctccctcggt ctacttcaac cccttctccg gctcccccgg 25140
ccactacccg gacgagttca tcccgaactt cgacgccatc agcgagtcgg tggacggcta 25200
cgattgaatg tcccatggtg gcgcagctga cctagctcgg cttcgacacc tggaccactg 25260
ccgccgcttc cgctgcttcg ctcgggatct cgccgagttt gcctactttg agctgcccga 25320
ggagcaccct cagggcccgg cccacggagt gcggatcgtc gtcgaagggg gcctcgactc 25380
ccacctgctt cggatcttca gccagcgtcc gatcctggtc gagcgcgagc aaggacagac 25440
ccttctgact ctgtactgca tctgcaacca ccccggcctg catgaaagtc tttgttgtct 25500
gctgtgtact gagtataata aaagctgaga tcagcgacta ctccggactt ccgtgtgttt 25560
aaactcaccc ccttatccag tgaaataaag atcatattga tgatgatttt acagaaataa 25620
aaaataatca tttgatttga aataaagata caatcatatt gatgatttga gtttaacaaa 25680
aaaataaaga atcacttact tgaaatctga taccaggtct ctgtccatgt tttctgccaa 25740
caccacttca ctcccctctt cccagctctg gtactgcagg ccccggcggg ctgcaaactt 25800
cctccacacg ctgaagggga tgtcaaattc ctcctgtccc tcaatcttca ttttatcttc 25860
tatcagatgt ccaaaaagcg cgtccgggtg gatgatgact tcgaccccgt ctacccctac 25920
gatgcagaca acgcaccgac cgtgcccttc atcaaccccc ccttcgtctc ttcagatgga 25980
ttccaagaga agcccctggg ggtgttgtcc ctgcgactgg ccgaccccgt caccaccaag 26040
aacggggaaa tcaccctcaa gctgggagag ggggtggacc tcgattcctc gggaaaactc 26100
atctccaaca cggccaccaa ggccgccgcc cctctcagtt tttccaacaa caccatttcc 26160
cttaacatgg atcacccctt ttacactaaa gatggaaaat tatccttaca agtttctcca 26220
ccattaaata tactgagaac aagcattcta aacacactag ctttaggttt tggatcaggt 26280
ttaggactcc gtggctctgc cttggcagta cagttagtct ctccacttac atttgatact 26340
gatggaaaca taaagcttac cttagacaga ggtttgcatg ttacaacagg agatgcaatt 26400
gaaagcaaca taagctgggc taaaggttta aaatttgaag atggagccat agcaaccaac 26460
attggaaatg ggttagagtt tggaagcagt agtacagaaa caggtgttga tgatgcttac 26520
ccaatccaag ttaaacttgg atctggcctt agctttgaca gtacaggagc cataatggct 26580
ggtaacaaag aagacgataa actcactttg tggacaacac ctgatccatc accaaactgt 26640
caaatactcg cagaaaatga tgcaaaacta acactttgct tgactaaatg tggtagtcaa 26700
atactggcca ctgtgtcagt cttagttgta ggaagtggaa acctaaaccc cattactggc 26760
accgtaagca gtgctcaggt gtttctacgt tttgatgcaa acggtgttct tttaacagaa 26820
cattctacac taaaaaaata ctgggggtat aggcagggag atagcataga tggcactcca 26880
tataccaatg ctgtaggatt catgcccaat ttaaaagctt atccaaagtc acaaagttct 26940
actactaaaa ataatatagt agggcaagta tacatgaatg gagatgtttc aaaacctatg 27000
cttctcacta taaccctcaa tggtactgat gacagcaaca gtacatattc aatgtcattt 27060
tcatacacct ggactaatgg aagctatgtt ggagcaacat ttggggctaa ctcttatacc 27120
ttctcataca tcgcccaaga atgaacactg tatcccaccc tgcatgccaa cccttcccac 27180
cccactctgt ggaacaaact ctgaaacaca aaataaaata aagttcaagt gttttattga 27240
ttcaacagtt ttacaggatt cgagcagtta tttttcctcc accctcccag gacatggaat 27300
acaccaccct ctccccccgc acagccttga acatctgaat gccattggtg atggacatgc 27360
ttttggtctc cacgttccac acagtttcag agcgagccag tctcgggtcg gtcagggaga 27420
tgaaaccctc cgggcactcc cgcatctgca cctcacagct caacagctga ggattgtcct 27480
cggtggtcgg gatcacggtt atctggaaga agcagaagag cggcggtggg aatcatagtc 27540
cgcgaacggg atcggccggt ggtgtcgcat caggccccgc agcagtcgct gccgccgccg 27600
ctccgtcaag ctgctgctca gggggtccgg gtccagggac tccctcagca tgatgcccac 27660
ggccctcagc atcagtcgtc tggtgcggcg ggcgcagcag cgcatgcgga tctcgctcag 27720
gtcgctgcag tacgtgcaac acagaaccac caggttgttc aacagtccat agttcaacac 27780
gctccagccg aaactcatcg cgggaaggat gctacccacg tggccgtcgt accagatcct 27840
caggtaaatc aagtggtgcc ccctccagaa cacgctgccc acgtacatga tctccttggg 27900
catgtggcgg ttcaccacct cccggtacca catcaccctc tggttgaaca tgcagccccg 27960
gatgatcctg cggaaccaca gggccagcac cgccccgccc gccatgcagc gaagagaccc 28020
cgggtcccgg caatggcaat ggaggaccca ccgctcgtac ccgtggatca tctgggagct 28080
gaacaagtct atgttggcac agcacaggca tatgctcatg catctcttca gcactctcaa 28140
ctcctcgggg gtcaaaacca tatcccaggg cacggggaac tcttgcagga cagcgaaccc 28200
cgcagaacag ggcaatcctc gcacagaact tacattgtgc atggacaggg tatcgcaatc 28260
aggcagcacc gggtgatcct ccaccagaga agcgcgggtc tcggtctcct cacagcgtgg 28320
taagggggcc ggccgatacg ggtgatggcg ggacgcggct gatcgtgttc gcgaccgtgt 28380
catgatgcag ttgctttcgg acattttcgt acttgctgta gcagaacctg gtccgggcgc 28440
tgcacaccga tcgccggcgg cggtctcggc gcttggaacg ctcggtgttg aaattgtaaa 28500
acagccactc tctcagaccg tgcagcagat ctagggcctc aggagtgatg aagatcccat 28560
catgcctgat ggctctgatc acatcgacca ccgtggaatg ggccagaccc agccagatga 28620
tgcaattttg ttgggtttcg gtgacggcgg gggagggaag aacaggaaga accatgatta 28680
acttttaatc caaacggtct cggagtactt caaaatgaag atcgcggaga tggcacctct 28740
cgcccccgct gtgttggtgg aaaataacag ccaggtcaaa ggtgatacgg ttctcgagat 28800
gttccacggt ggcttccagc aaagcctcca cgcgcacatc cagaaacaag acaatagcga 28860
aagcgggagg gttctctaat tcctcaatca tcatgttaca ctcctgcacc atccccagat 28920
aattttcatt tttccagcct tgaatgattc gaactagttc ctgaggtaaa tccaagccag 28980
ccatgataaa gagctcgcgc agagcgccct ccaccggcat tcttaagcac accctcataa 29040
ttccaagata ttctgctcct ggttcacctg cagcagattg acaagcggaa tatcaaaatc 29100
tctgccgcga tccctgagct cctccctcag caataactgt aagtactctt tcatatcctc 29160
tccgaaattt ttagccatag gaccaccagg aataagatta gggcaagcca cagtacagat 29220
aaaccgaagt cctccccagt gagcattgcc aaatgcaaga ctgctataag catgctggct 29280
agacccggtg atatcttcca gataactgga cagaaaatcg cccaggcaat ttttaagaaa 29340
atcaacaaaa gaaaaatcct ccaggtggac gtttagagcc tcgggaacaa cgatgaagta 29400
aatgcaagcg gtgcgttcca gcatggttag ttagctgatc tgtagaaaaa acaaaaatga 29460
acattaaacc atgctagcct ggcgaacagg tgggtaaatc gttctctcca gcaccaggca 29520
ggccacgggg tctccggcgc gaccctcgta aaaattgtcg ctatgattga aaaccatcac 29580
agagagacgt tcccggtggc cggcgtgaat gattcgacaa gatgaataca cccccggaac 29640
attggcgtcc gcgagtgaaa aaaagcgccc gaggaagcaa taaggcacta caatgctcag 29700
tctcaagtcc agcaaagcga tgccatgcgg atgaagcaca aaattctcag gtgcgtacaa 29760
aatgtaatta ctcccctcct gcacaggcag caaagccccc gatccctcca ggtacacata 29820
caaagcctca gcgtccatag cttaccgagc agcagcacac aacaggcgca agagtcagag 29880
aaaggctgag ctctaacctg tccacccgct ctctgctcaa tatatagccc agatctacac 29940
tgacgtaaag gccaaagtct aaaaataccc gccaaataat cacacacgcc cagcacacgc 30000
ccagaaaccg gtgacacact caaaaaaata cgcgcacttc ctcaaacgcc caaaactgcc 30060
gtcatttccg ggttcccacg ctacgtcatc aaaacacgac tttcaaattc cgtcgaccgt 30120
taaaaacgtc acccgccccg cccctaacgg tcgcccgtct ctcagccaat cagcgccccg 30180
catccccaaa ttcaaacacc tcatttgcat attaacgcgc acaaaaagtt tgaggtatat 30240
tattgatgat gg 30252
<210> 5
<211> 1692
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 5
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tg 1692
<210> 6
<211> 564
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 6
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu
<210> 7
<211> 1611
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 7
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct g 1611
<210> 8
<211> 536
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 8
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn
530 535
<210> 9
<211> 2679
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 9
atgggagctg ccccggagcc ggagaggacc cccgttggcc agggatcgtg ggcccatccg 60
ggacgcacca ggggaccatc cgacagggga ttctgtgtgg tgtcaccggc caggccagca 120
gaagaggcaa ccagcctcga gggagcgttg tctggaacca gacattccca cccgtcggtg 180
ggccggcagc accacgcggg accaccgtcc acttccagac cgccacggcc atgggacacc 240
ccttgcccgc ctgtgtatgc cgagactaaa cacttcctgt actcatccgg agacaaggaa 300
cagcttcggc cgtccttcct cctgtcgtcg ctcagaccga gcctgaccgg agcacgcaga 360
ttggtggaaa ctatcttcct tgggtcacgt ccgtggatgc caggtacccc acggcgcctc 420
ccgcgcctcc cacagagata ctggcagatg cggcctctgt tcctggaatt gctgggaaac 480
cacgctcagt gcccgtacgg agtcctgctc aagactcact gccctctgag ggcggcggtc 540
actccggcgg ccggagtgtg cgcacgggag aagccccagg gaagcgtggc agctccggaa 600
gaggaggaca ccgatccgcg ccgcctcgtg caacttctgc gccagcactc ctcgccctgg 660
caagtctacg ggttcgtccg cgcctgcctg cgccgcctgg tgccgcctgg gctctggggt 720
tcccggcata acgagcgccg cttcctgaga aatactaaga agtttatctc acttggaaaa 780
catgccaagt tgtcgctgca agaactcacg tggaagatgt cagtccgcga ttgcgcctgg 840
ctgcgccgct cgccgggcgt cgggtgtgtt ccagctgcag aacaccgcct gagagaagaa 900
attctggcca aatttctgca ttggctgatg tcagtgtacg tggtcgagct gctgcgctcc 960
tttttctacg tcactgagac tacctttcaa aagaaccgcc tgttcttcta ccgcaaatct 1020
gtgtggagca agctgcagtc aatcggcatt cgccagcatc tgaagagggt gcagctgcgg 1080
gaactttccg aggcagaagt ccgccagcac cgggaggccc ggccggcgct tctcacgtcg 1140
cgtctgagat tcatcccaaa gcccgacggg ctgaggccta tcgtcaacat ggattacgtc 1200
gtgggcgctc gcacctttcg ccgtgaaaag cgggccgaac gcttgacctc acgggtgaag 1260
gccctcttct ccgtgctgaa ctacgagaga gcaagacggc ctggcctgct gggagcttcg 1320
gtgctgggac tggacgatat ccaccgggct tggcggacct ttgttctccg ggtgagagcc 1380
caagaccctc cgccggaact gtacttcgtg aaggtggcga tcaccggagc ctatgatact 1440
attccgcaag atcgactcac cgaagtcatc gcctcgatca tcaaaccgca gaacacttac 1500
tgcgtcaggc ggtacgccgt ggtccagaag gccgcgcatg gccacgtgag aaaggcgttc 1560
aagtcgcacg tgtccactct caccgacctc cagccttaca tgaggcaatt cgttgcgcat 1620
ttgcaagaga cttcgcccct gagagatgcg gtggtcatcg agcagagctc cagcctgaac 1680
gaagcgagca gcggtctgtt tgacgtgttc ctccgcttca tgtgtcatca cgcggtgcga 1740
atcaggggaa aatcatacgt gcagtgccag ggaatcccac aaggcagcat tctgtcgact 1800
ctcttgtgtt ccctttgcta cggcgatatg gaaaacaagc tgttcgctgg gatcagacgg 1860
gacgggttgc tgctcagact ggtggacgac ttcctgctgg tgactccgca cctcactcac 1920
gccaaaacct ttctccgcac tctggtgagg ggagtgccag aatacggctg tgtggtcaat 1980
ctccggaaaa ctgtggtgaa tttccctgtc gaggatgagg cactcggagg aaccgcattt 2040
gtccaaatgc cagcacatgg cctgttccca tggtgcggtc tgctgctgga cacccgaact 2100
cttgaagtgc agtccgacta ctccagctat gcccggacga gcatccgcgc cagcctcact 2160
ttcaatcgcg gctttaaggc cggacgaaac atgcgcagaa agcttttcgg agtcctccgg 2220
cttaaatgcc attcgctctt tctcgatctc caagtcaatt cgctgcagac cgtgtgcacg 2280
aacatctaca agatcctgct gctccaagcc taccggttcc acgcttgcgt gcttcagctg 2340
ccgtttcacc aacaggtgtg gaagaacccg accttctttc tgcgggtcat tagcgatact 2400
gcctccctgt gttactcaat cctcaaggca aagaacgccg gaatgtcgct gggtgcgaaa 2460
ggagccgcgg gacctcttcc tagcgaagcg gtgcagtggc tctgccacca ggctttcctc 2520
ctgaagctga ccaggcacag agtgacctac gtcccgctgc tgggctcgct gcgcactgca 2580
cagacccagc tgtctagaaa actccccggc accaccctga ccgctctgga agccgccgcc 2640
aacccagcat tgccgtcaga tttcaagacc atcttggac 2679
<210> 10
<211> 893
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 10
Met Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser
1 5 10 15
Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys
20 25 30
Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly
35 40 45
Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His
50 55 60
His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr
65 70 75 80
Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser
85 90 95
Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg
100 105 110
Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly
115 120 125
Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro
130 135 140
Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn
145 150 155 160
His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu
165 170 175
Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro
180 185 190
Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg
195 200 205
Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly
210 215 220
Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly
225 230 235 240
Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile
245 250 255
Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys
260 265 270
Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly
275 280 285
Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys
290 295 300
Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser
305 310 315 320
Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe
325 330 335
Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln
340 345 350
His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg
355 360 365
Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe
370 375 380
Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val
385 390 395 400
Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr
405 410 415
Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg
420 425 430
Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His
435 440 445
Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro
450 455 460
Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr
465 470 475 480
Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro
485 490 495
Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala
500 505 510
His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr
515 520 525
Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr
530 535 540
Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn
545 550 555 560
Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His
565 570 575
His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile
580 585 590
Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly
595 600 605
Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu
610 615 620
Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His
625 630 635 640
Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly
645 650 655
Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp
660 665 670
Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly Leu
675 680 685
Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln
690 695 700
Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr
705 710 715 720
Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe
725 730 735
Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val
740 745 750
Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu
755 760 765
Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln
770 775 780
Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr
785 790 795 800
Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser
805 810 815
Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln
820 825 830
Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val
835 840 845
Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu
850 855 860
Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala
865 870 875 880
Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
885 890
<210> 11
<211> 1782
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 11
atggctagcg ccaaatttct gcattggctg atgtcagtgt acgtggtcga gctgctgcgc 60
tcctttttct acgtcactga gactaccttt caaaagaacc gcctgttctt ctaccgcaaa 120
tctgtgtgga gcaagctgca gtcaatcggc attcgccagc atctgaagag ggtgcagctg 180
cgggaacttt ccgaggcaga agtccgccag caccgggagg cccggccggc gcttctcacg 240
tcgcgtctga gattcatccc aaagcccgac gggctgaggc ctatcgtcaa catggattac 300
gtcgtgggcg ctcgcacctt tcgccgtgaa aagcgggccg aacgcttgac ctcacgggtg 360
aaggccctct tctccgtgct gaactacgag agagcaagac ggcctggcct gctgggagct 420
tcggtgctgg gactggacga tatccaccgg gcttggcgga cctttgttct ccgggtgaga 480
gcccaagacc ctccgccgga actgtacttc gtgaaggtgg cgatcaccgg agcctatgat 540
actattccgc aagatcgact caccgaagtc atcgcctcga tcatcaaacc gcagaacact 600
tactgcgtca ggcggtacgc cgtggtccag aaggccgcgc atggccacgt gagaaaggcg 660
ttcaagtcgc acgtgtccac tctcaccgac ctccagcctt acatgaggca attcgttgcg 720
catttgcaag agacttcgcc cctgagagat gcggtggtca tcgagcagag ctccagcctg 780
aacgaagcga gcagcggtct gtttgacgtg ttcctccgct tcatgtgtca tcacgcggtg 840
cgaatcaggg gaaaatcata cgtgcagtgc cagggaatcc cacaaggcag cattctgtcg 900
actctcttgt gttccctttg ctacggcgat atggaaaaca agctgttcgc tgggatcaga 960
cgggacgggt tgctgctcag actggtggac gacttcctgc tggtgactcc gcacctcact 1020
cacgccaaaa cctttctccg cactctggtg aggggagtgc cagaatacgg ctgtgtggtc 1080
aatctccgga aaactgtggt gaatttccct gtcgaggatg aggcactcgg aggaaccgca 1140
tttgtccaaa tgccagcaca tggcctgttc ccatggtgcg gtctgctgct ggacacccga 1200
actcttgaag tgcagtccga ctactccagc tatgcccgga cgagcatccg cgccagcctc 1260
actttcaatc gcggctttaa ggccggacga aacatgcgca gaaagctttt cggagtcctc 1320
cggcttaaat gccattcgct ctttctcgat ctccaagtca attcgctgca gaccgtgtgc 1380
acgaacatct acaagatcct gctgctccaa gcctaccggt tccacgcttg cgtgcttcag 1440
ctgccgtttc accaacaggt gtggaagaac ccgaccttct ttctgcgggt cattagcgat 1500
actgcctccc tgtgttactc aatcctcaag gcaaagaacg ccggaatgtc gctgggtgcg 1560
aaaggagccg cgggacctct tcctagcgaa gcggtgcagt ggctctgcca ccaggctttc 1620
ctcctgaagc tgaccaggca cagagtgacc tacgtcccgc tgctgggctc gctgcgcact 1680
gcacagaccc agctgtctag aaaactcccc ggcaccaccc tgaccgctct ggaagccgcc 1740
gccaacccag cattgccgtc agatttcaag accatcttgg ac 1782
<210> 12
<211> 594
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 12
Met Ala Ser Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val
1 5 10 15
Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys
20 25 30
Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser
35 40 45
Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser
50 55 60
Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr
65 70 75 80
Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val
85 90 95
Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg
100 105 110
Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn
115 120 125
Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly
130 135 140
Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg
145 150 155 160
Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr
165 170 175
Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala
180 185 190
Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val
195 200 205
Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His
210 215 220
Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala
225 230 235 240
His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln
245 250 255
Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu
260 265 270
Arg Phe Met Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val
275 280 285
Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys
290 295 300
Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg
305 310 315 320
Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr
325 330 335
Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly
340 345 350
Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn
355 360 365
Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met
370 375 380
Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg
385 390 395 400
Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile
405 410 415
Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met
420 425 430
Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe
435 440 445
Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr
450 455 460
Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln
465 470 475 480
Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg
485 490 495
Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys
500 505 510
Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro
515 520 525
Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu
530 535 540
Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr
545 550 555 560
Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala
565 570 575
Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile
580 585 590
Leu Asp
<210> 13
<211> 2373
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 13
atggctagct tcctcctgtc gtcgctcaga ccgagcctga ccggagcacg cagattggtg 60
gaaactatct tccttgggtc acgtccgtgg atgccaggta ccccacggcg cctcccgcgc 120
ctcccacaga gatactggca gatgcggcct ctgttcctgg aattgctggg aaaccacgct 180
cagtgcccgt acggagtcct gctcaagact cactgccctc tgagggcggc ggtcactccg 240
gcggccggag tgtgcgcacg ggagaagccc cagggaagcg tggcagctcc ggaagaggag 300
gacaccgatc cgcgccgcct cgtgcaactt ctgcgccagc actcctcgcc ctggcaagtc 360
tacgggttcg tccgcgcctg cctgcgccgc ctggtgccgc ctgggctctg gggttcccgg 420
cataacgagc gccgcttcct gagaaatact aagaagttta tctcacttgg aaaacatgcc 480
aagttgtcgc tgcaagaact cacgtggaag atgtcagtcc gcgattgcgc ctggctgcgc 540
cgctcgccgg gcgtcgggtg tgttccagct gcagaacacc gcctgagaga agaaattctg 600
gccaaatttc tgcattggct gatgtcagtg tacgtggtcg agctgctgcg ctcctttttc 660
tacgtcactg agactacctt tcaaaagaac cgcctgttct tctaccgcaa atctgtgtgg 720
agcaagctgc agtcaatcgg cattcgccag catctgaaga gggtgcagct gcgggaactt 780
tccgaggcag aagtccgcca gcaccgggag gcccggccgg cgcttctcac gtcgcgtctg 840
agattcatcc caaagcccga cgggctgagg cctatcgtca acatggatta cgtcgtgggc 900
gctcgcacct ttcgccgtga aaagcgggcc gaacgcttga cctcacgggt gaaggccctc 960
ttctccgtgc tgaactacga gagagcaaga cggcctggcc tgctgggagc ttcggtgctg 1020
ggactggacg atatccaccg ggcttggcgg acctttgttc tccgggtgag agcccaagac 1080
cctccgccgg aactgtactt cgtgaaggtg gcgatcaccg gagcctatga tactattccg 1140
caagatcgac tcaccgaagt catcgcctcg atcatcaaac cgcagaacac ttactgcgtc 1200
aggcggtacg ccgtggtcca gaaggccgcg catggccacg tgagaaaggc gttcaagtcg 1260
cacgtgtcca ctctcaccga cctccagcct tacatgaggc aattcgttgc gcatttgcaa 1320
gagacttcgc ccctgagaga tgcggtggtc atcgagcaga gctccagcct gaacgaagcg 1380
agcagcggtc tgtttgacgt gttcctccgc ttcatgtgtc atcacgcggt gcgaatcagg 1440
ggaaaatcat acgtgcagtg ccagggaatc ccacaaggca gcattctgtc gactctcttg 1500
tgttcccttt gctacggcga tatggaaaac aagctgttcg ctgggatcag acgggacggg 1560
ttgctgctca gactggtgga cgacttcctg ctggtgactc cgcacctcac tcacgccaaa 1620
acctttctcc gcactctggt gaggggagtg ccagaatacg gctgtgtggt caatctccgg 1680
aaaactgtgg tgaatttccc tgtcgaggat gaggcactcg gaggaaccgc atttgtccaa 1740
atgccagcac atggcctgtt cccatggtgc ggtctgctgc tggacacccg aactcttgaa 1800
gtgcagtccg actactccag ctatgcccgg acgagcatcc gcgccagcct cactttcaat 1860
cgcggcttta aggccggacg aaacatgcgc agaaagcttt tcggagtcct ccggcttaaa 1920
tgccattcgc tctttctcga tctccaagtc aattcgctgc agaccgtgtg cacgaacatc 1980
tacaagatcc tgctgctcca agcctaccgg ttccacgctt gcgtgcttca gctgccgttt 2040
caccaacagg tgtggaagaa cccgaccttc tttctgcggg tcattagcga tactgcctcc 2100
ctgtgttact caatcctcaa ggcaaagaac gccggaatgt cgctgggtgc gaaaggagcc 2160
gcgggacctc ttcctagcga agcggtgcag tggctctgcc accaggcttt cctcctgaag 2220
ctgaccaggc acagagtgac ctacgtcccg ctgctgggct cgctgcgcac tgcacagacc 2280
cagctgtcta gaaaactccc cggcaccacc ctgaccgctc tggaagccgc cgccaaccca 2340
gcattgccgt cagatttcaa gaccatcttg gac 2373
<210> 14
<211> 791
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 14
Met Ala Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala
1 5 10 15
Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro
20 25 30
Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met
35 40 45
Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr
50 55 60
Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro
65 70 75 80
Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala
85 90 95
Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg
100 105 110
Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu
115 120 125
Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg
130 135 140
Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala
145 150 155 160
Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys
165 170 175
Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu
180 185 190
His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met
195 200 205
Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu
210 215 220
Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp
225 230 235 240
Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln
245 250 255
Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg
260 265 270
Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly
275 280 285
Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe
290 295 300
Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu
305 310 315 320
Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly
325 330 335
Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe
340 345 350
Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val
355 360 365
Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu
370 375 380
Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val
385 390 395 400
Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys
405 410 415
Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met
420 425 430
Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala
435 440 445
Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu
450 455 460
Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val Arg Ile Arg
465 470 475 480
Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu
485 490 495
Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu
500 505 510
Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp
515 520 525
Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg
530 535 540
Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg
545 550 555 560
Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr
565 570 575
Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu
580 585 590
Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr
595 600 605
Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys
610 615 620
Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys
625 630 635 640
Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val
645 650 655
Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His
660 665 670
Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro
675 680 685
Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser
690 695 700
Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala
705 710 715 720
Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala
725 730 735
Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu
740 745 750
Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly
755 760 765
Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser
770 775 780
Asp Phe Lys Thr Ile Leu Asp
785 790
<210> 15
<211> 1551
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 15
atggctagca caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 60
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 120
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 180
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 240
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 300
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 360
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 420
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 480
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 540
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 600
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 660
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 720
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 780
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 840
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 900
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 960
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1020
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1080
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1140
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1200
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1260
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1320
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1380
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1440
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1500
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct g 1551
<210> 16
<211> 517
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 16
Met Ala Ser Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu
1 5 10 15
Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu
20 25 30
Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro
35 40 45
Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro
50 55 60
Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val
65 70 75 80
Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro
85 90 95
Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser
100 105 110
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala Ser
210 215 220
Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr Ser
225 230 235 240
Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser Ile
245 250 255
Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser Thr
260 265 270
Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu Thr
275 280 285
Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe
290 295 300
Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu
305 310 315 320
Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser
325 330 335
Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser
340 345 350
Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala
355 360 365
Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn
370 375 380
Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp
385 390 395 400
Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala
405 410 415
Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu
420 425 430
Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys
435 440 445
Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr
450 455 460
Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr
465 470 475 480
Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser Ala
485 490 495
Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala
500 505 510
Ala Ser Ala Asn Leu
515
<210> 17
<211> 3378
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 17
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cagaatcttc aacgcccact acgccggcta cttcgccgac 1740
ctgctgatcc acgacatcga gacaaaccct ggccccaccc ctggaaccca gagccccttc 1800
ttccttctgc tgctgctgac cgtgctgact gtcgtgacag gctctggcca cgccagctct 1860
acacctggcg gcgagaaaga gacaagcgcc acccagagaa gcagcgtgcc aagcagcacc 1920
gagaagaacg ccgtgtccat gaccagctcc gtgctgagca gccactctcc tggcagcggc 1980
agcagcacaa cacagggcca ggatgtgaca ctggcccctg ccacagaacc tgcctctgga 2040
tctgccgcca cctggggaca ggacgtgaca agcgtgccag tgaccagacc tgccctgggc 2100
tctacaacac cccctgccca cgatgtgacc agcgcccctg ataacaagcc tgcccctgga 2160
agcacagccc ctccagctca tggcgtgacc tctgccccag ataccagacc agccccagga 2220
tctacagccc cacccgcaca cggcgtgaca agtgcccctg acacaagacc cgctccaggc 2280
tctactgctc ctcctgccca tggcgtgaca agcgctcccg atacaaggcc agctcctggc 2340
tccacagcac caccagcaca tggcgtgaca tcagctcccg acactagacc tgctcccgga 2400
tcaaccgctc caccagctca cggcgtgacc agcgcacctg ataccagacc tgctctggga 2460
agcaccgccc ctcccgtgca caatgtgaca tctgcttccg gcagcgccag cggctctgcc 2520
tctacactgg tgcacaacgg caccagcgcc agagccacaa caaccccagc cagcaagagc 2580
acccccttca gcatccctag ccaccacagc gacaccccta ccacactggc cagccactcc 2640
accaagaccg atgcctctag cacccaccac tccagcgtgc cccctctgac cagcagcaac 2700
cacagcacaa gcccccagct gtctaccggc gtctcattct tctttctgtc cttccacatc 2760
agcaacctgc agttcaacag cagcctggaa gatcccagca ccgactacta ccaggaactg 2820
cagcgggata tcagcgagat gttcctgcaa atctacaagc agggcggctt cctgggcctg 2880
agcaacatca agttcagacc cggcagcgtg gtggtgcagc tgaccctggc tttccgggaa 2940
ggcaccatca acgtgcacga cgtggaaacc cagttcaacc agtacaagac cgaggccgcc 3000
agccggtaca acctgaccat ctccgatgtg tccgtgtccg acgtgccctt cccattctct 3060
gcccagtctg gcgcaggcgt gccaggatgg ggaattgctc tgctggtgct cgtgtgcgtg 3120
ctggtggccc tggccatcgt gtatctgatt gccctggccg tgtgccagtg ccggcggaag 3180
aattacggcc agctggacat cttccccgcc agagacacct accaccccat gagcgagtac 3240
cccacatacc acacccacgg cagatacgtg ccacccagct ccaccgacag atccccctac 3300
gagaaagtgt ctgccggcaa cggcggcagc tccctgagct acacaaatcc tgccgtggcc 3360
gctgcctccg ccaacctg 3378
<210> 18
<211> 1126
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 18
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Arg Ile Phe Asn Ala His Tyr Ala Gly
565 570 575
Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu Thr Asn Pro Gly Pro
580 585 590
Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val
595 600 605
Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly
610 615 620
Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr
625 630 635 640
Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser
645 650 655
Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala
660 665 670
Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp
675 680 685
Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro
690 695 700
Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly
705 710 715 720
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
725 730 735
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
740 745 750
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly
755 760 765
Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro
770 775 780
Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly
785 790 795 800
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
805 810 815
Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala
820 825 830
Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr
835 840 845
Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser
850 855 860
Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser
865 870 875 880
Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu
885 890 895
Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser
900 905 910
Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser
915 920 925
Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile
930 935 940
Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu
945 950 955 960
Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu
965 970 975
Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe
980 985 990
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser
995 1000 1005
Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser
1010 1015 1020
Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val
1025 1030 1035
Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
1040 1045 1050
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe
1055 1060 1065
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1070 1075 1080
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser
1085 1090 1095
Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser
1100 1105 1110
Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
1115 1120 1125
<210> 19
<211> 3378
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 19
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 1680
acaaaccctg gccccctggc tggcgagaca ggacaggaag ccgctcctct ggacggcgtg 1740
ctggccaacc ctcccaatat cagcagcctg agccccagac agctgctggg attcccttgt 1800
gccgaggtgt ccggcctgag cacagagaga gtgcgggaac tggctgtggc cctggcccag 1860
aaaaacgtga agctgagcac cgagcagctg cggtgcctgg cccacagact gtctgagcct 1920
cccgaggatc tggacgccct gcctctggat ctgctgctgt tcctgaaccc cgacgccttc 1980
agcggacctc aggcctgcac ccggttcttc agcagaatca ccaaggccaa cgtggacctg 2040
ctgcccagag gcgcccctga gagacagaga ctgctgcctg ctgctctggc ctgttgggga 2100
gtgcggggct ctctgctgtc tgaagctgat gtgcgggccc tgggaggcct ggcttgtgat 2160
ctgcctggaa gattcgtggc cgagagcgcc gaagtgctgc tgcctagact ggtgtcctgt 2220
cccggccctc tggaccagga tcagcaggaa gctgccagag ctgctctgca gggcggaggc 2280
cctccttatg gacctcctag cacttggagc gtgtccacca tggatgccct gaggggcctg 2340
ctgccagtgc tgggccagcc tatcatcaga tccatcccac agggcatcgt ggccgcctgg 2400
cggcagagaa gctctagaga tccctcttgg cggcagcccg agcggacaat cctgcggccc 2460
aggtttcgga gagaggtgga aaagaccgcc tgcccctctg gcaagaaggc cagagagatc 2520
gacgagagcc tgatcttcta caagaagtgg gagctggaag cctgcgtgga cgccgctctg 2580
ctggccaccc agatggacag agtgaacgcc atccccttca cctatgagca gctggacgtg 2640
ctgaagcaca agctggatga gctgtacccc cagggctacc ccgagagcgt gatccagcac 2700
ctgggctacc tgtttctgaa gatgagcccc gaggacatcc ggaagtggaa cgtgaccagc 2760
ctggaaaccc tgaaggccct gctggaagtg aacaagggcc acgagatgtc cccccaggtg 2820
gccacactga tcgacagatt cgtgaagggc agaggccagc tggacaagga caccctggat 2880
acactgaccg ccttctaccc cggctatctg tgcagcctgt cccccgagga actgagcagc 2940
gtgccaccta gctctatctg ggctgtgcgg ccccaggacc tggatacctg cgatcctaga 3000
cagctggatg tgctgtatcc caaggcccgg ctggccttcc agaacatgaa cggcagcgag 3060
tacttcgtga agatccagtc cttcctgggc ggagccccta ccgaggacct gaaagctctg 3120
agccagcaga acgtgtccat ggatctggcc acctttatga agctgcggac cgacgccgtg 3180
ctgcctctga cagtggctga ggtgcagaaa ctgctgggcc cccatgtgga agggctgaag 3240
gccgaagaac ggcacagacc cgtgcgcgac tggatcctga ggcagagaca ggatgacctg 3300
gacacactgg gcctgggact gcaggggggc atccctaatg gctacctggt gctggacctg 3360
agcatgcagg aagccctg 3378
<210> 20
<211> 1126
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 20
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Arg Ile Phe Asn
530 535 540
Ala His Tyr Ala Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu
545 550 555 560
Thr Asn Pro Gly Pro Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro
565 570 575
Leu Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro
580 585 590
Arg Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr
595 600 605
Glu Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys
610 615 620
Leu Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro
625 630 635 640
Pro Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn
645 650 655
Pro Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg
660 665 670
Ile Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg
675 680 685
Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser
690 695 700
Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp
705 710 715 720
Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg
725 730 735
Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala
740 745 750
Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr
755 760 765
Trp Ser Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu
770 775 780
Gly Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp
785 790 795 800
Arg Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr
805 810 815
Ile Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro
820 825 830
Ser Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys
835 840 845
Lys Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln
850 855 860
Met Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val
865 870 875 880
Leu Lys His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser
885 890 895
Val Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp
900 905 910
Ile Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu
915 920 925
Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile
930 935 940
Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp
945 950 955 960
Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu
965 970 975
Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln
980 985 990
Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys
995 1000 1005
Ala Arg Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val
1010 1015 1020
Lys Ile Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys
1025 1030 1035
Ala Leu Ser Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met
1040 1045 1050
Lys Leu Arg Thr Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val
1055 1060 1065
Gln Lys Leu Leu Gly Pro His Val Glu Gly Leu Lys Ala Glu Glu
1070 1075 1080
Arg His Arg Pro Val Arg Asp Trp Ile Leu Arg Gln Arg Gln Asp
1085 1090 1095
Asp Leu Asp Thr Leu Gly Leu Gly Leu Gln Gly Gly Ile Pro Asn
1100 1105 1110
Gly Tyr Leu Val Leu Asp Leu Ser Met Gln Glu Ala Leu
1115 1120 1125
<210> 21
<211> 3360
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 21
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
gccaccaatt tcagcctgct gaaacaggcc ggcgacgtgg aagagaaccc tggccctctg 1680
gctggcgaga caggacagga agccgctcct ctggacggcg tgctggccaa ccctcccaat 1740
atcagcagcc tgagccccag acagctgctg ggattccctt gtgccgaggt gtccggcctg 1800
agcacagaga gagtgcggga actggctgtg gccctggccc agaaaaacgt gaagctgagc 1860
accgagcagc tgcggtgcct ggcccacaga ctgtctgagc ctcccgagga tctggacgcc 1920
ctgcctctgg atctgctgct gttcctgaac cccgacgcct tcagcggacc tcaggcctgc 1980
acccggttct tcagcagaat caccaaggcc aacgtggacc tgctgcccag aggcgcccct 2040
gagagacaga gactgctgcc tgctgctctg gcctgttggg gagtgcgggg ctctctgctg 2100
tctgaagctg atgtgcgggc cctgggaggc ctggcttgtg atctgcctgg aagattcgtg 2160
gccgagagcg ccgaagtgct gctgcctaga ctggtgtcct gtcccggccc tctggaccag 2220
gatcagcagg aagctgccag agctgctctg cagggcggag gccctcctta tggacctcct 2280
agcacttgga gcgtgtccac catggatgcc ctgaggggcc tgctgccagt gctgggccag 2340
cctatcatca gatccatccc acagggcatc gtggccgcct ggcggcagag aagctctaga 2400
gatccctctt ggcggcagcc cgagcggaca atcctgcggc ccaggtttcg gagagaggtg 2460
gaaaagaccg cctgcccctc tggcaagaag gccagagaga tcgacgagag cctgatcttc 2520
tacaagaagt gggagctgga agcctgcgtg gacgccgctc tgctggccac ccagatggac 2580
agagtgaacg ccatcccctt cacctatgag cagctggacg tgctgaagca caagctggat 2640
gagctgtacc cccagggcta ccccgagagc gtgatccagc acctgggcta cctgtttctg 2700
aagatgagcc ccgaggacat ccggaagtgg aacgtgacca gcctggaaac cctgaaggcc 2760
ctgctggaag tgaacaaggg ccacgagatg tccccccagg tggccacact gatcgacaga 2820
ttcgtgaagg gcagaggcca gctggacaag gacaccctgg atacactgac cgccttctac 2880
cccggctatc tgtgcagcct gtcccccgag gaactgagca gcgtgccacc tagctctatc 2940
tgggctgtgc ggccccagga cctggatacc tgcgatccta gacagctgga tgtgctgtat 3000
cccaaggccc ggctggcctt ccagaacatg aacggcagcg agtacttcgt gaagatccag 3060
tccttcctgg gcggagcccc taccgaggac ctgaaagctc tgagccagca gaacgtgtcc 3120
atggatctgg ccacctttat gaagctgcgg accgacgccg tgctgcctct gacagtggct 3180
gaggtgcaga aactgctggg cccccatgtg gaagggctga aggccgaaga acggcacaga 3240
cccgtgcgcg actggatcct gaggcagaga caggatgacc tggacacact gggcctggga 3300
ctgcaggggg gcatccctaa tggctacctg gtgctggacc tgagcatgca ggaagccctg 3360
<210> 22
<211> 1120
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 22
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Ala Thr Asn Phe
530 535 540
Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn Pro Gly Pro Leu
545 550 555 560
Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp Gly Val Leu Ala
565 570 575
Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln Leu Leu Gly Phe
580 585 590
Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg Val Arg Glu Leu
595 600 605
Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser Thr Glu Gln Leu
610 615 620
Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu Asp Leu Asp Ala
625 630 635 640
Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp Ala Phe Ser Gly
645 650 655
Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr Lys Ala Asn Val
660 665 670
Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg Leu Leu Pro Ala
675 680 685
Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu Ser Glu Ala Asp
690 695 700
Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro Gly Arg Phe Val
705 710 715 720
Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val Ser Cys Pro Gly
725 730 735
Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala Ala Leu Gln Gly
740 745 750
Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser Val Ser Thr Met
755 760 765
Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln Pro Ile Ile Arg
770 775 780
Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln Arg Ser Ser Arg
785 790 795 800
Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu Arg Pro Arg Phe
805 810 815
Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly Lys Lys Ala Arg
820 825 830
Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp Glu Leu Glu Ala
835 840 845
Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp Arg Val Asn Ala
850 855 860
Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys His Lys Leu Asp
865 870 875 880
Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile Gln His Leu Gly
885 890 895
Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg Lys Trp Asn Val
900 905 910
Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val Asn Lys Gly His
915 920 925
Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg Phe Val Lys Gly
930 935 940
Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu Thr Ala Phe Tyr
945 950 955 960
Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu Ser Ser Val Pro
965 970 975
Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu Asp Thr Cys Asp
980 985 990
Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg Leu Ala Phe Gln
995 1000 1005
Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln Ser Phe Leu
1010 1015 1020
Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln Gln Asn
1025 1030 1035
Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp Ala
1040 1045 1050
Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
1055 1060 1065
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg
1070 1075 1080
Asp Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly
1085 1090 1095
Leu Gly Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp
1100 1105 1110
Leu Ser Met Gln Glu Ala Leu
1115 1120
<210> 23
<211> 3360
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 23
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cgccaccaat ttcagcctgc tgaaacaggc cggcgacgtg 1740
gaagagaacc ctggccctac ccctggaacc cagagcccct tcttccttct gctgctgctg 1800
accgtgctga ctgtcgtgac aggctctggc cacgccagct ctacacctgg cggcgagaaa 1860
gagacaagcg ccacccagag aagcagcgtg ccaagcagca ccgagaagaa cgccgtgtcc 1920
atgaccagct ccgtgctgag cagccactct cctggcagcg gcagcagcac aacacagggc 1980
caggatgtga cactggcccc tgccacagaa cctgcctctg gatctgccgc cacctgggga 2040
caggacgtga caagcgtgcc agtgaccaga cctgccctgg gctctacaac accccctgcc 2100
cacgatgtga ccagcgcccc tgataacaag cctgcccctg gaagcacagc ccctccagct 2160
catggcgtga cctctgcccc agataccaga ccagccccag gatctacagc cccacccgca 2220
cacggcgtga caagtgcccc tgacacaaga cccgctccag gctctactgc tcctcctgcc 2280
catggcgtga caagcgctcc cgatacaagg ccagctcctg gctccacagc accaccagca 2340
catggcgtga catcagctcc cgacactaga cctgctcccg gatcaaccgc tccaccagct 2400
cacggcgtga ccagcgcacc tgataccaga cctgctctgg gaagcaccgc ccctcccgtg 2460
cacaatgtga catctgcttc cggcagcgcc agcggctctg cctctacact ggtgcacaac 2520
ggcaccagcg ccagagccac aacaacccca gccagcaaga gcaccccctt cagcatccct 2580
agccaccaca gcgacacccc taccacactg gccagccact ccaccaagac cgatgcctct 2640
agcacccacc actccagcgt gccccctctg accagcagca accacagcac aagcccccag 2700
ctgtctaccg gcgtctcatt cttctttctg tccttccaca tcagcaacct gcagttcaac 2760
agcagcctgg aagatcccag caccgactac taccaggaac tgcagcggga tatcagcgag 2820
atgttcctgc aaatctacaa gcagggcggc ttcctgggcc tgagcaacat caagttcaga 2880
cccggcagcg tggtggtgca gctgaccctg gctttccggg aaggcaccat caacgtgcac 2940
gacgtggaaa cccagttcaa ccagtacaag accgaggccg ccagccggta caacctgacc 3000
atctccgatg tgtccgtgtc cgacgtgccc ttcccattct ctgcccagtc tggcgcaggc 3060
gtgccaggat ggggaattgc tctgctggtg ctcgtgtgcg tgctggtggc cctggccatc 3120
gtgtatctga ttgccctggc cgtgtgccag tgccggcgga agaattacgg ccagctggac 3180
atcttccccg ccagagacac ctaccacccc atgagcgagt accccacata ccacacccac 3240
ggcagatacg tgccacccag ctccaccgac agatccccct acgagaaagt gtctgccggc 3300
aacggcggca gctccctgag ctacacaaat cctgccgtgg ccgctgcctc cgccaacctg 3360
<210> 24
<211> 1120
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 24
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Ala Thr Asn Phe Ser Leu Leu Lys Gln
565 570 575
Ala Gly Asp Val Glu Glu Asn Pro Gly Pro Thr Pro Gly Thr Gln Ser
580 585 590
Pro Phe Phe Leu Leu Leu Leu Leu Thr Val Leu Thr Val Val Thr Gly
595 600 605
Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala
610 615 620
Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser
625 630 635 640
Met Thr Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser
645 650 655
Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala
660 665 670
Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val
675 680 685
Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr
690 695 700
Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala
705 710 715 720
His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
725 730 735
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala
740 745 750
Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp
755 760 765
Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr
770 775 780
Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala
785 790 795 800
His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Leu Gly Ser Thr
805 810 815
Ala Pro Pro Val His Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly
820 825 830
Ser Ala Ser Thr Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr
835 840 845
Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser
850 855 860
Asp Thr Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser
865 870 875 880
Ser Thr His His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser
885 890 895
Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe
900 905 910
His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr
915 920 925
Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln
930 935 940
Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg
945 950 955 960
Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr
965 970 975
Ile Asn Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr Glu
980 985 990
Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser Val Ser Asp
995 1000 1005
Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly Val Pro Gly
1010 1015 1020
Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu Val Ala Leu
1025 1030 1035
Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys Arg Arg
1040 1045 1050
Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr Tyr
1055 1060 1065
His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr
1070 1075 1080
Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser
1085 1090 1095
Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val
1100 1105 1110
Ala Ala Ala Ser Ala Asn Leu
1115 1120
<210> 25
<211> 4302
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 25
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggaggctcc 1620
ggcggaggag ctgccccgga gccggagagg acccccgttg gccagggatc gtgggcccat 1680
ccgggacgca ccaggggacc atccgacagg ggattctgtg tggtgtcacc ggccaggcca 1740
gcagaagagg caaccagcct cgagggagcg ttgtctggaa ccagacattc ccacccgtcg 1800
gtgggccggc agcaccacgc gggaccaccg tccacttcca gaccgccacg gccatgggac 1860
accccttgcc cgcctgtgta tgccgagact aaacacttcc tgtactcatc cggagacaag 1920
gaacagcttc ggccgtcctt cctcctgtcg tcgctcagac cgagcctgac cggagcacgc 1980
agattggtgg aaactatctt ccttgggtca cgtccgtgga tgccaggtac cccacggcgc 2040
ctcccgcgcc tcccacagag atactggcag atgcggcctc tgttcctgga attgctggga 2100
aaccacgctc agtgcccgta cggagtcctg ctcaagactc actgccctct gagggcggcg 2160
gtcactccgg cggccggagt gtgcgcacgg gagaagcccc agggaagcgt ggcagctccg 2220
gaagaggagg acaccgatcc gcgccgcctc gtgcaacttc tgcgccagca ctcctcgccc 2280
tggcaagtct acgggttcgt ccgcgcctgc ctgcgccgcc tggtgccgcc tgggctctgg 2340
ggttcccggc ataacgagcg ccgcttcctg agaaatacta agaagtttat ctcacttgga 2400
aaacatgcca agttgtcgct gcaagaactc acgtggaaga tgtcagtccg cgattgcgcc 2460
tggctgcgcc gctcgccggg cgtcgggtgt gttccagctg cagaacaccg cctgagagaa 2520
gaaattctgg ccaaatttct gcattggctg atgtcagtgt acgtggtcga gctgctgcgc 2580
tcctttttct acgtcactga gactaccttt caaaagaacc gcctgttctt ctaccgcaaa 2640
tctgtgtgga gcaagctgca gtcaatcggc attcgccagc atctgaagag ggtgcagctg 2700
cgggaacttt ccgaggcaga agtccgccag caccgggagg cccggccggc gcttctcacg 2760
tcgcgtctga gattcatccc aaagcccgac gggctgaggc ctatcgtcaa catggattac 2820
gtcgtgggcg ctcgcacctt tcgccgtgaa aagcgggccg aacgcttgac ctcacgggtg 2880
aaggccctct tctccgtgct gaactacgag agagcaagac ggcctggcct gctgggagct 2940
tcggtgctgg gactggacga tatccaccgg gcttggcgga cctttgttct ccgggtgaga 3000
gcccaagacc ctccgccgga actgtacttc gtgaaggtgg cgatcaccgg agcctatgat 3060
actattccgc aagatcgact caccgaagtc atcgcctcga tcatcaaacc gcagaacact 3120
tactgcgtca ggcggtacgc cgtggtccag aaggccgcgc atggccacgt gagaaaggcg 3180
ttcaagtcgc acgtgtccac tctcaccgac ctccagcctt acatgaggca attcgttgcg 3240
catttgcaag agacttcgcc cctgagagat gcggtggtca tcgagcagag ctccagcctg 3300
aacgaagcga gcagcggtct gtttgacgtg ttcctccgct tcatgtgtca tcacgcggtg 3360
cgaatcaggg gaaaatcata cgtgcagtgc cagggaatcc cacaaggcag cattctgtcg 3420
actctcttgt gttccctttg ctacggcgat atggaaaaca agctgttcgc tgggatcaga 3480
cgggacgggt tgctgctcag actggtggac gacttcctgc tggtgactcc gcacctcact 3540
cacgccaaaa cctttctccg cactctggtg aggggagtgc cagaatacgg ctgtgtggtc 3600
aatctccgga aaactgtggt gaatttccct gtcgaggatg aggcactcgg aggaaccgca 3660
tttgtccaaa tgccagcaca tggcctgttc ccatggtgcg gtctgctgct ggacacccga 3720
actcttgaag tgcagtccga ctactccagc tatgcccgga cgagcatccg cgccagcctc 3780
actttcaatc gcggctttaa ggccggacga aacatgcgca gaaagctttt cggagtcctc 3840
cggcttaaat gccattcgct ctttctcgat ctccaagtca attcgctgca gaccgtgtgc 3900
acgaacatct acaagatcct gctgctccaa gcctaccggt tccacgcttg cgtgcttcag 3960
ctgccgtttc accaacaggt gtggaagaac ccgaccttct ttctgcgggt cattagcgat 4020
actgcctccc tgtgttactc aatcctcaag gcaaagaacg ccggaatgtc gctgggtgcg 4080
aaaggagccg cgggacctct tcctagcgaa gcggtgcagt ggctctgcca ccaggctttc 4140
ctcctgaagc tgaccaggca cagagtgacc tacgtcccgc tgctgggctc gctgcgcact 4200
gcacagaccc agctgtctag aaaactcccc ggcaccaccc tgaccgctct ggaagccgcc 4260
gccaacccag cattgccgtc agatttcaag accatcttgg ac 4302
<210> 26
<211> 1434
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 26
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Gly Ser Gly Gly Gly Ala
530 535 540
Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His
545 550 555 560
Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser
565 570 575
Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu Ser
580 585 590
Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His His Ala Gly
595 600 605
Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro
610 615 620
Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys
625 630 635 640
Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu
645 650 655
Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro
660 665 670
Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr
675 680 685
Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln
690 695 700
Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala
705 710 715 720
Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser
725 730 735
Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln
740 745 750
Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg
755 760 765
Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His
770 775 780
Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly
785 790 795 800
Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val
805 810 815
Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro
820 825 830
Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His
835 840 845
Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr
850 855 860
Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys
865 870 875 880
Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys
885 890 895
Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg
900 905 910
Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys
915 920 925
Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala
930 935 940
Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val
945 950 955 960
Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly
965 970 975
Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp
980 985 990
Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu
995 1000 1005
Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro
1010 1015 1020
Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln
1025 1030 1035
Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala
1040 1045 1050
His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu
1055 1060 1065
Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
1070 1075 1080
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser
1085 1090 1095
Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg
1100 1105 1110
Phe Met Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val
1115 1120 1125
Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu
1130 1135 1140
Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly
1145 1150 1155
Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu
1160 1165 1170
Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr
1175 1180 1185
Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg
1190 1195 1200
Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly
1205 1210 1215
Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys
1220 1225 1230
Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr
1235 1240 1245
Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn
1250 1255 1260
Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly
1265 1270 1275
Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val
1280 1285 1290
Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu
1295 1300 1305
Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
1310 1315 1320
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile
1325 1330 1335
Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn
1340 1345 1350
Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro
1355 1360 1365
Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys
1370 1375 1380
Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu
1385 1390 1395
Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr
1400 1405 1410
Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp
1415 1420 1425
Phe Lys Thr Ile Leu Asp
1430
<210> 27
<211> 4371
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 27
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
acaatcctgt ctgagggcgc caccaacttc agcctgctga aactggccgg cgacgtggaa 1680
ctgaaccctg gccctggagc tgccccggag ccggagagga cccccgttgg ccagggatcg 1740
tgggcccatc cgggacgcac caggggacca tccgacaggg gattctgtgt ggtgtcaccg 1800
gccaggccag cagaagaggc aaccagcctc gagggagcgt tgtctggaac cagacattcc 1860
cacccgtcgg tgggccggca gcaccacgcg ggaccaccgt ccacttccag accgccacgg 1920
ccatgggaca ccccttgccc gcctgtgtat gccgagacta aacacttcct gtactcatcc 1980
ggagacaagg aacagcttcg gccgtccttc ctcctgtcgt cgctcagacc gagcctgacc 2040
ggagcacgca gattggtgga aactatcttc cttgggtcac gtccgtggat gccaggtacc 2100
ccacggcgcc tcccgcgcct cccacagaga tactggcaga tgcggcctct gttcctggaa 2160
ttgctgggaa accacgctca gtgcccgtac ggagtcctgc tcaagactca ctgccctctg 2220
agggcggcgg tcactccggc ggccggagtg tgcgcacggg agaagcccca gggaagcgtg 2280
gcagctccgg aagaggagga caccgatccg cgccgcctcg tgcaacttct gcgccagcac 2340
tcctcgccct ggcaagtcta cgggttcgtc cgcgcctgcc tgcgccgcct ggtgccgcct 2400
gggctctggg gttcccggca taacgagcgc cgcttcctga gaaatactaa gaagtttatc 2460
tcacttggaa aacatgccaa gttgtcgctg caagaactca cgtggaagat gtcagtccgc 2520
gattgcgcct ggctgcgccg ctcgccgggc gtcgggtgtg ttccagctgc agaacaccgc 2580
ctgagagaag aaattctggc caaatttctg cattggctga tgtcagtgta cgtggtcgag 2640
ctgctgcgct cctttttcta cgtcactgag actacctttc aaaagaaccg cctgttcttc 2700
taccgcaaat ctgtgtggag caagctgcag tcaatcggca ttcgccagca tctgaagagg 2760
gtgcagctgc gggaactttc cgaggcagaa gtccgccagc accgggaggc ccggccggcg 2820
cttctcacgt cgcgtctgag attcatccca aagcccgacg ggctgaggcc tatcgtcaac 2880
atggattacg tcgtgggcgc tcgcaccttt cgccgtgaaa agcgggccga acgcttgacc 2940
tcacgggtga aggccctctt ctccgtgctg aactacgaga gagcaagacg gcctggcctg 3000
ctgggagctt cggtgctggg actggacgat atccaccggg cttggcggac ctttgttctc 3060
cgggtgagag cccaagaccc tccgccggaa ctgtacttcg tgaaggtggc gatcaccgga 3120
gcctatgata ctattccgca agatcgactc accgaagtca tcgcctcgat catcaaaccg 3180
cagaacactt actgcgtcag gcggtacgcc gtggtccaga aggccgcgca tggccacgtg 3240
agaaaggcgt tcaagtcgca cgtgtccact ctcaccgacc tccagcctta catgaggcaa 3300
ttcgttgcgc atttgcaaga gacttcgccc ctgagagatg cggtggtcat cgagcagagc 3360
tccagcctga acgaagcgag cagcggtctg tttgacgtgt tcctccgctt catgtgtcat 3420
cacgcggtgc gaatcagggg aaaatcatac gtgcagtgcc agggaatccc acaaggcagc 3480
attctgtcga ctctcttgtg ttccctttgc tacggcgata tggaaaacaa gctgttcgct 3540
gggatcagac gggacgggtt gctgctcaga ctggtggacg acttcctgct ggtgactccg 3600
cacctcactc acgccaaaac ctttctccgc actctggtga ggggagtgcc agaatacggc 3660
tgtgtggtca atctccggaa aactgtggtg aatttccctg tcgaggatga ggcactcgga 3720
ggaaccgcat ttgtccaaat gccagcacat ggcctgttcc catggtgcgg tctgctgctg 3780
gacacccgaa ctcttgaagt gcagtccgac tactccagct atgcccggac gagcatccgc 3840
gccagcctca ctttcaatcg cggctttaag gccggacgaa acatgcgcag aaagcttttc 3900
ggagtcctcc ggcttaaatg ccattcgctc tttctcgatc tccaagtcaa ttcgctgcag 3960
accgtgtgca cgaacatcta caagatcctg ctgctccaag cctaccggtt ccacgcttgc 4020
gtgcttcagc tgccgtttca ccaacaggtg tggaagaacc cgaccttctt tctgcgggtc 4080
attagcgata ctgcctccct gtgttactca atcctcaagg caaagaacgc cggaatgtcg 4140
ctgggtgcga aaggagccgc gggacctctt cctagcgaag cggtgcagtg gctctgccac 4200
caggctttcc tcctgaagct gaccaggcac agagtgacct acgtcccgct gctgggctcg 4260
ctgcgcactg cacagaccca gctgtctaga aaactccccg gcaccaccct gaccgctctg 4320
gaagccgccg ccaacccagc attgccgtca gatttcaaga ccatcttgga c 4371
<210> 28
<211> 1457
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 28
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Thr Ile Leu Ser
530 535 540
Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu
545 550 555 560
Leu Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val
565 570 575
Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp
580 585 590
Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr
595 600 605
Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val
610 615 620
Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg
625 630 635 640
Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe
645 650 655
Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu
660 665 670
Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr
675 680 685
Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu
690 695 700
Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu
705 710 715 720
Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr
725 730 735
His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala
740 745 750
Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr
755 760 765
Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp
770 775 780
Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro
785 790 795 800
Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr
805 810 815
Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu
820 825 830
Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser
835 840 845
Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu
850 855 860
Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu
865 870 875 880
Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn
885 890 895
Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile
900 905 910
Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu
915 920 925
Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser
930 935 940
Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn
945 950 955 960
Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala
965 970 975
Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr
980 985 990
Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu
995 1000 1005
Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg
1010 1015 1020
Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile
1025 1030 1035
Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val
1040 1045 1050
Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg
1055 1060 1065
Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala
1070 1075 1080
Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met
1085 1090 1095
Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp
1100 1105 1110
Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser
1115 1120 1125
Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val
1130 1135 1140
Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln
1145 1150 1155
Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp
1160 1165 1170
Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu
1175 1180 1185
Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr
1190 1195 1200
His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
1205 1210 1215
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro
1220 1225 1230
Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro
1235 1240 1245
Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg
1250 1255 1260
Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser
1265 1270 1275
Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg
1280 1285 1290
Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His
1295 1300 1305
Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys
1310 1315 1320
Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His
1325 1330 1335
Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn
1340 1345 1350
Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys
1355 1360 1365
Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala
1370 1375 1380
Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu
1385 1390 1395
Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr
1400 1405 1410
Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu
1415 1420 1425
Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala
1430 1435 1440
Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
1445 1450 1455
<210> 29
<211> 4371
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 29
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggcacaatc 2700
ctgtctgagg gcgccaccaa cttcagcctg ctgaaactgg ccggcgacgt ggaactgaac 2760
cctggcccta cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 2820
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 2880
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 2940
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 3000
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 3060
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 3120
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 3180
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 3240
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 3300
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 3360
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 3420
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 3480
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 3540
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 3600
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 3660
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 3720
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 3780
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 3840
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 3900
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 3960
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 4020
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 4080
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 4140
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 4200
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 4260
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 4320
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct g 4371
<210> 30
<211> 1457
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 30
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
900 905 910
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Thr Pro Gly Thr Gln
915 920 925
Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val Leu Thr Val Val Thr
930 935 940
Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser
945 950 955 960
Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val
965 970 975
Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser
980 985 990
Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro
995 1000 1005
Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val
1010 1015 1020
Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His
1025 1030 1035
Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr
1040 1045 1050
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1055 1060 1065
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1070 1075 1080
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
1085 1090 1095
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1100 1105 1110
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1115 1120 1125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1130 1135 1140
Pro Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His
1145 1150 1155
Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr
1160 1165 1170
Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala
1175 1180 1185
Ser Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr
1190 1195 1200
Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser
1205 1210 1215
Thr His His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser
1220 1225 1230
Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser
1235 1240 1245
Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro
1250 1255 1260
Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met
1265 1270 1275
Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn
1280 1285 1290
Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala
1295 1300 1305
Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe
1310 1315 1320
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile
1325 1330 1335
Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln
1340 1345 1350
Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
1355 1360 1365
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu
1370 1375 1380
Ala Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile
1385 1390 1395
Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr
1400 1405 1410
Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg
1415 1420 1425
Ser Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu
1430 1435 1440
Ser Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
1445 1450 1455
<210> 31
<211> 4311
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 31
atggctagca caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 60
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 120
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 180
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 240
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 300
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 360
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 420
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 480
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 540
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 600
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 660
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 720
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 780
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 840
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 900
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 960
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1020
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1080
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1140
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1200
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1260
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1320
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1380
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1440
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1500
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1560
acaatcctgt ctgagggcgc caccaacttc agcctgctga aactggccgg cgacgtggaa 1620
ctgaaccctg gccctggagc tgccccggag ccggagagga cccccgttgg ccagggatcg 1680
tgggcccatc cgggacgcac caggggacca tccgacaggg gattctgtgt ggtgtcaccg 1740
gccaggccag cagaagaggc aaccagcctc gagggagcgt tgtctggaac cagacattcc 1800
cacccgtcgg tgggccggca gcaccacgcg ggaccaccgt ccacttccag accgccacgg 1860
ccatgggaca ccccttgccc gcctgtgtat gccgagacta aacacttcct gtactcatcc 1920
ggagacaagg aacagcttcg gccgtccttc ctcctgtcgt cgctcagacc gagcctgacc 1980
ggagcacgca gattggtgga aactatcttc cttgggtcac gtccgtggat gccaggtacc 2040
ccacggcgcc tcccgcgcct cccacagaga tactggcaga tgcggcctct gttcctggaa 2100
ttgctgggaa accacgctca gtgcccgtac ggagtcctgc tcaagactca ctgccctctg 2160
agggcggcgg tcactccggc ggccggagtg tgcgcacggg agaagcccca gggaagcgtg 2220
gcagctccgg aagaggagga caccgatccg cgccgcctcg tgcaacttct gcgccagcac 2280
tcctcgccct ggcaagtcta cgggttcgtc cgcgcctgcc tgcgccgcct ggtgccgcct 2340
gggctctggg gttcccggca taacgagcgc cgcttcctga gaaatactaa gaagtttatc 2400
tcacttggaa aacatgccaa gttgtcgctg caagaactca cgtggaagat gtcagtccgc 2460
gattgcgcct ggctgcgccg ctcgccgggc gtcgggtgtg ttccagctgc agaacaccgc 2520
ctgagagaag aaattctggc caaatttctg cattggctga tgtcagtgta cgtggtcgag 2580
ctgctgcgct cctttttcta cgtcactgag actacctttc aaaagaaccg cctgttcttc 2640
taccgcaaat ctgtgtggag caagctgcag tcaatcggca ttcgccagca tctgaagagg 2700
gtgcagctgc gggaactttc cgaggcagaa gtccgccagc accgggaggc ccggccggcg 2760
cttctcacgt cgcgtctgag attcatccca aagcccgacg ggctgaggcc tatcgtcaac 2820
atggattacg tcgtgggcgc tcgcaccttt cgccgtgaaa agcgggccga acgcttgacc 2880
tcacgggtga aggccctctt ctccgtgctg aactacgaga gagcaagacg gcctggcctg 2940
ctgggagctt cggtgctggg actggacgat atccaccggg cttggcggac ctttgttctc 3000
cgggtgagag cccaagaccc tccgccggaa ctgtacttcg tgaaggtggc gatcaccgga 3060
gcctatgata ctattccgca agatcgactc accgaagtca tcgcctcgat catcaaaccg 3120
cagaacactt actgcgtcag gcggtacgcc gtggtccaga aggccgcgca tggccacgtg 3180
agaaaggcgt tcaagtcgca cgtgtccact ctcaccgacc tccagcctta catgaggcaa 3240
ttcgttgcgc atttgcaaga gacttcgccc ctgagagatg cggtggtcat cgagcagagc 3300
tccagcctga acgaagcgag cagcggtctg tttgacgtgt tcctccgctt catgtgtcat 3360
cacgcggtgc gaatcagggg aaaatcatac gtgcagtgcc agggaatccc acaaggcagc 3420
attctgtcga ctctcttgtg ttccctttgc tacggcgata tggaaaacaa gctgttcgct 3480
gggatcagac gggacgggtt gctgctcaga ctggtggacg acttcctgct ggtgactccg 3540
cacctcactc acgccaaaac ctttctccgc actctggtga ggggagtgcc agaatacggc 3600
tgtgtggtca atctccggaa aactgtggtg aatttccctg tcgaggatga ggcactcgga 3660
ggaaccgcat ttgtccaaat gccagcacat ggcctgttcc catggtgcgg tctgctgctg 3720
gacacccgaa ctcttgaagt gcagtccgac tactccagct atgcccggac gagcatccgc 3780
gccagcctca ctttcaatcg cggctttaag gccggacgaa acatgcgcag aaagcttttc 3840
ggagtcctcc ggcttaaatg ccattcgctc tttctcgatc tccaagtcaa ttcgctgcag 3900
accgtgtgca cgaacatcta caagatcctg ctgctccaag cctaccggtt ccacgcttgc 3960
gtgcttcagc tgccgtttca ccaacaggtg tggaagaacc cgaccttctt tctgcgggtc 4020
attagcgata ctgcctccct gtgttactca atcctcaagg caaagaacgc cggaatgtcg 4080
ctgggtgcga aaggagccgc gggacctctt cctagcgaag cggtgcagtg gctctgccac 4140
caggctttcc tcctgaagct gaccaggcac agagtgacct acgtcccgct gctgggctcg 4200
ctgcgcactg cacagaccca gctgtctaga aaactccccg gcaccaccct gaccgctctg 4260
gaagccgccg ccaacccagc attgccgtca gatttcaaga ccatcttgga c 4311
<210> 32
<211> 1437
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 32
Met Ala Ser Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu
1 5 10 15
Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu
20 25 30
Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro
35 40 45
Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro
50 55 60
Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val
65 70 75 80
Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro
85 90 95
Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser
100 105 110
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala Ser
210 215 220
Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr Ser
225 230 235 240
Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser Ile
245 250 255
Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser Thr
260 265 270
Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu Thr
275 280 285
Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe
290 295 300
Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu
305 310 315 320
Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser
325 330 335
Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser
340 345 350
Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala
355 360 365
Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn
370 375 380
Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp
385 390 395 400
Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala
405 410 415
Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu
420 425 430
Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys
435 440 445
Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr
450 455 460
Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr
465 470 475 480
Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser Ala
485 490 495
Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala
500 505 510
Ala Ser Ala Asn Leu Gly Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr
515 520 525
Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu Leu Asn Pro Gly
530 535 540
Pro Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser
545 550 555 560
Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys
565 570 575
Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly
580 585 590
Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His
595 600 605
His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr
610 615 620
Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser
625 630 635 640
Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg
645 650 655
Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly
660 665 670
Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro
675 680 685
Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn
690 695 700
His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu
705 710 715 720
Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro
725 730 735
Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg
740 745 750
Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly
755 760 765
Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly
770 775 780
Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile
785 790 795 800
Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys
805 810 815
Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly
820 825 830
Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys
835 840 845
Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser
850 855 860
Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe
865 870 875 880
Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln
885 890 895
His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg
900 905 910
Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe
915 920 925
Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val
930 935 940
Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr
945 950 955 960
Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg
965 970 975
Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His
980 985 990
Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro
995 1000 1005
Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp
1010 1015 1020
Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
1025 1030 1035
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln
1040 1045 1050
Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val
1055 1060 1065
Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala
1070 1075 1080
His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu
1085 1090 1095
Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val
1100 1105 1110
Phe Leu Arg Phe Met Cys His His Ala Val Arg Ile Arg Gly Lys
1115 1120 1125
Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser
1130 1135 1140
Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu
1145 1150 1155
Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp
1160 1165 1170
Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe
1175 1180 1185
Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val
1190 1195 1200
Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala
1205 1210 1215
Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe
1220 1225 1230
Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln
1235 1240 1245
Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu
1250 1255 1260
Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
1265 1270 1275
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp
1280 1285 1290
Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys
1295 1300 1305
Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln
1310 1315 1320
Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu
1325 1330 1335
Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys
1340 1345 1350
Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly
1355 1360 1365
Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe
1370 1375 1380
Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu
1385 1390 1395
Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro
1400 1405 1410
Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu
1415 1420 1425
Pro Ser Asp Phe Lys Thr Ile Leu Asp
1430 1435
<210> 33
<211> 4311
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 33
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggcacaatc 2700
ctgtctgagg gcgccaccaa cttcagcctg ctgaaactgg ccggcgacgt ggaactgaac 2760
cctggcccta caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 2820
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 2880
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 2940
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 3000
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 3060
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 3120
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 3180
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 3240
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 3300
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 3360
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 3420
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 3480
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 3540
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 3600
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 3660
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 3720
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 3780
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 3840
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 3900
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 3960
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 4020
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 4080
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 4140
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 4200
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 4260
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct g 4311
<210> 34
<211> 1437
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 34
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
900 905 910
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Thr Gly Ser Gly His
915 920 925
Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg
930 935 940
Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser
945 950 955 960
Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln
965 970 975
Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser
980 985 990
Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro
995 1000 1005
Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala
1010 1015 1020
Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
1025 1030 1035
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1040 1045 1050
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1055 1060 1065
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1070 1075 1080
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
1085 1090 1095
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1100 1105 1110
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1115 1120 1125
Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala
1130 1135 1140
Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly
1145 1150 1155
Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro
1160 1165 1170
Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
1175 1180 1185
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser
1190 1195 1200
Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu
1205 1210 1215
Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn
1220 1225 1230
Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr
1235 1240 1245
Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr
1250 1255 1260
Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro
1265 1270 1275
Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr
1280 1285 1290
Ile Asn Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr
1295 1300 1305
Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser Val
1310 1315 1320
Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly Val
1325 1330 1335
Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu Val
1340 1345 1350
Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys
1355 1360 1365
Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp
1370 1375 1380
Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly
1385 1390 1395
Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys
1400 1405 1410
Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
1415 1420 1425
Ala Val Ala Ala Ala Ser Ala Asn Leu
1430 1435
<210> 35
<211> 4431
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 35
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggcgagggc 2700
agaggcagcc tgctgacatg tggcgacgtg gaagagaacc ctggccccct ggctggcgag 2760
acaggacagg aagccgctcc tctggacggc gtgctggcca accctcccaa tatcagcagc 2820
ctgagcccca gacagctgct gggattccct tgtgccgagg tgtccggcct gagcacagag 2880
agagtgcggg aactggctgt ggccctggcc cagaaaaacg tgaagctgag caccgagcag 2940
ctgcggtgcc tggcccacag actgtctgag cctcccgagg atctggacgc cctgcctctg 3000
gatctgctgc tgttcctgaa ccccgacgcc ttcagcggac ctcaggcctg cacccggttc 3060
ttcagcagaa tcaccaaggc caacgtggac ctgctgccca gaggcgcccc tgagagacag 3120
agactgctgc ctgctgctct ggcctgttgg ggagtgcggg gctctctgct gtctgaagct 3180
gatgtgcggg ccctgggagg cctggcttgt gatctgcctg gaagattcgt ggccgagagc 3240
gccgaagtgc tgctgcctag actggtgtcc tgtcccggcc ctctggacca ggatcagcag 3300
gaagctgcca gagctgctct gcagggcgga ggccctcctt atggacctcc tagcacttgg 3360
agcgtgtcca ccatggatgc cctgaggggc ctgctgccag tgctgggcca gcctatcatc 3420
agatccatcc cacagggcat cgtggccgcc tggcggcaga gaagctctag agatccctct 3480
tggcggcagc ccgagcggac aatcctgcgg cccaggtttc ggagagaggt ggaaaagacc 3540
gcctgcccct ctggcaagaa ggccagagag atcgacgaga gcctgatctt ctacaagaag 3600
tgggagctgg aagcctgcgt ggacgccgct ctgctggcca cccagatgga cagagtgaac 3660
gccatcccct tcacctatga gcagctggac gtgctgaagc acaagctgga tgagctgtac 3720
ccccagggct accccgagag cgtgatccag cacctgggct acctgtttct gaagatgagc 3780
cccgaggaca tccggaagtg gaacgtgacc agcctggaaa ccctgaaggc cctgctggaa 3840
gtgaacaagg gccacgagat gtccccccag gtggccacac tgatcgacag attcgtgaag 3900
ggcagaggcc agctggacaa ggacaccctg gatacactga ccgccttcta ccccggctat 3960
ctgtgcagcc tgtcccccga ggaactgagc agcgtgccac ctagctctat ctgggctgtg 4020
cggccccagg acctggatac ctgcgatcct agacagctgg atgtgctgta tcccaaggcc 4080
cggctggcct tccagaacat gaacggcagc gagtacttcg tgaagatcca gtccttcctg 4140
ggcggagccc ctaccgagga cctgaaagct ctgagccagc agaacgtgtc catggatctg 4200
gccaccttta tgaagctgcg gaccgacgcc gtgctgcctc tgacagtggc tgaggtgcag 4260
aaactgctgg gcccccatgt ggaagggctg aaggccgaag aacggcacag acccgtgcgc 4320
gactggatcc tgaggcagag acaggatgac ctggacacac tgggcctggg actgcagggg 4380
ggcatcccta atggctacct ggtgctggac ctgagcatgc aggaagccct g 4431
<210> 36
<211> 1477
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 36
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
900 905 910
Asn Pro Gly Pro Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu
915 920 925
Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg
930 935 940
Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu
945 950 955 960
Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu
965 970 975
Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro
980 985 990
Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro
995 1000 1005
Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg
1010 1015 1020
Ile Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu
1025 1030 1035
Arg Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg
1040 1045 1050
Gly Ser Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu
1055 1060 1065
Ala Cys Asp Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val
1070 1075 1080
Leu Leu Pro Arg Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp
1085 1090 1095
Gln Gln Glu Ala Ala Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro
1100 1105 1110
Tyr Gly Pro Pro Ser Thr Trp Ser Val Ser Thr Met Asp Ala Leu
1115 1120 1125
Arg Gly Leu Leu Pro Val Leu Gly Gln Pro Ile Ile Arg Ser Ile
1130 1135 1140
Pro Gln Gly Ile Val Ala Ala Trp Arg Gln Arg Ser Ser Arg Asp
1145 1150 1155
Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu Arg Pro Arg Phe
1160 1165 1170
Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly Lys Lys Ala
1175 1180 1185
Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp Glu Leu
1190 1195 1200
Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp Arg
1205 1210 1215
Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
1220 1225 1230
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val
1235 1240 1245
Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp
1250 1255 1260
Ile Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu
1265 1270 1275
Leu Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr
1280 1285 1290
Leu Ile Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp
1295 1300 1305
Thr Leu Asp Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser
1310 1315 1320
Leu Ser Pro Glu Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp
1325 1330 1335
Ala Val Arg Pro Gln Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu
1340 1345 1350
Asp Val Leu Tyr Pro Lys Ala Arg Leu Ala Phe Gln Asn Met Asn
1355 1360 1365
Gly Ser Glu Tyr Phe Val Lys Ile Gln Ser Phe Leu Gly Gly Ala
1370 1375 1380
Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln Gln Asn Val Ser Met
1385 1390 1395
Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp Ala Val Leu Pro
1400 1405 1410
Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro His Val Glu
1415 1420 1425
Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp Trp Ile
1430 1435 1440
Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly Leu
1445 1450 1455
Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
1460 1465 1470
Gln Glu Ala Leu
1475
<210> 37
<211> 4383
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 37
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggagg ctccggcgga 2700
ctggctggcg agacaggaca ggaagccgct cctctggacg gcgtgctggc caaccctccc 2760
aatatcagca gcctgagccc cagacagctg ctgggattcc cttgtgccga ggtgtccggc 2820
ctgagcacag agagagtgcg ggaactggct gtggccctgg cccagaaaaa cgtgaagctg 2880
agcaccgagc agctgcggtg cctggcccac agactgtctg agcctcccga ggatctggac 2940
gccctgcctc tggatctgct gctgttcctg aaccccgacg ccttcagcgg acctcaggcc 3000
tgcacccggt tcttcagcag aatcaccaag gccaacgtgg acctgctgcc cagaggcgcc 3060
cctgagagac agagactgct gcctgctgct ctggcctgtt ggggagtgcg gggctctctg 3120
ctgtctgaag ctgatgtgcg ggccctggga ggcctggctt gtgatctgcc tggaagattc 3180
gtggccgaga gcgccgaagt gctgctgcct agactggtgt cctgtcccgg ccctctggac 3240
caggatcagc aggaagctgc cagagctgct ctgcagggcg gaggccctcc ttatggacct 3300
cctagcactt ggagcgtgtc caccatggat gccctgaggg gcctgctgcc agtgctgggc 3360
cagcctatca tcagatccat cccacagggc atcgtggccg cctggcggca gagaagctct 3420
agagatccct cttggcggca gcccgagcgg acaatcctgc ggcccaggtt tcggagagag 3480
gtggaaaaga ccgcctgccc ctctggcaag aaggccagag agatcgacga gagcctgatc 3540
ttctacaaga agtgggagct ggaagcctgc gtggacgccg ctctgctggc cacccagatg 3600
gacagagtga acgccatccc cttcacctat gagcagctgg acgtgctgaa gcacaagctg 3660
gatgagctgt acccccaggg ctaccccgag agcgtgatcc agcacctggg ctacctgttt 3720
ctgaagatga gccccgagga catccggaag tggaacgtga ccagcctgga aaccctgaag 3780
gccctgctgg aagtgaacaa gggccacgag atgtcccccc aggtggccac actgatcgac 3840
agattcgtga agggcagagg ccagctggac aaggacaccc tggatacact gaccgccttc 3900
taccccggct atctgtgcag cctgtccccc gaggaactga gcagcgtgcc acctagctct 3960
atctgggctg tgcggcccca ggacctggat acctgcgatc ctagacagct ggatgtgctg 4020
tatcccaagg cccggctggc cttccagaac atgaacggca gcgagtactt cgtgaagatc 4080
cagtccttcc tgggcggagc ccctaccgag gacctgaaag ctctgagcca gcagaacgtg 4140
tccatggatc tggccacctt tatgaagctg cggaccgacg ccgtgctgcc tctgacagtg 4200
gctgaggtgc agaaactgct gggcccccat gtggaagggc tgaaggccga agaacggcac 4260
agacccgtgc gcgactggat cctgaggcag agacaggatg acctggacac actgggcctg 4320
ggactgcagg ggggcatccc taatggctac ctggtgctgg acctgagcat gcaggaagcc 4380
ctg 4383
<210> 38
<211> 1461
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 38
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Gly Ser Gly Gly Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu
900 905 910
Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg
915 920 925
Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu
930 935 940
Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu
945 950 955 960
Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro
965 970 975
Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro
980 985 990
Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile
995 1000 1005
Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg
1010 1015 1020
Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly
1025 1030 1035
Ser Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala
1040 1045 1050
Cys Asp Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu
1055 1060 1065
Leu Pro Arg Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln
1070 1075 1080
Gln Glu Ala Ala Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr
1085 1090 1095
Gly Pro Pro Ser Thr Trp Ser Val Ser Thr Met Asp Ala Leu Arg
1100 1105 1110
Gly Leu Leu Pro Val Leu Gly Gln Pro Ile Ile Arg Ser Ile Pro
1115 1120 1125
Gln Gly Ile Val Ala Ala Trp Arg Gln Arg Ser Ser Arg Asp Pro
1130 1135 1140
Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu Arg Pro Arg Phe Arg
1145 1150 1155
Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly Lys Lys Ala Arg
1160 1165 1170
Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp Glu Leu Glu
1175 1180 1185
Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp Arg Val
1190 1195 1200
Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys His
1205 1210 1215
Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
1220 1225 1230
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile
1235 1240 1245
Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu
1250 1255 1260
Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu
1265 1270 1275
Ile Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr
1280 1285 1290
Leu Asp Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu
1295 1300 1305
Ser Pro Glu Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala
1310 1315 1320
Val Arg Pro Gln Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp
1325 1330 1335
Val Leu Tyr Pro Lys Ala Arg Leu Ala Phe Gln Asn Met Asn Gly
1340 1345 1350
Ser Glu Tyr Phe Val Lys Ile Gln Ser Phe Leu Gly Gly Ala Pro
1355 1360 1365
Thr Glu Asp Leu Lys Ala Leu Ser Gln Gln Asn Val Ser Met Asp
1370 1375 1380
Leu Ala Thr Phe Met Lys Leu Arg Thr Asp Ala Val Leu Pro Leu
1385 1390 1395
Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro His Val Glu Gly
1400 1405 1410
Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp Trp Ile Leu
1415 1420 1425
Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly Leu Gln
1430 1435 1440
Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met Gln
1445 1450 1455
Glu Ala Leu
1460
<210> 39
<211> 4431
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 39
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cgagggcaga ggcagcctgc tgacatgtgg cgacgtggaa 1740
gagaaccctg gccccggagc tgccccggag ccggagagga cccccgttgg ccagggatcg 1800
tgggcccatc cgggacgcac caggggacca tccgacaggg gattctgtgt ggtgtcaccg 1860
gccaggccag cagaagaggc aaccagcctc gagggagcgt tgtctggaac cagacattcc 1920
cacccgtcgg tgggccggca gcaccacgcg ggaccaccgt ccacttccag accgccacgg 1980
ccatgggaca ccccttgccc gcctgtgtat gccgagacta aacacttcct gtactcatcc 2040
ggagacaagg aacagcttcg gccgtccttc ctcctgtcgt cgctcagacc gagcctgacc 2100
ggagcacgca gattggtgga aactatcttc cttgggtcac gtccgtggat gccaggtacc 2160
ccacggcgcc tcccgcgcct cccacagaga tactggcaga tgcggcctct gttcctggaa 2220
ttgctgggaa accacgctca gtgcccgtac ggagtcctgc tcaagactca ctgccctctg 2280
agggcggcgg tcactccggc ggccggagtg tgcgcacggg agaagcccca gggaagcgtg 2340
gcagctccgg aagaggagga caccgatccg cgccgcctcg tgcaacttct gcgccagcac 2400
tcctcgccct ggcaagtcta cgggttcgtc cgcgcctgcc tgcgccgcct ggtgccgcct 2460
gggctctggg gttcccggca taacgagcgc cgcttcctga gaaatactaa gaagtttatc 2520
tcacttggaa aacatgccaa gttgtcgctg caagaactca cgtggaagat gtcagtccgc 2580
gattgcgcct ggctgcgccg ctcgccgggc gtcgggtgtg ttccagctgc agaacaccgc 2640
ctgagagaag aaattctggc caaatttctg cattggctga tgtcagtgta cgtggtcgag 2700
ctgctgcgct cctttttcta cgtcactgag actacctttc aaaagaaccg cctgttcttc 2760
taccgcaaat ctgtgtggag caagctgcag tcaatcggca ttcgccagca tctgaagagg 2820
gtgcagctgc gggaactttc cgaggcagaa gtccgccagc accgggaggc ccggccggcg 2880
cttctcacgt cgcgtctgag attcatccca aagcccgacg ggctgaggcc tatcgtcaac 2940
atggattacg tcgtgggcgc tcgcaccttt cgccgtgaaa agcgggccga acgcttgacc 3000
tcacgggtga aggccctctt ctccgtgctg aactacgaga gagcaagacg gcctggcctg 3060
ctgggagctt cggtgctggg actggacgat atccaccggg cttggcggac ctttgttctc 3120
cgggtgagag cccaagaccc tccgccggaa ctgtacttcg tgaaggtggc gatcaccgga 3180
gcctatgata ctattccgca agatcgactc accgaagtca tcgcctcgat catcaaaccg 3240
cagaacactt actgcgtcag gcggtacgcc gtggtccaga aggccgcgca tggccacgtg 3300
agaaaggcgt tcaagtcgca cgtgtccact ctcaccgacc tccagcctta catgaggcaa 3360
ttcgttgcgc atttgcaaga gacttcgccc ctgagagatg cggtggtcat cgagcagagc 3420
tccagcctga acgaagcgag cagcggtctg tttgacgtgt tcctccgctt catgtgtcat 3480
cacgcggtgc gaatcagggg aaaatcatac gtgcagtgcc agggaatccc acaaggcagc 3540
attctgtcga ctctcttgtg ttccctttgc tacggcgata tggaaaacaa gctgttcgct 3600
gggatcagac gggacgggtt gctgctcaga ctggtggacg acttcctgct ggtgactccg 3660
cacctcactc acgccaaaac ctttctccgc actctggtga ggggagtgcc agaatacggc 3720
tgtgtggtca atctccggaa aactgtggtg aatttccctg tcgaggatga ggcactcgga 3780
ggaaccgcat ttgtccaaat gccagcacat ggcctgttcc catggtgcgg tctgctgctg 3840
gacacccgaa ctcttgaagt gcagtccgac tactccagct atgcccggac gagcatccgc 3900
gccagcctca ctttcaatcg cggctttaag gccggacgaa acatgcgcag aaagcttttc 3960
ggagtcctcc ggcttaaatg ccattcgctc tttctcgatc tccaagtcaa ttcgctgcag 4020
accgtgtgca cgaacatcta caagatcctg ctgctccaag cctaccggtt ccacgcttgc 4080
gtgcttcagc tgccgtttca ccaacaggtg tggaagaacc cgaccttctt tctgcgggtc 4140
attagcgata ctgcctccct gtgttactca atcctcaagg caaagaacgc cggaatgtcg 4200
ctgggtgcga aaggagccgc gggacctctt cctagcgaag cggtgcagtg gctctgccac 4260
caggctttcc tcctgaagct gaccaggcac agagtgacct acgtcccgct gctgggctcg 4320
ctgcgcactg cacagaccca gctgtctaga aaactccccg gcaccaccct gaccgctctg 4380
gaagccgccg ccaacccagc attgccgtca gatttcaaga ccatcttgga c 4431
<210> 40
<211> 1477
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 40
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
565 570 575
Gly Asp Val Glu Glu Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu
580 585 590
Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg
595 600 605
Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala
610 615 620
Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser
625 630 635 640
His Pro Ser Val Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser
645 650 655
Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu
660 665 670
Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro
675 680 685
Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg
690 695 700
Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr
705 710 715 720
Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro
725 730 735
Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val
740 745 750
Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala
755 760 765
Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu
770 775 780
Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His
785 790 795 800
Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg
805 810 815
Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe
820 825 830
Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu
835 840 845
Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp
850 855 860
Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg
865 870 875 880
Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val
885 890 895
Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr
900 905 910
Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys
915 920 925
Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg
930 935 940
Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala
945 950 955 960
Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg
965 970 975
Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg
980 985 990
Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser
995 1000 1005
Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala
1010 1015 1020
Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe
1025 1030 1035
Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe
1040 1045 1050
Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp
1055 1060 1065
Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr
1070 1075 1080
Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly
1085 1090 1095
His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp
1100 1105 1110
Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr
1115 1120 1125
Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu
1130 1135 1140
Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
1145 1150 1155
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys
1160 1165 1170
Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser
1175 1180 1185
Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg
1190 1195 1200
Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val
1205 1210 1215
Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val
1220 1225 1230
Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr
1235 1240 1245
Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala
1250 1255 1260
Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu
1265 1270 1275
Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser
1280 1285 1290
Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly
1295 1300 1305
Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu
1310 1315 1320
Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser
1325 1330 1335
Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln
1340 1345 1350
Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln
1355 1360 1365
Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp
1370 1375 1380
Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
1385 1390 1395
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu
1400 1405 1410
Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr
1415 1420 1425
Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr
1430 1435 1440
Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr
1445 1450 1455
Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys
1460 1465 1470
Thr Ile Leu Asp
1475
<210> 41
<211> 4383
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 41
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggaggctc cggcggagga gctgccccgg agccggagag gacccccgtt 1740
ggccagggat cgtgggccca tccgggacgc accaggggac catccgacag gggattctgt 1800
gtggtgtcac cggccaggcc agcagaagag gcaaccagcc tcgagggagc gttgtctgga 1860
accagacatt cccacccgtc ggtgggccgg cagcaccacg cgggaccacc gtccacttcc 1920
agaccgccac ggccatggga caccccttgc ccgcctgtgt atgccgagac taaacacttc 1980
ctgtactcat ccggagacaa ggaacagctt cggccgtcct tcctcctgtc gtcgctcaga 2040
ccgagcctga ccggagcacg cagattggtg gaaactatct tccttgggtc acgtccgtgg 2100
atgccaggta ccccacggcg cctcccgcgc ctcccacaga gatactggca gatgcggcct 2160
ctgttcctgg aattgctggg aaaccacgct cagtgcccgt acggagtcct gctcaagact 2220
cactgccctc tgagggcggc ggtcactccg gcggccggag tgtgcgcacg ggagaagccc 2280
cagggaagcg tggcagctcc ggaagaggag gacaccgatc cgcgccgcct cgtgcaactt 2340
ctgcgccagc actcctcgcc ctggcaagtc tacgggttcg tccgcgcctg cctgcgccgc 2400
ctggtgccgc ctgggctctg gggttcccgg cataacgagc gccgcttcct gagaaatact 2460
aagaagttta tctcacttgg aaaacatgcc aagttgtcgc tgcaagaact cacgtggaag 2520
atgtcagtcc gcgattgcgc ctggctgcgc cgctcgccgg gcgtcgggtg tgttccagct 2580
gcagaacacc gcctgagaga agaaattctg gccaaatttc tgcattggct gatgtcagtg 2640
tacgtggtcg agctgctgcg ctcctttttc tacgtcactg agactacctt tcaaaagaac 2700
cgcctgttct tctaccgcaa atctgtgtgg agcaagctgc agtcaatcgg cattcgccag 2760
catctgaaga gggtgcagct gcgggaactt tccgaggcag aagtccgcca gcaccgggag 2820
gcccggccgg cgcttctcac gtcgcgtctg agattcatcc caaagcccga cgggctgagg 2880
cctatcgtca acatggatta cgtcgtgggc gctcgcacct ttcgccgtga aaagcgggcc 2940
gaacgcttga cctcacgggt gaaggccctc ttctccgtgc tgaactacga gagagcaaga 3000
cggcctggcc tgctgggagc ttcggtgctg ggactggacg atatccaccg ggcttggcgg 3060
acctttgttc tccgggtgag agcccaagac cctccgccgg aactgtactt cgtgaaggtg 3120
gcgatcaccg gagcctatga tactattccg caagatcgac tcaccgaagt catcgcctcg 3180
atcatcaaac cgcagaacac ttactgcgtc aggcggtacg ccgtggtcca gaaggccgcg 3240
catggccacg tgagaaaggc gttcaagtcg cacgtgtcca ctctcaccga cctccagcct 3300
tacatgaggc aattcgttgc gcatttgcaa gagacttcgc ccctgagaga tgcggtggtc 3360
atcgagcaga gctccagcct gaacgaagcg agcagcggtc tgtttgacgt gttcctccgc 3420
ttcatgtgtc atcacgcggt gcgaatcagg ggaaaatcat acgtgcagtg ccagggaatc 3480
ccacaaggca gcattctgtc gactctcttg tgttcccttt gctacggcga tatggaaaac 3540
aagctgttcg ctgggatcag acgggacggg ttgctgctca gactggtgga cgacttcctg 3600
ctggtgactc cgcacctcac tcacgccaaa acctttctcc gcactctggt gaggggagtg 3660
ccagaatacg gctgtgtggt caatctccgg aaaactgtgg tgaatttccc tgtcgaggat 3720
gaggcactcg gaggaaccgc atttgtccaa atgccagcac atggcctgtt cccatggtgc 3780
ggtctgctgc tggacacccg aactcttgaa gtgcagtccg actactccag ctatgcccgg 3840
acgagcatcc gcgccagcct cactttcaat cgcggcttta aggccggacg aaacatgcgc 3900
agaaagcttt tcggagtcct ccggcttaaa tgccattcgc tctttctcga tctccaagtc 3960
aattcgctgc agaccgtgtg cacgaacatc tacaagatcc tgctgctcca agcctaccgg 4020
ttccacgctt gcgtgcttca gctgccgttt caccaacagg tgtggaagaa cccgaccttc 4080
tttctgcggg tcattagcga tactgcctcc ctgtgttact caatcctcaa ggcaaagaac 4140
gccggaatgt cgctgggtgc gaaaggagcc gcgggacctc ttcctagcga agcggtgcag 4200
tggctctgcc accaggcttt cctcctgaag ctgaccaggc acagagtgac ctacgtcccg 4260
ctgctgggct cgctgcgcac tgcacagacc cagctgtcta gaaaactccc cggcaccacc 4320
ctgaccgctc tggaagccgc cgccaaccca gcattgccgt cagatttcaa gaccatcttg 4380
gac 4383
<210> 42
<211> 1461
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 42
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Gly Ser Gly Gly Gly Ala Ala Pro Glu Pro Glu
565 570 575
Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg
580 585 590
Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala
595 600 605
Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser
610 615 620
His Pro Ser Val Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser
625 630 635 640
Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu
645 650 655
Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro
660 665 670
Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg
675 680 685
Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr
690 695 700
Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro
705 710 715 720
Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val
725 730 735
Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala
740 745 750
Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu
755 760 765
Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His
770 775 780
Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg
785 790 795 800
Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe
805 810 815
Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu
820 825 830
Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp
835 840 845
Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg
850 855 860
Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val
865 870 875 880
Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr
885 890 895
Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys
900 905 910
Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg
915 920 925
Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala
930 935 940
Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg
945 950 955 960
Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg
965 970 975
Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser
980 985 990
Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser
995 1000 1005
Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val
1010 1015 1020
Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val
1025 1030 1035
Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg
1040 1045 1050
Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr
1055 1060 1065
Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly His
1070 1075 1080
Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu
1085 1090 1095
Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser
1100 1105 1110
Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn
1115 1120 1125
Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys
1130 1135 1140
His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
1145 1150 1155
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu
1160 1165 1170
Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg
1175 1180 1185
Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr
1190 1195 1200
Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg
1205 1210 1215
Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val
1220 1225 1230
Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe
1235 1240 1245
Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu
1250 1255 1260
Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr
1265 1270 1275
Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe
1280 1285 1290
Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg
1295 1300 1305
Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu
1310 1315 1320
Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala
1325 1330 1335
Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln
1340 1345 1350
Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr
1355 1360 1365
Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met
1370 1375 1380
Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
1385 1390 1395
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg
1400 1405 1410
His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala
1415 1420 1425
Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala
1430 1435 1440
Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr
1445 1450 1455
Ile Leu Asp
1460
<210> 43
<211> 6117
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 43
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 1680
acaaaccctg gccccctggc tggcgagaca ggacaggaag ccgctcctct ggacggcgtg 1740
ctggccaacc ctcccaatat cagcagcctg agccccagac agctgctggg attcccttgt 1800
gccgaggtgt ccggcctgag cacagagaga gtgcgggaac tggctgtggc cctggcccag 1860
aaaaacgtga agctgagcac cgagcagctg cggtgcctgg cccacagact gtctgagcct 1920
cccgaggatc tggacgccct gcctctggat ctgctgctgt tcctgaaccc cgacgccttc 1980
agcggacctc aggcctgcac ccggttcttc agcagaatca ccaaggccaa cgtggacctg 2040
ctgcccagag gcgcccctga gagacagaga ctgctgcctg ctgctctggc ctgttgggga 2100
gtgcggggct ctctgctgtc tgaagctgat gtgcgggccc tgggaggcct ggcttgtgat 2160
ctgcctggaa gattcgtggc cgagagcgcc gaagtgctgc tgcctagact ggtgtcctgt 2220
cccggccctc tggaccagga tcagcaggaa gctgccagag ctgctctgca gggcggaggc 2280
cctccttatg gacctcctag cacttggagc gtgtccacca tggatgccct gaggggcctg 2340
ctgccagtgc tgggccagcc tatcatcaga tccatcccac agggcatcgt ggccgcctgg 2400
cggcagagaa gctctagaga tccctcttgg cggcagcccg agcggacaat cctgcggccc 2460
aggtttcgga gagaggtgga aaagaccgcc tgcccctctg gcaagaaggc cagagagatc 2520
gacgagagcc tgatcttcta caagaagtgg gagctggaag cctgcgtgga cgccgctctg 2580
ctggccaccc agatggacag agtgaacgcc atccccttca cctatgagca gctggacgtg 2640
ctgaagcaca agctggatga gctgtacccc cagggctacc ccgagagcgt gatccagcac 2700
ctgggctacc tgtttctgaa gatgagcccc gaggacatcc ggaagtggaa cgtgaccagc 2760
ctggaaaccc tgaaggccct gctggaagtg aacaagggcc acgagatgtc cccccaggtg 2820
gccacactga tcgacagatt cgtgaagggc agaggccagc tggacaagga caccctggat 2880
acactgaccg ccttctaccc cggctatctg tgcagcctgt cccccgagga actgagcagc 2940
gtgccaccta gctctatctg ggctgtgcgg ccccaggacc tggatacctg cgatcctaga 3000
cagctggatg tgctgtatcc caaggcccgg ctggccttcc agaacatgaa cggcagcgag 3060
tacttcgtga agatccagtc cttcctgggc ggagccccta ccgaggacct gaaagctctg 3120
agccagcaga acgtgtccat ggatctggcc acctttatga agctgcggac cgacgccgtg 3180
ctgcctctga cagtggctga ggtgcagaaa ctgctgggcc cccatgtgga agggctgaag 3240
gccgaagaac ggcacagacc cgtgcgcgac tggatcctga ggcagagaca ggatgacctg 3300
gacacactgg gcctgggact gcaggggggc atccctaatg gctacctggt gctggacctg 3360
agcatgcagg aagccctggg atccggcgag ggcagaggca gcctgctgac atgtggcgac 3420
gtggaagaga accctggccc cggagctgcc ccggagccgg agaggacccc cgttggccag 3480
ggatcgtggg cccatccggg acgcaccagg ggaccatccg acaggggatt ctgtgtggtg 3540
tcaccggcca ggccagcaga agaggcaacc agcctcgagg gagcgttgtc tggaaccaga 3600
cattcccacc cgtcggtggg ccggcagcac cacgcgggac caccgtccac ttccagaccg 3660
ccacggccat gggacacccc ttgcccgcct gtgtatgccg agactaaaca cttcctgtac 3720
tcatccggag acaaggaaca gcttcggccg tccttcctcc tgtcgtcgct cagaccgagc 3780
ctgaccggag cacgcagatt ggtggaaact atcttccttg ggtcacgtcc gtggatgcca 3840
ggtaccccac ggcgcctccc gcgcctccca cagagatact ggcagatgcg gcctctgttc 3900
ctggaattgc tgggaaacca cgctcagtgc ccgtacggag tcctgctcaa gactcactgc 3960
cctctgaggg cggcggtcac tccggcggcc ggagtgtgcg cacgggagaa gccccaggga 4020
agcgtggcag ctccggaaga ggaggacacc gatccgcgcc gcctcgtgca acttctgcgc 4080
cagcactcct cgccctggca agtctacggg ttcgtccgcg cctgcctgcg ccgcctggtg 4140
ccgcctgggc tctggggttc ccggcataac gagcgccgct tcctgagaaa tactaagaag 4200
tttatctcac ttggaaaaca tgccaagttg tcgctgcaag aactcacgtg gaagatgtca 4260
gtccgcgatt gcgcctggct gcgccgctcg ccgggcgtcg ggtgtgttcc agctgcagaa 4320
caccgcctga gagaagaaat tctggccaaa tttctgcatt ggctgatgtc agtgtacgtg 4380
gtcgagctgc tgcgctcctt tttctacgtc actgagacta cctttcaaaa gaaccgcctg 4440
ttcttctacc gcaaatctgt gtggagcaag ctgcagtcaa tcggcattcg ccagcatctg 4500
aagagggtgc agctgcggga actttccgag gcagaagtcc gccagcaccg ggaggcccgg 4560
ccggcgcttc tcacgtcgcg tctgagattc atcccaaagc ccgacgggct gaggcctatc 4620
gtcaacatgg attacgtcgt gggcgctcgc acctttcgcc gtgaaaagcg ggccgaacgc 4680
ttgacctcac gggtgaaggc cctcttctcc gtgctgaact acgagagagc aagacggcct 4740
ggcctgctgg gagcttcggt gctgggactg gacgatatcc accgggcttg gcggaccttt 4800
gttctccggg tgagagccca agaccctccg ccggaactgt acttcgtgaa ggtggcgatc 4860
accggagcct atgatactat tccgcaagat cgactcaccg aagtcatcgc ctcgatcatc 4920
aaaccgcaga acacttactg cgtcaggcgg tacgccgtgg tccagaaggc cgcgcatggc 4980
cacgtgagaa aggcgttcaa gtcgcacgtg tccactctca ccgacctcca gccttacatg 5040
aggcaattcg ttgcgcattt gcaagagact tcgcccctga gagatgcggt ggtcatcgag 5100
cagagctcca gcctgaacga agcgagcagc ggtctgtttg acgtgttcct ccgcttcatg 5160
tgtcatcacg cggtgcgaat caggggaaaa tcatacgtgc agtgccaggg aatcccacaa 5220
ggcagcattc tgtcgactct cttgtgttcc ctttgctacg gcgatatgga aaacaagctg 5280
ttcgctggga tcagacggga cgggttgctg ctcagactgg tggacgactt cctgctggtg 5340
actccgcacc tcactcacgc caaaaccttt ctccgcactc tggtgagggg agtgccagaa 5400
tacggctgtg tggtcaatct ccggaaaact gtggtgaatt tccctgtcga ggatgaggca 5460
ctcggaggaa ccgcatttgt ccaaatgcca gcacatggcc tgttcccatg gtgcggtctg 5520
ctgctggaca cccgaactct tgaagtgcag tccgactact ccagctatgc ccggacgagc 5580
atccgcgcca gcctcacttt caatcgcggc tttaaggccg gacgaaacat gcgcagaaag 5640
cttttcggag tcctccggct taaatgccat tcgctctttc tcgatctcca agtcaattcg 5700
ctgcagaccg tgtgcacgaa catctacaag atcctgctgc tccaagccta ccggttccac 5760
gcttgcgtgc ttcagctgcc gtttcaccaa caggtgtgga agaacccgac cttctttctg 5820
cgggtcatta gcgatactgc ctccctgtgt tactcaatcc tcaaggcaaa gaacgccgga 5880
atgtcgctgg gtgcgaaagg agccgcggga cctcttccta gcgaagcggt gcagtggctc 5940
tgccaccagg ctttcctcct gaagctgacc aggcacagag tgacctacgt cccgctgctg 6000
ggctcgctgc gcactgcaca gacccagctg tctagaaaac tccccggcac caccctgacc 6060
gctctggaag ccgccgccaa cccagcattg ccgtcagatt tcaagaccat cttggac 6117
<210> 44
<211> 2039
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 44
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Arg Ile Phe Asn
530 535 540
Ala His Tyr Ala Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu
545 550 555 560
Thr Asn Pro Gly Pro Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro
565 570 575
Leu Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro
580 585 590
Arg Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr
595 600 605
Glu Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys
610 615 620
Leu Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro
625 630 635 640
Pro Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn
645 650 655
Pro Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg
660 665 670
Ile Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg
675 680 685
Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser
690 695 700
Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp
705 710 715 720
Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg
725 730 735
Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala
740 745 750
Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr
755 760 765
Trp Ser Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu
770 775 780
Gly Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp
785 790 795 800
Arg Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr
805 810 815
Ile Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro
820 825 830
Ser Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys
835 840 845
Lys Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln
850 855 860
Met Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val
865 870 875 880
Leu Lys His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser
885 890 895
Val Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp
900 905 910
Ile Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu
915 920 925
Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile
930 935 940
Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp
945 950 955 960
Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu
965 970 975
Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln
980 985 990
Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys
995 1000 1005
Ala Arg Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val
1010 1015 1020
Lys Ile Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys
1025 1030 1035
Ala Leu Ser Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met
1040 1045 1050
Lys Leu Arg Thr Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val
1055 1060 1065
Gln Lys Leu Leu Gly Pro His Val Glu Gly Leu Lys Ala Glu Glu
1070 1075 1080
Arg His Arg Pro Val Arg Asp Trp Ile Leu Arg Gln Arg Gln Asp
1085 1090 1095
Asp Leu Asp Thr Leu Gly Leu Gly Leu Gln Gly Gly Ile Pro Asn
1100 1105 1110
Gly Tyr Leu Val Leu Asp Leu Ser Met Gln Glu Ala Leu Gly Ser
1115 1120 1125
Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
1130 1135 1140
Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val
1145 1150 1155
Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser
1160 1165 1170
Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu
1175 1180 1185
Ala Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser His
1190 1195 1200
Pro Ser Val Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser
1205 1210 1215
Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala
1220 1225 1230
Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu
1235 1240 1245
Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly
1250 1255 1260
Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp
1265 1270 1275
Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr
1280 1285 1290
Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala
1295 1300 1305
Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg
1310 1315 1320
Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro
1325 1330 1335
Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg
1340 1345 1350
Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
1355 1360 1365
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly
1370 1375 1380
Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr
1385 1390 1395
Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln
1400 1405 1410
Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg
1415 1420 1425
Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg Leu
1430 1435 1440
Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val
1445 1450 1455
Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr
1460 1465 1470
Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp
1475 1480 1485
Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val
1490 1495 1500
Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu
1505 1510 1515
Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys
1520 1525 1530
Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly
1535 1540 1545
Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser
1550 1555 1560
Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg
1565 1570 1575
Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile
1580 1585 1590
His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
1595 1600 1605
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala
1610 1615 1620
Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser
1625 1630 1635
Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val
1640 1645 1650
Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser
1655 1660 1665
His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe
1670 1675 1680
Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala Val Val
1685 1690 1695
Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe
1700 1705 1710
Asp Val Phe Leu Arg Phe Met Cys His His Ala Val Arg Ile Arg
1715 1720 1725
Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile
1730 1735 1740
Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn
1745 1750 1755
Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu
1760 1765 1770
Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys
1775 1780 1785
Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys
1790 1795 1800
Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp
1805 1810 1815
Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly
1820 1825 1830
Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
1835 1840 1845
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala
1850 1855 1860
Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg
1865 1870 1875
Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe
1880 1885 1890
Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile
1895 1900 1905
Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val
1910 1915 1920
Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro Thr Phe
1925 1930 1935
Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile
1940 1945 1950
Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala
1955 1960 1965
Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys His Gln
1970 1975 1980
Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val Pro
1985 1990 1995
Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys
2000 2005 2010
Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro
2015 2020 2025
Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
2030 2035
<210> 45
<211> 6117
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 45
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
acaatcctgt ctgagggcgc caccaacttc agcctgctga aactggccgg cgacgtggaa 1680
ctgaaccctg gccctggagc tgccccggag ccggagagga cccccgttgg ccagggatcg 1740
tgggcccatc cgggacgcac caggggacca tccgacaggg gattctgtgt ggtgtcaccg 1800
gccaggccag cagaagaggc aaccagcctc gagggagcgt tgtctggaac cagacattcc 1860
cacccgtcgg tgggccggca gcaccacgcg ggaccaccgt ccacttccag accgccacgg 1920
ccatgggaca ccccttgccc gcctgtgtat gccgagacta aacacttcct gtactcatcc 1980
ggagacaagg aacagcttcg gccgtccttc ctcctgtcgt cgctcagacc gagcctgacc 2040
ggagcacgca gattggtgga aactatcttc cttgggtcac gtccgtggat gccaggtacc 2100
ccacggcgcc tcccgcgcct cccacagaga tactggcaga tgcggcctct gttcctggaa 2160
ttgctgggaa accacgctca gtgcccgtac ggagtcctgc tcaagactca ctgccctctg 2220
agggcggcgg tcactccggc ggccggagtg tgcgcacggg agaagcccca gggaagcgtg 2280
gcagctccgg aagaggagga caccgatccg cgccgcctcg tgcaacttct gcgccagcac 2340
tcctcgccct ggcaagtcta cgggttcgtc cgcgcctgcc tgcgccgcct ggtgccgcct 2400
gggctctggg gttcccggca taacgagcgc cgcttcctga gaaatactaa gaagtttatc 2460
tcacttggaa aacatgccaa gttgtcgctg caagaactca cgtggaagat gtcagtccgc 2520
gattgcgcct ggctgcgccg ctcgccgggc gtcgggtgtg ttccagctgc agaacaccgc 2580
ctgagagaag aaattctggc caaatttctg cattggctga tgtcagtgta cgtggtcgag 2640
ctgctgcgct cctttttcta cgtcactgag actacctttc aaaagaaccg cctgttcttc 2700
taccgcaaat ctgtgtggag caagctgcag tcaatcggca ttcgccagca tctgaagagg 2760
gtgcagctgc gggaactttc cgaggcagaa gtccgccagc accgggaggc ccggccggcg 2820
cttctcacgt cgcgtctgag attcatccca aagcccgacg ggctgaggcc tatcgtcaac 2880
atggattacg tcgtgggcgc tcgcaccttt cgccgtgaaa agcgggccga acgcttgacc 2940
tcacgggtga aggccctctt ctccgtgctg aactacgaga gagcaagacg gcctggcctg 3000
ctgggagctt cggtgctggg actggacgat atccaccggg cttggcggac ctttgttctc 3060
cgggtgagag cccaagaccc tccgccggaa ctgtacttcg tgaaggtggc gatcaccgga 3120
gcctatgata ctattccgca agatcgactc accgaagtca tcgcctcgat catcaaaccg 3180
cagaacactt actgcgtcag gcggtacgcc gtggtccaga aggccgcgca tggccacgtg 3240
agaaaggcgt tcaagtcgca cgtgtccact ctcaccgacc tccagcctta catgaggcaa 3300
ttcgttgcgc atttgcaaga gacttcgccc ctgagagatg cggtggtcat cgagcagagc 3360
tccagcctga acgaagcgag cagcggtctg tttgacgtgt tcctccgctt catgtgtcat 3420
cacgcggtgc gaatcagggg aaaatcatac gtgcagtgcc agggaatccc acaaggcagc 3480
attctgtcga ctctcttgtg ttccctttgc tacggcgata tggaaaacaa gctgttcgct 3540
gggatcagac gggacgggtt gctgctcaga ctggtggacg acttcctgct ggtgactccg 3600
cacctcactc acgccaaaac ctttctccgc actctggtga ggggagtgcc agaatacggc 3660
tgtgtggtca atctccggaa aactgtggtg aatttccctg tcgaggatga ggcactcgga 3720
ggaaccgcat ttgtccaaat gccagcacat ggcctgttcc catggtgcgg tctgctgctg 3780
gacacccgaa ctcttgaagt gcagtccgac tactccagct atgcccggac gagcatccgc 3840
gccagcctca ctttcaatcg cggctttaag gccggacgaa acatgcgcag aaagcttttc 3900
ggagtcctcc ggcttaaatg ccattcgctc tttctcgatc tccaagtcaa ttcgctgcag 3960
accgtgtgca cgaacatcta caagatcctg ctgctccaag cctaccggtt ccacgcttgc 4020
gtgcttcagc tgccgtttca ccaacaggtg tggaagaacc cgaccttctt tctgcgggtc 4080
attagcgata ctgcctccct gtgttactca atcctcaagg caaagaacgc cggaatgtcg 4140
ctgggtgcga aaggagccgc gggacctctt cctagcgaag cggtgcagtg gctctgccac 4200
caggctttcc tcctgaagct gaccaggcac agagtgacct acgtcccgct gctgggctcg 4260
ctgcgcactg cacagaccca gctgtctaga aaactccccg gcaccaccct gaccgctctg 4320
gaagccgccg ccaacccagc attgccgtca gatttcaaga ccatcttgga cggatccggc 4380
gagggcagag gcagcctgct gacatgtggc gacgtggaag agaaccctgg ccccctggct 4440
ggcgagacag gacaggaagc cgctcctctg gacggcgtgc tggccaaccc tcccaatatc 4500
agcagcctga gccccagaca gctgctggga ttcccttgtg ccgaggtgtc cggcctgagc 4560
acagagagag tgcgggaact ggctgtggcc ctggcccaga aaaacgtgaa gctgagcacc 4620
gagcagctgc ggtgcctggc ccacagactg tctgagcctc ccgaggatct ggacgccctg 4680
cctctggatc tgctgctgtt cctgaacccc gacgccttca gcggacctca ggcctgcacc 4740
cggttcttca gcagaatcac caaggccaac gtggacctgc tgcccagagg cgcccctgag 4800
agacagagac tgctgcctgc tgctctggcc tgttggggag tgcggggctc tctgctgtct 4860
gaagctgatg tgcgggccct gggaggcctg gcttgtgatc tgcctggaag attcgtggcc 4920
gagagcgccg aagtgctgct gcctagactg gtgtcctgtc ccggccctct ggaccaggat 4980
cagcaggaag ctgccagagc tgctctgcag ggcggaggcc ctccttatgg acctcctagc 5040
acttggagcg tgtccaccat ggatgccctg aggggcctgc tgccagtgct gggccagcct 5100
atcatcagat ccatcccaca gggcatcgtg gccgcctggc ggcagagaag ctctagagat 5160
ccctcttggc ggcagcccga gcggacaatc ctgcggccca ggtttcggag agaggtggaa 5220
aagaccgcct gcccctctgg caagaaggcc agagagatcg acgagagcct gatcttctac 5280
aagaagtggg agctggaagc ctgcgtggac gccgctctgc tggccaccca gatggacaga 5340
gtgaacgcca tccccttcac ctatgagcag ctggacgtgc tgaagcacaa gctggatgag 5400
ctgtaccccc agggctaccc cgagagcgtg atccagcacc tgggctacct gtttctgaag 5460
atgagccccg aggacatccg gaagtggaac gtgaccagcc tggaaaccct gaaggccctg 5520
ctggaagtga acaagggcca cgagatgtcc ccccaggtgg ccacactgat cgacagattc 5580
gtgaagggca gaggccagct ggacaaggac accctggata cactgaccgc cttctacccc 5640
ggctatctgt gcagcctgtc ccccgaggaa ctgagcagcg tgccacctag ctctatctgg 5700
gctgtgcggc cccaggacct ggatacctgc gatcctagac agctggatgt gctgtatccc 5760
aaggcccggc tggccttcca gaacatgaac ggcagcgagt acttcgtgaa gatccagtcc 5820
ttcctgggcg gagcccctac cgaggacctg aaagctctga gccagcagaa cgtgtccatg 5880
gatctggcca cctttatgaa gctgcggacc gacgccgtgc tgcctctgac agtggctgag 5940
gtgcagaaac tgctgggccc ccatgtggaa gggctgaagg ccgaagaacg gcacagaccc 6000
gtgcgcgact ggatcctgag gcagagacag gatgacctgg acacactggg cctgggactg 6060
caggggggca tccctaatgg ctacctggtg ctggacctga gcatgcagga agccctg 6117
<210> 46
<211> 2039
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 46
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Thr Ile Leu Ser
530 535 540
Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu
545 550 555 560
Leu Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val
565 570 575
Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp
580 585 590
Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr
595 600 605
Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val
610 615 620
Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg
625 630 635 640
Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe
645 650 655
Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu
660 665 670
Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr
675 680 685
Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu
690 695 700
Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu
705 710 715 720
Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr
725 730 735
His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala
740 745 750
Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr
755 760 765
Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp
770 775 780
Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro
785 790 795 800
Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr
805 810 815
Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu
820 825 830
Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser
835 840 845
Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu
850 855 860
Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu
865 870 875 880
Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn
885 890 895
Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile
900 905 910
Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu
915 920 925
Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser
930 935 940
Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn
945 950 955 960
Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala
965 970 975
Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr
980 985 990
Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu
995 1000 1005
Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg
1010 1015 1020
Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile
1025 1030 1035
Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val
1040 1045 1050
Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg
1055 1060 1065
Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala
1070 1075 1080
Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met
1085 1090 1095
Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp
1100 1105 1110
Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser
1115 1120 1125
Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val
1130 1135 1140
Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln
1145 1150 1155
Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp
1160 1165 1170
Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu
1175 1180 1185
Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr
1190 1195 1200
His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
1205 1210 1215
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro
1220 1225 1230
Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro
1235 1240 1245
Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg
1250 1255 1260
Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser
1265 1270 1275
Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg
1280 1285 1290
Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His
1295 1300 1305
Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys
1310 1315 1320
Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His
1325 1330 1335
Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn
1340 1345 1350
Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys
1355 1360 1365
Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala
1370 1375 1380
Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu
1385 1390 1395
Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr
1400 1405 1410
Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu
1415 1420 1425
Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala
1430 1435 1440
Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
1445 1450 1455
Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu
1460 1465 1470
Glu Asn Pro Gly Pro Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala
1475 1480 1485
Pro Leu Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu
1490 1495 1500
Ser Pro Arg Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly
1505 1510 1515
Leu Ser Thr Glu Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln
1520 1525 1530
Lys Asn Val Lys Leu Ser Thr Glu Gln Leu Arg Cys Leu Ala His
1535 1540 1545
Arg Leu Ser Glu Pro Pro Glu Asp Leu Asp Ala Leu Pro Leu Asp
1550 1555 1560
Leu Leu Leu Phe Leu Asn Pro Asp Ala Phe Ser Gly Pro Gln Ala
1565 1570 1575
Cys Thr Arg Phe Phe Ser Arg Ile Thr Lys Ala Asn Val Asp Leu
1580 1585 1590
Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg Leu Leu Pro Ala Ala
1595 1600 1605
Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu Ser Glu Ala Asp
1610 1615 1620
Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro Gly Arg Phe
1625 1630 1635
Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val Ser Cys
1640 1645 1650
Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala Ala
1655 1660 1665
Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
1670 1675 1680
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly
1685 1690 1695
Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp
1700 1705 1710
Arg Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg
1715 1720 1725
Thr Ile Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala
1730 1735 1740
Cys Pro Ser Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile
1745 1750 1755
Phe Tyr Lys Lys Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu
1760 1765 1770
Leu Ala Thr Gln Met Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr
1775 1780 1785
Glu Gln Leu Asp Val Leu Lys His Lys Leu Asp Glu Leu Tyr Pro
1790 1795 1800
Gln Gly Tyr Pro Glu Ser Val Ile Gln His Leu Gly Tyr Leu Phe
1805 1810 1815
Leu Lys Met Ser Pro Glu Asp Ile Arg Lys Trp Asn Val Thr Ser
1820 1825 1830
Leu Glu Thr Leu Lys Ala Leu Leu Glu Val Asn Lys Gly His Glu
1835 1840 1845
Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg Phe Val Lys Gly
1850 1855 1860
Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu Thr Ala Phe
1865 1870 1875
Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu Ser Ser
1880 1885 1890
Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu Asp
1895 1900 1905
Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
1910 1915 1920
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile
1925 1930 1935
Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu
1940 1945 1950
Ser Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu
1955 1960 1965
Arg Thr Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys
1970 1975 1980
Leu Leu Gly Pro His Val Glu Gly Leu Lys Ala Glu Glu Arg His
1985 1990 1995
Arg Pro Val Arg Asp Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu
2000 2005 2010
Asp Thr Leu Gly Leu Gly Leu Gln Gly Gly Ile Pro Asn Gly Tyr
2015 2020 2025
Leu Val Leu Asp Leu Ser Met Gln Glu Ala Leu
2030 2035
<210> 47
<211> 6138
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 47
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cagaatcttc aacgcccact acgccggcta cttcgccgac 1740
ctgctgatcc acgacatcga gacaaaccct ggccccaccc ctggaaccca gagccccttc 1800
ttccttctgc tgctgctgac cgtgctgact gtcgtgacag gctctggcca cgccagctct 1860
acacctggcg gcgagaaaga gacaagcgcc acccagagaa gcagcgtgcc aagcagcacc 1920
gagaagaacg ccgtgtccat gaccagctcc gtgctgagca gccactctcc tggcagcggc 1980
agcagcacaa cacagggcca ggatgtgaca ctggcccctg ccacagaacc tgcctctgga 2040
tctgccgcca cctggggaca ggacgtgaca agcgtgccag tgaccagacc tgccctgggc 2100
tctacaacac cccctgccca cgatgtgacc agcgcccctg ataacaagcc tgcccctgga 2160
agcacagccc ctccagctca tggcgtgacc tctgccccag ataccagacc agccccagga 2220
tctacagccc cacccgcaca cggcgtgaca agtgcccctg acacaagacc cgctccaggc 2280
tctactgctc ctcctgccca tggcgtgaca agcgctcccg atacaaggcc agctcctggc 2340
tccacagcac caccagcaca tggcgtgaca tcagctcccg acactagacc tgctcccgga 2400
tcaaccgctc caccagctca cggcgtgacc agcgcacctg ataccagacc tgctctggga 2460
agcaccgccc ctcccgtgca caatgtgaca tctgcttccg gcagcgccag cggctctgcc 2520
tctacactgg tgcacaacgg caccagcgcc agagccacaa caaccccagc cagcaagagc 2580
acccccttca gcatccctag ccaccacagc gacaccccta ccacactggc cagccactcc 2640
accaagaccg atgcctctag cacccaccac tccagcgtgc cccctctgac cagcagcaac 2700
cacagcacaa gcccccagct gtctaccggc gtctcattct tctttctgtc cttccacatc 2760
agcaacctgc agttcaacag cagcctggaa gatcccagca ccgactacta ccaggaactg 2820
cagcgggata tcagcgagat gttcctgcaa atctacaagc agggcggctt cctgggcctg 2880
agcaacatca agttcagacc cggcagcgtg gtggtgcagc tgaccctggc tttccgggaa 2940
ggcaccatca acgtgcacga cgtggaaacc cagttcaacc agtacaagac cgaggccgcc 3000
agccggtaca acctgaccat ctccgatgtg tccgtgtccg acgtgccctt cccattctct 3060
gcccagtctg gcgcaggcgt gccaggatgg ggaattgctc tgctggtgct cgtgtgcgtg 3120
ctggtggccc tggccatcgt gtatctgatt gccctggccg tgtgccagtg ccggcggaag 3180
aattacggcc agctggacat cttccccgcc agagacacct accaccccat gagcgagtac 3240
cccacatacc acacccacgg cagatacgtg ccacccagct ccaccgacag atccccctac 3300
gagaaagtgt ctgccggcaa cggcggcagc tccctgagct acacaaatcc tgccgtggcc 3360
gctgcctccg ccaacctggg atccggcaca atcctgtctg agggcgccac caacttcagc 3420
ctgctgaaac tggccggcga cgtggaactg aaccctggcc ctggagctgc cccggagccg 3480
gagaggaccc ccgttggcca gggatcgtgg gcccatccgg gacgcaccag gggaccatcc 3540
gacaggggat tctgtgtggt gtcaccggcc aggccagcag aagaggcaac cagcctcgag 3600
ggagcgttgt ctggaaccag acattcccac ccgtcggtgg gccggcagca ccacgcggga 3660
ccaccgtcca cttccagacc gccacggcca tgggacaccc cttgcccgcc tgtgtatgcc 3720
gagactaaac acttcctgta ctcatccgga gacaaggaac agcttcggcc gtccttcctc 3780
ctgtcgtcgc tcagaccgag cctgaccgga gcacgcagat tggtggaaac tatcttcctt 3840
gggtcacgtc cgtggatgcc aggtacccca cggcgcctcc cgcgcctccc acagagatac 3900
tggcagatgc ggcctctgtt cctggaattg ctgggaaacc acgctcagtg cccgtacgga 3960
gtcctgctca agactcactg ccctctgagg gcggcggtca ctccggcggc cggagtgtgc 4020
gcacgggaga agccccaggg aagcgtggca gctccggaag aggaggacac cgatccgcgc 4080
cgcctcgtgc aacttctgcg ccagcactcc tcgccctggc aagtctacgg gttcgtccgc 4140
gcctgcctgc gccgcctggt gccgcctggg ctctggggtt cccggcataa cgagcgccgc 4200
ttcctgagaa atactaagaa gtttatctca cttggaaaac atgccaagtt gtcgctgcaa 4260
gaactcacgt ggaagatgtc agtccgcgat tgcgcctggc tgcgccgctc gccgggcgtc 4320
gggtgtgttc cagctgcaga acaccgcctg agagaagaaa ttctggccaa atttctgcat 4380
tggctgatgt cagtgtacgt ggtcgagctg ctgcgctcct ttttctacgt cactgagact 4440
acctttcaaa agaaccgcct gttcttctac cgcaaatctg tgtggagcaa gctgcagtca 4500
atcggcattc gccagcatct gaagagggtg cagctgcggg aactttccga ggcagaagtc 4560
cgccagcacc gggaggcccg gccggcgctt ctcacgtcgc gtctgagatt catcccaaag 4620
cccgacgggc tgaggcctat cgtcaacatg gattacgtcg tgggcgctcg cacctttcgc 4680
cgtgaaaagc gggccgaacg cttgacctca cgggtgaagg ccctcttctc cgtgctgaac 4740
tacgagagag caagacggcc tggcctgctg ggagcttcgg tgctgggact ggacgatatc 4800
caccgggctt ggcggacctt tgttctccgg gtgagagccc aagaccctcc gccggaactg 4860
tacttcgtga aggtggcgat caccggagcc tatgatacta ttccgcaaga tcgactcacc 4920
gaagtcatcg cctcgatcat caaaccgcag aacacttact gcgtcaggcg gtacgccgtg 4980
gtccagaagg ccgcgcatgg ccacgtgaga aaggcgttca agtcgcacgt gtccactctc 5040
accgacctcc agccttacat gaggcaattc gttgcgcatt tgcaagagac ttcgcccctg 5100
agagatgcgg tggtcatcga gcagagctcc agcctgaacg aagcgagcag cggtctgttt 5160
gacgtgttcc tccgcttcat gtgtcatcac gcggtgcgaa tcaggggaaa atcatacgtg 5220
cagtgccagg gaatcccaca aggcagcatt ctgtcgactc tcttgtgttc cctttgctac 5280
ggcgatatgg aaaacaagct gttcgctggg atcagacggg acgggttgct gctcagactg 5340
gtggacgact tcctgctggt gactccgcac ctcactcacg ccaaaacctt tctccgcact 5400
ctggtgaggg gagtgccaga atacggctgt gtggtcaatc tccggaaaac tgtggtgaat 5460
ttccctgtcg aggatgaggc actcggagga accgcatttg tccaaatgcc agcacatggc 5520
ctgttcccat ggtgcggtct gctgctggac acccgaactc ttgaagtgca gtccgactac 5580
tccagctatg cccggacgag catccgcgcc agcctcactt tcaatcgcgg ctttaaggcc 5640
ggacgaaaca tgcgcagaaa gcttttcgga gtcctccggc ttaaatgcca ttcgctcttt 5700
ctcgatctcc aagtcaattc gctgcagacc gtgtgcacga acatctacaa gatcctgctg 5760
ctccaagcct accggttcca cgcttgcgtg cttcagctgc cgtttcacca acaggtgtgg 5820
aagaacccga ccttctttct gcgggtcatt agcgatactg cctccctgtg ttactcaatc 5880
ctcaaggcaa agaacgccgg aatgtcgctg ggtgcgaaag gagccgcggg acctcttcct 5940
agcgaagcgg tgcagtggct ctgccaccag gctttcctcc tgaagctgac caggcacaga 6000
gtgacctacg tcccgctgct gggctcgctg cgcactgcac agacccagct gtctagaaaa 6060
ctccccggca ccaccctgac cgctctggaa gccgccgcca acccagcatt gccgtcagat 6120
ttcaagacca tcttggac 6138
<210> 48
<211> 2046
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 48
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Arg Ile Phe Asn Ala His Tyr Ala Gly
565 570 575
Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu Thr Asn Pro Gly Pro
580 585 590
Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val
595 600 605
Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly
610 615 620
Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr
625 630 635 640
Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser
645 650 655
Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala
660 665 670
Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp
675 680 685
Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro
690 695 700
Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly
705 710 715 720
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
725 730 735
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
740 745 750
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly
755 760 765
Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro
770 775 780
Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly
785 790 795 800
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
805 810 815
Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala
820 825 830
Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr
835 840 845
Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser
850 855 860
Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser
865 870 875 880
Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu
885 890 895
Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser
900 905 910
Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser
915 920 925
Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile
930 935 940
Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu
945 950 955 960
Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu
965 970 975
Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe
980 985 990
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser
995 1000 1005
Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser
1010 1015 1020
Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val
1025 1030 1035
Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
1040 1045 1050
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe
1055 1060 1065
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1070 1075 1080
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser
1085 1090 1095
Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser
1100 1105 1110
Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser
1115 1120 1125
Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
1130 1135 1140
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Gly Ala Ala Pro
1145 1150 1155
Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His Pro
1160 1165 1170
Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser
1175 1180 1185
Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu
1190 1195 1200
Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His His
1205 1210 1215
Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr
1220 1225 1230
Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser
1235 1240 1245
Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
1250 1255 1260
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile
1265 1270 1275
Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu
1280 1285 1290
Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu
1295 1300 1305
Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu
1310 1315 1320
Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly
1325 1330 1335
Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu
1340 1345 1350
Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln
1355 1360 1365
His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu
1370 1375 1380
Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu
1385 1390 1395
Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys
1400 1405 1410
His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val
1415 1420 1425
Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val
1430 1435 1440
Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe
1445 1450 1455
Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser
1460 1465 1470
Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe
1475 1480 1485
Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
1490 1495 1500
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala
1505 1510 1515
Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser
1520 1525 1530
Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val
1535 1540 1545
Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys
1550 1555 1560
Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val
1565 1570 1575
Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser
1580 1585 1590
Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val
1595 1600 1605
Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val
1610 1615 1620
Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg
1625 1630 1635
Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr
1640 1645 1650
Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly His
1655 1660 1665
Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu
1670 1675 1680
Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser
1685 1690 1695
Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn
1700 1705 1710
Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys
1715 1720 1725
His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
1730 1735 1740
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu
1745 1750 1755
Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg
1760 1765 1770
Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr
1775 1780 1785
Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg
1790 1795 1800
Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val
1805 1810 1815
Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe
1820 1825 1830
Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu
1835 1840 1845
Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr
1850 1855 1860
Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe
1865 1870 1875
Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg
1880 1885 1890
Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu
1895 1900 1905
Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala
1910 1915 1920
Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln
1925 1930 1935
Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr
1940 1945 1950
Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met
1955 1960 1965
Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
1970 1975 1980
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg
1985 1990 1995
His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala
2000 2005 2010
Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala
2015 2020 2025
Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr
2030 2035 2040
Ile Leu Asp
2045
<210> 49
<211> 6117
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 49
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cgagggcaga ggcagcctgc tgacatgtgg cgacgtggaa 1740
gagaaccctg gccccggagc tgccccggag ccggagagga cccccgttgg ccagggatcg 1800
tgggcccatc cgggacgcac caggggacca tccgacaggg gattctgtgt ggtgtcaccg 1860
gccaggccag cagaagaggc aaccagcctc gagggagcgt tgtctggaac cagacattcc 1920
cacccgtcgg tgggccggca gcaccacgcg ggaccaccgt ccacttccag accgccacgg 1980
ccatgggaca ccccttgccc gcctgtgtat gccgagacta aacacttcct gtactcatcc 2040
ggagacaagg aacagcttcg gccgtccttc ctcctgtcgt cgctcagacc gagcctgacc 2100
ggagcacgca gattggtgga aactatcttc cttgggtcac gtccgtggat gccaggtacc 2160
ccacggcgcc tcccgcgcct cccacagaga tactggcaga tgcggcctct gttcctggaa 2220
ttgctgggaa accacgctca gtgcccgtac ggagtcctgc tcaagactca ctgccctctg 2280
agggcggcgg tcactccggc ggccggagtg tgcgcacggg agaagcccca gggaagcgtg 2340
gcagctccgg aagaggagga caccgatccg cgccgcctcg tgcaacttct gcgccagcac 2400
tcctcgccct ggcaagtcta cgggttcgtc cgcgcctgcc tgcgccgcct ggtgccgcct 2460
gggctctggg gttcccggca taacgagcgc cgcttcctga gaaatactaa gaagtttatc 2520
tcacttggaa aacatgccaa gttgtcgctg caagaactca cgtggaagat gtcagtccgc 2580
gattgcgcct ggctgcgccg ctcgccgggc gtcgggtgtg ttccagctgc agaacaccgc 2640
ctgagagaag aaattctggc caaatttctg cattggctga tgtcagtgta cgtggtcgag 2700
ctgctgcgct cctttttcta cgtcactgag actacctttc aaaagaaccg cctgttcttc 2760
taccgcaaat ctgtgtggag caagctgcag tcaatcggca ttcgccagca tctgaagagg 2820
gtgcagctgc gggaactttc cgaggcagaa gtccgccagc accgggaggc ccggccggcg 2880
cttctcacgt cgcgtctgag attcatccca aagcccgacg ggctgaggcc tatcgtcaac 2940
atggattacg tcgtgggcgc tcgcaccttt cgccgtgaaa agcgggccga acgcttgacc 3000
tcacgggtga aggccctctt ctccgtgctg aactacgaga gagcaagacg gcctggcctg 3060
ctgggagctt cggtgctggg actggacgat atccaccggg cttggcggac ctttgttctc 3120
cgggtgagag cccaagaccc tccgccggaa ctgtacttcg tgaaggtggc gatcaccgga 3180
gcctatgata ctattccgca agatcgactc accgaagtca tcgcctcgat catcaaaccg 3240
cagaacactt actgcgtcag gcggtacgcc gtggtccaga aggccgcgca tggccacgtg 3300
agaaaggcgt tcaagtcgca cgtgtccact ctcaccgacc tccagcctta catgaggcaa 3360
ttcgttgcgc atttgcaaga gacttcgccc ctgagagatg cggtggtcat cgagcagagc 3420
tccagcctga acgaagcgag cagcggtctg tttgacgtgt tcctccgctt catgtgtcat 3480
cacgcggtgc gaatcagggg aaaatcatac gtgcagtgcc agggaatccc acaaggcagc 3540
attctgtcga ctctcttgtg ttccctttgc tacggcgata tggaaaacaa gctgttcgct 3600
gggatcagac gggacgggtt gctgctcaga ctggtggacg acttcctgct ggtgactccg 3660
cacctcactc acgccaaaac ctttctccgc actctggtga ggggagtgcc agaatacggc 3720
tgtgtggtca atctccggaa aactgtggtg aatttccctg tcgaggatga ggcactcgga 3780
ggaaccgcat ttgtccaaat gccagcacat ggcctgttcc catggtgcgg tctgctgctg 3840
gacacccgaa ctcttgaagt gcagtccgac tactccagct atgcccggac gagcatccgc 3900
gccagcctca ctttcaatcg cggctttaag gccggacgaa acatgcgcag aaagcttttc 3960
ggagtcctcc ggcttaaatg ccattcgctc tttctcgatc tccaagtcaa ttcgctgcag 4020
accgtgtgca cgaacatcta caagatcctg ctgctccaag cctaccggtt ccacgcttgc 4080
gtgcttcagc tgccgtttca ccaacaggtg tggaagaacc cgaccttctt tctgcgggtc 4140
attagcgata ctgcctccct gtgttactca atcctcaagg caaagaacgc cggaatgtcg 4200
ctgggtgcga aaggagccgc gggacctctt cctagcgaag cggtgcagtg gctctgccac 4260
caggctttcc tcctgaagct gaccaggcac agagtgacct acgtcccgct gctgggctcg 4320
ctgcgcactg cacagaccca gctgtctaga aaactccccg gcaccaccct gaccgctctg 4380
gaagccgccg ccaacccagc attgccgtca gatttcaaga ccatcttgga cggatccggc 4440
acaatcctgt ctgagggcgc caccaacttc agcctgctga aactggccgg cgacgtggaa 4500
ctgaaccctg gccctacccc tggaacccag agccccttct tccttctgct gctgctgacc 4560
gtgctgactg tcgtgacagg ctctggccac gccagctcta cacctggcgg cgagaaagag 4620
acaagcgcca cccagagaag cagcgtgcca agcagcaccg agaagaacgc cgtgtccatg 4680
accagctccg tgctgagcag ccactctcct ggcagcggca gcagcacaac acagggccag 4740
gatgtgacac tggcccctgc cacagaacct gcctctggat ctgccgccac ctggggacag 4800
gacgtgacaa gcgtgccagt gaccagacct gccctgggct ctacaacacc ccctgcccac 4860
gatgtgacca gcgcccctga taacaagcct gcccctggaa gcacagcccc tccagctcat 4920
ggcgtgacct ctgccccaga taccagacca gccccaggat ctacagcccc acccgcacac 4980
ggcgtgacaa gtgcccctga cacaagaccc gctccaggct ctactgctcc tcctgcccat 5040
ggcgtgacaa gcgctcccga tacaaggcca gctcctggct ccacagcacc accagcacat 5100
ggcgtgacat cagctcccga cactagacct gctcccggat caaccgctcc accagctcac 5160
ggcgtgacca gcgcacctga taccagacct gctctgggaa gcaccgcccc tcccgtgcac 5220
aatgtgacat ctgcttccgg cagcgccagc ggctctgcct ctacactggt gcacaacggc 5280
accagcgcca gagccacaac aaccccagcc agcaagagca cccccttcag catccctagc 5340
caccacagcg acacccctac cacactggcc agccactcca ccaagaccga tgcctctagc 5400
acccaccact ccagcgtgcc ccctctgacc agcagcaacc acagcacaag cccccagctg 5460
tctaccggcg tctcattctt ctttctgtcc ttccacatca gcaacctgca gttcaacagc 5520
agcctggaag atcccagcac cgactactac caggaactgc agcgggatat cagcgagatg 5580
ttcctgcaaa tctacaagca gggcggcttc ctgggcctga gcaacatcaa gttcagaccc 5640
ggcagcgtgg tggtgcagct gaccctggct ttccgggaag gcaccatcaa cgtgcacgac 5700
gtggaaaccc agttcaacca gtacaagacc gaggccgcca gccggtacaa cctgaccatc 5760
tccgatgtgt ccgtgtccga cgtgcccttc ccattctctg cccagtctgg cgcaggcgtg 5820
ccaggatggg gaattgctct gctggtgctc gtgtgcgtgc tggtggccct ggccatcgtg 5880
tatctgattg ccctggccgt gtgccagtgc cggcggaaga attacggcca gctggacatc 5940
ttccccgcca gagacaccta ccaccccatg agcgagtacc ccacatacca cacccacggc 6000
agatacgtgc cacccagctc caccgacaga tccccctacg agaaagtgtc tgccggcaac 6060
ggcggcagct ccctgagcta cacaaatcct gccgtggccg ctgcctccgc caacctg 6117
<210> 50
<211> 2039
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 50
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
565 570 575
Gly Asp Val Glu Glu Asn Pro Gly Pro Gly Ala Ala Pro Glu Pro Glu
580 585 590
Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg
595 600 605
Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala
610 615 620
Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser
625 630 635 640
His Pro Ser Val Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser
645 650 655
Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu
660 665 670
Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro
675 680 685
Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg
690 695 700
Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr
705 710 715 720
Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro
725 730 735
Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val
740 745 750
Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala
755 760 765
Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu
770 775 780
Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His
785 790 795 800
Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg
805 810 815
Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe
820 825 830
Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu
835 840 845
Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp
850 855 860
Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg
865 870 875 880
Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val
885 890 895
Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr
900 905 910
Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys
915 920 925
Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg
930 935 940
Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala
945 950 955 960
Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg
965 970 975
Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg
980 985 990
Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser
995 1000 1005
Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala
1010 1015 1020
Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe
1025 1030 1035
Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe
1040 1045 1050
Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp
1055 1060 1065
Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr
1070 1075 1080
Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly
1085 1090 1095
His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp
1100 1105 1110
Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr
1115 1120 1125
Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu
1130 1135 1140
Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
1145 1150 1155
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys
1160 1165 1170
Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser
1175 1180 1185
Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg
1190 1195 1200
Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val
1205 1210 1215
Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val
1220 1225 1230
Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr
1235 1240 1245
Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala
1250 1255 1260
Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu
1265 1270 1275
Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser
1280 1285 1290
Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly
1295 1300 1305
Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu
1310 1315 1320
Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser
1325 1330 1335
Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln
1340 1345 1350
Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln
1355 1360 1365
Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp
1370 1375 1380
Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
1385 1390 1395
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu
1400 1405 1410
Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr
1415 1420 1425
Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr
1430 1435 1440
Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr
1445 1450 1455
Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys
1460 1465 1470
Thr Ile Leu Asp Gly Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr
1475 1480 1485
Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu Leu Asn Pro
1490 1495 1500
Gly Pro Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1505 1510 1515
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser
1520 1525 1530
Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser
1535 1540 1545
Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser
1550 1555 1560
Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln
1565 1570 1575
Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly
1580 1585 1590
Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val Thr
1595 1600 1605
Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr
1610 1615 1620
Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro
1625 1630 1635
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly
1640 1645 1650
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
1655 1660 1665
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr
1670 1675 1680
Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
1685 1690 1695
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly
1700 1705 1710
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
1715 1720 1725
Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr
1730 1735 1740
Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
1745 1750 1755
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser
1760 1765 1770
Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr
1775 1780 1785
Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His
1790 1795 1800
Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro
1805 1810 1815
Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile
1820 1825 1830
Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp
1835 1840 1845
Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln
1850 1855 1860
Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe
1865 1870 1875
Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu
1880 1885 1890
Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr
1895 1900 1905
Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val
1910 1915 1920
Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala
1925 1930 1935
Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val
1940 1945 1950
Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys
1955 1960 1965
Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala
1970 1975 1980
Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
1985 1990 1995
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr
2000 2005 2010
Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr
2015 2020 2025
Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
2030 2035
<210> 51
<211> 6117
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 51
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggcgagggc 2700
agaggcagcc tgctgacatg tggcgacgtg gaagagaacc ctggccccct ggctggcgag 2760
acaggacagg aagccgctcc tctggacggc gtgctggcca accctcccaa tatcagcagc 2820
ctgagcccca gacagctgct gggattccct tgtgccgagg tgtccggcct gagcacagag 2880
agagtgcggg aactggctgt ggccctggcc cagaaaaacg tgaagctgag caccgagcag 2940
ctgcggtgcc tggcccacag actgtctgag cctcccgagg atctggacgc cctgcctctg 3000
gatctgctgc tgttcctgaa ccccgacgcc ttcagcggac ctcaggcctg cacccggttc 3060
ttcagcagaa tcaccaaggc caacgtggac ctgctgccca gaggcgcccc tgagagacag 3120
agactgctgc ctgctgctct ggcctgttgg ggagtgcggg gctctctgct gtctgaagct 3180
gatgtgcggg ccctgggagg cctggcttgt gatctgcctg gaagattcgt ggccgagagc 3240
gccgaagtgc tgctgcctag actggtgtcc tgtcccggcc ctctggacca ggatcagcag 3300
gaagctgcca gagctgctct gcagggcgga ggccctcctt atggacctcc tagcacttgg 3360
agcgtgtcca ccatggatgc cctgaggggc ctgctgccag tgctgggcca gcctatcatc 3420
agatccatcc cacagggcat cgtggccgcc tggcggcaga gaagctctag agatccctct 3480
tggcggcagc ccgagcggac aatcctgcgg cccaggtttc ggagagaggt ggaaaagacc 3540
gcctgcccct ctggcaagaa ggccagagag atcgacgaga gcctgatctt ctacaagaag 3600
tgggagctgg aagcctgcgt ggacgccgct ctgctggcca cccagatgga cagagtgaac 3660
gccatcccct tcacctatga gcagctggac gtgctgaagc acaagctgga tgagctgtac 3720
ccccagggct accccgagag cgtgatccag cacctgggct acctgtttct gaagatgagc 3780
cccgaggaca tccggaagtg gaacgtgacc agcctggaaa ccctgaaggc cctgctggaa 3840
gtgaacaagg gccacgagat gtccccccag gtggccacac tgatcgacag attcgtgaag 3900
ggcagaggcc agctggacaa ggacaccctg gatacactga ccgccttcta ccccggctat 3960
ctgtgcagcc tgtcccccga ggaactgagc agcgtgccac ctagctctat ctgggctgtg 4020
cggccccagg acctggatac ctgcgatcct agacagctgg atgtgctgta tcccaaggcc 4080
cggctggcct tccagaacat gaacggcagc gagtacttcg tgaagatcca gtccttcctg 4140
ggcggagccc ctaccgagga cctgaaagct ctgagccagc agaacgtgtc catggatctg 4200
gccaccttta tgaagctgcg gaccgacgcc gtgctgcctc tgacagtggc tgaggtgcag 4260
aaactgctgg gcccccatgt ggaagggctg aaggccgaag aacggcacag acccgtgcgc 4320
gactggatcc tgaggcagag acaggatgac ctggacacac tgggcctggg actgcagggg 4380
ggcatcccta atggctacct ggtgctggac ctgagcatgc aggaagccct gggatccggc 4440
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 4500
acaaaccctg gccccacccc tggaacccag agccccttct tccttctgct gctgctgacc 4560
gtgctgactg tcgtgacagg ctctggccac gccagctcta cacctggcgg cgagaaagag 4620
acaagcgcca cccagagaag cagcgtgcca agcagcaccg agaagaacgc cgtgtccatg 4680
accagctccg tgctgagcag ccactctcct ggcagcggca gcagcacaac acagggccag 4740
gatgtgacac tggcccctgc cacagaacct gcctctggat ctgccgccac ctggggacag 4800
gacgtgacaa gcgtgccagt gaccagacct gccctgggct ctacaacacc ccctgcccac 4860
gatgtgacca gcgcccctga taacaagcct gcccctggaa gcacagcccc tccagctcat 4920
ggcgtgacct ctgccccaga taccagacca gccccaggat ctacagcccc acccgcacac 4980
ggcgtgacaa gtgcccctga cacaagaccc gctccaggct ctactgctcc tcctgcccat 5040
ggcgtgacaa gcgctcccga tacaaggcca gctcctggct ccacagcacc accagcacat 5100
ggcgtgacat cagctcccga cactagacct gctcccggat caaccgctcc accagctcac 5160
ggcgtgacca gcgcacctga taccagacct gctctgggaa gcaccgcccc tcccgtgcac 5220
aatgtgacat ctgcttccgg cagcgccagc ggctctgcct ctacactggt gcacaacggc 5280
accagcgcca gagccacaac aaccccagcc agcaagagca cccccttcag catccctagc 5340
caccacagcg acacccctac cacactggcc agccactcca ccaagaccga tgcctctagc 5400
acccaccact ccagcgtgcc ccctctgacc agcagcaacc acagcacaag cccccagctg 5460
tctaccggcg tctcattctt ctttctgtcc ttccacatca gcaacctgca gttcaacagc 5520
agcctggaag atcccagcac cgactactac caggaactgc agcgggatat cagcgagatg 5580
ttcctgcaaa tctacaagca gggcggcttc ctgggcctga gcaacatcaa gttcagaccc 5640
ggcagcgtgg tggtgcagct gaccctggct ttccgggaag gcaccatcaa cgtgcacgac 5700
gtggaaaccc agttcaacca gtacaagacc gaggccgcca gccggtacaa cctgaccatc 5760
tccgatgtgt ccgtgtccga cgtgcccttc ccattctctg cccagtctgg cgcaggcgtg 5820
ccaggatggg gaattgctct gctggtgctc gtgtgcgtgc tggtggccct ggccatcgtg 5880
tatctgattg ccctggccgt gtgccagtgc cggcggaaga attacggcca gctggacatc 5940
ttccccgcca gagacaccta ccaccccatg agcgagtacc ccacatacca cacccacggc 6000
agatacgtgc cacccagctc caccgacaga tccccctacg agaaagtgtc tgccggcaac 6060
ggcggcagct ccctgagcta cacaaatcct gccgtggccg ctgcctccgc caacctg 6117
<210> 52
<211> 2039
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 52
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
900 905 910
Asn Pro Gly Pro Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu
915 920 925
Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg
930 935 940
Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu
945 950 955 960
Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu
965 970 975
Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro
980 985 990
Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro
995 1000 1005
Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg
1010 1015 1020
Ile Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu
1025 1030 1035
Arg Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg
1040 1045 1050
Gly Ser Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu
1055 1060 1065
Ala Cys Asp Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val
1070 1075 1080
Leu Leu Pro Arg Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp
1085 1090 1095
Gln Gln Glu Ala Ala Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro
1100 1105 1110
Tyr Gly Pro Pro Ser Thr Trp Ser Val Ser Thr Met Asp Ala Leu
1115 1120 1125
Arg Gly Leu Leu Pro Val Leu Gly Gln Pro Ile Ile Arg Ser Ile
1130 1135 1140
Pro Gln Gly Ile Val Ala Ala Trp Arg Gln Arg Ser Ser Arg Asp
1145 1150 1155
Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu Arg Pro Arg Phe
1160 1165 1170
Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly Lys Lys Ala
1175 1180 1185
Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp Glu Leu
1190 1195 1200
Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp Arg
1205 1210 1215
Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
1220 1225 1230
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val
1235 1240 1245
Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp
1250 1255 1260
Ile Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu
1265 1270 1275
Leu Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr
1280 1285 1290
Leu Ile Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp
1295 1300 1305
Thr Leu Asp Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser
1310 1315 1320
Leu Ser Pro Glu Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp
1325 1330 1335
Ala Val Arg Pro Gln Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu
1340 1345 1350
Asp Val Leu Tyr Pro Lys Ala Arg Leu Ala Phe Gln Asn Met Asn
1355 1360 1365
Gly Ser Glu Tyr Phe Val Lys Ile Gln Ser Phe Leu Gly Gly Ala
1370 1375 1380
Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln Gln Asn Val Ser Met
1385 1390 1395
Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp Ala Val Leu Pro
1400 1405 1410
Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro His Val Glu
1415 1420 1425
Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp Trp Ile
1430 1435 1440
Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly Leu
1445 1450 1455
Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
1460 1465 1470
Gln Glu Ala Leu Gly Ser Gly Arg Ile Phe Asn Ala His Tyr Ala
1475 1480 1485
Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu Thr Asn Pro
1490 1495 1500
Gly Pro Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1505 1510 1515
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser
1520 1525 1530
Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser
1535 1540 1545
Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser
1550 1555 1560
Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln
1565 1570 1575
Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly
1580 1585 1590
Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val Thr
1595 1600 1605
Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr
1610 1615 1620
Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro
1625 1630 1635
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly
1640 1645 1650
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
1655 1660 1665
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr
1670 1675 1680
Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
1685 1690 1695
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly
1700 1705 1710
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr
1715 1720 1725
Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr
1730 1735 1740
Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
1745 1750 1755
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser
1760 1765 1770
Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr
1775 1780 1785
Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His
1790 1795 1800
Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro
1805 1810 1815
Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile
1820 1825 1830
Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp
1835 1840 1845
Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln
1850 1855 1860
Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe
1865 1870 1875
Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu
1880 1885 1890
Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr
1895 1900 1905
Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val
1910 1915 1920
Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala
1925 1930 1935
Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val
1940 1945 1950
Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys
1955 1960 1965
Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala
1970 1975 1980
Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
1985 1990 1995
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr
2000 2005 2010
Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr
2015 2020 2025
Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
2030 2035
<210> 53
<211> 6138
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 53
atggctagcg gagctgcccc ggagccggag aggacccccg ttggccaggg atcgtgggcc 60
catccgggac gcaccagggg accatccgac aggggattct gtgtggtgtc accggccagg 120
ccagcagaag aggcaaccag cctcgaggga gcgttgtctg gaaccagaca ttcccacccg 180
tcggtgggcc ggcagcacca cgcgggacca ccgtccactt ccagaccgcc acggccatgg 240
gacacccctt gcccgcctgt gtatgccgag actaaacact tcctgtactc atccggagac 300
aaggaacagc ttcggccgtc cttcctcctg tcgtcgctca gaccgagcct gaccggagca 360
cgcagattgg tggaaactat cttccttggg tcacgtccgt ggatgccagg taccccacgg 420
cgcctcccgc gcctcccaca gagatactgg cagatgcggc ctctgttcct ggaattgctg 480
ggaaaccacg ctcagtgccc gtacggagtc ctgctcaaga ctcactgccc tctgagggcg 540
gcggtcactc cggcggccgg agtgtgcgca cgggagaagc cccagggaag cgtggcagct 600
ccggaagagg aggacaccga tccgcgccgc ctcgtgcaac ttctgcgcca gcactcctcg 660
ccctggcaag tctacgggtt cgtccgcgcc tgcctgcgcc gcctggtgcc gcctgggctc 720
tggggttccc ggcataacga gcgccgcttc ctgagaaata ctaagaagtt tatctcactt 780
ggaaaacatg ccaagttgtc gctgcaagaa ctcacgtgga agatgtcagt ccgcgattgc 840
gcctggctgc gccgctcgcc gggcgtcggg tgtgttccag ctgcagaaca ccgcctgaga 900
gaagaaattc tggccaaatt tctgcattgg ctgatgtcag tgtacgtggt cgagctgctg 960
cgctcctttt tctacgtcac tgagactacc tttcaaaaga accgcctgtt cttctaccgc 1020
aaatctgtgt ggagcaagct gcagtcaatc ggcattcgcc agcatctgaa gagggtgcag 1080
ctgcgggaac tttccgaggc agaagtccgc cagcaccggg aggcccggcc ggcgcttctc 1140
acgtcgcgtc tgagattcat cccaaagccc gacgggctga ggcctatcgt caacatggat 1200
tacgtcgtgg gcgctcgcac ctttcgccgt gaaaagcggg ccgaacgctt gacctcacgg 1260
gtgaaggccc tcttctccgt gctgaactac gagagagcaa gacggcctgg cctgctggga 1320
gcttcggtgc tgggactgga cgatatccac cgggcttggc ggacctttgt tctccgggtg 1380
agagcccaag accctccgcc ggaactgtac ttcgtgaagg tggcgatcac cggagcctat 1440
gatactattc cgcaagatcg actcaccgaa gtcatcgcct cgatcatcaa accgcagaac 1500
acttactgcg tcaggcggta cgccgtggtc cagaaggccg cgcatggcca cgtgagaaag 1560
gcgttcaagt cgcacgtgtc cactctcacc gacctccagc cttacatgag gcaattcgtt 1620
gcgcatttgc aagagacttc gcccctgaga gatgcggtgg tcatcgagca gagctccagc 1680
ctgaacgaag cgagcagcgg tctgtttgac gtgttcctcc gcttcatgtg tcatcacgcg 1740
gtgcgaatca ggggaaaatc atacgtgcag tgccagggaa tcccacaagg cagcattctg 1800
tcgactctct tgtgttccct ttgctacggc gatatggaaa acaagctgtt cgctgggatc 1860
agacgggacg ggttgctgct cagactggtg gacgacttcc tgctggtgac tccgcacctc 1920
actcacgcca aaacctttct ccgcactctg gtgaggggag tgccagaata cggctgtgtg 1980
gtcaatctcc ggaaaactgt ggtgaatttc cctgtcgagg atgaggcact cggaggaacc 2040
gcatttgtcc aaatgccagc acatggcctg ttcccatggt gcggtctgct gctggacacc 2100
cgaactcttg aagtgcagtc cgactactcc agctatgccc ggacgagcat ccgcgccagc 2160
ctcactttca atcgcggctt taaggccgga cgaaacatgc gcagaaagct tttcggagtc 2220
ctccggctta aatgccattc gctctttctc gatctccaag tcaattcgct gcagaccgtg 2280
tgcacgaaca tctacaagat cctgctgctc caagcctacc ggttccacgc ttgcgtgctt 2340
cagctgccgt ttcaccaaca ggtgtggaag aacccgacct tctttctgcg ggtcattagc 2400
gatactgcct ccctgtgtta ctcaatcctc aaggcaaaga acgccggaat gtcgctgggt 2460
gcgaaaggag ccgcgggacc tcttcctagc gaagcggtgc agtggctctg ccaccaggct 2520
ttcctcctga agctgaccag gcacagagtg acctacgtcc cgctgctggg ctcgctgcgc 2580
actgcacaga cccagctgtc tagaaaactc cccggcacca ccctgaccgc tctggaagcc 2640
gccgccaacc cagcattgcc gtcagatttc aagaccatct tggacggatc cggcacaatc 2700
ctgtctgagg gcgccaccaa cttcagcctg ctgaaactgg ccggcgacgt ggaactgaac 2760
cctggcccta cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 2820
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 2880
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 2940
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 3000
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 3060
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 3120
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 3180
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 3240
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 3300
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 3360
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 3420
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 3480
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 3540
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 3600
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 3660
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 3720
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 3780
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 3840
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 3900
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 3960
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 4020
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 4080
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 4140
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 4200
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 4260
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 4320
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 4380
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 4440
acaaaccctg gccccctggc tggcgagaca ggacaggaag ccgctcctct ggacggcgtg 4500
ctggccaacc ctcccaatat cagcagcctg agccccagac agctgctggg attcccttgt 4560
gccgaggtgt ccggcctgag cacagagaga gtgcgggaac tggctgtggc cctggcccag 4620
aaaaacgtga agctgagcac cgagcagctg cggtgcctgg cccacagact gtctgagcct 4680
cccgaggatc tggacgccct gcctctggat ctgctgctgt tcctgaaccc cgacgccttc 4740
agcggacctc aggcctgcac ccggttcttc agcagaatca ccaaggccaa cgtggacctg 4800
ctgcccagag gcgcccctga gagacagaga ctgctgcctg ctgctctggc ctgttgggga 4860
gtgcggggct ctctgctgtc tgaagctgat gtgcgggccc tgggaggcct ggcttgtgat 4920
ctgcctggaa gattcgtggc cgagagcgcc gaagtgctgc tgcctagact ggtgtcctgt 4980
cccggccctc tggaccagga tcagcaggaa gctgccagag ctgctctgca gggcggaggc 5040
cctccttatg gacctcctag cacttggagc gtgtccacca tggatgccct gaggggcctg 5100
ctgccagtgc tgggccagcc tatcatcaga tccatcccac agggcatcgt ggccgcctgg 5160
cggcagagaa gctctagaga tccctcttgg cggcagcccg agcggacaat cctgcggccc 5220
aggtttcgga gagaggtgga aaagaccgcc tgcccctctg gcaagaaggc cagagagatc 5280
gacgagagcc tgatcttcta caagaagtgg gagctggaag cctgcgtgga cgccgctctg 5340
ctggccaccc agatggacag agtgaacgcc atccccttca cctatgagca gctggacgtg 5400
ctgaagcaca agctggatga gctgtacccc cagggctacc ccgagagcgt gatccagcac 5460
ctgggctacc tgtttctgaa gatgagcccc gaggacatcc ggaagtggaa cgtgaccagc 5520
ctggaaaccc tgaaggccct gctggaagtg aacaagggcc acgagatgtc cccccaggtg 5580
gccacactga tcgacagatt cgtgaagggc agaggccagc tggacaagga caccctggat 5640
acactgaccg ccttctaccc cggctatctg tgcagcctgt cccccgagga actgagcagc 5700
gtgccaccta gctctatctg ggctgtgcgg ccccaggacc tggatacctg cgatcctaga 5760
cagctggatg tgctgtatcc caaggcccgg ctggccttcc agaacatgaa cggcagcgag 5820
tacttcgtga agatccagtc cttcctgggc ggagccccta ccgaggacct gaaagctctg 5880
agccagcaga acgtgtccat ggatctggcc acctttatga agctgcggac cgacgccgtg 5940
ctgcctctga cagtggctga ggtgcagaaa ctgctgggcc cccatgtgga agggctgaag 6000
gccgaagaac ggcacagacc cgtgcgcgac tggatcctga ggcagagaca ggatgacctg 6060
gacacactgg gcctgggact gcaggggggc atccctaatg gctacctggt gctggacctg 6120
agcatgcagg aagccctg 6138
<210> 54
<211> 2046
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 54
Met Ala Ser Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
20 25 30
Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
35 40 45
Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
50 55 60
Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
65 70 75 80
Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
85 90 95
Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
100 105 110
Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
115 120 125
Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
130 135 140
Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
145 150 155 160
Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
165 170 175
Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
180 185 190
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
195 200 205
Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
210 215 220
Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
225 230 235 240
Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
245 250 255
Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
260 265 270
Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
275 280 285
Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
290 295 300
Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
305 310 315 320
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
325 330 335
Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
340 345 350
Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
355 360 365
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
370 375 380
Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
385 390 395 400
Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
405 410 415
Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
420 425 430
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
435 440 445
Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
450 455 460
Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
465 470 475 480
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
485 490 495
Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
500 505 510
Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
515 520 525
Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
530 535 540
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
545 550 555 560
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
565 570 575
Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
580 585 590
Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
595 600 605
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
610 615 620
Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
625 630 635 640
Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
645 650 655
Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
660 665 670
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
675 680 685
Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
690 695 700
Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
705 710 715 720
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
725 730 735
Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
740 745 750
Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
755 760 765
Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
770 775 780
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
785 790 795 800
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
805 810 815
Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
820 825 830
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
835 840 845
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
850 855 860
Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
865 870 875 880
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp Gly
885 890 895
Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
900 905 910
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Thr Pro Gly Thr Gln
915 920 925
Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val Leu Thr Val Val Thr
930 935 940
Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser
945 950 955 960
Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val
965 970 975
Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser
980 985 990
Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro
995 1000 1005
Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val
1010 1015 1020
Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His
1025 1030 1035
Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr
1040 1045 1050
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1055 1060 1065
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1070 1075 1080
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His
1085 1090 1095
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1100 1105 1110
Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
1115 1120 1125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
1130 1135 1140
Pro Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His
1145 1150 1155
Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr
1160 1165 1170
Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala
1175 1180 1185
Ser Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr
1190 1195 1200
Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser
1205 1210 1215
Thr His His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser
1220 1225 1230
Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser
1235 1240 1245
Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro
1250 1255 1260
Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met
1265 1270 1275
Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn
1280 1285 1290
Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala
1295 1300 1305
Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe
1310 1315 1320
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile
1325 1330 1335
Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln
1340 1345 1350
Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
1355 1360 1365
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu
1370 1375 1380
Ala Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile
1385 1390 1395
Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr
1400 1405 1410
Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg
1415 1420 1425
Ser Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu
1430 1435 1440
Ser Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu Gly
1445 1450 1455
Ser Gly Arg Ile Phe Asn Ala His Tyr Ala Gly Tyr Phe Ala Asp
1460 1465 1470
Leu Leu Ile His Asp Ile Glu Thr Asn Pro Gly Pro Leu Ala Gly
1475 1480 1485
Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp Gly Val Leu Ala Asn
1490 1495 1500
Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln Leu Leu Gly Phe
1505 1510 1515
Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg Val Arg Glu
1520 1525 1530
Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser Thr Glu
1535 1540 1545
Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu Asp
1550 1555 1560
Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
1565 1570 1575
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile
1580 1585 1590
Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg
1595 1600 1605
Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly
1610 1615 1620
Ser Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala
1625 1630 1635
Cys Asp Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu
1640 1645 1650
Leu Pro Arg Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln
1655 1660 1665
Gln Glu Ala Ala Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr
1670 1675 1680
Gly Pro Pro Ser Thr Trp Ser Val Ser Thr Met Asp Ala Leu Arg
1685 1690 1695
Gly Leu Leu Pro Val Leu Gly Gln Pro Ile Ile Arg Ser Ile Pro
1700 1705 1710
Gln Gly Ile Val Ala Ala Trp Arg Gln Arg Ser Ser Arg Asp Pro
1715 1720 1725
Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu Arg Pro Arg Phe Arg
1730 1735 1740
Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly Lys Lys Ala Arg
1745 1750 1755
Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp Glu Leu Glu
1760 1765 1770
Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp Arg Val
1775 1780 1785
Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys His
1790 1795 1800
Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
1805 1810 1815
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile
1820 1825 1830
Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu
1835 1840 1845
Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu
1850 1855 1860
Ile Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr
1865 1870 1875
Leu Asp Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu
1880 1885 1890
Ser Pro Glu Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala
1895 1900 1905
Val Arg Pro Gln Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp
1910 1915 1920
Val Leu Tyr Pro Lys Ala Arg Leu Ala Phe Gln Asn Met Asn Gly
1925 1930 1935
Ser Glu Tyr Phe Val Lys Ile Gln Ser Phe Leu Gly Gly Ala Pro
1940 1945 1950
Thr Glu Asp Leu Lys Ala Leu Ser Gln Gln Asn Val Ser Met Asp
1955 1960 1965
Leu Ala Thr Phe Met Lys Leu Arg Thr Asp Ala Val Leu Pro Leu
1970 1975 1980
Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro His Val Glu Gly
1985 1990 1995
Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp Trp Ile Leu
2000 2005 2010
Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly Leu Gln
2015 2020 2025
Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met Gln
2030 2035 2040
Glu Ala Leu
2045
<210> 55
<211> 5214
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 55
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 1680
acaaaccctg gccccctggc tggcgagaca ggacaggaag ccgctcctct ggacggcgtg 1740
ctggccaacc ctcccaatat cagcagcctg agccccagac agctgctggg attcccttgt 1800
gccgaggtgt ccggcctgag cacagagaga gtgcgggaac tggctgtggc cctggcccag 1860
aaaaacgtga agctgagcac cgagcagctg cggtgcctgg cccacagact gtctgagcct 1920
cccgaggatc tggacgccct gcctctggat ctgctgctgt tcctgaaccc cgacgccttc 1980
agcggacctc aggcctgcac ccggttcttc agcagaatca ccaaggccaa cgtggacctg 2040
ctgcccagag gcgcccctga gagacagaga ctgctgcctg ctgctctggc ctgttgggga 2100
gtgcggggct ctctgctgtc tgaagctgat gtgcgggccc tgggaggcct ggcttgtgat 2160
ctgcctggaa gattcgtggc cgagagcgcc gaagtgctgc tgcctagact ggtgtcctgt 2220
cccggccctc tggaccagga tcagcaggaa gctgccagag ctgctctgca gggcggaggc 2280
cctccttatg gacctcctag cacttggagc gtgtccacca tggatgccct gaggggcctg 2340
ctgccagtgc tgggccagcc tatcatcaga tccatcccac agggcatcgt ggccgcctgg 2400
cggcagagaa gctctagaga tccctcttgg cggcagcccg agcggacaat cctgcggccc 2460
aggtttcgga gagaggtgga aaagaccgcc tgcccctctg gcaagaaggc cagagagatc 2520
gacgagagcc tgatcttcta caagaagtgg gagctggaag cctgcgtgga cgccgctctg 2580
ctggccaccc agatggacag agtgaacgcc atccccttca cctatgagca gctggacgtg 2640
ctgaagcaca agctggatga gctgtacccc cagggctacc ccgagagcgt gatccagcac 2700
ctgggctacc tgtttctgaa gatgagcccc gaggacatcc ggaagtggaa cgtgaccagc 2760
ctggaaaccc tgaaggccct gctggaagtg aacaagggcc acgagatgtc cccccaggtg 2820
gccacactga tcgacagatt cgtgaagggc agaggccagc tggacaagga caccctggat 2880
acactgaccg ccttctaccc cggctatctg tgcagcctgt cccccgagga actgagcagc 2940
gtgccaccta gctctatctg ggctgtgcgg ccccaggacc tggatacctg cgatcctaga 3000
cagctggatg tgctgtatcc caaggcccgg ctggccttcc agaacatgaa cggcagcgag 3060
tacttcgtga agatccagtc cttcctgggc ggagccccta ccgaggacct gaaagctctg 3120
agccagcaga acgtgtccat ggatctggcc acctttatga agctgcggac cgacgccgtg 3180
ctgcctctga cagtggctga ggtgcagaaa ctgctgggcc cccatgtgga agggctgaag 3240
gccgaagaac ggcacagacc cgtgcgcgac tggatcctga ggcagagaca ggatgacctg 3300
gacacactgg gcctgggact gcaggggggc atccctaatg gctacctggt gctggacctg 3360
agcatgcagg aagccctggg atccggcgag ggcagaggca gcctgctgac atgtggcgac 3420
gtggaagaga accctggccc cgccaaattt ctgcattggc tgatgtcagt gtacgtggtc 3480
gagctgctgc gctccttttt ctacgtcact gagactacct ttcaaaagaa ccgcctgttc 3540
ttctaccgca aatctgtgtg gagcaagctg cagtcaatcg gcattcgcca gcatctgaag 3600
agggtgcagc tgcgggaact ttccgaggca gaagtccgcc agcaccggga ggcccggccg 3660
gcgcttctca cgtcgcgtct gagattcatc ccaaagcccg acgggctgag gcctatcgtc 3720
aacatggatt acgtcgtggg cgctcgcacc tttcgccgtg aaaagcgggc cgaacgcttg 3780
acctcacggg tgaaggccct cttctccgtg ctgaactacg agagagcaag acggcctggc 3840
ctgctgggag cttcggtgct gggactggac gatatccacc gggcttggcg gacctttgtt 3900
ctccgggtga gagcccaaga ccctccgccg gaactgtact tcgtgaaggt ggcgatcacc 3960
ggagcctatg atactattcc gcaagatcga ctcaccgaag tcatcgcctc gatcatcaaa 4020
ccgcagaaca cttactgcgt caggcggtac gccgtggtcc agaaggccgc gcatggccac 4080
gtgagaaagg cgttcaagtc gcacgtgtcc actctcaccg acctccagcc ttacatgagg 4140
caattcgttg cgcatttgca agagacttcg cccctgagag atgcggtggt catcgagcag 4200
agctccagcc tgaacgaagc gagcagcggt ctgtttgacg tgttcctccg cttcatgtgt 4260
catcacgcgg tgcgaatcag gggaaaatca tacgtgcagt gccagggaat cccacaaggc 4320
agcattctgt cgactctctt gtgttccctt tgctacggcg atatggaaaa caagctgttc 4380
gctgggatca gacgggacgg gttgctgctc agactggtgg acgacttcct gctggtgact 4440
ccgcacctca ctcacgccaa aacctttctc cgcactctgg tgaggggagt gccagaatac 4500
ggctgtgtgg tcaatctccg gaaaactgtg gtgaatttcc ctgtcgagga tgaggcactc 4560
ggaggaaccg catttgtcca aatgccagca catggcctgt tcccatggtg cggtctgctg 4620
ctggacaccc gaactcttga agtgcagtcc gactactcca gctatgcccg gacgagcatc 4680
cgcgccagcc tcactttcaa tcgcggcttt aaggccggac gaaacatgcg cagaaagctt 4740
ttcggagtcc tccggcttaa atgccattcg ctctttctcg atctccaagt caattcgctg 4800
cagaccgtgt gcacgaacat ctacaagatc ctgctgctcc aagcctaccg gttccacgct 4860
tgcgtgcttc agctgccgtt tcaccaacag gtgtggaaga acccgacctt ctttctgcgg 4920
gtcattagcg atactgcctc cctgtgttac tcaatcctca aggcaaagaa cgccggaatg 4980
tcgctgggtg cgaaaggagc cgcgggacct cttcctagcg aagcggtgca gtggctctgc 5040
caccaggctt tcctcctgaa gctgaccagg cacagagtga cctacgtccc gctgctgggc 5100
tcgctgcgca ctgcacagac ccagctgtct agaaaactcc ccggcaccac cctgaccgct 5160
ctggaagccg ccgccaaccc agcattgccg tcagatttca agaccatctt ggac 5214
<210> 56
<211> 1738
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 56
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Arg Ile Phe Asn
530 535 540
Ala His Tyr Ala Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu
545 550 555 560
Thr Asn Pro Gly Pro Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro
565 570 575
Leu Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro
580 585 590
Arg Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr
595 600 605
Glu Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys
610 615 620
Leu Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro
625 630 635 640
Pro Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn
645 650 655
Pro Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg
660 665 670
Ile Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg
675 680 685
Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser
690 695 700
Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp
705 710 715 720
Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg
725 730 735
Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala
740 745 750
Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr
755 760 765
Trp Ser Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu
770 775 780
Gly Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp
785 790 795 800
Arg Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr
805 810 815
Ile Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro
820 825 830
Ser Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys
835 840 845
Lys Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln
850 855 860
Met Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val
865 870 875 880
Leu Lys His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser
885 890 895
Val Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp
900 905 910
Ile Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu
915 920 925
Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile
930 935 940
Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp
945 950 955 960
Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu
965 970 975
Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln
980 985 990
Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys
995 1000 1005
Ala Arg Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val
1010 1015 1020
Lys Ile Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys
1025 1030 1035
Ala Leu Ser Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met
1040 1045 1050
Lys Leu Arg Thr Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val
1055 1060 1065
Gln Lys Leu Leu Gly Pro His Val Glu Gly Leu Lys Ala Glu Glu
1070 1075 1080
Arg His Arg Pro Val Arg Asp Trp Ile Leu Arg Gln Arg Gln Asp
1085 1090 1095
Asp Leu Asp Thr Leu Gly Leu Gly Leu Gln Gly Gly Ile Pro Asn
1100 1105 1110
Gly Tyr Leu Val Leu Asp Leu Ser Met Gln Glu Ala Leu Gly Ser
1115 1120 1125
Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
1130 1135 1140
Asn Pro Gly Pro Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr
1145 1150 1155
Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr
1160 1165 1170
Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser
1175 1180 1185
Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln
1190 1195 1200
Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala
1205 1210 1215
Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro
1220 1225 1230
Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala
1235 1240 1245
Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg
1250 1255 1260
Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg
1265 1270 1275
Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His
1280 1285 1290
Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro
1295 1300 1305
Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr
1310 1315 1320
Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile
1325 1330 1335
Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val
1340 1345 1350
Gln Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His
1355 1360 1365
Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val
1370 1375 1380
Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile
1385 1390 1395
Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp
1400 1405 1410
Val Phe Leu Arg Phe Met Cys His His Ala Val Arg Ile Arg Gly
1415 1420 1425
Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu
1430 1435 1440
Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys
1445 1450 1455
Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val
1460 1465 1470
Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys Thr
1475 1480 1485
Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val
1490 1495 1500
Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu
1505 1510 1515
Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly Leu
1520 1525 1530
Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val
1535 1540 1545
Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
1550 1555 1560
Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg
1565 1570 1575
Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu
1580 1585 1590
Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr
1595 1600 1605
Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu
1610 1615 1620
Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe
1625 1630 1635
Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu
1640 1645 1650
Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala
1655 1660 1665
Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala
1670 1675 1680
Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu
1685 1690 1695
Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu
1700 1705 1710
Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala
1715 1720 1725
Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
1730 1735
<210> 57
<211> 5235
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 57
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cagaatcttc aacgcccact acgccggcta cttcgccgac 1740
ctgctgatcc acgacatcga gacaaaccct ggccccaccc ctggaaccca gagccccttc 1800
ttccttctgc tgctgctgac cgtgctgact gtcgtgacag gctctggcca cgccagctct 1860
acacctggcg gcgagaaaga gacaagcgcc acccagagaa gcagcgtgcc aagcagcacc 1920
gagaagaacg ccgtgtccat gaccagctcc gtgctgagca gccactctcc tggcagcggc 1980
agcagcacaa cacagggcca ggatgtgaca ctggcccctg ccacagaacc tgcctctgga 2040
tctgccgcca cctggggaca ggacgtgaca agcgtgccag tgaccagacc tgccctgggc 2100
tctacaacac cccctgccca cgatgtgacc agcgcccctg ataacaagcc tgcccctgga 2160
agcacagccc ctccagctca tggcgtgacc tctgccccag ataccagacc agccccagga 2220
tctacagccc cacccgcaca cggcgtgaca agtgcccctg acacaagacc cgctccaggc 2280
tctactgctc ctcctgccca tggcgtgaca agcgctcccg atacaaggcc agctcctggc 2340
tccacagcac caccagcaca tggcgtgaca tcagctcccg acactagacc tgctcccgga 2400
tcaaccgctc caccagctca cggcgtgacc agcgcacctg ataccagacc tgctctggga 2460
agcaccgccc ctcccgtgca caatgtgaca tctgcttccg gcagcgccag cggctctgcc 2520
tctacactgg tgcacaacgg caccagcgcc agagccacaa caaccccagc cagcaagagc 2580
acccccttca gcatccctag ccaccacagc gacaccccta ccacactggc cagccactcc 2640
accaagaccg atgcctctag cacccaccac tccagcgtgc cccctctgac cagcagcaac 2700
cacagcacaa gcccccagct gtctaccggc gtctcattct tctttctgtc cttccacatc 2760
agcaacctgc agttcaacag cagcctggaa gatcccagca ccgactacta ccaggaactg 2820
cagcgggata tcagcgagat gttcctgcaa atctacaagc agggcggctt cctgggcctg 2880
agcaacatca agttcagacc cggcagcgtg gtggtgcagc tgaccctggc tttccgggaa 2940
ggcaccatca acgtgcacga cgtggaaacc cagttcaacc agtacaagac cgaggccgcc 3000
agccggtaca acctgaccat ctccgatgtg tccgtgtccg acgtgccctt cccattctct 3060
gcccagtctg gcgcaggcgt gccaggatgg ggaattgctc tgctggtgct cgtgtgcgtg 3120
ctggtggccc tggccatcgt gtatctgatt gccctggccg tgtgccagtg ccggcggaag 3180
aattacggcc agctggacat cttccccgcc agagacacct accaccccat gagcgagtac 3240
cccacatacc acacccacgg cagatacgtg ccacccagct ccaccgacag atccccctac 3300
gagaaagtgt ctgccggcaa cggcggcagc tccctgagct acacaaatcc tgccgtggcc 3360
gctgcctccg ccaacctggg atccggcaca atcctgtctg agggcgccac caacttcagc 3420
ctgctgaaac tggccggcga cgtggaactg aaccctggcc ctgccaaatt tctgcattgg 3480
ctgatgtcag tgtacgtggt cgagctgctg cgctcctttt tctacgtcac tgagactacc 3540
tttcaaaaga accgcctgtt cttctaccgc aaatctgtgt ggagcaagct gcagtcaatc 3600
ggcattcgcc agcatctgaa gagggtgcag ctgcgggaac tttccgaggc agaagtccgc 3660
cagcaccggg aggcccggcc ggcgcttctc acgtcgcgtc tgagattcat cccaaagccc 3720
gacgggctga ggcctatcgt caacatggat tacgtcgtgg gcgctcgcac ctttcgccgt 3780
gaaaagcggg ccgaacgctt gacctcacgg gtgaaggccc tcttctccgt gctgaactac 3840
gagagagcaa gacggcctgg cctgctggga gcttcggtgc tgggactgga cgatatccac 3900
cgggcttggc ggacctttgt tctccgggtg agagcccaag accctccgcc ggaactgtac 3960
ttcgtgaagg tggcgatcac cggagcctat gatactattc cgcaagatcg actcaccgaa 4020
gtcatcgcct cgatcatcaa accgcagaac acttactgcg tcaggcggta cgccgtggtc 4080
cagaaggccg cgcatggcca cgtgagaaag gcgttcaagt cgcacgtgtc cactctcacc 4140
gacctccagc cttacatgag gcaattcgtt gcgcatttgc aagagacttc gcccctgaga 4200
gatgcggtgg tcatcgagca gagctccagc ctgaacgaag cgagcagcgg tctgtttgac 4260
gtgttcctcc gcttcatgtg tcatcacgcg gtgcgaatca ggggaaaatc atacgtgcag 4320
tgccagggaa tcccacaagg cagcattctg tcgactctct tgtgttccct ttgctacggc 4380
gatatggaaa acaagctgtt cgctgggatc agacgggacg ggttgctgct cagactggtg 4440
gacgacttcc tgctggtgac tccgcacctc actcacgcca aaacctttct ccgcactctg 4500
gtgaggggag tgccagaata cggctgtgtg gtcaatctcc ggaaaactgt ggtgaatttc 4560
cctgtcgagg atgaggcact cggaggaacc gcatttgtcc aaatgccagc acatggcctg 4620
ttcccatggt gcggtctgct gctggacacc cgaactcttg aagtgcagtc cgactactcc 4680
agctatgccc ggacgagcat ccgcgccagc ctcactttca atcgcggctt taaggccgga 4740
cgaaacatgc gcagaaagct tttcggagtc ctccggctta aatgccattc gctctttctc 4800
gatctccaag tcaattcgct gcagaccgtg tgcacgaaca tctacaagat cctgctgctc 4860
caagcctacc ggttccacgc ttgcgtgctt cagctgccgt ttcaccaaca ggtgtggaag 4920
aacccgacct tctttctgcg ggtcattagc gatactgcct ccctgtgtta ctcaatcctc 4980
aaggcaaaga acgccggaat gtcgctgggt gcgaaaggag ccgcgggacc tcttcctagc 5040
gaagcggtgc agtggctctg ccaccaggct ttcctcctga agctgaccag gcacagagtg 5100
acctacgtcc cgctgctggg ctcgctgcgc actgcacaga cccagctgtc tagaaaactc 5160
cccggcacca ccctgaccgc tctggaagcc gccgccaacc cagcattgcc gtcagatttc 5220
aagaccatct tggac 5235
<210> 58
<211> 1745
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 58
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Arg Ile Phe Asn Ala His Tyr Ala Gly
565 570 575
Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu Thr Asn Pro Gly Pro
580 585 590
Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val
595 600 605
Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly
610 615 620
Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr
625 630 635 640
Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser
645 650 655
Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala
660 665 670
Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp
675 680 685
Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro
690 695 700
Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly
705 710 715 720
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
725 730 735
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
740 745 750
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly
755 760 765
Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro
770 775 780
Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly
785 790 795 800
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
805 810 815
Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala
820 825 830
Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr
835 840 845
Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser
850 855 860
Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser
865 870 875 880
Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu
885 890 895
Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser
900 905 910
Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser
915 920 925
Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile
930 935 940
Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu
945 950 955 960
Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu
965 970 975
Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe
980 985 990
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser
995 1000 1005
Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser
1010 1015 1020
Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val
1025 1030 1035
Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
1040 1045 1050
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe
1055 1060 1065
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1070 1075 1080
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser
1085 1090 1095
Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser
1100 1105 1110
Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser
1115 1120 1125
Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
1130 1135 1140
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Ala Lys Phe Leu
1145 1150 1155
His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe
1160 1165 1170
Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe
1175 1180 1185
Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg
1190 1195 1200
Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
1205 1210 1215
Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg
1220 1225 1230
Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn
1235 1240 1245
Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg
1250 1255 1260
Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu
1265 1270 1275
Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val
1280 1285 1290
Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu
1295 1300 1305
Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys
1310 1315 1320
Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu
1325 1330 1335
Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys
1340 1345 1350
Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val
1355 1360 1365
Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln
1370 1375 1380
Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro
1385 1390 1395
Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu
1400 1405 1410
Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His
1415 1420 1425
His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly
1430 1435 1440
Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
1445 1450 1455
Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp
1460 1465 1470
Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro
1475 1480 1485
His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly
1490 1495 1500
Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val
1505 1510 1515
Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val
1520 1525 1530
Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu
1535 1540 1545
Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala
1550 1555 1560
Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys
1565 1570 1575
Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu
1580 1585 1590
Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln
1595 1600 1605
Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr
1610 1615 1620
Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln Val
1625 1630 1635
Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala
1640 1645 1650
Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser
1655 1660 1665
Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val
1670 1675 1680
Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
1685 1690 1695
Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln
1700 1705 1710
Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu
1715 1720 1725
Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile
1730 1735 1740
Leu Asp
1745
<210> 59
<211> 5214
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 59
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cgagggcaga ggcagcctgc tgacatgtgg cgacgtggaa 1740
gagaaccctg gccccgccaa atttctgcat tggctgatgt cagtgtacgt ggtcgagctg 1800
ctgcgctcct ttttctacgt cactgagact acctttcaaa agaaccgcct gttcttctac 1860
cgcaaatctg tgtggagcaa gctgcagtca atcggcattc gccagcatct gaagagggtg 1920
cagctgcggg aactttccga ggcagaagtc cgccagcacc gggaggcccg gccggcgctt 1980
ctcacgtcgc gtctgagatt catcccaaag cccgacgggc tgaggcctat cgtcaacatg 2040
gattacgtcg tgggcgctcg cacctttcgc cgtgaaaagc gggccgaacg cttgacctca 2100
cgggtgaagg ccctcttctc cgtgctgaac tacgagagag caagacggcc tggcctgctg 2160
ggagcttcgg tgctgggact ggacgatatc caccgggctt ggcggacctt tgttctccgg 2220
gtgagagccc aagaccctcc gccggaactg tacttcgtga aggtggcgat caccggagcc 2280
tatgatacta ttccgcaaga tcgactcacc gaagtcatcg cctcgatcat caaaccgcag 2340
aacacttact gcgtcaggcg gtacgccgtg gtccagaagg ccgcgcatgg ccacgtgaga 2400
aaggcgttca agtcgcacgt gtccactctc accgacctcc agccttacat gaggcaattc 2460
gttgcgcatt tgcaagagac ttcgcccctg agagatgcgg tggtcatcga gcagagctcc 2520
agcctgaacg aagcgagcag cggtctgttt gacgtgttcc tccgcttcat gtgtcatcac 2580
gcggtgcgaa tcaggggaaa atcatacgtg cagtgccagg gaatcccaca aggcagcatt 2640
ctgtcgactc tcttgtgttc cctttgctac ggcgatatgg aaaacaagct gttcgctggg 2700
atcagacggg acgggttgct gctcagactg gtggacgact tcctgctggt gactccgcac 2760
ctcactcacg ccaaaacctt tctccgcact ctggtgaggg gagtgccaga atacggctgt 2820
gtggtcaatc tccggaaaac tgtggtgaat ttccctgtcg aggatgaggc actcggagga 2880
accgcatttg tccaaatgcc agcacatggc ctgttcccat ggtgcggtct gctgctggac 2940
acccgaactc ttgaagtgca gtccgactac tccagctatg cccggacgag catccgcgcc 3000
agcctcactt tcaatcgcgg ctttaaggcc ggacgaaaca tgcgcagaaa gcttttcgga 3060
gtcctccggc ttaaatgcca ttcgctcttt ctcgatctcc aagtcaattc gctgcagacc 3120
gtgtgcacga acatctacaa gatcctgctg ctccaagcct accggttcca cgcttgcgtg 3180
cttcagctgc cgtttcacca acaggtgtgg aagaacccga ccttctttct gcgggtcatt 3240
agcgatactg cctccctgtg ttactcaatc ctcaaggcaa agaacgccgg aatgtcgctg 3300
ggtgcgaaag gagccgcggg acctcttcct agcgaagcgg tgcagtggct ctgccaccag 3360
gctttcctcc tgaagctgac caggcacaga gtgacctacg tcccgctgct gggctcgctg 3420
cgcactgcac agacccagct gtctagaaaa ctccccggca ccaccctgac cgctctggaa 3480
gccgccgcca acccagcatt gccgtcagat ttcaagacca tcttggacgg atccggcaca 3540
atcctgtctg agggcgccac caacttcagc ctgctgaaac tggccggcga cgtggaactg 3600
aaccctggcc ctacccctgg aacccagagc cccttcttcc ttctgctgct gctgaccgtg 3660
ctgactgtcg tgacaggctc tggccacgcc agctctacac ctggcggcga gaaagagaca 3720
agcgccaccc agagaagcag cgtgccaagc agcaccgaga agaacgccgt gtccatgacc 3780
agctccgtgc tgagcagcca ctctcctggc agcggcagca gcacaacaca gggccaggat 3840
gtgacactgg cccctgccac agaacctgcc tctggatctg ccgccacctg gggacaggac 3900
gtgacaagcg tgccagtgac cagacctgcc ctgggctcta caacaccccc tgcccacgat 3960
gtgaccagcg cccctgataa caagcctgcc cctggaagca cagcccctcc agctcatggc 4020
gtgacctctg ccccagatac cagaccagcc ccaggatcta cagccccacc cgcacacggc 4080
gtgacaagtg cccctgacac aagacccgct ccaggctcta ctgctcctcc tgcccatggc 4140
gtgacaagcg ctcccgatac aaggccagct cctggctcca cagcaccacc agcacatggc 4200
gtgacatcag ctcccgacac tagacctgct cccggatcaa ccgctccacc agctcacggc 4260
gtgaccagcg cacctgatac cagacctgct ctgggaagca ccgcccctcc cgtgcacaat 4320
gtgacatctg cttccggcag cgccagcggc tctgcctcta cactggtgca caacggcacc 4380
agcgccagag ccacaacaac cccagccagc aagagcaccc ccttcagcat ccctagccac 4440
cacagcgaca cccctaccac actggccagc cactccacca agaccgatgc ctctagcacc 4500
caccactcca gcgtgccccc tctgaccagc agcaaccaca gcacaagccc ccagctgtct 4560
accggcgtct cattcttctt tctgtccttc cacatcagca acctgcagtt caacagcagc 4620
ctggaagatc ccagcaccga ctactaccag gaactgcagc gggatatcag cgagatgttc 4680
ctgcaaatct acaagcaggg cggcttcctg ggcctgagca acatcaagtt cagacccggc 4740
agcgtggtgg tgcagctgac cctggctttc cgggaaggca ccatcaacgt gcacgacgtg 4800
gaaacccagt tcaaccagta caagaccgag gccgccagcc ggtacaacct gaccatctcc 4860
gatgtgtccg tgtccgacgt gcccttccca ttctctgccc agtctggcgc aggcgtgcca 4920
ggatggggaa ttgctctgct ggtgctcgtg tgcgtgctgg tggccctggc catcgtgtat 4980
ctgattgccc tggccgtgtg ccagtgccgg cggaagaatt acggccagct ggacatcttc 5040
cccgccagag acacctacca ccccatgagc gagtacccca cataccacac ccacggcaga 5100
tacgtgccac ccagctccac cgacagatcc ccctacgaga aagtgtctgc cggcaacggc 5160
ggcagctccc tgagctacac aaatcctgcc gtggccgctg cctccgccaa cctg 5214
<210> 60
<211> 1738
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 60
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
565 570 575
Gly Asp Val Glu Glu Asn Pro Gly Pro Ala Lys Phe Leu His Trp Leu
580 585 590
Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr
595 600 605
Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val
610 615 620
Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val
625 630 635 640
Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala
645 650 655
Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp
660 665 670
Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr
675 680 685
Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala
690 695 700
Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu
705 710 715 720
Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr
725 730 735
Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe
740 745 750
Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg
755 760 765
Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys
770 775 780
Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg
785 790 795 800
Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr
805 810 815
Met Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp
820 825 830
Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly
835 840 845
Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val Arg Ile
850 855 860
Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile
865 870 875 880
Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys
885 890 895
Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp
900 905 910
Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu
915 920 925
Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu
930 935 940
Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly
945 950 955 960
Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly
965 970 975
Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser
980 985 990
Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe
995 1000 1005
Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg
1010 1015 1020
Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu
1025 1030 1035
Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala
1040 1045 1050
Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln
1055 1060 1065
Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr
1070 1075 1080
Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met
1085 1090 1095
Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
1100 1105 1110
Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg
1115 1120 1125
His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala
1130 1135 1140
Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala
1145 1150 1155
Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr
1160 1165 1170
Ile Leu Asp Gly Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn
1175 1180 1185
Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu Leu Asn Pro Gly
1190 1195 1200
Pro Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu
1205 1210 1215
Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
1220 1225 1230
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val
1235 1240 1245
Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val
1250 1255 1260
Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly
1265 1270 1275
Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser
1280 1285 1290
Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg
1295 1300 1305
Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr Ser
1310 1315 1320
Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala
1325 1330 1335
His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
1340 1345 1350
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
1355 1360 1365
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser
1370 1375 1380
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala
1385 1390 1395
His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
1400 1405 1410
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
1415 1420 1425
Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser
1430 1435 1440
Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn
1445 1450 1455
Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
1460 1465 1470
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu
1475 1480 1485
Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser
1490 1495 1500
Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln
1505 1510 1515
Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser
1520 1525 1530
Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr
1535 1540 1545
Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln Ile
1550 1555 1560
Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg
1565 1570 1575
Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly
1580 1585 1590
Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys
1595 1600 1605
Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser
1610 1615 1620
Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly
1625 1630 1635
Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu
1640 1645 1650
Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln
1655 1660 1665
Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg
1670 1675 1680
Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His
1685 1690 1695
Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
1700 1705 1710
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn
1715 1720 1725
Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
1730 1735
<210> 61
<211> 5808
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 61
atggctagca cccctggaac ccagagcccc ttcttccttc tgctgctgct gaccgtgctg 60
actgtcgtga caggctctgg ccacgccagc tctacacctg gcggcgagaa agagacaagc 120
gccacccaga gaagcagcgt gccaagcagc accgagaaga acgccgtgtc catgaccagc 180
tccgtgctga gcagccactc tcctggcagc ggcagcagca caacacaggg ccaggatgtg 240
acactggccc ctgccacaga acctgcctct ggatctgccg ccacctgggg acaggacgtg 300
acaagcgtgc cagtgaccag acctgccctg ggctctacaa caccccctgc ccacgatgtg 360
accagcgccc ctgataacaa gcctgcccct ggaagcacag cccctccagc tcatggcgtg 420
acctctgccc cagataccag accagcccca ggatctacag ccccacccgc acacggcgtg 480
acaagtgccc ctgacacaag acccgctcca ggctctactg ctcctcctgc ccatggcgtg 540
acaagcgctc ccgatacaag gccagctcct ggctccacag caccaccagc acatggcgtg 600
acatcagctc ccgacactag acctgctccc ggatcaaccg ctccaccagc tcacggcgtg 660
accagcgcac ctgataccag acctgctctg ggaagcaccg cccctcccgt gcacaatgtg 720
acatctgctt ccggcagcgc cagcggctct gcctctacac tggtgcacaa cggcaccagc 780
gccagagcca caacaacccc agccagcaag agcaccccct tcagcatccc tagccaccac 840
agcgacaccc ctaccacact ggccagccac tccaccaaga ccgatgcctc tagcacccac 900
cactccagcg tgccccctct gaccagcagc aaccacagca caagccccca gctgtctacc 960
ggcgtctcat tcttctttct gtccttccac atcagcaacc tgcagttcaa cagcagcctg 1020
gaagatccca gcaccgacta ctaccaggaa ctgcagcggg atatcagcga gatgttcctg 1080
caaatctaca agcagggcgg cttcctgggc ctgagcaaca tcaagttcag acccggcagc 1140
gtggtggtgc agctgaccct ggctttccgg gaaggcacca tcaacgtgca cgacgtggaa 1200
acccagttca accagtacaa gaccgaggcc gccagccggt acaacctgac catctccgat 1260
gtgtccgtgt ccgacgtgcc cttcccattc tctgcccagt ctggcgcagg cgtgccagga 1320
tggggaattg ctctgctggt gctcgtgtgc gtgctggtgg ccctggccat cgtgtatctg 1380
attgccctgg ccgtgtgcca gtgccggcgg aagaattacg gccagctgga catcttcccc 1440
gccagagaca cctaccaccc catgagcgag taccccacat accacaccca cggcagatac 1500
gtgccaccca gctccaccga cagatccccc tacgagaaag tgtctgccgg caacggcggc 1560
agctccctga gctacacaaa tcctgccgtg gccgctgcct ccgccaacct gggatccggc 1620
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcgag 1680
acaaaccctg gccccctggc tggcgagaca ggacaggaag ccgctcctct ggacggcgtg 1740
ctggccaacc ctcccaatat cagcagcctg agccccagac agctgctggg attcccttgt 1800
gccgaggtgt ccggcctgag cacagagaga gtgcgggaac tggctgtggc cctggcccag 1860
aaaaacgtga agctgagcac cgagcagctg cggtgcctgg cccacagact gtctgagcct 1920
cccgaggatc tggacgccct gcctctggat ctgctgctgt tcctgaaccc cgacgccttc 1980
agcggacctc aggcctgcac ccggttcttc agcagaatca ccaaggccaa cgtggacctg 2040
ctgcccagag gcgcccctga gagacagaga ctgctgcctg ctgctctggc ctgttgggga 2100
gtgcggggct ctctgctgtc tgaagctgat gtgcgggccc tgggaggcct ggcttgtgat 2160
ctgcctggaa gattcgtggc cgagagcgcc gaagtgctgc tgcctagact ggtgtcctgt 2220
cccggccctc tggaccagga tcagcaggaa gctgccagag ctgctctgca gggcggaggc 2280
cctccttatg gacctcctag cacttggagc gtgtccacca tggatgccct gaggggcctg 2340
ctgccagtgc tgggccagcc tatcatcaga tccatcccac agggcatcgt ggccgcctgg 2400
cggcagagaa gctctagaga tccctcttgg cggcagcccg agcggacaat cctgcggccc 2460
aggtttcgga gagaggtgga aaagaccgcc tgcccctctg gcaagaaggc cagagagatc 2520
gacgagagcc tgatcttcta caagaagtgg gagctggaag cctgcgtgga cgccgctctg 2580
ctggccaccc agatggacag agtgaacgcc atccccttca cctatgagca gctggacgtg 2640
ctgaagcaca agctggatga gctgtacccc cagggctacc ccgagagcgt gatccagcac 2700
ctgggctacc tgtttctgaa gatgagcccc gaggacatcc ggaagtggaa cgtgaccagc 2760
ctggaaaccc tgaaggccct gctggaagtg aacaagggcc acgagatgtc cccccaggtg 2820
gccacactga tcgacagatt cgtgaagggc agaggccagc tggacaagga caccctggat 2880
acactgaccg ccttctaccc cggctatctg tgcagcctgt cccccgagga actgagcagc 2940
gtgccaccta gctctatctg ggctgtgcgg ccccaggacc tggatacctg cgatcctaga 3000
cagctggatg tgctgtatcc caaggcccgg ctggccttcc agaacatgaa cggcagcgag 3060
tacttcgtga agatccagtc cttcctgggc ggagccccta ccgaggacct gaaagctctg 3120
agccagcaga acgtgtccat ggatctggcc acctttatga agctgcggac cgacgccgtg 3180
ctgcctctga cagtggctga ggtgcagaaa ctgctgggcc cccatgtgga agggctgaag 3240
gccgaagaac ggcacagacc cgtgcgcgac tggatcctga ggcagagaca ggatgacctg 3300
gacacactgg gcctgggact gcaggggggc atccctaatg gctacctggt gctggacctg 3360
agcatgcagg aagccctggg atccggcgag ggcagaggca gcctgctgac atgtggcgac 3420
gtggaagaga accctggccc cagcttcctc ctgtcgtcgc tcagaccgag cctgaccgga 3480
gcacgcagat tggtggaaac tatcttcctt gggtcacgtc cgtggatgcc aggtacccca 3540
cggcgcctcc cgcgcctccc acagagatac tggcagatgc ggcctctgtt cctggaattg 3600
ctgggaaacc acgctcagtg cccgtacgga gtcctgctca agactcactg ccctctgagg 3660
gcggcggtca ctccggcggc cggagtgtgc gcacgggaga agccccaggg aagcgtggca 3720
gctccggaag aggaggacac cgatccgcgc cgcctcgtgc aacttctgcg ccagcactcc 3780
tcgccctggc aagtctacgg gttcgtccgc gcctgcctgc gccgcctggt gccgcctggg 3840
ctctggggtt cccggcataa cgagcgccgc ttcctgagaa atactaagaa gtttatctca 3900
cttggaaaac atgccaagtt gtcgctgcaa gaactcacgt ggaagatgtc agtccgcgat 3960
tgcgcctggc tgcgccgctc gccgggcgtc gggtgtgttc cagctgcaga acaccgcctg 4020
agagaagaaa ttctggccaa atttctgcat tggctgatgt cagtgtacgt ggtcgagctg 4080
ctgcgctcct ttttctacgt cactgagact acctttcaaa agaaccgcct gttcttctac 4140
cgcaaatctg tgtggagcaa gctgcagtca atcggcattc gccagcatct gaagagggtg 4200
cagctgcggg aactttccga ggcagaagtc cgccagcacc gggaggcccg gccggcgctt 4260
ctcacgtcgc gtctgagatt catcccaaag cccgacgggc tgaggcctat cgtcaacatg 4320
gattacgtcg tgggcgctcg cacctttcgc cgtgaaaagc gggccgaacg cttgacctca 4380
cgggtgaagg ccctcttctc cgtgctgaac tacgagagag caagacggcc tggcctgctg 4440
ggagcttcgg tgctgggact ggacgatatc caccgggctt ggcggacctt tgttctccgg 4500
gtgagagccc aagaccctcc gccggaactg tacttcgtga aggtggcgat caccggagcc 4560
tatgatacta ttccgcaaga tcgactcacc gaagtcatcg cctcgatcat caaaccgcag 4620
aacacttact gcgtcaggcg gtacgccgtg gtccagaagg ccgcgcatgg ccacgtgaga 4680
aaggcgttca agtcgcacgt gtccactctc accgacctcc agccttacat gaggcaattc 4740
gttgcgcatt tgcaagagac ttcgcccctg agagatgcgg tggtcatcga gcagagctcc 4800
agcctgaacg aagcgagcag cggtctgttt gacgtgttcc tccgcttcat gtgtcatcac 4860
gcggtgcgaa tcaggggaaa atcatacgtg cagtgccagg gaatcccaca aggcagcatt 4920
ctgtcgactc tcttgtgttc cctttgctac ggcgatatgg aaaacaagct gttcgctggg 4980
atcagacggg acgggttgct gctcagactg gtggacgact tcctgctggt gactccgcac 5040
ctcactcacg ccaaaacctt tctccgcact ctggtgaggg gagtgccaga atacggctgt 5100
gtggtcaatc tccggaaaac tgtggtgaat ttccctgtcg aggatgaggc actcggagga 5160
accgcatttg tccaaatgcc agcacatggc ctgttcccat ggtgcggtct gctgctggac 5220
acccgaactc ttgaagtgca gtccgactac tccagctatg cccggacgag catccgcgcc 5280
agcctcactt tcaatcgcgg ctttaaggcc ggacgaaaca tgcgcagaaa gcttttcgga 5340
gtcctccggc ttaaatgcca ttcgctcttt ctcgatctcc aagtcaattc gctgcagacc 5400
gtgtgcacga acatctacaa gatcctgctg ctccaagcct accggttcca cgcttgcgtg 5460
cttcagctgc cgtttcacca acaggtgtgg aagaacccga ccttctttct gcgggtcatt 5520
agcgatactg cctccctgtg ttactcaatc ctcaaggcaa agaacgccgg aatgtcgctg 5580
ggtgcgaaag gagccgcggg acctcttcct agcgaagcgg tgcagtggct ctgccaccag 5640
gctttcctcc tgaagctgac caggcacaga gtgacctacg tcccgctgct gggctcgctg 5700
cgcactgcac agacccagct gtctagaaaa ctccccggca ccaccctgac cgctctggaa 5760
gccgccgcca acccagcatt gccgtcagat ttcaagacca tcttggac 5808
<210> 62
<211> 1936
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 62
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu
1 5 10 15
Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr
20 25 30
Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro
35 40 45
Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser
50 55 60
Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val
65 70 75 80
Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp
85 90 95
Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser
100 105 110
Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro
115 120 125
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
130 135 140
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val
145 150 155 160
Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro
165 170 175
Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser
180 185 190
Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro
195 200 205
Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
210 215 220
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val
225 230 235 240
Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His
245 250 255
Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr
260 265 270
Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala
275 280 285
Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val
290 295 300
Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr
305 310 315 320
Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe
325 330 335
Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln
340 345 350
Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe
355 360 365
Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln
370 375 380
Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu
385 390 395 400
Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu
405 410 415
Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala
420 425 430
Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu
435 440 445
Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
450 455 460
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro
465 470 475 480
Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr
485 490 495
His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu
500 505 510
Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro
515 520 525
Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser Gly Arg Ile Phe Asn
530 535 540
Ala His Tyr Ala Gly Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu
545 550 555 560
Thr Asn Pro Gly Pro Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro
565 570 575
Leu Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro
580 585 590
Arg Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr
595 600 605
Glu Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys
610 615 620
Leu Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro
625 630 635 640
Pro Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn
645 650 655
Pro Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg
660 665 670
Ile Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg
675 680 685
Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser
690 695 700
Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp
705 710 715 720
Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg
725 730 735
Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala
740 745 750
Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr
755 760 765
Trp Ser Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu
770 775 780
Gly Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp
785 790 795 800
Arg Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr
805 810 815
Ile Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro
820 825 830
Ser Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys
835 840 845
Lys Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln
850 855 860
Met Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val
865 870 875 880
Leu Lys His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser
885 890 895
Val Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp
900 905 910
Ile Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu
915 920 925
Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile
930 935 940
Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp
945 950 955 960
Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu
965 970 975
Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln
980 985 990
Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys
995 1000 1005
Ala Arg Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val
1010 1015 1020
Lys Ile Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys
1025 1030 1035
Ala Leu Ser Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met
1040 1045 1050
Lys Leu Arg Thr Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val
1055 1060 1065
Gln Lys Leu Leu Gly Pro His Val Glu Gly Leu Lys Ala Glu Glu
1070 1075 1080
Arg His Arg Pro Val Arg Asp Trp Ile Leu Arg Gln Arg Gln Asp
1085 1090 1095
Asp Leu Asp Thr Leu Gly Leu Gly Leu Gln Gly Gly Ile Pro Asn
1100 1105 1110
Gly Tyr Leu Val Leu Asp Leu Ser Met Gln Glu Ala Leu Gly Ser
1115 1120 1125
Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu Glu
1130 1135 1140
Asn Pro Gly Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu
1145 1150 1155
Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser Arg
1160 1165 1170
Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln
1175 1180 1185
Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn
1190 1195 1200
His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro
1205 1210 1215
Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
1220 1225 1230
Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp
1235 1240 1245
Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp
1250 1255 1260
Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro
1265 1270 1275
Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg
1280 1285 1290
Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser
1295 1300 1305
Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp
1310 1315 1320
Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His
1325 1330 1335
Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met
1340 1345 1350
Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr
1355 1360 1365
Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser
1370 1375 1380
Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys
1385 1390 1395
Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His
1400 1405 1410
Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile
1415 1420 1425
Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val
1430 1435 1440
Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu
1445 1450 1455
Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
1460 1465 1470
Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp
1475 1480 1485
Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala
1490 1495 1500
Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr
1505 1510 1515
Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile
1520 1525 1530
Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr
1535 1540 1545
Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala Phe
1550 1555 1560
Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg
1565 1570 1575
Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala
1580 1585 1590
Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly
1595 1600 1605
Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala Val Arg
1610 1615 1620
Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly
1625 1630 1635
Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met
1640 1645 1650
Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu
1655 1660 1665
Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His
1670 1675 1680
Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr
1685 1690 1695
Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
1700 1705 1710
Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala
1715 1720 1725
His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr
1730 1735 1740
Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile
1745 1750 1755
Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn
1760 1765 1770
Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser
1775 1780 1785
Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr
1790 1795 1800
Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala
1805 1810 1815
Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro
1820 1825 1830
Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr
1835 1840 1845
Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys
1850 1855 1860
Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys
1865 1870 1875
His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr
1880 1885 1890
Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser
1895 1900 1905
Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala
1910 1915 1920
Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
1925 1930 1935
<210> 63
<211> 5829
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 63
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cagaatcttc aacgcccact acgccggcta cttcgccgac 1740
ctgctgatcc acgacatcga gacaaaccct ggccccaccc ctggaaccca gagccccttc 1800
ttccttctgc tgctgctgac cgtgctgact gtcgtgacag gctctggcca cgccagctct 1860
acacctggcg gcgagaaaga gacaagcgcc acccagagaa gcagcgtgcc aagcagcacc 1920
gagaagaacg ccgtgtccat gaccagctcc gtgctgagca gccactctcc tggcagcggc 1980
agcagcacaa cacagggcca ggatgtgaca ctggcccctg ccacagaacc tgcctctgga 2040
tctgccgcca cctggggaca ggacgtgaca agcgtgccag tgaccagacc tgccctgggc 2100
tctacaacac cccctgccca cgatgtgacc agcgcccctg ataacaagcc tgcccctgga 2160
agcacagccc ctccagctca tggcgtgacc tctgccccag ataccagacc agccccagga 2220
tctacagccc cacccgcaca cggcgtgaca agtgcccctg acacaagacc cgctccaggc 2280
tctactgctc ctcctgccca tggcgtgaca agcgctcccg atacaaggcc agctcctggc 2340
tccacagcac caccagcaca tggcgtgaca tcagctcccg acactagacc tgctcccgga 2400
tcaaccgctc caccagctca cggcgtgacc agcgcacctg ataccagacc tgctctggga 2460
agcaccgccc ctcccgtgca caatgtgaca tctgcttccg gcagcgccag cggctctgcc 2520
tctacactgg tgcacaacgg caccagcgcc agagccacaa caaccccagc cagcaagagc 2580
acccccttca gcatccctag ccaccacagc gacaccccta ccacactggc cagccactcc 2640
accaagaccg atgcctctag cacccaccac tccagcgtgc cccctctgac cagcagcaac 2700
cacagcacaa gcccccagct gtctaccggc gtctcattct tctttctgtc cttccacatc 2760
agcaacctgc agttcaacag cagcctggaa gatcccagca ccgactacta ccaggaactg 2820
cagcgggata tcagcgagat gttcctgcaa atctacaagc agggcggctt cctgggcctg 2880
agcaacatca agttcagacc cggcagcgtg gtggtgcagc tgaccctggc tttccgggaa 2940
ggcaccatca acgtgcacga cgtggaaacc cagttcaacc agtacaagac cgaggccgcc 3000
agccggtaca acctgaccat ctccgatgtg tccgtgtccg acgtgccctt cccattctct 3060
gcccagtctg gcgcaggcgt gccaggatgg ggaattgctc tgctggtgct cgtgtgcgtg 3120
ctggtggccc tggccatcgt gtatctgatt gccctggccg tgtgccagtg ccggcggaag 3180
aattacggcc agctggacat cttccccgcc agagacacct accaccccat gagcgagtac 3240
cccacatacc acacccacgg cagatacgtg ccacccagct ccaccgacag atccccctac 3300
gagaaagtgt ctgccggcaa cggcggcagc tccctgagct acacaaatcc tgccgtggcc 3360
gctgcctccg ccaacctggg atccggcaca atcctgtctg agggcgccac caacttcagc 3420
ctgctgaaac tggccggcga cgtggaactg aaccctggcc ctagcttcct cctgtcgtcg 3480
ctcagaccga gcctgaccgg agcacgcaga ttggtggaaa ctatcttcct tgggtcacgt 3540
ccgtggatgc caggtacccc acggcgcctc ccgcgcctcc cacagagata ctggcagatg 3600
cggcctctgt tcctggaatt gctgggaaac cacgctcagt gcccgtacgg agtcctgctc 3660
aagactcact gccctctgag ggcggcggtc actccggcgg ccggagtgtg cgcacgggag 3720
aagccccagg gaagcgtggc agctccggaa gaggaggaca ccgatccgcg ccgcctcgtg 3780
caacttctgc gccagcactc ctcgccctgg caagtctacg ggttcgtccg cgcctgcctg 3840
cgccgcctgg tgccgcctgg gctctggggt tcccggcata acgagcgccg cttcctgaga 3900
aatactaaga agtttatctc acttggaaaa catgccaagt tgtcgctgca agaactcacg 3960
tggaagatgt cagtccgcga ttgcgcctgg ctgcgccgct cgccgggcgt cgggtgtgtt 4020
ccagctgcag aacaccgcct gagagaagaa attctggcca aatttctgca ttggctgatg 4080
tcagtgtacg tggtcgagct gctgcgctcc tttttctacg tcactgagac tacctttcaa 4140
aagaaccgcc tgttcttcta ccgcaaatct gtgtggagca agctgcagtc aatcggcatt 4200
cgccagcatc tgaagagggt gcagctgcgg gaactttccg aggcagaagt ccgccagcac 4260
cgggaggccc ggccggcgct tctcacgtcg cgtctgagat tcatcccaaa gcccgacggg 4320
ctgaggccta tcgtcaacat ggattacgtc gtgggcgctc gcacctttcg ccgtgaaaag 4380
cgggccgaac gcttgacctc acgggtgaag gccctcttct ccgtgctgaa ctacgagaga 4440
gcaagacggc ctggcctgct gggagcttcg gtgctgggac tggacgatat ccaccgggct 4500
tggcggacct ttgttctccg ggtgagagcc caagaccctc cgccggaact gtacttcgtg 4560
aaggtggcga tcaccggagc ctatgatact attccgcaag atcgactcac cgaagtcatc 4620
gcctcgatca tcaaaccgca gaacacttac tgcgtcaggc ggtacgccgt ggtccagaag 4680
gccgcgcatg gccacgtgag aaaggcgttc aagtcgcacg tgtccactct caccgacctc 4740
cagccttaca tgaggcaatt cgttgcgcat ttgcaagaga cttcgcccct gagagatgcg 4800
gtggtcatcg agcagagctc cagcctgaac gaagcgagca gcggtctgtt tgacgtgttc 4860
ctccgcttca tgtgtcatca cgcggtgcga atcaggggaa aatcatacgt gcagtgccag 4920
ggaatcccac aaggcagcat tctgtcgact ctcttgtgtt ccctttgcta cggcgatatg 4980
gaaaacaagc tgttcgctgg gatcagacgg gacgggttgc tgctcagact ggtggacgac 5040
ttcctgctgg tgactccgca cctcactcac gccaaaacct ttctccgcac tctggtgagg 5100
ggagtgccag aatacggctg tgtggtcaat ctccggaaaa ctgtggtgaa tttccctgtc 5160
gaggatgagg cactcggagg aaccgcattt gtccaaatgc cagcacatgg cctgttccca 5220
tggtgcggtc tgctgctgga cacccgaact cttgaagtgc agtccgacta ctccagctat 5280
gcccggacga gcatccgcgc cagcctcact ttcaatcgcg gctttaaggc cggacgaaac 5340
atgcgcagaa agcttttcgg agtcctccgg cttaaatgcc attcgctctt tctcgatctc 5400
caagtcaatt cgctgcagac cgtgtgcacg aacatctaca agatcctgct gctccaagcc 5460
taccggttcc acgcttgcgt gcttcagctg ccgtttcacc aacaggtgtg gaagaacccg 5520
accttctttc tgcgggtcat tagcgatact gcctccctgt gttactcaat cctcaaggca 5580
aagaacgccg gaatgtcgct gggtgcgaaa ggagccgcgg gacctcttcc tagcgaagcg 5640
gtgcagtggc tctgccacca ggctttcctc ctgaagctga ccaggcacag agtgacctac 5700
gtcccgctgc tgggctcgct gcgcactgca cagacccagc tgtctagaaa actccccggc 5760
accaccctga ccgctctgga agccgccgcc aacccagcat tgccgtcaga tttcaagacc 5820
atcttggac 5829
<210> 64
<211> 1943
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 64
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Arg Ile Phe Asn Ala His Tyr Ala Gly
565 570 575
Tyr Phe Ala Asp Leu Leu Ile His Asp Ile Glu Thr Asn Pro Gly Pro
580 585 590
Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val
595 600 605
Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly
610 615 620
Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr
625 630 635 640
Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser
645 650 655
Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala
660 665 670
Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp
675 680 685
Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro
690 695 700
Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly
705 710 715 720
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
725 730 735
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala
740 745 750
Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly
755 760 765
Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro
770 775 780
Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly
785 790 795 800
Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg
805 810 815
Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala
820 825 830
Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr
835 840 845
Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser
850 855 860
Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser
865 870 875 880
Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu
885 890 895
Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser
900 905 910
Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser
915 920 925
Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile
930 935 940
Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu
945 950 955 960
Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu
965 970 975
Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe
980 985 990
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser
995 1000 1005
Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser
1010 1015 1020
Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val
1025 1030 1035
Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
1040 1045 1050
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe
1055 1060 1065
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1070 1075 1080
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser
1085 1090 1095
Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser
1100 1105 1110
Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu Gly Ser
1115 1120 1125
Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu Lys
1130 1135 1140
Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro Ser Phe Leu Leu
1145 1150 1155
Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu
1160 1165 1170
Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg
1175 1180 1185
Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu
1190 1195 1200
Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val
1205 1210 1215
Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala
1220 1225 1230
Ala Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala
1235 1240 1245
Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu
1250 1255 1260
Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala
1265 1270 1275
Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His
1280 1285 1290
Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu
1295 1300 1305
Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met
1310 1315 1320
Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly
1325 1330 1335
Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala
1340 1345 1350
Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
1355 1360 1365
Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg
1370 1375 1380
Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile
1385 1390 1395
Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser
1400 1405 1410
Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu
1415 1420 1425
Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro
1430 1435 1440
Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg
1445 1450 1455
Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe
1460 1465 1470
Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly
1475 1480 1485
Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr
1490 1495 1500
Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr
1505 1510 1515
Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp Thr Ile Pro Gln
1520 1525 1530
Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn
1535 1540 1545
Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His
1550 1555 1560
Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr
1565 1570 1575
Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu
1580 1585 1590
Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
1595 1600 1605
Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe
1610 1615 1620
Met Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln
1625 1630 1635
Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys
1640 1645 1650
Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile
1655 1660 1665
Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu
1670 1675 1680
Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu
1685 1690 1695
Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys
1700 1705 1710
Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr
1715 1720 1725
Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly
1730 1735 1740
Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser
1745 1750 1755
Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg
1760 1765 1770
Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val
1775 1780 1785
Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn
1790 1795 1800
Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu
1805 1810 1815
Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His
1820 1825 1830
Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
1835 1840 1845
Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala
1850 1855 1860
Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser
1865 1870 1875
Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu
1880 1885 1890
Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg
1895 1900 1905
Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu
1910 1915 1920
Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe
1925 1930 1935
Lys Thr Ile Leu Asp
1940
<210> 65
<211> 5808
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 65
atggctagcc tggctggcga gacaggacag gaagccgctc ctctggacgg cgtgctggcc 60
aaccctccca atatcagcag cctgagcccc agacagctgc tgggattccc ttgtgccgag 120
gtgtccggcc tgagcacaga gagagtgcgg gaactggctg tggccctggc ccagaaaaac 180
gtgaagctga gcaccgagca gctgcggtgc ctggcccaca gactgtctga gcctcccgag 240
gatctggacg ccctgcctct ggatctgctg ctgttcctga accccgacgc cttcagcgga 300
cctcaggcct gcacccggtt cttcagcaga atcaccaagg ccaacgtgga cctgctgccc 360
agaggcgccc ctgagagaca gagactgctg cctgctgctc tggcctgttg gggagtgcgg 420
ggctctctgc tgtctgaagc tgatgtgcgg gccctgggag gcctggcttg tgatctgcct 480
ggaagattcg tggccgagag cgccgaagtg ctgctgccta gactggtgtc ctgtcccggc 540
cctctggacc aggatcagca ggaagctgcc agagctgctc tgcagggcgg aggccctcct 600
tatggacctc ctagcacttg gagcgtgtcc accatggatg ccctgagggg cctgctgcca 660
gtgctgggcc agcctatcat cagatccatc ccacagggca tcgtggccgc ctggcggcag 720
agaagctcta gagatccctc ttggcggcag cccgagcgga caatcctgcg gcccaggttt 780
cggagagagg tggaaaagac cgcctgcccc tctggcaaga aggccagaga gatcgacgag 840
agcctgatct tctacaagaa gtgggagctg gaagcctgcg tggacgccgc tctgctggcc 900
acccagatgg acagagtgaa cgccatcccc ttcacctatg agcagctgga cgtgctgaag 960
cacaagctgg atgagctgta cccccagggc taccccgaga gcgtgatcca gcacctgggc 1020
tacctgtttc tgaagatgag ccccgaggac atccggaagt ggaacgtgac cagcctggaa 1080
accctgaagg ccctgctgga agtgaacaag ggccacgaga tgtcccccca ggtggccaca 1140
ctgatcgaca gattcgtgaa gggcagaggc cagctggaca aggacaccct ggatacactg 1200
accgccttct accccggcta tctgtgcagc ctgtcccccg aggaactgag cagcgtgcca 1260
cctagctcta tctgggctgt gcggccccag gacctggata cctgcgatcc tagacagctg 1320
gatgtgctgt atcccaaggc ccggctggcc ttccagaaca tgaacggcag cgagtacttc 1380
gtgaagatcc agtccttcct gggcggagcc cctaccgagg acctgaaagc tctgagccag 1440
cagaacgtgt ccatggatct ggccaccttt atgaagctgc ggaccgacgc cgtgctgcct 1500
ctgacagtgg ctgaggtgca gaaactgctg ggcccccatg tggaagggct gaaggccgaa 1560
gaacggcaca gacccgtgcg cgactggatc ctgaggcaga gacaggatga cctggacaca 1620
ctgggcctgg gactgcaggg gggcatccct aatggctacc tggtgctgga cctgagcatg 1680
caggaagccc tgggatccgg cgagggcaga ggcagcctgc tgacatgtgg cgacgtggaa 1740
gagaaccctg gccccagctt cctcctgtcg tcgctcagac cgagcctgac cggagcacgc 1800
agattggtgg aaactatctt ccttgggtca cgtccgtgga tgccaggtac cccacggcgc 1860
ctcccgcgcc tcccacagag atactggcag atgcggcctc tgttcctgga attgctggga 1920
aaccacgctc agtgcccgta cggagtcctg ctcaagactc actgccctct gagggcggcg 1980
gtcactccgg cggccggagt gtgcgcacgg gagaagcccc agggaagcgt ggcagctccg 2040
gaagaggagg acaccgatcc gcgccgcctc gtgcaacttc tgcgccagca ctcctcgccc 2100
tggcaagtct acgggttcgt ccgcgcctgc ctgcgccgcc tggtgccgcc tgggctctgg 2160
ggttcccggc ataacgagcg ccgcttcctg agaaatacta agaagtttat ctcacttgga 2220
aaacatgcca agttgtcgct gcaagaactc acgtggaaga tgtcagtccg cgattgcgcc 2280
tggctgcgcc gctcgccggg cgtcgggtgt gttccagctg cagaacaccg cctgagagaa 2340
gaaattctgg ccaaatttct gcattggctg atgtcagtgt acgtggtcga gctgctgcgc 2400
tcctttttct acgtcactga gactaccttt caaaagaacc gcctgttctt ctaccgcaaa 2460
tctgtgtgga gcaagctgca gtcaatcggc attcgccagc atctgaagag ggtgcagctg 2520
cgggaacttt ccgaggcaga agtccgccag caccgggagg cccggccggc gcttctcacg 2580
tcgcgtctga gattcatccc aaagcccgac gggctgaggc ctatcgtcaa catggattac 2640
gtcgtgggcg ctcgcacctt tcgccgtgaa aagcgggccg aacgcttgac ctcacgggtg 2700
aaggccctct tctccgtgct gaactacgag agagcaagac ggcctggcct gctgggagct 2760
tcggtgctgg gactggacga tatccaccgg gcttggcgga cctttgttct ccgggtgaga 2820
gcccaagacc ctccgccgga actgtacttc gtgaaggtgg cgatcaccgg agcctatgat 2880
actattccgc aagatcgact caccgaagtc atcgcctcga tcatcaaacc gcagaacact 2940
tactgcgtca ggcggtacgc cgtggtccag aaggccgcgc atggccacgt gagaaaggcg 3000
ttcaagtcgc acgtgtccac tctcaccgac ctccagcctt acatgaggca attcgttgcg 3060
catttgcaag agacttcgcc cctgagagat gcggtggtca tcgagcagag ctccagcctg 3120
aacgaagcga gcagcggtct gtttgacgtg ttcctccgct tcatgtgtca tcacgcggtg 3180
cgaatcaggg gaaaatcata cgtgcagtgc cagggaatcc cacaaggcag cattctgtcg 3240
actctcttgt gttccctttg ctacggcgat atggaaaaca agctgttcgc tgggatcaga 3300
cgggacgggt tgctgctcag actggtggac gacttcctgc tggtgactcc gcacctcact 3360
cacgccaaaa cctttctccg cactctggtg aggggagtgc cagaatacgg ctgtgtggtc 3420
aatctccgga aaactgtggt gaatttccct gtcgaggatg aggcactcgg aggaaccgca 3480
tttgtccaaa tgccagcaca tggcctgttc ccatggtgcg gtctgctgct ggacacccga 3540
actcttgaag tgcagtccga ctactccagc tatgcccgga cgagcatccg cgccagcctc 3600
actttcaatc gcggctttaa ggccggacga aacatgcgca gaaagctttt cggagtcctc 3660
cggcttaaat gccattcgct ctttctcgat ctccaagtca attcgctgca gaccgtgtgc 3720
acgaacatct acaagatcct gctgctccaa gcctaccggt tccacgcttg cgtgcttcag 3780
ctgccgtttc accaacaggt gtggaagaac ccgaccttct ttctgcgggt cattagcgat 3840
actgcctccc tgtgttactc aatcctcaag gcaaagaacg ccggaatgtc gctgggtgcg 3900
aaaggagccg cgggacctct tcctagcgaa gcggtgcagt ggctctgcca ccaggctttc 3960
ctcctgaagc tgaccaggca cagagtgacc tacgtcccgc tgctgggctc gctgcgcact 4020
gcacagaccc agctgtctag aaaactcccc ggcaccaccc tgaccgctct ggaagccgcc 4080
gccaacccag cattgccgtc agatttcaag accatcttgg acggatccgg cacaatcctg 4140
tctgagggcg ccaccaactt cagcctgctg aaactggccg gcgacgtgga actgaaccct 4200
ggccctaccc ctggaaccca gagccccttc ttccttctgc tgctgctgac cgtgctgact 4260
gtcgtgacag gctctggcca cgccagctct acacctggcg gcgagaaaga gacaagcgcc 4320
acccagagaa gcagcgtgcc aagcagcacc gagaagaacg ccgtgtccat gaccagctcc 4380
gtgctgagca gccactctcc tggcagcggc agcagcacaa cacagggcca ggatgtgaca 4440
ctggcccctg ccacagaacc tgcctctgga tctgccgcca cctggggaca ggacgtgaca 4500
agcgtgccag tgaccagacc tgccctgggc tctacaacac cccctgccca cgatgtgacc 4560
agcgcccctg ataacaagcc tgcccctgga agcacagccc ctccagctca tggcgtgacc 4620
tctgccccag ataccagacc agccccagga tctacagccc cacccgcaca cggcgtgaca 4680
agtgcccctg acacaagacc cgctccaggc tctactgctc ctcctgccca tggcgtgaca 4740
agcgctcccg atacaaggcc agctcctggc tccacagcac caccagcaca tggcgtgaca 4800
tcagctcccg acactagacc tgctcccgga tcaaccgctc caccagctca cggcgtgacc 4860
agcgcacctg ataccagacc tgctctggga agcaccgccc ctcccgtgca caatgtgaca 4920
tctgcttccg gcagcgccag cggctctgcc tctacactgg tgcacaacgg caccagcgcc 4980
agagccacaa caaccccagc cagcaagagc acccccttca gcatccctag ccaccacagc 5040
gacaccccta ccacactggc cagccactcc accaagaccg atgcctctag cacccaccac 5100
tccagcgtgc cccctctgac cagcagcaac cacagcacaa gcccccagct gtctaccggc 5160
gtctcattct tctttctgtc cttccacatc agcaacctgc agttcaacag cagcctggaa 5220
gatcccagca ccgactacta ccaggaactg cagcgggata tcagcgagat gttcctgcaa 5280
atctacaagc agggcggctt cctgggcctg agcaacatca agttcagacc cggcagcgtg 5340
gtggtgcagc tgaccctggc tttccgggaa ggcaccatca acgtgcacga cgtggaaacc 5400
cagttcaacc agtacaagac cgaggccgcc agccggtaca acctgaccat ctccgatgtg 5460
tccgtgtccg acgtgccctt cccattctct gcccagtctg gcgcaggcgt gccaggatgg 5520
ggaattgctc tgctggtgct cgtgtgcgtg ctggtggccc tggccatcgt gtatctgatt 5580
gccctggccg tgtgccagtg ccggcggaag aattacggcc agctggacat cttccccgcc 5640
agagacacct accaccccat gagcgagtac cccacatacc acacccacgg cagatacgtg 5700
ccacccagct ccaccgacag atccccctac gagaaagtgt ctgccggcaa cggcggcagc 5760
tccctgagct acacaaatcc tgccgtggcc gctgcctccg ccaacctg 5808
<210> 66
<211> 1936
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 66
Met Ala Ser Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp
1 5 10 15
Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln
20 25 30
Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg
35 40 45
Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser
50 55 60
Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu
65 70 75 80
Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp
85 90 95
Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr
100 105 110
Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg
115 120 125
Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu
130 135 140
Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro
145 150 155 160
Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val
165 170 175
Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala
180 185 190
Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser
195 200 205
Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln
210 215 220
Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln
225 230 235 240
Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu
245 250 255
Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly
260 265 270
Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp
275 280 285
Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp
290 295 300
Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys
305 310 315 320
His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile
325 330 335
Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg
340 345 350
Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val
355 360 365
Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg
370 375 380
Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu
385 390 395 400
Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu
405 410 415
Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu
420 425 430
Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg
435 440 445
Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln
450 455 460
Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln
465 470 475 480
Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp
485 490 495
Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro
500 505 510
His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp
515 520 525
Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly
530 535 540
Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met
545 550 555 560
Gln Glu Ala Leu Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys
565 570 575
Gly Asp Val Glu Glu Asn Pro Gly Pro Ser Phe Leu Leu Ser Ser Leu
580 585 590
Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu
595 600 605
Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu
610 615 620
Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly
625 630 635 640
Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro
645 650 655
Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys
660 665 670
Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg
675 680 685
Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr
690 695 700
Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp
705 710 715 720
Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe
725 730 735
Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp
740 745 750
Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val
755 760 765
Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala
770 775 780
Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg
785 790 795 800
Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe
805 810 815
Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg
820 825 830
Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val
835 840 845
Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg
850 855 860
Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr
865 870 875 880
Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu
885 890 895
Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala
900 905 910
Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile
915 920 925
His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro
930 935 940
Pro Pro Glu Leu Tyr Phe Val Lys Val Ala Ile Thr Gly Ala Tyr Asp
945 950 955 960
Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys
965 970 975
Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala
980 985 990
Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu
995 1000 1005
Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
1010 1015 1020
Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser
1025 1030 1035
Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg
1040 1045 1050
Phe Met Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val
1055 1060 1065
Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu
1070 1075 1080
Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly
1085 1090 1095
Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu
1100 1105 1110
Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr
1115 1120 1125
Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg
1130 1135 1140
Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly
1145 1150 1155
Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys
1160 1165 1170
Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr
1175 1180 1185
Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn
1190 1195 1200
Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly
1205 1210 1215
Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val
1220 1225 1230
Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu
1235 1240 1245
Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
1250 1255 1260
His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile
1265 1270 1275
Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn
1280 1285 1290
Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro
1295 1300 1305
Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys
1310 1315 1320
Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu
1325 1330 1335
Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr
1340 1345 1350
Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp
1355 1360 1365
Phe Lys Thr Ile Leu Asp Gly Ser Gly Thr Ile Leu Ser Glu Gly
1370 1375 1380
Ala Thr Asn Phe Ser Leu Leu Lys Leu Ala Gly Asp Val Glu Leu
1385 1390 1395
Asn Pro Gly Pro Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu
1400 1405 1410
Leu Leu Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala
1415 1420 1425
Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg
1430 1435 1440
Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr
1445 1450 1455
Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr
1460 1465 1470
Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala
1475 1480 1485
Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro
1490 1495 1500
Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp
1505 1510 1515
Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala
1520 1525 1530
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala
1535 1540 1545
Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
1550 1555 1560
Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly
1565 1570 1575
Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala
1580 1585 1590
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala
1595 1600 1605
Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro
1610 1615 1620
Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn
1625 1630 1635
Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu
1640 1645 1650
Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser
1655 1660 1665
Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro
1670 1675 1680
Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr
1685 1690 1695
His His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr
1700 1705 1710
Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe
1715 1720 1725
His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser
1730 1735 1740
Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe
1745 1750 1755
Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile
1760 1765 1770
Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe
1775 1780 1785
Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn
1790 1795 1800
Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser
1805 1810 1815
Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser
1820 1825 1830
Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val
1835 1840 1845
Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala
1850 1855 1860
Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe
1865 1870 1875
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1880 1885 1890
His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser
1895 1900 1905
Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser
1910 1915 1920
Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
1925 1930 1935
<210> 67
<211> 84
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 67
ggatccggca gaatcttcaa cgcccactac gccggctact tcgccgacct gctgatccac 60
gacatcgaga caaaccctgg cccc 84
<210> 68
<211> 28
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 68
Gly Ser Gly Arg Ile Phe Asn Ala His Tyr Ala Gly Tyr Phe Ala Asp
1 5 10 15
Leu Leu Ile His Asp Ile Glu Thr Asn Pro Gly Pro
20 25
<210> 69
<211> 63
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 69
ggatccggcg agggcagagg cagcctgctg acatgtggcg acgtggaaga gaaccctggc 60
ccc 63
<210> 70
<211> 21
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 70
Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu
1 5 10 15
Glu Asn Pro Gly Pro
20
<210> 71
<211> 84
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 71
ggatccggca caatcctgtc tgagggcgcc accaacttca gcctgctgaa actggccggc 60
gacgtggaac tgaaccctgg ccct 84
<210> 72
<211> 28
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 72
Gly Ser Gly Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser Leu Leu
1 5 10 15
Lys Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro
20 25
<210> 73
<211> 66
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 73
ggatccggcg ccaccaattt cagcctgctg aaacaggccg gcgacgtgga agagaaccct 60
ggccct 66
<210> 74
<211> 22
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 74
Gly Ser Gly Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val
1 5 10 15
Glu Glu Asn Pro Gly Pro
20
<210> 75
<211> 43
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 75
gagacaaacc ctggccccct ggctggcgag acaggacagg aag 43
<210> 76
<211> 46
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 76
gttgaagatt ctgccggatc ccaggttggc ggaggcagcg gccacg 46
<210> 77
<211> 47
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 77
gctacttcgc cgacctgctg atccacgaca tcgagacaaa ccctggc 47
<210> 78
<211> 47
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 78
ggtcggcgaa gtagccggcg tagtgggcgt tgaagattct gccggat 47
<210> 79
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 79
ttctgaagat gagccccgag gaca 24
<210> 80
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 80
cggcgtctca ttcttctttc tgtc 24
<210> 81
<211> 46
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 81
accctgtgac gaacatggct agcctggctg gcgagacagg acagga 46
<210> 82
<211> 43
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 82
accctgtgac gaacatggct agcacaggct ctggccacgc cag 43
<210> 83
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 83
accctgtgac gaacatggct agcacccctg gaacccagag cc 42
<210> 84
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 84
accctgtgac gaacatggct agcggagctg ccccggagcc gg 42
<210> 85
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 85
tctcaccgac ctccagcctt acat 24
<210> 86
<211> 37
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 86
acggaggctc cggcggactg gctggcgaga caggaca 37
<210> 87
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 87
tgggaggctc cggcggagga gctgccccgg agccgg 36
<210> 88
<211> 56
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 88
cctgctgatc cacgacatcg agacaaaccc tggccccacc cctggaaccc agagcc 56
<210> 89
<211> 52
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 89
tggccggcga cgtggaactg aaccctggcc ctacaggctc tggccacgcc ag 52
<210> 90
<211> 51
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 90
tggccggcga cgtggaactg aaccctggcc ctacccctgg aacccagagc c 51
<210> 91
<211> 54
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 91
tggccggcga cgtggaactg aaccctggcc ctagcttcct cctgtcgtcg ctca 54
<210> 92
<211> 51
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 92
tggccggcga cgtggaactg aaccctggcc ctggagctgc cccggagccg g 51
<210> 93
<211> 56
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 93
tggccggcga cgtggaactg aaccctggcc ctgccaaatt tctgcattgg ctgatg 56
<210> 94
<211> 43
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 94
tggaagagaa ccctggccct ctggctggcg agacaggaca gga 43
<210> 95
<211> 39
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 95
tggaagagaa ccctggccct acccctggaa cccagagcc 39
<210> 96
<211> 49
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 96
gcgacgtgga agagaaccct ggccccctgg ctggcgagac aggacagga 49
<210> 97
<211> 48
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 97
gcgacgtgga agagaaccct ggccccagct tcctcctgtc gtcgctca 48
<210> 98
<211> 50
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 98
gcgacgtgga agagaaccct ggccccgcca aatttctgca ttggctgatg 50
<210> 99
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 99
gcgacgtgga agagaaccct ggccccggag ctgccccgga gccgg 45
<210> 100
<211> 59
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 100
agaatcttca acgcccacta cgccggctac ttcgccgacc tgctgatcca cgacatcga 59
<210> 101
<211> 56
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 101
tgtctgaggg cgccaccaac ttcagcctgc tgaaactggc cggcgacgtg gaactg 56
<210> 102
<211> 48
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 102
ttcagcctgc tgaaacaggc cggcgacgtg gaagagaacc ctggccct 48
<210> 103
<211> 54
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 103
ccggcgaggg cagaggcagc ctgctgacat gtggcgacgt ggaagagaac cctg 54
<210> 104
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 104
gggcccagat cttcacaggg cttcctgcat gctcaggtcc agcac 45
<210> 105
<211> 39
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 105
acgaacatgg ctagcacccc tggaacccag agccccttc 39
<210> 106
<211> 50
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 106
gtgggcgttg aagattctgc cggatcccag ggcttcctgc atgctcaggt 50
<210> 107
<211> 53
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 107
tggtggcgcc ctcagacagg attgtgccgg atcccaggtt ggcggaggca gcg 53
<210> 108
<211> 58
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 108
tggtggcgcc ctcagacagg attgtgccgg atccgtccaa gatggtcttg aaatctga 58
<210> 109
<211> 38
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 109
tccgccggag cctcccaggg cttcctgcat gctcaggt 38
<210> 110
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 110
tccgccggag cctcccaggt tggcggaggc agcg 34
<210> 111
<211> 39
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 111
tccgccggag cctccgtcca agatggtctt gaaatctga 39
<210> 112
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 112
tgtcctcggg gctcatctt 19
<210> 113
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 113
aaggacagaa agaagaatga gacg 24
<210> 114
<211> 48
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 114
ttgttttgtt agggcccaga tcttcacagg gcttcctgca tgctcagg 48
<210> 115
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 115
ttgttttgtt agggcccaga tcttcacagg ttggcggagg cagcg 45
<210> 116
<211> 50
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 116
ttgttttgtt agggcccaga tcttcagtcc aagatggtct tgaaatctga 50
<210> 117
<211> 59
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 117
ctgtttcagc aggctgaaat tggtggcgcc ggatcccagg gcttcctgca tgctcaggt 59
<210> 118
<211> 55
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 118
ctgtttcagc aggctgaaat tggtggcgcc ggatcccagg ttggcggagg cagcg 55
<210> 119
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 119
tgcctctgcc ctcgccggat cccagggctt cctgcatgct caggt 45
<210> 120
<211> 46
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 120
tgcctctgcc ctcgccggat ccgtccaaga tggtcttgaa atctga 46
<210> 121
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 121
aggctggagg tcggtgagag tgga 24
<210> 122
<211> 57
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 122
agggttctct tccacgtcgc cacatgtcag caggctgcct ctgccctcgc cggatcc 57
<210> 123
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 123
acgaacatgg ctagcttcct cctgtcgtcg ctcagaccga g 41
<210> 124
<211> 47
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 124
ttgttttgtt agggcccaga tcttcagtcc aagatggtct tgaaatc 47
<210> 125
<211> 41
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 125
acgaacatgg ctagcgccaa atttctgcat tggctgatgt c 41
<210> 126
<211> 47
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 126
ttgttttgtt agggcccaga tcttcagtcc aagatggtct tgaaatc 47
<210> 127
<211> 39
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 127
accctgtgac gaacatggga gctgccccgg agccggaga 39
<210> 128
<211> 32
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 128
caacaagcta gcctggctgg cgagacagga ca 32
<210> 129
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 129
caacaaagat ctttacaggg cttcctgcat gcacag 36
<210> 130
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 130
accctgtgac gaacatggct agc 23
<210> 131
<211> 17
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 131
agatctgggc cctaaca 17
<210> 132
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 132
Met Ala Ser Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu
1 5 10 15
<210> 133
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 133
Thr Pro Gly Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr
1 5 10 15
<210> 134
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 134
Thr Gln Ser Pro Phe Phe Leu Leu Leu Leu Leu Thr Val Leu Thr
1 5 10 15
<210> 135
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 135
Phe Phe Leu Leu Leu Leu Leu Thr Val Leu Thr Val Val Thr Gly
1 5 10 15
<210> 136
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 136
Leu Leu Leu Thr Val Leu Thr Val Val Thr Gly Ser Gly His Ala
1 5 10 15
<210> 137
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 137
Val Leu Thr Val Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro
1 5 10 15
<210> 138
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 138
Val Thr Gly Ser Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys
1 5 10 15
<210> 139
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 139
Gly His Ala Ser Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala
1 5 10 15
<210> 140
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 140
Ser Thr Pro Gly Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser
1 5 10 15
<210> 141
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 141
Gly Glu Lys Glu Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser
1 5 10 15
<210> 142
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 142
Thr Ser Ala Thr Gln Arg Ser Ser Val Pro Ser Ser Thr Glu Lys
1 5 10 15
<210> 143
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 143
Gln Arg Ser Ser Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser
1 5 10 15
<210> 144
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 144
Val Pro Ser Ser Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser
1 5 10 15
<210> 145
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 145
Thr Glu Lys Asn Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser
1 5 10 15
<210> 146
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 146
Ala Val Ser Met Thr Ser Ser Val Leu Ser Ser His Ser Pro Gly
1 5 10 15
<210> 147
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 147
Thr Ser Ser Val Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser
1 5 10 15
<210> 148
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 148
Leu Ser Ser His Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly
1 5 10 15
<210> 149
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 149
Ser Pro Gly Ser Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr
1 5 10 15
<210> 150
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 150
Gly Ser Ser Thr Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala
1 5 10 15
<210> 151
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 151
Thr Gln Gly Gln Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala
1 5 10 15
<210> 152
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 152
Asp Val Thr Leu Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala
1 5 10 15
<210> 153
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 153
Ala Pro Ala Thr Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly
1 5 10 15
<210> 154
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 154
Glu Pro Ala Ser Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr
1 5 10 15
<210> 155
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 155
Gly Ser Ala Ala Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val
1 5 10 15
<210> 156
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 156
Thr Trp Gly Gln Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala
1 5 10 15
<210> 157
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 157
Asp Val Thr Ser Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr
1 5 10 15
<210> 158
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 158
Val Pro Val Thr Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala
1 5 10 15
<210> 159
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 159
Arg Pro Ala Leu Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr
1 5 10 15
<210> 160
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 160
Gly Ser Thr Thr Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp
1 5 10 15
<210> 161
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 161
Pro Pro Ala His Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala
1 5 10 15
<210> 162
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 162
Asp Val Thr Ser Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr
1 5 10 15
<210> 163
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 163
Ala Pro Asp Asn Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala
1 5 10 15
<210> 164
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 164
Lys Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr
1 5 10 15
<210> 165
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 165
Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp
1 5 10 15
<210> 166
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 166
Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala
1 5 10 15
<210> 167
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 167
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr
1 5 10 15
<210> 168
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 168
Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala
1 5 10 15
<210> 169
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 169
Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly Val Thr
1 5 10 15
<210> 170
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 170
Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Leu Gly Ser Thr
1 5 10 15
<210> 171
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 171
Ala Pro Asp Thr Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val
1 5 10 15
<210> 172
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 172
Arg Pro Ala Leu Gly Ser Thr Ala Pro Pro Val His Asn Val Thr
1 5 10 15
<210> 173
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 173
Gly Ser Thr Ala Pro Pro Val His Asn Val Thr Ser Ala Ser Gly
1 5 10 15
<210> 174
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 174
Pro Pro Val His Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly
1 5 10 15
<210> 175
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 175
Asn Val Thr Ser Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr
1 5 10 15
<210> 176
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 176
Ala Ser Gly Ser Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn
1 5 10 15
<210> 177
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 177
Ala Ser Gly Ser Ala Ser Thr Leu Val His Asn Gly Thr Ser Ala
1 5 10 15
<210> 178
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 178
Ala Ser Thr Leu Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr
1 5 10 15
<210> 179
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 179
Val His Asn Gly Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser
1 5 10 15
<210> 180
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 180
Thr Ser Ala Arg Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro
1 5 10 15
<210> 181
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 181
Ala Thr Thr Thr Pro Ala Ser Lys Ser Thr Pro Phe Ser Ile Pro
1 5 10 15
<210> 182
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 182
Pro Ala Ser Lys Ser Thr Pro Phe Ser Ile Pro Ser His His Ser
1 5 10 15
<210> 183
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 183
Ser Thr Pro Phe Ser Ile Pro Ser His His Ser Asp Thr Pro Thr
1 5 10 15
<210> 184
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 184
Ser Ile Pro Ser His His Ser Asp Thr Pro Thr Thr Leu Ala Ser
1 5 10 15
<210> 185
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 185
His His Ser Asp Thr Pro Thr Thr Leu Ala Ser His Ser Thr Lys
1 5 10 15
<210> 186
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 186
Thr Pro Thr Thr Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser
1 5 10 15
<210> 187
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 187
Leu Ala Ser His Ser Thr Lys Thr Asp Ala Ser Ser Thr His His
1 5 10 15
<210> 188
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 188
Ser Thr Lys Thr Asp Ala Ser Ser Thr His His Ser Ser Val Pro
1 5 10 15
<210> 189
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 189
Asp Ala Ser Ser Thr His His Ser Ser Val Pro Pro Leu Thr Ser
1 5 10 15
<210> 190
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 190
Thr His His Ser Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser
1 5 10 15
<210> 191
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 191
Ser Val Pro Pro Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln
1 5 10 15
<210> 192
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 192
Leu Thr Ser Ser Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly
1 5 10 15
<210> 193
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 193
Asn His Ser Thr Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe
1 5 10 15
<210> 194
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 194
Ser Pro Gln Leu Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe
1 5 10 15
<210> 195
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 195
Ser Thr Gly Val Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn
1 5 10 15
<210> 196
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 196
Ser Phe Phe Phe Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn
1 5 10 15
<210> 197
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 197
Leu Ser Phe His Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu
1 5 10 15
<210> 198
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 198
Ile Ser Asn Leu Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr
1 5 10 15
<210> 199
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 199
Gln Phe Asn Ser Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln
1 5 10 15
<210> 200
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 200
Ser Leu Glu Asp Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg
1 5 10 15
<210> 201
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 201
Pro Ser Thr Asp Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu
1 5 10 15
<210> 202
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 202
Tyr Tyr Gln Glu Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln
1 5 10 15
<210> 203
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 203
Leu Gln Arg Asp Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln
1 5 10 15
<210> 204
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 204
Ile Ser Glu Met Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu
1 5 10 15
<210> 205
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 205
Phe Leu Gln Ile Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn
1 5 10 15
<210> 206
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 206
Tyr Lys Gln Gly Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg
1 5 10 15
<210> 207
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 207
Gly Phe Leu Gly Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val
1 5 10 15
<210> 208
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 208
Leu Ser Asn Ile Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu
1 5 10 15
<210> 209
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 209
Lys Phe Arg Pro Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe
1 5 10 15
<210> 210
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 210
Gly Ser Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr
1 5 10 15
<210> 211
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 211
Val Val Val Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn
1 5 10 15
<210> 212
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 212
Gln Leu Thr Leu Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp
1 5 10 15
<210> 213
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 213
Ala Phe Arg Glu Gly Thr Ile Asn Val His Asp Val Glu Thr Gln
1 5 10 15
<210> 214
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 214
Gly Thr Ile Asn Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr
1 5 10 15
<210> 215
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 215
Val His Asp Val Glu Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala
1 5 10 15
<210> 216
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 216
Glu Thr Gln Phe Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr
1 5 10 15
<210> 217
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 217
Asn Gln Tyr Lys Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile
1 5 10 15
<210> 218
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 218
Thr Glu Ala Ala Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser
1 5 10 15
<210> 219
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 219
Ser Arg Tyr Asn Leu Thr Ile Ser Asp Val Ser Val Ser Asp Val
1 5 10 15
<210> 220
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 220
Leu Thr Ile Ser Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe
1 5 10 15
<210> 221
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 221
Asp Val Ser Val Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser
1 5 10 15
<210> 222
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 222
Ser Asp Val Pro Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly Val
1 5 10 15
<210> 223
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 223
Phe Pro Phe Ser Ala Gln Ser Gly Ala Gly Val Pro Gly Trp Gly
1 5 10 15
<210> 224
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 224
Ala Gln Ser Gly Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu
1 5 10 15
<210> 225
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 225
Ala Gly Val Pro Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys
1 5 10 15
<210> 226
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 226
Gly Trp Gly Ile Ala Leu Leu Val Leu Val Cys Val Leu Val Ala
1 5 10 15
<210> 227
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 227
Ala Leu Leu Val Leu Val Cys Val Leu Val Ala Leu Ala Ile Val
1 5 10 15
<210> 228
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 228
Leu Val Cys Val Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala
1 5 10 15
<210> 229
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 229
Leu Val Ala Leu Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys
1 5 10 15
<210> 230
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 230
Ala Ile Val Tyr Leu Ile Ala Leu Ala Val Cys Gln Cys Arg Arg
1 5 10 15
<210> 231
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 231
Leu Ile Ala Leu Ala Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly
1 5 10 15
<210> 232
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 232
Ala Val Cys Gln Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile
1 5 10 15
<210> 233
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 233
Cys Arg Arg Lys Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg
1 5 10 15
<210> 234
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 234
Asn Tyr Gly Gln Leu Asp Ile Phe Pro Ala Arg Asp Thr Tyr His
1 5 10 15
<210> 235
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 235
Leu Asp Ile Phe Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu
1 5 10 15
<210> 236
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 236
Pro Ala Arg Asp Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr
1 5 10 15
<210> 237
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 237
Thr Tyr His Pro Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly
1 5 10 15
<210> 238
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 238
Met Ser Glu Tyr Pro Thr Tyr His Thr His Gly Arg Tyr Val Pro
1 5 10 15
<210> 239
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 239
Pro Thr Tyr His Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr
1 5 10 15
<210> 240
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 240
Thr His Gly Arg Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro
1 5 10 15
<210> 241
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 241
Tyr Val Pro Pro Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val
1 5 10 15
<210> 242
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 242
Ser Ser Thr Asp Arg Ser Pro Tyr Glu Lys Val Ser Ala Gly Asn
1 5 10 15
<210> 243
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 243
Arg Ser Pro Tyr Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser
1 5 10 15
<210> 244
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 244
Glu Lys Val Ser Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr
1 5 10 15
<210> 245
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 245
Ala Gly Asn Gly Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val
1 5 10 15
<210> 246
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 246
Gly Ser Ser Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser
1 5 10 15
<210> 247
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 247
Leu Ser Tyr Thr Asn Pro Ala Val Ala Ala Ala Ser Ala Asn Leu
1 5 10 15
<210> 248
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 248
Met Ala Ser Leu Pro Thr Ala Arg Pro Leu Leu Gly Ser Cys Gly
1 5 10 15
<210> 249
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 249
Thr Ala Arg Pro Leu Leu Gly Ser Cys Gly Thr Pro Ala Leu Gly
1 5 10 15
<210> 250
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 250
Leu Leu Gly Ser Cys Gly Thr Pro Ala Leu Gly Ser Leu Leu Phe
1 5 10 15
<210> 251
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 251
Cys Gly Thr Pro Ala Leu Gly Ser Leu Leu Phe Leu Leu Phe Ser
1 5 10 15
<210> 252
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 252
Ala Leu Gly Ser Leu Leu Phe Leu Leu Phe Ser Leu Gly Trp Val
1 5 10 15
<210> 253
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 253
Leu Leu Phe Leu Leu Phe Ser Leu Gly Trp Val Gln Pro Ser Arg
1 5 10 15
<210> 254
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 254
Leu Phe Ser Leu Gly Trp Val Gln Pro Ser Arg Thr Leu Ala Gly
1 5 10 15
<210> 255
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 255
Gly Trp Val Gln Pro Ser Arg Thr Leu Ala Gly Glu Thr Gly Gln
1 5 10 15
<210> 256
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 256
Pro Ser Arg Thr Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro
1 5 10 15
<210> 257
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 257
Thr Leu Ala Gly Glu Thr Gly Gln Glu Ala Ala Pro Leu Asp Gly
1 5 10 15
<210> 258
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 258
Thr Gly Gln Glu Ala Ala Pro Leu Asp Gly Val Leu Ala Asn Pro
1 5 10 15
<210> 259
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 259
Ala Ala Pro Leu Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser
1 5 10 15
<210> 260
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 260
Asp Gly Val Leu Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro
1 5 10 15
<210> 261
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 261
Ala Asn Pro Pro Asn Ile Ser Ser Leu Ser Pro Arg Gln Leu Leu
1 5 10 15
<210> 262
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 262
Asn Ile Ser Ser Leu Ser Pro Arg Gln Leu Leu Gly Phe Pro Cys
1 5 10 15
<210> 263
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 263
Leu Ser Pro Arg Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser
1 5 10 15
<210> 264
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 264
Gln Leu Leu Gly Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr
1 5 10 15
<210> 265
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 265
Phe Pro Cys Ala Glu Val Ser Gly Leu Ser Thr Glu Arg Val Arg
1 5 10 15
<210> 266
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 266
Glu Val Ser Gly Leu Ser Thr Glu Arg Val Arg Glu Leu Ala Val
1 5 10 15
<210> 267
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 267
Leu Ser Thr Glu Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln
1 5 10 15
<210> 268
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 268
Arg Val Arg Glu Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys
1 5 10 15
<210> 269
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 269
Leu Ala Val Ala Leu Ala Gln Lys Asn Val Lys Leu Ser Thr Glu
1 5 10 15
<210> 270
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 270
Leu Ala Gln Lys Asn Val Lys Leu Ser Thr Glu Gln Leu Arg Cys
1 5 10 15
<210> 271
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 271
Asn Val Lys Leu Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg
1 5 10 15
<210> 272
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 272
Ser Thr Glu Gln Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro
1 5 10 15
<210> 273
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 273
Leu Arg Cys Leu Ala His Arg Leu Ser Glu Pro Pro Glu Asp Leu
1 5 10 15
<210> 274
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 274
Ala His Arg Leu Ser Glu Pro Pro Glu Asp Leu Asp Ala Leu Pro
1 5 10 15
<210> 275
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 275
Ser Glu Pro Pro Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu
1 5 10 15
<210> 276
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 276
Glu Asp Leu Asp Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn
1 5 10 15
<210> 277
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 277
Ala Leu Pro Leu Asp Leu Leu Leu Phe Leu Asn Pro Asp Ala Phe
1 5 10 15
<210> 278
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 278
Asp Leu Leu Leu Phe Leu Asn Pro Asp Ala Phe Ser Gly Pro Gln
1 5 10 15
<210> 279
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 279
Phe Leu Asn Pro Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg
1 5 10 15
<210> 280
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 280
Asp Ala Phe Ser Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg
1 5 10 15
<210> 281
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 281
Gly Pro Gln Ala Cys Thr Arg Phe Phe Ser Arg Ile Thr Lys Ala
1 5 10 15
<210> 282
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 282
Cys Thr Arg Phe Phe Ser Arg Ile Thr Lys Ala Asn Val Asp Leu
1 5 10 15
<210> 283
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 283
Phe Ser Arg Ile Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly
1 5 10 15
<210> 284
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 284
Thr Lys Ala Asn Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg
1 5 10 15
<210> 285
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 285
Val Asp Leu Leu Pro Arg Gly Ala Pro Glu Arg Gln Arg Leu Leu
1 5 10 15
<210> 286
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 286
Pro Arg Gly Ala Pro Glu Arg Gln Arg Leu Leu Pro Ala Ala Leu
1 5 10 15
<210> 287
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 287
Pro Glu Arg Gln Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly
1 5 10 15
<210> 288
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 288
Arg Leu Leu Pro Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser
1 5 10 15
<210> 289
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 289
Ala Ala Leu Ala Cys Trp Gly Val Arg Gly Ser Leu Leu Ser Glu
1 5 10 15
<210> 290
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 290
Cys Trp Gly Val Arg Gly Ser Leu Leu Ser Glu Ala Asp Val Arg
1 5 10 15
<210> 291
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 291
Arg Gly Ser Leu Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly
1 5 10 15
<210> 292
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 292
Leu Ser Glu Ala Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp
1 5 10 15
<210> 293
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 293
Asp Val Arg Ala Leu Gly Gly Leu Ala Cys Asp Leu Pro Gly Arg
1 5 10 15
<210> 294
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 294
Leu Gly Gly Leu Ala Cys Asp Leu Pro Gly Arg Phe Val Ala Glu
1 5 10 15
<210> 295
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 295
Ala Cys Asp Leu Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val
1 5 10 15
<210> 296
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 296
Pro Gly Arg Phe Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg
1 5 10 15
<210> 297
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 297
Val Ala Glu Ser Ala Glu Val Leu Leu Pro Arg Leu Val Ser Cys
1 5 10 15
<210> 298
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 298
Ala Glu Val Leu Leu Pro Arg Leu Val Ser Cys Pro Gly Pro Leu
1 5 10 15
<210> 299
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 299
Leu Pro Arg Leu Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln
1 5 10 15
<210> 300
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 300
Val Ser Cys Pro Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala
1 5 10 15
<210> 301
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 301
Gly Pro Leu Asp Gln Asp Gln Gln Glu Ala Ala Arg Ala Ala Leu
1 5 10 15
<210> 302
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 302
Gln Asp Gln Gln Glu Ala Ala Arg Ala Ala Leu Gln Gly Gly Gly
1 5 10 15
<210> 303
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 303
Glu Ala Ala Arg Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly
1 5 10 15
<210> 304
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 304
Ala Ala Leu Gln Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr
1 5 10 15
<210> 305
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 305
Gly Gly Gly Pro Pro Tyr Gly Pro Pro Ser Thr Trp Ser Val Ser
1 5 10 15
<210> 306
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 306
Pro Tyr Gly Pro Pro Ser Thr Trp Ser Val Ser Thr Met Asp Ala
1 5 10 15
<210> 307
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 307
Pro Ser Thr Trp Ser Val Ser Thr Met Asp Ala Leu Arg Gly Leu
1 5 10 15
<210> 308
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 308
Ser Val Ser Thr Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu
1 5 10 15
<210> 309
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 309
Met Asp Ala Leu Arg Gly Leu Leu Pro Val Leu Gly Gln Pro Ile
1 5 10 15
<210> 310
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 310
Arg Gly Leu Leu Pro Val Leu Gly Gln Pro Ile Ile Arg Ser Ile
1 5 10 15
<210> 311
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 311
Pro Val Leu Gly Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile
1 5 10 15
<210> 312
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 312
Gln Pro Ile Ile Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp
1 5 10 15
<210> 313
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 313
Arg Ser Ile Pro Gln Gly Ile Val Ala Ala Trp Arg Gln Arg Ser
1 5 10 15
<210> 314
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 314
Gln Gly Ile Val Ala Ala Trp Arg Gln Arg Ser Ser Arg Asp Pro
1 5 10 15
<210> 315
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 315
Ala Ala Trp Arg Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln
1 5 10 15
<210> 316
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 316
Gln Arg Ser Ser Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr
1 5 10 15
<210> 317
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 317
Arg Asp Pro Ser Trp Arg Gln Pro Glu Arg Thr Ile Leu Arg Pro
1 5 10 15
<210> 318
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 318
Trp Arg Gln Pro Glu Arg Thr Ile Leu Arg Pro Arg Phe Arg Arg
1 5 10 15
<210> 319
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 319
Glu Arg Thr Ile Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys
1 5 10 15
<210> 320
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 320
Leu Arg Pro Arg Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro
1 5 10 15
<210> 321
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 321
Phe Arg Arg Glu Val Glu Lys Thr Ala Cys Pro Ser Gly Lys Lys
1 5 10 15
<210> 322
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 322
Val Glu Lys Thr Ala Cys Pro Ser Gly Lys Lys Ala Arg Glu Ile
1 5 10 15
<210> 323
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 323
Ala Cys Pro Ser Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu
1 5 10 15
<210> 324
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 324
Gly Lys Lys Ala Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys
1 5 10 15
<210> 325
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 325
Arg Glu Ile Asp Glu Ser Leu Ile Phe Tyr Lys Lys Trp Glu Leu
1 5 10 15
<210> 326
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 326
Glu Ser Leu Ile Phe Tyr Lys Lys Trp Glu Leu Glu Ala Cys Val
1 5 10 15
<210> 327
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 327
Phe Tyr Lys Lys Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu
1 5 10 15
<210> 328
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 328
Trp Glu Leu Glu Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln
1 5 10 15
<210> 329
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 329
Ala Cys Val Asp Ala Ala Leu Leu Ala Thr Gln Met Asp Arg Val
1 5 10 15
<210> 330
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 330
Ala Ala Leu Leu Ala Thr Gln Met Asp Arg Val Asn Ala Ile Pro
1 5 10 15
<210> 331
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 331
Ala Thr Gln Met Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu
1 5 10 15
<210> 332
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 332
Asp Arg Val Asn Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val
1 5 10 15
<210> 333
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 333
Ala Ile Pro Phe Thr Tyr Glu Gln Leu Asp Val Leu Lys His Lys
1 5 10 15
<210> 334
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 334
Thr Tyr Glu Gln Leu Asp Val Leu Lys His Lys Leu Asp Glu Leu
1 5 10 15
<210> 335
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 335
Leu Asp Val Leu Lys His Lys Leu Asp Glu Leu Tyr Pro Gln Gly
1 5 10 15
<210> 336
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 336
Lys His Lys Leu Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser
1 5 10 15
<210> 337
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 337
Asp Glu Leu Tyr Pro Gln Gly Tyr Pro Glu Ser Val Ile Gln His
1 5 10 15
<210> 338
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 338
Pro Gln Gly Tyr Pro Glu Ser Val Ile Gln His Leu Gly Tyr Leu
1 5 10 15
<210> 339
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 339
Pro Glu Ser Val Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met
1 5 10 15
<210> 340
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 340
Ile Gln His Leu Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp
1 5 10 15
<210> 341
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 341
Gly Tyr Leu Phe Leu Lys Met Ser Pro Glu Asp Ile Arg Lys Trp
1 5 10 15
<210> 342
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 342
Leu Lys Met Ser Pro Glu Asp Ile Arg Lys Trp Asn Val Thr Ser
1 5 10 15
<210> 343
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 343
Pro Glu Asp Ile Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu
1 5 10 15
<210> 344
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 344
Arg Lys Trp Asn Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu
1 5 10 15
<210> 345
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 345
Val Thr Ser Leu Glu Thr Leu Lys Ala Leu Leu Glu Val Asn Lys
1 5 10 15
<210> 346
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 346
Glu Thr Leu Lys Ala Leu Leu Glu Val Asn Lys Gly His Glu Met
1 5 10 15
<210> 347
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 347
Ala Leu Leu Glu Val Asn Lys Gly His Glu Met Ser Pro Gln Val
1 5 10 15
<210> 348
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 348
Val Asn Lys Gly His Glu Met Ser Pro Gln Val Ala Thr Leu Ile
1 5 10 15
<210> 349
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 349
His Glu Met Ser Pro Gln Val Ala Thr Leu Ile Asp Arg Phe Val
1 5 10 15
<210> 350
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 350
Pro Gln Val Ala Thr Leu Ile Asp Arg Phe Val Lys Gly Arg Gly
1 5 10 15
<210> 351
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 351
Thr Leu Ile Asp Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys
1 5 10 15
<210> 352
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 352
Arg Phe Val Lys Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp
1 5 10 15
<210> 353
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 353
Gly Arg Gly Gln Leu Asp Lys Asp Thr Leu Asp Thr Leu Thr Ala
1 5 10 15
<210> 354
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 354
Leu Asp Lys Asp Thr Leu Asp Thr Leu Thr Ala Phe Tyr Pro Gly
1 5 10 15
<210> 355
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 355
Thr Leu Asp Thr Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser
1 5 10 15
<210> 356
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 356
Leu Thr Ala Phe Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu
1 5 10 15
<210> 357
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 357
Tyr Pro Gly Tyr Leu Cys Ser Leu Ser Pro Glu Glu Leu Ser Ser
1 5 10 15
<210> 358
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 358
Leu Cys Ser Leu Ser Pro Glu Glu Leu Ser Ser Val Pro Pro Ser
1 5 10 15
<210> 359
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 359
Ser Pro Glu Glu Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala
1 5 10 15
<210> 360
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 360
Leu Ser Ser Val Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln
1 5 10 15
<210> 361
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 361
Pro Pro Ser Ser Ile Trp Ala Val Arg Pro Gln Asp Leu Asp Thr
1 5 10 15
<210> 362
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 362
Ile Trp Ala Val Arg Pro Gln Asp Leu Asp Thr Cys Asp Pro Arg
1 5 10 15
<210> 363
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 363
Arg Pro Gln Asp Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp Val
1 5 10 15
<210> 364
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 364
Leu Asp Thr Cys Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys
1 5 10 15
<210> 365
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 365
Asp Pro Arg Gln Leu Asp Val Leu Tyr Pro Lys Ala Arg Leu Ala
1 5 10 15
<210> 366
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 366
Leu Asp Val Leu Tyr Pro Lys Ala Arg Leu Ala Phe Gln Asn Met
1 5 10 15
<210> 367
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 367
Tyr Pro Lys Ala Arg Leu Ala Phe Gln Asn Met Asn Gly Ser Glu
1 5 10 15
<210> 368
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 368
Arg Leu Ala Phe Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys
1 5 10 15
<210> 369
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 369
Gln Asn Met Asn Gly Ser Glu Tyr Phe Val Lys Ile Gln Ser Phe
1 5 10 15
<210> 370
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 370
Gly Ser Glu Tyr Phe Val Lys Ile Gln Ser Phe Leu Gly Gly Ala
1 5 10 15
<210> 371
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 371
Phe Val Lys Ile Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp
1 5 10 15
<210> 372
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 372
Gln Ser Phe Leu Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu
1 5 10 15
<210> 373
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 373
Gly Gly Ala Pro Thr Glu Asp Leu Lys Ala Leu Ser Gln Gln Asn
1 5 10 15
<210> 374
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 374
Thr Glu Asp Leu Lys Ala Leu Ser Gln Gln Asn Val Ser Met Asp
1 5 10 15
<210> 375
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 375
Lys Ala Leu Ser Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe
1 5 10 15
<210> 376
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 376
Gln Gln Asn Val Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg
1 5 10 15
<210> 377
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 377
Ser Met Asp Leu Ala Thr Phe Met Lys Leu Arg Thr Asp Ala Val
1 5 10 15
<210> 378
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 378
Ala Thr Phe Met Lys Leu Arg Thr Asp Ala Val Leu Pro Leu Thr
1 5 10 15
<210> 379
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 379
Lys Leu Arg Thr Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val
1 5 10 15
<210> 380
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 380
Asp Ala Val Leu Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu
1 5 10 15
<210> 381
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 381
Pro Leu Thr Val Ala Glu Val Gln Lys Leu Leu Gly Pro His Val
1 5 10 15
<210> 382
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 382
Ala Glu Val Gln Lys Leu Leu Gly Pro His Val Glu Gly Leu Lys
1 5 10 15
<210> 383
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 383
Lys Leu Leu Gly Pro His Val Glu Gly Leu Lys Ala Glu Glu Arg
1 5 10 15
<210> 384
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 384
Pro His Val Glu Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val
1 5 10 15
<210> 385
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 385
Gly Leu Lys Ala Glu Glu Arg His Arg Pro Val Arg Asp Trp Ile
1 5 10 15
<210> 386
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 386
Glu Glu Arg His Arg Pro Val Arg Asp Trp Ile Leu Arg Gln Arg
1 5 10 15
<210> 387
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 387
Arg Pro Val Arg Asp Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu
1 5 10 15
<210> 388
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 388
Asp Trp Ile Leu Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly
1 5 10 15
<210> 389
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 389
Arg Gln Arg Gln Asp Asp Leu Asp Thr Leu Gly Leu Gly Leu Gln
1 5 10 15
<210> 390
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 390
Asp Asp Leu Asp Thr Leu Gly Leu Gly Leu Gln Gly Gly Ile Pro
1 5 10 15
<210> 391
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 391
Thr Leu Gly Leu Gly Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu
1 5 10 15
<210> 392
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 392
Gly Leu Gln Gly Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu
1 5 10 15
<210> 393
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 393
Gly Ile Pro Asn Gly Tyr Leu Val Leu Asp Leu Ser Met Gln Glu
1 5 10 15
<210> 394
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 394
Tyr Leu Val Leu Asp Leu Ser Met Gln Glu Ala Leu Ser Gly Thr
1 5 10 15
<210> 395
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 395
Leu Asp Leu Ser Met Gln Glu Ala Leu Ser Gly Thr Pro Cys Leu
1 5 10 15
<210> 396
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 396
Met Gln Glu Ala Leu Ser Gly Thr Pro Cys Leu Leu Gly Pro Gly
1 5 10 15
<210> 397
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 397
Leu Ser Gly Thr Pro Cys Leu Leu Gly Pro Gly Pro Val Leu Thr
1 5 10 15
<210> 398
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 398
Pro Cys Leu Leu Gly Pro Gly Pro Val Leu Thr Val Leu Ala Leu
1 5 10 15
<210> 399
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 399
Gly Pro Gly Pro Val Leu Thr Val Leu Ala Leu Leu Leu Ala Ser
1 5 10 15
<210> 400
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 400
Pro Val Leu Thr Val Leu Ala Leu Leu Leu Ala Ser Thr Leu Ala
1 5 10 15
<210> 401
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 401
Arg Arg Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln
1 5 10 15
<210> 402
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 402
Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gln Gly Ser Trp Ala
1 5 10 15
<210> 403
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 403
Glu Arg Thr Pro Val Gly Gln Gly Ser Trp Ala His Pro Gly Arg
1 5 10 15
<210> 404
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 404
Val Gly Gln Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro
1 5 10 15
<210> 405
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 405
Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly
1 5 10 15
<210> 406
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 406
Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val
1 5 10 15
<210> 407
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 407
Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg
1 5 10 15
<210> 408
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 408
Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu
1 5 10 15
<210> 409
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 409
Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu
1 5 10 15
<210> 410
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 410
Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu
1 5 10 15
<210> 411
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 411
Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg
1 5 10 15
<210> 412
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 412
Thr Ser Leu Glu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro
1 5 10 15
<210> 413
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 413
Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg
1 5 10 15
<210> 414
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 414
Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gln His His Ala
1 5 10 15
<210> 415
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 415
Ser His Pro Ser Val Gly Arg Gln His His Ala Gly Pro Pro Ser
1 5 10 15
<210> 416
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 416
Val Gly Arg Gln His His Ala Gly Pro Pro Ser Thr Ser Arg Pro
1 5 10 15
<210> 417
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 417
His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp
1 5 10 15
<210> 418
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 418
Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys
1 5 10 15
<210> 419
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 419
Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr
1 5 10 15
<210> 420
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 420
Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys
1 5 10 15
<210> 421
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 421
Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr
1 5 10 15
<210> 422
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 422
Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp
1 5 10 15
<210> 423
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 423
Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu
1 5 10 15
<210> 424
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 424
Phe Leu Tyr Ser Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe
1 5 10 15
<210> 425
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 425
Ser Gly Asp Lys Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser
1 5 10 15
<210> 426
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 426
Glu Gln Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser
1 5 10 15
<210> 427
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 427
Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala
1 5 10 15
<210> 428
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 428
Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val
1 5 10 15
<210> 429
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 429
Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe
1 5 10 15
<210> 430
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 430
Thr Gly Ala Arg Arg Leu Val Glu Thr Ile Phe Leu Gly Ser Arg
1 5 10 15
<210> 431
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 431
Arg Leu Val Glu Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro
1 5 10 15
<210> 432
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 432
Thr Ile Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg
1 5 10 15
<210> 433
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 433
Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg
1 5 10 15
<210> 434
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 434
Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg
1 5 10 15
<210> 435
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 435
Thr Pro Arg Arg Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met
1 5 10 15
<210> 436
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 436
Leu Pro Arg Leu Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe
1 5 10 15
<210> 437
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 437
Pro Gln Arg Tyr Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu
1 5 10 15
<210> 438
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 438
Trp Gln Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala
1 5 10 15
<210> 439
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 439
Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr
1 5 10 15
<210> 440
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 440
Glu Leu Leu Gly Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu
1 5 10 15
<210> 441
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 441
Asn His Ala Gln Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys
1 5 10 15
<210> 442
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 442
Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala
1 5 10 15
<210> 443
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 443
Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro
1 5 10 15
<210> 444
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 444
Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val
1 5 10 15
<210> 445
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 445
Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu
1 5 10 15
<210> 446
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 446
Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gln Gly
1 5 10 15
<210> 447
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 447
Ala Gly Val Cys Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala
1 5 10 15
<210> 448
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 448
Ala Arg Glu Lys Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu
1 5 10 15
<210> 449
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 449
Pro Gln Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro
1 5 10 15
<210> 450
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 450
Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val
1 5 10 15
<210> 451
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 451
Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg
1 5 10 15
<210> 452
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 452
Thr Asp Pro Arg Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser
1 5 10 15
<210> 453
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 453
Arg Leu Val Gln Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val
1 5 10 15
<210> 454
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 454
Leu Leu Arg Gln His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val
1 5 10 15
<210> 455
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 455
His Ser Ser Pro Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu
1 5 10 15
<210> 456
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 456
Trp Gln Val Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val
1 5 10 15
<210> 457
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 457
Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu
1 5 10 15
<210> 458
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 458
Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg
1 5 10 15
<210> 459
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 459
Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg
1 5 10 15
<210> 460
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 460
Pro Gly Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg
1 5 10 15
<210> 461
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 461
Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys
1 5 10 15
<210> 462
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 462
Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu
1 5 10 15
<210> 463
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 463
Phe Leu Arg Asn Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala
1 5 10 15
<210> 464
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 464
Thr Lys Lys Phe Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu
1 5 10 15
<210> 465
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 465
Ile Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr
1 5 10 15
<210> 466
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 466
Lys His Ala Lys Leu Ser Leu Gln Glu Leu Thr Trp Lys Met Ser
1 5 10 15
<210> 467
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 467
Leu Ser Leu Gln Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys
1 5 10 15
<210> 468
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 468
Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg
1 5 10 15
<210> 469
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 469
Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly
1 5 10 15
<210> 470
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 470
Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val
1 5 10 15
<210> 471
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 471
Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu
1 5 10 15
<210> 472
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 472
Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg
1 5 10 15
<210> 473
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 473
Gly Cys Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu
1 5 10 15
<210> 474
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 474
Ala Ala Glu His Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu
1 5 10 15
<210> 475
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 475
Arg Leu Arg Glu Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met
1 5 10 15
<210> 476
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 476
Glu Ile Leu Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val
1 5 10 15
<210> 477
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 477
Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu
1 5 10 15
<210> 478
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 478
Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe
1 5 10 15
<210> 479
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 479
Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu
1 5 10 15
<210> 480
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 480
Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln
1 5 10 15
<210> 481
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 481
Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu
1 5 10 15
<210> 482
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 482
Val Thr Glu Thr Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg
1 5 10 15
<210> 483
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 483
Thr Phe Gln Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp
1 5 10 15
<210> 484
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 484
Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln
1 5 10 15
<210> 485
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 485
Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile
1 5 10 15
<210> 486
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 486
Ser Val Trp Ser Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu
1 5 10 15
<210> 487
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 487
Lys Leu Gln Ser Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln
1 5 10 15
<210> 488
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 488
Ile Gly Ile Arg Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu
1 5 10 15
<210> 489
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 489
Gln His Leu Lys Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu
1 5 10 15
<210> 490
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 490
Arg Val Gln Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His
1 5 10 15
<210> 491
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 491
Arg Glu Leu Ser Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg
1 5 10 15
<210> 492
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 492
Glu Ala Glu Val Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu
1 5 10 15
<210> 493
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 493
Arg Gln His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu
1 5 10 15
<210> 494
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 494
Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro
1 5 10 15
<210> 495
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 495
Ala Leu Leu Thr Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly
1 5 10 15
<210> 496
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 496
Ser Arg Leu Arg Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile
1 5 10 15
<210> 497
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 497
Phe Ile Pro Lys Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp
1 5 10 15
<210> 498
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 498
Pro Asp Gly Leu Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly
1 5 10 15
<210> 499
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 499
Arg Pro Ile Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe
1 5 10 15
<210> 500
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 500
Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys
1 5 10 15
<210> 501
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 501
Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg
1 5 10 15
<210> 502
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 502
Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg
1 5 10 15
<210> 503
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 503
Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu
1 5 10 15
<210> 504
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 504
Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu
1 5 10 15
<210> 505
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 505
Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg
1 5 10 15
<210> 506
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 506
Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro
1 5 10 15
<210> 507
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 507
Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly
1 5 10 15
<210> 508
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 508
Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu
1 5 10 15
<210> 509
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 509
Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp
1 5 10 15
<210> 510
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 510
Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala
1 5 10 15
<210> 511
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 511
Ser Val Leu Gly Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe
1 5 10 15
<210> 512
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 512
Leu Asp Asp Ile His Arg Ala Trp Arg Thr Phe Val Leu Arg Val
1 5 10 15
<210> 513
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 513
His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp
1 5 10 15
<210> 514
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 514
Arg Thr Phe Val Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu
1 5 10 15
<210> 515
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 515
Leu Arg Val Arg Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val
1 5 10 15
<210> 516
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 516
Ala Gln Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Asp Val
1 5 10 15
<210> 517
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 517
Pro Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr
1 5 10 15
<210> 518
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 518
Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr Ile Pro
1 5 10 15
<210> 519
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 519
Val Asp Val Thr Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu
1 5 10 15
<210> 520
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 520
Gly Ala Tyr Asp Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile
1 5 10 15
<210> 521
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 521
Thr Ile Pro Gln Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile
1 5 10 15
<210> 522
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 522
Asp Arg Leu Thr Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn
1 5 10 15
<210> 523
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 523
Glu Val Ile Ala Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val
1 5 10 15
<210> 524
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 524
Ser Ile Ile Lys Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala
1 5 10 15
<210> 525
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 525
Pro Gln Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys
1 5 10 15
<210> 526
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 526
Tyr Cys Val Arg Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly
1 5 10 15
<210> 527
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 527
Arg Tyr Ala Val Val Gln Lys Ala Ala His Gly His Val Arg Lys
1 5 10 15
<210> 528
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 528
Val Gln Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser
1 5 10 15
<210> 529
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 529
Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr
1 5 10 15
<210> 530
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 530
Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu
1 5 10 15
<210> 531
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 531
Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met
1 5 10 15
<210> 532
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 532
Val Ser Thr Leu Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val
1 5 10 15
<210> 533
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 533
Thr Asp Leu Gln Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln
1 5 10 15
<210> 534
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 534
Pro Tyr Met Arg Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro
1 5 10 15
<210> 535
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 535
Gln Phe Val Ala His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala
1 5 10 15
<210> 536
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 536
His Leu Gln Glu Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu
1 5 10 15
<210> 537
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 537
Thr Ser Pro Leu Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser
1 5 10 15
<210> 538
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 538
Arg Asp Ala Val Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala
1 5 10 15
<210> 539
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 539
Val Ile Glu Gln Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu
1 5 10 15
<210> 540
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 540
Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe
1 5 10 15
<210> 541
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 541
Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met
1 5 10 15
<210> 542
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 542
Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala
1 5 10 15
<210> 543
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 543
Asp Val Phe Leu Arg Phe Met Cys His His Ala Val Arg Ile Arg
1 5 10 15
<210> 544
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 544
Arg Phe Met Cys His His Ala Val Arg Ile Arg Gly Lys Ser Tyr
1 5 10 15
<210> 545
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 545
His His Ala Val Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln
1 5 10 15
<210> 546
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 546
Arg Ile Arg Gly Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln
1 5 10 15
<210> 547
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 547
Lys Ser Tyr Val Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu
1 5 10 15
<210> 548
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 548
Gln Cys Gln Gly Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu
1 5 10 15
<210> 549
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 549
Ile Pro Gln Gly Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys
1 5 10 15
<210> 550
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 550
Ser Ile Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met
1 5 10 15
<210> 551
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 551
Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu
1 5 10 15
<210> 552
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 552
Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile
1 5 10 15
<210> 553
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 553
Gly Asp Met Glu Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly
1 5 10 15
<210> 554
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 554
Asn Lys Leu Phe Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg
1 5 10 15
<210> 555
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 555
Ala Gly Ile Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp
1 5 10 15
<210> 556
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 556
Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val
1 5 10 15
<210> 557
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 557
Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu
1 5 10 15
<210> 558
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 558
Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys
1 5 10 15
<210> 559
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 559
Leu Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg
1 5 10 15
<210> 560
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 560
Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg
1 5 10 15
<210> 561
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 561
His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu
1 5 10 15
<210> 562
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 562
Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val
1 5 10 15
<210> 563
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 563
Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg
1 5 10 15
<210> 564
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 564
Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr Val Val
1 5 10 15
<210> 565
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 565
Gly Cys Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val
1 5 10 15
<210> 566
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 566
Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala
1 5 10 15
<210> 567
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 567
Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr
1 5 10 15
<210> 568
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 568
Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln
1 5 10 15
<210> 569
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 569
Asp Glu Ala Leu Gly Gly Thr Ala Phe Val Gln Met Pro Ala His
1 5 10 15
<210> 570
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 570
Gly Gly Thr Ala Phe Val Gln Met Pro Ala His Gly Leu Phe Pro
1 5 10 15
<210> 571
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 571
Phe Val Gln Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu
1 5 10 15
<210> 572
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 572
Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr
1 5 10 15
<210> 573
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 573
Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu
1 5 10 15
<210> 574
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 574
Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp
1 5 10 15
<210> 575
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 575
Leu Asp Thr Arg Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr
1 5 10 15
<210> 576
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 576
Thr Leu Glu Val Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser
1 5 10 15
<210> 577
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 577
Gln Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser
1 5 10 15
<210> 578
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 578
Ser Ser Tyr Ala Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn
1 5 10 15
<210> 579
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 579
Arg Thr Ser Ile Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys
1 5 10 15
<210> 580
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 580
Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn
1 5 10 15
<210> 581
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 581
Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys
1 5 10 15
<210> 582
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 582
Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val
1 5 10 15
<210> 583
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 583
Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys
1 5 10 15
<210> 584
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 584
Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu
1 5 10 15
<210> 585
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 585
Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu
1 5 10 15
<210> 586
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 586
Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser
1 5 10 15
<210> 587
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 587
His Ser Leu Phe Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val
1 5 10 15
<210> 588
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 588
Leu Asp Leu Gln Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile
1 5 10 15
<210> 589
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 589
Val Asn Ser Leu Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu
1 5 10 15
<210> 590
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 590
Gln Thr Val Cys Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala
1 5 10 15
<210> 591
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 591
Thr Asn Ile Tyr Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His
1 5 10 15
<210> 592
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 592
Lys Ile Leu Leu Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu
1 5 10 15
<210> 593
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 593
Leu Gln Ala Tyr Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe
1 5 10 15
<210> 594
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 594
Arg Phe His Ala Cys Val Leu Gln Leu Pro Phe His Gln Gln Val
1 5 10 15
<210> 595
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 595
Cys Val Leu Gln Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro
1 5 10 15
<210> 596
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 596
Leu Pro Phe His Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu
1 5 10 15
<210> 597
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 597
Gln Gln Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser
1 5 10 15
<210> 598
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 598
Lys Asn Pro Thr Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser
1 5 10 15
<210> 599
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 599
Phe Phe Leu Arg Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser
1 5 10 15
<210> 600
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 600
Val Ile Ser Asp Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala
1 5 10 15
<210> 601
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 601
Thr Ala Ser Leu Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly
1 5 10 15
<210> 602
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 602
Cys Tyr Ser Ile Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly
1 5 10 15
<210> 603
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 603
Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala
1 5 10 15
<210> 604
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 604
Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu
1 5 10 15
<210> 605
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 605
Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala
1 5 10 15
<210> 606
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 606
Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu
1 5 10 15
<210> 607
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 607
Gly Pro Leu Pro Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala
1 5 10 15
<210> 608
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 608
Ser Glu Ala Val Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys
1 5 10 15
<210> 609
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 609
Gln Trp Leu Cys His Gln Ala Phe Leu Leu Lys Leu Thr Arg His
1 5 10 15
<210> 610
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 610
His Gln Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr
1 5 10 15
<210> 611
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 611
Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu
1 5 10 15
<210> 612
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 612
Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg
1 5 10 15
<210> 613
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 613
Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr
1 5 10 15
<210> 614
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 614
Pro Leu Leu Gly Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser Arg
1 5 10 15
<210> 615
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 615
Ser Leu Arg Thr Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly
1 5 10 15
<210> 616
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 616
Ala Gln Thr Gln Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr
1 5 10 15
<210> 617
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 617
Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala
1 5 10 15
<210> 618
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 618
Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro
1 5 10 15
<210> 619
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 619
Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser
1 5 10 15
<210> 620
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 620
Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr
1 5 10 15
<210> 621
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic Construct
<400> 621
Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr Ile Leu Asp
1 5 10 15
Claims (23)
- 하기로 이루어진 군으로부터 선택되는 핵산 분자를 포함하는 다중-항원 구축물:
(i) 면역원성 MUC1 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열 및 면역원성 TERT 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열을 포함하는 핵산 분자;
(ii) 면역원성 MUC1 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열 및 면역원성 MSLN 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열을 포함하는 핵산 분자; 및
(iii) 면역원성 TERT 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열 및 면역원성 MSLN 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열을 포함하는 핵산 분자. - (i) 면역원성 MSLN 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열; (ii) 면역원성 MUC1 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열; 및 (iii) 면역원성 TERT 폴리펩티드를 코딩하는 적어도 1개의 뉴클레오티드 서열을 포함하는 핵산 분자를 포함하는 다중-항원 구축물.
- 제1항 또는 제2항에 있어서, (i) 면역원성 MUC1 폴리펩티드가 하기로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하는 것인 다중-항원 구축물:
(1) 서열식별번호: 8의 아미노산 서열;
(2) 서열식별번호: 8의 아미노산 4-537을 포함하는 아미노산 서열;
(3) 서열식별번호: 8의 아미노산 24-537을 포함하는 아미노산 서열;
(4) 서열식별번호: 16의 아미노산 서열;
(5) 서열식별번호: 16의 아미노산 4-517을 포함하는 아미노산 서열; 및
(6) 서열식별번호: 16의 아미노산 4-517을 포함하며, 여기서 위치 513에서의 아미노산이 T인 아미노산 서열. - 제1항 또는 제2항에 있어서, 면역원성 MSLN 폴리펩티드가 하기로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하는 것인 다중-항원 구축물:
(1) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597을 포함하는 아미노산 서열;
(2) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597로 이루어진 아미노산 서열;
(3) 서열식별번호: 6의 아미노산 서열; 및
(4) 서열식별번호: 6의 아미노산 서열의 아미노산 4-564를 포함하는 아미노산 서열. - 제1항 또는 제2항에 있어서, 면역원성 TERT 폴리펩티드가 하기로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하는 것인 다중-항원 구축물:
(1) 서열식별번호: 10의 아미노산 서열;
(2) 서열식별번호: 12의 아미노산 서열; 및
(3) 서열식별번호: 14의 아미노산 서열. - 제2항에 있어서,
(i) 면역원성 MUC1 폴리펩티드가
(1) 서열식별번호: 8의 아미노산 서열;
(2) 서열식별번호: 8의 아미노산 4-537을 포함하는 아미노산 서열;
(3) 서열식별번호: 8의 아미노산 24-537을 포함하는 아미노산 서열;
(4) 서열식별번호: 16의 아미노산 서열;
(5) 서열식별번호: 16의 아미노산 4-517을 포함하는 아미노산 서열; 및
(6) 서열식별번호: 16의 아미노산 4-517을 포함하며, 단 위치 513에서의 아미노산이 T인 아미노산 서열
로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하고;
(ii) 면역원성 MSLN 폴리펩티드가
(1) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597을 포함하는 아미노산 서열;
(2) 서열식별번호: 2의 아미노산 서열의 아미노산 37-597로 이루어진 아미노산 서열;
(3) 서열식별번호: 6의 아미노산 서열; 및
(4) 서열식별번호: 6의 아미노산 서열의 아미노산 4-564를 포함하는 아미노산 서열
로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하고;
(iii) 면역원성 TERT 폴리펩티드가
(1) 서열식별번호: 10의 아미노산 서열;
(2) 서열식별번호: 12의 아미노산 서열; 및
(3) 서열식별번호: 14의 아미노산 서열
로 이루어진 군으로부터 선택되는 아미노산 서열을 포함하는 것인
다중-항원 구축물. - 제1항 또는 제2항에 있어서, 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열이 하기로 이루어진 군으로부터 선택되는 것인 다중-항원 구축물:
(1) 서열식별번호: 7의 뉴클레오티드 서열;
(2) 서열식별번호: 15의 뉴클레오티드 서열; 및
(3) 서열식별번호: 7 또는 15의 뉴클레오티드 서열의 축중성 변이체. - 제1항 또는 제2항에 있어서, 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열이 하기로 이루어진 군으로부터 선택되는 것인 다중-항원 구축물:
(1) 서열식별번호: 5의 뉴클레오티드 서열; 및
(2) 서열식별번호: 5의 뉴클레오티드 서열의 축중성 변이체. - 제1항 또는 제2항에 있어서, 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이 하기로 이루어진 군으로부터 선택되는 것인 다중-항원 구축물:
(1) 서열식별번호: 17, 19, 21, 또는 23의 뉴클레오티드 서열; 및
(2) 서열식별번호: 17, 19, 21, 또는 23의 뉴클레오티드 서열의 축중성 변이체. - 제2항에 있어서,
(i) 면역원성 MUC1 폴리펩티드를 코딩하는 뉴클레오티드 서열이
(1) 서열식별번호: 7의 뉴클레오티드 서열:
(2) 서열식별번호: 15의 뉴클레오티드 서열; 및
(3) 서열식별번호: 7 또는 15의 뉴클레오티드 서열의 축중성 변이체
로 이루어진 군으로부터 선택되고;
(ii) 면역원성 MSLN 폴리펩티드를 코딩하는 뉴클레오티드 서열이
(1) 서열식별번호: 5의 뉴클레오티드 서열; 및
(2) 서열식별번호: 5의 뉴클레오티드 서열의 축중성 변이체
로 이루어진 군으로부터 선택되고;
(iii) 면역원성 TERT 폴리펩티드를 코딩하는 뉴클레오티드 서열이
(1) 서열식별번호: 17, 19, 21, 또는 23의 뉴클레오티드 서열; 및
(2) 서열식별번호: 17, 19, 21, 또는 23의 뉴클레오티드 서열의 축중성 변이체
로 이루어진 군으로부터 선택되는 것인
다중-항원 구축물. - 제6항에 있어서, 서열식별번호: 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 또는 66의 아미노산 서열을 코딩하는 뉴클레오티드 서열을 포함하는 다중-항원 구축물.
- 제6항에 있어서, 하기로부터 선택되는 뉴클레오티드 서열을 포함하는 다중-항원 구축물:
(1) 서열식별번호: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 또는 65의 뉴클레오티드 서열; 및
(2) 서열식별번호: 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 또는 65의 뉴클레오티드 서열의 축중성 변이체. - 제1항 내지 제12항 중 어느 한 항에 따른 다중-항원 구축물을 포함하는 벡터.
- 제1항 내지 제12항 중 어느 한 항에 따른 다중-항원 구축물을 포함하는 조성물.
- (i) 제1항 내지 제12항 중 어느 한 항에 따른 다중-항원 구축물 및 (ii) 제약상 허용되는 담체를 포함하는 제약 조성물.
- 유효량의 제15항에 따른 제약 조성물을 환자에게 투여하는 것을 포함하는, 환자에서 암을 치료하는 방법.
- 제16항에 있어서, 암이 MUC1, MSLN, 또는 TERT로부터 선택되는 1종 이상의 종양-연관된 항원을 과다-발현하는 것인 방법.
- 제16항에 있어서, 암이 유방암, 난소암, 또는 췌장암인 방법.
- 제16항에 있어서, 암이 삼중 음성 유방암인 방법.
- 제16항에 있어서, 유효량의 면역 조정제를 환자에게 투여하는 것을 추가로 포함하는 방법.
- 제20항에 있어서, 면역 조정제가 CTLA-4 억제제, IDO1 억제제, PD-1 억제제, 또는 PD-L1 억제제인 방법.
- 제15항에 있어서, 면역 조정제를 추가로 포함하는 제약 조성물.
- 제22항에 있어서, 면역 조정제가 CTLA-4 억제제, IDO1 억제제, PD-1 억제제, 또는 PD-L1 억제제인 제약 조성물.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662280636P | 2016-01-19 | 2016-01-19 | |
US62/280,636 | 2016-01-19 | ||
US201662419190P | 2016-11-08 | 2016-11-08 | |
US62/419,190 | 2016-11-08 | ||
PCT/IB2017/050229 WO2017125844A1 (en) | 2016-01-19 | 2017-01-16 | Cancer vaccines |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020207026620A Division KR20200109395A (ko) | 2016-01-19 | 2017-01-16 | 암 백신 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20180101537A true KR20180101537A (ko) | 2018-09-12 |
KR102158923B1 KR102158923B1 (ko) | 2020-09-23 |
Family
ID=57960563
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020187023390A KR102158923B1 (ko) | 2016-01-19 | 2017-01-16 | 암 백신 |
KR1020207026620A KR20200109395A (ko) | 2016-01-19 | 2017-01-16 | 암 백신 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020207026620A KR20200109395A (ko) | 2016-01-19 | 2017-01-16 | 암 백신 |
Country Status (22)
Country | Link |
---|---|
US (3) | US10251944B2 (ko) |
EP (2) | EP3733201A1 (ko) |
JP (2) | JP6820935B2 (ko) |
KR (2) | KR102158923B1 (ko) |
CN (1) | CN108778321A (ko) |
AU (1) | AU2017208848A1 (ko) |
BR (1) | BR112018013967A2 (ko) |
CA (1) | CA2954892A1 (ko) |
CO (1) | CO2018007442A2 (ko) |
DK (1) | DK3405212T3 (ko) |
ES (1) | ES2811523T3 (ko) |
HU (1) | HUE051722T2 (ko) |
IL (2) | IL260321B (ko) |
MX (1) | MX2018008797A (ko) |
PH (1) | PH12018501532A1 (ko) |
PL (1) | PL3405212T3 (ko) |
PT (1) | PT3405212T (ko) |
RU (2) | RU2718663C2 (ko) |
SG (1) | SG11201805621SA (ko) |
SI (1) | SI3405212T1 (ko) |
TW (3) | TWI709647B (ko) |
WO (1) | WO2017125844A1 (ko) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018204760A1 (en) * | 2017-05-05 | 2018-11-08 | David Weiner | Ctla4 antibodies and vaccine combinations and use of same for immunotherapy |
AU2022290634A1 (en) * | 2021-06-09 | 2024-01-25 | Ultimovacs Ab | A conjugate |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014127296A1 (en) * | 2013-02-14 | 2014-08-21 | Immunocellular Therapeutics, Ltd | Cancer vaccines and vaccination methods |
Family Cites Families (84)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4603112A (en) | 1981-12-24 | 1986-07-29 | Health Research, Incorporated | Modified vaccinia virus |
US4769330A (en) | 1981-12-24 | 1988-09-06 | Health Research, Incorporated | Modified vaccinia virus and methods for making and using the same |
US5288641A (en) | 1984-06-04 | 1994-02-22 | Arch Development Corporation | Herpes Simplex virus as a vector |
CA1341482C (en) | 1984-10-31 | 2005-05-10 | Paul A. Luciw | Process for preparing fragments of aids-associated retroviruses |
JPH0662517B2 (ja) | 1985-01-07 | 1994-08-17 | シンテツクス(ユー・エス・エイ)インコーポレイテツド | 置換アルキルートリ置換アンモニウム界面活性剤 |
GB8508845D0 (en) | 1985-04-04 | 1985-05-09 | Hoffmann La Roche | Vaccinia dna |
US5091309A (en) | 1986-01-16 | 1992-02-25 | Washington University | Sindbis virus vectors |
US5219740A (en) | 1987-02-13 | 1993-06-15 | Fred Hutchinson Cancer Research Center | Retroviral gene transfer into diploid fibroblasts for gene therapy |
WO1989001973A2 (en) | 1987-09-02 | 1989-03-09 | Applied Biotechnology, Inc. | Recombinant pox virus for immunization against tumor-associated antigens |
US5716826A (en) | 1988-03-21 | 1998-02-10 | Chiron Viagene, Inc. | Recombinant retroviruses |
US5591624A (en) | 1988-03-21 | 1997-01-07 | Chiron Viagene, Inc. | Retroviral packaging cell lines |
US5217879A (en) | 1989-01-12 | 1993-06-08 | Washington University | Infectious Sindbis virus vectors |
DE69034078T2 (de) | 1989-03-21 | 2004-04-01 | Vical, Inc., San Diego | Expression von exogenen Polynukleotidsequenzen in Wirbeltieren |
US5703055A (en) | 1989-03-21 | 1997-12-30 | Wisconsin Alumni Research Foundation | Generation of antibodies through lipid mediated DNA delivery |
HU212924B (en) | 1989-05-25 | 1996-12-30 | Chiron Corp | Adjuvant formulation comprising a submicron oil droplet emulsion |
US5279833A (en) | 1990-04-04 | 1994-01-18 | Yale University | Liposomal transfection of nucleic acids into animal cells |
US5817491A (en) | 1990-09-21 | 1998-10-06 | The Regents Of The University Of California | VSV G pseusdotyped retroviral vectors |
US5283185A (en) | 1991-08-28 | 1994-02-01 | University Of Tennessee Research Corporation | Method for delivering nucleic acids into cells |
AU671971B2 (en) | 1991-11-29 | 1996-09-19 | Chiron Corporation | Anti-cancer immunotherapeutic vector constructs |
ES2143716T3 (es) | 1992-06-25 | 2000-05-16 | Smithkline Beecham Biolog | Composicion de vacuna que contiene adyuvantes. |
IL107366A (en) | 1992-10-23 | 2003-03-12 | Chugai Pharmaceutical Co Ltd | Genes coding for megakaryocyte potentiator |
US6015686A (en) | 1993-09-15 | 2000-01-18 | Chiron Viagene, Inc. | Eukaryotic layered vector initiation systems |
GB9326253D0 (en) | 1993-12-23 | 1994-02-23 | Smithkline Beecham Biolog | Vaccines |
US5651981A (en) | 1994-03-29 | 1997-07-29 | Northwestern University | Cationic phospholipids for transfection |
US5527928A (en) | 1994-09-30 | 1996-06-18 | Nantz; Michael H. | Cationic transport reagents |
GB9513261D0 (en) | 1995-06-29 | 1995-09-06 | Smithkline Beecham Biolog | Vaccines |
US6083502A (en) | 1996-01-05 | 2000-07-04 | The United States Of America As Represented By The Department Of Health And Human Services | Mesothelium antigen and methods and kits for targeting it |
US7375183B1 (en) | 1996-01-05 | 2008-05-20 | The United States Of America As Represented By The Department Of Health And Human Services | Mesothelin, immunogenic peptides derived therefrom, and compositions comprising mesothelin, or immunogenic peptides thereof |
WO1997034921A1 (en) | 1996-03-20 | 1997-09-25 | Sloan-Kettering Institute For Cancer Research | Conjugated mucin peptide vaccines |
US6093809A (en) | 1996-10-01 | 2000-07-25 | University Technology Corporation | Telomerase |
EP1783139B8 (en) | 1996-10-01 | 2009-09-23 | Geron Corporation | Human telomerase catalytic subunit |
US6475789B1 (en) | 1996-10-01 | 2002-11-05 | University Technology Corporation | Human telomerase catalytic subunit: diagnostic and therapeutic methods |
US7585622B1 (en) | 1996-10-01 | 2009-09-08 | Geron Corporation | Increasing the proliferative capacity of cells using telomerase reverse transcriptase |
US6261836B1 (en) | 1996-10-01 | 2001-07-17 | Geron Corporation | Telomerase |
CA2281838A1 (en) | 1997-02-28 | 1998-09-03 | University Of Iowa Research Foundation | Use of nucleic acids containing unmethylated cpg dinucleotide in the treatment of lps-associated disorders |
AU753688B2 (en) | 1997-03-10 | 2002-10-24 | Ottawa Civic Loeb Research Institute | Use of nucleic acids containing unmethylated CpG dinucleotide as an adjuvant |
US7622549B2 (en) | 1997-04-18 | 2009-11-24 | Geron Corporation | Human telomerase reverse transcriptase polypeptides |
US20050013825A1 (en) | 1997-04-18 | 2005-01-20 | Geron Corporation | Vaccine containing the catalytic subunit of telomerase for treating cancer |
DE69838294T2 (de) | 1997-05-20 | 2009-08-13 | Ottawa Health Research Institute, Ottawa | Verfahren zur Herstellung von Nukleinsäurekonstrukten |
PT1003850E (pt) | 1997-06-06 | 2009-08-13 | Dynavax Tech Corp | Inibidores da actividade de sequências de adn imunoestimulantes |
GB9712347D0 (en) | 1997-06-14 | 1997-08-13 | Smithkline Beecham Biolog | Vaccine |
DE69815692T2 (de) | 1997-09-05 | 2004-04-29 | Glaxosmithkline Biologicals S.A. | Öl in wasser emulsionen mit saponinen |
US6303114B1 (en) | 1998-03-05 | 2001-10-16 | The Medical College Of Ohio | IL-12 enhancement of immune responses to T-independent antigens |
US6337200B1 (en) | 1998-03-31 | 2002-01-08 | Geron Corporation | Human telomerase catalytic subunit variants |
JP2002511423A (ja) | 1998-04-09 | 2002-04-16 | スミスクライン ビーチャム バイオロジカルズ ソシエテ アノニム | ワクチン |
US6962790B1 (en) | 1998-09-23 | 2005-11-08 | University Of Massachusetts Medical Center | Predictive assay for immune response |
EE05627B1 (et) | 1998-12-23 | 2013-02-15 | Pfizer Inc. | CTLA-4 vastased inimese monoklonaalsed antikehad |
HU228499B1 (en) | 1999-03-19 | 2013-03-28 | Smithkline Beecham Biolog | Streptococcus vaccine |
PT1187629E (pt) | 1999-04-19 | 2005-02-28 | Glaxosmithkline Biolog Sa | Composicao adjuvante que compreende saponina e um oligonucleotido imunoestimulador |
US6472028B1 (en) | 1999-08-12 | 2002-10-29 | Joseph Frazzitta | Method of producing a high gloss coating on a printed surface |
EP3214175A1 (en) | 1999-08-24 | 2017-09-06 | E. R. Squibb & Sons, L.L.C. | Human ctla-4 antibodies and their uses |
CO5200837A1 (es) | 1999-09-24 | 2002-09-27 | Smithkline Beecham Corp | Vacunas |
PL355232A1 (en) | 1999-09-24 | 2004-04-05 | Smithkline Beecham Biologicals S.A. | Adjuvant comprising a polyxyethylene alkyl ether or ester and at least one nonionic surfactant |
GB9930359D0 (en) | 1999-12-22 | 2000-02-09 | Glaxo Group Ltd | Novel polypeptides |
AU2001241533A1 (en) | 2000-02-15 | 2001-08-27 | The Regents Of The University Of California | A universal vaccine and method for treating cancer employing telomerase reverse transcriptase |
WO2002016555A2 (en) | 2000-08-17 | 2002-02-28 | University Of Wales College Of Medicine | Htert-immortalised cell lines, their preparation and use |
US20040136963A1 (en) | 2001-06-22 | 2004-07-15 | The Trustees Of The University Of Pennsylvania | Simian adenovirus vectors and methods of use |
US20030143228A1 (en) | 2001-10-29 | 2003-07-31 | Baylor College Of Medicine | Human telomerase reverse transcriptase as a class-II restricted tumor-associated antigen |
TWI228718B (en) | 2001-11-05 | 2005-03-01 | Tdk Corp | Manufacturing method and device of mold plate for information medium |
DE10162480A1 (de) | 2001-12-19 | 2003-08-07 | Ingmar Hoerr | Die Applikation von mRNA für den Einsatz als Therapeutikum gegen Tumorerkrankungen |
GB0212046D0 (en) * | 2002-05-24 | 2002-07-03 | Glaxo Group Ltd | Vaccines |
EP1575500A4 (en) | 2002-07-12 | 2007-01-03 | Univ Johns Hopkins | MESOTHELIN VACCINE AND MODEL SYSTEMS |
US9200036B2 (en) | 2002-07-12 | 2015-12-01 | The Johns Hopkins University | Mesothelin vaccines and model systems |
CN1315536C (zh) * | 2002-09-13 | 2007-05-16 | 李进 | 肿瘤抗原疫苗及其制备方法和疫苗组合物 |
EP1711518B1 (en) | 2004-01-23 | 2009-11-18 | Istituto di Richerche di Biologia Molecolare P. Angeletti S.p.A. | Chimpanzee adenovirus vaccine carriers |
DE102004035227A1 (de) | 2004-07-21 | 2006-02-16 | Curevac Gmbh | mRNA-Gemisch zur Vakzinierung gegen Tumorerkrankungen |
EP1832654B1 (en) * | 2004-12-28 | 2011-07-27 | Sekisui Medical Co., Ltd. | Anti-human soluble fibrin monoclonal antibody and immunological assay method using the antibody |
EP1748067A1 (en) | 2005-07-29 | 2007-01-31 | Institut Pasteur | Polynucleotides encoding MHC class I-restricted hTERT epitopes, analogues thereof or polyepitopes |
CA2652310A1 (en) | 2006-01-19 | 2007-08-23 | The Regents Of The University Of California | Human telomerase reverse transcriptase peptides |
BRPI0719865A2 (pt) * | 2006-10-12 | 2016-03-15 | Angeletti P Ist Richerche Bio | molécula de ácido nucleico, proteína de fusão purificada, vetor, célula hospedeira, processo para expressar uma proteína de fusão, e, método para tratar um paciente sofrendo de ou predisposto a um câncer |
EP1994942A1 (en) | 2007-05-25 | 2008-11-26 | INSERM (Institut National de la Santé et de la Recherche Médicale) | Pharmaceutical compositions comprising telomerase, and uses thereof |
WO2009025871A1 (en) | 2007-08-23 | 2009-02-26 | University Of Medicine And Dentistry Of Nj | Telomerase reverse transcriptase variant |
MY171866A (en) | 2008-07-08 | 2019-11-05 | Incyte Holdings Corp | 1,2,5-oxadiazoles as inhibitors of indoleamine 2,3-dioxygenase |
BRPI1008018A2 (pt) | 2009-02-02 | 2016-03-15 | Okairos Ag | ácidos nucleicos de adenovírus símio e sequências de aminoácidos, vetores contendo os mesmos e uso dos mesmos |
CN101920009B (zh) | 2010-03-06 | 2012-08-08 | 河北医科大学 | 一种用于预防和治疗肿瘤的疫苗 |
AU2012262520A1 (en) * | 2011-05-27 | 2014-01-23 | Cytocure Llc | Methods, compositions, and kits for the treatment of cancer |
CA2843200A1 (en) | 2011-07-27 | 2013-01-31 | Baylor Research Institute | Dendritic cell (dc)-vaccine therapy for pancreatic cancer |
US8523590B2 (en) * | 2011-07-29 | 2013-09-03 | General Electric Company | Cable system and methods of assembling a cable system |
CN114617958A (zh) * | 2011-08-17 | 2022-06-14 | 全球免疫股份有限公司 | 酵母-muc1免疫治疗组合物及其用途 |
MX359257B (es) * | 2012-05-04 | 2018-09-19 | Pfizer | Antígenos asociados a próstata y regímenes de inmunoterapia basados en vacuna. |
US10184216B2 (en) * | 2012-07-18 | 2019-01-22 | Thomas Urbanek | Advantageous detectable warning area and methods of forming the same |
US11419925B2 (en) | 2013-03-15 | 2022-08-23 | The Trustees Of The University Of Pennsylvania | Cancer vaccines and methods of treatment using the same |
KR102006527B1 (ko) | 2013-11-01 | 2019-08-02 | 화이자 인코포레이티드 | 전립선-연관 항원의 발현을 위한 벡터 |
CN106536480B (zh) | 2014-05-15 | 2019-09-03 | 艾特奥斯治疗公司 | 吡咯烷-2,5-二酮衍生物、药物组合物及用作ido1抑制剂的方法 |
-
2017
- 2017-01-16 JP JP2018537450A patent/JP6820935B2/ja not_active Expired - Fee Related
- 2017-01-16 CA CA2954892A patent/CA2954892A1/en active Pending
- 2017-01-16 ES ES17702951T patent/ES2811523T3/es active Active
- 2017-01-16 HU HUE17702951A patent/HUE051722T2/hu unknown
- 2017-01-16 SG SG11201805621SA patent/SG11201805621SA/en unknown
- 2017-01-16 RU RU2018126487A patent/RU2718663C2/ru active
- 2017-01-16 EP EP20175571.7A patent/EP3733201A1/en not_active Withdrawn
- 2017-01-16 AU AU2017208848A patent/AU2017208848A1/en not_active Abandoned
- 2017-01-16 RU RU2020110148A patent/RU2020110148A/ru unknown
- 2017-01-16 TW TW108100405A patent/TWI709647B/zh not_active IP Right Cessation
- 2017-01-16 PT PT177029519T patent/PT3405212T/pt unknown
- 2017-01-16 CN CN201780018354.0A patent/CN108778321A/zh active Pending
- 2017-01-16 WO PCT/IB2017/050229 patent/WO2017125844A1/en active Application Filing
- 2017-01-16 DK DK17702951.9T patent/DK3405212T3/da active
- 2017-01-16 EP EP17702951.9A patent/EP3405212B1/en active Active
- 2017-01-16 TW TW106101436A patent/TWI654302B/zh not_active IP Right Cessation
- 2017-01-16 SI SI201730373T patent/SI3405212T1/sl unknown
- 2017-01-16 PL PL17702951T patent/PL3405212T3/pl unknown
- 2017-01-16 KR KR1020187023390A patent/KR102158923B1/ko active IP Right Grant
- 2017-01-16 BR BR112018013967A patent/BR112018013967A2/pt not_active IP Right Cessation
- 2017-01-16 KR KR1020207026620A patent/KR20200109395A/ko not_active Application Discontinuation
- 2017-01-16 MX MX2018008797A patent/MX2018008797A/es unknown
- 2017-01-16 TW TW109135756A patent/TW202111125A/zh unknown
- 2017-01-17 US US15/407,890 patent/US10251944B2/en not_active Expired - Fee Related
-
2018
- 2018-06-27 IL IL260321A patent/IL260321B/en active IP Right Grant
- 2018-07-16 CO CONC2018/0007442A patent/CO2018007442A2/es unknown
- 2018-07-18 PH PH12018501532A patent/PH12018501532A1/en unknown
-
2019
- 2019-01-18 US US16/252,239 patent/US11058753B2/en active Active
-
2020
- 2020-10-01 IL IL277752A patent/IL277752A/en unknown
-
2021
- 2021-01-05 JP JP2021000428A patent/JP2021061851A/ja active Pending
- 2021-05-13 US US17/319,395 patent/US20220323558A1/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014127296A1 (en) * | 2013-02-14 | 2014-08-21 | Immunocellular Therapeutics, Ltd | Cancer vaccines and vaccination methods |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11173196B2 (en) | Vectors for expression of prostate-associated antigens | |
DK2155243T3 (en) | Compositions and methods comprising KLK3, PSCA, or antigen FOLH1 | |
JP2009509553A (ja) | マトリックスメタロプロテイナーゼ11ワクチン | |
KR102158923B1 (ko) | 암 백신 | |
JP2022031653A (ja) | Cea、muc1およびtertを含む免疫原性組成物 | |
JP2006518202A (ja) | アカゲザル癌胎児性抗原、前記抗原をコードするヌクレオチド、及びその使用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
A107 | Divisional application of patent | ||
GRNT | Written decision to grant |